Skip to content

Add Are.na channel extractor (resolves #5847)#8509

Merged
mikf merged 9 commits intomikf:masterfrom
dev-KingMaster:master
Nov 11, 2025
Merged

Add Are.na channel extractor (resolves #5847)#8509
mikf merged 9 commits intomikf:masterfrom
dev-KingMaster:master

Conversation

@dev-KingMaster
Copy link
Contributor

@dev-KingMaster dev-KingMaster commented Nov 4, 2025

Summary

Details

  • API usage:
  • Extracts:
    • Images via image.original.url (fallback to display/large when needed)
    • File attachments via attachment.url
    • Adds source_url when present (for Link blocks)
  • Skips non-downloadable items (e.g., nested channels without media).
  • Populates directory/filename metadata using GalleryExtractor defaults (includes channel slug/id, title, numbering).
  • Honors extractor.page-reverse.

URL Patterns

Testing

Contribution by Gittensor, learn more at https://gittensor.io/

@dev-KingMaster
Copy link
Contributor Author

Hi @mikf Could you review this pr and leave any comments? Thanks

mikf added 6 commits November 10, 2025 21:59
- prevent unnecessary request to 'page_url'
- fix pagination
- simplify block extraction code

TODO: - rewrite without GalleryExtractor
      - extractors for Blocks, Users, etc
@mikf mikf merged commit 793956d into mikf:master Nov 11, 2025
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Site Support] are.na

2 participants