Skip to content

Release v0.3.9

Choose a tag to compare

@github-actions github-actions released this 17 Nov 01:38
· 246 commits to main since this release
e6e65ba

A new update with many important changes

πŸš€ New Stuff and quality of life changes

  • Now the impersonate argument in Fetcher and FetcherSession can accept a list of browsers that the library will choose a random browser from them with each request.
from scrapling.fetchers import FetcherSession

with FetcherSession(impersonate=['chrome', 'firefox', 'safari']) as s:
  s.get('https://github.com/D4Vinci/Scrapling')
  • A new argument to the clean method in TextHandler to remove html entities from the current text easily.
  • Huge improvements to the documentation with more precise explanations of many parts and automatic translations of the main README.md file.

πŸ› Bug Fixes

  • Fixed a big issue with retrieving responses from browser-based fetchers. Now, there is intelligent content type detection that ensures response.body contains the rendered browser content only if the content is HTML; otherwise, it contains the raw content of the last request made. This allows you to download binary files and text-based files without having to find them wrapped in HTML tags, while being able to retrieve the rendered content you want from the website when fetching it.

πŸ”¨ Misc

  • Updated the contributing guide to make it clearer and easier.
  • Add a new workflow to enforce code quality tools (Same ones used as pre-commit hooks).

πŸ™ Special thanks to our Discord community for all the continuous testing and feedback


Big shoutout to our biggest Sponsors