pdfplumber inserts \n between lines based on the y_tolerance parameter.
It would be great to also detect paragraph breaks: when a larger vertical gap is found (above a separate threshold), emit \n\n instead of \n. This would make paragraph boundaries detectable.
Thank you
pdfplumber inserts \n between lines based on the y_tolerance parameter.
It would be great to also detect paragraph breaks: when a larger vertical gap is found (above a separate threshold), emit \n\n instead of \n. This would make paragraph boundaries detectable.
Thank you