Library.add_files(params, max_chunk_size=n) often creates record in db where chunk size vastly exceed n - often representing an entire document page of text
simply as described.
appears to be more associated with the parsing of pdf documents that have entire pages comprised of a scanned image
are these types of record included in embedding? if so, problematic, right?
macos 15.x
llmware v 0.3.8
active_db: sqlite