Skip to content

Conversation

@thliang01
Copy link
Contributor

This PR enhances the "Working with Apache Arrow" notebook that demonstrates DuckDB's seamless integration with Apache Arrow. The changes improve educational value by providing clearer explanations of memory operations and better code organization.

Changes Made

  • Update marimo version from 0.14.11 to 0.14.12
  • Reorganize imports: move psutil and os to module level for better code organization
  • Clarify memory behavior: note Pandas Copy-on-Write (CoW) impact on copy operations
  • Improve readability: reformat performance benefits section with clearer language
  • Fix grammar: change "data bigger than" to "data larger than" for consistency

These changes enhance the notebook's educational value by providing more accurate technical details about memory operations and improving overall code structure.

📝 Summary

This PR improves the Apache Arrow + DuckDB integration notebook by clarifying technical details about memory operations, reorganizing imports for better code structure, and enhancing readability. The notebook demonstrates how DuckDB leverages Apache Arrow's zero-copy capabilities for efficient data processing across different Python data libraries (Pandas, Polars).

📋 Checklist

  • I have included package dependencies in the notebook file using --sandbox
  • If adding a course, include a README.md
  • Keep language direct and simple

- Update marimo version from 0.14.11 to 0.14.12
- Reorganize imports: move psutil and os to module level for better code organization
- Clarify memory behavior: note Pandas Copy-on-Write (CoW) impact on copy operations
- Improve readability: reformat performance benefits section with clearer language
- Fix grammar: change "data bigger than" to "data larger than" for consistency

These changes enhance the notebook's educational value by providing more accurate
technical details about memory operations and improving overall code structure.
@thliang01 thliang01 changed the title Duckdb: Improve Apache Arrow + DuckDB notebook clarity and technical accuracy DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy Jul 21, 2025
@Haleshot Haleshot merged commit 911073a into marimo-team:main Jul 24, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants