DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy #119

thliang01 · 2025-07-21T09:35:21Z

This PR enhances the "Working with Apache Arrow" notebook that demonstrates DuckDB's seamless integration with Apache Arrow. The changes improve educational value by providing clearer explanations of memory operations and better code organization.

Changes Made

Update marimo version from 0.14.11 to 0.14.12
Reorganize imports: move psutil and os to module level for better code organization
Clarify memory behavior: note Pandas Copy-on-Write (CoW) impact on copy operations
Improve readability: reformat performance benefits section with clearer language
Fix grammar: change "data bigger than" to "data larger than" for consistency

These changes enhance the notebook's educational value by providing more accurate technical details about memory operations and improving overall code structure.

📝 Summary

This PR improves the Apache Arrow + DuckDB integration notebook by clarifying technical details about memory operations, reorganizing imports for better code structure, and enhancing readability. The notebook demonstrates how DuckDB leverages Apache Arrow's zero-copy capabilities for efficient data processing across different Python data libraries (Pandas, Polars).

📋 Checklist

I have included package dependencies in the notebook file using --sandbox
If adding a course, include a README.md
Keep language direct and simple

- Update marimo version from 0.14.11 to 0.14.12 - Reorganize imports: move psutil and os to module level for better code organization - Clarify memory behavior: note Pandas Copy-on-Write (CoW) impact on copy operations - Improve readability: reformat performance benefits section with clearer language - Fix grammar: change "data bigger than" to "data larger than" for consistency These changes enhance the notebook's educational value by providing more accurate technical details about memory operations and improving overall code structure.

thliang01 changed the title ~~Duckdb: Improve Apache Arrow + DuckDB notebook clarity and technical accuracy~~ DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy Jul 21, 2025

Haleshot approved these changes Jul 24, 2025

View reviewed changes

Haleshot merged commit 911073a into marimo-team:main Jul 24, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy #119

DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy #119

Uh oh!

thliang01 commented Jul 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy #119

DuckDB : Improve Apache Arrow + DuckDB notebook clarity and technical accuracy #119

Uh oh!

Conversation

thliang01 commented Jul 21, 2025

Changes Made

📝 Summary

📋 Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants