feat(callgraph): Add Python statement extraction for intra-procedural dataflow #344

shivasurya · 2025-11-04T00:33:48Z

Summary

Implements Python statement extraction from AST to support intra-procedural dataflow analysis. This is part 2 of the intra-procedural dataflow feature.

Changes

Add statement extraction for Python functions
Extract assignments, calls, and returns with def-use information
Comprehensive test coverage (87.3%)

Testing

20+ tests covering all statement types
All tests passing
Build and lint clean

Stacked on #343

🤖 Generated with Claude Code

Co-Authored-By: Claude [email protected]

shivasurya · 2025-11-04T00:34:05Z

This stack of pull requests is managed by Graphite. Learn more about stacking.

codecov · 2025-11-04T00:35:00Z

Codecov Report

❌ Patch coverage is 83.04348% with 39 lines in your changes missing coverage. Please review.
✅ Project coverage is 75.79%. Comparing base (1f7bc7a) to head (980d41f).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...ode-parser/graph/callgraph/statement_extraction.go	83.04%	23 Missing and 16 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #344      +/-   ##
==========================================
+ Coverage   75.50%   75.79%   +0.29%     
==========================================
  Files          49       50       +1     
  Lines        5699     5929     +230     
==========================================
+ Hits         4303     4494     +191     
- Misses       1221     1244      +23     
- Partials      175      191      +16

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

safedep · 2025-11-04T00:50:54Z

SafeDep Report Summary

No dependency changes detected. Nothing to scan.

_{This report is generated by SafeDep Github App}

shivasurya · 2025-11-04T01:51:36Z

Merge activity

Nov 4, 1:51 AM UTC: A user started a stack merge that includes this pull request via Graphite.
Nov 4, 1:52 AM UTC: Graphite rebased this pull request as part of a merge.
Nov 4, 1:53 AM UTC: @shivasurya merged this pull request with Graphite.

… dataflow Implements statement-level extraction from Python AST to support intra-procedural dataflow analysis and taint propagation. This is PR #2 of the intra-procedural dataflow feature implementation. **Key Features:** - Extract assignments, augmented assignments, calls, and returns - Build def-use information for each statement - Conservative identifier extraction for security analysis - Handle Python AST node wrapping (expression_statement) - Filter Python keywords and 'self' references - Extract method names from chained calls (obj.a.b.method) **Implementation Details:** - `ExtractStatements`: Main entry point, iterates function body - `extractAssignment`: Handles simple assignments (x = expr) - Stores RHS expression in CallTarget field - Skips tuple unpacking (requires multiple defs) - Skips attribute/subscript assignments (no local defs) - `extractAugmentedAssignment`: Handles x += expr (def and use) - `extractCall`: Extracts function/method calls - CallTarget contains method name (not full chain) - CallArgs contains literal argument values - Uses contains all identifiers (recursive extraction) - `extractReturn`: Handles return statements - Stores expression in CallTarget - `extractIdentifiers`: Recursive identifier extraction - Filters Python keywords and 'self' - Deduplicates results **Test Coverage:** - 20+ comprehensive tests covering all statement types - 87.3% overall coverage - Edge cases: empty functions, control flow skipped, nested calls - Tests for keyword filtering, deduplication, self references **Compliance:** - All tests passing - Build successful - Linter clean (nolint comments for false-positive unconvert warnings) Related to #340 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

…erage Adds 15+ additional tests to improve coverage from 87.3% to 87.7%. **New Test Coverage:** - Augmented assignment with attributes/subscripts - Complex call target expressions (lambda calls) - Nil node safety checks - Line number tracking - Nested keyword arguments - Assignment from literals - Return with multiple identifiers - Edge cases for defensive coding **Coverage Improvements:** - extractIdentifiersFromArgs: 92.0% → 96.0% - extractCallArgs: 91.3% → 95.7% - extractIdentifiers: 88.9% → 94.4% - ExtractStatements: 88.9% → 92.6% 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

shivasurya mentioned this pull request Nov 4, 2025

feat(dataflow): Add core data structures for intra-procedural taint analysis #343

Merged

shivasurya self-assigned this Nov 4, 2025

shivasurya added enhancement New feature or request go Pull requests that update go code labels Nov 4, 2025

shivasurya mentioned this pull request Nov 4, 2025

feat(callgraph): Add def-use chain construction (PR #3) #345

Merged

shivasurya marked this pull request as ready for review November 4, 2025 00:50

This was referenced Nov 4, 2025

feat(taint): Implement intra-procedural taint propagation #346

Merged

feat(callgraph): Integrate taint analysis into call graph builder #347

Merged

Fix intra-procedural vulnerability detection #348

Merged

shivasurya changed the base branch from feat/intra-procedural-dataflow-pr1-data-structures to graphite-base/344 November 4, 2025 01:51

shivasurya changed the base branch from graphite-base/344 to main November 4, 2025 01:51

shivasurya and others added 2 commits November 4, 2025 01:52

shivasurya force-pushed the feat/intra-procedural-dataflow-pr2-statement-extraction branch from 9f592e5 to 980d41f Compare November 4, 2025 01:52

shivasurya merged commit 29eb111 into main Nov 4, 2025
5 checks passed

shivasurya deleted the feat/intra-procedural-dataflow-pr2-statement-extraction branch November 4, 2025 01:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(callgraph): Add Python statement extraction for intra-procedural dataflow #344

feat(callgraph): Add Python statement extraction for intra-procedural dataflow #344

Uh oh!

shivasurya commented Nov 4, 2025 •

edited

Loading

Uh oh!

shivasurya commented Nov 4, 2025 •

edited

Loading

Uh oh!

codecov bot commented Nov 4, 2025 •

edited

Loading

Uh oh!

safedep bot commented Nov 4, 2025 •

edited

Loading

Uh oh!

shivasurya commented Nov 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(callgraph): Add Python statement extraction for intra-procedural dataflow #344

feat(callgraph): Add Python statement extraction for intra-procedural dataflow #344

Uh oh!

Conversation

shivasurya commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Testing

Uh oh!

shivasurya commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

safedep bot commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

SafeDep Report Summary

Uh oh!

shivasurya commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

shivasurya commented Nov 4, 2025 •

edited

Loading

shivasurya commented Nov 4, 2025 •

edited

Loading

codecov bot commented Nov 4, 2025 •

edited

Loading

safedep bot commented Nov 4, 2025 •

edited

Loading

shivasurya commented Nov 4, 2025 •

edited

Loading