JuliaSyntax: fix anonymous function parsing #60221

IanButterworth · 2025-11-24T03:05:09Z

julia> function zip_missing(::Tuple{}, longer)
		      map(function (second_one)
		          (missing, second_one)
		      end, longer)
		  end
zip_missing (generic function with 1 method)

Written by Claude

Keno · 2025-11-24T04:28:50Z

Just for reference, @KristofferC just touched this code in JuliaLang/JuliaSyntax.jl#580

JuliaSyntax/test/parser.jl

IanButterworth · 2025-12-03T15:05:40Z

The fix here has become quite complicated.. @c42f @KristofferC

fogti · 2025-12-05T11:55:37Z

Would it be possible to add a unit test for this? (incl. for the previous change that affected this part of the code)

Did we actually find out where the error comes from (by bisecting starting from Julia-1.12)?

IanButterworth · 2025-12-05T14:28:15Z

There are a couple of tests added already.

I believe the issue was introduced by JuliaLang/JuliaSyntax.jl#580

JuliaSyntax/src/julia/parser.jl

fogti · 2025-12-07T09:05:24Z

JuliaSyntax/src/julia/parser.jl

+                # Check if there's a newline between `)` and the next `(` or `.`.
+                # We need to find where `)` is and check what immediately follows it.
+                # If peek(1, skip_newlines=false) is `)`, we're directly before it.
+                # Otherwise there's whitespace/newline before `)`.
+                next_token_pos = if peek(ps, 1, skip_newlines=false) == K")"
+                    # Directly before ), token after ) is at 2
+                    2
+                else
+                    # There's whitespace before ), so ) is at 2
+                    # and what follows ) is at 3
+                    3
+                end
+                token_after_paren = peek(ps, next_token_pos, skip_newlines=false)
+                # If token_after_paren is a newline, this is an anonymous function
+                has_newline_after_paren = _maybe_grouping_parens && token_after_paren == K"NewlineWs"
+                # Get the next significant token to determine if we need to parse a call
+                next_kind = peek(ps, 2, skip_newlines=_maybe_grouping_parens && !has_newline_after_paren)


I think it would make a lot of sense to split the logic with a case distinction on _maybe_grouping_parens:

Suggested change

# Check if there's a newline between `)` and the next `(` or `.`.

# We need to find where `)` is and check what immediately follows it.

# If peek(1, skip_newlines=false) is `)`, we're directly before it.

# Otherwise there's whitespace/newline before `)`.

next_token_pos = if peek(ps, 1, skip_newlines=false) == K")"

# Directly before ), token after ) is at 2

2

else

# There's whitespace before ), so ) is at 2

# and what follows ) is at 3

3

end

token_after_paren = peek(ps, next_token_pos, skip_newlines=false)

# If token_after_paren is a newline, this is an anonymous function

has_newline_after_paren = _maybe_grouping_parens && token_after_paren == K"NewlineWs"

# Get the next significant token to determine if we need to parse a call

next_kind = peek(ps, 2, skip_newlines=_maybe_grouping_parens && !has_newline_after_paren)

next_kind = if _maybe_grouping_parens

# Check if there's a newline between `)` and the next `(` or `.`.

# We need to find where `)` is and check what immediately follows it.

# If peek(1, skip_newlines=false) is `)`, we're directly before it.

# Otherwise there's whitespace/newline before `)`.

next_token_pos = if peek(ps, 1, skip_newlines=false) == K")"

# Directly before ), token after ) is at 2

2

else

# There's whitespace before ), so ) is at 2

# and what follows ) is at 3

3

end

token_after_paren = peek(ps, next_token_pos, skip_newlines=false)

# If token_after_paren is a newline, this is an anonymous function

# Get the next significant token to determine if we need to parse a call

peek(ps, 2, skip_newlines= token_after_paren != K"NewlineWs")

else

# Get the next significant token to determine if we need to parse a call

peek(ps, 2, skip_newlines=false)

end

Subsequently, it makes sense to inspect more closely what happens if _maybe_grouping_parens is true: If the next tokens are:

K")", K"NewlineWs", then we call peek(ps, 2, skip_newlines=false) = token_after_paren

K")", !K"NewlineWs", then we call peek(ps, 2, skip_newlines=true) = token_after_paren

!K")", x/*we expect K")" here, but don't check that*/, K"NewlineWs", then we call peek(ps, 2, skip_newlines=false)=token_after_paren (I think (hope I interpret it correctly) that __lookahead_index treats the case with skip_newlines=true such that it also skips newlines entirely, including in the count up to n=2)

!K")", x, !K"NewlineWs", then we call peek(ps, 2, skip_newlines=false)=token_after_paren`

So, I think we can omit the peek(ps, 2, skip_newlines= token_after_paren != K"NewlineWs") entirely.

Suggested change

# Check if there's a newline between `)` and the next `(` or `.`.

# We need to find where `)` is and check what immediately follows it.

# If peek(1, skip_newlines=false) is `)`, we're directly before it.

# Otherwise there's whitespace/newline before `)`.

next_token_pos = if peek(ps, 1, skip_newlines=false) == K")"

# Directly before ), token after ) is at 2

2

else

# There's whitespace before ), so ) is at 2

# and what follows ) is at 3

3

end

token_after_paren = peek(ps, next_token_pos, skip_newlines=false)

# If token_after_paren is a newline, this is an anonymous function

has_newline_after_paren = _maybe_grouping_parens && token_after_paren == K"NewlineWs"

# Get the next significant token to determine if we need to parse a call

next_kind = peek(ps, 2, skip_newlines=_maybe_grouping_parens && !has_newline_after_paren)

next_token_pos = if _maybe_grouping_paren

# Check if there's a newline between `)` and the next `(` or `.`.

# We need to find where `)` is and check what immediately follows it.

# If peek(1, skip_newlines=false) is `)`, we're directly before it.

# Otherwise there's whitespace/newline before `)`.

if peek(ps, 1, skip_newlines=false) == K")"

# Directly before ), token after ) is at 2

2

else

# There's whitespace before ), so ) is at 2

# and what follows ) is at 3

3

end

else

2

end

# Get the next significant token to determine if we need to parse a call

next_kind = peek(ps, next_token_pos, skip_newlines=false)

IanButterworth added the parser Language parsing and surface syntax label Nov 24, 2025

topolarity requested review from c42f and mlechu November 24, 2025 14:06

fingolfin reviewed Nov 24, 2025

View reviewed changes

JuliaSyntax/test/parser.jl Show resolved Hide resolved

lgoettgens mentioned this pull request Nov 26, 2025

Nightly book test broken (once again) oscar-system/Oscar.jl#5592

Open

IanButterworth force-pushed the ib/syntax branch from e5769e5 to 4b253d6 Compare December 3, 2025 14:57

IanButterworth force-pushed the ib/syntax branch from 4b253d6 to 2f3972c Compare December 6, 2025 09:32

IanButterworth requested a review from yuyichao December 6, 2025 13:11

fogti reviewed Dec 6, 2025

View reviewed changes

JuliaSyntax/src/julia/parser.jl Outdated Show resolved Hide resolved

fix anonymous function parsing

28bf44b

IanButterworth force-pushed the ib/syntax branch from 2f3972c to 28bf44b Compare December 6, 2025 18:16

fogti reviewed Dec 7, 2025

View reviewed changes

oscardssmith added the bugfix This change fixes an existing bug label Dec 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

JuliaSyntax: fix anonymous function parsing #60221

JuliaSyntax: fix anonymous function parsing #60221

IanButterworth commented Nov 24, 2025

Uh oh!

Keno commented Nov 24, 2025

Uh oh!

Uh oh!

IanButterworth commented Dec 3, 2025

Uh oh!

fogti commented Dec 5, 2025 •

edited

Loading

Uh oh!

IanButterworth commented Dec 5, 2025

Uh oh!

Uh oh!

fogti Dec 7, 2025 •

edited

Loading

Uh oh!

fogti Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

JuliaSyntax: fix anonymous function parsing #60221

Are you sure you want to change the base?

JuliaSyntax: fix anonymous function parsing #60221

Conversation

IanButterworth commented Nov 24, 2025

Uh oh!

Keno commented Nov 24, 2025

Uh oh!

Uh oh!

IanButterworth commented Dec 3, 2025

Uh oh!

fogti commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

IanButterworth commented Dec 5, 2025

Uh oh!

Uh oh!

fogti Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fogti Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fogti commented Dec 5, 2025 •

edited

Loading

fogti Dec 7, 2025 •

edited

Loading