JavaTokenParser.ident doesn't correctly parse Java identifiers #6478

scabug · 2012-10-05T02:14:25Z

I assume JavaTokenParser.ident is meant to parse all valid Java identifiers? If so, it doesn't. For example, MODULE$ is a valid Java identifier (that Scala itself uses extensively), and this won't parse. Also, Java identifiers can contain unicode characters, so ☃ is a valid identifier. Fortunately, Java provides a very simple way to parse Java identifiers, using the following regular expression:

\p{javaJavaIdentifierStart}\p{javaJavaIdentifierPart}*

It uses the Character.isJavaIdentifierStart and Character.isJavaIdentifierPart methods for the first and then subsequent letters, and is guaranteed to match all valid Java identifiers, as long as they don't clash with keywords.

For language spec nuts:

http://docs.oracle.com/javase/specs/jls/se7/html/jls-3.html#jls-3.8

scabug · 2012-10-05T02:14:25Z

Imported From: https://issues.scala-lang.org/browse/SI-6478?orig=1
Reporter: @jroper
Assignee: @JamesIry
Affected Versions: 2.9.2

scabug · 2012-10-05T02:23:41Z

@jroper said:
Pull request here: scala/scala#1466

scabug · 2012-11-15T04:19:10Z

@jedesah said:
I see the pull request got merged.

If the bug has been fixed we might want to go ahead and close this.

scabug · 2013-01-22T23:53:02Z

@adriaanm said:
reopening for 2.10.1-RC1 backport

scabug · 2013-02-07T23:13:07Z

@JamesIry said:
scala/scala#2091

scabug closed this as completed Feb 8, 2013

scabug added quickfix backport has PR labels Apr 7, 2017

scabug added this to the 2.10.1 milestone Apr 7, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JavaTokenParser.ident doesn't correctly parse Java identifiers #6478

JavaTokenParser.ident doesn't correctly parse Java identifiers #6478

scabug commented Oct 5, 2012

scabug commented Oct 5, 2012

scabug commented Oct 5, 2012

scabug commented Nov 15, 2012

scabug commented Jan 22, 2013

scabug commented Feb 7, 2013

JavaTokenParser.ident doesn't correctly parse Java identifiers #6478

JavaTokenParser.ident doesn't correctly parse Java identifiers #6478

Comments

scabug commented Oct 5, 2012

scabug commented Oct 5, 2012

scabug commented Oct 5, 2012

scabug commented Nov 15, 2012

scabug commented Jan 22, 2013

scabug commented Feb 7, 2013