Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support extended Latin characters in auto-completion list #36

Closed
zufuliu opened this issue Aug 5, 2018 · 2 comments
Closed

Support extended Latin characters in auto-completion list #36

zufuliu opened this issue Aug 5, 2018 · 2 comments

Comments

@zufuliu
Copy link
Owner

zufuliu commented Aug 5, 2018

Currently only _0-9A-Za-z and some punctuation (depends on current scheme) is included when populate auto-completion list, because other characters can't (or not recommended to) be used as an identifier in most programming languages.
However, this will truncate words in many languages. e.g. Café is truncated as Caf.

@zufuliu
Copy link
Owner Author

zufuliu commented Aug 6, 2018

Implement this will require decode bytes retrieved from Scintilla to Unicode code point (UTF-32), and then checking character ranges.

For Unicode (UTF-8, UTF-16) encoding, decoding is simple.
For one byte extended ANSI (like Windows-1252) encoding, decoding need a map or a call to MultiByteToWideChar(), which is slow.
For DBCS/MBCS (CJK) encoding, decoding need a call to MultiByteToWideChar(), which is more slower than one byte extended ANSI.

In commit 7063 (0d5edc93e280), Scintilla introduced some new methods for indexing document, maybe useful to implement this.

zufuliu added a commit that referenced this issue Jan 13, 2019
zufuliu added a commit that referenced this issue Jan 13, 2019
…an (Windows-1250), Cyrillic (Windows-1251), Western European (Windows-1252), Greek (Windows-1253), Turkish (Windows-1254), Hebrew (Windows-1255), Arabic (Windows-1256), Baltic (Windows-1257), Vietnamese (Windows-1258), and Thai (Windows-874).

Improve word selection and auto-completion (issue #36) for ANSI-encoded documents on system with these code pages.
@zufuliu
Copy link
Owner Author

zufuliu commented Jan 13, 2019

Fixed by commit c4f533e and f449b46.

For DBCS (Chinese, Japanese, and Korean), see https://sourceforge.net/p/scintilla/feature-requests/1226/.

@zufuliu zufuliu closed this as completed Jan 13, 2019
@zufuliu zufuliu added the font label Dec 25, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant