Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RAW/UCS string codes? #3

Open
dpkristensen opened this issue Aug 3, 2018 · 0 comments
Open

RAW/UCS string codes? #3

dpkristensen opened this issue Aug 3, 2018 · 0 comments
Labels
enhancement New feature or request question Further information is requested

Comments

@dpkristensen
Copy link
Owner

Should allow proprietary text formats by defining a "RAW" string code. Ex: RAW-8, etc... Terminators and Padding characters may be specified if needed, meaning the specification writer only has to define a compatible length determination method.

This would have the effect for all intents and purposes of having 1:1 correspondence with Unicode code points when it is passed to the application; which means it starts to sound a lot like UCS (ISO/IEC 10646) in practical terms.

I think UCS should be supported for sure (UCS-8, etc...); the RAW encoding would potentially need some way to integrate into the parser. So the source encoding probably must be available to the application layer in some form. RAW encodings should not have any textual representation in a GUI, other than as a hex-coded byte stream of the appropriate width (ex: 2-byte hex for RAW-8).

One benefit of RAW encoding support is to use RAW-8 as a fall-back for unknown encodings.

@dpkristensen dpkristensen added enhancement New feature or request question Further information is requested labels Aug 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant