Picture to People: Understanding Unicode characters

What is Picture to People ?

"Picture to People" (P2P) is a huge Computer Graphics project. It was started to create new softwares able to make 2D drawing, 3D rendering, vexel drawing, text effects, photo effects, image filtering and other complex Computer Graphics operations. It has been made from scratch, including its low level Computer Graphics libraries like Maccala. Nowadays, most final features produced for this project are released as free online tools available from its official website. This blog talks about Computer Graphics, mainly concerning Picture to People development.

Tuesday, July 23, 2013

Understanding Unicode characters

In a previous post, I have talked about my adventures trying to understand and implement character encodes used to represent Unicode strings.

Despite it needs much more tests, I have been successfully implementing a library that can understand and manipulate strings in the following Unicode representations:
- UTF8;
- UTF16-LE;
- UTF16-BE;
- UTF32-LE;
- UTF32-BE.

By implementing the encodes above, I will automatically be able to deal with other simpler standards, like for example:
- Latin 1;
- UC2;
- UC4.

But what does it all mean? It means I will understand the binary encode of Unicode strings, or, in other words, I will know what code points a string represents. Code points are unique numbers representing characters or symbols.

It's pretty nice, but it's just half the way to go. Since I have the code points of a string, I know for sure what it should mean, but I don't know how the related chars or symbols look like. There is a long and hard way for a complete solution.

Subscribe in a reader

What is Picture to People ?

Tuesday, July 23, 2013

Understanding Unicode characters

No comments: