2025-09-12 Hacker News Top Articles and Its Summaries
1. UTF-8 is a brilliant design Total comment counts : 23 Summary UTF-8 is a Unicode encoding using 1–4 bytes per character and remains backward-compatible with ASCII for the first 128 code points. The first byte indicates the total bytes; continuation bytes start with 10. For example, the Devanagari letter अ (U+0905) is encoded in three bytes. A text like ‘Hey👋 Buddy’ needs non-ASCII bytes (13 total), while ASCII-only text is valid UTF-8 with 9 bytes....