Video Discription |
EVENT: Internationalization & Unicode Conference 45 (IUC 45)
DATE: October 13-15, 2021
DESCRIPTION:
We will present and discuss new developments in the Unicode regular expressions standard (UTS #18) and related proposals for improvements in JavaScript (ECMAScript) regular expressions.
Driven by developer demand for detecting and working with emoji in web apps, the UTC has added
definitions for “properties of strings” and clarified how those interact with regular expression “character classes”. We are working with Ecma TC39 on proposals to bring several emoji properties to JavaScript developers, as well as set operator syntax to make it possible and easy to build character classes based on properties but with additions and exceptions, which are often necessary. We are aiming for developers to be able to write concise, readable, self-updating regular expressions, replacing many kilobytes of generated character class data or custom code.
More generally, increasing support for multi-character strings will also help remove hurdles for handling “characters” that are encoded as sequences of code points.
BIOS:
Mathias Bynens, JavaScript Whisperer, Google, LLC
Mathias is a web standards enthusiast from Belgium who currently works on Chrome DevTools. He likes HTML, CSS, JavaScript, Unicode, performance, and security.
Mark Davis, Lead Internationalization Architect, Google, Inc.
Mark Davis is the president and co-founder of the Unicode Consortium. He also founded the Unicode ICU project, the premier Unicode software internationalization library, and also co-founded and is the chair of the Unicode CLDR project, providing language-based data for internationalization. The Unicode Standard, CLDR, and ICU are used by every modern computer and mobile phone to support different languages. More recently, he was one of the team that introduced emoji into Unicode and was the co-founder and chair of the Unicode Emoji Subcommittee up through 2019.
Davis is one of the key technical contributors to other Unicode specifications, being the primary author or co-author of LDML (structured data for languages and locales), Collation (sorting and searching different languages), the Bidirectional Algorithm (used worldwide to display Arabic and Hebrew text), Normalization, Scripts, Text Segmentation, Identifiers, Regular Expressions, Security, and Emoji. He is a co-author of BCP 47, the specification used for identifying languages and countries in modern software. The contributions to Unicode reflect Davis’s software internationalization work over the course of his career, starting with Apple in 1985, continuing at Taligent and IBM, and then at Google since 2006. His goal has always been to advance, broaden and deepen the support for languages in the digital world, and he is very grateful for the support for this goal from these companies.
Marcus Scherer, Unicode Software Engineer, Google LLC
Markus Scherer is a tech lead in the Google software internationalization team, focusing on the effective use of Unicode and on the development and deployment of cross-product internationalization libraries. He has been a major contributor to ICU since 1999 and designed and developed significant portions of the character conversion, BiDirectional support, normalization, Unicode properties, and collation functionality. He is the Chair of the ICU Technical Committee, and the Chair of the Unicode/UTC Properties & Algorithms group.
LINKS:
Visit the Unicode Website: https://home.unicode.org
Get Started with Unicode:
- Technical Quick Start Guide: https://home.unicode.org/technical-quick-start-guide/
- Overview of Internationalization and Unicode Projects: YouTube Playlist: https://www.youtube.com/playlist?list=PLMc927ywQmTNQrscw7yvaJbAbMJDIjeBh
Become a Member:
- Organizations: https://home.unicode.org/membership/membership-levels/
- Individuals: https://www.unicode.org/consortium/joinform.html
Learn Different Ways to Support Unicode: https://home.unicode.org/support-unicode/ [ZjLk67itwuw] |