UTF-8: Some Printable Characters

Characters

UTF-8: Some Printable Characters

Introduction

This page is for me for quick reference and it's listing some of the printable characters in UTF-8 including UTF-8 codes, HTML numbers, HTML names, and descriptions. When I started creating Google News Quotes pages, some of the pages I was quoting were using special characters that couldn't just be copied and shown correctly, so I had to do some research and find the equivalent HTML names or numbers to represent them. Some web pages are using incorrect characters for quoting, so in addition to listing characters that are used correctly or incorrectly as quotation marks I'm also prepared with some extra characters that could be used incorrectly for quoting in the future. Please note that some special characters add or remove whole or partial spacing characters, so the columns are not perfectly lined in every case and I'm not using HTML tables to correct that, because this way you can easily copy and paste this list for your own testing purposes.

List 1 has a maximum line length of 70 characters, so it should be easy to copy and paste the list to your Usenet newsreader, email application or web forum (if testing is allowed on a certain newsgroup or forum). List 2 has longer lines (longest line length is 83 characters) as it contains more information per line (HTML names and HTML numbers). Note that using List 2 the test lines will probably brake in the wrong places depending on your Usenet/email application settings or forum software, so use List 1 for copy-paste testing and List 2 for looking up the corresponding HTML name or HTML number for a UTF-8 character.

List 1: Shorter Lines For Copy-Pasting

QUOTATION MARKS, ETC.

=====================

Char Code Description

---- ---- -----------

" U+0022 quotation mark

« U+00AB left-pointing double angle quotation mark

» U+00BB right-pointing double angle quotation mark

‘ U+2018 left single quotation mark

’ U+2019 right single quotation mark

‚ U+201A single low-9 quotation mark

‛ U+201B single high-reversed-9 quotation mark

“ U+201C left double quotation mark

” U+201D right double quotation mark

„ U+201E double low-9 quotation mark

′ U+2032 prime

″ U+2033 double prime

‹ U+2039 single left-pointing angle quotation mark

› U+203A single right-pointing angle quotation mark

〞 U+301E double prime quotation mark


DIACRITICS

==========

Char Code Description

---- ---- -----------

' U+0027 apostrophe

` U+0060 grave accent

¨ U+00A8 diaeresis or umlaut

´ U+00B4 acute accent

¸ U+00B8 cedilla

˝ U+02DD double acute accent


COMBINING DIACRITICS

====================

Char Code Description

---- ---- -----------

̀ U+0300 combining grave accent

̃ U+0303 combining tilde


MODIFIER CHARACTERS

===================

Char Code Description

---- ---- -----------

ʻ U+02BB modifier letter turned comma

ʹ U+0374 Greek numeral sign

᾿ U+1FBF Greek psili


FINNISH CHARACTERS

==================

Char Code Description

---- ---- -----------

å U+00E5 latin small letter a with ring above

ä U+00E4 latin small letter a with diaeresis

ö U+00F6 latin small letter o with diaeresis


MATHEMATICAL NOTATION

=====================

Char Code Description

---- ---- -----------

Ṽ U+1E7C latin capital letter V with tilde


HORIZONTAL LINES

================

Char Code Description

---- ---- -----------

– U+2013 en dash

— U+2014 em dash

― U+2015 horizontal bar

− U+2212 minus sign


INTELLECTUAL PROPERTY SYMBOLS

=============================

Char Code Description

---- ---- -----------

© U+00A9 copyright

® U+00AE registered sign

™ U+2122 trade mark sign


OTHER CHARACTERS

================

Char Code Description

---- ---- -----------

á U+00E1 latin small letter a with acute

é U+00E9 latin small letter e with acute

• U+2022 bullet

∼ U+223C tilde operator

List 2: More Info Per Line

QUOTATION MARKS, ETC.

=====================

Char Code Number Name Description

---- ---- ------ ---- -----------

" U+0022 " " quotation mark

« U+00AB « « left-pointing double angle quotation mark

» U+00BB » » right-pointing double angle quotation mark

‘ U+2018 ‘ ‘ left single quotation mark

’ U+2019 ’ ’ right single quotation mark

‚ U+201A ‚ ‚ single low-9 quotation mark

‛ U+201B ‛ single high-reversed-9 quotation mark

“ U+201C “ “ left double quotation mark

” U+201D ” ” right double quotation mark

„ U+201E „ „ double low-9 quotation mark

′ U+2032 ′ ′ prime

″ U+2033 ″ ″ double prime

‹ U+2039 ‹ ‹ single left-pointing angle quotation mark

› U+203A › › single right-pointing angle quotation mark

〞 U+301E 〞 double prime quotation mark


DIACRITICS

==========

Char Code Number Name Description

---- ---- ------ ---- -----------

' U+0027 ' apostrophe

` U+0060 ` grave accent

¨ U+00A8 ¨ ¨ diaeresis or umlaut

´ U+00B4 ´ ´ acute accent

¸ U+00B8 ¸ ¸ cedilla

˝ U+02DD ˝ double acute accent


COMBINING DIACRITICS

====================

Char Code Number Name Description

---- ---- ------ ---- -----------

̀ U+0300 ̀ combining grave accent

̃ U+0303 ̃ combining tilde


MODIFIER CHARACTERS

===================

Char Code Number Name Description

---- ---- ------ ---- -----------

ʻ U+02BB ʻ modifier letter turned comma

ʹ U+0374 ʹ Greek numeral sign

᾿ U+1FBF ᾿ Greek psili


FINNISH CHARACTERS

==================

Char Code Number Name Description

---- ---- ------ ---- -----------

å U+00E5 å å latin small letter a with ring above

ä U+00E4 ä ä latin small letter a with diaeresis

ö U+00F6 ö ö latin small letter o with diaeresis


MATHEMATICAL NOTATION

=====================

Char Code Number Name Description

---- ---- ------ ---- -----------

Ṽ U+1E7C Ṽ latin capital letter V with tilde


HORIZONTAL LINES

================

Char Code Number Name Description

---- ---- ------ ---- -----------

– U+2013 – – en dash

— U+2014 — — em dash

― U+2015 ― horizontal bar

− U+2212 − − minus sign


INTELLECTUAL PROPERTY SYMBOLS

=============================

Char Code Number Name Description

---- ---- ------ ---- -----------

© U+00A9 © © copyright

® U+00AE ® ® registered sign

™ U+2122 ™ ™ trade mark sign


OTHER CHARACTERS

================

Char Code Number Name Description

---- ---- ------ ---- -----------

á U+00E1 á á latin small letter a with acute

é U+00E9 é é latin small letter e with acute

• U+2022 • • bullet

∼ U+223C ∼ ∼ tilde operator

Last modified: January 28th, 2011

Author: Tomi Häsä (tomi.hasa@gmail.com)

URL: http://sites.google.com/site/tomihasa/utf-8-printable

Back to Printable Characters.