What's that Unicode character in my clipboard?

54

17

Is there a quick and easy way to find the Unicode code point for any character? For example, I see a funny character on a web page, or a PDF file, or some other document.

What I current do is copy the character to the clipboard, save it to a file, and look at the file with a hex viewer. Alternatively I can open Microsoft Word, paste and do Alt+X. Both of these methods are a bit cumbersome. Is there an easier way?

I use Notepad++ so if there's any way to do that with Notepad++, it would be a suitable answer (it's less cumbersome than having to open Word). Or maybe there's a way to do it with a small specialised application?

DodgyCodeException

Posted 2017-12-15T10:10:47.500

Reputation: 675

5http://www.ltg.ed.ac.uk/~richard/utf-8.html Nice tool, worth a bookmark! – tkausl – 2017-12-16T04:14:24.907

How can the accepted answer be accepted? It does not mention Unicode at all and does not work well for Unicode. How exactly did it solve your question? Try and see that it won't work. It doesn't even display the character. – Thomas Weller – 2017-12-17T10:18:57.680

@ThomasWeller you make a good point, and I've changed my accepted answer. It seems that Notepad++ can only cope with 16-bit characters and doesn't understand code points greater than 0xFFFF, which your example character is. – DodgyCodeException – 2017-12-17T10:30:58.727

@user202729 It's not 2 years old. It says "asked Dec 15" which means "December the 15th of this year", not "December 2015". So it's only 3 days old. I don't understand why this question keeps getting upvoted. It was only a simple question about something simple that I was wondering about. – DodgyCodeException – 2017-12-18T13:15:00.543

I thought it's year 2015. About why it's upvoted, it's because it's on Hot Network Question list. (and it's somewhat useful, too, though I'm not sure if everyone encounter weird Unicode character all the time) – user202729 – 2017-12-18T13:16:43.747

You wonder why your question got so many upvotes, but it has generated a plethora of useful (and also less useful) answers, so, yeah, that deserves some merit. And it also popped up when I fired up my SE app – thymaro – 2017-12-19T08:53:32.147

1

An online tool is What Unicode character is this, http://www.babelstone.co.uk/Unicode/whatisit.html

– DrMoishe Pippik – 2018-01-08T22:14:44.657

Answers

32

I work a lot with Unicode characters, so I have written a small Windows application specifically for this:

Unicode Character Informer (Documentation)

In addition, my text editor, Rejbrand Text Editor, has extensive Unicode character support.

Andreas Rejbrand

Posted 2017-12-15T10:10:47.500

Reputation: 756

1Thanks! That's quite useful. May I suggest F1 should be a help function, which would bring up a small window containing a link to your documentation web page. – DodgyCodeException – 2017-12-15T13:03:54.117

6Out of curiosity, what job are you working? – Ooker – 2017-12-15T15:59:51.277

In bash, I use uniprops, which seems to output similar info. – TRiG – 2017-12-15T17:36:04.953

FYI I downloaded your unicode character informer and my virus software was very afraid of it. Wouldn't let me use it unless I added it to the white list, so I'm not going to use it. Do you have a virus-free copy? haha jk I know it probably doesn't actually have a virus in it -- but still, why does my AV hate it so much? BTW it's Symantec Endpoint Protection. – rory.ap – 2019-05-08T17:55:45.630

@rory.ap: I guarantee that it is completely free of viruses. Unfortunately, false-positive issues are common (maybe especially for applications written in Delphi). This is something I am very aware of, and something that does indeed make me sad. Today it is difficult being a spare-time Win32 software developer. I just wish people would stop making malware so serious hobby developers woudn't be affected by false positives... Maybe code signing would improve the situation, but that isn't free, and all software I produce I give away for free. :( – Andreas Rejbrand – 2019-05-08T20:13:09.560

35

Notepad++ has a pre-installed plug-in called Converter that has a option to Convert ASCII to HEX and Vice-versa. This tool is quite useful as to convert data files that are in HEX format which are to be converted to ASCII to read:

enter image description here

That is how it works:

enter image description here

duDE

Posted 2017-12-15T10:10:47.500

Reputation: 14 097

2Thanks! By the way, what software do you use to create an animated gif like that? – DodgyCodeException – 2017-12-15T10:56:40.053

44Ha ha! I just noticed the word "octadecimal" in that dialog box. That means base 18. The developer means octal, of course (base 8). – DodgyCodeException – 2017-12-15T10:59:38.633

5

@DodgyCodeException I guess LICEcap

– user11153 – 2017-12-15T11:37:02.487

3

@DodgyCodeException http://www.screentogif.com/ is nice for recording animated gifs.

– David – 2017-12-15T12:38:16.553

1

@DodgyCodeException ShareX has gained in popularity.

– Gabriel Fair – 2017-12-15T14:56:40.620

1@DodgyCodeException And it's shown working for non-ASCII characters, despite the label of the first box. – David Richerby – 2017-12-15T16:59:23.890

5When I paste there, it says D83D. So it's only half of the thruth and does NOT work well with Unicode characters. – Thomas Weller – 2017-12-17T10:17:39.173

My personal favorite for tasks like this is ezgif.com. It creates a surprisingly small GIF and has lots of tools built in.

– Tamás Sengel – 2017-12-19T09:46:19.020

17

When I'm faced with this problem, a quick Google search usually provides a quick answer. For example, when I google " unicode", I get a result like this: Google search for the "smiling face with heart shaped eyes" emoji

I like this method because:

  • It works on any computer with internet
  • You don't have to install anything
  • The keypresses required (Ctrl+C & Ctrl+T & Ctrl+V & Enter) are muscle memory actions for me, and probably for most other developers/typists.

Eric Seastrand

Posted 2017-12-15T10:10:47.500

Reputation: 271

11

There's a nice little website called Unicode Character Inspector (built by Tim Whitlock) that does just that. I find it way more convenient than a text editor or desktop program.

Baptiste Candellier

Posted 2017-12-15T10:10:47.500

Reputation: 237

3

To make that even faster, you can set up a search engine shortcut in your browser and have it take you straight to the information page about the entered character. This works because you can specify the character as a querystring parameter like https://apps.timwhitlock.info/unicode/inspect?s={some unicode character here}.

– Eric Seastrand – 2017-12-15T22:14:24.333

9

On a Unix-like system*:

unicode -s "$(xsel -ob)"

You can alias this or create a script to run it.

The output looks like this:

U+2672 UNIVERSAL RECYCLING SYMBOL
UTF-8: e2 99 b2 UTF-16BE: 2672 Decimal: ♲ Octal: \023162
♲ (♲)
Uppercase: 2672
Category: So (Symbol, Other)
Bidi: ON (Other Neutrals)

* It looks like the original poster is probably using Windows, but (a) this isn't specified, and (b) this solution might help others.

wchargin

Posted 2017-12-15T10:10:47.500

Reputation: 794

Thanks! Although I'm using Windows mostly, I do also use Linux, and in any case I can probably download a Windows port of this utility (such as GnuWin). – DodgyCodeException – 2017-12-16T18:41:17.667

6

You can use PowerShell!

[char]::ConvertToUtf32((gcb), 0)

This prints the first Unicode code point of the text on the clipboard.

If you don't have to worry about characters outside the Basic Multilingual Plane (that would be represented in .NET strings as a high and low surrogate), you can use this instead:

[int](gcb)[0]

If you'd prefer it in hex, you can use a format specifier:

'0x{0:x}' -f [char]::ConvertToUtf32((gcb), 0)

Ben N

Posted 2017-12-15T10:10:47.500

Reputation: 32 973

5

A note for any Emacs users: you can type C-u C-x = and it will give you a bunch of information about the character under the cursor, including the Unicode code point, the name in the Unicode database and the categories etc.

             position: 146 of 147 (99%), column: 0
            character: ♲ (displayed as ♲) (codepoint 9842, #o23162, #x2672)
    preferred charset: unicode (Unicode (ISO10646))
code point in charset: 0x2672
               script: symbol
               syntax: w    which means: word
             category: .:Base
             to input: type "C-x 8 RET 2672" or "C-x 8 RET UNIVERSAL RECYCLING SYMBOL"
          buffer code: #xE2 #x99 #xB2
            file code: #xE2 #x99 #xB2 (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
    xft:-PfEd-Mensch-normal-normal-normal-*-16-*-*-*-m-0-iso10646-1 (#x985)

Character code properties: customize what to show
  name: UNIVERSAL RECYCLING SYMBOL
  general-category: So (Symbol, Other)
  decomposition: (9842) ('♲')

Neil Roberts

Posted 2017-12-15T10:10:47.500

Reputation: 151

3

Got Vim? Just paste it in, put your cursor on it, and hit ga. I use this all the time for weird characters.

SilverWolf - Reinstate Monica

Posted 2017-12-15T10:10:47.500

Reputation: 141

2

I use http://unicode.scarfboy.com, which is simple and works well.

No don't shown my real name

Posted 2017-12-15T10:10:47.500

Reputation: 121

2Please add a brief description of what the site does, if there's any specific way to use it, and how it specifically addresses the OPs question. A one line answer usually isn't sufficient to truly answer a question. – computercarguy – 2017-12-15T19:05:19.943

2

I find Rishard Ishida's Unicode code converter (github link) very usefull for finding unicode charactercodes, amongst other things. It also provides translations/conversions to other codepoints, encodings and for instance escapes-sequences.

Unicode Converter

You may also want to checkout Richard Ishida's main webpage (rishida.net), as it contains (links to) alot of valuable tools and information, especially if you're interested in internationalisation and character-encoding. For instance, another very useful tool linked there, is his Uniview tool (github link).

Uniview

And finally, also very useful i find, although mostly relevant to Mac-users, is macOS's Character Viewer, accessible through the Input Menu, which can be enabled in System PreferencesKeyboard

Although the Apple-support website mainly focusses on how-to insert emojies (…), the Character Viewer is actually very useful for looking-up specific ('special') characters and their codepoints in several different encodings, as well as for finding which fonts on your systen contain specific glyphs.

Character Viewer

Cheers!

arri

Posted 2017-12-15T10:10:47.500

Reputation: 131

Character Viewer as pictured here is, I think, at least one major revision (of the OS) old; High Sierra gives a much less convenient “Show Emoji & Symbols”. – Anton Sherwood – 2017-12-16T21:05:12.837

@AntonSherwood nothing has changed that I can tell. You can decide in the Emoji and Symbols option "Customize List" which tables to show. The Unicode one isn't visible by default, I believe. – user0721090601 – 2017-12-17T03:04:45.243

I can confirm that Sierra has the full Characters panel–High Sierra doesn't? – SilverWolf - Reinstate Monica – 2017-12-17T20:58:32.853

Well, we learn something every day. Thanks @guifa. I made it somewhat more to my liking – but can't reproduce the screenshot shown. – Anton Sherwood – 2017-12-18T21:42:50.603

2

You can also use the following site: https://unicode-table.com/en/ Just paste your character, and you'll get a Unicode code point and HTML code as well.

Alina Ladygina

Posted 2017-12-15T10:10:47.500

Reputation: 21

1

If you have Microsoft Word, paste the text there, select the character (or click to the right of it), and press Alt+X.

G-Man Says 'Reinstate Monica'

Posted 2017-12-15T10:10:47.500

Reputation: 6 509

This is already provided by the question itself. – Stephan Bijzitter – 2017-12-18T15:50:17.033

0

I am going to mention http://amp-what.com/ as it is really easy to use with its quick search field and supports different notations (& code, Unicode codepoints, URI encode character sequence).

Example image

493msi

Posted 2017-12-15T10:10:47.500

Reputation: 1

0

Here's one more answer using an idea from user202729:

Bookmark the URL javascript:alert(prompt().codePointAt(0).toString(16)) and use a browser to run it. (Works on Chrome and Firefox. Doesn't appear to work on IE but this may be due to security settings.)

Unlike other answers, no internet connection is required, no external utility to download, not OS-specific.

DodgyCodeException

Posted 2017-12-15T10:10:47.500

Reputation: 675