QuadR, 65 bytes

Question

21

The famous constructed language Esperanto uses the Latin alphabet (mostly, see the linked wikipedia page for details). However, there are some characters with accents: ĉ, ĝ, ĥ, ĵ, ŝ, and ŭ. (C-circumflex, g-circumflex, h-circumflex, j-circumflex, s-circumflex, and u-breve.) Naturally, these characters are very hard to type. Even for this question, I had to search in the Unicode selector for the characters. Due to this, a convention using the letter "x" has been developed for electronic use. For example, "cxu" is used for "ĉu". (Note: the letter "x" is not used normally in the Esperanto alphabet."

However, I am a language purist! This *air quote* x nonsense is killing me! I need a program to fix this, preferably as short as possible so I can type it into my terminal as fast as possible!

Challenge

Your mission is to take a string of Esperanto using x-convention and convert it to real Esperanto.

In effect, you have to map:

cx: ĉ
gx: ĝ
hx: ĥ
jx: ĵ
sx: ŝ
ux: ŭ
Cx: Ĉ
Gx: Ĝ
Hx: Ĥ
Jx: Ĵ
Sx: Ŝ
Ux: Ŭ

All other printable ASCII characters should be accepted and not changed. Unicode would be nice, but not necessary.

Input and output can be in any format reasonable to your language. Good luck!

Testcases

"input" : "output"
_____________
"gxi estas varma" : "ĝi estas varma"
"Cxu sxi sxatas katojn aux hundojn?" : "Ĉu ŝi ŝatas katojn aŭ hundojn?"
"Uxcxsxabcd(hxSx)efg{};" : "Ŭĉŝabcd(ĥŜ)efg{};"
"qwertyuiop" : "qwertyuiop"
" " : " "
"" : ""
"x" : "x"
"xc" : "xc"
"xcx" : "xĉ"
"cxx" : "ĉx"

Scoring

This is code-golf. Answers are scored by smallest bytecount in the language's default encoding.

Here is a Stack Snippet to generate both a regular leaderboard and an overview of winners by language.

To make sure that your answer shows up, please start your answer with a headline, using the following Markdown template:

# Language Name, N bytes

where N is the size of your submission. If you improve your score, you can keep old scores in the headline, by striking them through. For instance:

# Ruby, <s>104</s> <s>101</s> 96 bytes

If there you want to include multiple numbers in your header (e.g. because your score is the sum of two files or you want to list interpreter flag penalties separately), make sure that the actual score is the last number in the header:

# Perl, 43 + 2 (-p flag) = 45 bytes

You can also make the language name a link which will then show up in the leaderboard snippet:

# [><>](http://esolangs.org/wiki/Fish), 121 bytes

var QUESTION_ID=149292,OVERRIDE_USER=47670;function answersUrl(e){return"https://api.stackexchange.com/2.2/questions/"+QUESTION_ID+"/answers?page="+e+"&pagesize=100&order=desc&sort=creation&site=codegolf&filter="+ANSWER_FILTER}function commentUrl(e,s){return"https://api.stackexchange.com/2.2/answers/"+s.join(";")+"/comments?page="+e+"&pagesize=100&order=desc&sort=creation&site=codegolf&filter="+COMMENT_FILTER}function getAnswers(){jQuery.ajax({url:answersUrl(answer_page++),method:"get",dataType:"jsonp",crossDomain:!0,success:function(e){answers.push.apply(answers,e.items),answers_hash=[],answer_ids=[],e.items.forEach(function(e){e.comments=[];var s=+e.share_link.match(/\d+/);answer_ids.push(s),answers_hash[s]=e}),e.has_more||(more_answers=!1),comment_page=1,getComments()}})}function getComments(){jQuery.ajax({url:commentUrl(comment_page++,answer_ids),method:"get",dataType:"jsonp",crossDomain:!0,success:function(e){e.items.forEach(function(e){e.owner.user_id===OVERRIDE_USER&&answers_hash[e.post_id].comments.push(e)}),e.has_more?getComments():more_answers?getAnswers():process()}})}function getAuthorName(e){return e.owner.display_name}function process(){var e=[];answers.forEach(function(s){var r=s.body;s.comments.forEach(function(e){OVERRIDE_REG.test(e.body)&&(r="<h1>"+e.body.replace(OVERRIDE_REG,"")+"</h1>")});var a=r.match(SCORE_REG);a&&e.push({user:getAuthorName(s),size:+a[2],language:a[1],link:s.share_link})}),e.sort(function(e,s){var r=e.size,a=s.size;return r-a});var s={},r=1,a=null,n=1;e.forEach(function(e){e.size!=a&&(n=r),a=e.size,++r;var t=jQuery("#answer-template").html();t=t.replace("{{PLACE}}",n+".").replace("{{NAME}}",e.user).replace("{{LANGUAGE}}",e.language).replace("{{SIZE}}",e.size).replace("{{LINK}}",e.link),t=jQuery(t),jQuery("#answers").append(t);var o=e.language;/<a/.test(o)&&(o=jQuery(o).text()),s[o]=s[o]||{lang:e.language,user:e.user,size:e.size,link:e.link}});var t=[];for(var o in s)s.hasOwnProperty(o)&&t.push(s[o]);t.sort(function(e,s){return e.lang>s.lang?1:e.lang<s.lang?-1:0});for(var c=0;c<t.length;++c){var i=jQuery("#language-template").html(),o=t[c];i=i.replace("{{LANGUAGE}}",o.lang).replace("{{NAME}}",o.user).replace("{{SIZE}}",o.size).replace("{{LINK}}",o.link),i=jQuery(i),jQuery("#languages").append(i)}}var ANSWER_FILTER="!t)IWYnsLAZle2tQ3KqrVveCRJfxcRLe",COMMENT_FILTER="!)Q2B_A2kjfAiU78X(md6BoYk",answers=[],answers_hash,answer_ids,answer_page=1,more_answers=!0,comment_page;getAnswers();var SCORE_REG=/<h\d>\s*([^\n,]*[^\s,]),.*?(\d+)(?=[^\n\d<>]*(?:<(?:s>[^\n<>]*<\/s>|[^\n<>]+>)[^\n\d<>]*)*<\/h\d>)/,OVERRIDE_REG=/^Override\s*header:\s*/i;

body{text-align:left!important}#answer-list,#language-list{padding:10px;width:290px;float:left}table thead{font-weight:700}table td{padding:5px}

<script src="https://ajax.googleapis.com/ajax/libs/jquery/2.1.1/jquery.min.js"></script> <link rel="stylesheet" type="text/css" href="//cdn.sstatic.net/codegolf/all.css?v=83c949450c8b"> <div id="answer-list"> <h2>Leaderboard</h2> <table class="answer-list"> <thead> <tr><td></td><td>Author</td><td>Language</td><td>Size</td></tr></thead> <tbody id="answers"> </tbody> </table> </div><div id="language-list"> <h2>Winners by Language</h2> <table class="language-list"> <thead> <tr><td>Language</td><td>User</td><td>Score</td></tr></thead> <tbody id="languages"> </tbody> </table> </div><table style="display: none"> <tbody id="answer-template"> <tr><td>{{PLACE}}</td><td>{{NAME}}</td><td>{{LANGUAGE}}</td><td>{{SIZE}}</td><td><a href="{{LINK}}">Link</a></td></tr></tbody> </table> <table style="display: none"> <tbody id="language-template"> <tr><td>{{LANGUAGE}}</td><td>{{NAME}}</td><td>{{SIZE}}</td><td><a href="{{LINK}}">Link</a></td></tr></tbody> </table>

Good luck, have fun, and feel free to suggest improvements!

Clarifications:

You only need to worry about printable ASCII characters.
You only need to output a character that looks like the correct output. Yes, this means you can tack the accent onto the standard character.

OldBunny2800

Posted 2017-11-28T01:08:20.037

Reputation: 1 379

ASCII here means 20-7E printable characters, 00-7F, or what? – user202729 – 2017-11-28T02:06:24.693

All the printable ones. – OldBunny2800 – 2017-11-28T02:07:11.237

Note: I added a clarification that you can use the letter and the modifier accent. – OldBunny2800 – 2017-11-28T02:15:23.767

5Combining circumflex is at 0302 ̂, and combining breve is at 0306 ̆. – user202729 – 2017-11-28T02:23:39.487

^ Each one take 2 bytes in UTF8 as TIO count.

– user202729 – 2017-11-28T02:28:53.730

@user202729 A language purist would most probably hate combining chars, but those are actually easy to type with compose key. – Erik the Outgolfer – 2017-11-28T12:25:25.943

@EriktheOutgolfer what do you mean “compose key”? – OldBunny2800 – 2017-11-28T12:26:54.347

I have to point out that your second test sentence, altough grammatically correct, should more likely end with an "n" ("hundojn") – etuardu – 2017-11-28T14:48:33.800

My bad, thanks. Mi pardonpetas. – OldBunny2800 – 2017-11-28T14:51:26.317

Try feeding it the input "Linux" – Arturo Torres Sánchez – 2017-11-28T21:00:29.497

@EriktheOutgolfer Why would a language purist care for different representations of the same grapheme? – Arturo Torres Sánchez – 2017-11-28T21:01:50.230

Parse my Esperanto!

Challenge

Testcases

Scoring

Clarifications:

Answers

QuadR, 65 bytes

Retina, 27 bytes

C, 173 154 bytes

Python 3, 81 bytes

///, 75 bytes

Python 3, 95 bytes

Retina, 55 bytes

Perl 5, 101 + 1 (`-p`) = 102 bytes

JavaScript (ES6), 92 bytes

C, 145 144 bytes

QuadR, 25 bytes

APL (Dyalog Unicode), 57 bytes

Perl 5, 49 + 2 (`-p -C`) = 61 51 bytes

J, 64 63 bytes

R, 75 70 bytes

Explanation

Befunge, 2x48 +1 = 99 bytes

How it works

Mathematica, 81 bytes or 57 bytes

CJam, 51 bytes

sed, 108 bytes

PowerShell, 58 bytes

Clojure, 126 115 bytes

JavaScript (ES6), 91 bytes

Scala, 110 bytes

JavaScript, 35 chars, 36 bytes

sed, 40 bytes (38 chars)

Parse my Esperanto!

Challenge

Testcases

Scoring

Clarifications:

Answers

QuadR, 65 bytes

Retina, 27 bytes

C, 173 154 bytes

Python 3, 81 bytes

///, 75 bytes

Python 3, 95 bytes

Retina, 55 bytes

Perl 5, 101 + 1 (-p) = 102 bytes

JavaScript (ES6), 92 bytes

C, 145 144 bytes

QuadR, 25 bytes

APL (Dyalog Unicode), 57 bytes

Perl 5, 49 + 2 (-p -C) = 61 51 bytes

J, 64 63 bytes

R, 75 70 bytes

Explanation

Befunge, 2x48 +1 = 99 bytes

How it works

Mathematica, 81 bytes or 57 bytes

CJam, 51 bytes

sed, 108 bytes

PowerShell, 58 bytes

Clojure, 126 115 bytes

JavaScript (ES6), 91 bytes

Scala, 110 bytes

JavaScript, 35 chars, 36 bytes

sed, 40 bytes (38 chars)

Perl 5, 101 + 1 (`-p`) = 102 bytes

Perl 5, 49 + 2 (`-p -C`) = 61 51 bytes