Find out which ending is the most common in a language

0

1

I would like to work out what the most useful TLDs are for domain hacking (i.e., creating short FQDNs that spell words by combining with a TLD, e.g., ho.me) in English.

Here is a list of public suffixes https://publicsuffix.org/list/public_suffix_list.dat, which I'd like to check against words in the English language to see which are most popular.

How can I achieve this, e.g., using a script?

michaelmcandrew

Posted 2018-12-18T13:25:23.057

Reputation: 119

Question was closed 2018-12-18T13:55:18.707

Fair enough. I decided to answer it myself with a quick bash script. Feel free to close in any case - I appreciate that it is off topic. – michaelmcandrew – 2018-12-18T13:50:29.233

This might be more suitable on code golf?

– Attie – 2018-12-18T13:53:13.817

Yes - I thought the same, but wondered if it was too easy. Feel free to move there if appropriate. – michaelmcandrew – 2018-12-18T13:55:52.840

2I guess if the question could be slightly re-worded it could stay here, say, you already have an input list, and you're looking for a way to analyze it using a script … that's the kind of question we get often. – slhck – 2018-12-18T13:56:56.603

OK - I've edited it a bit. – michaelmcandrew – 2018-12-18T14:04:14.500

I took the liberty of editing a little bit, assuming you were using Bash. Voted to reopen. – slhck – 2018-12-18T14:07:39.617

Answers

1

This bash script works, using a list of tlds from https://publicsuffix.org/list/public_suffix_list.dat

for TLD in $(grep -E '^[a-z]+$' public_suffix_list.dat); do echo $(grep $TLD /usr/share/dict/words | wc -l) $TLD; done | grep -v ^0 | sort -nr

Top 10 are:

16147 in
13129 es
9573 ng
9276 re
9174 at
8625 st
8331 ing
7383 ar
6762 li
6469 al

See https://gist.github.com/michaelmcandrew/a7de275eb57053206a17c6e9316ea86a for a full list.

michaelmcandrew

Posted 2018-12-18T13:25:23.057

Reputation: 119

What is .ing? It's not a top-level domain. – Scott – 2018-12-23T06:59:18.233