Hyphen

The hyphen is a punctuation mark used to join words, and to separate syllables of a single word. The use of hyphens is called hyphenation.[1] Non-hyphenated is an example of a hyphenated word. The hyphen should not be confused with dashes (figure dash , en dash , em dash , Horizontal bar ), which are longer and have different uses, or with the minus sign , which is also longer and more vertically centred in some typefaces.

Hyphen
-
Hyphen-minus Non-breaking hyphen Hyphen bullet

As an orthographic concept, the hyphen is a single entity. In terms of character encoding and display, the entity is represented by any of several characters and glyphs (including hard hyphens, soft or optional hyphens, and nonbreaking hyphens), depending on the context of use (discussed below).

Although hyphens are not to be confused with en dashes and minus signs, there are some overlaps in usage (in which either a hyphen or an en dash may be acceptable, depending on user preference, as discussed below) and in informal typing (which often uses the same character  called a "hyphen-minus"  to represent any of the hyphen, the minus sign and the en dash, see below).

Etymology

The word is derived from Ancient Greek ὑφ᾽ ἕν (hyph' hén), contracted from ὑπό ἕν (hypó hén), "in one" (literally "under one").[2][3] The word (ἡ) ὑφέν ((he) hyphén) was used for an undertie-like sign written below two consecutive letters to indicate that they belong to the same word when it was necessary to avoid ambiguity, before the space 'character' was in regular use.

Use in English

The English language does not have definitive hyphenation rules,[4] though various style guides provide detailed usage recommendations, and have a significant amount of overlap in what they advise. Hyphens are mostly used to break single words into parts, or to join ordinarily separate words into single words. Spaces are not placed between a hyphen and either of the elements it connects except when using a suspended or "hanging" hyphen that stands in for a repeated word (e.g., nineteenth- and twentieth-century writers). Style conventions that apply to hyphens (and dashes) have evolved to support ease of reading in complex constructions; editors often accept deviations if they aid rather than hinder easy comprehension.

The use of the hyphen in English compound nouns and verbs has, in general, been steadily declining. Compounds that might once have been hyphenated are increasingly left with spaces or are combined into one word. Reflecting this changing usage, in 2007, the sixth edition of the Shorter Oxford English Dictionary removed the hyphens from 16,000 entries, such as fig-leaf (now fig leaf), pot-belly (now pot belly) and pigeon-hole (now pigeonhole).[5] The increasing prevalence of computer technology and the advent of the Internet have given rise to a subset of common nouns that might have been hyphenated in the past (e.g. toolbar, hyperlink, pastebin).

Despite decreased use, hyphenation remains the norm in certain compound-modifier constructions and, among some authors, with certain prefixes (see below). Hyphenation is also routinely used as part of syllabification in justified texts to avoid unsightly spacing (especially in columns with narrow measure, as when used with newspapers).

Separating

Justification and line-wrapping

When flowing text, it is sometimes preferable to break a word in half so that it continues on another line rather than moving the entire word to the next line. The word may be divided at the nearest break point between syllables (syllabification), and a hyphen inserted to indicate that the letters form a word fragment, rather than a full word. This allows more efficient use of paper, allows flush appearance of right-side margins (justification) without oddly large word spaces, and decreases the problem of rivers. This kind of hyphenation is most useful when the width of the column (called the measure in typography) is very narrow. For example:

Justified text
without hyphenation
Justified text
with hyphenation

We,       therefore,      the
representatives of the United
States of America ...

  

We, therefore, the represen-
tatives of the United States
of America ...

The details of doing this properly are complex and language-dependent, and can interact with other orthographic and typesetting practices. Hyphenation algorithms, when employed in concert with dictionaries, are sufficient for all but the most formal texts.

It may be necessary to distinguish an incidental line-break hyphen from one integral to a word being mentioned (as when used in a dictionary) or present in an original text being quoted (when in a critical edition)—not only to control its word wrap behavior (which encoding handles with hard and soft hyphens having the same glyph) but also to differentiate appearance (with a different glyph). Webster's Third New International Dictionary[6] and the Chambers Dictionary[7] use a double hyphen for integral hyphens and a single hyphen for line-breaks, whereas Kromhout's Afrikaans–English dictionary uses the opposite convention.[8] The Concise Oxford Dictionary (fifth edition) repeated an integral hyphen at the start of the following line.[9]

Prefixes and suffixes

Prefixes (such as de-, pre-, re-, and non-[10]) and suffixes (such as -less, -like, -ness, and -hood) may or may not be hyphenated. (The unhyphenated style is also called closed up or solid.) A rule of thumb is that they are not hyphenated unless the lack of a hyphen hurts clarity—specifically, clarity at first glance rather than clarity upon a second look or a moment's pause. The clear–unclear distinction involves some subjectivity, because what is instantly clear to one reader may not be to another (depending on, for example, subject matter familiarity). Nonetheless, consensus among users of a language often reduces that subjectivity for many words. This is explained further below.

Many long-established words, such as disgusted, degrade, and refresh, do not require a hyphen because they are fully fused to the point that their first syllable is barely even thought about as having a prefix function. Many other words, such as prewashed or repainted, may not be quite so fully fused (the prefix function may be slightly more prominent in consciousness), but nonetheless they require no hyphen, because (1) most readers recognize the closed-up word as a familiar one and thus have no trouble parsing the syllables, and (2) if all such words were hyphenated, the many hyphens throughout the text would seem superfluous.

In contrast, for some other words, the closed-up style may not be as clear, and the hyphen can ensure clarity and avoid awkwardness, including odd appearance or misguided parsing of syllables. An example of avoiding misguided parsing would be to hyphenate the word co-worker (versus coworker) to prevent the reader's eye being caught automatically by the letter group cow (which might suggest cow (/k/) before backtracking and reparsing occurred). In such cases, styling varies depending on individual preference, regional preference, occupational specialty, or style guide preference, because the definition of awkwardness for any given word depends on who is judging it.

Words for which prefix hyphenation is least subjective, to the point that closed-up style is widely rejected, are of several classes. One such class consists of a few words that require a hyphen to distinguish them from other words that would otherwise be homographs, such as recreation (fun or sport) versus re-creation (the act of creating again), retreat (turn back) versus re-treat (give therapy again), and un-ionized (not in ion form) versus unionized (organized into trade unions). The other classes are those in which the prefix is applied to (1) a proper (capitalized) noun or adjective (un-American, de-Stalinisation);[11][12] (2) an acronym (anti-TNF antibody, non-SI units); or (3) a number (pre-1949 diplomacy, pre-1492 cartography).

Style guides codify rules to minimize inconsistency, the ultimate goal of which is to have the style unnoticed by the reader (that is, to avoid catching the reader's eye, either with trivial differences or with a lot of superfluous hyphens). The style guide rules allow exceptions to avoid awkwardness. For example, a guide will typically say to follow dictionary X's style for any word entered therein, and for words not entered, to close up by default and thus hyphenate only to avoid awkwardness. Such a rule successfully codifies almost all choices and thus leaves little to discretion except a few rare or neologistic words, which are safely hyphenated. This ensures high intradocument and inter-document consistency. Rules about avoiding doubled vowels or doubled consonants are often mentioned in style guides. These appropriately cascade only downstream, not upstream, of the "follow dictionary X" rule, because most dictionaries close up many well-established doubled-letter pairs. (For example, any style that follows Merriam-Webster's Collegiate Dictionary thus closes up preempt, reexamine, deemphasize, nonnegotiable, posttransfusion, and hundreds of others.) As mentioned earlier, the definition of "awkwardness" for any given word is inherently subjective but nonetheless also subject to consensus. For example, reexamine and deemphasize are accepted as nonawkward by a broad consensus; to prefer the hyphenated styling is a matter of opinion, but to insist that the solid styling is awkward would be considered pedantic by many educated readers. However, some doublings attract smaller majorities than others in such a consensus; with the co-worker/coworker example (mentioned earlier) or with antiinflammatory/anti-inflammatory, many readers may consider solid styling nonawkward whereas many others do not, and in such cases, dictionary styles vary (Dorland's, antiinflammatory; Merriam-Webster's Medical Dictionary, anti-inflammatory). Tripled letters rarely occur, but when they do, the hyphen is considered mandatory (thus shell-like, not shelllike).

There is a trend that words that once were hyphenated for clarity lose the hyphen as their familiarity grows. An excellent example is email/e-mail; the number of people who find email awkward dropped from the 1990s to the 2010s, and thus the hyphen has been dropped increasingly. For some instances, the consensus depends on occupational speciality or subspecialty. Although proto-oncogene is still hyphenated by most users (and by both Dorland's and Merriam-Webster's Medical), the solid styling (protooncogene) is gaining popularity, with oncologists and geneticists (for whom the term is most familiar) leading the way.

A hyphen can clarify that two adjacent vowels—whether two of the same letter (e.g., oo, ee) or two different letters (e.g., ae, ei)—are pronounced separately rather than being merged in a diphthong. The question is how necessary the clarification is. Thus, hyphenated de-escalate and co-operation have plenty of support, consensus-wise (plenty of users consider their hyphens as not superfluous), although solid deescalate and cooperation have plenty of support as well (plenty of users consider the hyphens superfluous). Consensus for styling varies by class, subclass, and even by individual word, with the common theme being that internal punctuation drops out of any combination judged as instantly recognizable enough in its context not to need it. As classes, there are doubling (namely, aa, ee, ii, oo, uu, yy) and nondoubling (for example, a+e, a+i, a+o; e+e, e+i, e+o). Several subclasses exist. There are combinations that are not rare in English as diphthongs and also not rare as nondiphthongs for users willing to style prefixed words solidly (such as ee and ei); regarding de+e/re+e/pre+e and de+i/re+i/pre+i, nearly everyone agrees that some fully fused examples (such as reiterate and reinforce) need no hyphen, but other examples have more evenly split pluralities (such as reexamine/re-examine or deemphasize/de-emphasize). There are combinations that are rare in English as diphthongs (for example, aa and ii) but not rare in prefixed words for those willing to style them solidly; and thus either they hardly need clarification within prefixed words (the solidification argument; thus intraarterial and antiinflammatory) or they need a hyphen to avoid looking like rare diphthongs, which are "odd-looking" because rare (the hyphenation argument, thus intra-arterial and anti-inflammatory).

A diaeresis can also sometimes be used, either to indicate non-diphthong status (e.g., coöperation and naïve) or to indicate non-silent terminal -e (e.g., Brontë), but there are several implicit boundaries on this style's use; it is now rare (its peak of popularity was in the late 19th and early 20th centuries), and it was never applied extensively across the language (only a handful of examples, including coöperation, naïve, and Brontë, are encountered with any appreciable frequency in English; for whatever reason, it never had any popularity in the de+e/re+e/pre+e or de+i/re+i/pre+i subclasses—thus seldom reëxamine, reïterate, deëmphasize, or others). Many users (and various dictionaries) consider the diaeresis optional in naive/naïve (on the basis that it is not necessary in order for the reader to recognize the word), and *na-ive draws attention to itself as a style that is never used.

Syllabification and spelling

Hyphens are occasionally used to denote syllabification, as in syl-la-bi-fi-ca-tion. Various British and North American dictionaries use an interpunct, sometimes called a "middle dot" or "hyphenation point", for this purpose, as in syl·la·bi·fi·ca·tion. This allows the hyphen to be reserved only for places where a hard hyphen is intended (for example, self-con·scious, un·self-con·scious, long-stand·ing). Similarly, hyphens may be used to indicate a word is being or should be spelled. For example, W-O-R-D spells "word".

In nineteenth-century American literature, hyphens were also used irregularly to divide syllables in words from indigenous North American languages, without regard for etymology or pronunciation:[13] e.g. "Shuh-shuh-gah" (from Ojibwe zhashagi, "blue heron") in The Song of Hiawatha.[14] This usage is now rare and proscribed, except in some place names such as Ah-gwah-ching.

Joining

Compound modifiers

Compound modifiers are groups of two or more words that jointly modify the meaning of another word. When a compound modifier other than an adverbadjective combination appears before a term, the compound modifier is often hyphenated to prevent misunderstanding, such as in American-football player or little-celebrated paintings. Without the hyphen, there is potential confusion about whether the writer means a "player of American football" or an "American player of football" and whether the writer means paintings that are "little celebrated" or "celebrated paintings" that are little.[15] Compound modifiers can extend to three or more words, as in ice-cream-flavored candy, and can be adverbial as well as adjectival (spine-tinglingly frightening). However, if the compound is a familiar one, it is usually unhyphenated. For example, some style guides prefer the construction high school students, to high-school students.[16][17] Although the expression is technically ambiguous ("students of a high school"/"school students who are high"), it would normally be formulated differently if other than the first meaning were intended. Noun–noun compound modifiers may also be written without a hyphen when no confusion is likely: grade point average and department store manager.[17]

When a compound modifier follows the term to which it applies, a hyphen is typically not used if the compound is a temporary compound. For example, "that gentleman is well respected", not "that gentleman is well-respected"; or "a patient-centered approach was used" but "the approach was patient centered."[18] But permanent compounds, found as headwords in dictionaries, are treated as invariable, so if they are hyphenated in the cited dictionary, the hyphenation will be used in both attributive and predicative positions. For example, "A cost-effective method was used" and "The method was cost-effective" (cost-effective is a permanent compound that is hyphenated as a headword in various dictionaries). When one of the parts of the modifier is a proper noun or a proper adjective, there is no hyphen (e.g., "a South American actor").[19]

When the first modifier in a compound is an adverb ending in -ly (e.g., "a poorly written novel"), various style guides advise no hyphen.[19] However, some do allow for this use. For example, The Economist Style Guide advises: "Adverbs do not need to be linked to participles or adjectives by hyphens in simple constructions ... Less common adverbs, including all those that end -ly, are less likely to need hyphens."[20] In the 19th century, it was common to hyphenate adverb–adjective modifiers with the adverb ending in -ly (e.g., "a craftily-constructed chair"). However, this has become rare. For example, wholly owned subsidiary and quickly moving vehicle are unambiguous, because the adverbs clearly modify the adjectives: "quickly" cannot modify "vehicle".

However, if an adverb can also function as an adjective, then a hyphen may be or should be used for clarity, depending on the style guide.[12] For example, the phrase more-important reasons ("reasons that are more important") is distinguished from more important reasons ("additional important reasons"), where more is an adjective. Similarly, more-beautiful scenery (with a mass-noun) is distinct from more beautiful scenery. (In contrast, the hyphen in "a more-important reason" is not necessary, because the syntax cannot be misinterpreted.) A few short and common words—such as well, ill, little, and much—attract special attention in this category.[20] The hyphen in "well-[past_participled] noun", such as in "well-differentiated cells", might reasonably be judged superfluous (the syntax is unlikely to be misinterpreted), yet plenty of style guides call for it. Because early has both adverbial and adjectival senses, its hyphenation can attract attention; some editors, due to comparison with advanced-stage disease and adult-onset disease, like the parallelism of early-stage disease and early-onset disease. Similarly, the hyphen in little-celebrated paintings clarifies that one is not speaking of little paintings.

Hyphens are usually used to connect numbers and words in modifying phrases. Such is the case when used to describe dimensional measurements of weight, size, and time, under the rationale that, like other compound modifiers, they take hyphens in attributive position (before the modified noun),[21] although not in predicative position (after the modified noun). This is applied whether numerals or words are used for the numbers. Thus 28-year-old woman and twenty-eight-year-old woman or 32-foot wingspan and thirty-two-foot wingspan, but the woman is 28 years old and a wingspan of 32 feet.[lower-alpha 1] However, with symbols for SI units (such as m or kg)—as opposed to the names of these units (such as metre or kilogram)—both the International Bureau of Weights and Measures and the U.S. National Institute of Standards and Technology recommend use without a hyphen: a 25 kg sphere. When the units are spelled out, this recommendation does not apply: a 25-kilogram sphere, a roll of 35-millimeter film.[22][23]

In spelled-out fractions, hyphens are usually used when the fraction is used as an adjective but not when it is used as a noun: thus two-thirds majority[lower-alpha 1] and one-eighth portion but I drank two-thirds of the bottle or I kept three-quarters of it for myself.[24] However, at least one major style guide[21] hyphenates spelled-out fractions invariably (whether adjective or noun).

In English, an en dash, , sometimes replaces the hyphen in hyphenated compounds if either of its constituent parts is already hyphenated or contains a space (for example, San Francisco–area residents, hormone receptor–positive cells, cell cycle–related factors, and public-school–private-school rivalries).[25] A commonly used alternative style is the hyphenated string (hormone-receptor-positive cells, cell-cycle-related factors). (For other aspects of en dash–versus–hyphen use, see Dash > En dash.)

Object–verbal-noun compounds

When an object is compounded with a verbal noun, such as egg-beater (a tool that beats eggs), the result is sometimes hyphenated. Some authors do this consistently, others only for disambiguation; in this case, egg-beater, egg beater, and eggbeater are all common.

An example of an ambiguous phrase appears in they stood near a group of alien lovers, which without a hyphen implies that they stood near a group of lovers who were aliens; they stood near a group of alien-lovers clarifies that they stood near a group of people who loved aliens, as "alien" can be either an adjective or a noun. On the other hand, in the phrase a hungry pizza-lover, the hyphen will often be omitted (a hungry pizza lover), as "pizza" cannot be an adjective and the phrase is therefore unambiguous.

Similarly, there's a man-eating shark in these waters is nearly the opposite of there's a man eating shark at table 6; the first is a shark, and the second a man. A government-monitoring program is a program that monitors the government, whereas a government monitoring program is a government program that monitors something else.

Personal names

Some married couples compose a new surname (sometimes referred to as a double-barrelled name) for their new family by combining their two surnames with a hyphen. Jane Doe and John Smith might become Jane and John Smith-Doe, or Doe-Smith, for instance. In some countries only the woman hyphenates her birth surname, appending her husband's surname.

With already-hyphenated names, some parts are typically dropped. For example, Aaron Johnson and Samantha Taylor-Wood became Aaron Taylor-Johnson and Sam Taylor-Johnson. Not all hyphenated surnames are the result of marriage. For example Julia Louis-Dreyfus is a descendant of Louis Lemlé Dreyfus whose son was Léopold Louis-Dreyfus.

Other compounds

Connecting hyphens are used in a large number of miscellaneous compounds, other than modifiers, such as in lily-of-the-valley, cock-a-hoop, clever-clever, tittle-tattle and orang-utan. Use is often dictated by convention rather than fixed rules, and hyphenation styles may vary between authors; for example, orang-utan is also written as orangutan or orang utan, and lily-of-the-valley may be hyphenated or not.

Suspended hyphens

A suspended hyphen (also called a suspensive hyphen or hanging hyphen, or less commonly a dangling or floating hyphen) may be used when a single base word is used with separate, consecutive, hyphenated words that are connected by "and", "or", or "to". For example, nineteenth-century and twentieth-century may be written as nineteenth- and twentieth-century. This usage is now common and specifically recommended in some style guides.[17] Suspended hyphens are also used, though less commonly, when the base word comes first, such as in "investor-owned and -operated". Uses such as "applied and sociolinguistics" (instead of "applied linguistics and sociolinguistics") are frowned upon; the Indiana University style guide uses this example and says "Do not 'take a shortcut' when the first expression is ordinarily open" (i.e., ordinarily two separate words).[17] This is different, however, from instances where prefixes that are normally closed up (styled solidly) are used suspensively. For example, preoperative and postoperative becomes pre- and postoperative (not pre- and post-operative) when suspended. Some editors prefer to avoid suspending such pairs, choosing instead to write out both words in full.[21]

Other uses

A hyphen may be used to connect groups of numbers, such as in dates (see below), telephone numbers or sports scores. It can also be used to indicate a range of values, although many styles prefer an en dash (see examples at Dash § En dash §§ Ranges of values).

The hyphen is sometimes used to hide letters in words (filleting for redaction or censoring), as in G-d, although an en dash can be used as well ("G–d").

The hyphen is often used in reduplicatives.

Varied meanings

Some stark examples of semantic changes caused by the placement of hyphens to mark attributive phrases:

  • Disease-causing poor nutrition is poor nutrition that causes disease.
  • Disease causing poor nutrition is a disease that causes poor nutrition.
  • A hard-working man is a man who works hard.
  • A hard working man is a working man who is tough.
  • A man-eating shark is a shark that eats humans.
  • A man eating shark is a man who is eating shark meat.
  • Three-hundred-year-old trees are an indeterminate number of trees that are each 300 years old.
  • Three hundred-year-old trees are three trees that are each 100 years old.
  • Three hundred year-old trees are 300 trees that are each a year old.

Origin and history

First page of the first volume: the epistle of St Jerome to Paulinus from the University of Texas copy. The page has 40 lines.

The first known documentation of the hyphen is in the grammatical works of Dionysius Thrax. At the time hyphenation was joining two words that would otherwise be read separately by a low tie mark between the two words.[26] In Greek these marks were known as enotikon, officially romanized as a hyphen.[27]

With the introduction of letter-spacing in the Middle Ages, the hyphen, still written beneath the text, reversed its meaning. Scribes used the mark to connect two words that had been incorrectly separated by a space. This era also saw the introduction of the marginal hyphen, for words broken across lines.[28]

The modern format of the hyphen originated with Johannes Gutenberg of Mainz, Germany, c.1455 with the publication of his 42-line Bible. His tools did not allow for a sublinear hyphen, and he thus moved it to the middle of the line.[29] Examination of an original copy on vellum (Hubay index #35) in the U. S. Library of Congress shows that Gutenberg's movable type was set justified in a uniform style, 42 equal lines per page. The Gutenberg printing press required words made up of individual letters of type to be held in place by a surrounding non-printing rigid frame. Gutenberg solved the problem of making each line the same length to fit the frame by inserting a hyphen as the last element at the right-side margin. This interrupted the letters in the last word, requiring the remaining letters be carried over to the start of the line below. His double hyphen appears throughout the Bible as a short, double line inclined to the right at a 60-degree angle:

In computing

Hyphen-minus

In the ASCII character encoding, the hyphen is encoded as character 45. This character is called the hyphen-minus, as it is also used as a minus sign and even sometimes for dashes. In Unicode, the hyphen-minus is encoded as U+002D - so that Unicode remains compatible with ASCII. Unicode also encodes the hyphen and minus separately, as U+2010 and U+2212 respectively, along with the em dash U+2014 , en dash U+2013 and other related characters: these are the forms used in professional type-setting. The hyphen-minus is a general-purpose character that attempts to fulfil several roles, and wherever optimal typography is desired, the formal hyphen, minus, or other symbol should be used instead. For example, compare 4+3−2=5 (minus) and 4+3-2=5 (hyphen-minus); in most fonts the hyphen-minus will not have the optimal width, thickness, or vertical position, whereas the minus character will.

However, the Unicode hyphen is inconvenient to enter on most keyboards, so the hyphen-minus character remains very common. It is often used instead of dashes or minus signs in situations where the preferred characters are unavailable (such as ASCII-only text), where the preferred characters take effort to enter (via dialog boxes or multi-key, unmemorable keyboard shortcuts), or when the writer is unaware of the distinction. Some writers use two hyphen-minuses (--) to represent an emdash in ASCII text.

The hyphen-minus character is also often used when specifying command-line options. The character is usually followed by one or more letters that indicate specific actions. Typically it is called a dash or switch in this context. Various implementations of the getopt function to parse command-line options additionally allow the use of two hyphen-minus characters, --, to specify long option names that are more descriptive than their single-letter equivalents. Another use of hyphens is that employed by programs written with pipelining in mind: a single hyphen may be recognized in lieu of a filename, with the hyphen then serving as an indicator that a standard stream, instead of a file, is to be worked with.

Hard and soft hyphens

Although software (hyphenation algorithms) can often automatically make decisions on when to hyphenate a word at a line break, it is also sometimes useful for the user to be able to insert cues for those decisions (which are dynamic in the online medium, given that text can be reflowed). For this purpose, the concept of a soft hyphen (discretionary hyphen, optional hyphen) was introduced, allowing such manual specification of a place where a hyphenated break is allowed but not forced. That is, it does not force a line break in an inconvenient place when the text is later reflowed.

In contrast, a hyphen that is always displayed and printed is called a hard hyphen (although some use this term to refer to a non-breaking hyphen; see below). Soft hyphens are inserted into the text at the positions where hyphenation may occur. It can be a tedious task to insert the soft hyphens by hand, and tools using hyphenation algorithms are available that do this automatically. Current modules of the Cascading Style Sheets (CSS) standard provide language-specific hyphenation dictionaries.

Nonbreaking hyphens

The nonbreaking hyphen, non-breaking hyphen, or no-break hyphen looks identical to the regular hyphen, but word processors treat it as a letter, namely that the hyphenated word will not be divided at the hyphen should this fall at what would be the end of a line of text; instead, the whole hyphenated word either will remain in full at the end of the line or will go in full to the beginning of the next line. The non-breaking space exists for similar reasons.

The word segmentation rules of most text systems consider a hyphen to be a word boundary and a valid point at which to break a line when flowing text. However, this is not always desirable behavior, especially when it could lead to ambiguity (such as in the examples given before, where recreation and re‑creation would be indistinguishable), or in languages other than English (e.g., a line break at the hyphen in Irish an t‑athair or Romanian s‑a would be undesirable). In Unicode it is defined as U+2011 NON-BREAKING HYPHEN (HTML ‑).

Usage in date notation

In parts of Europe, the hyphen is used to delineate parts within a written date. Germans and Slavs also used Roman numerals for the month; 14‑VII‑1789 (14 July 1789), for example, is one way of writing the first Bastille Day, though this usage is rapidly falling out of favour. Plaques on the wall of the Moscow Kremlin are written this way. Use of hyphens, as opposed to the slashes used in the English language, is specified for international standards.

International standard ISO 8601, which was accepted as European Standard EN 28601 and incorporated into various typographic style guides (e.g., DIN 5008 in Germany), brought about a new standard using the hyphen. Now all official European governmental documents use this. These norms prescribe writing dates using hyphens: 1789-07-14 is the new way of writing the first Bastille Day. This is also the typical date format used in large parts of Eastern Europe and Asia, although sometimes with other separators than the hyphen.

This method has gained influence within North America, as most common computer filesystems make the use of slashes difficult or impossible. DOS, OS/2 and Windows simultaneously support both \ and / as directory separators, but / is also used to introduce and separate switches to shell commands (unless reconfigured to use the hyphen-minus in DOS). Unix-like systems use / as a directory separator and, while \ is legal in filenames, it is awkward to use as the shell uses it as an escape character. Unix also uses a space followed by a hyphen to introduce switches. Apart from the separator used the non-year form of the date format is also identical to the standard American representation.

Unicode

Apart from dash and minus sign, Unicode has multiple hyphen characters:[30]

  • U+002D - HYPHEN-MINUS (HTML -), a character of multiple uses
  • U+00AD SOFT HYPHEN (HTML ­ · ­) (see note)
  • U+2010 HYPHEN (HTML ‐ · ‐, ‐)
  • U+2011 NON-BREAKING HYPHEN (HTML ‑)

Note: The SOFT HYPHEN serves as an invisible marker used to specify a place in text where a hyphenated break is allowed without forcing a line break in an inconvenient place if the text is re-flowed. It becomes visible only after word wrapping at the end of a line.

And in non-Latin scripts:[30]

  • U+058A ֊ ARMENIAN HYPHEN (HTML ֊)
  • U+1806 MONGOLIAN TODO SOFT HYPHEN (HTML ᠆)
  • U+1B60 BALINESE PAMENENG (HTML ᭠) (used only as a line-breaking hyphen)
  • U+2E17 DOUBLE OBLIQUE HYPHEN (HTML ⸗) (used in ancient Near-Eastern linguistics and in blackletter typefaces)
  • U+30FB KATAKANA MIDDLE DOT (HTML ・) (has the Unicode property of "Hyphen" despite its name)
  • U+FE63 SMALL HYPHEN-MINUS (HTML ﹣) (compatibility character for a small hyphen-minus, used in East Asian typography)
  • U+FF0D FULLWIDTH HYPHEN-MINUS (HTML -) (compatibility character for a wide hyphen-minus, used in East Asian typography)
  • U+FF65 HALFWIDTH KATAKANA MIDDLE DOT (HTML ・) (compatibility character for a wide katakana middle dot, has the Unicode property of "Hyphen" despite its name)

Unicode distinguishes the hyphen from the general interpunct. The characters below do not have the Unicode property of "Hyphen" despite their names:[30]

  • U+1400 CANADIAN SYLLABICS HYPHEN (HTML ᐀)
  • U+2027 HYPHENATION POINT (HTML ‧)
  • U+2043 HYPHEN BULLET (HTML ⁃ · ⁃)
  • U+2E1A HYPHEN WITH DIAERESIS (HTML ⸚)
  • U+2E40 DOUBLE HYPHEN (HTML ⹀)
  • U+30A0 KATAKANA-HIRAGANA DOUBLE HYPHEN (HTML ゠)
  • U+10EAD 𐺭 YEZIDI HYPHENATION MARK (HTML 𐺭)

(See interpunct and bullet (typography) for more round characters.)

gollark: Bees.
gollark: I think I can eliminate a match that way.
gollark: Oh yes, that might exist, mightn't it.
gollark: But if I split it into multiple functions, it would not actually work.
gollark: Look, it has pattern matching in it, therefore good.

See also

Notes

  1. With numbers greater than two, where a plural noun would normally be used in an unhyphenated predicative position, the singular form of the noun is generally used in the hyphenated form used attributively. Thus a woman who is 28 years old becomes a 28-year-old woman. There are occasional exceptions to this general rule, for instance with fractions (a two-thirds majority) and irregular plurals (a two-criteria review, a two-teeth bridge).

References

  1. "Hyphen Definition". dictionary.com. Retrieved 18 June 2015.
  2. ὑφέν. Liddell, Henry George; Scott, Robert; A Greek–English Lexicon at the Perseus Project.
  3. Harper, Douglas. "hyphen". Online Etymology Dictionary.
  4. Wroe, Ann, ed. (2015). The Economist Style Guide (11th ed.). London / New York: Profile Books / PublicAffairs. p. 74. hyphens  There is no firm rule to help you decide which words are run together, hyphenated or left separate.
  5. "Small object of grammatical desire". BBC News. London: British Broadcasting Corporation. 20 September 2007..
  6. Gove, Philip Babcock (1993). Webster's Third New International Dictionary of the English Language, Unabridged. Merriam-Webster. p. 14a, § 1.6.1. ISBN 978-0-87779-201-7. Retrieved 28 November 2014.
  7. Chambers, Allied (2006). The Chambers Dictionary. Allied Publishers. p. xxxviii, § 8. ISBN 978-8186062258. Retrieved 28 November 2014.
  8. Kromhout, Jan (2001). Afrikaans–English, English–Afrikaans Dictionary. Hippocrene Books. p. 182, § 5. ISBN 978-0-7818-0846-0. Retrieved 28 November 2014.
  9. Hartmann, R. Rf. K. (1986). The History of Lexicography: Papers from the Dictionary Research Centre Seminar at Exeter, March 1986. John Benjamins Publishing. p. 9. ISBN 978-9027245236.
  10. A fairly comprehensive list, although not exhaustive, is given at Prefix > List of English derivational prefixes.
  11. "Hyphenated Words: A Guide", The Grammar Curmudgeon, City slide.
  12. "Hyphens", Punctuation, Grammar book.
  13. Liberman, Mark. "American Indian Hyphens". Language Log.
  14. Longfellow, Henry Wadsworth. The Song of Hiawatha.
  15. Gary Blake and Robert W. Bly, The Elements of Technical Writing, p. 48. New York: Macmillan Publishers, 1993. ISBN 0020130856
  16. E.g. "H". Bloomberg School Style Manual. Johns Hopkins Bloomberg School of Public Health. Retrieved 9 March 2019.
  17. E.g. "H". The IU editorial style guide. Indiana University. Retrieved 9 March 2019.
  18. Davis, John (30 November 2004). "Using Hyphens in Compound Adjectives (and Exceptions to the Rule)" (Grammar tip). UHV. Archived from the original on 9 January 2010. Retrieved 5 January 2010.
  19. "Hyphenated Compound Words". englishplus.com. Retrieved 18 November 2014.
  20. Wroe, Ann, ed. (2015). The Economist Style Guide (11th ed.). London / New York: Profile Books / PublicAffairs. pp. 77–78. hyphens   ... 12. Adverbs: Adverbs do not need to be linked to participles or adjectives by hyphens in simple constructions [examples elided]. But if the adverb is one of two words together being used adjectivally, a hyphen may be needed [examples elided]. The hyphen is especially likely to be needed if the adverb is short and common, such as ill, little, much and well. Less common adverbs, including all those that end -ly, are less likely to need hyphens [example elided].
  21. Iverson, Cheryl, et al. (eds) (2007). "8.3.1". AMA Manual of Style (10th ed.). Oxford, Oxfordshire: Oxford University Press. ISBN 978-0-19-517633-9.CS1 maint: extra text: authors list (link)
  22. "The International System of Units (SI)", Bureau International des Poids et Mesures, 2006
  23. "Guide for the Use of the International System of Units (SI)", NIST Special Publication 811, National Institute of Standards and Technology, March 2008
  24. American Psychological Association (APA) (2010), The Publication Manual of the American Psychological Association (6th ed.), Washington, DC: American Psychological Association, ISBN 978-1-4338-0562-2.
  25. Gary Lutz; Diane Stevenson (2005). The Writer's Digest grammar desk reference. Writer's Digest Books. p. 296. ISBN 978-1-58297-335-7.
  26. Nicolas, Nick. "Greek Unicode Issues: Punctuation Archived 6 August 2012 at Archive.today". 2005. Accessed 7 October 2014.
  27. Ελληνικός Οργανισμός Τυποποίησης [Ellīnikós Organismós Typopoíīsīs, "Hellenic Organization for Standardization"]. ΕΛΟΤ 743, 2η Έκδοση [ELOT 743, 2ī Ekdosī, "ELOT 743, 2nd ed."]. ELOT (Athens), 2001. (in Greek)
  28. Keith Houston (2013). Shady Characters: The Secret Life of Punctuation, Symbols, and Other Typographical Marks. W.W. Norton & Company. p. 121. ISBN 978-0-393-06442-1.
  29. Keith Houston (2013). Shady Characters: The Secret Life of Punctuation, Symbols, and Other Typographical Marks. W.W. Norton & Company. p. 132. ISBN 978-0-393-06442-1.
  30. "Unicode 13.0 UCD: PropList.txt". 27 November 2019. Retrieved 17 March 2020.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.