Chinese Text Project

Chinese Text Project
Traditional Chinese	中國哲學書電子化計劃
Simplified Chinese	中国哲学书电子化计划
Literal meaning	The Chinese Philosophical Book Digitization Project
Transcriptions
Standard Mandarin
Hanyu Pinyin	Zhōngguó zhéxué shū diànzǐhuà jìhuà

Chinese Text Project
Type of site	Digital library
Available in	English and Chinese
Owner	Donald Sturgeon
Created by	Donald Sturgeon
URL	ctext.org
Alexa rank	38,208 (November 2016)[1]
Commercial	No
Registration	required to contribute
Launched	2006
Current status	active

The Chinese Text Project (CTP; Chinese: 中國哲學書電子化計劃) is a digital library project that assembles collections of early Chinese texts. The name of the project in Chinese literally means "The Chinese Philosophical Book Digitization Project", showing its focus on books related to Chinese philosophy. It aims at providing accessible and accurate versions of a wide range of texts,[2] particularly those relating to Chinese philosophy, and the site is credited with providing one of the most comprehensive and accurate collections of classical Chinese texts on the Internet,[3][4] as well as being one of the most useful textual databases for scholars of early Chinese texts.[5][6]

Site contents

Texts are divided into pre-Qin and Han texts, and post-Han texts, with the former categorized by school of thought and the latter by dynasty. The ancient (pre-Qin and Han) section of the database contains over 5 million Chinese characters, the post-Han database over 20 million characters, and the publicly editable wiki section over 5 billion characters.[7] Many texts also have English and Chinese translations, which are paired with the original text paragraph by paragraph as well as phrase by phrase for ease of comparison; this makes it possible for the system to be used as a useful scholarly research tool even by students with little or no knowledge of Chinese.[8]

As well as providing customized search functionality suited to Chinese texts,[9][10] the site also attempts to make use of the unique format of the web to offer a range of features relevant to sinologists, including an integrated dictionary, word lists, parallel passage information,[11] scanned source texts, concordance and index data,[12] a metadata system, Chinese commentary display,[13] a published resources database, and a discussion forum in which threads can be linked to specific data on the site.[14][15] The "Library" section of the site also includes scanned copies of over 25 million pages of early Chinese texts,[16][7] linked line by line to transcriptions in the full-text database, many creating using Optical Character Recognition,[17] and edited and maintained using an online crowd-sourcing wiki system.[18][19] Textual data and metadata can also be exported using an Application Programming Interface, allowing integration with other online tools as well as use in text mining and digital humanities projects.[18][20]

gollark: Protocol Sigma.

gollark: There will be no negotiation.

gollark: If my pancreas betrays me, I will have it *punished*.

gollark: Ħ.

gollark: Doesn't it use x and y the wrong way round for some stupid reason?

References

"Ctext.org Site Info". Alexa Internet. Retrieved 2016-11-07.
Elman, Benjamin A. "Classical Historiography for Chinese History: Databases & electronic texts". Princeton University. Archived from the original on August 5, 2016. Retrieved June 3, 2016.
"Association of Chinese Philosophers in North America (北美中国哲学学者协会)". Archived from the original on 2010-12-25. Retrieved 2010-09-25.
Chris Fraser, Department of Philosophy, University of Hong Kong
http://warpweftandway.com/support-the-chinese-text-project/
http://languagehat.com/chinese-text-project/
http://ctext.org/system-statistics
Connolly, Tim (2012). "Learning Chinese Philosophy with Commentaries". Teaching Philosophy. Philosophy Documentation Center. 35 (1): 1–18. doi:10.5840/teachphil20123511.
http://ctext.org/instructions/advanced-search
http://ctext.org/faq/normalization
Sturgeon, Donald (2017). "Unsupervised identification of text reuse in early Chinese literature". Digital Scholarship in the Humanities. Oxford University Press. Retrieved November 21, 2017.
Xu, Jiajin (2015). "Corpus-based Chinese studies: A historical review from the 1920s to the present". Chinese Language and Discourse. John Benjamins Publishing Company. 6 (2): 218–244. doi:10.1075/cld.6.2.06xu.
Adkins, Martha A. (2016). "Web Review: Online Resources for the Study of Chinese Religion and Philosophy". Theological Librarianship. American Theological Library Association. 9 (2): 5–8. doi:10.31046/tl.v9i2.435.
Holger Schneider and Jeff Tharsen, http://dissertationreviews.org/archives/9213
http://ctext.org/introduction
http://ctext.org/library.pl?if=en
Sturgeon, Donald (2017). Unsupervised Extraction of Training Data for Pre-Modern Chinese OCR. The Thirtieth International Flairs Conference. AAAI. Retrieved November 21, 2017.
https://cpianalysis.org/2016/06/08/crowdsourcing-apis-and-a-digital-library-of-chinese/ Archived 2017-03-20 at the Wayback Machine, China Policy Institute, University of Nottingham
http://ctext.org/instructions/ocr
http://ctext.org/tools/api

External links

Chinese Text Project
中國哲學書電子化計劃 (in Chinese)
Chinese Text Project at Douban (in Chinese)

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[alexa-1] "Ctext.org Site Info". Alexa Internet. Retrieved 2016-11-07.

[2] Elman, Benjamin A. "Classical Historiography for Chinese History: Databases & electronic texts". Princeton University. Archived from the original on August 5, 2016. Retrieved June 3, 2016.

[3] "Association of Chinese Philosophers in North America (北美中国哲学学者协会)". Archived from the original on 2010-12-25. Retrieved 2010-09-25.

[4] Chris Fraser, Department of Philosophy, University of Hong Kong

[5] ttp://warpweftandway.com/support-the-chinese-text-project/

[6] ttp://languagehat.com/chinese-text-project/

[ctext.org-7] ttp://ctext.org/system-statistics

[8] Connolly, Tim (2012). "Learning Chinese Philosophy with Commentaries". Teaching Philosophy. Philosophy Documentation Center. 35 (1): 1–18. doi:10.5840/teachphil20123511.

[9] ttp://ctext.org/instructions/advanced-search

[10] ttp://ctext.org/faq/normalization

[11] Sturgeon, Donald (2017). "Unsupervised identification of text reuse in early Chinese literature". Digital Scholarship in the Humanities. Oxford University Press. Retrieved November 21, 2017.

[12] Xu, Jiajin (2015). "Corpus-based Chinese studies: A historical review from the 1920s to the present". Chinese Language and Discourse. John Benjamins Publishing Company. 6 (2): 218–244. doi:10.1075/cld.6.2.06xu.

[13] Adkins, Martha A. (2016). "Web Review: Online Resources for the Study of Chinese Religion and Philosophy". Theological Librarianship. American Theological Library Association. 9 (2): 5–8. doi:10.31046/tl.v9i2.435.

[14] Holger Schneider and Jeff Tharsen, http://dissertationreviews.org/archives/9213

[15] ttp://ctext.org/introduction

[16] ttp://ctext.org/library.pl?if=en

[17] Sturgeon, Donald (2017). Unsupervised Extraction of Training Data for Pre-Modern Chinese OCR. The Thirtieth International Flairs Conference. AAAI. Retrieved November 21, 2017.

[cpi-18] ttps://cpianalysis.org/2016/06/08/crowdsourcing-apis-and-a-digital-library-of-chinese/ Archived 2017-03-20 at the Wayback Machine, China Policy Institute, University of Nottingham

[19] ttp://ctext.org/instructions/ocr

[20] ttp://ctext.org/tools/api