DEC Activities and Works

Programme based

Postgraduate

Research Degrees

Research Projects

Treebank Visualization Tool

A Treebank is a database of sentences annotated with syntactic information showing how different elements in a sentence relate to one another. An online treebank visualisation tool was created to enable users without linguistics or computing background to understand texts in a graphical way and find patterns on word usage which might not be identifiable through normal search engines.

Award Student Supervisor
CLASS DEC Competition 2014-15
Postgraduate Champion (Individual)

YEUNG, Chak Yan

Dr LEE, John

Publications

Translation Crowdsourcing and the Transmutation of Translator’s Subjectivity

网络众包翻译模式与译者主体性的嬗变

Abstract: Crowdsourcing translation emerged with the internet era, and it distinguishes itself from the established translation industry. The subjectivity of the internet crowdsourcing translators is fundamentally different from their off-line, traditional counterparts, because the nature of crowdsourcing enables the crowdsourcers to liberate translation from the hands of the few “elite translators”. Thus, translation becomes an open and shared social resource. Crowdsourcing translation relies largely on the power of the crowd to discover the best way of translating, as well as to reveal the meaning of the original. Meaning, during the process of translation, flows through different individual translating subjects and unites the latter. In this article, through further discussion on the subjectivity of crowdsourcing translators, as well as the organizational pattern of several crowdsourcing translation cases, the following conclusion is achieved: crowdsourcing translation, aside from promoting the subjectivity of individual translators, reveals a collective subjectivity on a higher level. Such collective subjectivity, vigorous and powerful, would keep growing stronger rather than being suppressed by the established translation industry.

摘要:网络众包翻译是网络时代翻译的一个重要现象,和传统翻译有着明显区别,前者的译者主体性与后者相比发生了根本变化。众包翻译译者的主体性特点决定了译者及其翻译模式已经从传统的翻译权力体系中解放了出来,翻译不再垄断在社会“精英”手中,而成为社会共享资源。众包翻译是靠主体间的力量来揭示文本意义的最佳途径和方式,而意义本身也最适合在主体间传递,并在传递过程中把众主体连结起来,形成一个意义世界。本文认为众包翻译体现了集体主体性,具有强大的生命力,不但不会轻易地被传统翻译权力系统收编,而且会顽强地生存下去。

Published in:

East Journal of Translation, (2015), Vol.1, p.31-34, 91.
(东方翻译,(2015), 第一期, 31-34頁, 91)

External link:

http://www.ejtrans.com/mulu/33/3301.aspx?id=5

Internet Crowd Participation in Translation: History, Present, and Future

互联网大众翻译模式微探:历史、现时、未来

Abstract: The development of the internet and computer technology has brought about the democratization of information. More and more grass-root users start to initiate and participate in translation activities, and attracted public attention. The development of China’s internet translation communities reveals a pattern. That is, the translation communities started from non-profit grass-root initiation, gradually become organized, patent-protected, and even for-profit productions. “User-generated translation” and “crowdsourcing translation” are the theoretical frameworks that can explain these phenomena. This article traces the development of China’s internet translation communities in the past ten years, proposes the cultural and market value of such translation mode, and points out possible future challenges for the non-professional online translators. As for the non-professional translation communities, they have the following characters: firstly, they are carried out by grass-root internet users. Secondly, some of the communities survived through their conflict with the dominant discourse, while some failed to. Thirdly, crowdsourcing translation is more efficient when translating multimodal texts than the plain texts. All these characters provide shed a light on the new direction for the translation industry in China.

摘要:随着互联网的普及,计算机技术的发展,以及信息流动的民主化,由网民集体发起和参与的翻译现象变得越来越普遍,越来越受到广泛的关注。在中国的网络翻译社区的发展中,我们可以明显地观察到这样的发展脉络, 即, 从网民集体自发的翻译行为到有规律、有组织、有版权保护、甚至有盈利的翻译活动。本文将这种现象纳入“用户生成翻译”(user-generated translation)和“众包翻译”(crowdsourcing translation)的理论框架,通过追溯中国各类互联网翻译社区近十年来的发展,总结出中国众包翻译社区的文化创新意义及未来的市场价值,并指出其可能面对的挑战。本文认为,互联大众翻译社区存在着以下特点:1.翻译过程主要由网络草根用户集体进行;2.在与主流与权威翻译话语体系的对话中,一部分众包翻译社区遭到淘汰,另一部分得以存活并且壮大起来;3.该模式在翻译多模态文本时呈现出比翻译纯文字文本更大的优势,这为中国翻译行业的发展指出了新方向。

Published in:

Chinese Translators Journal, (2015), Vol. 5, p.78-82
(中国翻译,(2015), 第五期, 78-82頁)

External link:

http://www.tac-online.org.cn/ch/tran/2015-09/10/content_8225656.htm

Crowdsourcing Translation in Contemporary China: An Inspiring Perspective of Translation in the Web 2.0 Age

Abstract: With the popularization of internet, the development of computer science, and the democratization of information, internet crowdsourcing and crowdsourcing translation has become increasingly popular. Crowdsourcing translation is broadly applied in a variety of online communities, including: social networks, informative websites, movie and animation fansubbing groups, humanitarian organizations (language-learning websites, and so on. As crowdsourcing translation has accumulated attention from the professional circle, it also aroused controversial debates on copy-right issues, as well as quality problems and evaluation standards. To a large extent, crowdsourcing translation shares the main features with internet crowdsourcing: voluntary participation, crowdsourcers’ multiple identities, and in most cases, translation for free.

This research aims at investigating the history and analyzing the features of crowdsourcing translation in nowadays China, so as to expound how the internet has democratized the production and consumption of translation, how the subjects involved in crowdsourcing translation are dynamically interrelated to each other, and how the concepts and culture of translation have changed with the decentralization of the translation power structure.

To start with, China is a monolingual country without neither a mature economic system nor a strictly regulated translation industry. Thus although crowdsourcing translation has existed for about one decade, and has immensely developed in scale and popularity, the crowdsourcing spirit stayed the same: non-profit, personal goals prevail money. Also, in Chinese crowdsourcing communities, the aim for translation is majorly introducing in foreign cultural products. And in the most cases, the source language is English, the largest second language of Chinese people since the opening up policy, an indication of cultural hegemony. Further, in China, a country exerts a harsh media censorship, the utopian-like crowdsourcing communities at their early stage are in fact, protesting discourses against the official ideology.

Published in:

Meta: Journal des traducteurs/Meta: Translators’ Journal, (2015), Vol.60, No. 2, p.316, doi: 10.7202/1032867ar

External link:

http://www.erudit.org/revue/meta/2015/v60/n2/...

An Exploration of Chinese News Audio-visual-oral Teaching Mode with an Open Vision

Abstract:In teaching Chinese as a second language, the news video is rarely applied, which may cause the situation that the students' listening comprehension ability can't match the requirement. Hereby, the present study applies the Chinese news audio-visual-oral teaching mode to solve this problem. This paper firstly introduces the foundation of this mode, including the theoretical bases of audio-visual-oral mode, and the current situation of the teaching materials of Chinese News Listening and Chinese Audio-visual-oral class. Then an example is provided to show how to design the class in this mode, including instructional objectives, the guides for choosing news topics, processing the news material, compiling the exercises and the teaching arrangement. Finally, a suggestion is put forward, namely, to build an open corpus, where every Chinese teacher can share his/her processed news materials with others.

中文摘要:在對外漢語教學中,新聞視頻素材很少被挖掘,這也導致學生的聽力水準與能力要求的水準之間有一定差距,所以我們開展了“漢語新聞視聽說”教學模式的探索。本文首先介紹了這種教學模式的基礎,包括“視”“聽”“說”一體化的理論基礎、新聞聽力方面和視聽說方面對外漢語教材的現狀;其次給出了該教學模式的教學目標和設計思路,包括選擇新聞話題的標準、新聞素材的加工要求、編寫練習的建議和教學課時安排;最後根據該教學模式的特點提出了教學資源分享的構想,即興建開放性語料庫,同行參與、素材共用的建設模式。

Published in:

TCSOL Studies, (2014), Vol 4, p.57-86
(華文教學與研究, (2014), 第四期, 57-86頁)

External link:

http://ronline.ro.cityu.edu.hk/scholar/download.do?fileUuid=...

Cue Competition between Animacy and Word Order: Acquisition of Chinese Notional Passives by L2 Learners

(with Xu, C)

ABSTRACT: Based on the Competition Model (Bates & MacWhinney, 1978; MacWhinney, 2005; MacWhinney, 2012), the present study investigates L2 cue strategies in the acquisition of Chinese notional passives by English-speaking and Japanese-speaking learners. Two experiments were conducted to examine both the comprehension and production of Chinese notional passives. The main findings included: 1) L2 learners’ acceptability of notional passive increased with improved Chinese proficiency but even advanced learners showed significant difference from Chinese native speakers; 2) L2 learners produced more notional passive sentences than bei-passive sentences and advanced learners showed no difference from Chinese native speakers; 3) Cross-linguistic influence seemed to affect L2 learners’ comprehension and production of Chinese notional passives. The results support the universality of animacy cue proposed by Gass (1987) but also suggest that word order and pragmatic factors may affect L2 learners’ cue strategies. The study also evidences the contribution of the input to the development of L2 cue strategies, which is in line with the predictions of the Competition Model.

Published in:

Open Journal of Modern Linguistics, (2015) Vol. 5 No. 2, p.213-224, doi: 10.4236/ojml.2015.52017

External link:

http://www.scirp.org/journal/PaperInformation.aspx?paperID=56060

Corpus-Based Learning of Cantonese for Mandarin Speakers

(with LEE John)

Abstract. This paper reports our experience in using a parallel corpus to teach Cantonese, a variety of Chinese spoken in Hong Kong, as a second language. The parallel corpus consists of pairs of word-aligned sentences in Cantonese and Mandarin Chinese, drawn from television programs in Hong Kong (Lee, 2011). We evaluated our pedagogical approach with Mandarin-speaking students at a university course. For each student, we first diagnosed the set of Cantonese words with which s/he experienced difficulties. Then, on a web-based interface, the student independently searched in the parallel corpus for sentence pairs involving this set of Cantonese words, and analysed the translations and usage examples. Our experiments showed that, in both the short- and long-term, the corpus-based pedagogical method helped students better retain their knowledge of difficult Cantonese words.

Published in:

Proceedings of the 2014 EUROCALL Conference - CALL Design: Principles and Practice,(2014), p.196-201, doi: 10.14705/rpnet.2014.000217

External link:

http://reference.research-publishing.net/display_article.php?...

On the Development of Sentence Final Particles (and Utterance tags) in Chinese

(with Foong Ha Yap and Ying Yang)

Abstract: This paper holds the opinion that although lacking unique markers for counterfactuals. Chinese indeed has counterfactual thoughts as well as expressions. Based on the definition of counterfactual conditionals, this paper lists three different types of counterfactual conditionals in Chinese for analysis of the relationship between the past tense and counterfactuality. Besides, samples from other languages are also collected to discuss the relationship between the temporal markers and counterfactuality. Based on the above analysis, we conclude that Chinese belongs to the past-tense- counterfactuality language although the tense is not fake. Past tense cannot be grammaticalized into counterfactual markers because of the isolating characteristics of Chinese.

Published by:

Discourse Functions at the Left and Right Periphery: Crosslinguistic Investigations of Language Use and Language Change, (2014), p.179−220

External link:

http://www.brill.com/cn/products/book/discourse-functions-left-...

Automatic Detection of Comma Splices

In English text, independent clauses should be demarcated with full-stops (periods), or linked together with conjunctions. Non-native speakers are often prone to linking them improperly with commas instead of conjunctions, producing comma splices. This paper describes a method to detect comma splices using Conditional Random Fields (CRF), with features derived from parse tree patterns. In experiments, our model achieved an average of 0.91 precision and 0.28 recall in detecting comma splices, significantly outperforming both a baseline model using only local features and a widely used commercial grammar checker.

Published in:

Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, (2014), p.551-560

External link:

http://www.aclweb.org/anthology/Y14-1063

CityU Corpus of Essay Drafts of English Language Learners: a Corpus of Textual Revision in Second Language Writing

Learner corpora consist of texts produced by non-native speakers. In addition to these texts, some learner corpora also contain error annotations, which can reveal common errors made by language learners, and provide training material for automatic error correction. We present a novel type of error-annotated learner corpus containing sequences of revised essay drafts written by non-native speakers of English. Sentences in these drafts are annotated with comments by language tutors, and are aligned to sentences in subsequent drafts. We describe the compilation process of our corpus, present its encoding in TEI XML, and report agreement levels on the error annotations. Further, we demonstrate the potential of the corpus to facilitate research on textual revision in L2 writing, by conducting a case study on verb tenses using ANNIS, a corpus search and visualization platform.

Published by:

Language Resources and Evaluation, (2015), Vol. 49, No. 3, p.659-683, doi: 10.1007/s10579-015-9301-z

External link:

http://link.springer.com/article/10.1007%2Fs10579-015-9301-z

The Typological Stage of Morphology in Chinese Counterfactuals

Abstract: Counterfactuality is used to express a counter-to-factual opinion, here the factual which is the truth held by the speaker does not always coincide with the objective world. Based on cross-linguistic research, we adduce many examples in favor of the opinion that Chinese is not a language based on CF ( Counterfactuality) markers but HE ( Hypotheticality Enhancing) markers. HE markers differ from CF markers in that they could not mark CF exclusively, independently and obligatorily.

Published in:

外國語(上海外國語大學學報), (2015), 第三十八卷一期, 三十至四十一頁

External link:

http://big5.oversea.cnki.net/kcms...

A Typological Research Towards Counterfactual Conditionals

Abstract: This paper holds the opinion that although lacking unique markers for counterfactuals. Chinese indeed has counterfactual thoughts as well as expressions. Based on the definition of counterfactual conditionals, this paper lists three different types of counterfactual conditionals in Chinese for analysis of the relationship between the past tense and counterfactuality. Besides, samples from other languages are also collected to discuss the relationship between the temporal markers and counterfactuality. Based on the above analysis, we conclude that Chinese belongs to the past-tense- counterfactuality language although the tense is not fake. Past tense cannot be grammaticalized into counterfactual markers because of the isolating characteristics of Chinese.

Published in:

外國語(上海外國語大學學報), (2014), 第三十七卷三期, 五十九至七十頁

External link:

http://big5.oversea.cnki.net/kcms...

 

Disclaimer

To make use of any material hosted at this page, you must give appropriate credit, provide a link to the source of origin, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the copyright owners endorses you or your use.

You may not use the material for commercial purposes.

 

Last updated: 15 December 2015