The Handbook of Austroasiatic Languages (2 Vols.) 9004283579, 9789004283572

The Handbook of the Austroasiatic Languages is the first comprehensive reference work on this important language family

1,466 116 12MB

English Pages 1358 [1359] Year 2014

Report DMCA / Copyright

DOWNLOAD FILE

Polecaj historie

Handbook of Metaphysics and Ontology (2 vols) [1-2] 9783884050804

282 60 29MB Read more

Handbook of Metaphysics and Ontology (2 Vols) 388405080X, 9783884050804

Book by

392 56 29MB Read more

Handbook of Amazonian Languages: Volume 2 Handbook Amazonian Languages 9783110860382, 9783110114959

249 57 102MB Read more

Chess player, Vols. 2

730 120 17MB Read more

Handbook on Computer Learning and Intelligence [2 vols., 2 ed.] 9811245142, 9789811245145

The Handbook on Computer Learning and Intelligence is a second edition which aims to be a one-stop-shop for the various

307 13 90MB Read more

The Principles of Psychology, Vols. 1 & 2 0674706250, 9780674706255

The publication of William James's acknowledged masterpiece marked a turning point in the development of psychology

507 21 19MB Read more

Handbook of Amazonian Languages: Volume 4 HANDBOOK AMAZONIAN LANGUAGES 9783110822120, 9783110149913

252 33 146MB Read more

Handbook of Amazonian Languages: Volume 1 HANDBOOK AMAZONIAN LANGUAGES 9783110850819, 9783110102574

206 101 131MB Read more

Word-Formation. Volume 2 Word-Formation: An International Handbook of the Languages of Europe 9783110246278, 9783110246261

This handbook comprises an in-depth presentation of the state of the art in word-formation. The five volumes contain 207

344 79 3MB Read more

Handbook of Australian Languages: Volume 3 0708112153

1,083 37 30MB Read more

The Handbook of Austroasiatic Languages (2 Vols.)
9004283579, 9789004283572

Author / Uploaded
Paul Sidwell
Mathias Jenny

Table of contents :
Contents
Acknowledgements
List of Maps
Abbreviations
Notes on Contributors
Volume 1
Part 1 General Chapters
Chapter 1 Editors’ Introduction
Chapter 2 The Austroasiatic Languages: A Typological Overview
Chapter 3 Austroasiatic Classification
Chapter 4 Austroasiatic Comparative-Historical Reconstruction: An Overview
Chapter 5 Overview of the Munda Languages
Part 2 Grammar Sketches
Section 1 Aslian
Chapter 1 Northern Aslian
Chapter 2 Semaq Beri
Section 2 Monic
Chapter 3 Old Mon
Chapter 4 Modern Mon
Section 3 Pearic
Chapter 5 Chong
Section 4 Khmeric
Chapter 6 Old Khmer
Chapter 7 Modern Khmer
Volume 2
Section 5 Bahnaric
Chapter 8 Bunong
Chapter 9 Kơho-Sre
Chapter 10 Sedang
Section 6 Katuic
Chapter 11 Kui Ntua
Chapter 12 Pacoh
Section 7 Vietic
Chapter 13 Vietnamese (Tiếng Việt)
Section 8 Khmuic
Chapter 14 Kammu
Chapter 15 Mlabri
Section 9 Mangic
Chapter 16 Bugan
Section 10 Palaungic
Chapter 17 Dara’ang Palaung
Chapter 18 Danau
Section 11 Khasian
Chapter 19 Standard Khasi
Chapter 20 Pnar
Section 12 Nicobarese
Chapter 21 Car Nicobarese
Appendix
Index of Languages
Index of Subjects

Citation preview

The Handbook of Austroasiatic Languages Volume 1

Grammars and Language Sketches of the World’s Languages Mainland and Insular South East Asia Editor Paul Sidwell (Australian National University)

The titles published in this series are listed at brill.com/gswl

The Handbook of Austroasiatic Languages VOLUME 1

Edited by

Mathias Jenny and Paul Sidwell

LEIDEN | BOSTON

Library of Congress Cataloging-in-Publication Data The handbook of Austroasiatic languages / Edited by Mathias Jenny and Paul Sidwell. p. cm. Includes bibliographical references and index. ISBN 978-90-04-28295-7 (hardback : alk. paper) — ISBN 978-90-04-28357-2 (e-book) 1. Austroasiatic languages—Handbooks, manuals, etc. I. Jenny, Mathias, editor. II. Sidwell, Paul, editor. PL4283.H36 2014 495.9’3—dc23 2014033059

This publication has been typeset in the multilingual “Brill” typeface. With over 5,100 characters covering Latin, IPA, Greek, and Cyrillic, this typeface is especially suitable for use in the humanities. For more information, please see www.brill.com/brill-typeface. issn 2352-9342 isbn 978-90-04-28295-7 (set hardback) isbn 978-90-04-28750-1 (vol. 1) isbn 978-90-04-28752-5 (vol. 2) isbn 978-90-04-28357-2 (e-book) Copyright 2015 by Koninklijke Brill nv, Leiden, The Netherlands. Koninklijke Brill nv incorporates the imprints Brill, Brill Nijhoff and Hotei Publishing. All rights reserved. No part of this publication may be reproduced, translated, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission from the publisher. Authorization to photocopy items for internal or personal use is granted by Koninklijke Brill nv provided that the appropriate fees are paid directly to The Copyright Clearance Center, 222 Rosewood Drive, Suite 910, Danvers, ma 01923, usa. Fees are subject to change. This book is printed on acid-free paper.

Contents volume 1 Acknowledgements ix List of Maps x Abbreviations xi Notes on Contributors xiv

Part 1 General Chapters 1 Editors’ Introduction 3 Mathias Jenny and Paul Sidwell 2 The Austroasiatic Languages: A Typological Overview 13 Mathias Jenny, Tobias Weber and Rachel Weymuth 3 Austroasiatic Classification 144 Paul Sidwell 4 Austroasiatic Comparative-Historical Reconstruction: An Overview 221 Paul Sidwell and Felix Rau 5 Overview of the Munda Languages 364 Gregory D. S. Anderson

part 2 Grammar Sketches Section 1 Aslian 1 Northern Aslian 419 Nicole Kruspe, Niclas Burenhult and Ewelina Wnuk

vi 2 Semaq Beri 475 Nicole Kruspe

Section 2 Monic 3 Old Mon 519 Mathias Jenny and Patrick McCormick 4 Modern Mon 553 Mathias Jenny

Section 3 Pearic 5 Chong 603 Suwilai Premsrirat and Nattamon Rojanakul

Section 4 Khmeric 6 Old Khmer 643 Paul Sidwell 7 Modern Khmer 677 Walter Bisang

volume 2 Section 5 Bahnaric 8 Bunong 719 Becky Butler 9 Kơho-Sre 746 Neil H. Olsen

Contents

Contents

10 Sedang 789 Kenneth Smith and Paul Sidwell

Section 6 Katuic 11 Kui Ntua 837 Kees Jan Bos and Paul Sidwell 12 Pacoh 881 Mark J. Alves

Section 7 Vietic 13 Vietnamese (Tiếng Việt) 909 Marc Brunelle

Section 8 Khmuic 14 Kammu 957 Jan-Olof Svantesson and Arthur Holmer 15 Mlabri 1003 Kevin Bätscher

Section 9 Mangic 16 Bugan 1033 Jinfang Li and Yongxian Luo

Section 10 Palaungic 17 Dara’ang Palaung 1065 Sujaritlak Deepadung, Ampika Rattanapitak and Supakit Buakaw

vii

viii 18 Danau 1104 Aung Si

Section 11 Khasian 19 Standard Khasi 1145 K.S. Nagaraja 20 Pnar 1186 Hiram Ring

Section 12 Nicobarese 21 Car Nicobarese 1229 Paul Sidwell Аppendix 1266 Index of Languages 1320 Index of Subjects 1326

Contents

Acknowledgements

This book would not have been possible without the work, cooperation, and goodwill of the contributors, editorial assistants, and our publisher. We would like to thank the Institute of Comparative Linguistics, University of Zurich, for affording Mathias time and resources to work on the project, including financial support for students to assist with copyediting and formatting manuscripts. The Institute also assisted Paul with accommodation and facilities during several visits to Zurich for work on the volume. Work done by Paul towards the project during 2009–11 was made possible by financial support provided by the National Endowment for the Humanities (Washington) for his Mon-Khmer Languages Project, and since December 2012 by financial support provided by the Australian Research Council under its Future Fellowship scheme.

List of Maps Chapter 3 1 Branches of the Austroasiatic Phylum 144 Chapter 5 2 Location of the Munda languages in India 364 3 Munda languages of Odisha 366 4 Location of languages represented in the grammar sketches 416

Abbreviations 1 2 3 i, ii, iii, . . . A ABL ABIL ABS ACC ACP ACS ACT ADD ADDR ADH ADJ ADV ADVRS AGR AGT ALL ANA ANIM ANTIP AOR APPL APPR APRX ART ASP ASRT ASSOC ATTR AUX

first person second person third person verb/tense class (Munda) agent ablative abilitive absolutive accusative attention calling particle accessible active additive address particle adhortative adjective adverb(alizer) adversative agreement agent(ive) allative anaphoric animate antipassive aorist applicative approving particle aproximative article aspect assertive particle associative attributive auxiliary

AVC BEN CAUS CLF COLL COM COMP COMPAR COMPL COND CONS CONSUL CONT CONTR COP COORD CSP CVB DAT DCON DECL DEF DEM DES DESC DET DIM DIR DRCT DISC DIST DISTR DS DU DUR

auxiliary verb construction benefactive causative classifier collective comitative complementizer comparative completive conditional consequence consultative continuous contrastive copula coordinating particle consent seeking particle converb dative discontinuous declarative definite demonstrative desiderative descriptive determiner diminutive directional direct (speech) discourse particle distal demonstrative distributive different subject dual durative

xii DYN ECHO EMPH EMOT ENUM EQUD EPIST ERG EUPH EXCL EXCLAM EXP F FAM FILLER FIN FOC FREQ FUT G GEN GOAL GRNDV H HAPP HES HON HORT HUM IDEOPH IDF, id IMM IMP INCEP INCL INCLIN INAN INC.OBJ INCL

Abbreviations dynamic aspect echo word emphatic emotional particle enumerative particle equidistal demonstrative epistemic particle ergative euphonic exclusive marker exclamation experience feminine familiar filler finite focus frequentative future goal, recipient genitive goal gerundive high honorific happenstance hesitation honorific hortative human ideophone identificational particle immediate (future) imperative inceptive inclusive marker inclinative inanimate incorporated object inclusive

iNCLIN IND INDF INDR INF INS INTER INTR INTS IPFV IRR ITER L LINK LOC M MANN MEDL MID MIR MPROX N NNEG NEWINF NFIN NML NOM NSIT NVIS NVOL OBJ OBL OPT ORD P PART PASS PERF

inclinative indicative indefinite indirect (speech) infinitive instrumental interrogative intransitive intensifier imperfective irrealis iterative low honorific linker locative masculine manner medial demonstrative middle voice mirative mid-proximal neuter nonnegation, negative new information non-finite nominal(izer) nominative new situation non-visible non-volitional object oblique optative ordinal number patient partitive passive perfect

xiii

Abbreviations PFV PL PN POL POSS PP PRED PROG PROH PROX PRS PST PTCP PTCL PURP Q QUOT REAL RECP RED REF REFL REL REM REP REP.SP REPET REQ RES RESTR RHET

perfective plural proper name politeness particle possessive preposition phrase predicative progressive prohibitive proximal demonstrative present past participle particle purposive question marker quotative realis reciprocal reduplication referential reflexive relativizer remote reported reported speech repetitive request particle resultative restrictive rhetorical question

RLZ ROY RTOP RTR S SBJ SBJV SEQ SG SNS SRP SS STAT SUB SUPER SURP T TAG TCL TEMP TITLE TOP TR UNIT VENT VIS VOC VOT

realizing particle royal language resumptive topic retracted tongue root single argument subject subjunctive sequential singular sensed (non-visual) self reflecting particle same subject static aspect, stative copula subordinate superlative particle expressing surpise theme tag particle topic-comment linker temporal title topic transitive unitized ventive visible vocative voice onset time

Notes on Contributors Mark Alves has been a professor in the Department of Reading, ESL, and Linguistics at Montgomery College in Rockville, Maryland (USA) since 2004. His presentations and publications over the past 20 years have focused on historical, comparative, and typological linguistics in Southeast Asia, especially Vietnamese and Mon-Khmer languages. Gregory D.S. Anderson is founder and director of Living Tongues Institute for Endangered Languages. His research focuses on the description and history of the Munda languages of India, covering topics ranging from comparative phonology to verb typology, and to situate these in broader South Asian areal and Austroasiatic historical contexts. Aung Si is a McKenzie postdoctoral fellow at the School of Languages and Linguistics, University of Melbourne, Australia. His interests include ethnobiology, anthropological linguistics and language contact. He has carried out fieldwork in southern India (Karnataka state), Myanmar (Shan state), Bhutan and Australia (west Arnhem Land). Kevin Bätscher (MA, Simon Fraser University) is pursuing his PhD degree in linguistics at the University of Hawaiʻi at Mānoa. His primary interests lie in clause linkage, spatial relations and language revitalization, with a focus on the linguistic areas of South East Asia (Austroasiatic, Austronesian, Sinitc, Thai) and the Pacific Northwest (Salishan). Walter Bisang is professor of General and Comparative Linguistics at the University of Mainz since 1992. He has studied General Linguistics, Chinese and Georgian at the University of Zürich (Switzerland). Main interests: linguistic typology, grammaticalization, complexity. Languages of interest: East and mainland Southeast Asian languages, Austronesian, Caucasian, Yoruba.

Notes On Contributors

xv

Kees Jan Bos is a linguist with SIL International. For the past nine years he has worked with the Kui language in Cambodia, doing linguistic analysis, language development and translation. Marc Brunelle is an associate professor in linguistics at the University of Ottawa. His main research interests are the phonetics and phonology of Southeast Asian languages, with a special emphasis on tones and prosody. His research mostly focuses on Vietnamese and Eastern Cham. Supakit Buakaw is a lecturer of linguistics at the Faculty of Humanities and Social Sciences, Khon Kaen University, Thailand. His main fields of interest are phonetics, phonology, and Southeast Asian linguistics. Niclas Burenhult is associate professor of General Linguistics at Lund University and research associate at the Max Planck Institute for Psycholinguistics, Nijmegen. His research interests include the relationship between language, culture and cognition, semantic typology, language documentation and description, and linguistic prehistory, with particular focus on Austroasiatic and Southeast Asia. Becky Butler recently completed her Ph.D. in linguistics at Cornell University and currently works as an ESL specialist at the University of North Carolina. She researches the phonology and phonetics—both articulatory and acoustic—of languages in mainland Southeast Asia, with a special focus on Austroasiatic languages. Sujaritlak Deepadung is an associate professor of linguistics at The Research Institute for Languages and Cultures of Asia, formerly known as Institute of Language and Culture for Rural Development, Mahidol University, Thailand. Her main fields of interest are pragmatics, field methods in linguistics, and Austroasiatic linguistics, on which she has conducted field work and published over the past 15 years.

xvi

Notes on Contributors

Arthur Holmer is associate professor at the Department of Linguistics in the Centre for Languages and Literature at Lund University (Sweden). His primary research focus lies within Formosan (Austronesian) syntax, but he has also worked on the syntax of other East and Southeast Asian languages, especially Kammu (Austroasiatic: Laos). Mathias Jenny is a senior researcher and lecturer at the Department of Comparative Linguistics, University of Zurich, Switzerland. His main fields of interest are language contact and language change in Southeast Asia, with a special focus on the languages of Myanmar/Burma, on which he has conducted fieldwork and widely published over the past twenty years. Nicole Kruspe is a researcher at the Centre for Languages and Literature, Lund University, Sweden. She is a specialist in the documentation and description of the Aslian languages, and has conducted research on the Semelai, Mah Meri, Semaq Beri and Ceq Wong languages. Jinfang Li is a professor at Minzu University of China. His fields of research interest are Tai-Kadai languages and other non-Han minority languages in south China and surrounding regions, with a focus on documentation of lesser-known languages. Yongxian Luo is an associate professor at the University of Melbourne. The focus of his research is in the comparative reconstruction of Tai and Kam-Sui language history, with particular emphasis on the Sino-Tai connection. Patrick McCormick is a research fellow at the Department of Comparative Linguistics at the University of Zurich and is also a historian affiliated with the École française d’Extrême-Orient in Rangoon, Myanmar/Burma. His main field of linguistic interest is the history of language contact and convergence in Mainland Southeast Asia.

Notes On Contributors

xvii

Keralapura Shreenivasaiah Nagaraja professor of linguistics at Deccan College Post-Graduate & Research Institute, Pune, India, from 1981 until his retirement in 2009. His specialization is Austroasiatic and other languages of India, on which he has conducted extensive field work, including Austroasiatic Khasi and Korku, Tibeto-Burman Konyak, as well as the language isolate Nihali. He has published numerous books and articles in national and international journals. Neil H. Olsen earned his M.A. and Ph.D. from the University of Utah. As a documentary linguist, his research interests include the Austroasiatic languages, with a special focus on the minority languages of Vietnam, especially Kơho, on which he has conducted fieldwork and published over the past forty years. Suwilai Premsrirat is an emeritus professor at at the Resource Center for Documentation and Revitalization of Endangered Languages, Research Institute for Languages and Cultures of Asia, Mahidol University, Thailand. Her main fields of interest are Austroasiatic linguistics, language documentation and description, and language revitalization and maintenance. Ampika Rattanapitak is a lecturer at the Burmese Division of the Oriental Languages Department, Faculty of Humanities, Chiang Mai University, Thailand. Her main fields of interest are pragmatics and the Burmese language, on which she has taught and conducted field-work over the past five years. Felix Rau (University of Cologne) has conducted extensive fieldwork on Gorum and Gutob and has been working in the Koraput Area of Odisha (India) since 2002. His research interest includes the Munda languages and the historical relationship of this branch to the rest of the Austro-Asiatic family. Hiram Ring is a linguist and researcher at Nanyang Technological University, Singapore. His main interests are in phonology, morphology and syntax, particularly of Khasian/Meghalayan languages.

xviii

Notes on Contributors

Nattamon Rojanakul is a researcher at the Resource Center for Documentation and Revitalization of Endangered Languages, Research Institute for Languages and Cultures of Asia, Mahidol University, Thailand. Her fields of interest focus on Pearic languages research and digital documentation for the endangered language. Paul Sidwell is a senior lecturer and research fellow at the Department of Linguistics, College of Asia and the Pacific, Australian National University (Canberra). The focus of his research lies in the comparative reconstruction of Austroasiatic language history and its wider implications for the history of Mainland Southeast Asia. Kenneth D. Smith served with the Summer Institute of Linguistics since 1962 in Vietnam, Malaysia, Philippines and Thailand. His linguistic research concentrated on the Sedang language of the Vietnam Central Highlands. After 1975 he was an administrator with SIL in Southeast Asia and since 2011 has been a finance officer of the Graduate Institute of Applied Linguistics in Dallas, Texas. Jan-Olof Svantesson is Professor Emeritus in General Linguistics at Lund University, Sweden. His main research area is the phonology, morphology and phonetics of Southeast Asian languages and Mongolian languages. Tobias Weber is a Ph.D. student and lecturer at the University of Regensburg, Germany. His main research interests include the typology of case systems and their diachronic development, phonology, and the areal-typological profile of Southeast Asia. Rachel Weymuth is a master student of General Linguistics at the Department of Comparative Linguistics, University of Zurich, Switzerland. Her main interests are the three varieties of Palaung: Shwe, Rumai and Ruching, the interplay with their contact languages as well as the features of negation in the Austroasiatic languages.

Notes On Contributors

xix

Ewelina Wnuk is a PhD candidate in the Language and Cognition Department at the Max Planck Institute for Psycholinguistics in Nijmegen, the Netherlands. She has conducted fieldwork among the speakers of Maniq since 2009. Her research interests are: language description and language documentation, ethnobiology, semantics, in particular semantic specificity and language of perception.

Part 1 General Chapters

⸪

chapter 1

Editors’ Introduction

Background to This Handbook

Seeing the Handbook of Austroasiatic Languages come to fruition is something like seeing a difficult adopted son finally leave home; taking him on was the right thing to do, it was necessary, and it felt almost noble at the beginning, but we didn’t really know what would be necessary and now there is a real sense of relief that things have moved on. There are always orphans that need taking care of, and Austroasiatic was definitely in that category. While some find good homes, others are left uncared for. There are false starts, broken promises, the failures to follow through. Institutional arrangements exist, but in the end it is the decisions made by individuals that matter. Other language families have more congenial histories: family reunions that draw hundreds of participants, homes of bricks and mortar that provide reliable shelter over decades, and wealthy uncles and rich inheritances underlying well documented lineages. But in this world of diverse circumstances and mixed fortunes fates are not fixed in stone. One can still make a go of things, and to the bold, as well as the lucky, can come the opportunities. A major step forward for our side came in 2008 with the publication of The Munda Languages (Routledge) edited by Gregory Anderson (who also provides a chapter in this volume). This achievement both inspired and shamed us by not only showing what could be done, but by starkly reminding us what had yet to be done, as the Munda languages are but one branch beside a dozen others on which scores of linguists have been working over decades. The concerns and emphases among Austroasiatic scholars vary considerably according to geography, with consequences for the study of such a dispersed language family. Our counterparts in the sub-continent toiled through the 20th century to collect extensive vernacular texts, compiled extraordinarily encyclopaedic dictionaries (such as the 16 volumes of Hoffman & Emelen’s Encyclopedia Mundarica 1928–37, and Bodding’s five volume A Santali dictionary 1929–36), and invested heavily in understanding morpho-syntax. More or less from the beginning of modern linguistic inquiry, Munda languages were rendered into writing without fuss; their easily distinguished syllables and modest segmental inventories were readily Romanised. And within the Indian cultural and political context, in which diverse ethnic groups have recognition

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_002

4

Editors ’ Introduction

and often official status, and epic narratives enjoy great currency, it is understandable that extensive texts would be collected and grammatical analyses could proceed. At the same time, comparative studies have been neglected; historical linguistics within India is dominated by Sanskrit and does not take an especially outward looking stance. The master work of Pinnow (1959) was exceptional, and despite great hopes did not catalyse subsequent comparative Munda reconstruction in phonology or lexicon among that generation. A somewhat different environment prevailed in Southeast Asia, especially with the blooming of American structuralist linguistics from the 1960s. Field workers enthusiastically collected survey wordlists and fussed endlessly over issues of confusing and competing phonological analyses. This has fed a veritable frenzy of mostly low-level comparative Austroasiatic work (see Sidwell’s 2011 historiography of such studies) and a fascination with phonology. Complex tone and phonation systems, bewildering arrays of vowels and different consonant types packed into highly asymmetrical syllables still present fundamental problems of analysis, providing ongoing fodder for research. By comparison, syntax has not seemed quite so interesting. With little in the way of morphology, the apparent areal dominance of national languages such as Thai and Vietnamese, and the popularity of questionnaire-based data elicitation methods, a narrative of syntactic convergence has come to powerfully influence thinking. There are also the problems that arise in connection with local national scholarly traditions. The huge region encompassing South and Southeast Asia is divided between many countries, and there is little harmony or cooperation between academic traditions. Local research practices have not favoured innovation or otherwise kept up with the rest of the world. Thus, even when nations such as India, Thailand, Vietnam, Malaysia and China have produced descriptive materials over recent decades in the form of dissertations and monographs, the methods have often been highly formulaic and lacking in sophistication, the output frequently not in international languages, and the volumes can be difficult to obtain even if you can find the responsible university or publisher. When language development work is done, circumstances often demand ill-fitting local orthographies and other unfortunate accommodations to national languages. As a consequence these descriptions and published vernacular text collections are very mixed in their extent, quality, transcription systems, and accessibility. Additionally, even experienced western scholars can find it difficult to cooperate or work beyond rather narrow local or personal imperatives. It has proved profoundly difficult to organise international meetings, or to otherwise coordinate activities. Although there is in principle a single journal of record

Editors ’ Introduction

5

for Austroasiatic linguistics, the Mon-Khmer Studies journal, the bulk of output by scholars is dispersed and difficult to keep track of. During more than a century of Austroasiatic linguistics, there has been a lamentable history of failure to follow through; perhaps the most egregious being the failure to publish the papers of the second ICAAL meeting (also known as SICAL, held in Mysore 1978). In 1983 the editors quietly abandoned the project with approximately half the papers already finalised and typeset. At around the same time Harry Shorto also put aside the extensive Mon-Khmer Comparative Dictionary he had been compiling and refining since the 1960s, and that manuscript was only resurrected and published in 2006, a decade after his passing. It is fair to say that the 1980s marked a period of failure of leadership in Austroasiatic studies. There has been a tendency to blame wider circumstances for decline experienced through this period: Austroasiatic linguistics also went into a period of relative decline as empirical linguistic research and analytical language documentation went out of fashion due to the international influence of anti-empiricist trends in linguistic theory prevailing at the time. van Driem, http://www.himalayanlanguages.org/icaal

However, this appeal to fashion is not enough to explain the broad atomisation of Austroasiatic studies, with its hallmark lack of programmatic coordination over many decades, especially the failure to leverage off the bold syntheses offered by figures such as Schmidt, Blagden, Pinnow, and Shorto. These scholars achieved what they did by going beyond local or fashionable concerns, taking a broader view in the conduct of their work. We paid a heavy price for the failure of leadership and international coordination through the “Long Pause” (as van Driem calls it). By the end of the 20th century there had been relatively few informative descriptions of Austroasiatic languages written in contemporary typological terms. Thankfully the situation began improving with the new millennium, particularly stimulated by the language documentation movement. Grammars began appearing such as Kruspe (2004) A grammar of Semelai, Burenhult (2005) A grammar of Jahai, Jenny (2005) The verb system of Mon, Alves (2006) A grammar of Pacoh, Peterson (2011) A grammar of Kharia, and so forth. But the output has remained relatively meagre against the absolute scale of Austroasiatic diversity and compared to the prodigious efforts of scholars working in the neighboring Tibeto-Burman or Austronesian worlds. In recent years we also began to regain some of the momentum that high level comparative reconstruction had attained 30 years before, and in

6

Editors ’ Introduction

2006 Sidwell oversaw the posthumous publication of Shorto’s Mon-Khmer Comparative Dictionary. That work had grown directly out of Shorto’s (1971) Dictionary of Mon Inscriptions, with partial incorporation of new data and insights achieved in the 1970s. Despite its late appearance, the need was great as no such comparative Austroasiatic compilation had appeared in print since the work of Schmidt (1905) and Blagden (1906), a full century before. Consequently the field finally acquired a basic comparative reference comparable to works of similar scope and quality for other language families of the region, such as Dempwolff’s (1938) Vergleichende Lautlehre des austronesischen Wortschatzes or Emeneau and Burrows & Emeneau’s (1961) Dravidian Etymological Dictionary. Importantly with this publication came the imperative to surpass it, to demonstrate to the field what we have learned in the meantime and outline how our work can best proceed. A direct consequence was the establishment of the Mon-Khmer Languages Project with its extensive online resources for comparative work (http://sealang.net/monkhmer). The new zeitgeist of interest and activity also saw the irregular resumption of the ICAAL (International Conference on Austroasiatic Linguistics) meetings in 2007, with the 3rd ICAAL held at Deccan College Post-Graduate & Research Institute in Pune (India), after a hiatus of nearly 30 years. The meeting was organised by K.S. Nagaraja, with encouragement from (in particular) George van Driem. Although a modestly sized meeting, it brought together various old and new generation figures, stimulating a cautious optimism. Crucially, it brought together Indian scholars, working on Munda, Khasian and Nicobarese, with SEAsian and Western scholars who work in Thailand, IndoChina, Myanmar and Malaysia. This was properly the formal re-establishing of Austroasiatic-wide scholarly relations after decades of purely locally focussed distributed efforts. It was followed by ICAAL4 in Bangkok (2009) and ICAAL5 in Canberra (2013). A meeting had been planned for 2011 in Bangkok but had to be abandoned due to flooding in central Thailand at that time.

The Birth of the Handbook

With the resumption of international contacts the circumstances for the conception of this volume came into being. The idea first emerged during a casual conversation in London at the 40th International Conference on Sino-Tibetan Languages and Linguistics, on a fateful September day in 2008. The gentlemen who would eventually edit this volume were making small talk to provide distraction—and some sense of scholarly solidarity—during a somewhat unimpressive paper presentation. Discussing the recent resumption of ICAAL,

Editors ’ Introduction

7

the appearance of various new grammars, and Anderson’s Munda handbook, the theme of conversation turned to the possibility of a broad Austroasiatic handbook. More than one academic publisher has its long-standing language family handbook series yet apparently none had yet commissioned a volume for the phylum that stands at the geographical crossroads of the East’s great civilizations and which itself gave birth to one of history’s greatest monumental cultures, Angkor. The more we thought about it the more we recognised an obligation to act rather than complain. The following year ICAAL was held in Bangkok (at Mahidol University Salaya campus), the largest gathering of its kind so far. At the business meeting on the evening of the last day discussions crystallized into a formal proposal, as Mathias suggested that the time was right to put together a handbook. There was much discussion over the hour; in particular there was some debate over whether to go with a printed volume at all or proceed to a purely digital living work. Objections to a printed volume included:

• •

a book quickly becomes outdated as knowledge improves; hardcovers are expensive—a publisher could charge three or four hundred dollars a copy for the initial print run, while a paperback might take five years to appear, or not appear at all. Yet others present asserted that their institutions would only recognize hardcopy publication, and this would not necessarily rule out an electronic version that could be developed as an extensible resource. It was also put that a book project is positive because it forces coordination and deadlines such that work comes to fruition that might otherwise drag on, unfinished and unpublished, an all too familiar problem in our field. Eventually agreement was reached and a number of scholars present pledged to contribute chapters to the handbook. It was agreed that Mathias and Paul would approach publishers and negotiate a satisfactory deal, and several potential publishers were nominated. In fact not much happened for the next two years as other business kept us busy, until in 2011 Mathias created the project website (sites.google.com/site/icaalprojects) and began laying out details, including proposed table of contents, standardized notational conventions, invitations and suggestions for contributions. The initial plan was intended to be straightforward: 24 language sketches of 20 pages each (two for each branch except Munda, which was already well presented in Anderson’s handbook) plus historical and typological overview chapters. The whole thing would be around 650 pages and our naive expectation was that this would be minimally burdensome on the authors, who would be encouraged to adapt

8

Editors ’ Introduction

existing work into our template. The publication was not intended to be primarily a showcase for new and interesting results, but to give an overview of the language family.

How the Book Came Together

The project took some time to gain momentum, with relatively little happening over the first year and a half. The situation turned with the holding of a workshop for participants at the 45th International Conference on Sino-Tibetan Languages and Linguistics, at Nanyang Technological University, Singapore, 25 October 2012. This was an all day workshop with 13 presentations, which included discussion of draft sketches by Mathias Jenny (Mon) and Paul Sidwell (Sedang). Many practical issues were raised and discussed. It was in discussions at this meeting that we realized that some adjustments to our original plan would be necessary. A particular sticking point was that everybody with a substantial amount of data on their particular language seemed to want to provide as full a description as they could muster, and there was mention of drafts of 60, 70 or more pages length. Given the publisher’s limit of being able to accommodate a book of no more than approx. 1000 pages, and being reluctant to produce multiple volumes, we suggested that sketches should be limited to 30 pages or around 10,000 words maximum. In the end we came to something like this result, with the shortest sketches being just over 6,000 words and the longest a little more than 13,000.1 Another pleasing outcome of the Singapore meeting was the participation of Gregory Anderson. The project was initially launched as the MonKhmer Languages Handbook, intended to complement Anderson’s The Munda Languages volume, consistent with the tendency over the past half century to recognize Austroasiatic as a phylum consisting of Munda and Mon-Khmer families. But as comparative-historical work has progressed in recent years it has become increasingly clear that there is no such dichotomy, there is not even such a thing as a Mon-Khmer family, but rather a phylum with some 13 branches which coordinate historically in a manner that is not quite clear. Also, the publications by Anderson and others have revealed that Munda is underlyingly much more similar to what was previously regarded as the MonKhmer type, and it would not be appropriate to continue the historical mischaracterisation of Austroasiatic by crystallizing the notion in a Mon-Khmer 1 As the manuscripts were typeset it became evident that the book would count around 1300 pages, which forced the publisher to split the publication into two volumes.

Editors ’ Introduction

9

handbook. It was with great satisfaction that Anderson agreed to contribute his Munda overview chapter to the handbook, allowing it to have properly Austroasiatic scope and carry Austroasiatic in the title. There was also discussion in Singapore about notational conventions and standards for the handbook. It has always been the editors’ strong intention that the content of the sketches be consistent in presentation, utilizing a single TOC template, a unified list set of glossing abbreviations, and consistent use of IPA in vernacular transcription.2 Remarkably, this attitude did encounter some resistance. The inertia of long-time local traditions and expectation of deference to authorial preferences over editorial authority run deep in linguistics, where egos are all too often placed above the greater good of the discipline. We managed to largely stand our ground, making some minor accommodation in order to admit some sketches, and ultimately a couple of originally intending contributors did withdraw from participation, but we are satisfied that we took the right approach. Throughout the project we have endeavored to conduct things in a transparent and cooperative manner, nonetheless we could not accommodate the full diversity of views and intentions that have been expressed at times. A case in point is that from the beginning Gérard Diffloth expressed strong support for the project and volunteered to come on board as an editor. We were quite pleased at the prospect of Gérard contributing the benefit of his specialist knowledge and vast experience, not the least of which was the prospect of a historical and classification overview chapter that he proposed to contribute. However, it was not to be, as eventually irreconcilable editorial differences emerged, and Gérard withdrew from the project. Readers will notice that we did not quite achieve our aim of two representative sketches for each Austroasiatic branch. Various specialists in the respective languages who are well placed to make contributions were contacted and did signal agreement. However, the academic life is a demanding one, and it can happen that one may not find the time to put together two dozen pages of notes within a timeframe of three or four years, despite one’s best intentions, and as editors we recognize these limitations. In a couple of cases we were provided with drafts, but they were so divergent from what was requested that they could not be used. In other cases authors withdrew quite late in the project creating serious possible gaps in our plan. Fortunately we were able to find willing and able authors who contributed 2 The parallel layout of the grammar sketches allows the reader to easily find specific topics in each language. This also allowed the editors to restrict the index to the first part of the publication, covering the general chapters.

10

Editors ’ Introduction

chapters at relatively short notice, and in this connection we would like to especially thank: Becky Butler, Hiram Ring, and Aung Si. Consequently only Vietic, Pearic, Nicobarese and Mangic are represented by single sketches in this handbook. Also, it is significant that we include a number of grammar sketches of languages for which previously there were no grammatical descriptions available, in addition to some languages that are already quite well known.

A Unifying Typological Approach

Our original editorial intention was quite clear; it was not really so much about a book, as a new programmatic vision for Austroasiatic studies, to overcome the lack of direct comparability that has characterized work until now. We take a broad typological approach, recognizing that recurrent features of languages ought to be appropriately described in a consistent manner, without imposing theoretical bias on the languages described. This led to our insistence on consistent glossing, even to the extent that it can make the translations appear somewhat opaque outside the areal context. An example is the semantically bleached verbs used in modal/aspectual functions, such as the ‘give’ benefactive and causative constructions that frequently turn up in the sketches. Our attitude is that these should be glossed with their lexical meanings, rather than labeled according to their contextual functions— the latter should always be clear from the translations. This practice nicely captures an important recurring feature of the language family and area, of the kind inherently interesting to typology, historical linguistics, contact linguistics, cognitive studies etc. and which might otherwise be obscured under a diversity of labels, interpretations and competing theoretical imperatives. Understandably, as drafts came in, it became apparent that our recommendations to authors were not always followed. This should not have been a surprise, as what we have been intending all along is a new level of coordination in our field. This typological approach specifically prioritizes the coherence of the work above the autonomy of the individual writers. This is a paradigm shift from the established culture of the field, which failed to spontaneously manifest coherence to the extent that we ought to expect of a mature field. As scholars it is important that we conduct and report our work in a manner that maximizes its accessibility and intelligibility; not just within AA studies but to the broader context comprised of all subfields of linguistics.

Editors ’ Introduction

11

Consequently we had to deal with the extensive editorial challenge of finessing the texts to conform to the volume requirements without interfering with the individual authors’ analyses. This task, both time consuming and frustrating, yielded substantial benefits. As a consequence of working through all the chapters we have now developed an elaborated list of glossing conventions that covers a wider and more appropriate range of categories than the standard Leipzig Glossing Rules list, unified the application of IPA to segmental representations, and consolidated a range of descriptive and analytical tools for ongoing use. This is the basis of our unifying typological approach with which we intend to coordinate efforts in Austroasiatic studies going beyond this handbook.3 Mathias Jenny and Paul Sidwell June 2014 Zurich References Alves, Mark. 2006. A Grammar of Pacoh: a Mon-Khmer language of the central highlands of Vietnam. Canberra: Pacific Linguistics (PL580). Anderson, Gregory (ed.). 2008., The Munda languages. London/New York: Routledge. Blagden, Charles Otto. 1906. Pagan races of the Malay Peninsula. Vol. 2. London: Macmillan. Bodding, P.O. 1929–1936. A Santali dictionary (5 volumes). Oslo: I Kommisjon Hos Jacob Dybwab (Norwegian Academy of Science and Letters). Burenhult, Niclas. 2005. A grammar of Jahai. Canberra: Pacific Linguistics. Burrow, Thomas & Murray Barnson Emeneau. 1961. Dravidian etymological dictionary. Oxford: Clarendon Press. Dempwolff, Otto. 1938. Vergleichende Lautlehre des austronesisches Wortschatzes. Bd. III: Austronesisches Wörterverzeichnis. Berlin and Hamburg: Dietrich Reimer. Hoffmann, John & Arthur van Emelen. 1928–37. Encyclopedia Mundarica (16 volumes). Patna: Government Superintendent Printing

3 Our attitude is also consistent with the advocacy of unified methods in data warehousing and sharing by Doug Cooper of the Centre for Research in Computational Linguistics (Bangkok), such as presented the 24th SEALS meeting in Yangon, in May 2014.

12

Editors ’ Introduction

Jenner, Philip N., Laurence C. Thompson, and Stanley Starosta (eds.). 1976. Austroasiatic Studies (2 vols.). Honolulu: University of Hawaii (Oceanic Linguistics, Special Publication, No. 13). Jenny, Mathias. 2005. The Verb System of Mon. Zurich: Universität Zürich. Kruspe, Nicole D. 2004. A grammar of Semelai. Cambridge: Cambridge University Press. Peterson, John. 2011. A grammar of Kharia: a South Munda language. Leiden/Boston: Brill. Pinnow, Heinz-Jürgen. 1959. Versuch Einer Historischen Lautlehre Der Kharia-Sprache. Wiesbaden: Otto Harrassowitz. Schmidt, Wilhelm. 1905. Grundzüge einer Lautlehre der Mon-Khmer-Sprachen. Denkschrift der Akademie der Wissenschaften. Wien, Philologisch-Historische Klasse 51:1–233. Shorto, Harry L. 1971. A dictionary of the Mon inscriptions, from the sixth to the sixteenth centuries, incorporating materials collected by the late C.O. Blagden (London Oriental Series 24). London: Oxford University Press. Sidwell, Paul. 2011. Comparative Mon-Khmer Linguistics in the 20th Century: where from, where to? In K.S. Nagaraja (ed.) Austro-Asiatic Linguistics: In memory of R. Elangaiyan. (Proceedings of the 3rd International Conference on Austroasiatic Languages) Mysore: Central Institute of Indian Languages, 38–104.

chapter 2

The Austroasiatic Languages: A Typological Overview Mathias Jenny, Tobias Weber and Rachel Weymuth The Austroasiatic (AA) languages are geographically widespread, occurring in mostly discontinuous pockets of speech communities. The vast majority of AA languages have no official status; the only exceptions are Vietnamese and Khmer, which are the national languages of Vietnam and Cambodia, respectively, and Khasi, which has official status in the Indian state of Meghalaya, where it is effectively used as a majority language by one million plus speakers for many purposes. Serving as vernaculars of non-majority populations in most countries where they are spoken, the AA languages are exposed to external influence to varying degrees. This has led to the great diversity of structures found in the AA languages today, which is partly due to different degrees of retention of inherited features, partly to language change, mostly under the influence of dominant neighboring languages. As the overall structure of the proto-language is not well known apart from various morphological processes found in branches separated by hundreds or thousands of kilometers for many centuries, it is often difficult to make conclusive statements concerning the origin of specific structural features in individual languages. While the verb final constituent order in the Munda languages can be attributed to areal South Asian influence with some confidence, we cannot readily explain the presence of verb initial structures found in Nicobarese as well as some Khasian and Palaungic languages. Typologically, the AA languages can be superficially grouped into three distinct subgroups which belong to three geographic regions of the AA speaking area. The Munda languages in central and eastern India are consistently verbfinal agglutinating languages, with a large number of affixes expressing derivational processes as well as case relations with nominals, and tense-aspect and person with verbs. The Nicobarese languages, spoken on the Nicobar islands in the Andaman Sea, are generally verb-initial and exhibit complex morphological processes, including prefixes, infixes, and suffixes. The rest of the family, mostly spoken in Mainland Southeast Asia, is generally verb-medial, and apart from traces of inherited derivational morphology, isolating. A large number of other features and isoglosses cut across the boundaries of these three © koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_003

14

Jenny, weber and weymuth

groups, though. In addition, at least in the case of the Munda languages, the preset typological profile is at least partly due to the areal influence of South Asian languages. No such influence can be conclusively claimed in the case of Nicobarese languages, but in both Munda and Nicobarese there are traces of different word orders, which may be seen as older. The present typological appearance is therefore not enough for an internal classification of the AA languages into three groups, but may rather be the (recent) outcome of events in the history of these languages. This chapter brings together data from a large number of AA languages covering all known or described branches. The examples are mostly taken from the grammar sketches by various authors in this publication, complemented with data from other published sources. No complete typological profile of the AA family is attempted here, as the state of AA studies is still far from a point where this undertaking would be possible. Our aim is rather to give a short overview of structures found in the different AA languages, illustrating the diversity of the family. Extensive text corpora on which a more in-depth study of morphosyntactic structures would have to based are to the present day nonexistent in next to all AA languages. Also comprehensive descriptive grammars are, with a few exceptions, not available for most AA languages, and the available material covers only a small part of the whole family. We are therefore not yet in a position to synthesize a complete typological profile of AA, and must content ourselves with a descriptive presentation of the structures encountered in the individual languages. 1

Phonetics and Phonology

AA languages generally tend to reflect the typological phonological profile of the geographical areas in which they are spoken. Hence, Munda languages follow patterns typical of South Asia, while the AA languages spoken in Mainland Southeast Asia follow patterns typical of that area. However, AA languages also exhibit features that are frequent in the family but not typical of the languages of other families of the respective areas. Besides the characteristic features of AA, individual languages in some cases considerably differ from each other and exhibit features that are rare—from an areal or even from a worldwide perspective. The branches of AA show differing degrees of internal phonological diversity. Fairly diverse branches are Bahnaric, Palaungic, Aslian, and Vietic (cf. Diffloth 1980: 3). The reasons for that might be partly the number of languages that belong to each branch (Bahnaric being the largest) and partly the areal profile of the languages (small languages spoken in remote areas and/or in

The Austroasiatic Languages: A Typological Overview

15

contact with different dominant languages tend to be more diverse, cf. Nichols 2003), but the broader picture of the exact nature of the typological diversity found among AA languages remains to be established. 1.1 Word and Syllable Structure Austroasiatic languages usually have monosyllabic and disyllabic phonological words. Additionally, due to morphological derivations and loanwords from other languages, most prominently Sanskrit and Pāli (often mediated through third languages such as Thai or Lao) and later also English and French, many allow for polysyllabic words. A characteristic feature of many Austroasiatic languages (although less so especially in Munda, Vietnamese, and Nicobarese) are phonological words that consist of two syllables, whereby an initial unstressed syllable (often called ‘minor syllable’ or ‘presyllable’) is followed by a stressed full syllable (‘main syllable’). This word structure has also been called ‘sesquisyllable’ (‘one-and-ahalf’ syllables long) since Matisoff (1973). First raised by Shorto (1963: 46), the original idea of this term was to describe a situation where an onset consonant was preceded by another (‘pre-initial’) consonant, and for phonotactic reasons an unstressed vowel occurs between the two consonants, creating the nucleus of an unstressed syllable. The nature of the first unstressed syllable, however, differs from language to language. A typical structure of such sesquisyllabic words is the following, taken from Mon: (1) Mon1 (Cə).C(C)V(C)

Other languages also allow for closed minor syllables. Pacoh is a case in point: (2) Pacoh CV(C).C(C)V(C)

Sesquisyllabicity has also been postulated for Proto-AA (Donegan & Stampe 2004, Shorto 2006, Sidwell this volume). Furthermore, it has also been postulated for some other language families of Southeast Asia and adjacent areas, namely for Proto-Sino-Tibetan (Benedict 1972, Handel 1998), and also for the Kam-Sui branch of Tai-Kadai (Solnit 1988). Moreover, lexemes reconstructed for Proto-Tai-Kadai (Ostapirat 2005) and Proto-Hmong-Mien (Ratliff 2010) 1 All examples, unless otherwise indicated, are taken or adapted from the respective grammar sketches in the two volumes of this publication. In addition, some of the Mon examples are from MJ’s own field notes, and the Rumai data is mostly from RW’s fieldnotes.

16

Jenny, weber and weymuth

have maximally two syllables. On the other hand, words in Proto-Austronesian are longer due to affixation (Dempwolff 1934, Blust 1988, Wolff 2010). In sesquisyllabic words of AA languages, generally only the first consonant of the second (main) syllable has the full consonant inventory. The inventory of the second consonant of the main syllable is often limited to sonorants. The inventory of the final consonants often roughly corresponds to the inventory of the onset consonants of main syllable but lacking an apparent voicing or VOT contrast;2 in some cases there are further restrictions in fricatives and liquids. Some AA languages only permit closed main syllables, e.g. Mlabri, Old Mon (with some exceptions), and some Aslian languages (e.g. Jahai). That is, there is a strong tendency for stressed syllables to be heavy, i.e. bimoraic. Anderson (2004) describes the concept of a ‘bimoraic constraint’, which can explain why some monosyllabic free forms in Munda have to be expanded (i.e. resyllabified and/or reduplicated) in order to remain heavy. The combinations of consonants in the main syllable are often heavily restricted. Usually all consonants can occur in the first position, if no consonant follows. Otherwise, the first position is usually also restricted to stops and, in some cases, additionally also to some fricatives, while only liquids and occasionally glides are possible in the second position. In Mon, for instance, the following clusters are permitted: (3) Mon kj kr kl kw khj khr khl khw pj pr pl phj phr phl

Other AA languages allow for more cluster combinations. Sedang additionally has nasal-plus-liquid and stop-plus-h combinations: (4) Sedang -h: ph th (ch) kh -l: pl (tl) kl bl gl ɓl ml 2 We say ‘apparent’ because there is a tendency—for example in Khasi (see Nagaraja this publication)—for codas to be longer after short vowels than after long nuclei. Conceivably this could relate to a historical voicing or VOT feature in syllable codas.

The Austroasiatic Languages: A Typological Overview -r: pr tr br dr ɓr ɗr mr ˀmr

17

kr gr ŋr ˀŋr ŋ̊ r

In Khasi morphemes are predominantly monosyllabic, and the syllables have the following structure: (C)CV(C). Most syllables are closed, and there are many combinations of onset consonants, some of them go against the sonority hierarchy, but sequences of the same place of articulation are avoided. Dental+alveolar clusters such as tn-, tr-, tl-, tʰn-, tʰr-, tʰl do occur, however. Khasi onset clusters include the sequences of two stops, one of them voiceless and the other voiced (Henderson 1989–1990: 62): (5) Khasi bti bthi dkar dkhar dpei

‘to lead by the hand’ ‘sticky’ tortoise’ ‘plainsman’ ‘ashes’

pdot pdeng tbian tba

‘throat’ ‘middle’ ‘floor’ ‘to feel’

Some rare sequences of two voiceless stops are also attested (ibid.): (6) Khasi tkor-tkor ptej-ptej kpa

‘plump and tender’ (used of something very sticky) ‘father’

Furthermore, Khasi has nasal+stop clusters such as mtung ‘in a large heap’ and mthin ‘strong and stout’. Khmer also has a lot of CC combinations in the onset of the main stressed syllable. Most notably there are no restrictions in the second onset position, and the first onset onset position is restricted to p, t, c, k, ʔ, s, d, m and l. Furthermore, Huffman (1967) lists some monosyllabic loanwords that have three consonants in the onset: (7) Khmer (Huffman 1967) sthaːn ‘place’ lkhaon ‘theater’

18

Jenny, weber and weymuth

Sre permits maximally three consonants in the onset. The third consonant has to be a glide: (8) Kơho ŋgwi krjaŋ

‘to sit down’ (type of hard wood)

On the other hand, Nicobarese and some Munda languages can only have maximally one consonant in syllable onsets. The syllable structure of Kharia is the following: (9) Kharia (Peterson 2011: 32) (C)V(C)

The consonant inventory of the first (minor) syllable is often also smaller than that of the second (main) syllable. The vowel inventory of the minor syllable is often limited to [ə], which in many languages does not have phonemic status. Some AA languages also allow a number of vowels other than [ə] in that position, but not the full range of vowels that can occur in the main syllable. In North Aslian languages, the default epenthetic vowel is [ə], but [i] and [u] also occur, depending on the onset consonant of the following syllable: palatal consonants condition [i], bilabial, labiovelar and velar consonants condition [u]. In some languages the vowel of the minor syllable is phonemic, i.e. it is not predictable from the environment of the syllable. In Pacoh, for instance, the minor syllable has the structure CV(C). In the onset position any consonant except postglottalized glides and aspirated stops can occur, the nucleus can be filled with the vowels a, i, and u, and in the coda position nasals and liquids are allowed. Kri also permits a, i, and u. In the same position, r can also occur: (10) Kri (Enfield & Diffloth 2009: 35) C{V/r}.(C)CV(C)

Mlabri shows some variation in the vowel of the minor syllable, which however is not phonemic: (11) Mlabri təguk ~ tuguk chəmɔn ~ chɔmɔn chr̩kɛŋ ~ chərkɛŋ bm̩ bwaj ~ bumbwaj

19

The Austroasiatic Languages: A Typological Overview

Northern Kammu distinguishes between non-tonal and tonal minor syllables: non-tonal minor syllables consist of a consonant and (in careful speech) an epenthetic (non-phonemic) ə and bear the same tone as the following main syllable. Tonal minor syllables, on the other hand, consist of a consonant and a nasal, between which usually an epenthetic ə is inserted; this second syllable type carries a tone that is independent of the tone of the following main syllable: (12) Northern Kammu non-tonal minor syllable: tonal minor syllabe:

c.mə̀ km̀ .múʔ

[cə̀.mə̀ ʔ] [kə̀ m.múʔ]

‘rope’ ‘human being’

A minority of AA languages such as Khmer or Kui have CC onsets in minor syllables. The second consonant is restricted to l and r. (13) Kui crə.lia ‘thorn’ krə.wiən ‘sickle’ trə.bɒh ‘break’

Minor syllables that are created by partial reduplication of the main syllable are also attested in AA. In Sedang, the onset consonant or even onset clusters consisting of two consonants can be reduplicated. In some instances, the vowel of the main syllable is also reduplicated: (14) Sedang reduplication of the (only) onset C of the main syllable: reduplication of two onset Cs of the main syllable: reduplication of the onset C and the nucleus V of the main syllable:

cicow, dudat, ɲiɲɔn krikro̰w, tritrow kiki, hihiə, iʔiw

Monosyllabic words are also frequent in most other AA languages, although far less frequent than in Khasi. The typical structure of monosyllabic words in AA has the following shape: (15) e.g. Jahai CV(C)

Vietnamese restructured its phonology considerably under Sinitic influence. Consequently, it became predominantly monosyllabic as it lost all the minor

20

Jenny, weber and weymuth

syllables of sesquisyllabic words (see Ferlus 1997 for an overview). But in contrast to Khasi, and also to Khmer, Vietnamese allows only one consonant in the onset (followed by an optional w, however): (16) Vietnamese C(w)V(V)(C)

Contemporary Vietnamese has driven monosyllabicity to the extent that it can be analyzed without postulating the phonological word as a separate prosodic unit from the syllable (cf. Schiering et al. 2010), i.e. there does not appear to be a prosodic unit between the syllable and the phonological phrase that is relevant for any phonological rule or process. However, this does not entail that Vietnamese lexemes cannot be di- or polysyllabic, as the following examples illustrate. The semantics of single syllables of di- and polysyllabic lexemes is not always transparent: (17) Vietnamese semantically transparent:

quán ăn

[kwanB1 ɁănA1]

‘restaurant’ (lit. ‘shop eat’) semi-opaque compounds: Hà Nội [haA2-nojB2] ‘Hanoi’ (lit. ‘river interior’) loanwords: ban công [ɓanA1-kowŋ͡mA1] ‘balcony’ (< Fr. balcon)

However, although it has been recognized that there is a tendency for sesquisyllabic words to become monosyllabic (cf. e.g. Matisoff 2001, Brunelle 2009, Michaud 2012), sesquisyllabic words in AA appear to be diachronically very stable, and Vietnamese is a rare deviation from that stability. Moreover, Brunelle & Pittayaporn (2012: 412) argue that a change from sesquisyllabicity to monosyllabicity is less common than generally assumed. In a number of AA languages, words can be more complex. Disyllabic words in which the first syllable is not reduced also occur in some AA languages that otherwise predominantly have monosyllabic and sesquisyllabic words but are rarer and usually occur in derivations and loanwords (cf. Section 2). Lexemes do not always consist of a single phonological word. In Sedang, mono- or disyllabic words can be partly or fully reduplicated, creating lexemes, which especially also include onopatopoeia. (18) Sedang: complete word reduplication ʔblut ʔblut ‘little by little’

The Austroasiatic Languages: A Typological Overview (19) Sedang: partial reduplication CV reduplication: Initial and final C reduplication: Initial C reduplication: Minor syllable and initial C reduplication:

baŋ bal kuŋ kiŋ kɔŋ cek caŋ kəʔblou kəʔbla

21 (kind of catfish) ‘elbow’ ‘to lie on one’s side’ ‘very crowded’

Due to affixation and compounding, phonological words in Khasi can also be more complex, consisting of up to five syllables: (20) Khasi two syllables: three syllables: four syllables: five syllables:

laŋ.brot kʰa.basa eːr.laŋ.tʰa.ri pɨn.hiar.mɨn.siem.set

‘lamb’ ‘fish variety’ ‘whirl-wind’ ‘snore’

Some Aslian languages also allow for more complex words. Jahai, for instance, has tetrasyllabic words with the form /C.C.C(V)C.CVC/ as in t-b-tadɔʔ (relprog-wait) ‘waiting’. The main stress is always on the last syllable. Infixes in AA are always inserted after the first onset consonant. Aslian and Nicobarese have a process whereby the coda of the final syllable is reduplicated and inserted in the empty coda position of the first unstressed syllable, as in Semaq Beri bəq ‘to tie’ vs. nk-bək [nək.bək] ‘tying’. This process has also been referred to as ‘incopyfixation’ (cf. Matisoff 2003). Outside Aslian and Nicobarese, Khmuic also shows some traces of coda copying, which increases the likelihood that it was already present in Proto-AA (cf. Sidwell’s overwiew chapter on comparative-historical reconstruction). Munda (with the exception of Gtaʔ, see Anderson this volume) and Nicobarese languages do not have sesquisyllabic words of the types found in Mainland Southeast Asia. They allow for more complex morphology, including suffixation. To what degree such more complex morphological words correspond to phonological words is still under-investigated. 1.2 Phoneme Ιnventory and Phonotactics 1.2.1 Consonants The size and structure of Austroasiatic consonant inventories vary considerably. As already mentioned, the phonemic inventories of AA languages are largest in the first onset position of the main (stressed) syllable. Fairly small consonant inventories are found in different branches of AA and include those of Car Nicobarese and Khmer (17 consonants each). On the other hand, several AA languages have consonant inventories consisting

22

Jenny, weber and weymuth

of 30 or more consonants, e.g. Kharia (34, including consonants occurring in loanwords only, Peterson 2011: 29), Ruching Palaung (31), Bugan (36), or Sedang (39). The majority of AA languages have between 20 and 30 consonants, which are also about average-sized inventories from a worldwide perspective (Maddieson 2013a). AA languages typically have five places of articulation for stops and four for nasals: bilabial, dental/alveolar, palatal, velar, and glottal (for stops only). The most common fricatives are s and h, f is rarer, and further fricatives yet rarer. Furthermore, AA languages typically have one rhotic (r), one lateral approximant (l), and two glides (w and j). AA languages differ with respect to the stop series. Several have two stop series, usually distinguishing between voiceless (unaspirated) and voiced stops, which in some languages are implosive. They are found in most branches of AA. Khmer is an example for a language that has two stop series, voiceless (plain) vs. voiced: (21) Khmer consonant phoneme inventory (fully available in the initial position of main syllables) p t c k ʔ b d m n ɲ ŋ (f) s h r v l j

Phonetically, however stop+[h] sequences occur in the onset. Before a vowel, they are the realization of the phonemic sequence of a stop plus h, as in khɤŋ ‘angry’. On the other hand, the first stop of a two-stop cluster is inherently aspirated (e.g. [khp]; see also Huffman 1972). Many other AA languages (including the reconstructed Proto-AA) have three stop series. Chong has three stop series: voiceless plain, voiceless aspirated, and voiced: (22) Chong p t c k ʔ ph th ch kh b d m n ɲ ŋ ( f ) s h r w l j

The Austroasiatic Languages: A Typological Overview

23

Implosive stops are a further feature found in many languages of Mainland Southeast Asia but only very rarely in other parts of Eurasia and also somewhat rarely worldwide (Maddieson 2013c). Following this areal trend, they are also found frequently in the AA languages of Mainland Southeast Asia. The phonemic status of the implosive stops differs, however. Aslian languages do not have implosive stops. Kammu has three stop series, including an implosive series: (23) Kammu p t c k ʔ ph th ch kh ɓ ɗ m n ɲ ŋ s h r w l j ʔw ʔj

Note the typologically fairly unusual contrast between plain and glottalized glides, which are also found in a number of other AA languages. Car Nicobarese has only one stop series, namely unaspirated voiceless stops. (24) Car p t c k ʔ m n ɲ ŋ f s h r ɽ v l j

r is unusual in that it is realized as prestopped rhotic in initial position, as [t] before voiceless consonants and as [d] elsewehere. On the other side of the extreme, some Munda languages such as Kharia follow the South Asian trend in having four stop series, including retroflex stops: voiceless unaspirated (p, t, ʈ, c, k), voiceless aspirated (ph, th, ʈh, ch, kh), voiced plain (b, d, ɖ, ɟ, g), and breathy voiced (bʱ, dʱ, ɖʱ, ɟʱ, gʱ). (25) Kharia (Peterson 2011: 29) p t ʈ c k (ʔ) tʰ ʈʰ cʰ kʰ b d ɖ ɟ g bʱ dʱ ɖʱ ɟʱ gʱ m n (ɳ) ɲ ŋ

24

Jenny, weber and weymuth f s h r (ɽ) (ɽʱ) w l j

Munda languages generally have retroflex stops, both voiceless ʈ and voiced ɖ. The retroflex nasal ɳ is also common but does not occur in all Munda languages, while a phonemic contrast between r and ɽ is more frequent. Thus, with the presence of retroflex consonants the Munda languages again follow the pattern typical of South Asia. They may stem from contact influence of Dravidian (and/or later Indo-Aryan). However, the phoneme inventories of other Munda languages such as Santali are much smaller; they have only two stop series, namely voiceless and voiced. Khasi, on the other hand, as a non-Munda language spoken more in the periphery of South Asia, also exhibits the four stop series described for Kharia, but lacks retroflex sounds. (26) Khasi p t k ʔ pʰ tʰ kʰ b d ʤ bʱ dʱ ʤʱ m n ɲ ŋ s ʃ h r w l j

Closely related Pnar has six contrastive places of articulation, distinguishing between apical and laminal stops, but lacks the breathy voiced stop series found in Khasi:3 (27) Pnar p t t̪ ʧ k ʔ ph th kh b d d̪ ʤ

3 In the cognate lexicon in Pnar, the breathy stops are resyllabified, e.g. compare Khasi bʱa ‘good’ with Pnar băha ‘good’.

The Austroasiatic Languages: A Typological Overview

25

m n ɲ ŋ s h r w l j

Katu has a different set of four stop series; it lacks breathy voiced stops but has a distinction between plain voiced and implosive: (28) Katu (Costello 1971) p t c k ʔ pʰ tʰ kʰ b d ɟ g ɓ ɗ ʄ m n ɲ ŋ s h r w l j

Danau also has the same four stop series, but the fricative inventory is larger and includes five places of articulation and voicing: (29) Danau p t c k ʔ ph th ch kh b d ɟ g ɓ ɗ m n ɲ ŋ f s ç x h sh v z r w l j

The rare fricative sh is also found in varieties of Burmese, thus the presence of this phoneme can be explained in terms of languages contact. v occurs only rarely, e.g. in dəvaɛ ‘tiger’, which is realized with an inserted glide, i.e. [dəvwaɛ]. Mon has three stop series, voiceless non-aspirated, voiceless aspirated, and implosive, and a voicing contrast in nasals and glides:

26

Jenny, weber and weymuth

(30) Mon p t c k ʔ ph th ch kh ɓ ɗ m n ɲ ŋ m̥ n̥ ɲ̊ s ɕ h r w l j w̥ l ̥

Northern Vietnamese has the same three stop series (although the voiceless aspirated series is restricted to tʰ) and a fricative inventory that is unusually large in the AA context: (31) Northern Vietnamese (initial consonants) t ʧ c k Ɂ tʰ ɓ ɗ m n ɲ ŋ f s z x h v ɣ l

Also note the absence of voiceless bilabial stops (p and pʰ), g, r, and glides, which, however, occur in other positions of the syllable. The consonant inventory of Northern Vietnamese differs from the one of Southern Vietnamese in several ways: Southern Vietnamese r and j are merged into z, and Southern Vietnamese has retroflex consonants and g instead of ɣ. As already mentioned, the vast majority of AA languages have four nasals: m, n, ɲ, ŋ. They can occur in both onset and coda positions. Nasals frequently occur word-finally, but are relatively infrequent in the initial position of monosyllabic words. The distribution of word-initial ŋ in AA follows areal tendencies. In Eurasia, it is present in the northeast and in the southeast, but not elsewhere (cf. Anderson 2013, Comrie 2007: 33). Accordingly, it is present in the AA languages of Southeast Asia but rare in the Munda languages—at least as word-initial onset consonants—as in other languages of South Asia. Some AA languages (including Munda languages) have syllabic nasals, cf. Remo ŋkuj or Gtaʔ ŋkui ‘water pot’ (Bhattacharya 1975: 47).

The Austroasiatic Languages: A Typological Overview

27

A number of AA languages has prenasalized stops in the onset position of main syllables. In Sedang, the voiced stop series is obligatorily prenasalized, i.e. prenasalization is not contrastive. On the other hand, Bunong has both a plain voiceless and a prenasalized voiceless stop series: (32) Bunong p t c k ʔ ph th ch kh ɓ ɗ (ʄ) (ɠ) ɲc ŋk mp nt ç h r w l j ˀw ˀj

Note also the glottalized glides. Bugan has more nasal+stop sequences, including both voiceless affricates and voiced stops and affricates, but they may be analyzed as sequences of two phonemes: (33) Bugan onsets p t ʦ k q ʔ pʰ tʰ ʦʰ kʰ mʦ mʦʰ b d g mb nd ndz ŋg ŋq md mdz m n ȵ ŋ f θ, s ɕ x h ʑ ɣ w l

Bugan has further unusual segments: a large inventory of fricatives, including both voiceless and voiced palatal and velar, a voiceless interdental θ, and the voiceless uvular stop q. Some Palaungic and Khmuic languages, among them Mlabri and Eastern Kammu, have a voicing contrast in both stops (p, t, c, k vs. b, d, ɟ, g) and sonorants (m̥ , n̥ , ɲ̊ , ŋ̊ , l ̥, r̥, w̥ , j̊ vs. m, n, ɲ, ŋ, l, r, w, j). Northern and Western Kammu have lost this distinction and retained only the unmarked members of the series, i.e. voiceless stops and voiced sonorants.

28

Jenny, weber and weymuth

The differentiation between voiced and voiceless sonorants was also present in Proto-Southwestern Tai and is still present in a number of other non-AA languages of Southeast Asia, especially in Burmese and other Sino-Tibetan languages spoken in Myanmar and adjacent areas in China and India. Sedang is unusual4 in having three sonorant series: voiced, glottalized, and voiceless: (34) Sedang p t c k ʔ b d ɟ g ɓ ɗ (ʄ) m n ɲ ŋ ˀm ˀn ˀɲ ˀŋ m̥ n̥ ɲ̊ ŋ̊ s, ʂ h r ˀr r̥ w l j ˀw ˀl ˀj w̥ l ̥ j̊

The voiced stops are generally realized as prenasalized. Stop plus [h] can be analyzed as the realization of a sequence of two phonemes. The inventory of syllable-final consonants is smaller than that of onset consonants in AA languages in that there is only one series of stops, which are always voiceless and typically unreleased, i.e. accompanied by glottal restriction which stops the airflow. The other consonants that can occur in the onset position of main syllables are typically permitted, including nasals in all four places of articulation. Some languages, however, have more restrictions. In Kui, for instance, s occurs in syllable-initial position but cannot occur in the coda:

4 The contrast of glottalized and voiceless sonorants is probably much more common among Bahnaric and Katuic languages than reported. E.g. see the phonology of Bahnar as described by Banker, Banker and Mơ (1979).

The Austroasiatic Languages: A Typological Overview

29

(35) Kui coda consonants p t c k ʔ m n ɲ ŋ h r w l j

In Danau, x is the only fricative occurring in coda position, which is in sharp contrast to the large fricative inventory in the initial position of stressed syllables. Khasi further restricts its coda consonant inventory; fricatives and l are not allowed: (36) Khasi coda consonants p t ʲt k ʔ m n ɲ ŋ r w j

However, r can occur. Bunong, on the other hand, does not allow any liquids in coda position. Bugan is yet more restrictive and limits its coda consonant inventory to the nasals m, n, ŋ and the stops p, t, k. Thus, the coda inventory is much smaller than in most other AA languages, but fits fairly well into the areal Chinese context. Southern Vietnamese also has a very restricted set of coda consonants. Palatals and fricatives are not permitted, and the alveodentals and the velars occur in allophonic distribution which can be predicted by preceding vowels: (37) Southern Vietnamese p t/k m n/ŋ w j

Munda languages, on the other hand, are an exception in that they allow for final stops that are described as voiced, i.e. the glottal restriction is released while the oral closure is still maintained, cf. Kharia mɔ’ɖ. In some northern Munda languages final stops can be nasally released; Osada (2008) transcribes Mundari b and d phonetically as [ʔb̥ m], [ʔd̥ n].

30

Jenny, weber and weymuth

In North Aslian languages, word-final nasals are preploded ([bm, dn, ᶡɲ, gŋ]) if they occur after an oral vowel. Aslian languages share these syllable-final stop-plus-nasal sequencies with Dayak languages of Borneo (cf. Matisoff 2003: 19). 1.2.2 Vowels AA languages typically have very large vowel inventories. For instance, Vương Hừu Lễ (1999) claims 42 vowel phonemes for Bru. The full vowel inventory is only available in main stressed syllables. Minor syllables usually only have ə (often non-phonemic), some languages, esp. Aslian and Khmuic permit a, i, and u. The standard (simple) vowels are represented in the system as it is found in Chong. All vowels qualities have a length contrast: (38) Chong i/iː ɨ/ɨː u/uː e/eː ə/əː o/oː ɛ/ɛː ɔ/ɔː a/aː

Typically, AA languages have two mid-vowel series and also two central vowels, although ɨ is missing in some languages. Munda vowel systems show a greater degree of diversity, but are generally smaller. Mundari, Keraʔ, Korku, Kharia, Sora and Gutob have a 5-vowel system. (39) Kharia (Peterson 2011: 27) i u e o a

The central unrounded non-low vowels ə and ɨ are absent in most languages, but Sora has both, Santali only ə, and Gtaʔ only ɨ. Munda languages often apparently only have one series of mid-level vowels. Juang has a constrast between o and ɔ, Santali, Gtaʔ and some Ho varieties distinguish both e vs. ɛ and o vs. ɔ. Santali exhibits morphologically-driven vowel harmony, with a vowel alternation between e and ɛ in some affixes, depending on the vowels in the stem. The third singular agreement marker is a case in point:

The Austroasiatic Languages: A Typological Overview

31

(40) Santali (a) ɛɽɛ=jɛ rɔɽ-a (b) səri=je rɔɽ-a lie=3 speak-fin truth=3 speak-fin ‘he tells lies’ ‘he tells the truth’

Khasi also has only one series of mid-high vowels. Note the presence of ɨ and the absence of ə. All vowels except ɨ can be both short and long. (41) Khasi i/iː ɨ u/uː e/eː o/o: a/aː

Car, on the other hand, has two mid-vowel series: (42) Car i e ɛ (æ)

ɨ u ɤ o ə ɔ a

Most notably, Car has a rarely occurring phonemically relevant distinction between /ɤ/ and /ə/, as in lɤ́ːkə ‘to be gone along’, by way of’ vs. lə́ːkə ‘to do well’. The phoneme /æ/ is restricted to English loanwords. Some AA languages, such as Kơho, functionally have four degrees of vowel hight, giving them two low non-front vowels: (43) Kơho i ɨ u e ǝ o ɛ ɔ a ɑ

Bugan notably has the front rounded vowel y: (44) Bugan i, y ɯ u e ə o ɛ ɔ a

32

Jenny, weber and weymuth

Front rounded vowels are frequent in the languages of northern Eurasia including China but rare in the whole rest of the world, including South and Southeast Asia (Maddieson 2013b). Thus, the occurrence of y in Bugan is peculiar in the AA context but fits into the areal picture for a language spoken in China. In most AA languages vowel length is contrastive. In some cases, however, contrastive length is restricted to a subset of vowel qualities, i.e. the inventory of long vowels is smaller than that of short vowels. Thus, for instance in Bunong, front vowels do not have a length contrast: (45) Bunong i ɨ/ɨː u/uː e ə/əː o/oː ɛ ɔ/ɔː a/aː

Short vowels in Bunong are generally more centralized and lax. Kammu also has fewer long vowel phonemes than short vowel phonemes, but the restrictions are of a different kind than those of Bunong in that midopen vowels are always short: (46) Kammu i/iː ɨ/ɨː u/uː e/eː ə/ə o/oː ɛ ʌ ɔ a/aː

In Northern Vietnamese, out of 12 vowels only a and ə contrast in length, rarely also ɛ and ɔ. Several AA languages, among them Munda, Aslian, Palaungic languages, have lost the phonemic length contrast. However, non-phonemic vowel length is still present in some of these languages. In Danau, for instance, monophthongs also have different durations, depending on the syllable type and tone; vowels in closed syllables with falling tone are shorter than vowels with high-falling and high tone. In Mon, monophthongs are still realized with various degrees of length. Longer vowels occur in open syllables and in syllables ending in a nasal or an approximant. Vowels in checked syllables (i.e. ending in a stop) are between long and short, while syllables ending in a glottal consonant (ʔ or h) contain short monophthongs. AA languages often have a large diphthong inventory. Diphthongs are also present in several AA languages that do not have a phonemic length contrast. Danau has the following diphthongs:

The Austroasiatic Languages: A Typological Overview

33

(47) Danau eɪ ui ɪɛ ou ɛə eɯ

In addition, Danau also has nasalized diphthongs, which differ from oral diphthongs in their place of articulation: (48) Danau eɪ̃ oũ ɐɪ̃ ɐũ æɛ̃

Mon is another example of a language which lacks a phonemic vowel length contrast but has a fairly rich inventory of diphthongs: (49) Mon iə uə ɒə eə oə ao ɛə ɔə

An abundance of diphthongs and triphthongs are also described for some Munda languages, e.g. in Santali (Ghosh 2008: 24f). There are also some AA languages with only few diphthongs. Kammu, for instance, only has three diphthongs, namely ia, ɨa and ua, and Pacoh has iə, ɨə, uə. Surin Khmer is remarkable in being described as having no diphthonged vowels in closed syllables (see Chantrapanth & Phromjakgarin 1978). 1.3 Suprasegmentals Many AA languages have phonologically relevant register and/or tone distinctions. Register (i.e. phonation type) and tone have often been treated separately, i.e. the relevant contrasts were analyzed in terms of register or tone only. However, the two phenomena are often not independent of one another and should therefore be analyzed in a comprehensive way. For instance, creaky voice is often associated with a higher pitch, while breathy voice is associated with lower voice. Traditionally, AA languages have been characterized as register languages, i.e. register has been considered to be phonologically relevant, while tone is considered to be merely a phonetic epiphenomenon, which is not phonologically relevant. Unless more detailed phonetic analyses are available, we follow this practice here, with the caveat that most AA languages are still

34

Jenny, weber and weymuth

in need of more thorough phonetic analyses. Register differences are worldwide fairly uncommon, but most notably they are rare in Southeast Asia outside AA. Still, it is interesting to note that tone is not a prevalent feature of AA, although complex tone systems are very prominent in the non-AA languages of Mainland Southeast Asia (Sino-Tibetan, Tai-Kadai, Hmong-Mien). 1.3.1 Registers/Phonation Types AA languages are well known for having phonologically relevant phonation types which are relevant for vowels, usually referred to as ‘registers’. In principle, the following phonation types are relevant in the context of AA: – –

–

Modal (or clear) voice: default phonation type Creaky voice (laryngealization): vocal folds are vibrating while the arytenoid cartilages are much closer together than in modal voice; laryngeal musculature is tensed, the vocal folds do not vibrate as a whole (Ladefoged and Maddieson 1996: 53) Breathy voice (murmur): vocal folds vibrate in a looser form, sounds have a higher flow rate (Ladefoged & Maddieson 1996: 57)

Many AA languages have a two-way contrast between breathy and modal voice. Languages of this type include Wa, Mon, Kri (Enfield & Diffloth 2009), Pacoh, Paraok, Mon, Bunong, Kui, and some Western Khmer dialects, but not Standard Khmer as spoken in the Phnom Penh. Other AA languages such as Sedang have a contrast between modal and creaky voice. The following examples illustrate minimal pairs. (50) Sedang modal creaky ka ‘to eat’ ka̰ kan ‘big’ ka̰n bau ‘to wash face’ ba̰w pəla ‘chaff’ pəla̰ ma ‘eye’ ma̰ soj ‘to err’ so̰ j

‘fish’ ‘chief’ ‘field rice’ ‘elephant tusk; between’ ‘we two (excl.)’ ‘to sarifice’

The two registers of Bunong differ in their realization, depending on the vowel quality, as the following table illustrates.

The Austroasiatic Languages: A Typological Overview Table 2.1

35

Registers in Bunong

High and mid vowels Low vowels

Register 1: – lowered in mouth – modal Lower pitch, diphthong Equal pitch, monophthong

Register 2: – raised in mouth – breathy Higher pitch, monophthong Equal pitch, diphthong

Bugan also has a tense versus lax distinction in all vowels but y. For tense vowels, the tongue position is lower and further back. For instance, in ta³¹ ‘near’, the vowel is realized as [a], but in ta̠³¹ ‘to bet’, it is close to [ɑ]. In Suoy, breathy vowels tend towards diphthongization (Michaud 2012: 123). Register is not always a property of just the vowel, but can also spread to other syllables. In Mon, for instance, which has a two-register contrast between clear and breathy voice, the vowel of the first unstressed syllable is always in the same register as the vowel of the second stressed syllable. Phonemic register distinctions are also found in Munda languages: Gorum has creaky voice (cf. Anderson & Rau 2008), and Sora has glottalized vowels (Zide 2008). In Santali, certain vowels are associated with a degree of laryngeal tension (cf. Zide 2000: 9). The most complex register systems are found in Pearic languages. Chong has a four-way contrast (see also DiCanio 2009 for a detailed phonetic analysis): (51) Chong Register 1: Register 2: Register 3: Register 4:

clear-modal clear-creaky breathy breathy-creaky

Aslian languages, on the other hand, do not exhibit register contrasts (with the apparent exception of Mah Meri, cf. Stevens et al. 2006), and neither does Car Nicobarese. 1.3.2 Nazalization A minority of AA languages belonging to different branches exhibit a phonemic contrast between oral and nasalized vowels. Among them are several Munda languages such as Santali; Sedang; Aslian languages, and Nicobarese.

36

Jenny, weber and weymuth

Nasality is often subject to certain constraints. Usually, the inventory of nasal vowels is smaller than that of oral vowels. Bugan has the nasal vowel phonemes ã õ ũ ũ ə̃, as opposed to an inventory of the ten oral vowels a, ɔ, o, u, i, e, ɛ, ɨ, ə, and y. The syllable structure can also restrict nasalization. Some AA languages (e.g. Sedang) do not allow nasalized vowels in syllables ending in nasal consonants. Finally, further restrictions include the combination of nasalization and phonation type. In Sedang, for instance, nasalization and creaky voice can only co-occur in open syllables and syllables ending in j or w. Overall, nasalized vowels are fairly rare in Sedang. 1.3.3 Tones Tone and registers are often dependent on one another; this might pose a problem to phonological descriptions, if languages should be classified as ‘tone languages’ or ‘register languages’. Often, one of the two domains is more relevant for phonology (i.e. for phonological processes or rules). Marginal tone contrasts have been reported for some Northern Aslian languages (Schebesta 1928 for Jahai, Bishop 1996 for Kensiw), and Hajek (2003) also assumes tonal activity in Northern Aslian languages, Wnuk and Burenhult (fc.), however, question the evidence brought up. Tone systems are especially prominent in Palaungic languages. Danau has a four-tone system: (52) Danau high-level (55) long-falling (442) short-falling (31) low-level (11)

ā (e.g. shī ‘sun’) â (e.g. kɨ̂ ‘go’) à (e.g. plù ‘thigh’) a (e.g. thwɐn ‘lime’)

Additionally, Danau also exhibits tone-sandhi, but the exact rules need further investigation. Other branches of Palaungic also have tone systems: Angkuic developed a tone system out of the vowel length contrasts, and Lamet has a two-tone system (cf. Sidwell’s chapter on the reconstruction). Some Khmuic languages have simple tone systems: Northern and Western (but not Eastern) Kammu dialects have a tone system consisting of a high level and a low level tone, which can occur in all syllable types. Originally these varieties had a voicing distinction in initial consonants. The loss of this distinction led to the emergence of a tonal

The Austroasiatic Languages: A Typological Overview

37

distinction of the following vowels in Northern Kammu. In Western Kammu, the voiced stops became aspirated, but otherwise the tonogenesis took place along similar lines. Eastern Kammu remained non-tonal. The following examples illustrate the processes: (53) Kammu tonogenesis Eastern Northern Western a. klaːŋ kláːŋ kláːŋ ‘eagle’ glaːŋ klàːŋ kʰlàːŋ ‘stone’ m̥ aːn máːn máːn ‘to bury’ maːn màːn màːn ‘pregnant’ b. kʰuʔ kʰúʔ kʰúʔ ‘weaver ant’ sɨaŋ sɨ́aŋ sɨ́aŋ ‘pig’ ˀjiak ˀjíak ˀjíak ‘shit’ ˀmec ɓéc ɓéc question particle ʔɔŋ ʔɔ̀ ŋ ʔɔ́ ŋ ‘hornet’

Bugan has a much more complex tone system, consisting of six tones in unchecked syllables: high level 55, mid-level 33, rising 35 and 13, and falling 31, neutral; 55, 33, 31 also occur in checked syllables; and finally the neutral tone occurs in some prefixes. Tones in Bugan are apparently fairly unstable, and many forms are recorded with alternate tones, e.g. ʑou³³/⁵⁵ ‘earth’, or χau⁵⁵/³¹ ‘valley’. The best described tone system of an AA language is certainly the one of Vietnamese. Northern Vietnamese has six tones that can occur in unchecked syllables; they combine pitch and phonation type contrasts: (54) Northern Vietnamese tones (Brunelle 2009: 80) A1 (‘ngan’) level, modal voice A2 (‘huyền’) starting low and falling smoothly, modal or breathy voice B1 (‘sằc’) starting mid-range and rising rapidly B2 (‘nặng’) falling, short, ending in a glottal stop C1 (‘hỏi’) falling dramatically, then accompanied by a slight laryngealization C2 (‘ngã’) falling, then interrupted by a glottalization, ending on a dramatic rise

Additionally, Northern Vietnamese has two tones that occur in checked syllables. They are similar to the tones B1 and B2. Southern Vietnamese has five tones, and, in contrast to Northern Vietnamese, phonation type is not relevant.

38

Jenny, weber and weymuth

2.3.3 Stress The vast majority of AA languages, which are sesquisyllabic, are strongly iambic, i.e. the first minor syllable is weak and unstressed and followed by a strong stressed main syllable. This pattern is not confined to AA languages in mainland Southeast Asia but is also found in other languages of the area, e.g. in Thai (Bennett 1994), or Burmese (Green 2005). South Asian languages, on the other hand, are trochaic, i.e. they have feet consisting of a first stressed syllabe and a second unstressed syllable. Munda languages have also been described along these lines (Donegan 1993, Donegan & Stampe 2004). And indeed Tibeto-Burman languages of eastern India, Bhutan and Nepal (LaPolla 2003) have also been described as trochaic. However, metric patterns in Munda are a fairly complex matter. Their exact nature is still under-researched, but often different types of metric patterns seem to coexist. Phonological and morphological words are often distinct and have different prosodic patterns. One of the patterns is the iambic stress pattern. Peterson (2011: 35) describes the prosodic pattern of phonological words in Kharia as low-high. Similarly, in Santali (Ghosh 2008) and Plains Remo (Anderson & Harrison 2008), the stress is always on the second syllable. Car is usually also iambic, but due to morphological processes, the stress can also fall on the penultimate or antepenultimate syllable. However, it remains to be established whether the syllables that follow the stressed syllable are also part of the same phonological word. Car has been described as having three contrastive stress degrees: primary, weak, and emphatic. The stress domain is not always the single word. In Mon, for instance, the stress domain can also be the phrase, as the following examples show: (55) Mon hɒəʔ ˈpèh ɗɔə ˈhɒəʔ ciəʔ ˈpɤŋ

2

‘your house’ ‘in the house’ ‘eat rice’

Word Formation

2.1 Compounding Different types of compounding are found in AA languages, expressing different functions. Compounds can consist of two (or more) nouns, a noun and a verb, or two (or more) verbs. Compounds can in most cases be distinguished from phrases only by stress pattern and semantics. In one common kind of

The Austroasiatic Languages: A Typological Overview

39

compounding, the nouns joined asyndetically result in a meaning that includes both parts, as seen in Vietnamese cha mẹ ‘father mother’, meaning ‘parents’, or qua�n áo ‘pant shirt’ meaning clothes’ (both from Brunelle, this publication). The order of the elements is fixed and the resulting meaning, though semantically transparent, is conventionalized. Other examples of juxtapositional noun-noun compounds are given in the following example. Interestingly, many AA languages have the order ‘mother-father’ for ‘parents’, unlike most neighboring languages, such as Thai and Burmese, which consistently show the inverse order of the elements. (56) Noun-noun compounds Khmer ʔoːpùk-mdaːj ‘father-mother’ ‘parents’ Bugan mau³³-na³³ ‘younger.brother-younger.sister’ ‘sibling’ Mlabri diŋ-roj ‘older.sibling-younger.sibling’ ‘siblings’ Kơho mɛʔ-baːp ‘mother-father’ ‘parents’ Mon mìʔ-mɛ̀ʔ ‘mother-father’ ‘parents’ Pacoh ʔaʔi-ʔaʔam ‘mother-father’ ‘parents’ Sedang now-pa ‘mother-father’ ‘parents’ Sedang ɟa-poə ‘grandmother-grandfather’ ‘grandparents’ Kơho muh-mat ‘nose-eye’ ‘face’ Pacoh duŋ-vḛːl ‘house-village’ ‘neighborhood, society’ Mon ɗɤŋ-kwan ‘town-village’ ‘land, country’

In endocentric compounds, one of the parts modifies or specifies the other. The first element of the compound usually is the head of the whole expression. In Car assimilation and deletion rules apply to compounds. Examples are given in (57). (57) Modifying compounds Car cɛ́hɛcóːn cɛ́hɛn ‘thing’ + cóːn ‘tree’ ‘bird’ Car kɤɲkúːn kóːɲ ‘male’ + kúːn ‘child’ ‘son-in-law’ Chong n9aːj-tɔŋ ‘owner-house’ ‘head of the family’ Khmer nɛːək-srae ‘man-rice.field’ ‘farmer’

In a few cases the head noun is the second element of the compound. This can be due to wholesale borrowing of the expression, as is the case in Mon pɤ̀-kjɤ̀k ‘commander’ from Burmese bo-ʨʰouʔ [bo.ʥouʔ] (lit. ‘forces-cover’). The Bugan word for ‘rice field’, da³⁵-ta̠ɯ̱³⁵ ‘water-field’, is made up of native elements, but shows head-final structure, perhaps influenced by neighboring head-final languages such as Chinese.

40

Jenny, weber and weymuth

One common type of modifying compounds is a class term followed by a specifying term, as seen in the following examples. This kind of compound can be called specifying compound, though the distinction from other modifying compounds is not always clear-cut. (58) Specifying compounds Mon ɗɤŋ-sem Danau souʔ-pəlwe

‘town-Thai’ ‘Thailand’ ‘plant-banana’ ‘banana plant’

Besides nominal compounding, the compound form can consist of verbal elements or verb phrases, including a P argument. (59) Chong tʰa̤ːk cʰaː water eat ‘drinking water’ (60) Khmer nɛːək-cap-trɤj man-catch-fish ‘fisherman’

Like nouns, verbs can also be combined to achieve new meanings or nuances of meanings. The resulting meaning can be the addition of the meanings of the parts of the compound, as in (61). In some cases, as in the Bunong (Phillips 1973b: 131) and Vietnamese (Brunelle, this publication) examples in (62), no clear difference of meaning between the elements and the compound can be detected. Verbal compounds are not always clearly distinguishable from multiverb predicates or serial verb constructions. (61) Additive verbal compounds Vietnamese mua-bán ‘buy-sell’ ‘trade’ Mon sɒh-ràn ‘sell-buy’ ‘trade’ (62) Euphonic verbal compounds Bunong cʰoŋ-cʰaː ‘eat-eat’ ‘eat’ Vietnamese lười-biếng ‘be.lazy-be.lazy’ ‘be lazy’

In Car, nouns can be compounded with prepositional elements, resulting in nouns.

The Austroasiatic Languages: A Typological Overview

41

(63) Car ʔɛlrə́ːn ʔɛl ‘loc’ + rə́ːn ‘foot’ ‘sole’ ʔɛlkɛ́ːl ʔɛl ‘loc’ + kɛ́ːl ‘arm’ ‘armpit’

Unlike most other AA languages, in which no morphological clues can be used to distinguish compounds from phrases, Khasi makes a clear distinction between the two. This is true for nominal and verbal modifying compounds, as seen in the following examples, where the phrase level expressions on the right side have an overt linker and no morphological reduction of the elements. (64) Khasi (a) ka=um-mat (b) ka=um ʤoŋ ka=kʰmat f=water-eye f=water gen f=eye ‘tear’ ‘water of the eye’ (65) Khasi (a) u=laŋ-jap (b) u=phlaŋ (u) ba jap m=grass-die m=grass (m) rel die ‘straw’ ‘grass (that) which dead’ (66) Khasi (a) u=riew-bʱa (b) u=briew (u) ba bʱa m=man-good m=man (m) rel good ‘good man’ ‘man (he) who is good’ (67) Khasi (a) jaːʲt-waːt (b) jaːʲt ban waːt walk-search walk inf search ‘to walk searching’ ‘to walk in order to search’

Khasi also shows noun incorporation, whereby a noun loses its syntactic independence and is closely linked to a verb. (68) Khasi (a) thiet-kot (b) thiet ja ka=kot buy-book buy acc 3sg=book ‘to purchase (a) book’ ‘to purchase a book’

42

Jenny, weber and weymuth

(69) Khasi (a) jaːʲt-lɨnti (b) jaːʲt ha ka=lɨnti walk-path walk on path ‘to travel’ ‘to walk on the path/way’

Similarly, nouns can be incorporated in a verbal expression in Pnar. The incorporated noun can be either the P argument or an oblique participant. As in Khasi, the incorporated noun loses its syntactic independence, as can be seen by the loss of the class marker. (70) Pnar ku u [Ø] da ka=jiŋ.ken. climb m - ins f=ladder ‘he climbs (the tree) with/by the ladder’ (71) Pnar ku jiŋ.ken u [Ø] climb ladder m ‘he ladder-climbs (the tree)’ (72) Pnar ku deɲ u da ka=jiŋ.ken. climb tree m ins f=ladder ‘he tree-climbs with/by the ladder’

2.2 Reduplication Another common type of word formation found in AA languages is reduplication. Reduplication appears in different forms, as full or partial reduplication, with or without phonetic change in the reduplicated part. Reduplication has different functions in AA languages, the most common being adverbial and intensifying for verbs, and plurality or non-specificity for other categories. Full reduplication without alternation is attested in most or all AA languages with different functions, including intensification and plurality. (73) Intensifying Vietnamese Bunong Khasi Kui

đỏ kwɔŋ suki plɜm

‘red’ ‘big’ ‘slowly’ ‘fat’

đỏ-đỏ kwɔŋ-kwɔŋ suki-suki plɜm-plɜm

‘reddish’ ‘really big’ ‘very slowly’ ‘very fat’

The Austroasiatic Languages: A Typological Overview

43

(74) Chong ne̤ːw~ne̤ːw coːk ceːn c9aːk tɔŋ. child~red run come from home ‘The children run from their home.’ (75) Bugan hɛ⁵⁵~hɛ⁵⁵ day~red ‘every day, always’

In Khasi, full reduplication of verbs can result in iterative or continuous meaning, or express indirect causative. (76) Khasi her kʰlik bɨnrap haːp

‘fly’ ‘flicker’ ‘collect, gather’ ‘fall’

her~her kʰlik~kʰlik bɨnrap~bɨnrap haːp~haːp

‘continue to fly’ ‘continue to flicker’ ‘ask someone to put things together’ ‘ask somebody to fall’

Mon uses full reduplication to express indefinite notions. The reduplicated element can be verbal or nominal, as seen in the following examples. (77) Mon pɤ̀m lɒ ɗɛh hɒm~hɒm, ʔuə hùʔ pətɛh pùh. manner inter 3 speak~red 1sg neg believe neg ‘However he speaks, I don’t believe him.’ (78) Mon ɲèh~ɲèh ʔuə~ʔuə klùə klɤŋ cɒp phɛ̀ə roŋ. person~red 1sg~red gather come arrive monastery asrt ‘Everyone will come and gather at the monastery.’ (79) Mon mənìh mùə~mùə human one~red ‘someone’

In Kharia, monosyllabic verbs are reduplicated to form verbal nouns (‘masdars’), which cover different functions, including attributive and complement clauses.

44

Jenny, weber and weymuth

(80) Kharia (Peterson 2011: 94f) ho=kaɽ=aʔ ter~ter jo=joˀɟ. that=sg.hum=gen give~red see=act.pst.1sg ‘I saw him give/his (act of) giving [something to someone].’

Reduplication with stem alternation includes full reduplication with change in onset, vowel, rhyme, or tone, as well as partial reduplication. Partial reduplication usually consists of the repetition of the initial consonant with the same or different vowel. The functions covered by reduplication with alteration are very diverse and cannot easily be categorized. In some cases the function of the reduplicated form is not evident, or the basic form may be missing altogether. Partial reduplication is already well attested in the Old Khmer corpus, with examples like krek /krɤːk/ ‘to stir’ > kakrek /kə krɤːk/ ‘to quake’. (81) Vietnamese: Change in tone đỏ ‘red’ đo đỏ ‘reddish’ nhẹ ‘light’ nhè nhẹ ‘rather light’ (82) Khmer: Change in onset mɛ̀ːn-tɛ̀ːn ‘really’ slòːt-bòːt ‘nice, kind, agreeable’ (83) Sedang: Change in vowel and rhyme mɛ ˀni~ˀnɔ ku~ka mi~mot tuŋ mɛ so red~pull.out red~bite red~enter inside so ‘(The snake) came out (of the water) killed (the girl) and entered into that (water).’ (84) Partial reduplication Khmer keh ‘scratch’ kə-keh ‘to scratch constantly’ Khmer baoh ‘sweep, brush’ bə-baoh ‘to brush or caress lightly and frequently’

Aslian languages have complex systems of reduplication, including full reduplication without alternation, full reduplication with vowel alternation, full reduplication with vowel loss, and partial reduplication. The following examples illustrate these processes in Jahai and Semelai. (85) Jahai (Burenhult 2005: 94f) cɨp ‘go’ cɨp-cɨp continuative

The Austroasiatic Languages: A Typological Overview cip-cɨp ca-cɨp cp-cɨp

45

durative recpiprocal imperfective

Semelai (Kruspe 2004: 110f) goŋ gŋ-goŋ

‘carry’ imperfective

In a number of cases a lexeme looks like a reduplicated form, usually with vowel alternation, but no basic form is attested in the language. This is the case for example in Mlabri, Kui, and Kơho. (86) No basic form attested Mlabri mic-mɛc Mlabri ɲ̊ im-ɲ̊ ɛm Kui leːp-laːp Kơho cɔk-cɛk

‘ant’ ‘eyelashes’ ‘butterfly’ ‘to gossip’

2.3 Derivation AA languages have a native system of derivational affixes, consisting mainly of prefixes and infixes. Only few cases of suffixes are attested. A few of the derivational affixes can be reconstructed to the proto-language with well defined functions. In a number of languages the old derivational processes have lost their productivity and in some cases new derivational affixes were introduced to the language. While Old Mon had a full set of derivational affixes, the derived forms, though surviving in modern Mon, have been merged in many cases, resulting in the opposition basic form—derivate, without distinguishing between different function. The process of underspecified derivation by means of a prefix hə- has retained some degree of productivity in spoken Mon. (87) Development of derivation in Mon Old Mon gloss process gloṅ ‘be many’ base gumloṅ ‘many’ attr girloṅ ‘quantity’ nml guloṅ ‘increase’ caus

Modern Mon klɒ̀ ɲ həlɒ̀ ɲ həlɒ̀ ɲ həlɒ̀ ɲ

In some AA languages, all derivational morphology has been lost and compounding is the only way to derive new words. Examples are Vietnamese, Chong, and Bunong. In these languages, verbs can be nominalized by adding a

46

Jenny, weber and weymuth

noun with a general meaning ‘matter, affair, fact’ or similar, as in the following examples. In the case of Chong, the nominalizing noun kʰw9aːm ‘matter, affair’ is a loan from Thai, where it serves a similar function. (88) Bunong rawe > nau-rawe ‘worry’ ‘matter-worry’ (89) Chong pʰoʔ > kʰw9aːm-pʰoʔ ‘dream’ ‘matter-dream’

2.3.1 Nominal derivation The range of nominal derivation by affixation is illustrated by Old Khmer. There are both prefixes and infixes, and they can be applied to nominal as well as verbal roots. (90) Old Khmer pN-, ’N- j9av ‘barter’ > pamj9av ‘bartered goods’ ruṅ ‘big, mature’ > ʼaṃruṅ ‘size, extent, area’ -n- pvas ‘enter holy orders’ > phnvas ‘holy orders’ sre ‘riceland’ > sanre ‘ricefield’ -m- pre ‘use’ > pamre ‘servant’ cer ‘transgress’ > cmer ‘transgressor’ -mn- jvan ‘offer’ > jaṃnvan ‘offering’ -p- car ‘plant in a row’ > c(h)par ‘flower garden, plot’

Modern Khmer has retained some of of the inherited affixes of Old Khmer, with a number of innovations. Not all affixes in modern Khmer are fully productive, and some are found only in fossilized forms. (91) Khmer k-: s-: m-: N-: -b-: -m-: -n-: -vmn-: bvN-:

ba ŋ pɪ̀ːən hoːp baos rɔːəm sòːm kɪ̀ːəp dam tùk

‘to screen, to shade/cover sth.’ > k-baŋ ‘screen, movable curtain’ ‘pass over, traverse’ > s-pɪ̀ːən ‘bridge’ ‘eat’ > m-hoːp ‘food’ ‘sweep’ > ʔɔm-baos ‘brush, n.’ ‘dance’ > rəbam ‘dance, n.’ ‘ask’ > smòːm ‘beggar (so. who asks)’ ‘squeeze, apply pincers’ > khnɪ̀ːəp ‘pincers’ ‘plant, v.’ > dɔmnam ‘plant, n.’ ‘put away, keep’ > bɔntùk ‘cargo, load’

The Austroasiatic Languages: A Typological Overview

47

kvN-: cas ‘old’ > kɔɲcas ‘old man [derogative]’ svN-: bɔ̀ ːk ‘peel, strip of bark or skin’ > sɔmbɔ̀ ːk ‘shell, husk, bark, skin’

In other languages, the number of retained derivational affixes is usually smaller and in many cases they have lost their productivity, so that the derived forms are to be considered lexical forms. A frequently found general nominalizing device is an -n- infix, in some cases with phonotactic change. The antiquity of this infix is shown by its occurrence in all branches of AA, including Nicobarese and Munda. (92) Pacoh katɨp tapaʔ taŋɨh kar

‘cork’ ‘make fish sauce’ ‘breathe’ ‘drill a hole’

(93) Sedang ciə ‘dig’ soə̰ŋ ‘divide’ pa̰n ‘raise’ (94) Kơho pat sɛ blɔ

həniə hənoə̰ŋ məna̰n

‘to knead, squeeze’ ‘to turn, detour’ ‘to wear in the ear’

kəntɨp təmpaʔ tərŋɨh kanaːr

‘cork, n.’ ‘fish sauce’ ‘breath’ ‘hole-driller’

‘shove-hoe’ ‘problem’ ‘domestic animals’ pənat sənɛ bənɔ

‘s.t. kneaded (clay, dough, etc.)’ ‘place where detour begins or ends’ ‘earring’

(95) Mlabri chrɛt ‘comb’

chnrɛt

‘com, n.’

(96) Mon ceh

‘descend’

hneh

‘pier’ ()

(97) Car ŋát fóh kɛ́ːl

‘tie’ ‘whip’ ‘arm’

ŋanát fanɔ́ h kanɛ́ːl

(98) Mundari (Osada 2008: 116) dub ‘sit’ dunub ol ‘write’ onol tebaʔ ‘arrive’ tenebaʔ

‘cord’ ‘whip, n.’ ‘tusk’

‘meeting’ ‘writing’ ‘arrival’

48

Jenny, weber and weymuth

Other infixes are also used to derive nouns from verbs, as seen in the following examples. (99) Kơho kal ‘bolt, bar’ saːc ‘fish by draining’ sɔːl ‘illuminate’ (100) Car tɨ́ːc ‘plant, v.’ séːɲ ‘mince’

kərnal ‘wooden door bolt’ səmpaːc ‘the place one stands to fish by draining’ səmpɔːl ‘torch’

tahɨ́ːc sahéːɲ

‘what is planted’ ‘something minced’

(101) Kammu tə́ːm ‘sing’ húːc ‘sting, v.’

tŕ.nə̀ːm hŕ.nùːc

‘song’ ‘sting (of an insect)’

(102) Mlabri tɛk tɯj kwac gwɛc

trnɛk trnɯj krwac grwɛc

‘hammer’ ‘mortar’ ‘broom’ ‘finger’

‘hit’ ‘pound’ ‘sweep’ ‘poke’

A number of languages use prefixes to derive nouns. One common prefix is a nominalizing ʔi-, associated in some languages with female personal names and kinship terms, as in Mon, or general with humans, as in Dara’ang. A similar prefix, ʔə- is found in many AA languages as general nominalizer, as in Danau. In Mon it is often associated with male kinship terms. (103) Mon ʔiʔnàj ʔənàj

‘aunt’ ‘uncle’

(104) Dara’ang Ɂikăt Ɂimaj Ɂiŋauh

‘older brother or older sister’ ‘male, husband’ ‘killer’

(105) Danau mɛt sòn

‘all’ ‘red’

ə-mɛt ə-sòn

‘all of it/them’ ‘the red one’

The Austroasiatic Languages: A Typological Overview

49

Other nominalizing prefixes are found in AA languages, some of which may be related to the infixes described above. This is the case for example in Kammu. Car uses a prefix ta-/tə-, together with changes in the stem and/or suffixes. (106) Kammu tèn ‘sit’ cét ‘sour’ (107) Car vúːj ŋɛ́ːʔ rɨ́ːla ʔamínuvə

rǹ.tèn rǹ.cét

‘chair’ ‘sourness’

tavúːj taŋɛ̃ː́ ʔ tərɨ́ːla ́ təʔamĩːnu

‘be hot’ ‘be hard’ ‘be crawling’ ‘possess’

‘sun’ ‘bone’ ‘animal, beast’ ‘owner’

Khasi and Pnar have a general nominalizer ʤiŋ, which is prefixed to verbs, besides more specific agentive and instrumental nominalizing prefixes. Another way to nominalize verbs is by attaching a noun class marker to them directly. The choice of noun class markers determines the exact function of the nominalized form. (108) Pnar i=ʤiŋ-ɛʔ n=nml-be.difficult ‘difficulty’

ka=krɛʔ f=work ‘the work (completed)’

i=krɛʔ n=work ‘the working (ongoing)’

In Dara’ang, a prefix da- is used to derive ethnic names. The origin of this prefix is unknown. It has apparently become a fixed part of the ethnonym Dara’ang. (109) Dara’ang dasiam dakhε daraɁaŋ

‘Tai people’ ‘Chinese people’ ‘Palaung people’

A set of prefixed nouns in Sedang can be compared to Vietnamese forms which include a generic class term cái. This may give a hint regarding the origin of at least some of the derived forms also in other languages. (110) Sedang and Vietnamese Sedang kəɓaŋ ‘table’

Vietnamese cái bàn

50

Jenny, weber and weymuth kəhop kəkeo̰ kəli kəmajʔ

‘can’ ‘bottle’ ‘glass’ ‘machine’

cái hộp cái kéo cái ly cái máy

Nicobarese and Munda languages are unlike other members of the family in having not only derivational prefixes and infixes, but also suffixes. In Car, suffixes can be combined with other affixes in nominal derivation, as the following examples illustrate. (111) Car ʔáːp cóːŋ rát lím tɔ́ ːp ɲáh

‘canoe’ ‘ship’ ‘slice’ ‘fold’ ‘grasp’ ‘beat’

héŋ miʔáːpə héŋ micóːŋə miráːtə milíːmə tɔ́ ːpə ɲáhã

‘one canoe plus contents’ ‘one ship plus contents’ ‘slice, n.’ ‘parcel’ ‘tongs’ ‘injury’

(112) Juang gata ‘say’

agatae

‘untold story’

(113) Remo re sil aŋ

regəta sita aŋto

‘wound’ ‘pain, sickness’ ‘taboo’

‘cut’ ‘sick’ ‘be taboo’

2.3.2 Verbal Derivation The most wide-spread type of verbal derivation in AA languages is the labial causative prefix, which appears to have reflexes in all branches, at least in fossilized lexical forms. The realization of the prefix varies greatly among the languages, with three main underlying forms, namely pə-, pN-, and mə-. The connection between the different forms and their distribution patterns are not always clear. (114) Causative (labial prefix) Old Khmer tval ‘fall’ > phtval ‘fell, bring down, overthrow, topple’ Old Khmer ket ‘born’ > paṅketa ‘give birth, beget, topple’ Khmer kaət ‘born, arise, happen’ > prəkaət ‘cause, bring about’ Khmer dac ‘break, be torn apart’ > phdac ‘break, separate, tr.’

The Austroasiatic Languages: A Typological Overview Chong Sedang Sedang Sedang Sedang Pacoh Kui Kơho Kơho Kơho Mon Mon Nyahkur Kammu Kammu Mlabri Samlong Khasi Khasi Khasi Pnar Car Car Gorum Kharia

hoːc ta̰ caj loj rɛh sər boːl sɔŋ kap haːp tɛt ceh tun màh ràːs bɯl jʌm jap baːm laʲt jap huɽíŋ ɽáːc gidʒu ɲoʔ

51

‘die’ > mahoːc ‘kill’ ‘go around’ > məda̰ ‘cause to go around’ ‘sick’ > məɟaj ‘cause to be sick’ ‘abandon’ > pəloj ‘cause to abandon’ ‘alive’ > pərɛh/mərɛh ‘nourish’ ‘go up’ > pasər ‘raise’ ‘drunk’ > pəmboːl ‘intoxicate’ ‘straight’ > bəsɔŋ ‘straighten’ ‘bite’ > bəkap ‘cause to bite’ ‘envious’ > pənhaːp ‘beautify’ ‘exit’ > pətɛt ‘take out’ ‘descend’ > phjeh ‘take down’ ‘rise’ > pətun ‘arm a spring-trap’ ‘eat’ > pń.màh ‘feed’ ‘dry’ > pś.ràːs ‘dry’ ‘die’ > pabɯl ‘kill’ ‘die’ > pjʌm ‘kill’ ‘die’ > pɨnjap ‘kill’ ‘eat’ > pɨnbaːm ‘feed’ ‘escape’ > pɨllaʲt ‘set free’ ‘die’ > pn̩ -jap ‘kill’ ‘black’ > miɽíŋ ‘make black’ ‘hot’ > haɽáːc ‘to heat’ ‘see’ > abgidʒu ‘show’ ‘eat’ > oˀbɲoʔ ‘feed’

Khasi allows recursive causativization, as in the following example. (115) Khasi u-san u pʰaʔ-pɨn-jap ja u=ʤɔn ha u=bil m-pn m caus-caus-die acc m=pn dat m=pn ‘San made Bill kill John.’

Other prefixes with causative function also occur in different languages. (116) Bunong (Vogel 2006: 173) (a) kə̤p pi̤c (b) kə̤p rpi̤c kɔːn ci̤ː 1sg lie 1sg lay child ill ‘I lie down/sleep.’ ‘I put my sick child to bed.’

52

Jenny, weber and weymuth

(117) Semelai (Kruspe 2004: 128f) kɒh ‘come undone’ > tar-kɒh ‘take off’ pər ‘fly’ > taŋ-pər ‘get sth. to fly’’ (118) Kơho ʔaɲ tən-git gə kʰaj ʔin 1sg caus-know 3 3sg dat ‘I cause (or make) him to know it.’ (119) Bunong khət > ŋ-khət ‘die’ ‘kill’ (120) Nyahkur (Diffloth 1984) tho̤w > kətho̤w ‘hurry away’ ‘take away’

Another causative affix consists of a labial or vocalic infix. The former is found in Old and modern Khmer and Kui, the latter in Old Mon. (121) Old Khmer sl9ap ‘die’ cren ‘grow’

> >

saṃl9ap caṃren

‘cause to die’ ‘cause to grow’

(122) Khmer sʔaːt ‘clean’ > sɔmʔaːt

‘clean, tr.’

(123) Kui təkɒw təliəj

‘cause to sit’ ‘break’

‘sit’ > təmkɒw ‘broken’ > təmliəj

(124) Old Mon gloṅ ‘many, much’

>

guloṅ

‘increase’

In Pacoh, an anti-causative prefix is used with some verbs. (125) Pacoh hɛʔ

‘tear’

>

tihɛʔ

‘torn’

The Austroasiatic Languages: A Typological Overview

53

In some cases Car uses vowel alterations in intransitive-transitive pairs. In this case it is not evident which form is to be taken as basic and which as derivate. (126) Car non-agentive agentive gloss intransitive transitive cavíː cuvíː ‘rotate, circle, turn around’ falál filál ‘break’ kaín kijín ‘shake’

Other verbal derivational affixes are used to express reciprocity, adversity, nonvolitionality, and others. The distribution of these affixes and the wide range of functions associated with similar forms in different languages do not allow a reconstruction to the proto-language of specific forms, though the system as such may be postulated to have been in existence already in the ancestor language. (127) Pacoh (reciprocal) ʔacuən ‘hit’ >

ʔrcuən

‘hit each other’

(128) Sedang (reciprocal) ma̰ tə-cuə̰ dej pɔ̰ 1du recp-obey with each.other ‘We two obey each other.’ (129) Sedang (reciprocal) prḛj tə-weə̰ŋ dej pɔ̰ 3du recp-help with each.other ‘Those two help each other.’ (130) Sedang (reciprocal) mɛ prḛj tə-l ̥ɔ dej pɔ̰ then 3du recp-see with each.other ‘Then those two saw each other.’ (131) Dara’ang (reciprocal) katuh ‘bump against’ kamuan ‘talk together’

katun kathăɁ

‘meet’ ‘clap with hands’

54

Jenny, weber and weymuth

(132) Pacoh (non-volitional) pḛɲ ‘shoot’ > tapḛɲ

‘shoot accidentally’

(133) Sedang (adversative) poə ʔɛh lə-how grandfather 2sg advrs-hate ‘Grandfather, you hate us two.’

ma̰ 1du.excl

(134) Sedang ʔa̰ saw ʔɛh lə-kəde ʔa̰ 1sg fear 2sg advrs-kill 1sg ‘I’m afraid you will kill me’

In Danau verbs can be gerundivized with the prefic pə- to form a non-finite verbal construction that can take direct objects. (135) Danau tə-kɨ̂ pin tʰæ̀ x bēj nì=nə pə-jɔx ɐ̀ kùtʰokæ̃ ɛ̆ cond-go kill life other prox=top nfin-do sin ‘If (you) go and kill another, it is the same as committing sin.’

ɐnɐnɐj same

2.3.3 Other Types of Derivation The prefix hə- forms locative adverbials in Kơho, similar to the prefix pain Chong and na- in Kammu. (136) Kơho gah nɛ ɗap ɗaŋ ɗəm (137) Chong dɨŋ heːn tʰɛ̤ːw nih kə̰ːt cʰaː

‘side’ > hə-gah ‘on the side/edge’ ‘that’ (equidistant from the object) > hə-nɛ ‘over there’ ‘the portion of land in front of’ > hə-ɗap ‘in front of’ ‘up, above, top’ > hə-ɗaŋ ‘on top of’ ‘below, under(neath)’ > hə-ɗəm ‘underneath’

‘on’ > padɨŋ ‘that’ > paheːn ‘other’ > patʰɛ̤ːw ‘where’ > panih ‘low’ > pakə̰ːt ‘to eat’ > pacʰaː

‘above’ ‘there’ ‘elsewhere’ ‘where’ ‘below’ ‘a place for food’

The Austroasiatic Languages: A Typological Overview

55

(138) Kammu locative prefix na- signifies a location. na-Ɂac ‘before, in front of’ na-påan ‘behind’ na-khεŋ ‘inside, in the back’ na-rec ‘outside’

In Mlabri the marker luŋ/nuŋ is also used in combination with the demonstratives to form luŋgʌh ‘here’ and luŋɲʌʔ ‘there’. The reason why this compound is analyzed as one word, rather than as a phrase, is that the locative marker luŋ seems to undergo the same vowel assimilation that is found in minor syllables. (139) Mlabri mɛh chi ɗaj ŋ̊ uh luŋ-luah lɛh 2sg want get sit loc-outside q ‘Do you want to go sit outside?’ (140) Mlabri leh lɯŋ-kal come loc-front ‘[He] came before [the others]’ or ‘[He will] come later’

Kơho forms interrogatives by prefixing a nasal consonant to demonstratives and nouns. The nasal has the same place of articulation as the initial of the stem. (141) Kơho bɛ bəh *ɗɑː teːŋ caw cʰi

‘that; as’ ‘from’ (origin, source) [no base form] ‘place, direction’ ‘person’ ‘(any-, some-) thing’

> > > > > >

mbɛ mbəh nɗɑː nteːŋ ɲcaw ɲcʰi

‘how’ ‘(from) where; what source’ ‘how much/many’ ‘where’ ‘who, which’ ‘what’

In Mon, different prefixes are attached to the demonstrative and interrogative stems to derive pronouns and adverbs. While the pronominal and locative prefixes originate in old nominal prefixes, the manner adverbial prefix is shortened from the noun sac ‘kind, type, color’, which still appears in this form in literary Mon. The prefix ʔiʔ- in colloquial Mon merges with the initial of the stem and causes palatalization, possibly by a process of metathesis which results in the prefixes becoming an infix (ʔiʔnɔʔ >*niɔ > ɲɔʔ) (see Jenny 2003).

56 (142) Mon Base Pronoun

Jenny, weber and weymuth

nɔʔ ‘this’ ʔiʔ-nɔʔ ~ ɲɔʔ ‘this one’ ʔə-nɔʔ ‘here’

Adverb 1 locative Adverb 2 hə-nɔʔ ‘like this’ manner

kɔ̀ h ‘that’ ʔiʔ-kɔ̀ h ~ cɔ̀ h ‘that one’ ʔə-kɔ̀ h ‘there’

tɤʔ ‘that (over there)’ lɒ ‘which’ ʔiʔ-tɤʔ ~ cɤʔ ʔiʔ-lɒ ~yɒ ‘that one’ ‘which one’ ʔə-tɤʔ ‘there’ ʔə-lɒ ‘where’

hə-kɔ̀ h ‘like that’

(hə-tɤʔ)5

hə-lɒ ‘how’

Sedang has two prefixes that attach to numerals, resulting in different derivative meanings. The prefix tə- forms measure words, the prefix mə- ordinal 5 numerals. (143) Sedang peə̰ ‘two’ > pa̰j ‘three’ > pṵn ‘four’ >

təbeə̰ ‘two fingers’ width’ təpa̰j ‘three fingers’ width’ təpṵn ‘four fingers’ width’

(144) Sedang məmoj ‘first’ məpeə̰ ‘second’ məpa̰j ‘third’ məpṵn ‘fourth’

A complex and heterogeneous system of derivation is found in Bugan, where a change of vowel, tone or consonant results in a new, but related meaning. (145) Bugan vowel/tone/consonant alternation derivation ɕi⁵⁵po³¹ ‘yesterday evening’ ɕa⁵⁵po³¹ ‘tomorrow evening’ ŋɔ³³ ‘peppery’ ŋa³³ ‘salted’ ʦa̠¹³ ‘to bite’ ʦo̱ u̱³¹ ‘to eat’ la̠³⁵ ‘to return’ li̱³⁵ ‘to change’ ʦa̠n²⁴ ‘to smell’ ʦïan³³ ‘smelly’ kʰo⁴⁴ ‘ladle’ kʰo²⁴ ‘to scoop’ mʦʰa¹³ ‘to kill’ mʦa³¹ ‘to die’ nap⁵⁵ ‘be closed’ ȵap⁵⁵ ‘to close one’s eyes, to shut up’

5 The form hətɤʔ is not in common use. A periphrastic construction like pɤ̀m tɤʔ ‘that manner’ is used instead.

The Austroasiatic Languages: A Typological Overview

3

57

Clause Structure

3.1 Simple Clauses 3.1.1 Intransitive Clauses Intransitive clauses in most AA languages exhibit the basic word order SV. The single argument (‘subject’) of the clause precedes the verb, which may or may not be accompanied by tense-aspect-modality markers, directionals, or other modifiers. This word order is already attested in the earliest documents of AA, namely Old Mon and Old Khmer. (146) Old Khmer ’ry9aṃ l9a. elder.sibling depart ‘[My] older brother took his leave.’

In Khasi, the single argument is cross-referenced in the verb phrase by the gender marker, which precedes the verbs and tam markers. (147) Khasi u=briew u la wan. m=man m pst come ‘The man came.’

In some languages, the normal order of constituents may be inverted, as in Mon, where the orders SV and VS are both possible with a small number of verbs, mostly existential or presentational verbs. The two word orders have different information-structural functions, as seen in the following examples. The different readings are in line with the general topic-comment structure of the language (see also section 3.6). (148) Mon pɤŋ seh mɔ̀ ŋ ɲìʔ thɔ̀ raʔ cooked.rice remain stay little only foc ‘There’s a little bit of rice left.’ (149) Mon seh mɔ̀ ŋ chaʔ pɤŋ, hwaʔ ʔɒt ʔa jaʔ left stay excl cooked.rice curry all go nsit ‘There’s only rice left, the curry is gone.’

58

Jenny, weber and weymuth

Also in Dara’ang, the subject of an intransitive expression can occur after the verb. The exact function of this inversion in Dara’ang is not clear, but there are presumably information-structural factors. (150) Dara’ang Ɂăw jic maso. neg get.up dog ‘The dog didn’t get up.’

In intransitive clauses in Aslian languages, the subject may either precede or follow the predicate, as seen in example (151) from Semelai and (152) from Semaq Beri. (151) Semelai (Kruspe 2004: 255) ga=dmdəm ʔəɲ, bhih ɲɔʔ lpəc. imm=lie.down 1f be.full very stomach ‘I’m going to lie down, (my) stomach is really full.’ (152) Semaq Beri kɛ jlaŋ. 3sg be.long ‘It is long.’

In Munda languages, such as Santali, the order is SV if the S argument is overtly expressed by a full NP or by a pronoun. (153) Santali (Ghosh 2008: 55) khan-gɛ paʈoari dɔ-e then-foc pn top-3sg.sbj ‘The Patoari ran away.’

dəɽ-ked-a. run-pst.act-fin

If the single argument is omitted, it is marked on the verb by a suffix which appears in the final slot of the verb complex, after the finite marker. The order of elements within the intransitive verb form is therefore V-S. (154) Santali (Ghosh 2008: 61) sɛn-akan-a-e. go-perf.mid-fin-3sg.sbj ‘He has gone.’

The Austroasiatic Languages: A Typological Overview

59

A handful of languages are exceptional in the AA family in having verb initial basic word order. These include Pnar, Amwi and War (but not the closely related Khasi), Wa (but not other Palaungic languages), and the Nicobarese languages. (155) Car lə́ːk an ŋam máj calm 3sg the sea ‘The sea is calm.’ (156) Pnar jap u. die m ‘He died.’ (157) Pnar jap u=wɔʔ kiaŋ.naŋ.baʔ die m=hon pn ‘Mr. Kiang Nangbah died.’

3.1.2 Transitive Clauses The most frequently encountered word order in transitive expressions in AA languages is AVP, that is, the agent precedes the main verb, the patient follows it. This is also the normal word order in the Old Mon and Old Khmer inscriptions, and it is found throughout the eastern (non-Munda, non-Nicobarese) branches of AA, including standard Khasi, which coreferences the A in the verb phrase by the appropriate gender marker. (158) Old Khmer paṃnvas cya sl9a monks eat areca ‘The monks were chewing areca nut.’ (159) Khmer (Jacob 1968: 59) mdaːj thvɤ̀ ː mhoːp. mother make food ‘Mother is making the meal.’

60

Jenny, weber and weymuth

(160) Sedang ga̰ w̥ aŋ ga̰ tuŋ trap 3sg throw 3sg inside mud ‘He threw him in the mud’ (161) Kơho kʰaj ləh hiw pa 3sg build house new ‘He built a new house.’ (162) Bugan pə⁵⁵kan³³ ȵu³³mʦaŋ³⁵ kai³³ ʔo⁵⁵ biou³³ bugan celebrate festival mountain top ‘The Bugan celebrate their festivals on top of the mountain.’ (163) Bunong (Vogel 2006: 110) paŋ kɛ̤h rəwɛh 3sg have elephant ‘He has (an) elephant(s).’ (164) Danau ō swe bɯ̄ 1sg eat rice ‘I am eating (rice).’ (165) Kammu róːj l̀.wàːŋ kóh cŕ.kùːl.tíʔ kə̀ spirit sky cut finger 3sg.m ‘The sky spirit cut off his fingers.’ (166) Mlabri ʔoh kɔbɔ boŋ cin 1sg neg eat meat ‘I don’t eat meat’ (167) Mon kalaʔ kɔ̀ h mìʔ.kon.plɛm kɔ̀ h pətɛt kəsɔp raʔ. time medl pn medl caus.exit nml.think.over foc ‘Then Mi Kon Plem made her plans.’

The Austroasiatic Languages: A Typological Overview

61

(168) Khasi u=ksew u baːm doʔ m=dog m eat flesh ‘The dog eats flesh.’

Deviations from the standard constituent order are frequent for pragmatic reasons (see section 3.6), as is the omission of contextually retrievable arguments and other constituents. The Munda languages exhibit basic verb final word order if the arguments are overtly expressed by full NPs or pronouns. In the following example from Santali, the A argument is expressed by a free pronoun as well as a clitic, and the object is expressed by a full NP and cross-referenced on the verb by a suffix. The constituent order in this case can be taken as AVP, with originally extra-clausal NPs expressing A and P: [A P] AVP. The fact that the finite marker appears after the P suffix suggests that it was originally a free form and the P formed a closer unit with the verb. (169) Santali (Ghosh 2008: 75) am ɖaŋra=m əgu-e-a. 2sg bullock=2sg.sbj bring-3sg.obj-fin ‘You will bring the bullock.’

If both arguments are omitted, they are indicated by affixes on the verb. The order in this case is VPA, and the finite marker occurs between the two argument suffixes. (170) Santali (Ghosh 2008: 61) dal-akad-e-a-m. beat-perf.act-3sg.obj-fin-2sg.sbj ‘You have beaten him.’

If we take AVP as the underlying original word order in Santali, the position of the agent after the finite marker can be explained as anti-topic, which is frequent in the languages of he area, including verb-final languages. Alternatively, we might take VPA as the original word order, as suggested by the pronominalized verb forms in Santali. This is found as the basic constituent order in the Khasian languages Kudeng War and Amwi, as well as in Nicobarese, as in the following examples.

62

Jenny, weber and weymuth

(171) Kudeng War (Anne Daladier, p.c.) ə əri viɛr ʔi hun kə. decl leave lose pl child f ‘She left her children for ever.’ (172) Amwi (Weidert 1975: 214) jaʔlə beʔ škɛ ʔi hnthlɛ cuprəw ʔi. go.together hunt deer 1pl seven person 1pl ‘We went deer hunting together, seven of us.’ (173) Nancowry (Reid 1994: 333) kalóʔ nɔ́ t cə́ -n steal pig my-nom ‘The thief stole my pig.’

kamalóʔ. thief

In Wa, unlike other Palaungic languages, the basic constituent order appears to be VAP, though alternative orders are also attested, as seen in the following examples from Tangyan Wa (Shan State). (174) Wa (field notes MJ) sɔm ʔɤʔ ʔɯp. eat 1sg rice ‘I eat rice.’ (175) Wa (field notes MJ) ʔaŋ ʔɤʔ seʔ sɔm ʔɯp. neg 1sg tam eat rice ‘I don’t eat rice.’

Semelai, an Aslian language of the Malay peninsula, shows different word orders depending on the context (Kruspe 2004: 253ff). In what Kruspe (2004: 254) calls “universal clauses”, the verb occupies the position between A and P, as seen in example (176). (176) Semelai (Kruspe 2004: 254) pɔdɔŋ ca smaʔ. tiger eat person ‘Tigers eat people.’

The Austroasiatic Languages: A Typological Overview

63

In perfective clauses, the verbal predicate occurs in the clause-initial position, as in example (177). Notice though that a pronominal agent proclitic occurs before the predicate, so that one may still argue that the underlying structure is AVP, with the A resumptively expressed in postverbal position. (177) Semelai (Kruspe 2004: 255) ki=bukɒʔ la=kniek hn=pintuʔ. 3a=open a=husband p=door ‘The husband opened the door.’

The A argument can be fronted to the preverbal position, as seen in example (178). (178) Semelai (Kruspe 2004: 257) smaʔ, ki=tampɔŋ cɔ. person 3a=hit dog ‘The person, he hit the dog.’

3.1.3 Ditransitive Clauses Ditransitive clauses contain three arguments, agent, theme, and goal or recipient. The order of these arguments with respect to the verb and to each other differs among the AA languages. Like with transitive clauses, the most frequent word order is apparently AVGT or AVTG. In many AA languages there is only a handful of real ditransitive verbs, in some cases with ‘give’ as only member of the category. The constituent order AVGT is found, for example, in Vietnamese, Kui, Sedang, and Mon. In these cases no overt marker of grammatical role is found and the word order and semantics alone determine the function of the arguments. (179) Vietnamese Hương cho họ.hàng thuốc. pn give relatives drug ‘Hương gives relatives drugs.’ (180) Kui kruː.pɛ̤ːt ʔɒːn beːʔ ʔiː krən.caɲ muŋ doctor give person ill malaria mosquito.net ‘The doctor is giving the malaria patients mosquito nets.’

64

Jenny, weber and weymuth

(181) Sedang ŋin ʔa̰m ʔɛh pəkeə̰ŋ 1pl.excl put 2sg powder ‘We give you the powder.’ (182) Sedang pin ɟa ʔɛh troə̰ŋ həda 1pl.incl show 2sg path escape ‘We will show you the escape route.’ (183) Mon ʔəmè kɒ kon kwaɲ rɤ̀h ŋuə. mother give offspring sweets every day ‘The mother gives her child sweets everyday.’

In Khmer, Chong and Mlabri, the order is AVTG, with no overt marker of G or T. (184) Khmer khɲom ʔaoj luj kòət. 1sg give money 3sg ‘I give him money.’ (185) Chong ʔuːɲ ʔɨt kataŋ boːt father give money younger.sibling ‘Father gives money to my younger brother.’ (186) Mlabri maʔ khanom mɛh give cookie 2sg ‘give the cookie to you’

Old Khmer and a number of other AA languages mark the recipient as oblique by means of a dative or other preposition. The word order in Old Khmer, Kammu, Dara’ang, Khasi, and Semaq Beri is AVTG.

The Austroasiatic Languages: A Typological Overview

65

(187) Old Khmer . . . kamrateṅ kaṃtvan ’añ viṅ oy pras9ada bhūmi srat9ac . . . high.lord maternal 1sg again give favour land pn ṛdval ta loñ vasudeva pn obl title pn ‘. . . My High Lord in the maternal line again give the tracts in Sratāc [and] Ṛdval as a royal grant to the loñ Vāsudeva. . .’ (188) Dara’ang Ɂo taih khamun dɨ Ɂalɔt 1sg give sweet goal pn ‘I gave Alot some sweets.’ (189) Kammu ʔòʔ ʔùːn k.múːl jʌ̀ kə̀ 1sg give money ins 3sg.m ‘I handed money to him.’ (190) Khasi ŋi-n aːj ja ka=kot ha pʰi 1pl-fut give acc f=book dat 2 ‘We will give the book to you.’ (191) Semaq Beri kɛ jon lapan ratus haʔ ʔəɲ 3pl give eight hundred loc 1sg ‘They gave me 800 (ringgit).’

In Kơho, the recipient is marked by a dative postposition and occurs between V and T. (192) Kơho kʰaj ʔaj ʔaɲ ʔin do məŋan dɑːʔ 3sg give 1sg dat one bowl water ‘S/he gave me a bowl of water.’

Pnar has the basic constituent order VATG, with the recipient marked as oblique and the T left unmarked.

66

Jenny, weber and weymuth

(193) Pnar e kɔ ka=hu.kum ja o give 3sg.f f=command ben 3sg.m.top ‘She gave the command (to him).’

In many languages ditransitive expressions with overt G and T are usually avoided, omitting or fronting either G or T, as seen in the following example from Danau. (194) Danau nì=nǝ ō ɐ kɔʔ mǝ̀ prox=top 1sg give neg.cop 2sg ‘I won’t give this to you.’ (lit. ‘This, I won’t give to you.’)

Another strategy is to use biclausal constructions, as illustrated by the following example from Kammu. (195) Kammu ʔòʔ cə̀ ʔùːn mè 1sg irr give 2sg.m ‘I will give you some rice seeds.’

tèʔ ŋɔ́ ʔ sḿ.làh get rice seed

3.2 Subordinate Clauses Subordinate clauses are clauses that cannot form a complete sentence by themselves and have one of a number of functions. Traditionally, subordinate clauses are divided into relative or attributive clauses, complement clauses, and adverbial clauses. The latter category is very diverse, both cross-linguistically and often also within a particular language; furthermore, it cannot be seen as a uniform class apart from the general semantics of ‘event modification or specification’. In many AA languages, subordinate clauses cannot readily be distinguished from coordinate or juxtaposed independent clauses, as often no overt marker is used and the function of the clause is determined by the context alone. This underspecification in clause types may be due to the fact that the majority of AA languages do not have literary traditions, and orally transmitted languages tend towards simpler clauses and juxtaposition (see e.g. Karlsson 2009). The subordinators most commonly used in the AA languages can be classified into three groups, namely, 1. grammatical function words, which often cover a wide range of functions, including prepositions, 2. partly desemanticized verbs, and 3. desemanticized nouns. Desemanticized

The Austroasiatic Languages: A Typological Overview

67

verbs, usually with the lexical meaning ‘say’ or similar, are frequently used as complementizers, while verbs or nouns with a wide range of lexical meanings are used to introduce adverbial clauses. In Car, the word order in subordinate clauses differs from matrix clauses in that matrix clauses are generally verb initial, while subordinate clauses are verb medial and the S/A pronoun takes a special subordinate form, as illustrated in the following examples. (196) Car (a) Independent clause: (b) Dependent clause: ləːk an ŋam maj heː ŋam maj nə calm 3sg def sea time def sea 3sg.sub ‘The sea is calm.’ ‘when the sea is calm’

leːk calm

(197) Car vɛ́ːɲə naŋ cɔːn tə cu min ŋac umuh tell ear pn link 1sg.poss fut finish there ‘Tell John that I will be there.’

3.2.1 Relative Clauses Relative clauses in AA languages in most cases follow the head noun they modify, in line with the usual position of postposed modifiers in general. The relative clause may be introduced by a relativizer, or they may be formally unmarked. In Bunong, the demonstrative determiner iː also functions as relativizer, as seen in (198). (198) Bunong kə̤p rəwat rawɛːh iː cʰa reːb ta nɛʔ 1sg buy elepant rel eat herb loc there ‘I bought the elephant that is eating grass there.’

Khasi and Pnar use a relativizer together with the appropriate noun class marker to introduce postnominal relative clauses. The noun class marker agrees with the head noun. (199) Khasi ka=briew ka ba tʰoʔ siti f=person f rel write letter ‘the woman who writes a letter. . .’

68

Jenny, weber and weymuth

(200) Pnar man kɔ ka=kn̩ .ten ka wa jar bha. be 3sg.f f=word f rel wide very ‘It is a very wide/encompassing word.’

Khmer, Kammu, Sedang, and other languages use a relativizer to introduce relative clauses. The relativized function of the head noun covers different ranges in different languages, including possessive and oblique roles in Khmer and Kammu. A relativizer is also used in Kui relative clauses with the referentiality marker as a dummy nominal head. (201) Khmer mənùh dael khɲom nɪ̀jɪ̀ːəj human rel 1sg talk ‘the man about whom I talk’ (202) Kammu c.mə̀ʔ kə̀m kə̀ túk púas k.nìʔ rope rel 3sg.m tie deer medl ‘the string with which he had tied the deer’ (203) Kammu kòn kə̀m róːj.ʔèːs túk hŕ.màːl k.nàːj person rel area.spirit tie soul dist ‘the person whose soul the area spirits have tied’ (204) Sedang tədroə̰ŋ ki ʔa̰ ˀnaj thing attr 1sg know ‘the things I know’ (205) Kui ʔaː ləm kuəj bɒn srɒn mənia~mənia ref rel person beg ask how.many~red ‘That which the people had begged [during the past year].’

The relative marker in Danau appears in the immediate preverbal position, and the relative clause precedes the modified head noun. This non-typical position of both the whole relative clause and the relative marker could be due to Burmese influence, which has the same syntactic structure.

The Austroasiatic Languages: A Typological Overview

69

(206) Danau kɐntʰɯ̄ =nə mə̀ pəʔ swe tū nɐ̄ ɔ=nə tū ɲɯŋ jɪn yesterday=top 2sg rel eat curry that=top curry meat chicken ‘The curry you ate yesterday was chicken curry.’

Headless relative clauses functioning as nominals exhibit the same structure as headed relative expressions. The relativizer may be omitted, as in example (207). (207) Danau ɐn tə̄ŋ tʰə̀ mɛʔ nɐ̄ ɔ=nəʔ kɐt soʊʔ kədɯ 3sg prog sit obl dist=top under tree pine ‘Where he is sitting is under a pine tree.’

Old Mon uses a relativizer ~ to introduce attributive clauses. Similar to Danau, Middle Mon places the relativizer directly before the verb, but in Mon the whole relative clause follows the modified head noun. This structure is found also in literary Mon, while the relativizer has been all but lost in most varieties of colloquial Mon. The following examples illustrate the development of relative clauses in Mon. (208) Old Mon (Shorto 1971: 123) mirmas guṇ ma smiṅ ʔiñcim jirku. remember favour rel king sustain body ‘He remembered the favors with which the king had sustained him.’ (209) Middle Mon (Shorto 1971: 282) gal9an dewat9au ma huim word god rel speak ‘the words that the gods spoke’ (210) Literary Mon həlàn ɲèh mə=hɒm kɔ̀ h word person rel=speak medl ‘the words that he spoke’ (211) Modern Mon ʔərè ɗɛh hɒm language 3 speak ‘the words he spoke’

70

Jenny, weber and weymuth

In Mlabri, relative clauses, like other subordinate clauses, are adjoined to their head noun without being formally marked. Like other subordinate clauses, relative expressions are not frequently used in the language and examples are hard to find in the texts. (212) Mlabri [β] (Rischel 1995: 190) khep mɤm maʔ ʔoh ʔa ɲon, ʔa tac slippers father give 1sg nsit neg.exist nsit break ‘The slippers father gave me are used up, they’re broken.’

Also other AA languages put the attributive clause after the head noun without using overt relative markers. (213) Bugan ʔɔ³¹ ʑaɯ³¹ bɔ⁵⁵ mə³³ bɔ⁵⁵ mda³³ te³¹ qou³³ nan⁵⁵~nan⁵⁵ 1sg see star one clf shine very bright idioph~red ‘I saw a star (which was) shining bright.’ (214) Chong nēːm ʔuːt ʔuːɲ kat.praʔoː pēn neːm̤ .ŋɔ̄ ʔ tree father cut yesterday be rambutan.tree ‘The tree that father cut down yesterday is a rambutan tree.’ (215) Kơho baːr nəm sraʔ təj ʔaɲ kʰaj bləj ŋaj ʔɔr dɔ two clf book French 1sg 3sg buy day previous prox ‘these two French books of mine he bought yesterday’ (216) Dara’ang Ɂo nap Ɂikăt tuc dɨ dɔj 1sg know elder sit goal there ‘I know the elder (who) sits there.’ (217) Kui kəː pə̤ːʔ cuəh tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw 1sg see tiger lie sleep be.at in cave mountain ‘I saw a tiger sleeping in a mountain cave.’

In some languages, the relative clause is marked not by a clause initial relativizer, but rather by a clause final referential marker, indicating that the clause is

The Austroasiatic Languages: A Typological Overview

71

to be understood as non-predicative expression. One case in point is Mon, as seen in example (218). (218) Mon kɤ̀ʔ tɛ̀h ɓɛ̀ʔ kon ŋèə həkaoʔ klàj kɔ̀ h. get hit ref offspring frog body seek medl ‘He got the little frog he was looking for (by coincidence).’ (NM_frog01)

Old Khmer and Vietnamese use a general linker or subordinator to introduce relative clauses. (219) Old Khmer saṅ khñuṃ jmaḥ khdic ’añ jvan ta vraḥ śivaliṅga make slave name pn 1sg offer link holy śivaliṅga ‘[He] made [me] the pious gift of a slave named Khdic, whom I offered up to the holy śivaliṅga.’ (220) Vietnamese Nhà (mà) Ø bán giò.chả ở đa�u đường giàu. house comp sell k.o.sausage reside head street rich ‘The family that sells sausage at the beginning of the street is rich.’

3.2.2 Complement Clauses Complement clauses most frequently occur as clausal arguments of verbs of saying, perception, and cognition. In many cases, perhaps in the majority of instances, complement clauses are unmarked and directly adjoined to the matrix clause. This is also the case when more than one complement clause occurs in a sentence, as in the double embedded construction in the Sedang example (222). Unmarked complement clauses can easily be found in most branches of AA, as illustrated by the following examples. (221) Vietnamese Nhiều người Hà.Nội gốc tha�y là mình tinh.tế hơn many person Hanoi root see cop self sophisticated more người khác. person different ‘Many old-time Hanoians think that they are more sophisticated than other people.’

72

Jenny, weber and weymuth

(222) Sedang ʔa̰ taŋ waj to̰ j kla̰ hiə̰ŋ ka co̰ j 1sg hear 3sg tell tiger already bite deer ‘I hear (that) they say (that) a tiger has killed a deer.’ (223) Kui kəː kət paːj daːʔ.kʰiəl haj sɔt me̤ːn.te̤ːn 1sg think say honey 1 pure intns ‘I think that my honey is pure indeed.’ (224) Pnar jo-luti ɔ bam kɔ see-way 1sg eat 3sg.f ‘I hope she eats.’

In Semaq Beri, the complement clause is not marked by a subordinator and usually follows the matrix clause, though it may precede it. (225) Semaq Beri ʔəɲ btʰɔŋ swak~swak gaʔ bri 1sg be.afraid walk~cont goal forest ‘I am afraid to keep walking in the forest.’ (226) Semaq Beri kɛ, kɨŋ kɛ kʰɛʔ  3sg make.shelter 3sg know ‘Her, she knows (how) to make a shelter.’

Also when embedding questions, both polar and content, as in the following Bunong, Chong and Mlabri examples, no complementizer is necessary but the question is apparent by the presence of the question word. (227) Bunong (Bequette 2008: 79) paŋ iriː oːp cə̤t mɔːh naw maj 3sg FOC ask more what matter 2sg.m.poss ‘He (the bull owner) asked further, “What is your problem?”’ (228) Chong wa̤ːj tʰ9aːm wə̤t cʰaː panih tiger ask search eat where ‘A tiger asked “where will you find the food?”’

The Austroasiatic Languages: A Typological Overview

73

(229) Mlabri mɔj sop leh ɗaj lɛh 3sg ask come can q ‘He asks if he can come.’

An overt complementizer may be used optionally in Bunong and Chong, though, as illustrated by the following example. (230) Bunong (Bequette 2008: 50) kə̤p ntoːn lah ntro̤ j maj ŋan 1sg suspect comp fiancée 2sg.m.poss truly ‘I suspected that she was really your fiancée.’ (231) Chong pʰɨ̄ən tʰ9aːm lok ʔiːn pih wə̄j friend ask sub exist what disc ‘Friends ask what it happened.’

Car uses a general linker, which also serves other functions, to introduce complement clauses. (232) Car vɛ́ːɲə náŋ cɔ́ ːn tə́ cu min ŋac umúh tell ear John link 1sg.poss fut finish loc.dist ‘Tell John that I will be there.’ (233) Car kɔ́ ːʔtəkúːʔ cáʔa, tə́ ɔ nə kucíːkə ɽɔ́ ŋ find 3du.pl link 3sg.anim 3 tasty fruit ‘They found that it’s fruit was tasty.’

Khmer, Kui, Kammu, Dara’ang and Mon, among others, use a grammaticalized verb ‘say’ to introduce complement clauses, besides the possibility of asyndetically joining them to the matrix. (234) Khmer (Jacob 1968: 100) krùː prap khɲom thaː kmeːŋ nùh prəlɔːŋ cɔːəp. teacher tell 1sg say boy dist sit.for.examination pass ‘The teacher told me that the boy will pass the examination.’

74

Jenny, weber and weymuth

(235) Kui pɛːn suər paːj naːw mɜʔ haːn mṳːt baːŋ ʔɛŋ ʔəː Pen tell say 3 neg dare enter stream ana.dist emph ‘Pen says he doesn’t dare to enter that stream.’ (236) Kammu km̀ .múʔ tŕ.kə̀t sáh lòːk kì ʔàh sáːm cèn Kammu believe say world prox have three level ‘The Kammu believe that this world has three levels.’ (237) Dara’ang daraɁaŋ ɲɨm tauh boŋ Ɂakja kɨn~kɨn dɨ pən dɨ houɁ wpn believe tell make merit lot~red goal get goal ascend məŋdεŋ heaven ‘Dara’ang people believe (that) making lots of merit will make them be able to go to heaven.’ (238) Mon ɲèh thiəŋ.həjaʔ kɛ̀h rɔ̀ ə həkaoʔ khjɒt ʔa jaʔ 3h think say friend body die go nsit ‘He thought that his friend was dead.’

In the following example from Danau, two independent clauses are juxtaposed and the whole sentence can be read as ‘I called you, did you hear me?’. (239) Danau ō heiʔ mǝ̀ mǝ̀ on 1sg call 2sg 2sg hear ‘Did you hear me call you?’

jɐ̂ q

An alternative construction in Danau is to have the complement clause precede the matrix like a fronted topic. This is also a frequent construction in Mon, where the fronted complement clause is overtly marked as topical or non-predicative by the medial demonstrative. (240) Danau ɐn=nə ɐn klɐj tɯ̀ lò kətʰê ō on tʰənəʔ=nə 3sg=top 3sg become perf person rich 1sg hear like.that=top ‘I heard that he has become a rich man.’

The Austroasiatic Languages: A Typological Overview

75

(241) Mon kɤ̀ʔ pɔŋ.phak ʔa mùə.han kom.kaoʔ kao plàj get associate go together with older.brother young.man kɔ̀ h tɔ̀ h hùʔ màn raʔ medl be neg win foc ‘It is impossible that I will stay together with you, my brother.’ (MKP)

3.2.3 Adverbial Clauses Adverbial clauses cover a wide range of functions and do not form a structurally or semantically uniform category. This is reflected by the greatly diverging constructions found in the different AA languages to express adverbial information about the clause. The main functions expressed by adverbial clauses include conditional, causal, temporal, and manner, among others. As with other subordinate clauses, many AA languages employ an asyndetic strategy, juxtaposing two formally independent clauses. This strategy is available for different functional types of adverbial clauses. This leads in many cases to an ambiguity in the expression which must be resolved by the context. (242) Kammu mè ʔùːn ʔòʔ tèʔ sɨ́p màn, ʔòʔ cə̀ ʔə̀h 2sg.m give 1sg get ten piastre, 1sg irr do ‘If you give me ten piastres, I will do it.’ (243) Mlabri [β] (Rischel 1995: 193) kibi bi:ʔ boŋ di chɛʔ neg satisfied eat link much ‘If you aren’t full yet, eat some more!’ (244) Dara’ang Ɂăn raɲ ɡan kasɔ ʔăn pən rən ʔicak 3sg do work hard 3sg get money little ‘(Though) he works hard, he gets small amount of money.’ (245) Danau ō pə-ljɐk jɪ̄n ɲɐ̄ =nə ɐn mɐ̀ mɐ̃ ɔ tɯ̀ 1sg nfin-arrive return home=top 3sg already dark perf ‘When I arrived home, it was already dark.’

76 (246) Danau ɲɯŋ wæx=nə ō kʰrɯ̄ k meat cow=top 1sg like ‘Although I like beef, I don’t eat (it).’

Jenny, weber and weymuth

mə̄nɔʔ, ō lɐ-swe kɔʔ stat 1sg neg-eat neg.cop

(247) Pacoh ʔrbaŋ bɔː mat lɨː sky rain cool very ‘When it rains, it’s very cool.’ (248) Pnar man da slap, tɛ happen real rain nvis ‘When it rains, then it is cold.’

kʤam be.cold

Besides unmarked, and therefore often ambiguous, subordinate clauses, overt subordinators are commonly found in many languages in different functions. These markers may or may not have transparent semantics, that is, they may be synchronically linked to lexical verbs or nouns. In Chong, the conditional subordinator tʰ9aː is clearly a loan from Thai tʰâː ‘if’. (249) Chong tʰ9aː me̤ː ʔuːɲ dak ceːn || cʰ9an kɔ̄ ː tɔ̄ ŋ ceːw ʔih if mother father 3 come 1sg then must go neg ‘If their parents come, I do not go with them.’ (250) Khmer (Haiman 2011: 316) baə ʔaoj daə taːm thɔːəmmedaː, prəhael mɯ̀ n dɔl tèː. if give walk follow normal perhaps neg arrive neg ‘If [they] had made me walk normally, I might not arrive at all.’ (251) Sedang taŋ cuo tɔ̰ poj ta wɛh ʔoh if in.law laugh imp neg turn neg ‘If you (my in-law) laugh, don’t turn away.’ (252) Bunong (Bequette 2008: 51) lah aj ŋkɔ̤ːn aj ɔn ta̤ːk wiː . . . if 2sg.f cook 2sg.f put.aside water rice.water . . . ‘If you cook rice, you put aside the rice water for me . . .

The Austroasiatic Languages: A Typological Overview

77

(253) Dara’ang se Ɂo mə̆h rən Ɂo dɨ raɲ kaŋ kamaj if 1sg have money 1sg goal do house new ‘If I have money, I will build a new house.’ (254) Kui prənɒː bɜː mia cənap lic tuɒn həːj krənaː tomorrow if rain strong flood again nsit road ‘If it rains hard tomorrow the road will be flooded again.’ (255) Mon jɔ̀ ʔuə hùʔ chɤ klɤŋ ɲèh mùə hùʔ chɤ if 1sg neg find come person one neg find ‘If I hadn’t found it, no one would have found it.’ (256) Pacoh nam ʔarbaŋ bɔː kɨː ləjʔ boːn cho̰ ː tuː duŋ if weather rain 1sg neg get return to home ‘If it’s raining, I can’t go home.’

The conditional subordinator in Danau occurs as a prefix on the verb, rather than in clause initial position. (257) Danau lo-si-nì kəlē də-pəʔ=nə̀ ō kɨ̂ now-day-prox rainfall cond-rain=top 1sg go ‘If it rains today, I won’t go to Aungban.’

kɔʔ ŋbæ̃ ̂ ɛ̆ neg.cop Aungban

Temporal clauses may be expressed by a subordinator. They may express anteriority, simultaneity, or posteriority. (258) Bunong (Bequette 2008: 51) ce̤h caː plɔn hjak ri ɲut lah . . . after eat eggplant type dist pn say . . . ‘After (she) ate that eggplant, Nhut said. . .’ (259) Khasi katba u la da tʰjaʔ ki la leʲt haŋtaj while m pst ins sleep pl pst go there ‘While he slept they went there.’

78

Jenny, weber and weymuth

(260) Khasi ɨnda ka la dep baːm dep diʔ ka la ʃim ja when f pst finish eat finish drink f pst take acc ka=ʃaŋkwaːj ban baːm kwaːj. f=betel-basket inf eat betelnut ‘When she finished eating and drinking, she took the betel-basket to eat betelnut.’ (261) Kui pɒ̤ h cɛːn bəːn kuəj naːw taʔ caː swaːj to̤ ʔ when ripe get person 3 pick eat mango dist ‘When [they are] ripe, people will pick and eat those mango’s over there.’

Causal subordination is usually expressed by a clause initial conjunction. Danau is exceptional here in having a clause final causal marker. This may be due to influence from Burmese, which consistently uses clause final subordinators. (262) Vietnamese Duy ngủ không sâu vì hàng.xóm đang xây nhà. pn sleep neg deep because neighbor prog build house ‘Duy does not sleep well because the neighbors are building a house.’ (263) Sedang ga̰ ko̰ pa̰ŋ rədej klaj suə̰ ga̰. pa.seə̰ŋ weə̰ŋ ga̰ mɛ 3sg prox able strong what because 3sg father.spirit help 3sg so ‘This one was quite strong because he—the father spirit—helped him.’ (264) Kui cɒw.rɜː naːw səʔiə bəːn bəːn naːw kʰɒm.wuɒ kaːr cənap because 3 want get get 3 make.effort labour strong ‘It’s because he wants to get [it] that he’s working so hard.’ (265) Danau kʰæ̃ ̂ wæ̃ ̂ sɐʔouʔ hō hɔʔ (kʰɐũ� ) ō æ̃ ̂ kɨ̂ mɛʔ tǝ-sʰɐĩ purp buy book exist cop (because) 1sg fut go obl all-shop ‘I will go to the shop in order to buy a book.’ [‘. . . because there is a need to buy a book.’]

The Austroasiatic Languages: A Typological Overview

79

(266) Danau jwɐn.swe kæ̃ ɛ̆=nə̀ kʰɐ̃ ɔ̂ ̆ son.klɛ̀ hɔʔ lu hɔʔ believe karma=top because sacrifice cop donate cop ‘Because you believe in karma, you give away possessions, you perform charity.’

Causal clauses in Kammu are formed with the instrumental preposition jʌ̀ ‘instrumental’ or the subordinator pʰɔ́ ʔ wà, borrowed from Tai. (267) Kammu jàːm jʌ̀ tè cə̀ pə́ pɨ̀an pə̀ʔ màh mɨ̀ s.kí cry ins refl irr neg can eat rice day today ‘[I] am crying because I will not be able to eat anything today.’ (268) Kammu kĺ.ʔàːk nɔ́ ŋ jíaŋ cén kʰ.ɗíʔ, pʰɔ́ ʔ.wà trɨ́al tà kə̀ jʌ̀ crow still black until now, because pheasant paint 3sg.m ins kǹ.sáh charcoal ‘The crow is still black until now because the pheasant painted it with charcoal.’

Danau has a special negative clause final subordinator mɐ̄ to mark subordinate clauses with the meaning ‘without V-ing’’. The structure and function of this subordinator is similar to the corresponding Burmese negative subordinator mə-V-pɛ̀ ‘neg-V-neg.sub’, which may be have served as model of the Danau construction. (269) Danau ɐn mə-swe kɔʔ bɯ̄ mɐ̄ ɐn kɨ̂ pon nthɯn 3sg irr-eat neg.cop rice neg.sub 3sg go exit out ‘He went out without eating.’

A common structure found not only in AA languages, but also other languages of Southeast Asia, is the use of a (topic-comment) linker (tcl) to connect clauses. The linker, which may otherwise function as topic marker as in Mon, is usually placed between the two clauses and can be translated as ‘then, but, because, so’, among other possibilities, depending on the context. The first clause may or may not be introduced by a subordinator. Through prosodic reanalysis, the linker may become attached to the first clause, resulting in what looks on the surface like a clause final subordinator.

80

Jenny, weber and weymuth

(270) Mon pɤ̀m.praŋ pèh nɔʔ jɔ̀.raʔ wətɒə tɛm klɤŋ chɤ teh, appearance 2 prox if pn know come find top ɗɛh chanɗu noŋ, pèh thiəŋ ha. 3 pity asrt 2 think q ‘Do you think Wati would feel pity for you if she knew about your condition and came to see you?’ (271) Dara’ang mε si haw mɨ haw 2sg want go tcl go ‘(If) you want to go, then go.’ (272) Kơho dilah gəs jət naʔ caw taːm rəndɛh, gɛn hat ŋan if exist ten clf people loc vehicle tcl crowded very ‘If there are ten people in the car, then it is very crowded.’ (273) Kơho kʰaj at do paŋ sraʔ, gɛn sraʔ ruŋ de uː 3sg hold one clf paper tcl paper fall to ground ‘He held a sheet of paper, (and) it fell to the ground.’

Purposive clauses are frequently expressed by a verb meaning ‘give’, which in many languages also has permissive, jussive, or general causative meaning. (274) Khmer (Bisang 1992: 442) kɔːət prap tɤːu kmeːŋ ʔaoj jɔ̀ ːk tɤːu cùːn cɔmpùəh nɛːək.srɤj bɔndoːl. 3sg tell go boy give take go give to Miss pn ‘He told the boy to take [the parcel] and deliver it to Miss Bondol.’ (275) Pacoh pḛːh cɛːn ɟo̰ ːn kɨː ŋɔjʔ daːʔ get glass give 1sg drink water ‘Get a glass for me to drink water.’

In Kammu, the verb ‘give’ appears to have come to indicate conditional relations as well. The following example can possibly be paraphrased as ‘Let your stone sing and we will give you all this merchandise’.

The Austroasiatic Languages: A Typological Overview

81

(276) Kammu (ʔùːn = ‘give’?) ʔùːn klàːŋ mè k.nìʔ tə́ːm sáh, ʔìʔ cə̀ mʌ̀t kʰɔ́ ŋ.krùa if stone 2sg.m medl sing quot 1pl irr take merchandise kì ʔùːn mè tèʔ lòːc prox give 2sg.m get exhaust ‘If your stone sings, we will give you all this merchandise.’

3.3 Clause Chaining and Coordination In many AA languages, clauses expressing individual, but connected events can be chained without overt linker. An intonational pause may or may not occur between the clauses, and the status of the clauses as dependent or independent is not always clearly definable. Also the distinction between chained events and serial verb constructions is not always obvious, as the definition of a single event vs. multiple events can be elusive. Frequently, paratactic chaining is used when there is no change of subject from one clause to the next, and the events occur in consecutive order. (277) Old Khmer dep saṅ goś9ala duk thpal ta gi saṅ chp9ar next build stable assign herd link it build garden ‘[They] next built a cow stable, assigned a herd to it, [and] made a garden’ (278) Bunong (Philips 1973: 132) ɲup paŋ choŋ klup təm ntrṳŋ catch 3sg carry throw loc pen ‘Catch it, carry it over and throw it into the pen.’ (279) Pacoh kinoː kɨː poːk mamɔjʔ boːn pḛɲ kunɛː yesterday 1sg go hunt get shoot deer ‘Yesterday, I went hunting and was able to shoot a deer.’ (280) Kui beːʔ.re̤ːn ʔɛːl kɒŋ cih ciə dɔŋ ciə student take bicycle drive go house go ‘The students took their bikes and rode back home.’

krɔːj back

82

Jenny, weber and weymuth

(281) Kammu r̀.wàːc k.máŋ tè, cùːr tà kàːŋ, rù hḿ.ràŋ, pàk grab sword refl go.down loc house lead horse, ride hḿ.ràŋ tè jɔ̀ h horse refl go ‘[He] grabbed his sword, went down from his house, fetched his horse and rode away.’ (282) Danau ō jɪ̄n mɛ 1sg return fut ‘I’ll go home and eat.’

ɲɐ̄ house

jɪ̄n return

swe eat

bɯ̄ . rice

The two clauses may also express a contrast, without overt concessive marker, or they may include different subjects, as in the example from Chong. In this case, there is an intonational pause between the clauses, which are syntactically independent. (283) Danau ɲɯŋ wæx=nə ō kʰrɯ̄ k meat cow=top 1sg like ‘Although I like beef, I don’t eat (it).’

mə̄nɔʔ, ō stat 1sg

lɐ-swe neg-eat

kɔʔ neg.cop

(284) Chong lūŋ tʰɔː sūən || tʰɔm kɨj tɔŋ kʰīlɛ uncle do garden aunt stay home disc ‘My uncle is planting garden and my aunt is staying at home.’

A number of languages use dedicated coordinating conjunctions, such as the Chong disjunctive particle rɨ̄ ‘or’ (probably from Thai rɯ̌ ː ‘or’) or Kui rɜː ‘and’, which occurs together with the clause final inclusive marker lɒŋ. (285) Chong kʰe̤ːc ceːw cʰuːn rɨ̄ː ʔuːɲ kɛ̄ː dak ceːn c9a 2sg fut go send or father 3 come receive ‘You will send (her) home or her father will come (to take her home).’

The Austroasiatic Languages: A Typological Overview

83

(286) Kui prənɒː haj ciə srɛː croːt srɒː lɒŋ wa̤h daːʔ tomorrow 1 go fields harvest rice incl make.incision water srɛː lɒŋ rɜː taʔ pələj cətɒːŋ lɒŋ. fields incl and pick.pluck fruit beans incl ‘Tomorrow I go to the fields to harvest rice, to regulate the water and to pick beans.’

In other cases prepositions are used to join clauses or sentences, as in Kơho, where the instrumental marker mə appears in coordinating function. (287) Kơho ŋgum kɔj mə pʰɛ ŋkʰjaŋ kʰaj 3sg winnow paddy ins separate uncooked.rice ‘She winnows the paddy and [she] separates the rice.’

An additive marker ‘also, even’ after the first clause may indicate concessive or consecutive events, as in Dara’ang. (288) Dara’ang ma tauh dɨ mana bə hom dop ɡuɲ Ɂăn lε mana mother tell goal pn wait eat with father 3sg add pn tu bǝ neg wait ‘Mother told Mana to wait and eat with her father, but Mana did not wait.’

Some languages employ more or less desemanticized verbs like ‘finish’ to link events, as seen in Mon. (289) Mon kraoh pèh həlɒ̀ c kətɒ nù kəmot toə kəliəŋ.cao klɤŋ kəpac male 2 shake rise abl fire finish return come side hɒəʔ pèh raʔ. house 2 foc ‘Your husband got up from the fire and came back to your house.’

84

Jenny, weber and weymuth

3.4 Questions Interrogative expressions are of two types, namely polar questions and content questions. Polar questions may be marked only by a change of intonation, usually a rise of pitch towards the end of the sentence. (290) Dara’ang mε dɨ tăh kəɲ̊ a tomorrow 2sg goal come ‘Tomorrow, will you come again?’ (291) Kammu mè mʌ̀t mɨ́ar 2sg.m take loincloth ‘Did you take my loincloth?’

pɔlεɁ again

ʔòʔ 1sg

(292) Kharia (Peterson 2011: 369) beghma, am ɖaʔ uʈh=oˀb? pn 2sg water drink=act.pst.2sg ‘Beghmna, did you drink water?’

In Kui a yes-no question can be asked with a raising intonation clause finally, or with a falling final pitch if the topic is fronted. (293) Kui ʔaːw niː prox shirt ‘Is this your shirt?’

maj ↑ 2

(294) Kui maj niː ↓ ʔaːw shirt 2 prox ‘Is this your shirt?’

Very common are sentence final questions particles, which mark the utterance as an interrogative. (295) Dara’ang mε dɨ loŋ dop khu hεɁ 2sg goal descend with teacher q ‘Will you go down (from Angkhang mountain) with me?’

85

The Austroasiatic Languages: A Typological Overview (296) Mon pèh ciəʔ pɤŋ 2 eat cooked.rice ‘Have you eaten?’

toə finish

(297) Kammu kì pɔ̀ kùːɲ plàːŋ rɔ̀ t 2pl see pn come prox ‘Have you seen Plàaŋ coming here?’ (298) Mlabri mɛh kɔbɔ ɟak 2sg neg go ‘Aren’t you going?’

ja ʔ nsit

ha. q

há q

lɛh q

(299) Kơho me lɔːt draːʔ 2sg.m go market ‘Are you going to market?’

səl q

(300) Semaq Beri gi lompot bəh, beh bəh? hn 3pl flee q neg q quot ‘“Have they fled, or not?” he (asked himself)?’

kɛ  3sg

(301) Kharia (Peterson 2011: 370) kinbhar=te munga daru on=oˀb no, courtyard=obl munga tree plant=act.pst.2sg q ‘Did you plant the munga tree in the courtyard, oh mother?’

m9a mother

joy. voc

Some AA languages, for example Chong, have different question particles, expressing different kinds or nuances of polar questions. The exact functions and meanings of these particles are not always clear or retrievable from the available material. (302) Chong cəʔ ʔ9aj pʰ9aj meet addr Phai ‘Did you see Phai?’

ʔih q

w9a sp

86

Jenny, weber and weymuth

(303) Chong hiw plɔŋ hungry cooked.rice ‘Are you hungry?’

hiːt q

In Danau, the allomorphy of the question particle seems to be caused by the phonetic environment, though the rules are not clear. (304) Danau mə̀ sù kəɲwɐ̄ n 2sg walk path ‘Are you walking?’ (305) Danau ō tʰɯ̄ tənī 1sg sit there ‘Can I sit there?’

bɪ̃ get

ɐ̂ q

jɐ̂ q

Clause initial interrogative particles also occur, though apparently less commonly than clause or sentence final markers. (306) Sedang ʔa̰j ʔa hom ʔo̰ j q q remain exist ‘Is there still more wine?’

drow wine

neo̰ more

(307) Kammu k.màʔ rɔ̀ t ɓéc hŋ́ .kúːr q storm rain come ‘Has there been a rainstorm?’

Another possibility to express polar questions encountered in AA languages is by juxtaposition of the affirmative and negative form of the predicate, or by adding a negative element as question marker.

The Austroasiatic Languages: A Typological Overview

87

(308) Pnar ha ka=ʧi-snɛm, ʧi-sen ʧi-snɛm ʤoʔ laj phi nɛ m̩ loc f=one-year one-time one-year same go 2pl tag neg ɛm nɛ, mɔ have tag q ‘Once a year also, you go or not?’

Content questions contain an interrogative pronoun or adverb, which usually occurs in situ, that is, in the position where the element about which information is asked would occur. Temporal interrogatives in Kui are distinguished as to whether the event is in the past (təh naː) or in the future (pɒ̤ h nia). Both forms are found in situ. (309) Kui ciə pʰsaːr pɒ̤ h.nia go market when ‘When do [you] go to town?’ (310) Kui təh.naː kɒːn maj n̩ druh sɛːŋ when child 2 fall descend ‘When did your child fall from the house?’ (311) Chong cʰɔː tap tōŋ nih dog bite loc where ‘Where did the dog bite you?’ (312) Mlabri naŋsɯ ʔoh dɔk gəlɛŋ book 1sg put where ‘Where [shall] I put the book?’ (313) Kui maj səʔiə ca ː ntuɒ. 2 want eat what ‘What do you want to eat?’

rɜː from

dɔŋ house

88

Jenny, weber and weymuth

(314) Kammu pàːr klám mə̀ʔ cə̀ who irr carry two ‘Who will carry two [loads]?’ (315) Dara’ang mε və̆h malouh dop mase 2sg open door with what ‘With what did you (use to) open the door?’

Interrogative pronouns and adverbs may be fronted as in the following examples. Mon also allows some fronted interrogatives, most commonly mùʔ ‘what’, to be resumptively repeated in situ. (316) Mon hɒm. mùʔ ɗɛh what 3 speak ‘What did he say?’ (317) Mon ʔaŋkəlòc kɔ̀ h mùʔ ɗɒc klɤŋ mùʔ. English medl what ride come what ‘What did the English ride when they came here?’ (318) Kơho nteːŋ me lɔːt where 2sg.m go ‘Where are you going?’

(tajh) (q)

(319) Pacoh ʔa.məh maj pa.pi: what 2sg say ‘What are you saying?’

In Kharia, the interrogative pronouns or adverbs regularly appear in the immediate preverbal position, which is the default focus position in this language. (320) Kharia (Peterson 2011: 370) laʔ tama raij ber calay=e. then now kingdom who drive=act.irr ‘Now who will lead their kingdom?’

89

The Austroasiatic Languages: A Typological Overview

Some AA languages have a sentence final content question marker, which is different from the polar question marker. Danau uses the same question marker in content questions as in polar questions. The use of the content question marker is often optional. (321) Mon hnòk ʔɒt kɔ̀ h klon ʔəwao older.brother big all medl do ‘What does your oldest brother do?’ (322) Kơho me lɔːt nteːŋ 2sg.m go where ‘Where are you going?’

mɔ̀ ŋ stay

mùʔ what

rao. q

(tajh) (q)

(323) Pacoh tikuəj pako̰ h viː klɨŋ jaːʔ people Pacoh have many family.names ‘Do the Pacoh people have many family names?’

ləʔj q

(324) Danau dəmwɐ̀-rɐ̂ where-q ‘Where?’ (325) Danau ɐn lɯ kətʰɐn 3sg desire what ‘What does he want?’

ɐ̂ q

Car uses a special set of subject pronouns in interrogative expressions. (326) Car sítih kanɔ́ ːʔŋəváhtíːʔ how catch.prfv ‘How did you catch it?’

ʔəm 2sg.sub

tə́ rel

(327) Car iɽɔ́ ːŋə ʔac véː how.many 3du.pl.q prox.pl.anim ‘How many men there are!’

ʔɔ 3sg.anim.q

tarík people

90

Jenny, weber and weymuth

Alternative questions may be either formed by a disjunctive conjunction, or by parallel affirmative-negative questions, as in Mon, or by juxtaposing two lexemes with opposite meanings, as in Old Khmer. (328) Mon pèh ket pɤŋ ha, 2 take cooked.rice q ‘Do you want rice or a snack?’

kwaɲ sweets

ha. q

(329) Old Khmer vnaṃ khvuṅ d9ap ○ khvuṅ hill high low high ‘Is the hill high or low? [It] is high’

3.5 Imperatives Imperative expressions are used to issue commands to the addressee. In many cases, the most simple form of an imperative expression is the plain verb, with or without the addressee overtly expressed. (330) Chong kʰɨŋ kəpaw hold bag ‘Hold my bag!’

ʔɨt give

mɔŋ. also

(331) Kammu mʌ̀t ˀwɨ́ar jɔ̀ h kɨ̀t take knife go cut ‘Go cut it with a knife!’ (332) Mlabri (mɛh) (di) 2sg link ‘Smile!’

rɛʔ smile

(333) Mlabri mɛh gʌm ɟak 2sg proh go ‘Don’t go [there]!’

kə̀. 3sg.m

The Austroasiatic Languages: A Typological Overview

91

(334) Khasi leʲt go ‘Go!’ (335) Khasi kʰala.kʰala wan come soon ‘Come soon!’

A downtoning word may be added to an imperative expression in order to soften the one and make the expression appear more polite. The downtoner may be a sentence particle or a lexical verb, often with a meaning like ‘ask’, ‘invite’, or similar. (336) Khmer (Bisang 1992: 439) soːm taː ʔɔɲcɤ̀ːɲ6 ask grandfather invite ‘Please, you go and thank him.’ (337) Kơho ɗəs mham ʔet dan please speak fast little ‘Please speak a little faster.’

tɤːu go 6

ʔɔːkùn thank

kèː 3sg

coh. imp

taj more

In other cases, an imperative marker is added, as in Kơho and Kui, or a reinforcing verb, frequently with the lexical meaning ‘go’, is added, as seen in the Chong example. (338) Kơho lɔːt laʔ, ʔaɲ ʔɑːʔ go imp 1sg neg ‘(You) go, I’m not going!’

lɔːt go

6 ʔɔɲcɤ̀ːɲ is a polite word that typically is used to show respect, and has nothing to do with imperatives—e.g. ‘Grandfather went’ would also use ʔɔɲcɤ̀ːɲ (Franklin Huffman p.c.).

92

Jenny, weber and weymuth

(339) Kơho lɔːt tɛʔ bal.mə bɔl.hi go imp with 1pl.excl ‘(You) come along with us.’ (340) Kui ciə maj sɒh go 2 ascend ‘Go into the house!’

dɔŋ house

ciə imp

(341) Chong ceːw leː̤ŋ ŋa̤ːˀj~ŋa̤ːˀj ceːw go play far~red go ‘Go and play far from here!’

Negative imperatives or prohibitives are often expressed differently from negative statements by employing a special prohibitive marker. This will be discussed in section 4.3.3. 3.6 Pragmatics and Syntax Many AA languages exhibit strong tendencies towards pragmatically determined sentence structure. Constituents can be fronted for informationstructural or other pragmatic effects, and retrievable or known arguments can freely be omitted. Topicalization is much favored already in the Old Khmer inscriptions. Topicalized NPs are commonly but not always introduced by the prepositions ri ~ ri e, e, or nau ~ nau ru. Typically, topicalized NPs serve as the direct object, brought up to the sentence head. Occasionally they are the subject, and are followed by an anaphoric pronoun, or a genitive NP. Example of a genitive topic: (342) Old Khmer k9ap thpvaṅ nau ta yokk neḥ ta roḥ neḥ ti pre top link take prox link manner this link order chop head ‘Of those who take these aforesaid—[the executioner] shall be ordered to cut off [their] heads.’

Similar constructions are found in many AA languages, often without overt marking of the topic and with a wide range of semantic roles of the fronted element.

93

The Austroasiatic Languages: A Typological Overview (343) Khmer (Haiman 2011: 211) cɪ̀ːə bɤj phtɛːəh thmɤj nɪ̀h kèː khan house new prox 3pl divide be three ‘This new house, they divided into three rooms.’

bɔntùp. room

(344) Mlabri naŋsɯ ʔoh dɔk gəlɛŋ book 1sg put where ‘Where [shall] I put the book?’ (345) Kammu kʰ.ɗíʔ prìaŋ pháːn sɨ́aŋ, prìaŋ pə̀ʔ ʔàh sɨ́aŋ, sɔ́ ʔ prìaŋ pə́ pháːn now people kill pig, people eat meat pig, dog people neg kill ‘Nowadays people slaughter pigs and eat pork, but dogs they don’t slaughter.’ (346) Mlabri kap boŋ duck eat ‘(We do) eat ducks.’ (347) Mon mənìh prèə kɔ̀ h ɲèh kok.khao human female medl person call ‘The woman was called Mi Lim Canay.’

ciəʔ eat

mìʔ.lìm.cənaj pn

raʔ. foc

(348) Pacoh kərnaː ʔn.nɛh ʔipoːk ʔɔː lɨː road prox 3sg.go good very ‘As for this road, one goes well.’ or ‘This road is very good to go on.’ (349) Dara’ang maɡreih haj Ɂăn hεɁ bear ferocious 3sg already ‘The ferocious bear, he already killed.’

ŋauh kill

If the subject is topicalized in Pnar, the clause structure is AVP, with fronted object the constituent order becomes PVA.

94

Jenny, weber and weymuth

(350) Pnar u=bru ʧim u ka=wat̪ m=person take m f=sword ‘the man, he took the sword’ (351) Pnar ka=wat̪ ʧim u=bru f=sword take m=person ‘the sword, the man took (it)’

In Mon, fronted topical arguments are usually, though not obligatorily, marked as such by the medial demonstrative kɔ̀ h. This is also true in clauses with recursive fronting, that is, with the A argument fronted after the object has been moved to the clause initial position. In this case the A can be resumptively expressed by a pronoun in situ. (352) Mon ʔəpa ʔuə (kɔ̀ h), ʔərɛ̀k ( ɲèh) hùʔ sɤŋ, ɓɔk ( ɲèh) hùʔ sɤŋ. father 1sg medl liquor person neg drink cigar person neg drink ‘My father, he doesn’t drink, nor does he smoke.’

Danau and other AA languages have dedicated topic markers. (353) Danau mǝ̄nɐ̃ɔ khɯ̀ =kɐ̄ khɯ̀ .sû ē pɐj kɔʔ mɛ̀ speech 3pl=top pol 1pl can neg.cop epist ‘We could not speak their language (as you probably know).’

A more elaborate form of topic marking in Danau involves the embedding of the topicalised NP between the preposition kwe ‘on’ and the demonstrativetopic marker complex nì=nǝ or nɐ̄ ɔ=nǝ. (354) Danau kwe ǝlu.ǝtæ̃ ̂ ɛ̆ pɯ̄ nɐ̄ ɔ=nǝ, nɔǝ̆t kǝdɐj nì hō kɔʔ on charity 2pl dist=top mind stinginess prox exist neg.cop ‘When it comes to your being charitable, [there should be] no meanness of spirit.’

The Austroasiatic Languages: A Typological Overview

4

95

Grammatical Categories

4.1 TAM and Directionals 4.1.1 Tense and Aspect The grammatical category tense is not present in the majority of AA languages, though some languages express a distinction future vs. non-future grammatically. This is usually achieved by a particle indicating future tense, though it is not clear that the marker glossed as ‘future’ is not rather a marker of irrealis or probability, and should be treated as indicator of modality rather than of tense. (355) Kơho nam daʔ kʰaj rəp lɔːt year next 3sg fut go ‘Next year, I will go to Đà Lạt.’

tam to

Daːlaːc Dalat

(356) Danau ō æ̃ ̂ ɛ kɨ̂ tǝ-shɐ̃i 1sg fut go all-shop ‘I will go to the shop.’

Besides the preverbal future marker æ̃ ̂ ɛ, Danau also employs a posterbal marker mɛ, which appears to be a loan from the Burmese postverbal future or irrealis marker mɛ. (357) Danau ō jɪ̄n mɛ 1sg return fut ‘I’ll go home and eat.’

ɲɐ̄ house

jɪ̄n return

swe eat

bɯ̄ . rice

Khasi overtly marks past and future tense, while the present remains unmarked. The future marker appears as a clitic on the subject. (358) Khasi ki la wan pl pst come ‘They came.’

96

Jenny, weber and weymuth

(359) Khasi ki=n wan pl=fut come ‘They will come.’

In Car, future and past are marked overtly by free markers. The past tense is also indicated by the use of a special set of past tense pronouns. (360) Car kaláh cin min jih try 1sg fut come ‘I will try to come to you.’

ʔin obl

mɛh 2sg.poss

(361) Car heŋ taka ək misíːʔ tafísi tə one person 3sg.pst pst widow link ‘Once there was a widow who had four children.’

fɛːn four

manɨ́k clf

kuːn child

Kharia and other Munda languages mark past and present tense together with active and middle in fusional morphemes. The future is expressed by the irrealis marker, which is also fused with the voice marker. These markers are clitics which may attach to verbal or nominal elements alike. (362) Kharia (Peterson 2011: 295) koɖpuɽuʔ=ki gam=na=ʔ hosiyar gam man=pl say=inf=gen intelligent say ‘Men call themselves intelligent in (= of) speech.’

ɖom=ta=ki. refl=mid.prs=pl

(363) Kharia (Peterson 2011: 297) u kamu ikon=oˀj. this work do=act.pst.1sg ‘I did this work.’

Imperfective events are expressed by the addition of an aspect marker or a verb with lexical meaning ‘continue’ or ‘be at, stay’. (364) Pacoh ʔacaːj ʔat toːŋ older.sibling be.at say ‘What are you saying?’

ʔaməh what

The Austroasiatic Languages: A Typological Overview

97

(365) Sedang da̰ deə loj ʔɛh 2sg continue water release ‘You continue to pour out (the liquid).’ (366) Bugan ʔɔ³¹ sai³³ 1sg dur ‘I’m writing.’

pioŋ¹³ write

(367) Bugan mɛ⁵⁵ be³⁵ ʦu³¹ ʔɔŋ³⁵(¹³) child hold cont sugar.cane ‘This boy is holding two sugar canes.’ (368) Danau ɐn tɐ̄m swe 3sg prog eat ‘He is still eating.’

bi³¹ two

ʦʰe¹³ clf

bɯ̄ rice

In Kharia, different morphemes are used to express imperfective, perfective, inceptive and progressive, among others. The imperfective marker laʔ appears as free morpheme, to which tense and person markers may be cliticized. The main predicate appears in the infinitive. (369) Kharia (Peterson 2011: 342) khaɽiya lebu=ki pujapaʈh katay=na laʔ=ki=may. Kharia man=pl sacrifice do=inf ipfv=mid.pst=3pl ‘The Kharia men used to perform sacrifices.’

Imperfective markers may also appear in negated contexts, as in the following example from Danau. (370) Danau ɐn khɐ kɨ̂ kɔʔ ŋbæ̃ ̂ ɛ̆ 3sg cont go neg.cop Aungban ‘He hasn’t gone to Aungban as yet.’

Perfective situations are often expressed by the addition of a secondary verb with intrinsic telic meaning, such as ‘throw’, ‘complete’, or ‘finish’. The latter has

98

Jenny, weber and weymuth

in a number of languages developed into a marker of ‘new situation’ (nsit), an aspectual category widespread in Southeast Asia which expresses that a situation has been established after a change of state. In negated expressions, the reading is regularly ‘not anymore’. nsit expressions are often (wrongly) translated or analyzed as perfect or past tense. It is not always evident from the available material whether an element glossed as past or perfect(ive) is actually an nsit marker or not. This is the case in the following example from Kơho and Car, where ŋác has the lexical meaning ‘finish’. (371) Kơho kʰaj nɛh lɔːt tam Daːlaːc 3sg pst go to Dalat ‘S/he went to Đà Lạt yesterday.’

ŋa j day

(372) Car jɔ́ ːklə ŋác ŋam ʔám (nə) climb finish def dog 3 ‘The dog climbed up on his back.’

i ʔúk on back

ʔɔr previous

ɔ 3sg.anim

In Khasi, it is not clear whether the preverbal dep ‘finish’ expresses only completed events (perfective), or also events that have begun (nsit). (373) Khasi ɨnda ka la dep baːm dep diʔ ka la ʃim ja when f pst finish eat finish drink f pst take acc ka=ʃaŋkwaːj ban baːm kwaːj. F=betel.basket inf eat betelnut ‘When she finished eating and drinking, she took the betel-basket to eat betelnut.’

In Kui, the sentence final particle həːj is apparently a loan from Khmer haəj ‘finish, nsit’. (374) Kui haj wuɒ bəːn həːj 1 make get nsit ‘I have done/completed it.’

99

The Austroasiatic Languages: A Typological Overview (375) Kui to̤ ʔ cɛːn həːj swaːj mango dist ripe nsit ‘That mango is already ripe.’

Other languages have the category nsit, but the marker is not related to the verb ‘finish’, or any other known lexical source, as is the case in Mon and Mlabri. (376) Mon ɲèh thiəŋ.həjaʔ kɛ̀h rɔ̀ ə həkaoʔ 3h think say friend body ‘He thought that his friend was dead.’

khjɒt die

ʔa go

(377) Mlabri [β] (Rischel 1995: 190) khep mɤm maʔ ʔoh ʔa ɲon, ʔa slippers father give 1sg nsit n.exist nsit ‘The slippers father gave me are used up, they’re broken.’

jaʔ. nsit

tac break

(378) Mlabri ki a-pluŋ neg nsit-hot ‘not hot anymore’

The spread and exact range of functions in different languages, both Austroasiatic and others in Southeast Asia, is still to be established and needs more in-depth research in the relevant languages. 4.1.2 Modality Like aspect, modality is mostly expressed in AA languages by secondary verbs. One common construction is a main lexical verb combined with a grammaticalized verb meaning ‘get’ to express possibility or similar notions. This pattern is very widespread throughout Southeast Asia and beyond (Enfield 2003). In the following example from Kammu, the lexical meaning of the verb pɨ̀an ‘get’ is still transparent, while it is more bleached in other instances. The position of potential ‘get’ is either preverbal or postverbal.

100

Jenny, weber and weymuth

(379) Kammu pə́ pɨan pə̀ʔ màh mə̀ʔ pə́ hóːc hń.túʔ cń.tràŋ tè indf neg finish hole pole refl neg get eat rice ‘Those who have not finished their hole for the pole will not get anything to eat.’ (380) Dara’ang Ɂăw pən dɨ kauɁ Ɂăn neg get goal bite 3sg ‘The bear was not able to bite him.’ (381) Sedang ka pin ca 1pl.incl get eat ‘We can eat boar.’ (382) Danau mə̀ swe nɐ̄ɔ 2sg eat that ‘You can eat that.’

bɪ̃ get

rəka̰j boar

mə̄nɔʔ stat

In Mon, the verb kɤ̀ʔ ‘get’ occurs in two postverbal position, either directly after the main verb, or after the object. The reading of the two structures is different, as illustrated in the following examples. (383) Mon ɗɛh rɔ̀ p kɤ̀ʔ kaʔ. 3 catch get fish ‘He caught a fish.’ (‘He succeeded in catching a fish.’) (384) Mon ɗɛh rɔ̀ p kaʔ kɤ̀ʔ. 3 catch fish get ‘He can catch fish.’ (‘It is possible/allowed for him to catch fish.’)

Other modals are frequent in preverbal position, expressing different notions. Their lexical sources are in many cases not known and they must be regarded as purely grammatical markers in many languages. In Khasi, some modal verbs are linked to the main verb by the infinitive marker.

The Austroasiatic Languages: A Typological Overview (385) Kui mɒːŋ mɜʔ kɜːt haj səʔiə ʔɒːn sɛːm.saːj pələj kroːc 1 des give relatives fruit orange but neg exist ‘We want to give oranges to my relatives, but we don’t have many.’ (386) Danau ō (kǝ)lɐ swe 1sg dare eat ‘I dare to eat it.’

101 kəlɜŋ much

mə̄nɔʔ stat

(387) Khasi u=n ʤinda m=fut may ‘He may go.’

leʲt go

(388) Khasi u dej ba n m must inf ‘He must eat.’

baːm. eat

(389) Khasi u=n sa laʔ m=fut definitely able ‘He will be able to go.’

ban inf

leʲt go

In Kharia, the lexical verb lam ‘look for’ is used as postverbal desiderative auxiliary and appears with the infinitive of the main predicate. Tense and person markers are attached to the auxiliary. (390) Kharia (Peterson 2011: 347) iɲ u ikuˀɖ sundar kontheˀɖ=ki=te bajhay=kon satay=na 1sg prox very beautiful bird=pl=obl trap=seq torment=inf um=iɲ lam=te. neg=1sg want=act.pres ‘I don’t want to trap these beautiful birds and torment them.’

4.1.3 Directionals Secondary verbs with lexical meanings ‘come’ and ‘go’, as well as a small number of others, function in many AA languages as directionals with spatial as well as temporal meaning.

102

Jenny, weber and weymuth

(391) Chong m9an hɔːt ceːw 3 drag go ‘It drags (him) away.’ (392) Chong ne̤ːw~ne̤ːw coːk ceːn c9aːk child~red run come from ‘The children run from their home.’

tɔŋ home

(393) Mon klɒ kɔ̀ h həɗiəŋ krìp ʔa teh, hətum ceh ʔa ɗɔə ɓi. dog medl chase run go top fall descend go loc river ‘As the dog ran after the deer, it fell down into a river.’ (394) Khmer (Bisang 1992: 442) kɔːət prap tɤːu kmeːŋ ʔaoj jɔ̀ ːk tɤːu cùːn cɔmpùəh nɛːək-srɤj bɔndoːl. 3sg tell go boy give take go give to person-woman pn ‘He told the boy to take [the parcel] and deliver it to Miss Bondol.’

Mon has a kind of transitivity harmony in complex predicates involving a main verb and a directional. If the patient is principally affected by the event described by the transitive predicate, the directional must appear in the transitive/causative form. If the agent is principally affected, the directional remains in the basic/intransitive form, irrespective of the transitivity value of the main verb. The choice of the form of the directional in Mon is obviously based on affectedness, rather than syntactic transitivity. (395) Mon ɗɛh pɛ̀k ʔa ka. 3 follow go car ‘He drove the car (away from here).’ (396) Mon ɗɛh pɛ̀k na klèə. 3 drive caus.go cow ‘He drove the cattle (away from here).’

4.1.4 Other Secondary Verbs Other secondary verbs can be used to express notions that are not easily classified and do not fall in the categories of tam or directionality. One example

103

The Austroasiatic Languages: A Typological Overview

is a morpheme expressing non-volitionality or non-intentionality of the agent, indicating that he or she did something without meaning to do it or without considering the consequences. In Mon, the verb tɛ̀h ‘hit, touch, come into contact with’ is used in this function. In Danau, no lexical source has been identified. (397) Mon kɤ̀ʔ tɛ̀h ɓɛ̀ʔ kon ŋèə həkaoʔ klàj kɔ̀ h. get hit ref offspring frog body seek medl ‘He got the little frog he was looking for (by coincidence).’ (398) Danau ō swe mɛ̃j bɯ̄ frɐ̄ 1sg eat nvol rice Buddha ‘I accidentally ate the rice (meant as an offering) for Buddha.’ (399) Danau mə̀ pə-heiʔ ō=nə 2sg nfin-call 1sg=top ‘I didn’t hear you calling me.’

ō 1sg

lɐ-on neg-hear

mɛ̃j nvol

kɔʔ neg.cop

Other secondary verbs found in AA languages combine aspectual with manner notions. A verb with the lexical meaning ‘eat’, for example, expresses that something is done regularly or habitually, with a connotation of inward-directedness or self-benefactive. A verb meaning ‘throw, throw away’ is often used to indicate that an activity is carried out completely and impulsively, often irreversibly. The verbs involved in these constructions and the range of functions varies greatly among AA languages, so that hardly any general pattern across the family can be established. 4.2 Causative and Passive Constructions Besides the morphological causatives described in section 2.3.2, many AA languages also have the possibility to form causatives periphrastically. This is frequently achieved by a preverbal auxiliary or secondary verb with the lexical meaning ‘give’. This pattern is widespread in the languages of Southeast Asia, and also found in other parts of the world, though the exact form of the construction may vary considerably. In some cases the lexical semantics is transparent, as in the following example from Pacoh, while in other cases the verb ‘give’ is more grammaticalized. The structure commonly found in AA languages is ‘CAUSER GIVE CAUSEE V’, with the causee appearing as subject of its own predicate. This construction is already found in Old Khmer. In modern Khmer,

104

Jenny, weber and weymuth

‘give’ may occur on its own or in connection with the verb meaning ‘do, make’. The latter construction is also found in Khmer and Mon, especially with nonhuman causers. (400) Old Khmer ta sruk pvan oy śapata ’anak give swear person link village four ‘to administer the oath to men of four villages’ (‘make them swear to the men of the four villages’) (401) Pacoh pḛːh cɛːn ɟo̰ ːn kɨː ŋɔjʔ get glass give 1sg drink ‘Get a glass for me to drink water.’

daːʔ water

(402) Khmer (Haiman 2011: 316) baə ʔaoj daə taːm thɔːəmmedaː, prəhael mɯ̀ n dɔl if give walk follow normal perhaps neg arrive ‘If [they] had made me walk normally, I might not arrive at all.’

tèː. neg

(403) Mon ʔəŋàn mùə ŋuə ɗɛh kɒ, ɗɛh kwɒ klon raʔ, ɗɛh kɒ quota one day 3 give 3 neg.give make foc 3 give cù raʔ ʔiʔ-kɔ̀ h ɗɛh kɒ cù. rest foc nml-medl 3 give rest ‘They gave us (the money) for one day’s work quota, then they wouldn’t let us work anymore, they let us rest; they let us rest like then.’ (404) Khmer (Bisang 1992: 441) thvɤ̀ː-ʔaoj khɲom ceh.tae cɔŋ nɪ̀ːjɪ̀ːəj cɔŋ khɤ̀ːɲ heːŋ make-give 1sg constantly want speak want see pn ‘[This] made me constantly want to talk and to see you, Heng.’ (405) Mon kja pɒc paʔ kɒ taj wind blow do give hut ‘The wind destroyed the hut.’

lɤ̀m destroy

ʔa. go

ʔaeŋ. 2sg

The Austroasiatic Languages: A Typological Overview

105

The periphrastic permissive causative in Kharia is formed by combining the main predicate in the infinitive with the verb ter ‘give’, which takes tense and person markers, or combines with aspectual auxiliaries. (406) Kharia (Peterson 2011: 347) beʈ=ɖom kosu buŋ jhalay=kon kamu=na=jo um son=3poss sickness ins be.troubled=seq work=inf=add neg ter=na laʔ=ki. give=inf ipfv=mid.pst ‘With her son suffering so much from the illness, [his mother] didn’t even let him work.’

In Bugan, periphrastic causatives are constructed with ŋgɔ̱³⁵, which is derived from the lexical meaning ‘to drive out’. (407) Bugan xo̱ u̱³⁵ ‘ride (a horse)’ pa̠ɯ̱³¹ ‘get up’

ŋgɔ̱³⁵xo̱ u̱³⁵ ŋgɔ̱³⁵ pa̠ɯ̱³¹

‘to make/force sb. to ride (a horse )’ ‘to make sb. get up’

In other cases, such as Kammu and Danau, the lexical source of the periphrastic causative marker is not known. (408) Kammu kə̀ lə̀ tòk k.sés róːŋ 3sg.m then caus fall mouth.harp ‘He then dropped his mouth harp (by mistake).’

tè refl

(409) Danau mɐj ɐn=nǝ sho kɨ̂ lùpē ŋbæ̃ ̂ ɛ̆ mother 3sg=top caus go child Aungban ‘The mother sends the child to Aungban.’ (lit: His mother makes the child go to Aungban.)

Few AA languages have special morphological or periphrastic passives. Frequently, unmarked verb forms can receive passive reading, depending on the context. A limited number of NPs in Old Khmer consisting of a head + genitive dependent, when attributive to a head, can be interpreted as passivized by virtue of their position. In this case the genitive dependent may be said to function as the agent.

106

Jenny, weber and weymuth

(410) Old Khmer lon sre pradāna poñ riceland gifted title pn ‘Rice land gifted by the poñ Lon’ (411) Old Khmer . . . prasap sre daṃnuñ chloñ vidy9amaya . . . abut field purchased title pn ‘. . . [it] abuts on the ricefield purchased by the chloñ Vidyāmaya’

Old Khmer and Old Mon have particles marking the passive. In Old Khmer, the marker ti is of unknown origin, while Old Mon ñin is related to an AA verb root meaning ‘receive, accept’ (Shorto 1971: 133). Note that the verb cincon in Old Mon can be translated as ‘decorate’ or ‘be decorated’, that is, the passive marker is not compulsory to achieve passive reading. The agent in Old Khmer is not demoted to an oblique role but apears in its normal preverbal position. No passive clause with overt agent is found in the Old Mon data. (412) Old Khmer mrat9añ oy ta ti vraḥ sruk sre ta village field link pass lord give link holy ‘village [and] ricefields which were given by the lord to the divinity’ (413) Old Mon (Shorto 1971: 132) dinṅal thar ma ñin cincon mirror gold rel pass decorate ‘a golden mirror set with gems’

na ins

rat gem

In Vietnamese, the passive voice can be expressed by one of a number of auxiliaries. Like in Old Khmer, the agent is not demoted in these constructions but appears in the position before the main verb and does not receive any oblique marking. (414) Vietnamese Quyển sách này do Vũ.Trọng.Phụng viết clf book prox pass pn write ‘This book was written by Vũ Trọng Phụng in Hanoi.’

ở reside

Hà.Nội. Hanoi

In Jahai, progressive forms can be used with passive meaning and the patient is marked as subject. Nothing in the morphology of the verb indicates the passive function, though.

107

The Austroasiatic Languages: A Typological Overview (415) Jahai (Burenhult 2005: 158) ka=gtah, wa=bk-ʔɛk wa=bk-ʔɛk irr.3sg=prog-give sbj=rubber irr.3sg=prog-give ‘Rubber will be given, oil palms will be given.’

ka=klapah.sawit sbj=oil.palm

Kơho and Pnar have passive markers, the use of which results in the patient appearing as subject and the agent being demoted to an oblique role. (416) Kơho caːl paːʔ mpoːŋ wind open door ‘The wind opened the door.’ (417) Kơho mpoːŋ gə-paːʔ mə caːl door pass-open ins wind ‘The door was opened by the wind.’ (418) Pnar ʧ aʔ pn̩ -jap ka=khla ha pass caus-die f=tiger loc ‘the tigress was killed by the man’

u=bru m=person

In Car, as one of the few AA languages with a morphological passive, the suffix -ə is attched to the verb stem, which may in turn be expanded by aspect and other markers. This suffix has a number of allomorphs: ə ~ a ~ Ø ~ əŋə ~ və ~ u ~ u(ə). The patient appears as subject of the clause, and the agent is demoted to an oblique role, marked with the general linker tə. (419) Car mɨ́kka ən ŋíh look.ipfv.pass 3sg.prox prox.sg.inan ‘This book is looked after by Jane.’

líːpərɛ book

tə link

céːn. pn

4.3 Negation The constructions of negative sentences in the AA languages are very diverse, so that hardly any feature can be found that may be claimed as general AA. There may be a tendency for preverbal negation, but for too many languages no or only little data is available, so that a valid conclusion is not possible. There is no negator that is reconstructible to the protolanguage, though a number of negators or words used as negators are found

108

Jenny, weber and weymuth

in languages from different branches. A negator found in Mlabri (kɔbɔ), Danau (kɔʔ), and a number of Munda languages (Ho, Mundari ka) contains the initial k-. In Old Mon, the verb kaḥ is a negative copula, which may be related. Another negator that originates in a verb is AA *ʔət ‘lack, be finished, exhausted’, which is found as negative marker in Car (ʔət) and Munda languages (Gorum or-, Juang ar-). 4.3.1 Statements Austroasiatic main negators may be particles, negative auxiliaries, pronominalized negators or negative copulas. Preverbal negators usually have narrow scope over the following verb, while negative copulas and clause final negators may have wide scope over the clause. The main negator in most AA languages is a preverbal particle, as seen in in Vietnamese and Dara’ang. (420) Vietnamese thảo không làm ở khoa. pn neg work reside faculty ‘Thảo does not work at the faculty.’ (421) Dara’ang sanaɁ mɨ tu gun tcl neg ‘Gun, (I) don’t have.’

mə́h have

In many Munda languages, the negator is a negative auxiliary that gets the person marking from the main verb, like in Santali. (422) Santali (Neukom 2001: 149) ba-ko baḍae-a. neg-3pl know-ind ‘They don’t know.’

Muöt (Nancowry) has three negative particles and additionally also a set of pronominalized negators. These are probably the result of a contraction of the personal pronouns with the negative particle nit (Rajasingh 2013: 56). Example (423) contains the first person singular pronominalized negator. (423) Muöt (Rajasingh 2013: 56) cit juaŋsisɛ ʔitcaccə 1sg.neg prog read ‘I am not reading this book.’

nɛʔ leːpəɹɛ. prox.vis book

109

The Austroasiatic Languages: A Typological Overview

The principal negator in Danau is the postverbal negative copula kɔʔ. The prefix lə-, attached to the main verb, is optional in colloquial speech. (424) Danau ō (lǝ-)kɨ̂ lɔn 1sg (neg-)go accompany ‘I won’t go with you.’

kɔʔ neg.cop

mǝ̀ 2sg

Postverbal negation is also found in Rumai (unlike other Palaung varieties) and Chong, in the latter case interchangeably with a preverbal negator. (425) Chong cʰăn ceːw wə̤t mɛʔ tɛ̀ː mɛʔ 1sg go meet grandmother but grandmother ‘I went to meet my mother but she was not there.’

kɨj stay

ʔih neg

(426) Rumai ʔɐwː mʌk mă tho. 1sg like neg bean ‘I do not like beans.’

A negative copula may also be used to negate nonverbal predicates like in Pacoh. (427) Pacoh ʔrviət ʔn.nɛh ʔih pen prox neg.cop ‘This is not your pen.’

ʔm.maj 2sg.poss

Nonverbal predicates in Mon are negated by the postposed form hùʔ siəŋ ‘it is not (the case that)’, with hùʔ, the general negator in Mon, and siəŋ ‘be so’. (428) Mon həkaoʔ nɔʔ kɔ̀ h ʔəca kəsao body prox medl teacher nml.write ‘She (knew that she) was no writer.’

lɛ add

hùʔ neg

siəŋ. be.so

In addition to the main negators, there are several features like emphasising negative particles, negative verb forms and negators for other purposes. Mon has a final negative particle pùh that reinforces the negation. In colloquial speech, pùh sometimes occurs without the main negator hùʔ.

110

Jenny, weber and weymuth

Additionally, there are some frequent verbs with a labial infix as negation marker, like in kwɤ̀ʔ (kɤ̀ʔ ‘get’). Both features are shown in example (429). (429) Mon pèh ʔa kwɤ̀ʔ 2 go neg.get ‘You cannot go.’

pùh neg

A clause-final negator also occurs in Khmer. In colloquial informal speech, it is used either alone or, with a certain emphatic effect, together with a preverbal negator. (430) Khmer mdaːj mɯ̀ n thvɤ̀ː mhoːp mother neg make meal ‘Mother is not making the meal.’

(tèː). neg

In Dara’ang (Ruching) and in the other varieties of Palaung, Shwe and Rumai, there is a preverbal negator ɲ̥am ‘not yet’, the origin of which is unknown. There may be a connection with the homonymous verb with the meaning ‘be dilatory; be stiff’. (431) Dara’ang ɲ̊ am hɔc r̥aɲ. kaŋ house not.yet finish do ‘The house has not finished (being built) yet.’

4.3.2 Interrogative Sentences Mlabri simple negative sentences are built by inserting a preverbal negative particle, the same also counts for interrogative sentences. (432) Mlabri mɛh kɔbɔ ɟak 2sg neg go ‘Aren’t you going?’

lɛh. q

In Rumai, there are two different question particles used, namely lʌh for positive (433) and mɯ for negative (434) questions.

The Austroasiatic Languages: A Typological Overview (433) Rumai maj haːw lʌh pa 2sg go q loc ‘Do you go to work?’

111

jɯːŋ. work

(434) Rumai maj haːw mă mɯ 2sg go neg q ‘Do you not go to work?’

pa loc

jɯːŋ. work

4.3.3 Prohibitive A number of AA languages use a special marker for prohibitive expressions, which is different from the general negation marker. The negative imperative is often expressed by a clause initial prohibitive particle like in Chong, Khmer, and Khasi. (435) Chong cʰaː kʰɔ̄ ːŋ puk ma̰ːj proh eat thing rotten ‘Do not eat the rotten food!’

lɔː. disc

(436) Khmer (Huffman 1970: 174) kom dak mtèːh craən pèːk proh put chili much too.much ‘Don’t put too much chili on it.’

nɤh. excl

(437) Khasi wat leʲt proh go ‘Don’t go!’

In Semelai the prohibitive particle bɔy is used clause initially, but there is a difference between intransitive (438) and transitive (439) negative imperative clauses in that transitive verbs, unlike intransitive ones, host the irrealis marker ma= (Kruspe 2004: 337). (438) Semelai (Kruspe 2004: 336) bɔy ʔyotǃ proh return ‘Don’t go backǃ’

112

Jenny, weber and weymuth

(439) Semelai (Kruspe 2004: 337) ʔəɲǃ bɔy ma=kɔŋ proh irr=hit 1f ‘Don’t hit meǃ’

Kơho has a prohibitive marker, which co-occurs with the imperative marker.7 (440) Kơho baɲ ɲim taj ɲɔʔ tɛʔ proh cry more laugh imp ‘Don’t cry any more, laugh!

Negative imperatives in Danau include the default negative prefix lɐ- and an additional prohibitive suffix -nɐʔ on the verb. (441) Danau lɐ-kɨ̂-nɐʔ! neg-go-proh ‘Don’t go!’

There are also languages that use an overt second person pronoun subject. In Kammu this pronoun is optional. (442) Kammu tá (mè) 2sg.m proh ‘Don’t bite me!’

pók bite

ʔòʔ 1sg

4.3.4 Symmetry Many Austroasiatic languages have symmetric structures of positive and negative sentences such that, except the insertion of the negator, there is no difference between the positive sentence and its negative counterpart. This feature occurs also in Mon.

7 This is common in other AA languages as well, such as Khmer and Vietnamese (Franklin Huffman p.c.).

The Austroasiatic Languages: A Typological Overview (443) Mon nɔʔ ʔuə ʔəpat hətaʔ week in.front prox 1sg ‘Next week I will go to Yangon.’

ʔa go

(444) Mon ʔəpat hətaʔ nɔʔ ʔuə hùʔ week in.front prox 1sg neg ‘Next week I will not go to Yangon.’

113

ʔəkɤ̀ŋ. Yangon

ʔa go

ʔəkɤ̀ŋ. Yangon

As already mentioned, in many Munda languages the negator gets the person marking from the main verb (see also (422)). This leads to an asymmetry like in Kharia, where the negative marker is um. (445) Kharia (Peterson 2011: 335) juŋ=oʔ=may. ask=act.pst=3pl ‘They asked.’ (446) Kharia (Peterson 2011: 335) umay8 juŋ=oʔ. neg.3pl ask=act.pst ‘They didn’t ask.’

8

In Rumai symmetric structures are very common. However, the future marker nɐŋ cannot occur in negative sentences. (447) Rumai ʔɐwː nɐŋ dih caʔuʔ 1sg fut read book ‘I will read this book.’

niːn. prox

(448) Rumai ʔɐwː (*nɐŋ) dih mă 1sg fut read neg ‘I will not read this book.’

caʔuʔ book

8 from *um=may ‘neg=3pl’ (Peterson 2011: 335).

niːn. prox

114

Jenny, weber and weymuth

4.4 Demonstratives The demonstrative systems found in AA languages vary from simple twoway distinctions to elaborated multi-dimensional systems. Car has only two degrees of distance, proximal and distal, but different forms are used according to animacy and number. In the case of dual animate, no difference seems to be made between proximal and distal, and some of he distal forms look like derivates of the corresponding proximal demonstratives. (449) Car Proximal ŋɔ́ h sg.anim véː pl.anim náː du.anim ŋíh sg.inan nɛ̃ ́ pl.inan

Distal ŋamɔ́ h mɛ̃ː́ ná\ː ŋamúh minɛ̃ː́

sg.anim pl.anim du.anim sg.inan pl.inan

Three grades of distance are found for example in Sedang, Mon, and Kharia. They include proximal (near the speaker), medial (near the hearer or medium distance from both the speaker or hearer, depending on the language), and distal (away from both hearer and speaker, may be visible or invisible, often non-specific). (450) Sedang ko̰ prox mɛ medl ta̰ dist (451) Mon nɔʔ prox kɔ̀ h medl tɤʔ dist (452) Kharia (Peterson 2011: 183) u prox ho medl hin, han dist

In Kui, the three degrees of distance indicate proximal, distal-visible, and distal-invisible.

The Austroasiatic Languages: A Typological Overview

115

(453) Kui niː prox to̤ ʔ dist tɨh nvis

While many languages use the demonstratives exophorically and anaphorically, Kui has a special set of anaphoric demonstratives. (454) Kui nɛŋ ana.prox ʔɛŋ ana.dist ʔɛɲ ana.emph

Kơho makes a five-way distinction, with two medial demonstratives and the distinction between visible and non-visible. The non-visible demonstrative also serves as anaphoric. Similarly Semaq Beri distinguishes five degrees of remoteness, with three distal demonstratives, including one for remote places, usually on a vertical axis, and one for invisible referents. The medial demonstrative in Semaq Beri also serves as anaphoric. (455) Kơho dɔ nɛ dɛn/gɛn daʔ həʔ

‘this’ (closer to speaker; the first of a pair of objects being compared or contrasted) ‘that’ (object not proximate but equidistant from speaker and addressee) ‘that’ (closer to addressee, not speaker) ‘that’ (the second of a pair of objects being compared or contrasted) ‘that’ (not visible, spatially or temporally; anaphoric, indicating old information)

(456) Semaq Beri9 nɔ̃ ʔ prox təʔ medl ʔin dist.vis ʔɛn dist.vert hnɛ̃ʔ nvis 9 The Semaq Beri demonstrative system is not fully understood yet and the labels used here are tentative.

116

Jenny, weber and weymuth

In Kammu the demonstratives differentiate four degrees of closeness in relation to the speaker and listener, as well as a height dimension indicating a location higher or lower than the speaker or speech situation. (457) Kammu kì prox nìʔ medl nàːj dist1 hóʔ dist2 ɗɨ́ŋ dist.up súʔ dist.down

close to speaker close to listener away from speaker and listener far away higher than speaker lower than speaker

Similar systems are found in Khasi, which has six degrees of remoteness and displacement on the vertical axis. In Khasi, the demonstratives are combined with the noun class markers or prepositions to form different kinds of demonstrative expressions. (458) Khasi ne prox to medl taj dist ta nvis tej dist.up tʰie dist.down

Closely related Pnar does not seem to make a distinction on the vertical axis, but differentiates five degrees of remoteness from the deictic center. (459) Pnar ni te tu taj tɛ

prox medl dist1 dist2 dist3

within reach of speaker just out of reach, visible slightly farther away than te quite far away, possibly out of sight, but known to speaker and hearer out of sight, potentially unknown to speaker and hearer

Pacoh makes an eight-way distinction, including fore/higher, aft/lower and beside for medial and distal demonstratives. In the proximal degree, these distinctions are neutralized for obvious semantic reasons. The first medial

The Austroasiatic Languages: A Typological Overview

117

degree, which is also neutralized in terms of the vertical and horizontal axis, is presumably used also in anaphoric function. It it noteworthy that the second medial and distal degrees are distinguished iconically by vowel length alone.10 (460) Pacoh Distance prox medl1 medl2 dist

Neutral ʔnnɛh / nɛh ʔŋkoh / koh

Fore/Higher

Aft/Lower

Beside

ʔntih ʔntiːh

ʔnto̰ h ʔnto̰ ːh

ʔntrah ʔntraːh

4.5 Pronominal Systems 4.5.1 Personal Pronouns The systems of personal pronouns in AA languages fall broadly into two types. On the one hand, there are languages with minimal grammatical distinctions in their systems but elaborate honorific distinctions. Real personal pronouns are often avoided in these languages and replaced by kinship or professional terms, or terms designating a social status or function. In many cases these terms are semantically bleached and have come to be generally used like real pronouns, though the original lexical meaning is usually still present. One case in point is the polite Khmer first person pronoun khɲom, which originates in the homonymous noun meaning ‘slave, servant’. On the other hand, there are languages with purely grammatical sets of pronouns, often distinguishing singular and plural, in many cases also dual, and inclusive vs. exclusive in the nonsingular first person prounouns. A number of languages have reduced sets of pronouns, with number distinction only in the first person. A more detailed overview and historical study of AA personal pronouns is given in Pinnow (1965). Examples of the first (honorific) type of pronominal system, if they may be called this, are Khmer and Vietnamese.

10 A similar phenomenon is sporadically encountered in other AA languages, e.g. in Mon, where a remote distal is formed by lengthening the vowel of the distal, that is, tɤʔ ‘that’ becomes tɤːʔ ‘that far away’. There is no way to indicate this distinction in Mon orthography, and the form is not generally accepted as ‘correct’.

118

Jenny, weber and weymuth

(461) Khmer 1st sg: khɲom ʔaɲ ʔaːtmaː tùːl-bɔŋkùm 1st pl: jɤ̀ːŋ 2nd: nɛːək nɛək-srɤj ʔaeŋ lòːk lòːk-srɤj chan 3rd: kɔ̀ ət vɪ̀ːə kèː prɛəh-ʔɔŋ

speaker is acting humble or polite speaker is arrogant or on intimate terms with addressee speaker is a monk speaker addressing royalty neutral addressee is younger or of lower status addressee is a woman of middle class addressee is of much lower status addressee is older or of higher status addressee is a married female of higher status speaker and addressee are both monks referent is respected other referent is disrespected other referent may be any other referent is royalty or clergy

Grammatical pronominal systems may include as few as four basic forms, as seen in Mon, where only the first person distinguishes between singular and plural. In the second and third persons a secondary distinction of politeness is found. (462) Mon 1sg 1pl 2 3

11

ʔuə poj pèh ɗɛh

(informal ɓɛ̀ʔ;11 formal mənɛ̀h) (honorific ɲèh)

In other languages, all persons distinguish singular and plural. This is seen for example in Danau. (463) Danau sg 1 ō 2 mə̀ 3 ɐn

pl ē pɯ̄ khɯ̀

11 The intimate form ɓɛ̀ʔ is a special use of the referential marker/dummy nominal head. It is considered rude by most educated speakers.

The Austroasiatic Languages: A Typological Overview

119

Mlabri distinguishes three numbers in the first and second persons. The plural forms are built from the dual by adding tʰɤŋ ‘five’, which is also used as a plural marker. (464) Mlabri sg 1 ʔoh 2 mɛh 3 mɔj

du ʔ ah ba h mɔj

pl ʔah thɤŋ bah thɤŋ mɔj

One frequently found pronominal system in AA languages makes a three-way number distinction and have different forms for inclusive vs. exclusive first person non-singular. This is exemplified by Dara’ang, Sedang, the Northern Aslian Ceq Wong, and Mundari. Note that in Sedang dual and plural are neutralized in the second person. (465) Sedang 1excl 1incl 2 3 (466) Dara’ang

sg ʔa̰

du ma̰ pa̰ ʔɛh pɔ̰ ga̰ prḛj

mε ʔăn

du j aj Ɂaj baj ɡaj

pl je Ɂe be ɡe

(467) Ceq Wong sg 1excl ʔiŋ 1incl 2 mɨʔ̃ 3 ʔuh

du jah h ay juh teʔ

pl jaʔ ( jaʔ paʔ) hɛʔ jin gən

1excl 1incl 2 3

sg Ɂo

pl ŋin pin pɔ̰ w aj

120

Jenny, weber and weymuth

(468) Mundari (Osada 2008: 109) sg du pl 1excl añ aliŋ ale 1incl alaŋ abu 2 am aben ape 3 aeʔ akin ako

A number of languages make further distinction of masculine vs. feminine in the second and or third person. One example is the Northern Aslian Menriq, which neutralizes number distinctions in the second person. (469) Menriq 1excl 1incl 2m 2f 3

sg ʔəɲ

kɛ

pl yɛ hɛ

heʔ ja

gi

Kammu makes a gender distinction in the second and third person, but does not have the inclusive vs. exclusive distinction. In the second person dual and plural the gender distinction is neutralized, as is the number distinction in the third person masculine. (470) Kammu sg 1 ʔòʔ 2m mè 2f pà 3m kə̀ 3f nà

du ʔàʔ s.pá s.pá kə̀ s.ná

pl ʔìʔ pɔ̀ pɔ̀ kə̀ nɔ̀

Standard Khasi distinguishes three genders (masculine, feminine, neuter/ diminutive) only in the third person singular, while Pnar makes the distinction in second and third person singular. The basic personal pronouns of the third person are identical to the noun class markers. The second person in Khasi does not have a number distinction.

The Austroasiatic Languages: A Typological Overview (471) Khasi 1 2 3m 3f 3n (472) Pnar

1 2m 2f 3m 3f 3n

sg ŋa pʰi u ka i

121

pl ŋi ki ki ki 12

Topic/oblique sg pl ŋa i me phi pha phi o12 ki ka ki i ki

Default/post-verbal sg pl ɔ i mi phi phɔ phi u ki kɔ ki i ki

The most complex set of personal pronouns is probably found in the Nicobarese languages, as exemplified here by Car. In Car, there are different forms of pronouns for different syntactic functions, including subordination and interrogatives, and the pronouns can be contracted with subordinators and other grammatical markers, such as negators and modals. The following list only shows the non-past forms of the pronouns of Car. A fuller discussion can be found in the chapter on Car in this publication. (473) Car 1sg 1du.excl 1du.incl 1pl.excl 1pl.incl 2sg 2du 2pl

sbj cin(i) ʔan ʔan ʔin ʔin man(a) nan jin(a)

inter ʔəcu ʔajə ʔəjah ʔəjih, ʔihə ʔəjiʔə ʔəmɛh ʔanah ʔəj

sub ʔəc ʔəʔaj ʔəj ʔəj ʔəm ʔan ʔəj

obl cu ʔaj haːʔ ʔih hiːʔ mɛh naːn jiːʔ

12 The third person singular forms masculine o and feminine kɔ are used with animate referents. Inanimate referents take the regular forms u and ka.

122

Jenny, weber and weymuth

3sg.hum 3sg.nhum 3dl 3pl.hum 3pl.nhum

sbj.vis ʔan(a) ʔən nan jin(a) nun

sbj.nvis ʔap, ʔəp nup nap jip nup

inter.vis ʔaŋ, ʔaŋah ʔən ʔac, naːn ʔac(a) ʔənuh

inter.nvis ʔap(a) ʔap(a) na p jip(a) nup(a)

sub ʔɔ ʔɛ nə nə nə

obl ʔɔ ʔɛ ca ca ʔu

4.5.2 Reflexives and Logophorics Besides reflexive morphology, as found in Nicobarese and Munda languages, a number of AA languages have special reflexive pronouns that are used to refer to the subject of the same clause. They appear in different functions, including object and possessive, as seen in the examples from Kammu and Mon, where the noun həkaoʔ ‘body’ takes the function of a reflexive pronoun. This use is already attested in Old Mon. (474) Kammu kə̀ lə̀ tòk k.sés róːŋ tè 3sg.m then caus fall mouth.harp refl ‘He then dropped his mouth harp (by mistake).’ (475) Old Mon (Shorto 1971: 123) mirmas guṇ ma smiṅ ʔiñcim jirku. remember favour rel king sustain body ‘He remembered the favors with which the king had sustained him.’ (476) Mon ɲèh thiəŋ.həjaʔ kɛ̀h rɔ̀ ə həkaoʔ 3h think say friend body ‘He thought that his friend was dead.’

khjɒt die

ʔa go

jaʔ. nsit

Besides non-subject functions, these reflexives can also have the role of the subject if they have the same referent as the main clause or a preceding clause. The subject does not have to overtly expressed. This function is known as logophoric or long distance anaphora (see Huang 2000). In Kammu, this function is available for all persons, while in Mon it is restricted to the second and, more commonly, third persons. (477) Kammu tè cə̀ pə́ pɨ̀an pə̀ʔ màh mɨ̀ jàːm jʌ̀ cry ins refl irr neg get eat rice day ‘[I] am crying because I will not be able to eat anything today.’

s.kí today

The Austroasiatic Languages: A Typological Overview

123

(478) Mon kɔ̀ h. kɤ̀ʔ tɛ̀h ɓɛ̀ʔ kon ŋèə həkaoʔ klàj get hit ref offspring frog body seek medl ‘He got the little frog he was looking for (by coincidence).’

Palaung (Shwe) has a pronoun de, which is probably related to the Kammu reflexive tè. This pronoun is used in the second and third persons, both in subject and non-subject function. When the subject of a clause is repeated, it is usually expressed by de if it is second or third person. (479) Palaung (Milne 1921: 19) pɛ hɲʌm bʌp 2sg not.yet necessary ‘You don’t have to go yet.’

de refl

(480) Palaung (Milne 1921: 37) ge ʔɤ bɛ de 3pl prox able refl ‘They can work.’

rɤr. do

lɔh. go

The form and function of the Palaung and Kammu forms are similar, but not identical, to the Old Mon third person pronoun ḍeḥ, which, according to Shorto (1971: 137f) is used “usually resuming or referring to subj. of statement, of first of series of statements, or head cl. of statement”. In modern Mon, the reflex of Old Mon ḍeḥ, that is, ɗɛh, is the normal form of the general third person pronoun. 4.6 Classifiers Jones (1970) argues that classifier systems in AA languages are not native to the family but borrowed from neighboring languages such as Thai, Burmese, and Chinese. Adams (1989; 1991) elaborates on Jones’s claims and identifies possible sources of a number of classifier constructions in the eastern groups of Austroasiatic. She concludes that “classifiers have been in AA languages long enough for them to possess many features which are not derivative but are unique to the language family” (Adams 1991: 75). Though classifiers may be part of the genetic inheritance of AA languages, the individual languages show a variety of structures, some arguably influenced by neighboring languages, and a number of AA languages do not make regular use of classifiers at all. In the case of Khasian, there is a typologically rare interplay of classifier and gender or noun class systems.

124

Jenny, weber and weymuth

Vietnamese shows the structure num-clf-n, as in example (481). (481) Vietnamese Uyên mổ ba con gà pn cut.open three clf chicken ‘Uyên kills three chickens in the sink.’

trong inside

la.va.bô. sink

The same structure is found in Sedang, Pacoh and Bunong, with demonstratives following the quantifier phrase. (482) Sedang rəta̰m pṵn ŋḛ four clf young.man ‘four young men’ (483) Pacoh bar lam ʔakaj nɛh three clf child prox ‘These three children.’ (484) Bunong (Vogel 2006: 26) pa̤ːr po̤ ːk rəwɛh two clf elephant ‘two elephants’

Car uses a small number of classifiers, including taka ‘people’, manɨ́k ‘children’, máːʔ ‘plants or trees’, nɔŋ ‘animals; long, slender objects’, and kaŋɛ́n ‘kind, type’, among others. The word order here also is num-clf-n. (485) Car héŋ tahɔ́ ːl kúk one pair coconut ‘One pair of coconuts.’ (486) Car héŋ taka ək misíːʔ tafísi tə one person 3sg.pst pst widow rel ‘Once there was a widow who had four children.’

fɛ́ːn four

manɨ́k clf

kúːn child

The Austroasiatic Languages: A Typological Overview

125

In Khasi, the word order in the quantifier phrase is identical, but the demonstrative determiner precedes the quantifier phrase. Unlike other classifier languages, Khasi uses only two numeral classifiers, one for animate and one for inanimate nouns. Their use is obligatory when counting from ‘two’ onwards, although they are optional with spaʔ ‘100’ and haʤaːr ‘1000’ and not used with lak ‘100,000’ or ʤur ‘pair’. In addition, Khasi employs a three-way noun class system, which is neutralized in the plural. (487) Khasi ki-ne laːj pl-prox three ‘these three girls’

ŋut clf

(488) Khasi ki-ne saːw tɨlli pl-prox four clf ‘these four villages’

ki=kɨnthej pl=female

ki=ʃnoŋ pl=village

In other languages, including Kammu and Mlabri, the order of elements in quantifier phrases is n-num-clf. (489) Kammu cḿ.rɔ̀ ʔ pàːr man two ‘two men’

kòn clf

(490) Mlabri chɯ cəbut bɛr buy pig two ‘buy two pigs’

klɔʔ clf

The same order is also found in Old and modern Khmer, Kơho, and Kui. Demon stratives follow the quantifier phrase. In modern Khmer, the use of numeral classifiers is not obligatory, at least not in all cases, as will be shown below. (491) Old Khmer kantai pi woman three ‘three women’

s9at being

126 (492) Khmer baːrɤj pɪ̀ː cigarette two ‘two cigarettes’

Jenny, weber and weymuth daəm clf

(493) Kơho caw baːr naɁ two clf person ‘two women’

Ɂuːr woman

(494) Kui cuəh baːr naʔ man two person ‘those two men’

ʔɛŋ ana.dist

Old Khmer has the alternative order n-clf-num, as in example (495). (495) Old Khmer ’aṃc9as ’na elder person ‘four elders’

pvan four

In Bugan, when the quantity of the noun is one, it can be directly modified by bɔ̱⁵⁵ ‘one, classifier’, which is a fusion of num + clf. In other cases, the word order is n-num-clf. The head noun may be separated from the quantifier expression by a verb, as seen in (498). (496) Bugan wɔ̱³³ bɔ̱⁵⁵ stone clf ‘one stone’ (497) Bugan piau¹³⁽³⁵⁾ bi³¹ pau³¹ person two clf ‘two persons’ (498) Bugan li⁵⁵ ka̠i³³ mi³³ biə³³ ox have five clf ‘There are five oxen.’

127

The Austroasiatic Languages: A Typological Overview

In Kammu, the quantifier phrase may be separated from the head noun by verbal and adverbial constituents and appear at the end of the clause. (499) Kammu kə̀ píɲ k.néʔ háːn tà 3sg.m shoot rat die loc ‘He shot one rat to death in my house.’

kàːŋ house

ʔòʔ 1sg

mòːj one

tó clf

A number of AA languages do not make obligatory use of numeral classifiers. This is common in Chong and Kammu, among others. Kruspe (2004: 206) states that the “set of classifiers in Semelai is small and rigid, and their use is not obligatory. The small inventory does not result in many entities employing the same classifier, but rather that many entities are simply not associated with a classifier.” (500) Chong j9aːŋlōp pʰa̰ːj eraser two ‘two erasers’ (501) Chong pʰa̰ːj cʰɔ̄ ː two dog ‘two dogs’ (502) Kammu túːt crìʔ mòːj tree banyan one ‘one big banyan tree’

(túːt) clf

nám big

Mon uses classifiers only very sparingly. The only case where the use of a classifier is quasi-obligatory is in counting monks, in which case the noun həkaoʔ ‘body’ is used as a classifier. With other nouns, no classifier usually occurs, but the word order is different for common nouns (n-num) and measures (num-n). In some AA languages the classifier can be the repetition of the head noun, as seen in Kammu, or a derivate form thereof, as in Old Khmer.

128 (503) Kammu t.lɔ̀ ŋ nám log big ‘one big log’

Jenny, weber and weymuth

mòːj one

t.lɔ̀ ŋ log

(504) Old Khmer sre moy sanre field one field ‘one rice field’

4.7 Case and Аdpositions 4.7.1 Case Мarking and Аdpositions Most AA languages do not distinguish cases morphologically, but grammatical and semantic relations are indicated by free markers, usually preposistions, verbal and nominal semantics, as well as word order. An exception are the Munda languages with their extensive agglutinating morphology, which uses prefixes and suffixes to express case relations (see Anderson (this publication) for an overview). In Nicobarese, prefixes and free forms are used to indicate semantic relations (Car i ‘loc’, ɛl/ɛl- ‘loc’, tə/tə- ‘general linker, oblique’). Like other aspects of Nicobarese grammar, the functioning of the prefixes and prepositions is still poorly understood. In Khasi, arguments are marked as accusative, locative/dative, instrumental, or other by means of prepositions, as seen in the following examples. The prepositions occur before the noun class markers, which are cliticized to the noun. (505) Khasi ŋa pʰaʔ ja ki=kʰuːn 1sg send acc pl=child ‘I send the children to school.’

ʃa all

(506) Khasi ŋi-n aːj ja ka=kot 1pl-fut give acc f=book ‘We will give the book to you.’

ka=skul. f=school

ha loc

pʰi. 2

The instrumental is also used with verbs to achieve temporal (simultaneous) expressions. In this case the instrumental preposition intervenes between the tense marker and the verb.

129

The Austroasiatic Languages: A Typological Overview (507) Khasi da tʰjaʔ katba u la while m pst ins sleep ‘while he slept they went there.’

ki pl

la pst

leʲt go

haŋtaj. there

In Semaq Beri, the locative preposition can be combined with nouns and pronouns. It is also used to express recipients, addressees, and direct objects with some verbs. (508) Semaq Beri ʔəɲ gaʔ dəm haʔ bri  1 sg imm sleep loc forest  ‘(I’m) going to sleep outside.’ (509) Semaq Beri ʔəɲ knal 1sg know ‘I know you.’

haʔ loc

ja 2sg.f

Other prepositions in Semaq Beri indicate a goal (gaʔ ‘allative’) or source (tən ‘ablative’). Relator nouns may also be used instead of prepositions to express more specific relations, as in the following example. (510) Semaq Beri kɛ kʰom kloc bri 3sg sit inside forest ‘He stayed inside the forest.’

General locative preposistions, covering allative and locative (including more specific notions like inessive, adessive, superessive, etc.) are frequently found in AA languages. (511) Mon paʔ mùʔ~mùʔ rao, ɗɔə càt kɔ̀ h. do what~red q loc performance medl ‘What do you do in this theater?’ (512) Kơho dilah gəs jət naʔ caw tɑːm rəndɛh, gɛn hat if exist ten clf people loc vehicle then crowded ‘If there are ten people in the car, then it is very crowded.’

ŋan very

130

Jenny, weber and weymuth

(513) Mlabri ŋ̊ uh (ni/ti) sit loc/loc ‘be at home’

gɛŋ house

(514) Pnar ha ka=ʧi-snɛm, ʧi-sen ʧi-snɛm ʤoʔ laj pʰi nɛ m̩ loc f=one-year one-time one-year same go 2pl tag neg ɛm nɛ, mɔ have tag q ‘once a year also, you go or not?’

In Kơho and Kammu, the instrumental is expressed by a preposition, which in Kơho is also used to mark the agent in passive constructions. The preposition glossed instrumental in Kammu is also used to indicate the recipient in ditransitive expressions and is probably a general oblique marker. (515) Kơho mpoːŋ gə-paːʔ mə caːl. door pass-open ins wind ‘The door was opened by the wind.’ (516) Kammu ʔòʔ ʔùːn k.múːl jʌ̀ 1sg give money ins ‘I handed money to him.’

kə̀ 3sg.m

Danau has a general oblique marker mɛʔ, which combines with more specific locative expessions (relator nouns), or with prefixed role markers to form complex prepositions. (517) Danau mə̀ tʰɯ̄ mɛʔ kɔŋ 2sg sit obl inside ‘He is sitting in the house.’

ɲɐ̄ house

The Austroasiatic Languages: A Typological Overview

131

(518) Danau kʰæ̃ ̂ wæ̃ ̂ sɐʔouʔ hō hɔʔ (kʰɐũ ̂) ō æ̃ ̂ kɨ̂ mɛʔ tǝ-sʰɐĩ purp buy book exist cop (because) 1sg fut go obl all-shop ‘I will go to the shop in order to buy a book.’ [‘. . . because there is a need to buy a book.’]

The oblique marker is also used with adverbial expressions. (519) Danau ɐn tin ljɐk mɛʔ lonì kô 3sg sleep arrive obl now newinf ‘He’s still asleep!’ [lit. ‘He is sleeping till now!’]

Similar to Danau, Mlabri combines general prepositions with specific relator nouns to form complex adpositions with concrete or more specialized meaning. Interestingly, in Mlabri the relator noun follows the lexical noun rather than preceding it. The result is bipartite adpositions with parts before and after the noun. (520) Mlabri ʔac ŋ̊ uh ni toʔ bird sit loc table ‘The bird is on the table.’

lətɤŋ top

Mon also uses complex prepositions, but in this case the general oblique marker follows rather than precedes the specific preposition. Prepositions in Mon can be used with nouns and pronouns as well as clauses. (521) Mon ɗɛh klɤŋ nù kɒ phɛ̀ə. 3 come abl obl monastery ‘He is coming from school.’ (522) Mon ʔe ɗɔə kɒ ɗɛh həpak ceh kɔ̀ h ʔaŋkəlòc kəpɒc eh loc obl 3 stab descend medl English dash phjeh na kɔ̀ h khjɒt ʔɒt.tah. caus.descend caus.go medl die completely ‘Now when [the airplane] came crashing down, the English struck it down, they died all.’

132

Jenny, weber and weymuth

In Kơho, postpositions are used as in addition to prepositions. This is the case for dative and possessive relations, as illustrated in the following examples. (523) Kơho kʰaj ʔaj tərnɑːm rəpu ʔin. 3sg give rice.wine buffalo dat ‘S/he gave rice wine to the water buffalo.’ (524) Kơho koɲ ʔaɲ sen.gar ʔəsɛh caw pɑː uncle 1sg take.care.of horse people chief ‘My uncle takes care of the village chief’s horse.’

ɓɔn village

ʔin. dat

(525) Kơho sraʔ kɔn de book child poss ‘the child’s book’ (526) Kơho sraʔ ʔaɲ ɟe book 1sg poss ‘my book’

As postpositions are unexpected in verb-medial Kơho, which also uses regular prepositions, the occurrence of these needs an explanation. Sidwell (2000: 87) reconstructs a form *ʔin to proto-South Bahnaric as an ‘auxiliary particle’, with the function of ‘demonstrative, benefactive particle’, but he does not list the Kơho form. Semantically a postposed demonstrative as source of a benefactive marker would not be impossible. No source for the possessive marker can be established at the present state of research. 4.7.2 Directive (Goal) Marker and Linker A number of AA languages belonging to groups far apart have a grammatical marker that shows striking similarity in shape and function. It can be used as preposition, usually indicating a goal, or conjunction, or a general linker between connected elements. The phonological shape is a dental stop followed by a vowel. It is possible (or even likely) that these markers in different languages have unrelated origins, but the general meaning is very similar across the groups and could be compared to the range of functions of English ‘to’. In the individual grammars the respective morphemes are variously glossed as

The Austroasiatic Languages: A Typological Overview

133

linker, marker of goal, dative, locative, allative or general oblique. In Old Khmer here are two distinct forms, ta ‘general relator’ and ti ‘directional preposition, benefactive, purposive marker’, which may be unrelated to each other but share at least some of their functions. In Old Mon, the preposition ta marks benefactive and directional relations (Shorto 1971: 138). (527) Old Khmer vraḥ kñuṃ ’aṃnoy kur9ak hv9ar ta slave given kurāk pn link divinity ‘slaves given by the kurāk of Hvār to the divinity’ (528) Old Khmer nau ta yokk neḥ ta roḥ neḥ ti pre k9ap thpvaṅ top link take prox link manner prox link order chop head ‘Of those who take these aforesaid—[the executioner] shall be ordered to cut off [their] heads.’

A marker dɨ is found in Dara’ang, which covers a wide range of functions, including marking of the recipient and as connector between two verbs in multi-verb predicates. Similarly, Mlabri uses di to connect verbs in multi-verb predicates, sometimes with purposive reading. In Mlabri, the same marker is also used to indicate possessive relations. (529) Dara’ang daraɁaŋ ɲɨm tauh boŋ Ɂakja kɨn~kɨn dɨ pən dɨ pn believe tell make merit lot~red goal get goal houɁ məŋdεŋ ascend heaven ‘Dara’ang people believe (that) making lots of merit will make them be able to go to heaven.’ (530) Dara’ang Ɂăw pən dɨ kauɁ Ɂăn neg get goal bite 3sg ‘[The bear] was not able to bite him.’

134

Jenny, weber and weymuth

(531) Mlabri [β] (Rischel 1995: 193) kibi bi:ʔ, boŋ di chɛʔ neg satisfied eat link much ‘If you aren’t full yet, eat some more!’

Car has a general ‘relator’ morpheme, which is used as subordinator and oblique preposition. It’s form and function is similar to the Jahai relative marker t-, but the latter attaches only to word-level units, not to clauses or phrases (Burenhult 2005: 124). (532) Car vɛ́ːɲə náŋ cɔ́ ːn tə́ cu tell ear John link 1sg.poss ‘Tell John that I will be there.’ (533) Car mɨ́kka ən ŋíh look 3sg.prox prox.sg.inan ‘This book is looked after by Jane.’

min fut

líːpərɛ book

ŋ ac finish

tə link

umúh loc.dist

céːn pn

Munda languages like Mundari and Kharia have oblique or instrumental postpositions or suffixes of the shape -te, which is attached to nominal or verbal elements. In Danau, the prefix (or proclitic?) tə- indicates an allative relation, and in the languages further east there are similar markers for locative relations, in some cases including allative. Palaung (Shwe) has the allative preposition ta, besides the general linker (and future marker) di. (534) Danau kʰæ̃ ̂ wæ̃ ̂ sɐʔouʔ hō hɔʔ (kʰɐũ ̂) ō æ̃ ̂ kɨ̂ mɛʔ tǝ-sʰɐĩ purp buy book exist cop (because) 1sg fut go obl all-shop ‘I will go to the shop in order to buy a book.’ [‘. . . because there is a need to buy a book.’] (535) Pacoh nam ʔarbaŋ bɔː kɨː ləjʔ if weather rain 1sg neg ‘If it’s raining, I can’t go home.’

boːn get

cho̰ ː return

tuː to

duŋ home

The Austroasiatic Languages: A Typological Overview

135

(536) Kơho kʰaj cəm luʔ te/de sɔ. 3sg throw rock loc/loc dog ‘He throws rock at the dog.’ (537) Kơho ʔaɲ ʔəm te/de 1sg be.at loc/loc ‘I am in the doorway.’

bər mouth

mpoːŋ door

(538) Kammu ta ̀ nɔ̀ jèt réʔ. 3pl stay loc field ‘They were staying in the field.’

Much remains to be explored in regard to the directive markers or general linkers, and a more in-depth analysis must be based on text usage of the respective elements in the languages where they appear. Possibly there was a linker morpheme in proto-AA which developed in different directions in the individual languages, surfacing in a variety of similar but different forms and functions. 4.8 Clause and Sentence Particles Sentence particles are an important part of the make-up of any AA language. These particles fulfill a wide range of functions, which are in most cases not easily describable. The main functions of sentence particles are to convey speakers attitude towards the event described or the interlocutors, express illocutionary force, and structure the information flow. Mon and Semelai have sentence or utterance final particles indicating that the sentence is reported speech, without the original speaker obligatorily mentioned. In Mon the particle kɛ̀h from an all but obsolete verb kɛ̀h ‘say’ is used in this function, which also rarely occurs as complementizer with verbs of saying, hoping, and cognition. Unlike Mon kɛ̀h, Semelai kʰləŋ can be used with or without overt speaker. A similarly functioning particle is found in Danau, which, like Mon, is exposed to Burmese influence. The origin of both the Mon and Danau reportative markers may be a replication of the corresponding Burmese sentence final reportative tɛ́ ‘it is said, they say’.

136

Jenny, weber and weymuth

(539) Semelai (Kruspe 2004: 406) ʔnɔʔnɔʔ, kʰləŋ. sak tah, kʰləŋ, bɔs peel q quot sugar.cane prox quot ‘Aren’t you going to peel, (he) asked, this sugar cane? (he) asked.’ (540) Mon pɤŋ hwaʔ tom lɔ̀ swak mənìh plàj cooked.rice curry cook deposit for human young.man pɔn kɔ̀ h lɛ cut lɔ̀ kjìʔ noŋ kɛ̀h. four medl add put deposit poison asrt say ‘She had put poison in the rice and curry for the four young men, it is said.’ (541) Danau frɐ̄ hɔ̄ pɯ̄ t sənə̀ nɛ̀ holy preach come like.this rep ‘The Buddha has been preaching thus (I am told).’

Other sentence final particles are hard to classify at the present state of research. As their function is in most cases pragmatic and directly discourse related, more extensive and better annotated text material of the respective languages would be needed. In most cases the translations and descriptions available do not allow a more detailed description or analysis, and frequently the particles are glossed merely as such. The following examples are given to illustrate different uses of sentence final particles found in the available material. (542) Bugan ʑau³¹ ɣau³¹.sau⁵⁵ ʦu³¹ mə⁵⁵ go read cont one ‘He has gone to school for one year.’ (543) Chong jɔʔ cʰ9an waj ʔih 1sg able neg disc ‘I’m not able [to do this].’ (544) Chong kaːn taː-lan kʰimaŋ work addr-pn disc ‘It might be Mr. Lan’s work.’

nam⁵⁵ year

ʔa³³ asrt

The Austroasiatic Languages: A Typological Overview (545) Danau nì=nǝ thɪn prox=top belong ‘Isn’t this yours?’

mǝ̀ 2sg

137

dē tag

(546) Danau bēj bɪn rɐ̀ khɯ̀ zɛ̄ others get afraid 3pl newinf ‘Everyone was afraid of them.’ [new information] (547) Danau ʤǝpæ̃ ɛ̆ cwɐn pʰū mɛ̀ Japanese meet exp asrt ‘(Of course) I have met the Japanese!’ (548) Danau frɐttǝrɐ̂ kʰɯ̀ =kɐ̄ ɐ̄cjɐ̀ damn 3pl=top strength ‘Damn, they were strong!’

ŋ̄ kò plenty

bjɯ̄ emot

5 Conclusion We have seen that the AA languages exhibit a great diversity in their structures, which is not surprising, given the geographic distribution and the known histories of the AA speaking peoples. As historical documents of some time depth are available for only a small number of AA languages (Khmer, Mon, Vietnamese), we have only synchronic language material to work with in most cases. The AA languages of Southeast Asia (including Khasic, but excluding Vietnamese) share a number of phonological features, such as sesquisyllabic words (also present in Gtaʔ), large vowel inventories (often including large numbers of diphthongs), and—to a lesser extent—register distinctions. The consonant inventories of these languages are more diverse but typically include two or three stop series (one of which is often an implosive series) and four nasals. The Munda languages resemble their South Asian neighbors in having retroflex consonants, full first syllables (i.e. no sesquisyllabicity) and smaller vowel inventories. Vietnamese, under Sinitic influence, has developed a complex tone system and monosyllabicity.

138

Jenny, weber and weymuth

There is not much that can be conclusively said about the morphosyntactic structure of the proto-language, unlike its vocabulary, which can be reconstructed with considerable certainty, at least in part. The three superficially distinct typological groups Munda (verb-final), Nicobarese (verb-initial) and the eastern languages (verb-medial) are not confirmed by other isoglosses and hardly constitute major divisions in the family. If we take the word order of the majority of AA languages, namely AVP, as the original order of the proto-language, the deviant word orders of the other two groups have to be explained. A change in word order can indeed be claimed to have occurred in Munda under areal influence, with all surrounding languages exhibiting consistent verb-final order. A number of Munda constructions suggest an earlier verbmedial (or possibly verb-initial) order. The same is the case in Nicobarese, where the order AVP is found in subordinate clauses. On the other hand, most or all eastern AA languages with verb-medial word order are located in areas with dominant verb-medial structure, so that it might as well be the eastern languages that have changed an original verb-initial (or verb-final) order under areal pressure. Yet, with little evidence of earlier non-verb-medial structures in the eastern languages, and firm evidence of verb-medial order in Munda and Nicobarese, it may be safe to take AVP as the original order of AA, with the possibility of pragmatic changes, including fronting of topical and focal elements, post-verbal subjects in certain information-structural contexts, and free omission of contextually retrievable arguments. Multiple clauses forming complex sentences, included embedded clauses, are in many AA languages juxtaposed asyndetically, as is not uncommon in nonliterary languages. The conjunctions that are found in the individual languages are frequently loans from neighboring dominant languages or transparently derived from lexical verbs or nouns. One general linker may be reconstructed, at least in its function, to the proto-language. This linker, which appears in many languages, covers a wide range of functions, which can be subsumed under the notion ‘directive, goal oriented, A relates to B’ or similar. This linker also functions as a preposition indicating locative or allative relations, or as a general oblique marker linking nouns to verbs, nouns, or clauses. Other relations are usually expressed by grammaticalized lexical verbs or nouns. The pronominal systems found in the AA languages suggest a three-way distinction in number (singular, dual, and plural), at least in the first and second person, and an inclusive vs. exclusive distinction in the non-singular first person pronouns. Demonstratives most frequently show three degrees of distance, though it is not clear whether this was true for the proto-language. The question whether proto-AA made use of classifiers is undecided. Classifiers occur in all branches of the family, including Munda and Nicobarese,

The Austroasiatic Languages: A Typological Overview

139

but the systems exhibit great variation and are in some cases clearly borrowed from (or at least influenced by) neighboring languages. In most cases classifiers play a marginal role in the grammatical make-up of the AA languages In terms of morphology, a number of derivational affixes can be firmly reconstructed to proto-AA, and it is evident that the language had a rich system of prefixes and infixes. Among the most widespread affixes are the causative labial prefix p-, pə-, the instrumental nominalizing infix -n-/-rn-, and the nominalizing prefix ʔi-/ʔə-. Suffixes occur only in a few AA languages and are probably of a more recent date. Inflectional morphology, such as person and tense marking as found in the Munda languages, seem to be more recent innovations. The picture that emerges of proto-AA is far from clear in most respects, and only more in-depth study of the individual languages, based on more extensive text material and better descriptive grammars that are available for most languages at the present, together with a better understanding of the history of the individual languages and the contact scenarios they entered into during their history, will shed more light on the structural development. 6 References Adams, Karen L. 1989. Systems of numeral classification in the Mon-Khmer, Nicobarese, and Aslian subfamilies of Austroasiatic. Canberra: Pacific Linguistics. ———. 1991. The influence of non-Austroasiatic languages on numeral classification in Austroasiatic. Journal of the American Oriental Society 111.1: 62–81. Anderson, Gregory D. S. 2004. Advances in Proto-Munda reconstruction. Mon-Khmer Studies 34: 159–184. ———. 2007. The Munda verb. Berlin & New York: Mouton de Gruyter. ———. 2013. The velar nasal. In Matthew Dryer & Martin Haspelmath (eds.), chapter 9. http://wals.info/chapter/9. ——— (ed.). 2008. The Munda languages. London & New York: Routledge. Anderson, Gregory D.S. & K. David Harrison. 2008. Remo (Bonda). In Gregory D.S. Anderson (ed.), 557–632. Banker, John, Elizabeth Banker, & Mơ. 1979. Bahnar dictionary: Plei Bong—Mang Yang dialect. Huntington Beach, CA: Summer Institute of Linguistics. Bequette, Rebecca. 2008. Participant reference, deixis, and anaphora in Bunong narrative discourse (MA thesis). Graduate Institute of Applied Linguistics, Duncanville, TX. Benedict, Paul K. 1972. Sino-Tibetan: A conspectus. Cambridge: Cambridge University Press. Bennett, Fraser J. 1994. Iambicity in Thai. Studies in the Linguistic Sciences 24.1: 39–57.

140

Jenny, weber and weymuth

Bhattacharya, Sudhibhushan. 1975. Studies in comparative Munda linguistics. Simla: Indian Institute of Advanced Study. Bisang, Walter. 1992. Das Verb im Chinesischen, Hmong, Vietnamesischen, Thai und Khmer. Tübingen: Gunter Narr. Bishop, Nancy. 1996. A preliminary description of Kensiw (Maniq) phonology. MonKhmer Studies 25: 227–254. Blust, Robert. 1988. Austronesian root theory: An essay on the limits of morphology. Amsterdam: Benjamins. Brunelle, Marc. 2009a. Diglossia and monosyllabization in Eastern Cham: A sociolinguistic study. In James Stanford & Dennis Preston (eds.), Variation in indigenous minority languages. Amsterdam: Benjamins, 47–75. ———. 2009b. Tone perception in Northern and Southern Vietnamese. Journal of Phonetics 37: 79–96. Burenhult, Niclas. 2005. A grammar of Jahai. Canberra: Pacific Linguistics. Chantrupanth, Dhanan, & Chatchai Phromjakgarin. 1978. Khmer (Surin)–Thai–English Dictionary. Bangkok: Chulalongkorn University Language Institute. Comrie, Bernard. 2007. Areal typology of Mainland Southeast Asia: What we learn from the Wals maps. Manusya 13: 18–47. Costello, Nancy A. 1971. Ngữ-vựng Katu: Katu Vocabulary. Saigon: Department of Education. Dempwolff, Otto. 1934. Vergleichende Lautlehre des austronesischen Wortschatzes. Band I. Induktiver Aufbau einer indonesischen Ursprache. Berlin: Reimer. DiCanio, Christian T. 2009. The phonetics of register in Thakhian Thong Chong. Journal of the International Phonetic Association 39.2: 162–188. Diffloth, Gérard. 1980. The Wa languages. Linguistics of the Tibeto-Burman Area 5.2: 1–182. ———. 1984. Dvaravati Old Mon language and Nyah Kur. Bangkok: Chulalongkorn University. Dryer, Matthew & Martin Haspelmath (eds.). 2013. The World Atlas of Language Structures Online. Leipzig: Max Planck Institute for Evolutionary Anthropology. http://wals.info Donegan, Patricia J. 1993. Rhythm and vocalic drift in Munda and Mon-Khmer. Linguistics in the Tibeto-Burman Area 16.1: 1–43. Donegan, Patricia & David Stampe. 2004. Rhythm and the synthetic drift of Munda. In Rajendra Singh (ed.), The Yearbook of South Asian Languages and Linguistics. New Delhi: Thousand Oaks, 3–36. Enfield, N.J. 2003. Linguistic epidemiology. Semantics and grammar of language contact in mainland Southeast Asia. London & New York: RoutledgeCurzon. Enfield, N.J. & Gérard Diffloth. 2009. Phononogy and sketch grammar of Kri, a Vietic language of Laos. Cahiers de Linguistique—Asie Orientale 38.1: 3–69.

The Austroasiatic Languages: A Typological Overview

141

Ferlus, Michel. 1997. Le maleng brô et le vietnamien. Mon Khmer Studies 27: 55–66. Ghosh, Arun. 2008. Santali. In Gregory D.S. Anderson (ed.), 11–98. Green, Anthony Dubach. 2005. Word, foot, and syllable structure in Burmese. In Justin Watkins (ed.), Studies in Burmese linguistics. Canberra: Pacific Linguistics, 1–25. Haiman, John. 2011. Cambodian. Khmer. Amsterdam & Philadelphia: Benjamins. Handel, Zev. 1998. The medial systems of Old Chinese and Proto-Sino-Tibetan. Ph.D. dissertation, University of California, Berkeley. Henderson, Eugénie. 1989–1990. Khasi clusters and Greenberg’s universals. Mon-Khmer Studies 18–19: 61–66. Huang, Yan. 2000. Anaphora. A cross-linguistic study. Oxford: Oxford University Press. Huffman, Franklin E. 1967. An outline of Cambodian grammar. Ph.D. dissertation, Cornell University. ———. 1970. Modern Spoken Cambodian. New Haven: Yale University Press. ———. 1972. The boundary between the monosyllable and the disyllable in Cambodian. Lingua 29.1: 54–66. Jacob, Judith M. 1968. Introduction to Cambodian. London: Oxford University Press. Jenny, Mathias. 2003. New infixes in Spoken Mon. In Mon-Khmer Studies 33: 183–94. Jones, Robert B. 1970. Classifier constructions in Southeast Asia. Journal of the American Oriental Society 90.1: 1–12. Karlsson, Fred. 2009. Origin and maintenance of clausal embedding complexity. In Geoffrey Sampson, David Gil & Peter Trudgill (eds.), Language complexity as an evolving variable. Oxford: Oxford University Press, 192–202. Kruspe, Nicole. 2004. A grammar of Semelai. Cambridge: Cambridge University Press. Ladefoged, Peter & Ian Maddieson. 1996. The sounds of the world’s languages. Oxford: Blackwell. LaPolla, Randy J. 2003. Overview of Sino-Tibetan morphosyntax. In Graham Thurgood & Randy J. La Polla (eds.), The Sino-Tibetan languages. London: Routledge, 22–42. Maddieson, Ian. 2013a. Consonant inventories. In Matthew Dryer & Martin Haspelmath (eds.), chapter 1. http://wals.info/chapter/1. ———. 2013b. Front rounded vowels. In Matthew Dryer & Martin Haspelmath (eds.), chapter 11. http://wals.info/chapter/11. ———. 2013c. Glottalized consonants. In Matthew Dryer & Martin Haspelmath (eds.), chapter 7. http://wals.info/chapter/7. Matisoff, James A. 1973. Tonogenesis in Southeast Asia. In Larry M. Hyman (ed.), Consonant types and tone. Los Angeles: UCLA, 71–95. ———. 2001. Genetic versus contact relationship: prosodic diffusability in South-East Asian languages. In Alexandra Aikhenvald & R.M.W. Dixon (eds.), Areal diffusion and genetic inheritance: Problems in comparative linguistics. Oxford: Oxford University Press, 291–325. ———. 2003. Aslian: Mon-Khmer of the Malay Peninsula. Mon-Khmer Studies 33: 1–58.

142

Jenny, weber and weymuth

Michaud, Alexis. 2012. Monosyllabization: Patterns of evolution in Asian languages. In Nicole Nau, Thomas Stolz & Cornelia Stroh (eds.), Monosyllables: From phonology to typology. Berlin: Akademie Verlag, 115–130. Milne, Leslie. 1921. An elementary Palaung grammar. Oxford: Clarendon Press. Neukom, Lukas. 2001. Santali. München: Lincom Europa. Nichols, Johanna. 2003. Diversity and stability in language. In Brian D. Joseph & Richard D. Janda (eds.), The handbook of historical linguistics. Oxford: Blackwell, 283–310. Osada, Toshiki. 2008. Mundari. In Gregory D.S. Anderson (ed.), 99–164. Ostapirat, Weera. 2005. Kra-Dai and Austronesian: Notes on phonological correspondences and vocabulary distribution. In Laurent Sagart, Roger Blench and Alicia Sanchez-Mazas (eds.), The peopling of East Asia: Putting together archaeology, linguistics and genetics. London: RoutledgeCurzon, 107–131. Peterson, John. 2011. A grammar of Kharia. A South Munda language. Leiden & Boston: Brill. Phillips, Richard. 1973. A Mnong pegagogical grammar: The verb phrase and constructions with two or more verbs. Mon-Khmer Studies 4: 129–138. Pinnow, H.J. 1965. Personal pronouns in the Austroasiatic languages: a historical study. (Translated by H. L. Shorto). Lingua 14: 3–42. Rajasingh V.R. 2013. Negators in Muöt. Mon-Khmer Studies 41: 46–59. Ratliff, Martha. 2006. Prefix variation and reconstruction. In Thomas D. Cravens (ed.), Variation and reconstruction. Amsterdam: Benjamins, 165–178. Reid, Lawrence A. 1994. Morphological evidence for Austric. Oceanic Linguistics 33.2: 323–344. Rischel, Jørgen. 1995. Minor Mlabri: A hunter-gatherer language of northern Indochina. Copenhagen: Museum Tusculanum Press. Schebesta, Paul. 1928. Grammatical sketch of the Jahai dialect, spoken by a Negrito tribe of Ulu Perak and Ulu Kelantan, Malay Peninsula (translated by C.O. Bladgen). Bulletin of the School of Oriental and African Studies 4.4: 803–826. Schiering, René, Balthasar Bickel & Kristine A. Hildebrandt. 2010. The prosodic word is not universal, but emergent. Journal of Linguistics 46: 657–709. Shorto, Harry L. 1963. The structural patterns of Northern Mon-Khmer languages. In Harry L. Shorto (ed.), Linguistic comparison in South East Asia and the Pacific. London: School of Oriental and African Studies, 45–61. ———. 1971. A dictionary of the Mon inscriptions from the sixth to the sixteenth centuries. London: Oxford University Press. ———. 2006. A Mon-Khmer Comparative Dictionary. Canberra: Pacific Linguistics. Sidwell, Paul J. 2000. Proto South Bahnaric. A reconstruction of a Mon-Khmer language of Indo-China. Canberra: Pacific Linguistics.

The Austroasiatic Languages: A Typological Overview

143

Solnit, David B. 1988. The position of Lakkia within Kadai. In Jerold A. Edmondson & David B. Solnit (eds.), Comparative Kadai: Linguistic studies beyond Tai. Dallas: Summer Institute of Linguistics and the University of Texas at Arlington, 219–238. Steven, M., N. Kruspe & J. Hajek. 2006. Register in Mah Meri: A preliminary phonetic analysis. In R. Hoffmann & H. Mixdorff (eds.), Speech Prosody. 3rd International Conference Dresden, May 2–5, 2006. 269–272. Vogel, Sylvain. 2006. Introduction à la langue et aux dits traditionnels des Phnong de Mondulkiri. Phnom Penh: Éditions Funan. Vương Hừu Lễ. 1999. A new interpretation of the Bru Vân Kiêu vowel system. MonKhmer Studies 29: 97–106. Weidert, Alfons. 1975. I Tkong Amwi: Deskriptive Analyse eines Waridialektes des Khasi. Wiesbaden: Otto Harrassowitz. Wnuk, Ewelina & Niclas Burenhult. Forthcoming. The sound system of Maniq, an understudied Aslian outlier of Isthmian Thailand. Wolff, John U. 2010. Proto-Austronesian phonology with glossary. Ithaca, NY: Cornell South Asia Program Publications. Zide, Norman H. 2000. Three Munda scripts. Linguistics of the Tibeto-Burman Area 22: 199–232. ——— 2008. Korku. In Gregory D.S. Anderson (ed.), 256–298.

chapter 3

Austroasiatic Classification Paul Sidwell 1 Introduction The Austroasiatic (AA) phylum spans a vast although discontinuous area of mainland South and Southeast Asia, from central India to Malysia and southern China, plus a small community on the Nicobar Islands of the Andaman Sea. 171 named languages are listed at Ethnologue.com (2013 edition), although on linguistic grounds a more modest number, perhaps around 130 distinct languages should be recognised.1 The languages fall out into more than a dozen branches, although how the branches coordinate into a nested tree is controversial, and discussion of that question is the main topic of part 2 of this chapter. North Munda

South Munda

Khasian

Mangic

Palaungic

Vietic Khmuic

Bahnaric

Katuic Monic

Pearic Khmer

Vietic

Nicobarese Aslian

Map 3.1

Branches of the Austroasiatic Phylum.

1 Much AA classificatory work has been done in the lexicostatistical framework, and in the absence of intelligibility survey data, it is realistic to assign provisional language membership

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_004

Austroasiatic Classification

145

General discussions of the disposition and/or classification of AA languages are relatively few in number. A listing of notable works in English published in the last three decades that provide an overview, either primarily or as part of a wider discussion of the linguistic area, would include: Adams (1989), Parkin (1991), Diffloth & Zide (1992), Peiros (1998), Chazée (1999), van Driem (2001), Sidwell (2009), Nagaraja (2011). The general situation, as it was characterised in the 1980s, still prevails to this day: The disagreements among these linguists revolve around the alignment of these languages into subfamilies and into branches of subfamilies. The question of whether certain languages belong to the Austroasiatic group now concerns only a few poorly known languages. Adams 1989: 26

A generation of remarkable progress transpired in AA studies from the late 1950s to late 1970s, in which scholarly efforts coalesced into a coherent field— complete with conferences, canonical texts and a journal—and many fundamental problems were solved and/or clearly delineated. However, progress faltered in the 1980s as key projects (such as publication the SICAL2 papers, and Shorto’s Mon-Khmer Comparative Dictionary) were abandoned and key figures (such as Harry Shorto, Franklin Huffman) retired from academia without their places being taken by sufficiently dynamic and resourceful individuals. Consequently the rate of progress was reduced to fits and starts, and almost halted on the question of classification, such that overt expressions of exasperation ultimately began to flow from the keystrokes of concerned scholars: However, the relationships between these families within Austro-Asiatic is debated; in addition to the traditional classification, two recent proposals are given, neither of which accept traditional Mon-Khmer as a valid unit.

on the bases of lexicostatistical scores of 80% or more of shared basic vocabulary to identify a language. For example, in Miller & Miller’s (1996) comparison of 50 Katuic wordlists the threshold of 80% correlates well with uncontroversial language identifications. Consequently, the 171 AA language names assigned ISO codes are reduced to 129 in the Appendix to this chapter. To see the problem in more general terms, Xia et al. (2010) attempted to reconcile all ISO 639–3 language codes and the complete list of named languages in Ethnologue, and their analysis characterised 2,625 language names as ambiguous. 2 Second International Conference of Austroasiatic Linguistics, held in Mysore (India) December 19–21, 1978; copies of papers and handouts from the meeting are held at Cambridge University Library and CIIL Mysore Library.

146

Sidwell

It should be noted that little of the data used for competing classifications has ever been published, and therefore cannot be evaluated by peer review. Nagaraja 2010: 15

Austroasiatic languages are the most poorly researched of all those under discussion. Many are not documented at all and some recently discovered in China are effectively not classified. The genetics of Austroasiatic speakers are almost unresearched. Austroasiatic is conventionally divided into two families, Mon-Khmer (in SE Asia) and Muṇḍā (in India). Diffloth (2005, 79) now considers Austroasiatic to have three primary branches but no evidence for these realignments has been published. Indeed Austroasiatic classification has been dogged by a failure to publish data, making any evaluation of competing hypotheses by outsiders a merely speculative exercise. Blench 2008, 117–118

Thus, in to the second decade of the 21st century, and well into the second century of AA studies, we find ourselves without a clear classificatory consensus, and consequently floundering when it comes to addressing issues of prehistoric culture and population diversification, contacts and movements in Mainland Southeast Asia. AA is clearly an old phylum, probably the oldest in mainland Southeast Asia, as is clearly evident from its geographical distribution. The branches are distributed largely discontinuously (a tendency that is much greater outside of Indo-China), with speaker communities divided by other linguistic families known to be intrusive to the region (Tibeto-Burman, Kra-Tai, Austronesian, Hmong-Mien). AA is also typologically diverse, which is indicative of both great time depth and a history of contact with diverse external influences. The proto AA lexicon, according to our current state of knowledge (e.g. Diffloth 2005, Shorto 2006, Blench & Sidwell 2011) shows a variety of terms linked to the cultivation and consumption of rice, yet there are few if any apparently ancient terms related to metals or metal tools. There is also abundant lexicon for tropical wildlife, plus some domesticated species. Additonally, there are indications of ancient loans, especially of animal terms, between AA and neighbouring phyla such as Sino-Tibetan and Austronesian. This would seem to indicate a periodization of cultural development that may be tested by correlation with archaeological and genetic data. These are the kinds of considerations that Nagaraja has in mind when he writes:

147

Austroasiatic Classification

Regarding the homeland of Munda people, more research needs to be undertaken on a multi-disciplinary axis as no single disciplinary approach will bear any fruit. Nagaraja 2011: 14

Obviously real progress in the area of AA phylogenetic classification holds great promise for advancing our understanding of human history in Southeast Asia, which only has a written historical record that begins patchily in the mid first Millennium with inscriptional Mon and later Khmer. A reliable reconstruction of language history that demonstrates the branching structure of the phylum will have much to say about the migration and settlement history of the speaker communities, with implications for both absolute and relative chronologies of diversification events. One of the first problems is to determine the general shape of the family tree, since on general grounds we expect that a deeply nested tree structure will correlate with a very old language history, while a flatter or less structured tree would indicate a younger family whose initial diversification was a more rapid or punctuated event. The principle is indicated diagrammatically by Bellwood (2005) in the figure below. SPACE 1

1‒4 = Proto-languages

1

2

1‒5 = Proto-languages 3 4 5

TIME

2 3

4

A

subgroups

B

subgroups

DIRECTION OF GEOGRAPHIC EXPANSION OF FOUNDER COMMUNTIES Figure 3.1 Strong vs. weak nesting of branching relations within family trees (Bellwood 2005: 187).

148

Sidwell

The idea is that in a family with well-defined nested sub-branching distinguished by innovations at the nodes, substantial periods of time and/ or geographical separation must have occurred, and done so in an ordered chronology. On the other hand, a relatively rapid dispersal will see nodes only weakly differentiated, with less opportunity for nested groupings to arise, yielding a simple rake-like or radial structure (depending on how the tree is drawn). Proposals of both kinds have been made in respect of AA, and it is on this question that scholars have had the most difficulty reconciling their views. There are presently two main tendencies; one reflected by Diffloth (2005) divides AA into three families with substantial nested subgroupings; the other tendency (e.g. Sidwell & Blench 2011) proposes a more rake-like tree, reflecting a later and more rapid dispersal of the phylum. Various proposals for nested relations among branches will be discussed below; and solutions to these questions potentially hold the key to providing answers to the most promising and problematic issues facing the field, such as:

• • • • •

Does AA form an old deeply nested phylogeny or a flatter/rake-like tree? Is there a primary divide between Munda and Mon-Khmer, with one structurally archaic and the other innovative? Is the AA homeland in Inda, SEAsia, China, or elsewhere? How can the geographical dispersal of AA languages be reconciled with linguistic phylogeny? What is the chronology of linguistic expansions/dispersals, and how do these correlate with historical and archeological facts (e.g. rice agriculture, metal smelting, seafaring etc.)? How have linguistic contacts conditioned language change/restructuring over time?

•

There is, however, substantial consensus on the number and compostion of individual branches in the phylum. The general view at present acknowledges some thirteen AA branches, listed in alphabetical order here: Table 3.1

Austroasiatic branches

Branch

Main regions where spoken

Aslian Bahnaric Katuic Khasian

Malay Peninsula Central Indo-China Central Indo-China Meghalaya State of India

Austroasiatic Classification Branch

Main regions where spoken

Khmer Khmuic Mangic/Pakanic Monic Munda Nicobarese Palaungic Pearic Vietic

Cambodia and neighboring areas Northern Laos Southern China and Vietnam Southern Myanmar and central Thailand Eastern and Central India Nicobar Islands of India Shan State of Myanmar Cambodia and Thailand Vietnam and Central Laos

149

There is extraordinary variation in the number of languages and internal sub-groups within branches. At one extreme Khmer is represented by a single language, Monic by two and, both Nicobarese and Mangic/Pakanic each with three (although this is not entirely clear). At the other extreme the largest branch is probably Bahnaric, with at least 30 distinct languages and extensive internal typological diversity, indicating that it has been diversifying for several thousands of years. In addition to the list above, there are recent proposals that other AA branches existed and have left traces in various languages today (Sidwell 2005, Blench 2011, Blench & Sidwell 2009), but these are not dealt with here. 2

A Short History of AA Classification Proposals

A range of proposals emerged around the middle of the 19th century that recognised a substantial language family that we would today identify with AA or a subset of AA, and until the 1950s there were ongoing disputes over the membership and the limits of the phylum. In particular there were deep divisions over questions of whether languages as typologically divergent as Vietnamese and Mundari could even be related to each other, which were only resolved with progress in comparative studies. The period since the 1950s and continuing to the present is very different in character; debate over the extent of the phylum settled down after the publication of various landmark studies, and a coherent field began to emerge, complete with international meetings, a journal, and canonical reference publications. A new era dawned in which field workers generated extensive new data written in modern linguistic frameworks, and this in turn facilitated a growth in comparative and other analytical

150

Sidwell

work, especially branch-level comparative reconstruction and classification. Rapid progress followed through the late 1960s and into the 1970s as many problems of the “low hanging fruit” kind were solved, followed by a relatively quiet phase through the 1980–90s in which no dedicated conferences were held and several important projects were abandoned or otherwise delayed. The pace of work picked up again in the late 1990s, in part due to the new emphasis on language documentation as a field, and later improved application of computational methods and data sharing provided a much needed boost to comparative work. International conferences finally resumed in 2007 with ICAAL3 being held in Pune (India). Very recently new computational phylogenetic studies are being conducted with AA data and beginning to yield new and interesting results. The first recognition of some kind of AA language relationships come in a footnote on page 345 of Vol. IV (1850) of The Journal of the Indian Archipelago: I was first led, some years ago, to recognise the peculiar connection of Anam with the Indo-Asianesian races on finding, when amongst the Basisi of Bukit Panchür in Malacca, that several of their non-Malayu words were Anamese. Logan 1850: 345

Logan’s otherwise uninformative remark was followed in 1854 by Mason who provided—in his grammar of Mon (Talaing)—a list of Mon and Kol (the Munda languge also known as Ho) lexical comparisons that provided strong indications of a genetic relationship. Within a generation there was wide recognition of the “Mon-Anam” family, although diffusionists would remain sceptical well into the 20th century. For example, while Cust (1878) laid out a recognisable characterisation of the family: The Group is composed of twenty Languages—1. The Mon or Peguan. 2. the Kambojan. 3. The Annamite. 4. The Paloung. 5–20. The Languages of the sixteen Wild Tribes inhabiting the upper basin of the River Mekong. Cust 1878, 124

Keane on the other hand offered outright dismissal: Here no place is given to a so-called “Mon-Annam” linguistic family, of which a good deal has recently been heard, and which is supposed to embrace the Annamese, . . . Keane 1880, 286

Austroasiatic Classification

151

And as late as 1942, Sebeok was pointing to the various structural differences between, for example, Munda, Vietic and Aslian languages, offering only incredulity at the prospect that they could be related. Perhaps the most influential scholar suspicious of the genetic hypothesis was Charles Otto Blagden, who served for many years as Reader in Malay Studies in the University of London and made great contributions to Aslian linguistics and Mon and Burmese epigraphy. An enthusiastic diffusionist, Blagden saw waves of linguistic influence where others would see a genetic family. He imagined multiple phases of MonAnnam influence working their way into the aboriginal cultures of the Malay peninsula throughout the first millennium, especially citing lexical differences as evidence. Despite the strong diffusionist tendency, the neogrammarians would eventually prevail. Through the late 1800s more and more data came in from French Indochina, and from the turn of the century the great Linguistic Survey of India (or LSI, published in multiple volumes from 1903 to 1928, edited by George Grierson) provided a mass of well organised data of AA languages in what are today India, Bangladesh, Myanmar, and the Nicobar Islands. Although on structural grounds Grierson separated Munda and Mon-Khmer—a scheme that would echo down to the present—the LSI helped to provide a solid basis for comparative AA linguistics, from the beginning of the 20th century until well into the 1960s. Using materials obtained through Blagden, Grierson and French sources, Wilhelm Schmidt produced a series of monograph length papers:

• 1901: analysed the Aslian group (“Sakai und Semang auf Malacca”) in relation to Mon-Khmer (published in English 1903); • 1904: analysed Khasi and Palaungic (“Palaung-Wa-, und Riang-Sprachen” or Salwin Group) 1905: • often refered to simply as the Grundzüge (‘Foundations’), is a comparative analysis of Mon, Khmer, Bahnar and Stieng, compiling more than 900 etymologies; 1906: outlines the case for an Austric super-phylum (published in French in 1907).

•

Schmidt’s work put comparative AA studies on a strong footing, even though he was unable to solve many problems with the limited data at hand. For example, the problem of proto-vocalism so frustrated him that he expressed doubts that regular correspondences even existed, and the problem bedevils the field to some extent even today. Schmidt (1906) suggested an internal classification of Austroasiatic, based upon various lexical and morphological isoglosses, arriving at a classification

152

Sidwell

that he considered “einleuchtender und gesicherter” (reasonable and secure), as follows: Table 3.2 Austroasiatic classification by Schmidt (1906)

I. II.

III.

a) b) a) b) c) a) b) c)

Semang Senoi (Sakei, Tembe); als Michungen mit Bersisi, siehe IIIc. Khasi Nikobar Wa, Palong, Riang; nach Mon-Khmer neigend, siehe IIIa. Mon-Khmer (mit Bahnar, Stieng usw., siehe oben S. 18.) Muṇḍā-Spachen Tscham, Rade usw., s,ob.18; Mischungen mit austronesischen Sprachen

Schmidt’s Mon-Khmer group included languages that are today classified into six distinct branches: Monic, Khmer, Bahnaric, Pearic, Katuic, and Khmuic. Altogether it is evident that 11 Austroasiatic branches are represented in Schmidt’s work, with only Vietic and Mangic/Pakanic absent. The minor Vietic languages, which are typologically much more like other ‘Mon-Khmer’ than the highly monosyllabic and tonal Vietnamese, were unknown, so it is understandable that Vietamese was put aside as an oddity. On the other hand, Mangic/Pakanic would not even be known to scholars until the later part of the century. Finck (1909) followed Schmidt (1906) closely, and showing more confidence explicitly maintained Vietnamese (his Annamitisch) within Austroasiatic. The classification, explained in the text rather than explicitly tabled, plays out as follows: Table 3.3 Austroasiatic classification by Finck (1909)

Austroasiatische 1– Kurku-Annamitisch or Munda und die Mon-Khmer Gruppe 2– Khassi-Nikobarisch 3– Proto-Malakkisch

153

Austroasiatic Classification

Finck placed Vietnamese within his Kurku-Annamitisch group, explaining that it had “received a new and peculiar imprint” due to Chinese influence. The Khassi-Nikobarisch corresponds to Schmidt’s group II, and the third (Semangische and Senoische, today’s Aslian) is equivalent to Schmidt’s group I. Within francophone linguistics a strong tendency resisted the classification of Vietnamese within AA. Maspero in 1912 (translated into English by Haudricourt 1966) treated Vietnamese as a Tai language, asserting that the tonal system must have been so inherited; so profound was Maspero’s influence that he was still being cited as authoritative decades later by westeners (e.g. Sebeok 1942) and until recently still enjoyed significant support by Chinese and other Asian scholars. And some other Europeans had their own original ideas, of particular note von Hevesy (1928; 1930; 1932; 1934) repeatedly ridiculed the Munda-MK connection, instead suggesting a genetic relation between Munda and the Finno-Ugric languages. Przyłuski (1924) offered a classification in his chapters “Les langues austroasiatiques”, “Les langues mon-khmer” and “L’annamite” in an edition of Meillet and Cohen’s Les langues du monde. He championed the AA hypothesis whilst being scathing of the bolder Austric hypothesis “La construction est grandiose; elle est encore assez fragile.” (The construction is magnificent, it is still quite fragile) (p. 25). Przyłuski divided AA into three families: Munda, MonKhmer and Annamite (Vietic), and offered lexical and structural evidence, such as the following set of numeral comparisons: Table 3.4 Austroasiatic numeral comparisons by Przyłuski (1924: 386)

‘un’ ‘deux ‘trois’ ‘quatre’

Santali (Munda)

Mon

Muong (Vietic)

mit bar pä pōn

mwai b9a pi pan

môt hal pa pôn

Sebeok (1942) offered a wide ranging and sceptical review of Schmidt’s Austric hypothesis, critiquing his comparisons on lexical, phonological and structural grounds. He found reasons to doubt not only Austric but also the main pillars of AA. Consequently, while Sebeok accepted a core Mon-Khmer family

154

Sidwell

(which included Chamic), he rejected any genetic connection to Munda or Vietnamese. A key flaw in Sebeok’s argument was his ironic acceptance of one of Schmidt’s real errors; the insistence that Mon-Khmer languages have exclusively monosyllabic roots. Sebeok pointed to the real risk of “statistical convergence” in comparison of monosyllables (see Ringe 1999 for a more recent discussion of the problem) and was thus able to cast doubt generally on the lexical comparisons upon which AA is based. However, this criticism is relevant only in respect of particular faulty comparisons, and cannot be held generally against the hypothesis; later work (especially Donegan & Stampe 1983, 2004 etc.) has built a coherent theory reconciling Munda and non-Munda AA roots, moving beyond a dogmatic insistence on historical CVC structure. In a very positive aspect of the paper, Sebeok opens with a detailed AA classification. “[. . .] as complete and as accurate as a comparison of the sources permits [. . .] because, scattered over many places, it is nowhere available in such a unified form.” (fn. 1942: 206). The list is reproduced here as a useful indication of the state of knowledge at the time. Table 3.5 Austroasiatic classification by Sebeok (1942)

Austroasiatic 1. Mon-Khmer 1.1. Mon-Khmer proper 1.101. Mon (or Talaing) 1.102. Khmer (or Cambodian) 1.103. Bahnar 1.104. Stieng 1.105. Rangao 1.106. Moi 1.107. Kha 1.108. Kuoi (about 4 dialects) 1.109. Chong 1.110. Pear 1.111. Penong 1.2. Eastern 1.21. Cham 1.211. Cambodian 1.212. Bình Thuân 1.22. Jarai

1.32208. Senoi of Ulu Pahang 1.32209. Chendariang 1.32210. Tapah 1.32211. Ulu Gedang 1.32212. Sungkai 1.32213. Slim 1.32214. Orang Tanjong of Ulu Langat 1.323. Southern 1.3231. Southwestern (5 dialects) 1.3232. Southeastern (4 or more dialects) 1.324. Eastern 1.3241. Inner Group (about 5 dialects) 1.3242. Outer Group (about 2 dialects) 1.4. Nicobarese 1.41. Kar Nicobar-Chowra 1.42. Teressa-Bompoka 1.43. Kamorta—Nankawri—Trinkat Kachal 1.44. Little and Great Nicobarese 1.45. Minor dialects

155

Austroasiatic Classification 1.221. Haban 1.222. Hedroŋ 1.223. Arap 1.224. Chur 1.225. Kruŋ 1.23. Radé 1.231. Kpa 1.232. Atham 1.233. Dle-rue 1.234. Ktul 1.235. Kaduŋ 1.24. Sedang 1.3. Malay Peninsula 1.31. Semang 1.311. Group One 1.3111. Kedah (3 dialects) 1.3112. Ulu Selama 1.3113. Ijok 1.3114. Jarum 1.3115. Plus and Jehrhr 1.3116. Pangan (12 dialects) 1.3117. Hill Semang 1.312. Group Two 1.3121. Juru 1.3122. Begbie’s Semang 1.3123. Newbold’s Semang 1.3124. Swamp Semang 1.3125. Unclassified dialects 1.32. Sakai 1.321. Northern 1.3211. Kenderong 1.3212. Grik 1.3213. Kenering 1.3214. Sungai Piah 1.3215. Po-Kloh 1.3216. Plus Korbu 1.3217. Ulu Kinta 1.3218. Tanjong Rambutan 1.3219. Tembe’ 1.322. Central

1.5. Salowen Basin 1.51. Palaung 1.52. Wa 1.53. Riang 1.54. Khamûk (or Khmu) 1.55. Le-Met 1.6. Khasi 2. Muṇḍa 2.1. Himalayan 2.101. Patan 2.102. Rangōi 2.103. Kanāshī 2.104. Kanāwri 2.105. Rangkas (or Sankiyā) 2.106. Dārūtiyā 2.107. Byāngsī 2.108. Chandāngsī 2.109. Vāyu 2.110. Khambu 2.111. Yākhā 2.112. Limbu 2.113. Thāmi 2.114. Dhimāl 2.2. Chota-Nagpur 2.21. Western 2.211. Kūrkū 2.212. Khariā 2.213. Juāng 2.22. Kherwārī 2.221. Santālī 2.222. Muṇḍārī 2.223. Bhumij 2.224. Bīrhar 2.225. Kōḍā 2.226. Ho 2.227. Tūrī 2.228. Asurī 2.229. Korwā 2.23. Two questionable groups 2.231. Savara

156

Sidwell

(cont.)

1.32201. Blanja (Lengkuas) 1.32202. Sungai Raya 1.32203. Ulu Bertang 1.32204. Ulu Kampar 1.32205. Mt. Berumban 1.32206. Jelai 1.32207. Serau

2.232. Gadabs 3. Annam-Muong 3.1. Annamese 3.11. Tonkinese 3.12. Upper Annamese 3.13. Cochinchinese 3.2. Muong (many dialects)

Sebeok’s scheme recognises three numbered divisions, 1 Mon-Khmer, 2 Munda, 3 Annam-Muong (Vietic), reflecting his concerns over demonstrating genetic relationship. The listing does include some languages that are well known these days as belonging to phyla other than AA. These include the Chamic (Austronesian) languages Cham, Jarai, Radé, mistakenly included in AA by Schmidt, plus a column of Tibeto-Burman languages misclassified as Himalayan Munda (continuing Schmidt’s earlier typolologically based classification). More than a third of the total list offered by Sebeok is a listing of Semang and Sakai (Aslian) languages, which is directly from the compilation of Blagden (1906). Generally the impression is of little progress in the three decades up to the 1940s. The 1950s saw a radical improvement in the field as the twin problems of the status of Vietnamese and Munda were independently and successfully tackled. Haudricourt (1953, 1954) brilliantly demonstrated that Vietnamese tones originated natively within the resources of the AA vocabulary, so there could no longer be reasonable doubt of its affiliation. Pinnow (1954, 1959) systematised lexical and phonological correspondences within Munda and between Munda and other AA languages, providing the first substantial addition to Schmidt’s work of half a century earlier. Like Schmidt before him, Pinnow’s work also took some minor dead ends; he mistakenly included Nahali in his Munda group when it should be regarded as an isolate (see Shafer 1940; Kuipers 1962, Zide 1996); and there are many unsafe comparisons in his data. Although it was the 1950s, Pinnow was working substantially with LSI sourced data, and was strongly influenced by Grierson’s views, so we can afford to be generous is assessing Pinnow’s substantial achievement. The classification he presented in 1959 is presented at Table 3.6.

Austroasiatic Classification

157

Table 3.6 Austroasiatic languages by Pinnow (1959: 3–6)

A. West-Northwest: Nahali B. Nordwest: Munda a) Ost: Kherwari b) West: Kurku c) Zentral: Khaɽia-Ɉuaŋ d) Süd: Sora-Gadaba C. Mon-Khmer: a) Mon b) Kambodja (e.g. Khmer, Pear, Stieng) c) Chəma (e.g. Ma, Chrau) d) Mnong e) Bahnar f) Sedang g) Brao h) Jəru (e.g. Boloven, Nhaheun) i) Kuoy j) Suoy (e.g. Kaseng, Alak, Laveh) D. Nordost: Palaung-Wa or Salwen Group a) West (e.g. Riang, Palaung, Wa etc.) b) Ost (e.g. Khmu, Lameet etc.) E. Nord: Khasi F. Südwest: Nikobar G. Səmang a) Seəmang-Pangan b) Tiefland-Semang H. Sakai a) Nord b) Zentral c) Süd d) Ost I. Ɉaku’d (Ɉakud’n, Ɉakun)

Examining the classification, we note that Pinnow treats Munda as one among nine coordinate banches. This is understandable as his thesis primarily concerns Kharia within Munda, and the wider AA lexical comparisons he offers

158

Sidwell

are unstructured, merely providing suggestive witness of AA etymology. So the groupings are being given in a flat list without presupposing any particular nested structure. A little later Pinnow (1960 in German, 1963 in English) followed up with a paper directly addressing the classification issue. The impact of Pinnow’s argument is explained well in the following extract: According to recent investigation the extensive subordination and the extremely synthetic structure of the Munda languages are of a secondary nature, just as the analytic structure and general co-ordination in the east are not original but are probably the result of secondary development and, in part, of the influence of the Thai and other languages. The complicated Munda conjugation, if pursued back to its origins in older Proto-Munda, shrinks to a fraction of its present size; thus in ProtoMunda no pronouns as indicators of either subject or object were incorporated into the verb. The decisive question here is whether the analytic simplification developed independently in the individual eastern groups or whether it began in a period before the formation of these groups, as was the case with the increased structuralization in the later period of Proto-Munda. If the latter supposition is correct, the Khmer-Nicobar languages form just such an historical unified group as the Munda languages do. It is unfortunately not yet possible to give a reliable answer to this question. (Pinnow 1963: 145)

So he tentively divides AA into coordinate Western and Eastern families, as tabled below: Table 3.7 Austroasiatic languages by Pinnow (1963: 278)

Western group (Nahali-Munda) (A) West: Nahali (?) (B) East: Munda (a) North Kherwari (Santali, Mundari, Korwa, etc.) Kurku (b) South 1. Central: Khari, Juang 2. South-East: Sora, Pareng, Gutob, Remo

Austroasiatic Classification

159

Eastern group (Khmer-Nicobar) (A) West: Nicobarese (Nancowry, Car, etc.) (B) East: Palaung-Khmer (a) West: Khasi (b) North: Palaung-Wa (Palaung, Wa, Riang, Lawa, etc.) (c) East: Mon-Khmer (Mon, Khmer, Bahnar, Sre, etc.) (d) South: Malacca 1. Sakai 2. Jakud 3. Semang

Missing from consideration was Vietic, explaining that: “Vietnamese, Müöng [. . . .] are clearly mixed languages” with an Austroasiatic substratum so small “that we may unhesitatingly regard Vietnamese and Müöng as Thai languages” (pp. 140–1) citing Maspero (1912) as the authority for this claim. Thus, at the beginning of the 1960s AA comparative studies were solidly grounded in neogrammarian studies, but quite behind the times; in effect hypotheses developed in the 1850s–80s had been fleshed out and partially proven using data that was mostly circa the first decade of the 20th century. The times they were a-changing, and the 1960s saw the field of AA studies make a serious play at fast catch-up. In addition to substantial francophone research activity, the Summer Institute of Linguistics (SIL) and allied (mainly American) researchers in Indo-China systematically collected survey word lists and other descriptive data, and keenly applied new methodologies such as lexicostatistics (Swadesh 1952, 1955 etc.) and new structuralist analytical frameworks. Out of initiatives taken on several continents an international movement came to life. At SOAS in London in July 1961 a group of scholars met and their papers were published in 1963 as Linguistic Comparison in South East Asia and the Pacific. Also at SOAS Harry Shorto had taken up a chair in Austroasiatic Languages, and in 1962 his A Dictionary of Modern Spoken Mon was published, and shortly followed by Bibliographies of Mon-Khmer and Tai Linguistics (Shorto, Jacob & Simmonds 1963). In 1964 the journal Mon-Khmer Studies was launched in Saigon, with the first issue including A Survey of Austroasiatic and Mon-Khmer Comparative Studies (Thomas 1964). In Chicago Norma Zide was particularly active, in 1966 editing Studies in Comparative Austroasiatic Linguistics and taking steps that ultimately led to the very successful first International Conference

160

Sidwell

in Austroasiatic Linguistics (ICAAL) held at the University of Hawaii in 1973 (and proceedings published in two volumes as Austroasiatic Lingusitics in 1976, Jenner et al. eds). A watershed advance in the classification of AA languages was reached with Thomas’ (1966) Mon-Khmer Subgroupings in Vietnam and a follow-on study coauthored with Headley (1970) More on Mon-Khmer subgroupings, both relying on lexicostatistical methods (and the willingness of colleagues to share their data, a practice that is not universal in this field). The papers were transformative, such that their results have successfully framed discussions of AA classification ever since, with several of their novel results withstanding the test of time well. The principal achievement of Thomas and Headley was to sort out the confusion regarding eastern AA groupings. As one may note in the classifications of Schmidt, Pinnow, and others, there was a strong tendency to group all of the AA languages of Indo-China and immediately neighbouring areas into a vague Mon-Khmer group, as if the many languages of the Indo-China Cordillera and Mekong Valley were one great dialect chain, with Mon and Khmer at one end and Khmu at the other. Thomas and Headley introduced structure into this linguistic diversity, clearly established Pearic, Katuic and Bahnaric as distinct branches, their tabling of cognate counts producing: . . . . fairly neat clustering of percentages, showing the distinctions between intra-branch comparisons and inter-branch comparisons; intra-branch figures run from about 40% up, inter-branch figures about 20–30%, inter-family figures 8–15%. (Below 6–8% genetic relationship becomes indistinguishable from chance similarity.) Thomas & Headley 1970: 409

Yet their data was still quite limited, and they followed earlier writers in treating Munda, Nicobar and Aslian (Malaccan) as coordinate families along with the newly internally delineated Mon-Khmer. The latter they classified as follows: Table 3.8 Mon-Khmer sub-groupings by Thomas & Headley (1970)

I. Pearic II. Khmer III. Bahnaric A. North Bahnaric B. West Bahnaric C. South Bahnaric

Austroasiatic Classification

161

IV. Katuic V. Khmuic VI. Monic VII. Palaungic VIII. Khasi IX. Viet-Muong

Thomas & Headley’s classification substantially informed the widely cited 1974 Encyclopædia Britannica article by Diffloth, who acknowledged that, “The main lines of the following classification have been established by the glotto chronological method, which involves the statistical comparison of similarities.” (Diffloth 1974: 481)3 The problem of classification became an important theme of the 1973 International Conference on Austroasiatic Languages (ICAAL) meeting in Hawai’i, with proceeding published in 1976 in two volumes under the title Austroasiatic Studies. Lexicostatistical studies included: Benjamin (delineating the internal structure of Aslian); Gregerson et al. (analysing the position of Bahnar in Bahnaric); Headley (examining the place of Khmer); Huffman (discussing specific issues arising in applying lexicostatistics to Mon-Khmer languages). Further lexicostatistical studies followed, including: Smith (1981), Migliazza (1992), Miller & Miller (1996), Peiros (1998, 2004), and various unpublished studies by Shorto (discussed in Sidwell 2009). All these studies follow similar methodologies, of the type that originated with Swadesh (1952 etc.) and (with the exception of Peiros) they did not include their complete wordlists and cognate assignments with their results. The lexicostatistical method has been widely criticized, especially for unreliability at greater time depths due to problems with the assumption of constant rate of change inherent in the method (see e.g. Bergsland & Vogt 1962, Rea 1973, Blust 2000). More recently Bayesian approaches to computational phylogenetics, adapted to linguistics from other fields such as genetics, are being adopted and have essentially displaced lexicostatistics. From the 1970s onwards Diffloth developed a model of AA classification, apparently based on clades of lexical and other innovations, although the justification has not been published. Early versions divide AA into Munda and 3 Diffloth himself does not practice lexicostatistics, and oddly his encyclopedia article does not credit Thomas and Headley, although the bibliography does mention Mon-Khmer Studies, characterising it as “collections of short technical articles mostly on the montagnard languages of South Vietnam.”

162

Sidwell

Mon-Khmer familes (following Pinnow 1959), and Mon-Khmer into three subfamilies (Northern, Eastern, Southern). A version is presented by Diffloth and Zide (1992): Table 3.9 Mon-Khmer sub-groupings by Diffloth & Zide (1992)

Northern Mon-Khmer Khasi Palaungic Khmuic Eastern Mon-Khmer Vietic Katuic Bahnaric Khmer Southern Mon-Khmer Monic Aslian Nicobarese

In 2005 Diffloth further revised his classification, elevating his Northern MonKhmer to a coordinate family, and collapsing the eastern and southern groups, giving us an AA phylum composed of three families: Munda, Khasi-Khmuic, and Mon-Khmer. The paper does not defend or demonstrate the classification but rather offers it as “the most recent form of a possible AA tree” (2005: 81). Some verbal explanation was offered by Diffloth at the Southeast Asian Linguistics Society (SEALS) meeting (May 21–21, 2008, Universiti Kebangsaan Malaysia), along the lines that it is based on analysis of lexical innovations. Although the facts behind these lexical arguments have not been published, their existence is asserted by others, e.g.: Gerard Diffloth (2005) provides a somewhat changed picture. Here rather than counting cognates, Diffloth compares reconstructions of various clades, and attempts to classify them based on shared innovations. Nagaraja 2011: 17

Of course, one must acknowledge that Diffloth has presented some phonological arguments from time to time—and these are discussed below—but serious

163

Austroasiatic Classification ad 1,000 ad 0

1,000 bc 2,000 bc 3,000 bc 4,000 bc 5,000 bc

Korku Kherwarian Kharia-Juang

Munda

Koraput Khasian Pakanic

Khasi Khmuic

Eastern Palaungic Western Palaungic Khmuic Vietic Eastern Katuic

Vieto-Katuic

Western Katuic

Khmero-Vietic

Western Bahnaric Northwestern Bahnaric Northern Bahnaric Central Bahnaric

Khmero-Bahnaric

Southern Bahnaric Khmeric Monic Northern Asli Senoic

Asli-Monic Nico-Monic

Southern Asli Nicobarese

Figure 3.2 Austroasiatic classification by Diffloth (2005).

detailed discussion of AA classification has been remarkably lacking since the heyday of AA studies in the 1970s. Through to the early 2000s discussions continued to be dominated by lexicostatistical analyses and this has not helped in terms of having the field taken seriously; the lexicostatistical method (as classically formulated) is widely held to be discredited and in practice has generated wildly varying results in respect of AA over 40 years of efforts. Consequently, in the first decade of the 21st century, the state of AA classification was in a very unsatisfactory state. The general picture was of multiple conflicting published claims, and a lack of access to the data upon which those claims were made. Furthermore, claims based on differing methods (i.e. morphosyntactic typology vs. sound change vs. lexicon etc.) had not produced convergent results. However, it is recognised that studies conducted so far have not always had access to sufficiently representative or comprehensive data (often due to lack of data sharing rather than an absolute lack of data collection) and individual researchers have faced difficulties dealing with large data sets.

164

Sidwell

Better use of software and information technology over the last decade has seen a dramatic improvement in data aggregation and sharing. Additionally, computational phylogenetics generally has begun to demonstrate results which are convergent with those derived from more traditional comparative methods (see Greenhill & Gray 2009 for discussion). Thus the propects for moving the field forward have improved both qualitatively and quantitatively (although it remains for academic culture to catch up to the possibilities created by new technology). Responding to the challenge of Diffloth’s classifications, this writer has taken a multi-pronged approach, pursuing systematic bottom-up reconstruction of the phonology and lexicon of AA branches plus the use of computational phylogenetic methods (the latter conducted in cooperation with Russel Grey of the University of Auckland and Simon Greenhill of the Australian National University4).5 The ongoing result of this work is an emerging synthesis that is summarised immediately below and discussed in more details in sections following that deal with individual branches. The computational phylogenetic project began with an analysis based on a data set of Swadesh 100 list items for 30 AA languages, the results of which were discussed at the 4th ICAAL meeting in 2009 and in print by Sidwell & Blench (2011). The limitations of that study were such that Mangic/Pakanic was not represented, and most branches were only represented by two or three languages. The result was a strongly branching tree with roughly equidistant “spokes” suggesting a rather flat or rake-like structure. Support was found for sub-branching of Katuic with Bahnaric, and also a northern group comprised of Palaungic, Khmuic and Khasian, but with no clear indication that these should be interpreted as genetic or areal constructs. A second trial was conducted in 2010, and the results posted on a website for the purposes of discussion/cooperation (people.anu.edu.au/~u9907217/ lexico/AAclassification). The dataset was expanded to 54 languages, including the addition of Mangic/Pakanic. The results were consistent with the previous study, this time broadly indicating 13 sub-groups in a strongly branching tree, 4 At the time the cooperation began, in 2009, Greenhill was at the University of Auckland, and late took up a fellowship at the ANU, Canberra). 5 It should be noted that this writer continues to be strongly criticised, mostly in anonymous referee reports on grant applications, for not pursuing an approach focused on seeking innovations in morphology and syntax as the basis of classification, citing the importance of this method for Indo-European studies. Meanwhile, in the real world, no scholar has proposed morphological or syntactic innovations that might plausibly link two or more AA branches, and this is not for lack of effort in searching for this kind of evidence.

165

IC

ur KS C_

_K

hm

er

B BA AHN HN A AR RIC BAH NAR IC_St _Sre IC_T ien BAHN g a ARIC mpuan _ BAHNARIC Bahnar _Sedang BAHNARIC_Halang a BAHNARIC_Cu u IC_ Jrn R A N BAH yaheu IC_N NAR BAH

KA KA TUI TU C_ IC Bru MO _K ui NIC MO _Nya NIC h_K _Mo ur n MAN GIC_ Man g MANGIC _Boylu MANGIC_Bugan

uAD ge Kat IC_N coh TU Pa KA IC_ TU KA

hm

_K

C UI

g

an

u uC

l Ma ri KM IC_ lab U M M _ K UIC KM lai eme i N_S A I Sema ASL ASLIAN_ emiar ASLIAN_T ASLIAN_Jahai ASLIAN_Kensiw NICOB ARIC_C NICO AR BAR IC_N MU anco ND wri A MU MUN _ Juan g ND DA A_ _Ko Mu rku M UN nd ari DA _S or a

m ru Go A_ ND o MU ngS avu _Th TIC VIE alieng c _M Ru ese TIC TIC_ VIE VIE TIC_Vietnam VIE VIETIC_MuongHB

KAT

KHASIAN_WarLamin nar KHASIAN_P hasi IAN_K KHAS

IC AR

ER

I ER

M

_ UIC

H ong _Ch ng RIC _Kaso PT PEA RIC PearK _ PEA in

PE M KH

KH

PALAUNGIC_U PA PAL LAUNGIC_ Lamet AU PAL NGIC_ Wa AU NG PA IC_ LA KM Pal UN aun UI G IC_ g C_ D Ks a na inm w ul

Austroasiatic Classification

Figure 3.3 Neighbour Net of 50 Austroasiatic languages by Sidwell & Greenhill (2014).

again with weak support for Katuic-Bahnaric and Palaungic-Khmuic-Khasian (with Khasian falling closer to Palaungic). In early 2014 a third trial was conducted on an expanded data set and the data and results put online (sites.google.com/site/austroasiaticlanguages). The lists were extended to 192 lexical items for each language (an edited version of the Matisoff CALMSEA6 list), while the number of languages was reduced to 50 as several were not represented by sufficiently rich sources. A Neighbour Net of the 50 languages (created by Greenhill using SplitsTree4 software) is reproduced here. Again the results are strongly in line with previous trials, which (in the context of the much expanded and repeatedly reviewed dataset) provides 6 Culturally Appropriate List for Mainland Southeast Asia, see Matisoff 1978. Some items were edited from the list due to their consistent lack of representation in the sources.

166

Sidwell

substantial comfort that the methodology is delivering useful results. One other Neighbour Net analysis of AA relations is published, being a detailed treatment of Aslian internal relation by Dunn et al. (2013) and also reproduced in Benjamin (2011). It is noted that, consistent with the results of my 50 languages trial, Dunn et al. also find three branchings that include Kensiw and Jahai, Temiar and Semai, and Semelai respectively. This most recent AA Neighbor Net continues to indicate a strongly branching, largely flat tree, with weak support for Katuic-Bahnaric and PalaungicKhmuic-Khasian sub-families, plus it reveals important information about internal relations within the more divers branches (such as Bahnaric, Palaungic, etc.), which is discussed further below. Interesing aspects of the Neighbor Net include:

•

The Munda branch is coherent, and shows as a coordinate branch at the same level as others, so there is no support for treating it as a sister family of a greater Mon-Khmer family. Pearic and Khmeric are grouped in the Neighbor Net, but this is apparently due to borrowings, predominantly from Khmer into Pearic. There are no serious proposals to sub-group these; in fact at the 2013 SEALS meeting (Argay, France) Diffloth suggested a Northern Mon-Khmer affiliation for Pearic, citing Khasi isoglosses. Palaungic and Khasian are subgrouped, consistent with a postion that I arrived at by comparative reconstruction (see discussion below), but there is only much weaker indication of a link to Khmuic. The putative Mangic branch is indicated, consistent with Peiros’ (2004) lexicostatistics, while earlier suggestions of a connection to Palaungic (Thomas & Headley 1970, Loi et al. 2008) are not supported. A Katuic-Bahnaric sub-family is strongly indicated. Calculations suggest that there are in the order of approximately 30 unique isoglosses in the 194 word list, which suggests either a coordinate sub-family, or a coherent local sprachbund, or both. There is weak support for a southern grouping of Nicobar-Aslian, but these do not group with Mon, as Diffloth (2005) suggests.

• • • • •

Below, we will briefly review the evidence for several sub-grouping proposals, before concluding with an examination of the internal structure of the consensus branches.

Austroasiatic Classification

3

167

Proposals for AA Families & Sub-Families

3.1 Munda vs. Mon-Khmer It is quite striking that Schmidt’s (1906) classification grouped Munda with the eastern Mon-Khmer languages, reflecting particular lexical agreements which are found in basic vocabulary, pronouns and lower numerals. A half century later, especially following Pinnow’s work (1959, 1960, 1963) which gave priority to typological differences, a view would emerge and become widely accepted that AA consists of a two coordinate families, Munda and Mon-Khmer (or Khmer-Nicobar in Pinnow’s formulation). Benjamin describes the present situation neatly: It has been widely assumed that the two branches constitute genetically well-founded sub-families, but the view currently being proposed is that all the Austroasiatic groups are of more equal status and that the high-level division between ‘Mon-Khmer’ and ‘Munda’ should therefore be regarded as arbitrary (Sidwell 2010, Sidwell & Blench 2011). While this ‘flat’ view may turn out to be true, linguistic research continues in practice to treat the Munda and Mon-Khmer branches separately. Benjamin 2012: 137fn

However, statements such as the above obscure the fact that there is a history of fluidity and competing hypotheses. Even Pinnow expressed the reservation that his classification may, “. . . . have only structural and geographical justification.” (1963: 150) Consequently the problem was never simply put aside, and in particular through the 1970s and 1980s both Diffloth and Shorto independently considered the idea that AA actually has two coordinate families, Northern and Southern, with the former consisting of Munda, Khasian, Palaungic and Khmuic, and the remainder making up the latter. In an unpublished draft by Diffloth held in the Cornell Library manuscript collection he muses: I have in mind in particular a dendrogram which would place Munda within, or as a coordinate of, the Northern branch of Mon-Khmer. This idea is supported by historical innovations (FN . . .)7 and is therefore superior to other classifications which are based mostly on typological considerations. But the question remains entirely open, . . . . 7 Although an explanatory footnote was intended, it does not occur in the manscript.

168

Sidwell

The note is accompanied by a handout from a conference presentation Diffloth gave in Moscow in 19898 on the question of AA classification, which presents details on two etymologies (for ‘daylight’ and ‘stone’) that suggest a regular correspondence of *s-:*t- in minor syllable position, with Munda, Khasian, Palaungic and Khmuic reflecting *s- and the rest of the phylum reflecting *t-. The same question had been considerd by Shorto; one of the important changes that had been made in his Mon-Khmer Comparative Dictionary between the first draft in the late 1960s and the second draft a decade later, was the reconstruction of a *t2 proto-phoneme to account for the same correspondence that was to trouble Diffloth. Shorto, writing about the supposition of a Northern MK family, remarked: The whole of this group is characterized by a shift of *t in initial position in structures *CCVC (in some cases) to a sibilant, prima facie via an affricate stage. This minor shift is interesting because it apparently extends to Munda. Its incidence may be conditioned by the lost (in Mon-Khmer) V1 of Proto-Austroasiatic *CVCVC, or it may entail reconstructing an additional proto-phoneme (*t1,) *t2. Shorto (2006: x–xi)

Shorto (2006) proposed 14 etymologies in support of his preconsonantal *t₂-; on close examination I find that only 5 are sustainable, yet these still need explaining, and it is not clear that we have a definitive solution. Presently, this writer (Sidwell) proposes that since all the reasonably secure examples involve s:t before a sonorant, they may reflect lenition of an original stop following juncturally conditioned aspiration, similar that that which occurs in Cambodian. This is at least as reasonable as Shorto’s proposal of assimilation to a pre-pAA *i vowel, yet both solutions still hint at the proposed Northern-AA family. Harking back to Schmidt, it is surely significant that Munda languages retain important AA lexical items that have otherwise been replaced or restricted in Khasian, Palaungic and Khmuic. Most significant among these are the lower numerals; prima facie we can reconstruct numeral forms one to nine for pAA, based on cognate forms retained in Bahnaric, Vietic, Mangic, Monic, Aslian and Munda. Elsewhere only forms one to four or five are generally retained, and forms five to nine are replaced by a diversity of roots which are difficult to etymologise. In a different pattern again, a pAA form *gaːl ‘ten’ is apparently reconstructable based on reflexes in Vietic, Mangic, Khmuic, Palaungic, and 8 Linguistic Reconstruction and the Ancient History of the East, Institute for Oriental Studies, Moscow, May 28–June 2, 1989. Presentation: “Sub- and supra-classification of Mon-Khmer”.

Austroasiatic Classification

169

Munda. It would appear that in this respect Munda is broadly conservative, while the other supposedly Northern languages show more losses/innovation in their numerals. Consistent with the numerals evidence, it is also apparent that Munda languages do not share supposed Northern lexical innovations (see next sub-section for details and discussion) but retain more ancient forms in common with languages on the eastern periphery of AA. The typological argument also needs to be dealt with. It has been noted in several studies that Munda languages are typologically very different from the rest of the phylum, especially in their having highly synthetic morphosyntax, including extensive suffixation and polysyllabic words with rather simple syllable structure and phoneme inventories (see e.g. Pinnow 1959, Donegan & Stampe 1983, 2004). By itself this typological contrast does not require positing a phylogenetic opposition between Munda and the rest of AA as a single coordinating family. Donegan & Stampe have provided a proposal for how Munda typology could have evolved out of the eastern AA isolating iambic type, and no scholar has proposed any innovation or other factor that would require the recognition of all the non-Munda AA languages as a single phylogenetic unit. The closest we have come the latter is the highly problematic s:t correspondence that seems to link Munda with Khasian, Palaungic and Khmuic, and yet which is contradicted by the failure of Munda to show any lexical or other structural innovations that might link it to this putative group. Consequently, we can say that there is no basis for a single Munda-MK split underlying the AA family tree; on examination it seems to have been a misreading of Pinnow’s (1960/1963) typological classification. 3.2 Northern Mon-Khmer? A number of significant references mention or discuss a putative Northern Mon-Khmer family (e.g. Ferlus 1974, Diffloth 1979, Diffloth & Zide 1992 and others), and it remains an important feature of the influential (2005) classification proposal of Diffloth, under the rubric Khasi-Khmuic. Thomas & Headley’s (1970) lexicostatistics weakly suggested a northern vs. southern division, and the authors cautioned that, “this would be hard to sustain in the face of the rest of the figures”. More confidently, they grouped Khmuic and Palaungic sharing between 25% and 30% cognates, plus Mang (only recently identified at the time) was found to be closer to Palaungic. Diffloth (1979) explicitly identifies a north-south division in his map of AA languages, with Khasian, Palaungic, Khmuic, Mang and Vietic in the north, although without explicit justification. Diffloth & Zide (1992: 137) straightforwardly state: “The Northern division of MK contains three branches: Khmuic, Palaungic, and Khasian. [. . .] To this North MK division may be added the very small Mang language located on the border of Vietnam and China.” Diffloth’s (2005) scheme has Khasian branching

170

Sidwell

from the highest node, sitting above a sub-family in which Khmuic coordinates with Palaungic-Pakanic, Mang is not mentioned but it may be that Diffloth’s Pakanic includes Mang. Other lexicostatistical studies also provided weak indications of a northern family; for example, Peiros (1998) grouped Palaungic and Khmuic sharing 26% cognates, coincidentally the same percentage counted for Palaungic-Khmuic as found by Huffman (1978). However, these results seem to be an artefact of the selection of languages for comparison; in my lexicostatistical trials comparing five Palaungic languages with four Khmuic, the range of percentages of cognates varied from 10% between Danau and Mal, to 27% between Rumai and Khmu Cuang, and this indicates that the lexicostatistical bases for Northern Mon-Khmer are at best ambiguous. Additionally, some lexical innovations have been suggested in support of a northern family. In the following table of examples from basic vocabulary, innovations shared by two or more branches are shaded: Table 3.10 Select Northern AA isoglosses Khasian

‘blood’ ‘water’ ‘two’ ‘hair’ ‘fingernail’ ‘rain’ ‘eye’

War

pPalaungic Danau

Lameet

snaːm ʔum ʔaːr ʃɲuʔ trsim slap kmat

rnʊə ʔam ʔũ/ʔʊ͂ ə sʊʔ (kǝcar) slɛ/sla mat

*snaːm *ʔoːm *-ʔaːr *suk *rnsiːm *səlɛʔ *ŋaːj

kənɑn⁴ ʔun⁴ ʔɑn⁴ ɲok¹ (kəleəŋ⁴) kəlɪ¹ ŋɑj²

naːm ʔoːm ʔlaːr khuk lmhiːm səlɛʔ ŋaːj

Mang

Mangic Bugan

Bolyu

ham¹ ʑum¹ ʑɨəj⁴ hɔk⁷ — ma² mat⁷

sa³³ da³⁵ biɔ³¹ sak⁵⁵ mi³³ kʰou³⁵ mɛ³³

saːm⁵³ nde⁵³ mbi⁵⁵ suk⁵³ maːi¹³ qɔ⁵⁵ mat⁵³

Khmu

‘blood’ ‘water’ ‘two’ ‘hair’ ‘fingernail’ ‘rain’ ‘eye’

Palaungic

Khasi

Khmuic Mlabri

maːm ʔom, ʔɔːk baːr gləʔ tmʰmɔːŋ kmaʔ mat

mɛːm ɟrʌːk bɛːr thmbʌc chŋkɛr mɛːʔ mat

Austroasiatic Classification

171

The isoglosses given above seem to contradict the notion of a close PalaungicKhmuic relation in favour of Khasi-Palaung; and this is also the consistent result of the computational phylogenetic analyses of Sidwell and Greenhill. The etymology of ‘blood’ terms is problematic, although significant light was thrown onto the question by Ferlus (2008). A problem is that there are two main and apparently related groups of forms. One group is based on a root *saːm ‘to bleed’, which underlies the nominalised stem *snaːm, which is unambiguously represented in Khasian and Palaungic. Additionally there are forms with a mainsyllable -haːm and either a palatal or a bilabial presyllable (e.g. Khmer chiəm, Mon chim, Bahnar phaːm, Semalai maham etc.) which suggest pAA *ɟhaːm or *bhaːm/*mhaːm. Diffloth (1977) and Shorto (2006) both proposed to explain the diversity of forms by positing a nasal infix in an original pAA *ɟhaːm > *ɟnhaːm > *ɟmhaːm, but the identification of the ‘to bleed’ root by Ferlus (continued in Vietnamese as tươm ‘to ooze’) seriously undermines Diffloth’s and Shorto’s ad hoc reconstruction. Consequently, is appears that there are historically two roots, perhaps phonological variants in prepAA. In contrast to Khasian and Palaungic, Khmuic forms reflect pre-pKhmuic *mhaːm with regular loss of medial *h (hence, e.g. Khmu maːm) presenting a serious objection to the Khmuic-Palaungic model. The Mangic languages also present forms that are not infixed, but still pose problems of analysis: Mang reflects both pAA *s and *h as /h/, and so is ambiguous, while the Bugan and Bolyu reflexes both suggest *saːm, although they could also reflect the clustered onset *ɟh reduced to /s/, but this is not indicated by Mang, so probably all three reflect *saːm, which is elsewhere only found in Vietic. Particularly intriguing is the distribution of terms for ‘water’. Most AA branches reflect *ɗaːk ‘water’ (including Munda) while Khasian, Palaungic, and a subset of Khmuic language reflect *ʔoːm, which is clearly an innovation (apparently from a root meaning ‘to wash’). Strikingly, within Khmuic there is a profusion of ‘water’ terms, e.g.: Mlabri ɟrʌːk, Mal phuaj, Phong paʔaŋ, TaiHat paj; Khmu dialects show ʔom as the general term for ‘water’, ʔɔːk for ‘drinking water’, and may reflect pAA *ɗaːk in daʔ ‘spring (water)’. The apparent Khmu homeland area is in the northwest of Laos, in immediate proximity to Palaungic speakers, so this writer is suggesting that the Palaungic word for ‘water’ was borrowed into Khmu at some point before Khmu expanded out over its present range. Within Mangic, we note that *ɗaːk is reflected in Bugan da³⁵ and Bolyu nde⁵³, while Mang shows ʑum1. The Mang form is curious; it can be compared to Waic (Palaungic) *rʔom ‘water’, which is superficially attractive, but this is not unambiguously supported by historical phonology. All the examples of Mang words with initial /ʑ/ that this writer has been able to reliably etymologise reflect forms with original clusters *kr-, *kj-, e.g. ‘wind’

172

Sidwell

ʑiː4 < *kjaːl, ‘road’ ʑaː¹ < *kraʔ. These suggest a preMang ‘water’ *krum or *kjum, neither of which support the comparison with Waic. A similar conundrum prevails over the numeral ‘two’. Khmuic and Bugan/ Bolyu clearly reflect pAA *ɓaːr, while Khasian and Palaungic reflect a variant *ʔaːr, with a lateral preffixed to Palaungic forms (therefore *lʔaːr). Mang. ʑɨəj4 may be compared to Palaungic, but again it is not clear that the correspondences are viable, and the form remains problematic. Very striking is the apparent isogloss for ‘hair’ between Danau (Palaungic) and Standard Khasi, in which both show an infixed palatal nasal, which is not a regular formation. Both groups also have the regular AA root with the nasal, so it appears that either we have an astonishing coincidence, or a common inheritance of a one-off innovation. Not listed above, as it falls outside of the Swadesh list, there is another morphological doublet in words for ‘porcupine’. Khasi has the form dŋkʰɛːt with a front vowel while War has ʃkɔt with a back vowel; the forms are similar but the correspondences do not appear to be regular. Within Palaungic, most languages show a back vowel, such as Riang rəŋkɔs¹, while Danau has a front vowel: tɔŋ² kiɛt¹ˈ³. Reflexes further afield, in Aslian and Katuic also show back vowels, so the marked forms are in Danau and Khasi. Thus we appear to have two examples of non-trivial morphological doublets between Khasian and Palaungic. Additionally, Khasian and Palaungic share an innovated form for ‘fingernail’, and what looks suspiciously like a common innovation replacing the AA root for ‘rain’ (while Khmuic and Mang reflect the more archaic pAA *gmaʔ ‘rain’). The final /p/ of the Khasi reflex is problematic, as it may be original and War may have dropped it, in which case the similarity to Palaungic is irregular. On the other hand, the final /p/ could be due to consonant sandhi, in a phrasal construction with the Khasian verb *pat ‘to fall’, and the connection to Palaungic is strengthened. Finally, the last row in the table above lists forms for ‘eye’. Most AA languages reflect pAA *mat with minimal or no change, whereas Palaugic languages show reflexes of *ŋaːj, apparently from a root meaning ‘pupil’. On balance, it seems reasonable to suggest that the immediate proximity of Khmuic and Palaungic occasioned inter-branch borrowings, and these may be sufficient to explain the similarities that have been noted between these languages. At the same time, the unambiguous geographical separation between Khasian and Palaungic means that it is more difficult to invoke borrowing to explain the isoglosses between these two. Also, except the two questionable Mang words, there is a lack of evidence connecting Mangic/Pakanic with any supposed Northern Mon-Khmer sub-family. Consequently, the most parsimonious analysis groups Khasian and Palaungic in a sub-family, while treating Khmuic and Mangic/Pakanic as independent AA branches.

Austroasiatic Classification

173

3.3 Katuic-Bahnaric? While a major finding of Thomas (1966) was that the Katuic and Bahnaric branches should be delineated within AA, subsequent lexicostatistical studies suggested that these two groups might form a sub-family. Arguments for this have been exclusively lexical, as no common phonological or other structural innovations have been proposed, nor are other indications evident to this writer after nearly two decades of looking at the question. Of particular influence was Huffman’s presentation to the second ICAAL meeting (Mysore, 1978) which although unpublished, was widely circulated and acknowledged. Huffman found very high cognate percentages between Katuic and Bahnaric. He wrote, “It became obvious that the cognate percentages between the Katuic and Bahnaric branches were sufficiently high (52–55) and sufficiently similar to be considered a single branch at the interbranch level of comparison . . .” (1978: 3) Huffman’s figures were also remarkable because he found that all AA branches tend to share higher cognate percentages with Katuic-Bahnaric than than they do with each other, with the effect broadly correlating with geographical proximity to Katuic-Bahnaric. This was interpreted as suggesting that that region of Indo-China was the original centre of expansion of Austroasiatic. Subsequent studies still found a higher percentage of shared cognates between Katuic and Bahnaric, but in ranges much lower than found by Huffman. For example, Smith (1981) found between 24% and 39% of cognates between Katuic and Bahnaric, while Peiros (1998) counted 40%.9 This writer’s (2010) lexicosatistical trial found a range of 23% to 36% between Katuic and Bahnaric, which is essentially consistent with Smith’s findings. It appears that Huffman’s cognate counts were skewed upward by a combination of factors, including the selection of languages and a failure to screen out borrowings. Sidwell & Jacq (2003) identified a substantial stratum of Katuic loans into Bahnaric, especially West Bahnaric, and this sub-branch is overly represented in Huffman’s lexicostatistics as he collected his Bahnaric data from speakers taking refuge in Thailand during the Indo-China war. The recent computational phylogenetic analyses by Sidwell and Greenhill, including Neighbour Net and Covarian tree (relaxed-clock) found strong indications of Katuic-Bahnaric branching from a single node, the Covarian analysis grouping Katuic and Bahnaric in 97 out of every 100 possible trees, which indicates a significant number of isoglosses underlying the computational results. Examining the lexicons used for the phylogenetic analyses, it is not 9 It is significant that Peiros only use Kui to represent Katuic, and this language has a disproportionately higher rate of loans from Khmer.

174

Sidwell

Table 3.11 Katuic-Bahnaric proto-branch basic lexicon isoglosses Gloss

pKatuic

pBahnaric

‘bark of tree’ ‘claw/nail’ ‘skin’ ‘to stand up’ ‘tree/wood’ ‘crossbow’ ‘horn’ ‘palm, sole’ ‘salt’ ‘to steal’ ‘ten’

*ʔnɗɔh *knrias *ʔŋkar *dɨk *ʔalɔːŋ *pnaɲ *ʔakiː *trpaːŋ *bɔːh *toŋ *ɟit

*kɗuh *krʔniəh *ʔəkaːr *dɨk *ʔlɔːŋ *pnaɲ *ʔəkɛː *-paːŋ *bɔh *toŋ *cit

Notes

cf. Khmer kiəh ‘to scratch’ may be borrowed from Chamic cf. pKhmuic *cʔɔːŋ cf. OldMon pnaɲ ‘army’

difficult to identify apparent isoglosses, and a compilation of examples that seem to be reconstructable to both branches (rather than later loans) in the list used for the computational phylogenetic analyses is given above. Eleven proto-language isoglosses are found in the list, and it is not clear whether such should be treated as common innovations or early borrowings. Generally it is evident that there has been extensive borrowing between Katuic and Bahnaric; in the course of compiling etymologies for this writer’s 2011 pBahnaric reconstruction, out of an initial compilation of 1912 Bahnaric comparisons, 335 were identified as loans from Katuic, or about 1/6th of the total. Even if a substantial proportion of these are misidentified as loans from Bahnaric into Katuic, or as common loans from a third source (such as Chamic or Khmer) this strongly suggests a language contact area. All indications are that Katuic is a younger and less lexically diverse group than Bahnaric, and Katuic is also lexically well-defined with numerous distinctive lexical innovations well represented across the language and reconstructable to pKatuic (see discussion of the Katuic branch below). These innovations are across a range of semantic fields, including significantly the numerals from five to nine. By contrast, Bahnaric is not well defined lexicaly, in fact only some three lexical innovations have been identified as uniquely defining the branch. These facts suggest that each branch has had a very different deep history, along the following lines:

Austroasiatic Classification

175

• •

Bahnaric is an older group with its root quite close to pAA, and commensurately has a deeply nested internal branching structure; Katuic is a relatively younger branch which diversified much later and more rapidly from a single rather lexically innovative proto-language, and came into contact with Bahnaric (and Chamic) forming a language area.10 It is very telling that within Bahnaric, Chamic loans are extensive in Central, Northern and Eastern sub-branches, but almost absent from West Bahnaric (see Sidwell & Jacq 2003 for discussion and examples), strongly indicating that Bahnaric diversification preceded the arrival of the Chams in Indo-China in the first Millennium BC. By contrast, there are some Chamic loans that are found right across Katuic (e.g.: *jaːŋ ‘spirit’, *kəw ‘I’, *tdɨa ‘chest’, *rpuːʔ ‘storm’, *cuah ‘sand’ and others) correlating with a later date for Katuic diversification. We may see a hint of the nature of pKatuic society in the recent life of the Katu: their language is phonologically conservative yet lexically innovative, Chazee (1999: 87) remarks on their culture, and the: “independent and autarchic attitude of this population, who negotiate its acculturation harshly.” Consequently, this writer’s analysis is that the extent of Katuic-Bahnaric borrowing is so great that it has been difficult to control for in comparative studies. Detailed analysis with the benefit of extensive databases reveals that lexical isoglosses between these two branches are most likely due to contact loans after each had followed very different early trajectories after independently emerging from pAA. 3.4 Vieto-Katuic? Diffloth (1991b) suggested a sub-family of Vietic and Katuic, and this features consistently in his subsequent published classifications; additionally Diffloth (1989) remarked in the typological parallel of Vietic and Katuic languages having similar creaky phonation contrasts, although was not able to demonstrate a common etymological origin of the feature. Nguyễn (1995) also remarked upon the tendency of both Vietic and Katuic languages to add minor-syllables to AA monosyllabic roots, such as in animal names, in a manner that appears strikingly parallel. Alves (2005) provided an examination of the hypothesis with reference to 100 lexical comparisons. However, all of the viable comparisons provided by Alves have wider AA etymologies, and so there are no immediate consequences for the sub-grouping argument. 10 One may speculate that contact with Chamic may be related to the spread of Katuic.

176

Sidwell

The most important evidence is the small set of lexical comparisons proposed by Diffloth (1991) illustrating the proposed correspondence of pAA *ʔ- to Katuic *h- to Vietic *s-. The data is partly extracted (all etymologies are represented, not all languages) and tabled here: Table 3.12 Katuic-Vietic isoglosses from Diffloth (1991) Thavung Maleng Tum

Pacoh

‘centipede’ kasḭːp

kasḭːp

liːp-siːp kahḛːp

‘bone’

—

səːŋ

siəŋ

‘to cough’

—

—

—

‘to fart’

—

—

somʔ

‘to breath’ pəsʌ̰ːmʔ pəsʌːmʔ — ‘blood’

—

ʔasaːmʔ —

Kui

Katu

Other AA

kahɛ̰ːp kahip

Bahnar kəʔɛːp, Khmer kʔaɛp ŋha̰ ːŋ ŋha̰ ːŋ ŋhaːŋ Khmer cʔəŋ, Khmu cʔaːŋ kahɔ̰ːʔ ŋho̰ ʔ — Khmer kʔoːʔ, Brao kʔɔk — — — Bahnar phoːm, Khmu puːm palho̤ ːm pəhɒ̰ ːm — Khmer ɗɑŋhaəm, Car ʔuhɔːm ʔaha̰ ːm ŋha̰ ːm ʔahaːm Bahnar phaːm, Khmu maːm

The examples presented by Diffloth (1991) establish that in at least two etyma there is a Vietic /s/ corresponding to /h/ in Katuic and /ʔ/ elsewhere, plus one example of Katuic /h/ lacking Vietic cognate; additionally there is a correspondence of Vietic /s/ to /h/ in Katuic and elsewhere, supported by three examples. This is a curious problem, since it suggests an unusual direction of phonetic change (i.e.: ʔ > h > s), and in the absence of any apparent motivation such as a fronting environment. This suggests something grammatical and/ or analogical underlying these. What does not appear to have been suggested is that there is a possible semantic link in the above items; firstly, discounting the ‘blood’ etymology, which is discussed above already, the terms ‘cough’, ‘fart’, ‘breath’ all involve release of gas from the body and are susceptible to sound symbolism with sibilants. This does not immediately explain the ‘bone’ and ‘centipede’ cases, although these may have been restructured secondarily through alliterative rhyming or similar word play. Whatever the explanations,

Austroasiatic Classification

177

the correspondence is real, if sparcely represented, so it hints at either a close contact situation between Vietic and Katuic, or common inheritance. In the same paper Diffloth also asserted that the lexical evidence favours his Vieto-Katuic, writing: It also agrees with the general impression one gathers from inspecting the lexicon of Vietic languages like Thavung, Maleng and Tum, namely, that there are many unique and remarkable agreements between them and the Katuic languages. These agreements do not seem to favor one sub-branch of Katuic or another, but are distributed throughout the branch. There are also cases of unique lexical agreements between Muong and the Katuic languages, and occasionally even of Vietnamese agreeing alone with the Katuic branch. Arguments about lexical replacement require ample space and time to be presented convincingly, which I hope to be able to do on another occasion. Diffloth 1991b: 136

It is a matter of regret that a convenient occasion has not arisen in the past quarter century to present the lexical agreements that are apparently so readily identified, despite the efforts of Alves and this writer. On balance it would appear that the Vieto-Katuic hypothesis has only ambiguous support; the branches are in close proximity and have experienced historical contact, although even the evidence presented for this is not extensive. Despite unsubstantiated claims of significant lexical isoglosses, there are no indications of these in lexicostatistics or computational phylogenetic studies, and the small number of etyma showing non-trivial phonological correspondences is not well enough understood to inform a sub-grouping argument. Additionally, there are also similar problems of the sort encounterd in assessing the KatuicBahnaric hypothesis; in this case, it is evident that both Vietic and Katuic diversified internally a significant time after they split from pAA, and later both came under Chamic influence (e.g.: see Gregerson 1991 for a discussion of Austronesian lexicon in Vietnamese, and note that southern Vietic groups even refer to themselves by the Chamic word for mountain chưt). Consequently it is this writer’s assessment tha the Vieto-Katuic hypothesis is not sufficiently supported. 3.5 Nico-Monic? A consistent feature of Diffloth’s AA family trees is the identification of a Southern Mon-Khmer or Nico-Monic sub-family; the scheme has a binary split

178

Sidwell

between Nicobarese and an Aslian-Mon sub-grouping. So far as this writer is aware, two pieces of evidence have been presented for this hypothesis by Diffloth:

•

there is a non-trivial correspondence between Aslian and Nicobarese diphthongs (mentioned by Diffloth 1991, and supported with some examples in a conference handout distributed at the 18th SEALS meeting, May 21–21, 2008, Universiti Kebangsaan Malaysia); it was suggested at the 4th ICAAL meeting (Oct. 29–30, 2009, Mahidol University Thailand) that the etymology Old Mon kandar, pNyahKur *kəmta̤ r ‘wife’ (reflecting pAA *ua > pMonic *a sound change) has regular reflexes in Aslian such as Semai kərdɔːr, etc.

•

To the above, we can add also the following taken from the core vocabulary: Table 3.13 Southern-AA proto-branch basic lexicon isoglosses

Gloss

Monic Old Mon pNyahKur

Aslian Nicobarese Kensiw Cewong Jah-Hut Semelai Semai Car Nancowry

‘night’ ‘nail/claw’ ‘to die’ ‘head’

bərtɔm (sənlem) (kcøt) (kɗøp)

— ca s gabis kuj

*pəta̱ m (*khənli̱am) (*kəce̱t) (*khdə̱p)

bətom — — —

ptɔm cərwɛs cros kəbɨs kəbəs — khoj

tɯp — — kuj

hataːm sɔh k a pa h kuj

daːm kisoah fah kɔj

The rather scant isogloss data suggests that while there are at least a couple of basic vocabulary isoglosses potentially supporting Diffloth’s Nico-Monic hypothesis, the isoglosses identified here, and the diphthong evidence adduced by Diffloth, would tend to support a closer relation between Nicobarese and Aslian within any such alignment. Additionally it is worth considering the evidence of Shom Pen, the enigmatic language of Great Nicobar Island for which only somewhat problematic sources are presently available. Blench & Sidwell (2011) reviewed the most extensive published source (Chattopadhyay & Mukhopadhyay 2003, henseforth C&W), and found that Shom Pen is most likely an innovative Nicobarese language, or less likely, a sister of Nicobarese and Aslian within an AsliNicobar sub-family. Our analysis of C&W’s Shompen lexicon found 47 unique

Austroasiatic Classification

179

isoglosses with Nicobarese, and 31 unique isoglosses with Aslian languages. Additionally, we identified AA etymologies for some 65 Shom Pen items, but no unique Shompen-Mon isoglosses; those isoglosses with Mon we were able to find are all in the context of wider AA etymologies. It is our contention that the Shom Pen analysis is strongly consistent with there being a closer relationship between Nicobarese and Aslian than between either of these and Monic. The above facts are also consistent with Sidwell & Green’s computational phylogenetic analyses which weakly groups Nicobarese and Aslian yet indicates no support for a special connection with Monic. In terms of reconciling language histories, one may suggest that ancient contacts could explain apparent Monic-Aslian isoglosses; Briggs’ (1950) analysis of Chinese historical sources indicates Khmer and/or Mon presence on the Malay Peninsula for several hundred years during the first millennium CE. However, in respect of the ‘night’ etymology, the phonological changes are regular, indicating a very old form rather than a borrowing, and Shorto (2006) reconstructs pAA *btɔm on the basis of reflexes in all three groups, treating it as a retention. Weighing the above evidence, this writer rejects the Nico-Monic hypothesis as insufficiently supported, and instead proposes a Nico-Aslian subfamily, which has both phonological and lexical indications. 3.6 Provisional AA Classification Having regard to the discussion above, representing a summary of our best efforts to reconcile lexical, lexicostatistical, computational phylogenetic, and phonological studies, the following provisional AA classification is offered (characterised as ‘provisional’ as any hypothesis is open to revision in the light of evidence and analysis): Austroasiatic

Munda

Khasian

Palaungic Mangic Katuic Bahnaric Monic Nicobarese Khmuic Vietic Khmeric Pearic Aslian

The AA tree is characterised as strongly branching, with 11 primary nodes, potentially indicating that there was a phase of rapid differentiation at some point in the history of the phylum. This is broadly consistent with the findings of Sidwell & Blench (2011) and contrasts with the more deeply nested tree model of Diffloth (2005). In terms of modelling language history, the challenge

180

Sidwell

is to reconcile the world represented by the proto-language and its apparent time depth. On one hand, terms connected with rice agriculture and domesticated species seem to be clearly reconstructable for pAA (Diffloth 2005, Sidwell & Blench 2011 etc.) yet at the same time Diffloth (2005) suggests localising pAA older than 5,000 BP, and similarly old dates are suggested by calibrated computational analyses. This is problematic because archaeology (e.g.: Higham 2002, Chi & Hung 2009) tells us that the earliest agriculture in Mainland Southeast Asia appears only around 2000~2500 BP, and significantly older dates for agriculture require localisation either much further north into China or westward into South Asia, which would arguably be even more problematic for modelling the linguistic history. The suggestion made here is that pAA is very old, and was relatively stable over millennia in a dialect chain or “linkage” that underlies the present division into branches. The culture was at home along the Mekong and surrounding uplands, perhaps as far afield as the Red River valley; it was characterised by foraging, hunting, fishery and tuberculture. The combination of dry and wet rice cultivation was adopted circa 4000 BP, and increased the adaptability and vitality of AA communities, who then quickly radiated southward and westward over a range roughly corresponding to what we recognise today. 4

Internal Phylogeny of Austroasiatic Branches

4.1 Aslian Aslian is the most southerly branch of the Mon-Khmer family; located in the interior of the Malay Peninsula while the coastal lowlands came to be dominated by (mostly) Malays. They are spoken by relatively small tribal groups, totalling only a few tens of thousands. Being geographically isolated from other mainland linguistic groups, they never came under strong influence from morphologically isolating and tonal languages like Chinese or Thai, and thus have escaped many of the changes that spread through other Austroasiatic languages. For excellent survey articles about the Aslian branch, see Benjamin (1976) and Matisoff (2003). The internal classification of the languages is essentially now a settled question, particularly in the light of the computational phylogenetic studies of Dunn et al. (2011) (also discussed in Burenhult et al. 2011) and the pAslian reconstruction of Philips (2013). The Aslian languages fall into distinct Northern, Central, and Southern groups, plus one language—Jah Hut—occupies an ambiguous position between Northern and Central branches and may represent a fourth branch. This is seen in the following family tree from Dunn et al. (2011):

181

Austroasiatic Classification

Ten’en Palian Ten’en Paborn 97 Kensiw Perak Kensiw Kedah 61 Kintaq 100 Menriq Rual 99 Jahai Rual 81 Jahai Banun 29 Batek Teh Taku 47 45 Menriq Lah Batek Teh Lebir Batek Teq 82 53 100 Batek Deq Terengganu Batek Deq Koh Ceq Wong 39

Mon

64

Figure 3.4 Aslian family tree, rooted on Mon ( from Dunn et al.). This is a Maximum Clade Consistency tree with branch length equal to the median length of all congruent branches found in the sample. Numbers on the branches indicate percentage of the tree sample supporting each bifurcation.

The results of computational phylogenetics map very neatly on to the patterns of sound change identified by Philips (2013) in his reconstruction of Aslian historical phonology, especially in the vocalism. For example:

• •

Central Aslian preserves long-short vowel distinctions lost to mergers in other branches; PAslian diphthongs *ua, *uə are retained (and distinguished) in Jah Hut (as wɔ~wa~wɛ and wo~o repectively) while they are reflected as low and front monopthongs in North Aslian and mostly as back monophthongs in Central and South Aslian; North Aslian has central vowel reflexes of various front and back pAslian vowels.

•

Southern

92 66

Lanoh Kertei 95 Semnam Malau 100 Semnam Bal 98 Temiar Perak 77 Temiar Kelantan Semai Ringlet 100 Semai Kampar Semaq Beri Berua Semaq Beri Pergam Semelai Mah Meri 89

Central

100

Jah Hut

?

100

Northern Menraq-Batek

35

Maniq

100

100

182

Sidwell

Generally the consonants in Aslian language are well preserved, except for the complete loss of pAA implosion, such that original *ɓ, *ɗ merged with *b, *d in pAslian, and the most common pattern now is for a simple voiced-voiceless contrast in the stop system. From the areal perspective, this is consistent with being in a region of Malayic influence, with convergence on the western Austronesian pattern of multisyllabic words of CVC syllables and less frequent diphthongisation and/or vowel quantity distinctions. In terms of innovations that clearly define the Aslian branch, this is problematic, as no specific proposals have been made for common Aslian innovations either lexical or phonological (beyond the loss of implosion). Consequently, the justification for Aslian remains primarily lexicostatistical, with various studies indicating higher rates of cognates between Aslian sub-branches than with other AA groups; additionally recent computational phylogenetic trials also indicate that the Aslian branch is real. However, even this evidence is not unambiguous: The landmark lexicostatistical study by Benjamin (1976) found cognacy rates as low as 15% with Aslian (e.g. between Kensiw and Temoq) although averaging around 25% between South and North Aslian, and later studies are consistent with these findings. Beyond Aslian these are cognacy rates more typical of the distinction between branches; in the lexicostatistical trial conducted by Sidwell in 2010 various relatively high cognate counts between Aslian and other languages were found, especially with Katuic (e.g. Semelai sharing 26% with Katu, 27% with Kui, and 30% with Bru). These figures indicate a very variable rate of lexical change within Aslian, suggesting conservatism in the south and more rapid lexical change in the north, and this was idependently noted as being indicated by the anlysis of Dunn et al. (2011). 4.2 Bahnaric The Bahnaric family is one of the more diverse AA branches, spoken by hundreds of thousands of mostly hill-tribes (montagnards), living in Vietnam, Cambodia and Laos. While there is significant typological diversity within Bahnaric, comparative reconstruction indicates that pBahanric was very conservative, closely approaching pAA phonologically. Yet there are a couple of somewhat odd lexical innovations that seem to characterise the branch. In particular, there is a form for ‘fire’ with a final nasal, when we would otherwise expect to see a final fricative, since the initial and vowel appear to correspond to the general pAA term. The word for ‘bone’ has a high vowel and a pre-vocalic alveolar (s or t) that leave it looking suspiciously like an adaptation from Vietic (cf. Vietnamese xương ‘bone’). Also, the word for ‘tongue’ is a uniquely infixed derivative of *liət ‘lick’. If one finds two of these forms in a language it is clearly Bahnaric:

183

Austroasiatic Classification Table 3.14 Bahnaric lexical innovations Gloss

Laven (West)

Sedang (North)

Stieng (Central)

pBahnaric

pAA

‘bone’ ‘fire’ ‘tongue’

ktɨəŋ ʔuɲ hapiat

kəsiəŋ ʔuɲ rəpiɩ

ntiːŋ ʔoɲ lpiat

*kʦɨːŋ *ʔuɲ *lpiət

*cʔaːŋ *ʔus *lntaːk

These apparent lexical innovations are not entirely unproblematic; with two languages showing exceptions. Srê (in the extreme south of the Bahnaric range) shows ʔos ‘fire’, while Cua (in the extreme north-east) has ʔolh ‘fire’ (with -lh the regular reflex of *-s). Perhaps the best explanation is that there were competing *ʔus,*ʔuɲ forms in pBahnaric, and *ʔuɲ simply won out in most cases. In addition to the above lexical innovations, there is a striking pattern of selective retention of pAA final glottal stop. In other AA branches the pattern is that word final glottal stop is either generally retained or lost, but in Bahnaric there is split in which the glottal is retained in about half the relevant vocabulary and lost in the other half (e.g. Bahnar buː ‘who’, sɔː ‘dog’ but baʔ ‘father’, tɔʔ ‘hot’). This pattern is found throughout the branch, except in South Bahnaric where all orginal final glottal stops have been lost. The internal classification was extremely problematic until recent times as scholars did not have access to sufficiently representative data (for example, researchers working with data obtained in Vietnam missed entire subbranches and thus could not successfully configure the nesting of nodes in the tree). Thomas (1966) and Thomas & Headley (1970) correctly managed to separate the then Stiengan and Bahnaran groups from the mass of eastern Mon-Khmer languages and group and relabeled them as South Bahnaric and North Bahanric respectively. Then Smith (1972), Gregerson et al. (1976) and others noted that Bahnar is structurally more like South Bahnaric than the other North Bahnaric languages (which are marked by having contrastive phonation types) and a compromise proposal of a Central Bahnaric sub-branch was advanced (to accommodate Bahnar, and also Tampuon of Cambodia and Alak from Laos). Thomas & Headly (1970) also identified a West Bahnaric group in Laos, although it was studies such as Ferlus (1974) that really confirmed its status as another sub-branch. Smith (1973) also questioned the status of Cua and Katua, proposing a North-East goup, although lately Sidwell (2009) clarified

184

Sidwell

this problem, showing that Katua is North Bahnaric and Cua belongs in a separate East Bahnaric branch. Sidwell (2002, 2009, 2010) comprehensively reviewed and revised the internal classification of Bahnaric based on phonological innovations, yielding the following schema of four coordinate sub-branches: Table 3.15 Bahnaric classification by Sidwell (2010) West Bahnaric

Central Bahnaric

Jru’ (Laven), Juk, Su’ Taliang (Kasseng) Nyaheun Alak Oi, The, Sok, Sapuan, Cheng Central South Brao, Laveh, Krưng, Kravet Tampuon Bahnar South Bahnaric Chrau Sre Stieng Mnong

North Bahnaric

East Bahnaric

Halang, Kayong Cua (Kor) Jeh Kotau Tadrah, Modrah Sedang Hrê Mơnơm (Bơnâm) Rengao Kaco’, Ramam

The classification based on historical phonology is largly consistent with later computational phylogenetic trials, including the Neighbour Net discussed above, the Bahnaric section of which is extracted and enlarged here. BA BAH HNAR NAR I IC C_Sr BAHN ARIC _Stieng e _Tam pu BAHNAR IC_Bahn an ar BAHNARIC_Sedang ng ala _H BAHNARIC a RIC_Cu ru BAHNA C I _J NAR un BAH Nyahe _ RIC NA H A B

Figure 3.5 Bahnaric section of AA Neighbour net (Sidwell & Greenhill).

185

Austroasiatic Classification

The Neighbor Net analysis correctly pairs up Jru and Nyaheun (West Bahnaric), Sedang and Halang (North Bahnaric) and Sre and Stieng (South Bahnaric); it isolates Cua (East Bahnaric), and perhaps most strikingly places Bahnar and Tampuon together, even though lexicostatistical analyses consistently group it within North Bahanric, apparently due to undetected lexical borrowings (this is still indicated by the proximity of Bahnar to Sedang and Halang in the Neighbour net). 4.3 Katuic Katuic is a branch of about 15 distinct languages (with significant dialectal diversity) spoken by communities in Thailand, Cambodia, Laos and Vietnam. The current edition of ethnologue lists 19 languages, but this erroneously included Bahnaric languages (Kasseng, Tareng) and multiple ISO codes are allocated to dialects of Bru and Kui. The majority of speakers use dialects of the West Katuic or Kui-Bru sub-branch, living mostly in eastern Thailand and Cambodia (although there are significant Bru communities in Laos and Vietnam), while the greatest diversity of Katuic languages lies among a patchwork of small communities the Salavan and Sekong provinces of Laos and adjacent border areas of Vietnam. For a recent bibliography and quite full discussion of the disposition of Katuic languages see Choo (2012). Katuic was recognised in print as a branch by Thomas (1966), who mentions that suggestions were made in earlier unpublished work, naming a Phillips but without giving a clear reference. A listing of 18 putative Katuic languages appeared in Thomas & Headley (1970) but erroneously included three Bahnaric languages, and is the source of the Ethnologue error mentioned above. Katuic is readily identified by numerous lexical innovations in core vocabulary and lower numerals, and some of these innovations have diffused into West Bahnaric and some Central Bahnaric languages, which contributed to earlier confusion over classification. Some examples of Katuic lexical innovations that replaced AA roots: Table 3.16 Select Katuic isogloss Gloss

pAA

Katu

Pacoh

Bru

‘wife’ ‘year’ ‘cobra’

*kdɔːr *cnam *ɟaːt

kadiel kamaː tuːr

(kapaj) kumɔː tur

kandɛːl kumɑː tṳːr

186

Sidwell

(cont.) Gloss

pAA

Katu

Pacoh

Bru

‘mushroom’ ‘bone’ ‘six’ ‘eight’

*psit *cʔaːŋ *tpraw *thaːm

triː ŋhaːŋ sapat takɔːl

triə ŋhaːŋ tupat tikɔːl

tri̤aʔ ŋhaːŋ tapa̤t tikṳal

Once Katuic was identified, progress in classification followed. Ferlus (1974) delineated West and East Katuic groups, correctly putting Kui, Bru, So and their dialects together in the West and lumped the remaining languages in the East. Several lexicostatistical studies followed (e.g. Smith (1981), Migliazza (1992), Miller and Miller (1996), Peiros (1996)), with varying results. Both Smith (1981) and Peiros (1996) suggested a primary split between Katu and the rest of the branch, and this was also followed by L-Thongkhum (2001). The analysis by this writer (Sidwell 2005) divides Katuic into four equidistant sub-branches, based on the patterns of independent and overlapping sound changes. Three sub-branches have undergone devoicing of stops and show phonation contrasts in their phonologies, while a fourth, Katu (with several dialects), is conservative and preserves a phonological system that resembles pAA. Superficially this is consistent with the proposals of Smith (1981) and Peiros (1996) but close analysis demonstrates that the devoicing of stop series and phonologisation of phonation occurred independently multiple times in the history of Katuic, so it is not appropriate to use these features to recognise larger nested groupings. The following classification is reported by Sidwell (2005): Katuic

West Katuic: Kui, Souei Bru, Sô Ta’oih: Ta’oih, Katang, Talan/Ong/Ir/Inh Kriang/Ngeq Chatong

Austroasiatic Classification

187

Katu: Kantu, Katu, Phuong, Triw, Dakkang Pacoh: Pacoh Figure 3.6 Katuic classification by Sidwell (2005).

The Neighbour Net analysis of Sidwell & Greenhill included five Katuic languages, and yielded a result consistent with the lexicostatistics of Smith (1981) and Peiros (1996), placing Katu on a node above the rest of the branch. Further studies are clearly indicated to investigate the real lexical diversity within the Katu dialect chain, as it is quite possible that the Katu AnDiem dialect used by Sidwell & Greenhill and used by the lexicostatistical studies mentioned above (all sourced from Costello 1971) is not sufficiently representative. KATU

IC_Ka

tuAD KATUIC_Nge KATUIC_Pacoh ru KATUIC_B i IC_Ku U T A K

Figure 3.7 Katuic section of AA Neighbour net (Sidwell & Greenhill).

There is presently an active research effort on Katuic centered at Payap University in Thailand, and consequently our understanding of the branch is rapidly improving to the point where unresolved issues over its internal classification should become quite tractable. 4.4 Khasian The Khasian (also Khasic, or more lately Meghalayan) languages are spoken in the Meghalaya State of India and neighbouring parts of Bangladesh. In India the variety known as Khasi has official status and a very healthy literature, including dictionaries, grammars and popular texts. Since the 19th century (e.g. Roberts 1875, 1891 especially) the tendency has been to treat Khasi as a single language with various dialects, the latter often characterised as unprestigious and consequently neglected as objects of study or language development. The most important early survey work is the Linguistic Survey of India (or LSI, Grierson 1903), which identifies four languages: Standard (Khasi), Lyngngam,

188

Sidwell

Synteng or Pnar, and War, and although latter work has documented a plethora of named local varieties (see Daladier 2002) it appears that Grierson’s four categories are fundamentally sound. The Khasian branch is readily identified by both sound changes and lexical innovations. All the languages share a distinctive phonological change in which pre-vocalic voiceless stops became aspirated, leaving only a small number of words with /p, t, k/ in that position (probably from a variety of sources). This change is discussed by Haudricourt (1965) who characterised it as a Germanic shift; it is notable that similar shifts are uncommon elsewhere in AA, and are not a typical feature of the linguistic area. Also the old Austroasiatic implosives merged with the plain voiced stops, so that the dominant stop phonation contrast is between voiced and aspirated, which is somewhat odd typologically. Some indicative examples of the Germanic shift in Khasian follow: Table 3.17 Khasian Germanic shift examples Gloss

Khasi

War

pAA

‘tiger’ ‘fish’ ‘crab’ ‘to weave’

kʰlaː kʰaː tʰaːm tʰaːɲ

kʰli hi tʰə tʰin

*klaʔ *kaʔ *ktaːm *taːɲ

It is also apparent that pAA word final *-s and *-h became *-t in pKhasian, which is an extremely marked change, and particularly indicative of membership of the branch. E.g.: Table 3.18 Khasian sound change *-s, *-h > *-t Gloss

Khasi

War

pAA

‘fungus’ ‘to swell’ ‘deer’ ‘bellows’

tit ʔ at kʰiat prsut

tet ʔat — pərsʊ͂ t

*ptis *ʔas *kiəh ‘mountain goat’ *cuh ‘to kindle’

189

Austroasiatic Classification

Internally the phylogeny is straightforward; Khasi and Pnar are unambiguously close, with only minor differences in lexicon and phonology. The next highest node is Lyngngam—spoken by just a few thousand speakers in the west of the Khasi hills—it is more lexically divergent but phonologically similar to KhasiPnar. War branches from the highest node; spoken in the south of the hills and also into Bangladesh, and some communities in the north, apparently due to migrations. War is not only lexically more different, it is radically restructured phonologically: stops are devoiced, final nasals are frequently lost, low vowels are raised and/or diphthongised in many contexts, vowel quantity contrasts are lost completely, and other changes (see Sidwell 2011 for more on the historical phonology). In recent years several lexicostatistical and computational phylogenetic studies were carried out: Nagaraja (2004), Brighthill et al. (2007), Sidwell (2009), Nagaraja et al. (2013). The results of these analyses are consistent with the phonological indications that Khasi-Pnar-Lyngngam fall closer together while War branches form the highest node within the tree. The last of the studies list above is the most rigorous; Nagaraja’s 2004 data was revised and extended, and Palaung was added to root the tree and test overall coherence of Khasian. A Bayesian Phylogenetic analysis was run by Greenhill using BEAST v1.7.4. The tree is constrained to indicate a time depth of 500 years BP for the KhasiPnar split; having regard to the fact that the Buranji chronicles of the Ahom kingdom (see Gait 1906) reference the Pnar kingdom at Jaintia from around 500 BP, which suggests a floor under the separation of Khasi and Pnar. The Bayesian analysis places the War branching at around 1350 years (95% HPD = 1028–1737 years), while the glottochronological calculation reported in the same paper places it at 2054 years BP, probably the true figure lies somewhere between these.

1

PnarJaintia Khasi

1

Lyngngam

1

WarLamin

1

Palaung 3000.0

2500.0

2000.0

1500.0

1000.0

500.0

Figure 3.8 Khasian computational phylogenetic tree (Nagaraga et al. 2013).

0.0

190

Sidwell

Further work is required to fully map the diversity of Khasian languages, but it appears that the outlines of the phylogenetic structure of the branch are now reasonably securely understood and provide a solid basis for ongoing comparative studies. 4.5 Khmer Khmer is the national language of Cambodia, with a written tradition going back at least 1400 years. We can speak of a Khmeric branch which consists of a single language with several very closely related dialects. Several dialects are conventionally distinguished (disregarding the continuum of rural speech varieties):

• • • •

Standard Khmer: used in cities such a Phnom Penh and Siem Reap Surin or Northern Khmer: spoken in northeastern Thailand Cardamon or Western Khmer: spoken in the south-west of Cambodia Khmer Krom of the Mekong Delta

It is apparent that all of these dialects diverged during the Middle Khmer period (roughly between 1400 and 1800 CE) after unifying influence of the Angkorian state collapsed. Khmer shows no particular affinity to any other AA branch, although Diffloth (2005 and elsewhere) has consistently grouped it with Bahnaric, explaining verbally that this is on the basis that they share a metathesized reflex of the word for ‘mushroom’ (e.g. Khmer psət, Sre bəsit, Laven pseːt, pAA *ptis) although this writer suspects borrowing into Bahnaric from Khmer, which is also attested with Bahnar bəməw and Tampuon ma̤ w ‘mushroom’ being borrowed from Chamic. Headley (1976) applied multiple methodologies to the problem of the place of Khmer within AA, and found only ambiguous results, so it appears that we must treat the language as forming a coordinate branch within the phylum. 4.6 Khmuic The Khmuic branch is dominated by Khmu (also Kmhmu, Kmhmu’, Kamu etc.), being a large dialect chain spoken through Northern Laos and adjacent areas of Thailand, Vietnam and Yunnan. Additionally there are some minor Khmuic languages spoken by small communities on the western and eastern peripheries of the Khmu speaking area. These geographical groupings are:

191

Austroasiatic Classification

• • •

Khmu dialects (Cuang, Khuen, Rok, Yuan etc.) Western: Mlabri/Yumbri, Mal-Pray/Tinic (Lua’, Pray, Phay, Mal, T’in/Htin) Eastern: Ksingmul, Pramic (Puoc, Phong, Tai Hat, Kaniang, Then, Ơdu) Studies and reference literature on Khmuic classification have been sparce and ambiguous; a typical pattern, such followed by Filbeck (1978) and Proschan (1996) is to simply give an unstructured list of Khmuic languages. The most developed comparative studies have been Filbeck (1978) and Rischel (2007) on pTinic, with the latter also giving special attention to Mlabri, the position of which he considers to be ambiguous (being either a sister of Tinic or an independent AA branch with a Tinic statum). This writer (Sidwell) takes the view that Khmuic (including Mlabri) is a coherent coordinate AA branch, and is working actively on the reconstruction of pKhmuic (a recent version of the reconstruction was released online in 2013 at http://sealang.net/monkhmer and updates will follow). The branch is readily distinguished by a unique sound change within AA, being the loss of medial *h, e.g.: Table 3.19 Khmuic loss of medial *h Gloss

Khmu Cuang

Mlabri

Mal

Other AA

‘full/sated’ ‘blood’ ‘to fart’

biʔ maːm puːm

biːʔ mɛːm —

piʔ miam —

Laven phɛʔ, Chong phəː Stieng mhaːm, Chong məhaːm Laven phoːm, Chong phuːˀm

However, it must be acknowledged that lexically Khmuic is not especially coherent; in this writer’s lexicostatical trials (such as reported in Sidwell 2010a) cognate scores between sub-branchs typically fall below 40% and as low as 21% (between Mlabri and Ksingmul). Consistent with this, our Neighbour net analysis (Khmuic fragment below) also shows the languages branching close to the root and relatively equidistantly. This writer has recently proposed an internal classification based on sequencing phonological changes within Khmuic (discussed at the 5th ICAAL meeting held in Canberra September 4–5th 2013). The proposal is that pKhmuic *aː

192

Sidwell l Ksinmu KMUIC_ KMUIC_KhmuCuang KMUIC_Mal KMUIC _Mlabri

Figure 3.9 Khmuic Neighbour Net

remained unchanged in Khmu but regularly fronted and raised elsewhere in the branch along a cline: *aː> *ɛː > *iə > *iː. Somewhat problematically for the historical phonology, there was a secondary expansion of Khmu, accompanied by a substantial phase of dialect borrowing of forms retaining *aː (and other conservative features) from Khmu into the other languages, creating multiple correspondence patterns. Taking care to remove the loan correspondences, we find that, broadly, Mlabri reflects pKhmuic *aː as /ɛː/, the Tinic group show /iə/, and Pramic reflexes are /iː/. This allows us to construct a tree as follows: Table 3.20 Khnuic classification

/aː/ Khmu (Cuang, Khuen, Rok, Yuan etc.) Proto-Khmuic *aː

/ɛː/ Mlabri /iə/ Khsingmul, Mal-Pray/Tinic /iː/ Pramic

4.7 Mangic/Pakanic The status of several minor languages only described in recent decades, namely: ‘Mang (spoken in Northern Vietnam and nearby in China), Bolyu/ Paliu/Lai (in Yunnan and Guangxi) and Bugan/Pakan (in Yunnan)’ poses an intriguing problem. It appears that these form a sub-group, variously called Mangic (favoured by this writer) or Pakanic (favoured by Diffloth). Mang first came to the attention of scholars in the 1960s, apparently with the report of Vương Hoang Tuyên (1963), providing some data collected Lai Châu Province. A useful lexicon and description of the tone system later

193

Austroasiatic Classification

appeared as Nguyễn Văn Lợi et al. (2008). There is some documentation on Mang in Chinese but it has not been consulted as part of this study. Bolyu first came to the wider attention of scholars with the account of Liang Min (1984). Benedict (1990) followed up with a discussion of the classification of Bolyu, arguing that it represents a distinct branch of Mon-Khmer. Edmondson (1996) provides a lexicon of Bolyu, and Edmondson & Gregerson (1996) analysed the Bolyu tone system, making typological and etymological comparison to Vietic. Their data was collected at Longlin Various Nationalities Autonomous County in western Guangxi. A little later Li Xulian (1999) also provided a lexicon of Bolyu (in Chinese), and recently Qin & Li (2012) reported on the status of Bolyu (counting the population at only 1,030 in 2006). Bugan was documented by Li Jinfang (1996) with a brief sketch that also includes comparisons with Bolyu, Vietnamese, Cambodian, Lawa, Khmu’ and Old Mon. Additionally there is an unpublished lexicon of Bugan by Li Jinfang (recorded at Guangnan County, Eastern Yunnan) of about a thousand items circulating privately. Discussion of the classification of these languages begins with Thomas & Headley (1970). Their brief note is as follows: Mang (Mang U’) in North Vietnam near the China border (Tuyên 1963), which we expected from its geographical location to be Khmuic, shows highest cognateness with Palaungic, next highest with Viet-Mương. Though the data is sketchy, we are tentatively classifying Mang with Palaungic. Thomas & Headley 1970: 403

More recently Nguyen Van Loi (2008) provided lexical arguments for classifying Mang with Palaungic, perhaps within the Samtau sub-group, on the basis of the forms for ‘water’ and ‘two’. This view is apparently reflected in Diffloth (2005) classification, see fragment reproduced at Figure 3.10. Khasian Khasi-Khmuic

Pakanic Eastern Palaungic Western Palaungic Khmuic Figure 3.10

Pakanic classified with Palaungic by Diffloth (2005).

194

Sidwell

Peiros’ (2004) lexicostatistics unambiguously grouped Mangic, Bolyu and Bugan into a single branch (coordinating with Vietic and Palaungic-Khmuic in his tree). Similarly this writer’s (2010) lexicostatistical study clearly groups the three languages, scoring Bolyu-Bugan at 47% cognate, Mang-Bolyu at 29%, and Mang-Bugan 25%, figures which are comparable to what we find within, for example, Khmuic. The highest inter-branch count is 22% between Mang and Khmu, and all other pairwise comparisons across AA have yielded cognate counts no higher than 20%. Comparative analysis for Mangic is not highly developed. All three languages are tonal, and heavily restructured phonologically (reduction in clusters, extensive vowel shifts, etc.), and this also hinders identification of cognates, so in all likelihood the group is probably a little more lexically coherent than the figures above suggest. The compilation and analysis so far done by this writer remains largely inconclusive; no lexical or phonological innovations have been identified that unambiguously unify the branch. The vocabulary that has been etymologised is almost entirely general AA vocabulary with no marked tendency to resemble another branch, except for the lower numerals which are closest to Vietic (discussed by Edmondson 1996). Consequently, the identification of Mangic currently relies primarily on statistical justification, in the absence of compelling evidence of another type. 4.8 Monic The Monic branch consists of two languages, Mon and Nyah Kur, spoken today by minorities in Myanmar and Thailand. These languages are apparently descendents of the vernacular speech of the inhabitants of the Old Mon civilisation of Dvāravatī that flourished in central and northeastern Thailand from the 6th to the 10th centuries. Dvāravatī society disappeared from history as the Khmer and Siamese absorbed much of the Old Mon culture over time. Using the inscriptional materials, comparison of written and Modern spoken Mon, Nyah Kur, and the evidence of borrowings from and into neighbouring languages, Ferlus (1983) and Diffloth (1984) reconstructed Old Mon or Proto Monic (as the comparative reconstruction is more properly called), and the results show a language which is very similar to proto-Austroasiatic, preserving all three stop series and a variety or other archaic features. Contemporary Monic languages have changed so substantially that, like Old and Modern English, they are almost unrecognisable. 4.9 Munda The Munda branch consists of 11 languages spoken in eastern and central India, ranging from large languages such as Santali (ca. 6 million speakers) and Mundari (ca. 1.5 million speakers) with considerable dialectal diversity down

Austroasiatic Classification

195

to very small members such as Gtaʔ ( k Mixed southeastern group *q > zero.

Pinnow (1959), in his extensive comparative treatment of Munda languages, divided them into four coordinate sub-branches: a) b) c) d)

East: Kherwari (Santali, Mundari including seven named dialects) West: Kurku Central: Kharia, Juang South: Sora-Gadaba (Sora, Pareng/Gadaba, Gutob, Remo)

Pinnow then revised his classification to reflect a basic North-South split, explaining: Recent investigation has shown that the verbal inflection of all Munda languages is traceable to a Proto-Munda inflectional system, which was later expanded in the north and considerably reduced in the south. From this evidence and on the basis of lexical differences the Munda languages may be divided into a Northern group with the sub-groups Kurku and Kherwari (Santali, Mundari, Korwa, etc., belong to the latter branch), and a Southern group, which is further subdivided into a Central group (including Kharia and Juang), and a South-Eastern group (including Sora, Pareng, Gutob, and Remo). Pinnow 1963: 141

196

Sidwell

There subsequently emerged an essentially consensus classification, which was generally received for the next four decades, reproduced here from Anderson (2006): Proto-Munda North Munda Kherwarian

Korku

Santali Mundari etc.

South Munda Kharia-Juang

Koraput Munda

Kharia Juang Gutob-Remo-Gtaʔ Sora-Gorum Gutob-Remo Proto-Gtaʔ Sora Gorum Gutob-Remo Plains-Gtaʔ Hill Gtaʔ

Figure 3.11

Traditional family tree of Munda (Anderson 2006: 2).

Several Munda scholars, especially Anderson (2001, 2004, 2007, 2008 etc.), have emphasised verb morphology as the principal legitimate criteria for classifying the languages, claiming that lexical and phonological features are more easily borrowed and thus less reliable indicators of phylogeny. Anderson and Zide in particular have argued that, “the Proto-Munda language was indeed morphologically complex, and it is far from the case that all such morphological features are attributable to South Asian areal diffusion.” (Anderson & Zide 2001: 24) And subsequently Anderson suggested a revision of the South-Munda part of the family tree to reflect his reconstruction of morphological changes: Proto-South Munda Proto-Sora- Gorum Proto-Juang

Sora-Juray Gorum Figure 3.12

←→ Proto-Gtaʔ ←→ Proto-Kharia ←→ Proto-Gutob-Remo Juang

Kharia

Gutob

Remo Plains Gtaʔ Hill Gtaʔ

Revised south-Munda tree (Anderson 2007: 4) with ←→ indicating sharing of innovative isoglosses.

197

Austroasiatic Classification uang DA_ J MUN orku MUNDA_K MUNDA_Mundari

MUNDA_Sora MUND A_Gor um

Figure 3.13

Munda section of AA Neighbor Net (Sidwell & Greenhill 2014).

Views on Munda classification continue to evolve, and scholars including David Stampe and Felix Rau, have begun to suggest that the North-South division is not real, and that North Munda may be better regarded as coordinating along-side the South-Munda sub-branches, but we await concrete proposals. In this regard, it is notable that the Munda section of our recent Neighbour Net, reproduced above, firmly places the putative Northern languages Kurku and Mundari, between Sora-Gorum and Juang. Consequently the present indications are that the internal structure of Munda is rather flatter than earlier proposals had it, with perhaps six coordinate sub-branches as follows: Munda

Figure 3.14

North Munda: Korku Santali, Mundari Sora-Gorum Juang Kharia Gutob-Remo Gtaʔ

Revised Munda classification.

4.10 Nicobarese (Nicobaric) Remarkably, the Nicobarese are the only Austroasiatic-speaking community who do not live on the Mainland, but on a small chain of idyllic tropical islands. This is not to say that the Nicobarese are, or have been, especially isolated; their islands have always been on an important trade route since classical times. In the atlas of Claudius Ptolemy (ca. 150 CE) the Nicobars are apparently identified as Bazacata Insula, where we are told the natives always go naked. And we are given the following remarkable account of the multilingualism of the natives observed in the 19th century:

198

Sidwell

The oldest men yet speak the corrupted Portuguese that still lingers in the East. Middle aged men speak very often a little bad sailor-English; the young men, especially South and East, speak Burmese; the boys a little Hindustani: all talk Malay and their own language. At Car Nicobar they talk English pretty well. de Röepstorff 1875: 14

The above should put to rest any Rousseauian fanatasy that the conditions of the Nicobarese or their language are the result of great isolation; the great difficulty that scholars have found accessing the islands and speakers in the last century have more to do with the restrictions of colonial and Indian administrations, and more recently the circumstances occasioned by the 2004 Boxing Day Tsunami. The above being said, we know something about the languages, especially the two main languages thanks to the PhD theses by Brain (1970) on Car, and by Radhakrishnan (1981) on Nancowry (also lately called Müot). Additionally there are extensive dictionaries and some grammars from the colonial period (e.g. de Röepstorff 1975, 1884, Man 1889, Temple 1902, Whitehead 1925). Whitehead (1925) listed six languages as follows: 1) Car 2) Chowra 3) Teresa/Bompoka 4) Central (Camorta, Nancowry, Trinkat, Katcall) 5) Southern (Great and Little Nicobar, Pulo Milo, Kondul) 6) Shom Pen However, the above list is geographical rather than based on linguistic criteria, and without access to good data on more than Car and Nancowry it is difficult to offer useful conclusions. None the less, the position is not hopeless, as in fact Radhakrishnan did circulate a basic word list of 68 items in seven languages at the SICAL meeting in 1978, and a copy of this was made available to this writer by David Stampe. The languages are: Nancowry, Katcall, Pulo Milo, Kondul, Camorta, Teresa, Car. It is immediately evident that all of the languages except for Car are lexically very close, with only minor phonetic differences between them for most etyma. In terms of the apparent percentage cognates between the lists, compared to Nancowry (as the presumptive reference dialect for the putative Central Nicobarese), we can construct a tree,

199

Austroasiatic Classification

additionally interpolating the position of Shom Pen on the basis of Blench and Sidwell (2011): Nancowry Katcall Camorta Kondul Pulo Milo Teressa Car Shom Pen Figure 3.15

100% 100% 100% 83% 76% 70% 38% ?

Central Nicobar Dialect Chain Proto-Nicobarese

Proposed Nicobarese family tree.

This proposed Nicobarese family tree identifies three primary branches: a Central Nicobarese dialect chain, plus two substantially distinct languages, Car and Shom Pen, respectively at the northern and southern extremes of the island chain. Thus we could characterise Nicobarese as having North, Central and South sub-branches. 4.11 Palaungic Palaungic (or Palaung-Wa in older sources) is an internally diverse branch, with about two dozen languages11 (and numerous named dialects) spread over a wide area that overlaps Thailand, Myanmar, China and Laos. They are generally distributed in small discontiguous pockets surrounded by other ethnic groups, mostly Tai, Meo and Tibeto-Burmans who repeatedly colonised the upper Mekong and Salween valleys over the last couple millennia. Perhaps because of this disparate distribution and intense language contact (or inspite of it?), Palaungic sub-groups have undergone some rather dramatic phonological changes, frequently involving tonogenesis by diverse mechanisms (such as devoicing initials, merging codas, reanalysing vowel length, etc; see Diffloth 1980 on tonogenesis in Waic, Svantesson 1988, 1991 on tonogenesis in Angkuic). The first properly coherent Palaungic classification was presented by Mitani in 1978 (SICAL meeting, Mysore, India) based on lexicostatistics, reproduced here at Figure 3.16: 11 23 ISO 693–3 language codes listed in the 2013 Ethnologue.

200

Sidwell Danau Riang Palaung Wa Angku

Figure 3.16

Palaungic classification by Mitani (1978).

Lamet

C_U UNGI PALA AUNGIC_Lamet PAL PALAUNGIC_Wa PALAUNGIC _Palaung PALAU NGIC_ Danaw

Figure 3.17

Palaungic fragment of AA Neighbor Net (Sidwell & Greenhill 2014).

Mitani’s scheme recognises a primary split between Danau (a single language spoken in the Shan State, see Luce 1965) and the rest of the branch, the latter divided into Eastern (Palaung-Riang) and Western (Wa-Angku-Lameet) sub-brances. Strikingly, this scheme agrees neatly with our recent Neighbor Net analysis at Figure 3.17. Diffloth (1977) proposed a somewhat different tree, based on analysis of sound changes affecting palatals and sibilants, and tonogenesis. This classification still puts Danau at the highest node, as it retains distinct reflexes of pPalaungic *ɟ, *c, *s, *h while there are several distinct patterns of mergers among these reflected in other Palaungic sub-groups. The main difference in Diffloth’s scheme is that Angkuic is placed at a higher node, coordinating with Palaung-Riang. Diffloth (1980) followed up with a reconstruction of Waic lexicon and phonology, including more than 500 lexical reconstructions, and a detailed classification of some 28 dialects within Waic. Paulsen (1989–90) followed with a similarly extensive reconstruction for the Samtau/Plang/Bulang sub-group within Waic. That paper substantially expands and improves classification within Waic, classifying the 28 named sources used in the text into a Waic subbranch with coordinate Wa-Lawa and Samtau nodes. The Santau group, also known as Plang in some sources, and is the subject of a phonological reconstruction by Paulsen (1989–90).

201

Austroasiatic Classification

-*ḉ → *s-

Danaw

Angku

Riang (Black) Lamet

North Wa (e.g., Tung Va)

South Wa

-*š- → sh-, s-*j- → c-*š- → sh-, s- (merger with *h-) Palaung (Milne)

-*c- → ¢ -

register → tone

-register affects vowel quality

Lawa (Bo Luang) -register disappears Lawa (Ban Phae)

-register disappears -breathiness becomes aspiration Lawa (Umphai)

-*j- → *c- + breathy-low vowel Riang (White Striped)

-*s- → ɵ-*j- → *dƷ- → ḉ-*s- → *š-

Proto-Wa

Figure 3.18

Proto-Palaungic

Proto-Palaung-Lawa

Palaungic family tree by phonological innovations (Diffloth 1977:49).

In 1982 Diffloth circulated a classification at the 15th International Conference on Sino-Tibetan Language and Linguistics (Beijing, China) (repoduced in Kasisopa 2003, and Deepadung 2007) in which Danau is tenatively demoted to a sister of Palaung Riang-(indicated with a dotted line in his diagram), and Angkuic demoted back to being a sister of Waic and Lameet within East Palaungic. The basis of this reclassification has not been explained. In 2010 this writer compiled a Palaungic comparative lexicon, and proposed a phonological and lexical reconstruction, posting more than 1300 reconstructions and etymologies online at sealang.net/monkhmer. Additionally the analysis was discussed at the May 2010 Southeast Asian Linguistics Society meeting in Bangkok, and a recent computational phyologenetic analysis presented at the May 2014 Southeast Asian Linguistics Society meeting in Yangon. The results are consistent with Mitani’s (1978) proposal, solidly placing Danau at the highest branching node of Palanguic, below which East and West Palaungic sub-branches are recognised, see Figure 3.19. This work still leaves unresolved the status of a couple of languages, Khabit (e.g. Kingsada & Kosada 1999) spoken in Northern Laos, Kháng/Bumang (e.g. Mikami 2003, Edmondson 2010) spoken in Vietnam and nearby in China, which

202

Sidwell Lameet Lua U Muak Lawa Wa Palaungic

Figure 3.19

Rumai Palaung Riang Danau

Proposed Palaungic family tree (Sidwell).

are erroneously listed as Khmuic in the 2013 Ethnologue. Diffloth (1982) tentatively grouped Khabit with Lameet, and Edmondson (2010) compared Kháng with several Khmuic languages. However, this writer argues that while both these languages share some lexical isoglosses with Khmuic, this can be readily explained by contact. On the other hand, both share isoglosses for ‘eye’ and ‘blood’ which are strongly diagnostic for Palaungic: Kháng ŋaj² ‘face’, nɯm² ‘blood’, Khabit pəŋaːj ‘eye’, sənuə̆m ‘blood’, regularly reflect pPalaungic *ŋaːj ‘eye’ and *snaːm ‘blood’, which are very difficult to explain unless these languages are geographically displaced Palaungic varieties. Examination of the published lexcions leads this writer to the conclusion that Diffloth (1982) was on the right track, and both of these group with Lameet within Palaungic. 4.12 Pearic The Pearic branch is a small, highly endangered group of languages spoken mainly in Southwest Cambodia and Trat Province of Thailand, and small pockets elsewhere including some communities to the east of Siam Reap. Suffering an extremely endangered status, much good work has been done recently by Thailand based researchers in language documentation and revitalization (see Malone Premsrirat 2005). The most extensive documentation of Pearic in Cambodia was done in the 1930s by Baradat (1941a,b) and Ferlus (2011) provides an exhaustive list of sources of published Pearic data up to 2009, as well as a recent phonological reconstruction. The languages are noted for having four voice registers, such that any stressed syllable can be plain, breathy, creaky, or breathy-creaky. This in itself distinguishes the Pearic from the rest of Austroasiatic, although there are also numerous lexical innovations that are worth noting, and a selection are noted here (a more extensive list is provided at Headley 1985: 463–465):

203

Austroasiatic Classification Table 3.21 Select Pearic lexical innovations Gloss

pAA

Kasong

Chong of Trat

Pear of Kompong Thom

‘fish’ ‘fire’ ‘bone’ ‘chicken’

*kaʔ *ʔus *cʔaːŋ *ʔiər

mê̤ːw ple̤ːw klɔːŋ lɛ́ːk

meːl pleːw klaːŋ lɛːk

miəl pʰlou klâŋ lêk

Early lexicostatistical studies of Pearic (Thomas & Headley 1970, Martin 1974) revealed a pattern that confirmed by later comparative studies (Headley 1985, Ostapirat 2009, Ferlus 2011): the branch is characterised by a binary split between Pear of Kompong Thom (Eastern Pearic) and a diversity of Chong dialects (Western Pearic) which are apparently highly mutually intelligible. While Headley (1985: 464) provides an isogloss map delineating some 9 phonological features, it is sufficient to note here that Western Pearic is distinguished by having both plain and aspirated reflexes of prevocalic pAA plain voiceless stops, while Eastern Pearic shows only non-aspirated reflexes. Following Sidwell (2009) the languages/dialects can be classified as follows: Pearic

Figure 3.20

Eastern Pearic: Pear of Kompong Thom Western Pearic: Chong Southern Chong: Suoi, Saoch, Chung Western Chong: Chong of Chantaburi Central Chong: Samre, Kasong, Chong of Trat Northern Chong: Somray, Somre of Siam Reap (extinct) Proposed Pearic classification.

4.13 Vietic The Vietic languages consist of Vietnamese and the closely related Mương languages, together recognized as Viet-Muong, plus a handful of very small languages spoken in the Vietnam-Lao borderlands (generally refered to as Chứt in Vietnamese sources); for informative general introductions see Parkin (1991), Hayes (1992), Ferlus (1996). Into the 1980s it was normal to refer to this branch as Viet-Muong, and to treat all non-Vietnamese members as Mương languages or dialects. Among all researchers concerned with Vietic, Ferlus was ahead of his time when in 1979 he proposed the following family tree for Viet-Muong at Figure 3.21.

204

Sidwell Vietnamien Nguôn Muong (nord) Muong (centre et sud)

Viet-Muong

Muong (Uý-1ô) Pong, toum Langues chứt Pakatan, phôn-soung Thavung

Figure 3.21

Vietic (‘Viet-Muong’) family tree by Ferlus (1979).

Though the 1980s it became increasingly clear that Vietnamese and Mương are not especially representative of the branch, especially as better data on the languages of the Lao-Vietnam borderlands revealed them to be quite distinctive, especially in phonology. For example, while Vietnamese and Mương varieties have variously five or six tones depending on dialect, the Arem variety described by Ferlus (1997, 2001) and Kri of Enfield & Diffloth (2009) are toneless, but have a breathy phonation contrast, and plain versus glottalised finals. These and other closely related dialects were recognised as reflecting an archaic sub-group, and recognised as such by Ferlus in his (1989–90) classification: Vietic

Figure 3.22

1.1 Eastern archaic: Mãliềng, Arem, Chứt 1.2 Western archaic: Thavung, Pakatan 2. Pong-Toum group 3. Mương 4. Vietnamese

Vietic sub-groups after Ferlus (1989–90).

Austroasiatic Classification

205

Hayes (1992) proposed to rename the branch Vietic, and treat Viet-Muong as the designation for the apparent sub-grouping of Vietnamese and Mương varieties. He reviewed Vietic classification on the basis of lexicostatistics and phonological innovations, and proposed three coordinate sub-branches as follows: Vietic

Figure 3.23

West Vietic: Thavung, Tong Luông, Bô, Phọng and others Central Vietic: Ɖan Lai, Hung, Toum, Cọi, Cuôi and others East Vietic: 1) Chứt (Arem, Mãliềng, Mày, Rục, Sách, Mụ Già) 2) Mương-Nguồn, Vietnamese (various dialects)

Vietic classification after Hayes (1992).

However, based on the comparative work of Ferlus (e.g.: 1982, 1992, 1996, 1997, 1998, 2001 etc.) it is evident that Hayes erred in sub-grouping Chứt and VietMuong. Probably Vietnamese and Mương form a sub-group which coordinates alongside several other Vietic sub-groups. It also needs to be recognised that recently Phan (2012) presented evidence that Mương is a paraphyletic taxon, i.e. does not uniquely share a common ancestor, but does so with Vietnamese, such that the latter is just a divergent Mương language. This effectively leads us back to the scheme proposed by Ferlus (1979) which can be revised as follows (calling Ferlus’ Archaic group Chut to follow Ethnologue): Vietic

Viet-Muong: Vietnamese (various dialects), Mương Muốt, Mương Nàbái, Mương Chỏi etc. Pong-Toum: Phong, Ɖan Lai, Hung, Toum and others Chut: East: Mãliềng, Maleng, Arem, Kri, Chứt (Mày, Rục, Sách, Mụ Già) West: Thavung, Pakatan Figure 3.24 Vietic classification synthesizing the present state of knowledge.

5 Appendix: AA Language Names, ISO 639–3 Codes and Summary Classification The following list reconciles AA languages names and ISO 639–3 codes (at 2013 Ethnologue.com) with the classifications advanced in this chapter. There are diverse variant forms and spellings of language and dialect names in the literature, and these are covered extensively in the Ethnologue. Also, there are languages which are lacking distinct ISO codes—not listed here—so the following list should not be treated as

206

Sidwell

exhaustive. Multiple items listed together on a single line are regarded as dialects or name variants of one language (for example: there are three codes for Bru reflecting the fact that is spoken in three countries). The 171 ISO coded language names of the Ethnologue are consolidated into 129 on linguistic grounds. Austroasiatic Munda North Munda Korku (kfq) Santali-Mundari Santhali (sat), Mahali (mjx), Turi (trd) Mundari (unr, unx), Asuri (asr), Birhor (biy), Ho (hoc), Koda (cdz), Kol (ekl), Korwa (kfp), Kodaku (ksz), Bijori (bix), Agariya (agi) Sora-Gorum Sora (srb), Juray (juy) Gorum (pcj) Juang (jun) Kharia (khr) Gutob-Remo Remo (bfw) Gutob (gbj) Gtaʔ (gaq) Khasi-Palaung Khasian Khasi-Pnar Khasi (kha) Pnar (pbv) War (aml) Lyngngam (lyg) Palaungic East Palaungic Danau (dnu) West Palaungic Palaung-Riang Palaung (pce, pll) Rumai (rbb) Riang (ril), Yinchia (yin)

Austroasiatic Classification Lametic Lameet (lbn) Khabit (bgk), Buxinhua (bxt) Kháng (kjm), Khao (xao) Angkuic U (uuu) Hu (huo) Mok (mqt) Man Met [mml], Kemiehua (kfj) Tai Loi (tlq) Kiorr (xko), Con (cno), Kon Keu (kkn) Waic Samtau Blang (blr) Bumang (bvp) Samtao (stu) Lawa Lawa (lwo, lcp) Phalok (lwl) Wa Wa (vbm) Parauk (prk) Awa (vwa) Khmuic Khmu Khmu’ (kjg), Khuen (khf), Kuanhua (xnh) Pray-Pram Mlabri (mra) Mal-Pray Mal (mlf) Prai (prt), Pray (pry) Lua’ (prb) Khsingmul (puo) Pramic Phong-Kniang (pnx) Ơdu (tyh)

207

208 Mangic Mang (zng) Pakanic Bugan (bgh), Bogan (bbh) Bolyu (ply) Vietic Viet-Muong Vietnamese (vie) Mương (mtq) Bo (bgl) Nguon (nuo) Pong-Toum Pong (hnu) Cuối (tou) Chut East Chut Arem (aem) Maleng (pkt) Chứt (scb) West Chut Thavung (thm) Katuic West Katuic Bru (bru, brv, xhv) Sô (sss) Kuy (kdt) Nyeu (nyl) Ta’oih Ir (irr), Ong (oog) Ta’oih (tto, tth) Kataang (kgd) Ngeq (ngt), Khlor (llo) Pacoh (pac) Katu Eastern Katu (ktv) Western Katu (kuf) Phuong (phg)

Sidwell

Austroasiatic Classification Khmeric Central Khmer (khm) Northern Khmer (kxm) Bahnaric North Bahnaric Halang (hal), Doan (hld) Jeh (jeh) Kayong (kxy) Takua (tkz) Katua (kta) Hre (hre) Sedang (sed) Todrah (tdr) Monom (moo) Rengao (ren) Kaco’ (xkk), Romam (rmx), Lamam (lmm) East Bahnaric Cua (cua) West Bahnaric Brao-Kravet Kavet (krv), Krưng (krr), Lave (brb) Laven Jru’ (lbo), Sou (sqq) Oi-The Oy (oyb), Sapuan (spu), Sok (skk), The (thx), Jeng (jeg) Nyaheun (nev) Central Bahnaric Alak (alk) Trieng (stg), Talieng (sdf), Kasseng (kgc) South-Central Bahnaric Tampuon (tpu) Bahnar (bdq) South Bahnaric Stieng (sti), Budeh Stieng (stt) Chrau (crw) Sre (kpm) Maa (cma) Mnong

209

210

Sidwell Eastern Mnong (mng) Central Mnong (cmo), Kraol (rka) Southern Mnong (mnn)

Pearic Pear (pcb) Chong Chong (cog) Sa’och (scq) Samre (sxm) Somray (smu) Suoy (syo) Monic Mon (mnw) Nyah Kur (cbn) Asli-Nicobar Aslian North Aslian Batek (btq) Chewong (cwg) Jehai (jhi) Kensiw (kns) Kintaq (knq) Mendriq (mnq) Mintil (mzt) Tonga (tnz) Jah Hut (jah) Central Aslian Lanoh (lnh) Sabum (sbo) Semai (sea) Semnam (ssm) Temiar (tea) South Aslian Mah Meri (mhe) Semaq Beri (szc) Semelai (sza) Temoq (tmo)

Austroasiatic Classification

211

Nicobarese North-Nicobarese Car (caq) Central-Nicobarese Nancowry, Katchal, Camorta (ncb) Kondul (nik) Pulo-Milo (nik) Teressa (tef), Chowra (crv) South-Nicobarese Shom Pen (sii)

6 References Adams, Karen Lee. 1989. Systems of numeral classification in the Mon-Khmer, Nicobarese and Aslian subfamilies of Austroasiatic. Canberra: Pacific Linguistics. Alves 2005. The Vieto-Katuic hypothesis: lexical evidence. In Paul Sidwell (ed.), SEALS XV Papers from the 15th Annual Meeting of the Southeast Asian Linguistics Society 2005. Canberra: Pacific Linguistics, 169–176. Anderson, Gregory D.S. (ed.). 2008. The Munda Languages. London & New York: Routledge. ———. 2001. Recent Advances in Proto-Munda and Proto-Austroasiatic Reconstruction. Paper presented at the 3rd Harvard Round Table on the Prehistory of Central and South Asia, Cambridge, MA, May, 2001. ———. 2004. Advances in Proto-Munda reconstruction. Mon-Khmer Studies Journal 34: 159–184. ———. 2007. The Munda Verb. Typological Perspectives. Berlin: Mouton de Gruyter (Trends in Linguistics, Studies and Monographs, 174). Anderson, Gregory D.S. & Norman H. Zide. 2001. Recent Advances in the Reconstruc tion of the Proto-Munda Verb. In Laurel Brinton (ed.), Historical Linguistics 1999. Amsterdam: Benjamins, 13–30. Baradat, R. 1941a. Les Samrê ou Pear, population primitive de l’Ouest du Cambodge. BEFEO 4.1: 1–150. ———. 1941b. Les dialectes des tribus samre. Manuscript held at l’Ecole Francaise d’Extreme-Orient, Paris. 267p. Bellwood, Peter 2005. First farmers: The origins of agricultural societies. Oxford: Blackwell. Benedict, Paul K. 1990. How to Tell Lai: An Exercise in Classification. Linguistics of the Tibeto-Burman Area 13.2: 1–26.

212

Sidwell

Benjamin, Geoffrey. 1976. Austroasiatic Subgroupings and Prehistory in the Malay Peninsula. In Philip N. Jenner et al. (eds), 37–128. ———. 2012. The Aslian languages of Malaysia and Thailand: an assessment. Language Documentation and Description 11: 136–230. Bergsland, Knut & Hans Vogt. 1962. On the validity of glottochronology. Current Anthropology 3: 115–153. Blagden, Charles Otto. 1906. Pagan Races of the Malay Peninsula Vol. 2, edited by W.W. Skeat and C.O. Blagden. London: Macmillan. Blench, Roger and Paula Sidwell. 2011. Is Shom Pen a Distinct Branch of Austroasiatic? In Sophana Srichampa et al. (eds.), 90–101. Blench, Roger. 2008. Stratification in the peopling of China: how far does the linguistic evidence match genetics and archaeology? In Alicia Sanchez-Mazas, Roger M. Blench, M.D. Ross, Ilia Peiros & Marie Lin (eds.), Human migrations in continental East Asia and Taiwan. Matching archaeology, linguistics and genetics. London: Routledge, 105–132. ———. 2011. Was there an Austroasiatic presence in island SE Asia prior to the Austro nesian expansion? Bulletin of the Indo-Pacific Prehistory Association 30: 133–144. Blust, Robert. 2000. Why lexicostatistics doesn’t work. The ‘universal constant’ hypothesis and the Austronesian languages. In Colin Renfrew, April McMahon, & L. Trask (eds.), Time depth in historical linguistics. Cambridge: The McDonald Institute for Archaeological Research. 311–331. Braine, Jean (Critchfield). Nicobarese Grammar (Car Dialect). Ph.D. dissertation, University of California, Berkeley. Briggs, Lawrence Palmer. 1950. The Khmer Empire and the Malay Peninsula. The Far Eastern Quarterly 9.3: 256–305. Brightbill, Jeremy, Amy Kim, and Seung Kim. 2007. The War-Jaintia in Bangladesh: a sociolinguistic survey. SIL Electronic Survey Reports. Chattopadhyay, Subhash Chandra & Asok Kumar Mukhopadhyay. 2003. The language of the Shompen of Great Nicobar: a preliminary appraisal. Kolkata: Anthropological Survey of India. Chazée, Laurent. 1999. The peoples of Laos: Rural and Ethnic Diversities. Bangkok: White Lotus. Chi, Zhang and Hsiao-chun Hung. 2009. The emergence of agriculture in southern China. Antiquity 84: 11–25. Choo, Marcus. 2012. The Status of Katuic. Chiang Mai, Survey Unit Linguistics Institute Payap University (Research Project SU15). Costello, Nancy A. 1971. Ngữ-vựng Katu: Katu vocabulary. Vietnam Montagnard Language Series 5. Saigon: Department of Education. Cust, R.N. 1878. A sketch of the modern languages of the East Indies. London: Trübner.

Austroasiatic Classification

213

Daladier, Anne. 2002. Khasi. In Encyclopédie des sciences du langage, Dictionnaire des langues. Paris: PUF. Davidson, Jeremy (ed.). 1991. Austroasiatic Languages: Essays in honor of H.L. Shorto. London: SOAS. de Röepstorff, F. Adolph. 1884. Dictionary of the Nancowry Dialect of the Nicobarese language. Calcutta. ———. 1875. Vocabulary of dialects spoken in the Nicobar and Andaman Isles: Calcutta. Diffloth, Gérard, and Norman H. Zide. 1992. Austro-Asiatic languages. In William Bright (ed.), International Encyclopedia of Linguistics. New York: Oxford University Press, Vol. I: 137–142. Diffloth, Gérard. 1974. Austro-Asiatic Languages. In Encyclopaedia Britannica: Chicago: Encyclopaedia Britannica Inc. ———. 1977. Mon-Khmer Initial Palatals and “substratumized” Austro-Thai. MonKhmer Studies 6: 39–57. ———. 1979. Aslian languages and Southeast Asian prehistory. Federation Museums Journal 24ns: 3–16. ———. 1980. The Wa languages. Linguistics of the Tibeto-Burman Area 5.2: 1–182. ———. 1982. Registres, dévoisement, timbres vocaliques: leur histoire en Katouique. Mon-Khmer Studies 11: 47–82. ———. 1982b. Subclassification of Palaungic and notes on ‘P’uman’. Paper presented at the 15th International Conference on Sino-Tibetan Language and Linguistics, August 1982. Beijing, China. ———. 1984. The Dvāravatī-Old Mon Language and Nyah Kur, Monic Language Studies. Bangkok: Chulalongkorn University Printing House. ———. 1989. Proto-Austroasiatic Creaky Voice. Mon-Khmer Studies 15: 139–154. ———. 1991a. Palaungic vowels in Mon-Khmer perspective. In J.H.C.S. Davison (ed.), Austroasiatic Languages, essays in honour of H.L. Shorto. London: SOAS, 13–28. ———. 1991b. Vietnamese as a Mon-Khmer language. In Martha Ratliff & Eric Schiller (eds.), Papers from the First Annual Meeting of the Southeast Asian Linguistics Society, 125–139. ———. 2005. The contribution of linguistic palaeontology to the homeland of Austroasiatic. In Laurent Sagart, Roger Blench and A. Sanchez-Mazas (eds.), The peopling of East Asia: Putting together archaeology, linguistics and genetics London: RoutledgeCurzon, 77–80. Donegan, Patricia J. & David Stampe. 1983. Rhythm and the holistic organization of linguistic structure. Chicago Linguistic Society 19.2: 337–353. ———. 2004. Rhythm and the synthetic drift of Munda. In R. Singh (ed.), The Yearbook of South Asian Languages and Linguistics. Berlin & New York: Mouton de Gruyter, 3–36.

214

Sidwell

Dunn, Micheal, Nicole Kruspe, and Niclas Burenhult. 2013. Time and place in the prehistory of the Aslian languages. Human Biology 85: 383–399. Dunn, Michael, Niclas Burenhult, Nicole Kruspe, Sylvia Tufesson, and Neele Becker. 2011. Aslian linguistics prehistory: a case study in comoutational phylogenetics. Diachronica 28.3: 291–323. Edmondson, Jerold & Kenneth Gregerson. 1996. Bolyu tone in Vietic perspective. MonKhmer Studies 26: 117–133. ———. 1996. English-Bolyu Glossary. Mon-Khmer Studies 24: 133–159. ———. 2010. The Khang language of Vietnam in comparison to Ksingmul (Xinh-mun). In Kenneth A. McElhanon & Ger Reesnik (eds), A mosaic of languages and cultures: studies celebrating the career of Karl J. Franklin. SIL e-Books 19. SIL International: http://www.sil.org/silpubs. Enfield, Nicholas J. and Gérard Diffloth. 2009. Phonology and sketch grammar of Kri, a Vietic language of Laos. Cahiers de Linguistique Asie Orientale 38.1: 3–69. Ferlus, Michel. 1979. Lexique thavung-français. Cahiers de Linguistique, Asie Orientale 5:71–94. ———. 1974. Les langues du groupe austroasiatiques-nord. Asie du Sud-Est et Monde Insulindien 5.1: 39–68. ———. 1982. Spirantisation des obstruantes médiales et formation du systèm consonantique du vietnamien. Cahiers de Linguistique, Asie Orientale 11.1: 83–106. ———. 1983. Essai de phonétique historique de môn. Mon-Khmer Studies 12: 1–90. ———. 1989–1990. Sur l’origine des langues Việt-Mường, Mon-Khmer Studies 18–19: 52–59. ———. 1996. Langues et peuples viet-muong. Mon-Khmer Studies 26: 7–28. ———. 1997. Problèmes de la formation du systèm vocalique du vietnamien. Cahiers de Linguistique, Asie Orientale 26.1, 37–51. ———. 1998. Les systèmes de tons dans les langues viet-muong. Diachronica 15.1: 1–27. ———. 2001. The origin of tones in Viet-Muong. Paper presented at the XI meeting of the Southeast Asia Linguistics Society, Mahidol University, Bangkok, May 16–18 2001. ———. 2004. The origin of tones in Viet-Muong. In S. Burusphat (ed.), Papers from the Eleventh Annual Meeting of the Southeast Asian Linguistics Society. Tempe, Arizona: Arizona State University, 297–313. ———. 2008. Le mot ‘sang’ en austroasiatique. Mon-Khmer Studies 38: 25–39. ———. 2011. Toward Proto-Pearic: Problems and Historical Implications. In Sophana Srichampa et al. (eds.), 38–51. Filbeck, David. 1978. T’in: a historical study. Canberra: Pacific Linguistics. Finck, Franz Nikolaus. 1909. Die Sprachstämme des Erdkreises. Leipzig: B. G. Teubner. Gait, E.A. 1906. A History of Assam. Calcutta: Thacker, Spink & Co. Greenhill, Simon J. & R.D. Gray. 2009. Austronesian language phylogenies: Myths and misconceptions about Bayesian computational methods. In A. Adelaar & A. Pawley

Austroasiatic Classification

215

(eds.), Austronesian historical linguistics and culture history: a festschrift for Robert Blust. Canberra: Pacific Linguistics, 375–397. Gregerson, Kenneth J., Kenneth D. Smith, and David D. Thomas. 1976. The place of Bahnar within Bahnaric. In P.N. Jenner et al. (eds.), 323–369. Gregerson, Kenneth. 1991. On Austronesian Lexicon in Vietnamese. In Jeremy Davidson (ed.), 81–94. Grierson, George Abraham. 1903. Languages of India: being a reprint of the chapter on languages contributed to the Report on the census of India 1901, together with the census statistics of language. Calcutta: Office of the Superintendent of Government Printing India. ———. 1904. Mon-Khmer and Siamese-Chinese Families. Vol. II, Linguistic Survey of India. Delhi: Banarashidaas. ———. 1906. Munda and Dravidian Families. Vol. IV, Linguistic Survey of India. Delhi: Banarashidaas. Haudricourt, André-Georges. 1953. La place du viêtnamien dans les langues austroasiatiques. Bulletin de la Société de Linguistique de Paris 49.1: 122–128. ———. 1954. De l’origine des tons en viêtnamien. Journal Asiatique 242: 69–82. ———. 1965. Mutation consonantique en Mon-Khmer. Bulletin de la Société Linguistique de Paris 60: 160–72. Hayes, La Vaughn H. 1992. Vietic and Viet-Muong: a new subgrouping in Mon-Khmer. Mon-Khmer Studies 21: 211–228. Headley, Robert K., Jr. 1976. Some Considerations On the Classification of Khmer. In Philip N. Jenner et al. (eds.), 431–451. Higham, Charles 2002. Language and farming dispersals: Austroasiatic language and rice cultivation. In Peter Bellwood & Colin Renfrew (eds.), Examining the farming/language dispersal hypothesis. Cambridge: Cambridge University Press, 223–232. Huffman, Franklin E. 1976. The Relevance of Lexicostatistics to Mon-Khmer Languages. In Philip N. Jenner et al. (eds.), 539–574. ———. 1978. On the centrality of Katuic-Bahnaric to Austroasiatic. Presentation at 2nd International Confernece on Austroasiatic, Mysore (India) December 18–21, 1978. Proceedings of the International Congress of Anthropological and Ethnological Sciences, 10.2: 78–79. Kasisopa, B. 2003. Phonological study and genetic classification of Dara-ang (Palaung) spoken at Nawlae village, Fang district, Chiang Mai province. MA thesis, Chulalongkorn University. (In Thai) Keane, Augustus Henry. 1880. On the relations of the Indo-Chinese and Indo-Oceanic races and languages. Journal of the Royal Anthropological Institute of Great Britain and Ireland 9: 254–289. Kingsada, Thongphet and Ryuichi Kosaka. 1999. Khabit. In Thongphet Kingsada & Tadahiko Shintani (eds.), Basic vocabularies of the languages spoken in Phongxaly,

216

Sidwell

Lao P.D.R. Tokyo: Institute for the Study of Languages and Cultures of Asia and Africa, 297–317 Kuiper, F.B.J. 1962. Nahali: A Comparative Study. Amsterdam: N.V. Noord-Hollandsche Uitgevers Maatschappij (Mededelingen der Koninklijke Nederlandse Akademie van Wetenschappen: Afd. Letterkunde; N.R. 25,5) Li Jinfang. 1996. Bugan—a new Mon-Khmer language of Yunnan Province, China. Mon-Khmer Studies 26: 135–160. Li Xulian. 1999. Laiyu yanjiu. Beijing: Zhongyang minzu daxue chubanshe. Liang, Min. 1984. A brief description of the Lai language. Minzu Yuwen 1984.4 Logan, J.R. 1850. On the Leading Charactersitics of the Papuan, Australian, and MalayuPolynesian Nations. The Journal of the Indian Archipelago 4: 344–478. Lợi, Nguyễn Văn. 2008. Tiếng Mảng (Mang Language) Hanoi, Nhả xuất bản Khoa học Xã hội. Luce, Gordon H. 1965. Danaw, a dying Austroasiatic language. Lingua 14: 98–129. Malone, Dennis L. and Suwilai Premsrirat. 2005. Language development and language revitalization in Asia. Mon-Khmer Studies 35: 101–120. Man, Edward Horace. 1889. Dictionary of the Central Nicobarese language (EnglishNicobarese and Nicobarese-English), with appendices containing a comparison of synonymous words in the remaining Nicobarese forms and other matters, preceded by notes on the grammar of the Central form. London: W.H. Allen and Co. Reprinted in 1975 by Sanskaran Prakashak, Delhi. Mason, Francis. 1854. The Talaing Language. Journal of American Oriental Society 4: 277–288. Maspero, Henri. 1912. Etude sur le phonétique de le langue annamite. Les initials. . Bulletin de l’Ecole Française d’Extrême Orient 12.1: 1–127. Matisoff, James A. 2003. Aslian: Mon-Khmer of the Malay Peninsula. The Mon-Khmer Studies Journal 33: 1–58. ———. 1978. Variational Semantics in Tibeto-Burman: the ‘organic’ approach to linguistic comparison. Occasional Papers of the Wolfenden Society on Tibeto-Burman Linguistics, Volume VI. Publication of the Institute for the Study of Human Issues (ISHI), Philadelphia. Migliazza, Brian. 1992. Lexicostatistic Analysis of some Katuic Languages. In Proceedings of the Third International Symposium on Language and Linguistics. Volume III: Bangkok: Chulalongkorn University Printing House. Mikami, Naomitsu. 2003. A Khang Phonology and Wordlist. In Hiromi Ueda (ed.), Reports on Minority Languages in Mainland Southeast Asia. Endangered Languages of the Pacific Rim, Faculty of Informatics, Osaka Gakuin University, 1–42. Miller, John & Carolyn Miller. 1996. Lexical comparison of Katuic Mon-Khmer languages with special focus on So-Bru groups in northeast Thailand. Mon-Khmer Studies 2.6: 255–290.

Austroasiatic Classification

217

Mitani, Yasuyuki. 1978. Problems in the classification of Palaungic. Paper presented at 2nd International Conference on Austroasiatic Linguistics, 19–21 December 1978. Mysore, India. Nagaraja, K.S. 2011. Austroasiatic Languages—an introduction. In K.S. Nagaraja (ed.), Austro-Asiatic Linguistics: In memory of R. Elangaiyan, Mysore: CIIL (India), 1–32. Nagaraja, K.S., Paul Sidwell and Simon Greenhill. 2012. A Lexicostatistical Study of the Khasian Languages: Khasi, Pnar, Lyngngam, and War. Mon-Khmer Studies 42: 1–11. Nguyễn, Tài Cẩn. 1995. Gíao trình lịch sử ngữ âm tiếng Việt (Textbook of Vietnamese historical phonology). Hà Nội: Nhà Xuất Bản Gíao Dục. Ostapirat, Weera. 2009. Early and Modern Pearic Registers/Tones. Paper presented at the 4th International Conference on Austroasiatic Linguistics, Mahidol University, Oct. 29–30 2009. Parkin, Robert. 1991. A guide to Austroasiatic speakers and their languages. Oceanic Linguistics Special Publications No.23. Honolulu: University of Hawaii Press. Paulsen, Debbie. 1989–1990. A Phonological Reconstruction of Proto-Plang. MonKhmer Studies 18–19: 160–222. Peiros, Ilia. 1998. Comparative linguistics in Southeast Asia. Canberra: Pacific Linguistics. ———. 2004. Geneticeskaja klassifikacija avstroaziatskix jazykov: Moskva: Rossijskij gosudarstvennyj gumanitarnyj universitet (doktorskaja dissertacija). Phan, John. 2012. Mường is not a subgroup: Phonological evidence for a paraphyletic taxon in the Viet-Muong sub-family. Mon-Khmer Studies 42:1–18. Phillips, Timothy C. 2013. Proto-Aslian: towards an understanding of its historical linguistic systems, principles and processes. Ph.D. thesis, Institut Alam Dan Tamadun Melayu Universiti Kebangsaan Malaysia, Bangi. Pinnow, Heinz-Jürgen. 1954. Grundzüge einer Phonetik der Kharia Sprache. Unpublished Manuscript, Berlin. ———. 1959. Versuch einer historischen Lautlehre der Kharia-Sprache: Wiesbaden: Otto Harrassowitz. ———. 1960. Uber der Ursprung der voneinander abweichenden strukturen der Munda- und Khmer-Nikobar-Sprachen. Indo-Iranian Journal 4: 81–103. ———. 1963. The position of the Munda languages within the Austroasiatic language family. In H.L. Shorto (eds.), Linguistic comparison in Southeast Asia and the Pacific. London: SOAS, 140–152. Premsrirat, Suwilai. 2002. Thesaurus of Khmu Dialects in Southeast Asia. Institute of Language and Culture for Rural Development Mahidol University Thailand. Proschan, Frank. 1996. A Survey of Khmuic and Palaungic Languages in Laos and Vietnam. Pan-Asiatic Linguistics 3: 895–919. Przyłuski, Jean. 1924. Les langues austroasiatiques. In Les Langues du Monde (Collection linguistique publiée par la société de linguistique de Paris) 16, edited by A. Meillet and M. Cohen: Paris Librarie Ancienne Edouard Champion.

218

Sidwell

Qin, Xiaohang and Fanglan Li. 2012. The status quo and trend of language use by Lai people. Mon-Khmer Studies 40: 107–115. Radhakrishnan, R. 1981. The Nancowry word: phonology, affixual morphology and roots of a Nicobarese language. Current inquiry into language and linguistics, 37, Edmonton, Alberta, Linguistic Research Inc. Rea, John A. 1973. The romance data of pilot studies for glottochronology. In Henry Max Hoenigswald & Robert H. Langacre (eds.), Diachronic, areal and typological linguistics.. The Hague: Mouton, 355–367. Ringe, Don. 1999. How hard is it to match CVC-roots? Transactions of the Philological Society 97.2: 213–244. Rischel, Jørgen. 2007. Mlabri and Mon-Khmer. Historisk-filosofiske Meddelelser 99, Copenhagen. Roberts, H. 1875. Anglo-Khasi Dictionary of the use of schools and colleges. Calcutta. [Revised ed., 1878, Calcutta] ———. 1891. A grammar of the Khassi language. London: Kegan Paul, Trench, Trübner and Co. Schmidt, Pater Wilhelm. 1906. Die Mon-Khmer-Völker, ein Bindeglied zwischen Völkern Zentralasiens und Austronesiens. Archiv für Anthropologie 5: 59–109. Schmidt, Wilhelm. 1901. Die Sprachen der Sakai und Semang auf Malacca und ihr Verhältnis zu den Mon-Khmer-Sprachen. Bijdragen tot de Taal-, Land-, en Volkenkunde van Nederlandsch-Indië, 52: 399–583. ———. 1904. Grundzüge einer Lautlehreder Khasi-Sprache in ihren Beziehungen zu derjenigen der Mon-Khmer-Sprachen. Mit einem Anhang: die Palaung-Wa-, und Riang-Sprachen des mittleren Salwin. Abh. Bayrischen Akademie der Wissenschaft, 1.22.3: 677–810. ———. 1905. Grundzüge einer Lautlehre der Mon-Khmer-Sprachen. Denkschrift der Akademie der Wissenschaften, Wien, Philologisch-Historische Klasse 5.1: 1–233. ———. 1906. Die Mon-Khmer-Völker, ein Bindeglied zwischen Völkern Zentralasiens und Austronesiens. Archiv für Anthropologie, Braunschweig, 5: 59–109. Sebeok, Thomas A. 1942. An examination of the Austro-Asiatic Language family. Language 1.8: 206–217. Shafer, Robert. 1940. Nahali, a linguistic study in Paleo-Ethnography. Harvard Journal of Asiatic Studies 5: 346–71. Shorto, Harry L. 1962. A dictionary of Modern Spoken Mon. London: Oxford University Press. ———. 2006. A Mon-Khmer comparative dictionary: Canberra: Pacific Linguistics. Shorto, Harry L., Judith M. Jacob, and E.H.S. Simmonds. 1963. Bibliographies of MonKhmer and Tai linguistics. London Oriental bibliographies 12. London: Oxford University Press. Sidwell, Paul & Pascale Jacq 2003. A Handbook of Comparative Bahnaric: Volume 1, West Bahnaric. Canberra: Pacific Linguistics.

Austroasiatic Classification

219

Sidwell, Paul and Roger Blench. 2011. The Austroasiatic Urheimat: the Southeastern Riverine Hypothesis. In Nicholas Enfield (ed.), Dynamics of human diversity. Canberra: Pacific Linguistics, 315–344. Sidwell, Paul. 2005. The Katuic languages: classification, reconstruction and comparative lexicon: Munich: Lincom Europa. ———. 2006. Preface to Shorto, Harry L. 2006. A Mon-Khmer Comparative Dictionary: Canberra Pacific Linguistics, vi–xxv. ———. 2009. http://people.anu.edu.au/~u9907217/lexico/AAclassification.html ———. 2010. The Austroasiatic central riverine hypothesis. Вопросы языкового родства/Journal of Language Relationship 4: 117–134. ———. 2011. Katuic-Bahnaric: Austroasiatic sub-family or convergence area? In Sophana Srichampa et al. (eds.), 100–111. ———. 2011. Proto-Khasian and Khasi-Palaungic. Journal of the Southeast Asian Lingistics Society 4.1: 144–168. Smith, Kenneth, D. 1981. A Lexico-statistical study of 45 Mon-Khmer languages. In Andre Gonzalez and David Thomas (eds), Linguistics across continents. Manila: SIL, 180–205. Srichampa, Sophana, Paul Sidwell & Kenneth Gregerson (eds.). 2009. Austroasiatic Studies: papers from the ICAAL4: Mon-Khmer Studies Journal Special Issue No. 3. Dallas: SIL International; Canberra, Pacific Linguistics; Salaya: Mahidol University. Svantesson, Jan-Olof. 1988. U. Linguistics of the Tibeto-Burman Area, 11.1: 64–133. ———. 1991. Hu—a language with unorthodox tonogenesis. In Jeremy Davidson (ed.), 67–80. Swadesh, Morris. 1952. Lexico-statistical dating of prehistoric ethnic contacts: With special reference to North American Indians and Eskimos. Proceedings of the American Philosophical Society 9.6: 452–463. ———. 1955. Toward greater accuracy in lexicostatistical dating. International Journal of American Linguistics 2.1: 121–37. Temple, R.C. 1902. A grammar of the Nicobarese language. Chapter IV, Part II, The Census Report on the Andaman and Nicobar Islands. Port Blair: Superintendent’s Press. Thomas, David. 1964. A survey of Austro-asiatic and Mon-Khmer comparative studies. Mon-Khmer Studies 1: 49–163. ———. 1966. Mon-Khmer subgroupings in Vietnam. In Norman H. Zide (ed.), Studies in comparative Austroasiatic linguistics. The Hague: Mouton, 194–202. Thomas, David, and Robert K. Headley, Jr. 1970. More on Mon-Khmer subgroupings. Lingua 2 (5):398–418. Tuyên, Vương Hoàng. 1963. Các dân tộc nguồn gốc Nam Á ở Tây Bắc Việt Nam. Nxb Giáo dục, Hà Nội. van Driem, George 2001. Languages of the Himalayas: An Ethnolinguistic Handbook of the Greater Himalayan Region: Containing an Introduction to the Symbiotic Theory of Language. Leiden: Brill.

220

Sidwell

von Hevesy, Wilhelm. 1928. Munda-Magyar-Maori: An Indian link between the antipodes; new tracks of Hungarian origins. London: Luzac. ———. 1930. On Wilhelm Schmidt’s Munda—Mon-Khmer comparisons (Does an “Austric” family of languages exist?). Bulletin of the School of Oriental Studies 6.1: 187–200. ———. 1932. Finnish-Ugrische aus Indien: Es gibt keine austische Sprach-familie— das vorarische Indien reilweise finnish-ugrisch. Wein: Manzsche Verlags- und Universitätsbuchhandlung. ———. 1934. A false linguistic family “The Austro-Asiatic”, Journal of the Bihar and Orissa Research Society 20.3–4: 251–259. Vương Hoàng Tuyên. 1963. Các dân tộc nguồn gốc Nam Á ở Tây Bắc Việt Nam (The people of South Asian origin in North Vietnam). Nxb Giáo dục, Hà Nội. Whitehead, George. 1925. Dictionary of the Car-Nicobarese language. Rangoon: American Baptist Mission Press. Zide, Norman H. 1996. On Nihali. Mother Tongue 2: 93–100.

chapter 4

Austroasiatic Comparative-Historical Reconstruction: An Overview Paul Sidwell and Felix Rau1 1 Introduction At the time of writing (early 2014) more than a century and a half has passed since the first recognisable characterisation of the Austroasiatic phylum (Logan 1854) appeared in print, and almost 110 years since Schmidt’s (1905) Grundzüge einer Lautlehre der Mon-Khmer-Sprachen (Outlines of a phonology of MonKhmer languages) made the first substantial presentation of cognate vocabulary and sound correspondences across multiple branches of the phylum. Since that promising start, progress in comparative Austroasiatic (AA) reconstruction has proceeded in fits and starts, largely the work of motivated individuals rather than any coordinated efforts, and significantly undermined by a reluctance to publish provisional results. As it stands, the most comprehensive published work on the subject is Shorto’s (2006) A Mon-Khmer Comparative Dictionary, which itself is a posthumous and incomplete editing of drafts produced between three and four decades ago, and was compiled without the benefit of substantial field data collected at around the same time. This is not to suggest that little has been happening in comparative AA studies— something that I hope is made clear in my recent survey paper (Sidwell 2011c)—but the activity has been largely focussed at branch level, and a provisional phylum-wide synthesis has been lacking. Adopting the working principle (credited to Voltaire) that, “Le mieux est l’ennemi du bien” (The best is the enemy of the good) I take the opportunity presented by the compilation of this volume, to offer an integrated synthesis, a provisional working model of AA 1 Section 3.12 (Proto Munda) of this chapter is jointed written by Felix Rau and Paul Sidwell. Sidwell’s work on this chapter was made possible by generous financial support from the Australian Research Council (project ID: FT120100241), and also builds on earlier work supported by grants from the National Endowment for the Humanities (Washington, Grants PM5001207, PW5038009). Any views, findings, conclusions or recommendations expressed in this publication do not necessarily represent those of the Australian Research Council or National Endowment for the Humanities.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_005

222

sidwell and rau

historical phonology, word-formation and lexicon, while also briefly touching upon syntax. Limited by word count and general pressures of academic work, I am sure that what follows will variously inform and frustrate (if not enrage), but most importantly my hope is that it will move matters forward by stimulating critical response. Before diving into the technicalities, it ought to be asked, just what is at stake in pursuing the reconstruction proto-AA? There are many benefits to investigating the history of a language family beyond merely improving our typology of language change. These lie in extending and/or complementing what may be gained from allied disciplines such as history, archaeology, anthropology etc. In this case the recent prehistory of Southeast Asia, and extending into South Asia, is somewhat obscure beyond the reach of the inscriptional record back to the middle of the first Millennium. The third and fourth millennia before present were evidently important periods of human migration, cultural interaction and innovation, marked especially by the extensive development of rice agriculture, and in the later pre-historic period the spread of Hinduism and Buddhism and the emergence of important regional states. Not only were Austroasiatic speakers at the heart of these developments, but many traces of these transformations remain, directly and indirectly, encoded in their languages. Regions such as the Annamite Cordillera, the Upper Salween, and the Isthmus of Kra, and others, were important prehistoric crossroads of cultures, and their ancient AA -speaking inhabitants have descendents either still living there or relatively nearby. All other language groups of the region (with the exception perhaps of Andamanese) were apparently borne by incoming populations; their early interactions left fingerprints in the lexicon, the phonology, and the other linguistic sub-systems that are revealed by comparative reconstruction. Thus one can argue that protoAustroasiatic occupies something of a priority position in unlocking the human history of greater Mainland Southeast Asia. It must be acknowledged from the start that our knowledge of the AA phylum is somewhat incomplete, such that scholars are still unearthing new languages/dialects, and many AA languages that have been quite well known to scholarship for decades still lack adequate published descriptions. That being said, there are more or less extensive lexical recourses available for around three quarters of the perhaps 150 AA languages, including coverage of all recognised branches, and branch level studies for various branches are well developed. Furthermore, one is able to say that many fundamental problems—especially of the ‘low hanging fruit’ kind—were solved, partly or wholly, by scholarly work of the early to mid-twentieth century, and it is notable that subsequent work has built upon, rather than overturned, these accomplishments. The great pioneers who preceded us (such as Blagden, Schmidt, Haudricourt, Pinnow,

austroasiatic comparative-historical reconstruction

223

Shorto) were able, in the period from the end of the 19th century up to the late 1970s, to lay a foundation that has truly stood the test of time. With the tools of their day—the ledger card, the pencil, the shoebox,2 and the limits inherent in these—they fleshed out hundreds of etymologies, established sound correspondences, proposed various morphological analyses and recognised many of the genealogical groupings that are taken for granted today. Furthermore, and playing no less a role, our understanding of linguistic typology, especially typology of change in linguistic sub-systems such as phonology, has been revolutionised since the mid 1900s and this contributes greatly to the present task. Given the above, the present level of progress is quite sufficient to allow one to flesh out an informative and useful working model, being a crucial programmatic step at this juncture in the history of the field. It is to the shame of latter-day scholarship, spoiled with powerful information technology, telecommunications, cheap travel, and a century of hindsight, that the period from the late 1970s to the beginning of the 21st century saw a broad lack of progress or cooperation, with no dedicated international meetings, few compilations or substantial reference works;3 in short an utter failure of leadership (the “period of relative decline” as van Driem has called it).4 This is not to discount the real efforts of individual researchers that continued through the lean years, filling the pages of Mon-Khmer Studies and manifesting the occasional dissertation or survey work, but the efforts were atomised, uncoordinated rather than cooperative. But the inflorescence of AA studies that emerged in the 1990s—and which pleasingly continues to build momentum—has produced substantial new knowledge and progress, and it is appropriate to tie together the historical threads with the more recent insights to flesh out a programmatic basis for moving forward with the reconstruction. 2

Reconstruction of protoAustroasiatic (pAA)

2.1 Preliminary Remarks Linguistic reconstruction, relying on both comparative and internal methods (as we do in the present case), necessarily entails constructing an edifice that is hypothetical and potentially difficult to test. I take the view that the 2 The actual box for shoes, not the popular software from SIL International, now superceded by Toolbox. 3 Astonishingly the procedings of the Second International Conference on Austroasiatic Linguistics (held in Mysore in 1978) reached a high state of editorial readiness before being abandoned by its editors in 1983. 4 See: http://www.himalayanlanguages.org/icaal.

224

sidwell and rau

activity is legitimately scientific—in the sense that such hypotheses are clearly falsifiable—although the deeper in time we reconstruct the less likely that objective confirmations will be found and we are forced to rely on principles such as economy and typological plausibility to justify our preferred analyses. The reconstruction of pAA is a clear case in point; it is evident that pAA speakers did not leave written traces of their language, nor were numerous pAA loan words somehow preserved in contemporary languages which may be preserved in writing. Although there have been claims from time to time (e.g. Schuessler 2006 and others) that old AA words can be found lurking, for example, in Chinese, we are not able to follow the example of Gamkrelidze & Ivanov (1995) who test their ‘glottalic’ theory of proto-Indo-European in part by reference to apparent loans in Semitic. Instead, we are locked into a reconstruction based upon numerous attested spoken daughter languages plus a small number of older written languages. The latter (Old Mon and Old Khmer) for which there are inscriptional corpora going back to the middle of the first millennium, usefully deliver us back to a time before much of the invasion of SEAsia by rival language phyla, and crucially before the formation of the SEAsian Linguistic Area (see Enfield 2005 for overview) with its extensive phonological and syntactic restructuring. The earliest attempts at historical analysis of AA relied heavily upon reference to these written languages, and understandably so. When westerners came to Southeast Asia in the early and middle modern periods, they encountered Mon and Khmer as living written languages and subsequently linguists came to appreciate that the evident discrepancies between the writing systems of Indic origin and their modern phonetic readings could be used to infer phonological information correlating to older stages of the languages. Other AA languages—purely vernacular tongues such as Bahnar, Pear, Nicobarese, etc.—were also recognised and taken into account, but with the poor development of language documentation before the mid-20th century, nothing rivalled the extent and reliability of phonological information accessible in the inscriptional and epigraphic record of Mon and Khmer. Consequently, there arose a methodological tradition5 that broadly privileged inscriptional and epigraphic data for phonological reconstruction, while invoking evidence from diverse AA languages to support lexical reconstruction. This is not without precedent, and we do well to recall that comparative Indo-European studies had a similar genesis with initial reliance on Sanskrit, Greek, Latin etc. in the neo-grammarian period. From the mid 1960s another 5 First elaborated by Schmidt (e.g. 1904, 1905) and carried through to quite recent times culminating in the comparative work of Shorto (2006).

austroasiatic comparative-historical reconstruction

225

tendency emerged that worked bottom-up, reconstructing branchs and subbranchs, primarily using field data. A partial listing includes: Kherwarian (Munda 1968), Sora-Juray-Gorum (Zide 1982), North Bahnaric (Smith 1972), Mnong (Blood 1966), Katuic (Thomas 1967; Diffloth 1982; Sidwell 2005), VietMương (Barker 1963; Barker & Barker 1970; Ferlus 1992a; 1998; 2005), JehHalang (Thomas & Smith 1967), Semai (Diffloth 1977), Bahnaric (Sidwell 1998; L-Thongkum 2001), Aslian (Phillips 2005; 2012); see Sidwell (2011c) for a more extensive survey. But the bottom-up approach has yet to consolidate into a comprehensive pAA comparative dictionary and reconstruction. The closest thing to this that has appeared in print is the work of Shorto (2006) edited posthumously by myself (and Christian Bauer and Doug Cooper). Shorto relied primarily on the textual evidence of two criterion languages, Mon and Khmer, with etymologies fleshed out further with reference to available wordlists and previous published comparisons, concentrating on 20 main sources that covered nine branches (although with a disproportionate emphasis on Bahnaric that reflected a heavy reliance on Schmidt (1905)). Some adjustments to the reconstructions based narrowly on the criterion languages were made to account for details turning up in the non-criterion languages, but frequently these were presented as alternate reconstructions, so that the primacy of (Old)Mon and/or Khmer evidence was not undermined. The result represents an interim step (criticised as such by Diffloth in his review of 2008) in the broader programmatic imperative: a vital documentation of the state of knowledge four decades ago and framework for extending and refining etymologies and reconstructions. It is also well known that Gerard Diffloth and Ilia Peiros have independently pursued their own consolidated pAA reconstructions and comparative lexicons for several decades, although neither of these have been subject to editorial scrutiny. Peiros (1998: 111) reports that in the 1980s he compiled some 1500 AA etymologies using Khmer, Mon, Chrau, Vietnamese and Wa and devised a preliminary historical phonology.6 He later made an extended compilation accessible online via The Tower of Babel project at http://starling.rinet.ru/, apparently last updated in 2005. Unfortunately this work remains somewhat obscure; the online interface only permits searching by individual etymologies and no documentation supporting the underlying analyses is made available. Diffloth has similarly since the 1980s been reporting that he has a substantial AA compilation of more than 2000 etymologies supported by a phonological reconstruction, and has cited a modest number of provisional proto-forms in 6 In fact Peiros described the process of his compilation and analysis to me in detail in the mid 1990s when he was my graduate superviser at the University of Melbourne.

226

sidwell and rau

various papers (e.g. Diffloth (2005) cites some 20 of his pAA forms for rice terminology and fauna), but again supporting documentation has not been made available. Consequently the only elaborate published reconstruction which we can effectively discuss is that of Shorto (1976; 2006), plus various shorter works that deal with specific aspects of pAA (e.g. works by Patricia Donegan and David Stampe (1983; 2004), and independently Gregory Anderson (2004) to be mentioned below, that offer suggestions concerning the shape of the phonological word and such). As an editor of Shorto (2006), I reviewed the phonological and lexical reconstruction in my PREFACE to the volume. In summary, I arrived at several significant general findings:

· ·

Shorto’s pMK is effectively a pAA reconstruction, although skewed and incomplete in various respects; the historical phonology is heavily skewed towards Old Mon, with the additional factor that Khmer was relied upon for reconstruction of proto-diphthongs; the lexicon was heavily skewed by reliance on Bahnaric, and by the author’s wrong-headed acceptance of the Austric hypothesis (genetic relation to Austronesian), which was further skewed by the history of long term Austronesian (Malay and Chamic) contact with Bahnaric and Khmer; Shorto’s theory of vocalic ‘alternances’—which lead to an inelegant proliferation of alternate reconstructions—is wrong, and arose in part as an attempt to deal with the consequences of relying upon too few criterion languages.

· ·

Notwithstanding the above points, Shorto’s achievement was substantial, although regretfully not made available in the late 1970s when the second draft typescript was completed. The analytical shortcomings mentioned above would have been noted and discussed by scholars, and substantial progress facilitated with several decades’ head start over the situation in which we now find ourselves. Yet it is the nature of things that science progresses in fits and starts, and a generation further on we are in a good position to take stock and map out a basis for moving forward. In the sections that follow I outline my provisional reconstruction of Austroasiatic linguistic history, with particular reference to segmental phonology, phonotactics and lexicon. The chapter ends with a Swadesh list of pAA reconstructions and selected daughter reflexes. Additionally, the question of classification of the languages necessarily arises, although it is dealt with in detail in a separate chapter in this volume. The approach taken here is that

austroasiatic comparative-historical reconstruction

227

AA is treated as effectively consisting of a dozen coordinate branches, and the presence of a feature in at least two non-geographically adjacent branches is required to justify reconstruction to pAA. Insofar as multiple branches may constitute sub-families within a nested tree structure—a possibility which remains controversial—this does appear to materially affect the reconstruction. This is because while supposed AA sub-families remain controversial, geography is straightforward, and proximity of branches would suggest both the possibility of borrowing and special genealogical relations, so controlling for geographical proximity is relied upon to deal with both issues. 2.2 pAA Phonological Word The reconstruction of pAA must begin with a phonotactically reasonable model of root structure and phonological word, which subsequently constrains the segmental reconstruction, helping us to avoid positing unnecessarily complex structures or redundant entities. At the same time one ought to avoid forcing optimally economical or symmetrical analyses that are not supported by typology or otherwise seem phonetically unnatural. We begin here discussing Shorto’s (2006)7 proto-word and phonotactics, and then make reference to apparently conservative structures found in eastern AA language, and the general principles that were first laid out by Stampe in his 1978 presentation “The Austroasiatic Word” delivered at the Second International Conference on Austroasiatic Linguistics (Mysore, India). The reconstruction of Shorto is fairly simple; he posited monosyllables with or without initial clusters (*CVC, *CCVC), including forms that would be characterised as sesquisyllabic8 in contemporary terms, and a modest number of 7 Shorto characterised his reconstruction as Proto Mon-Khmer, as it only obliquely included Munda data, and the prevailing view held that Austroasiatic consists of two coordinate families, Mon-Khmer and Munda. This view is no longer tenable as it has become clear that Munda is simply one of many coordinate AA branches (see the chapter on Classification in this volume) and consequently Shorto’s reconstruction is effectively pAA. 8 Some discussion is needed concerning the term “sesquisyllable” which is much over-used in discussion of AA phonology and about which views differ somewhat. Coined by Matisoff (1973) the term means ‘a syllable and a half’ and was intended to characterise the situation in which a prevocalic consonant is preceded by a ‘pre-initial’ consonant, such as in Khmer psa: ‘market’ or kŋaok ‘peacock’. The areal tendency is to articulate such sequences with a short weak vowel, creating an unstressed minor syllable. The vowel, or from another perspective the syllabicity of this structure, carries no phonological weight, but may have significant downstream consequences for reanalysis and restructuring. Various studies (e.g. Thomas 1992, Thurgood 1999 and others) have identified unstressed penultimate syllables with contrastive vowels (i.e. having phonological weight, although with strong restrictions)

228

sidwell and rau

disyllables that, being iambs, can also be treated as sesquisyllabic. Shorto’s proto-V is long or short and the syllable is always closed (no zero codas, counting finals *h and *ʔ among final consonants). Shorto briefly explains his CCVC forms as follows: *CCVC forms comprise both simplexes containing *CCVC roots, and affixed forms of *CVC ones whether prefixed (*C-CVC) or infixed (*C/C/ VC). [. . . . .] All the consonants listed in the table on p.2.1 may occur at C₂ in structures *CCVC. Glottalized consonants, semivowels and *h do not occur at C₁. Of the nasal series we reconstruct a nasal homorganic with the following consonant (schematically *N), and *m; these two terms are distinctive only before liquids. Shorto 2006: 51

In other words, there are both CVC and CCVC roots, additionally some CCVC forms are prefixed or infixed CVC roots. Also, although their analysis it is not discussed by Shorto, an examination of the CCCVC forms reconstructed shows that in such cases the C₂ is a sonorant, suggesting an infixed CCVC. There are some aspects to this reconstruction that require further discussion. In the first place Shorto mentions that there are some restrictions to C₁ in CC onsets, namely the lack of glottalized consonants (implosives), semivowels and /h/. But this can be turned around and characterized somewhat differently, permitting us to posit two distinct kinds of CC onsets (where C is any consonant and D any plosive stop (oral, glottal or nasal) or /s/), namely: 1. 2.

DR, where are R is an approximant, including /h/; and DD and DĐ, where Đ is an implosive

Type 1 DR are unambiguous monosyllables that conform to the sonority sequencing principle (SSP, see Sievers 1881, Jespersen 1904, Selkirk 1984) and should arguably be grouped with the CVC roots. Potentially various DR onsets where R was /r/ or /l/ could reflect infixed forms, but their contribution appears to have been fairly marginal.

as a kind of sesquisyllable, treating sesquisyllables as a kind of disyllable rather than just a kind of monosyllable that permits initial clusters. The common factor in both definitions is that words are iambic, and have more than one consonant preceding the stressed (or main syllable) vowel, so here the term is used to indicate just these specific features.

austroasiatic comparative-historical reconstruction

229

Type 2 DĐ present a significant structural asymmetry which applies throughout Austroasiatic where implosives are attested; they are always prevocalic. Where the history of the languages are known or reconstructed with confidence, and their implosives are not direct reflexes of the pAA series (such as in Khmer, Vietnamese, Wa), they have always arisen from earlier pre-vocalic voiceless stops. This suggests an internal reconstruction of pre-pAA in which implosives are derived from plosives (see also Sidwell (2008) for speculation of a common origin of implosives and nasal infixes). The point here is that both DD and DĐ onsets reflect one historical phonological category with an internal asymmetry that is both reasonable and potentially explainable. The type 2 onsets include sequences with plateaued or falling sonority (e.g. *pp-, *gc-, *ds-, *rt-, *nr- etc.) and following characteristic areal typology we can expect that these were articulated with a short excrescent vowel (voiced or voiceless, as in Khmer for example; see Huffman et al. 1970:43, and symbolized with the superscript ǝ here), yielding sesquisyllables. As mentioned above, Shorto’s *CCC onsets all have either /r, l, n/ in C₂ position,9 and these would have created phonetically unstressed minor syllables (with or without vowel excrescence due to the sonority of the sonorant), so structurally they can be grouped with the DD/DĐ onsets. Consequently, Shorto’s *CC- onsets can be divided into two types; between monosyllabic and larger structures that do not conform to the SSP and can reasonably be expected to have surfaced as sesquisyllables or disyllables. The *CCVC monosyllables are better analysed as bearing a medial at C₂, as this permits us to identify a single main syllable initial slot (Ci) which further allows any C followed by a vowel in a CVC, or any D followed by a medial (Cm). Thus we have two distinct phonological templates which constitute a provisional pAA word template: pAA phonological word templates: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(n/r/l))CiVCf

How does the above reconstruction square with the result of others? Diffloth has not (to this writer’s knowledge) explained his pAA reconstruction in print, but we can make some observations based on the provisional proto-forms he has presented. Diffloth (2005) discussed 20 proto-forms reflecting rice and faunal terms:

9 To these we might want to add *m, which would also go some way to explaining labial infixes in Khmer and Mon, but examples indicating wider distribution in AA are presently lacking.

230

sidwell and rau

Diffloth (2005) rice and faunal terms, provision pAA reconstructions: Rice terms: Faunal terms: ‘rice plant’ #(kə)ɓaːʔ ‘land/tree monitor’ ‘rice grain’ #rəŋkoːʔ ‘ant-eater’ ‘rice outer husk’ #cəŋkaːm ‘buffalo’ ‘rice inner husk’ #kəndək ‘bear-cat’ ‘rice bran’ #pheːʔ ‘mountain goat’ ‘mortar’ #təmpal ‘asian elephant’ ‘pestle’ #ɟənreʔ ‘peacock’ ‘winnowing tray’ #ɟəmpiər ‘rhinoceros’ ‘to winnow’ #guːm ‘bamboo-rat’ ‘dibbling-stick’ #ɟərmuəl ‘rice-complement’ #kəntuːʔ

#tərkuət #(bən-)ɟoːl, #ɟ(ərm)oːl #tənriak #tənjuːʔ #kiaɕ #kaciaŋ #mraːk #rəmaːs #dəkan

It is immediately clear that Diffloth’s reconstructions are structurally consistent with the templates elaborated from Shorto’s reconstruction; there is only the notational discrepancy that Diffloth insists on syllabifying forms with initial clusters that violate sonority sequencing with a predictable schwa (the /a/ vowel of ‘asian elephant’ appears to be a typo in the original). By contrast, the reconstructions of Peiros reflect a fundamentally different approach. Again, there is no published explanation (as far as I am aware) but Peiros did explain the principles to me verbally several times in the 1990s. Below I table Peiros pAA forms that have equivalents in the Diffloth set above. Peiros (2005) rice and faunal terms, provisional pAA reconstructions (paralleling Diffloth (2005)): Rice terms: Faunal terms: ‘rice plant’ *bʔa ‘land/tree monitor’ *kɔ:t ‘rice grain’ *kɔw ‘pangolin’ *[j]ual ‘rice outer husk’ *kaːm ‘buffalo’ *ra:k ‘rice inner husk’ *dʔǝk ‘mountain goat’ *kɛh ‘mortar’ *tal ‘asian elephant’ *cieŋ ‘pestle’ *rǝj / *rɛj ‘peacock’ *raːk ‘winnowing basket’ *pVr ‘rhinoceros’ *ma:jh ‘to winnow’ *gu:m / *ʔu:m ‘to dibbling’ *mu:l / *mɔ:l

The above proto-forms, and others presented on the website, reveal that Periros adheres strictly to Schmidt’s hypothesis that CVC monosyllables underlie all

austroasiatic comparative-historical reconstruction

231

pAA roots, with affixation accounting for more complex forms. Strikingly, implosives are generally explained as coming from clusters with glottal stops (a device that Shorto resorted to for just a couple of etyma), the final *-s of Shorto and Diffloth is explained as an –h suffix following yod (consistent with Peiros’ 1996 pKatuic, and Schmidt’s 1906 suggestion of Austric suffixes), and zero finals are permitted. Given the lack of textual explanation, and the evident lack of serious development beyond the ideas of Schmidt a century ago, further discussion of Peiros’ reconstruction is is not pursued here. There is additional theoretical and typological support for our model of AA word and root structure in the work of Patrica Donegan and David Stampe (e.g. Stampe 1978; Donegan & Stampe 1983, 2004; Donegan 1993). Beginning with a presentation to the 2nd International Conference on Austroasiatic Linguistics by Stampe in 1978 and a series of further papers, Donegan and Stampe (D&S) have elaborated a theory of the relation between speech rhythm, accent and syllable, which although somewhat controversial, is directly relevant this discussion. D&S demonstrate that the analytical syntax of non-Munda Austroasiatic languages such as Khmer corresponds neatly to the order of bound morphemes imbedded in Munda (e.g. Sora) verbs. The non-Munda Austroasiatic languages show sesquisyllabism, head-dependent ordered analytical syntax and extensive suffixing.10 In a typological mirror to this, Munda languages build words by concatenating phonologically simple syllables, employing dependent-head highly synthetic syntax, with extensive suffixing. And while Munda disyllabic roots are mostly stressed on the rightward syllable, the presence of suffixes means that stress is not on the ultima, delivering trochaic, rather than iambic rhythm. The explanation offered by D&S suggests that languages such as Khmer transmit the pAA pattern of head-dependent SVO (SV/AVP) syntax, while in Munda various originally free morphemes indexing subject, object etc. have grammaticalised into bound morphemes. This supposes a “synthetic drift” in Munda (and by extension independently in Nicobarese as well). However, the field is strongly divided on these questions. A tendency reflected by Pinnow (1959; 1960; 1963), Anderson and Zide (2001) etc. reconstructs elaborate verbal inflection for pMunda and sees apparent traces of historical inflection in various non-Munda languages. . . . They criticize Donegan and Stampe (1983) for the view that the Munda morphology must be seen as in large part due to the independent synthetic drift of the daughter languages rather than due to the breakdown 10 The Nicobarese languages are more Munda-like in this respect; see Critchfield-Brain (1970), Radhakrishnan (1981) for descriptions.

232

sidwell and rau

of a fully formed verbal system in proto-Munda. [. . .] To reconstruct synthetic morphology for proto-Austroasiatic implies that the vast majority of eastern languages lost verbal inflection and morphology and became analytic. But that would entail that in most Mon-Khmer languages that former affixes—elements characteristically faded in pronunciation, grammatical autonomy, and meaning—have been restored to full lexical specificity and function. It is far more likely that Munda and the few eastern languages that show signs of synthesis have innovated it. Dongegan and Stampe 2004: 8

I concur with D&S in this argument, adding phonological considerations. It is more parsimonious in both syllable structure and segmental phonology to reconstruct pAA as morphosyntactically more similar to Khmer than to Sora. Broadly speaking, Munda languages tend to build words by stringing together simple (C)V(C) syllables that lack distinctions of quantity, tone, or diphthongs (notwithstanding local secondary developments). One Munda language, Gta’, shows significant sesquisyllabism, and Anderson (2008a) argues that this is historically secondary. Munda languages also lack contrastive implosion, typically showing a single voiced stop series unmarked for +/– plosive (corresponding to a historical merger of pAA plosive and implosive voiced stops). Having recourse to Munda data alone, it is simply not possible to reliably predict whether cognate forms elsewhere in AA have imploded or plain voiced initials, long or short or diphthongal vowels. On the other hand, recourse to non-Munda forms provides high predictability of Munda cognates, and this is seen in the selected examples below. The ten etymologies below illustrate reflexes of various pAA monosyllables, with and without medials, mostly yielding disyllables in Munda, and several important patterns are evident. Firstly, one can see that Munda disyllables are created from monosyllables with echo vowels. Apparently when the etymological main syllable vowel was *i or *u, it is fully realised in the new syllable; while when it was *a, as in the ‘air, wind’, ‘road, way’ examples below, the quality of the echo vowel is affected by the initial (e.g. > [o] /k_). An exhaustive analysis is yet to be conducted, but the broad outline of the patterning is clear. We also see that the pAA diphthong *iə is consistently realised as a monophthong in Munda (similar for *uə but it is difficult to isolate unambiguous examples). And the last three examples also illustrate the merger of plain and implosive voiced stops in Munda. Finally, there is apparently no basis for predicting the length of the pAA vowel with recourse to Munda data. For the reasons discussed above, it seems reasonable to suggest that the special typological characteristics of Munda are the result of independent shift within that branch, and not inheritances from a Munda-like pAA.

233

austroasiatic comparative-historical reconstruction Non-Munda Munda gloss: ‘thigh’, Shorto (2006) *bluːʔ Khmer: phlɤ̀u Sora: buluBahnar: bluː Kharia: bhulu Palaung: blu Mundari: bulu Temiar: bəloʔ Ho: bulu Nicobarese: pulɔː Kurku: bulu

gloss: ‘dirty’, Shorto (2006) *kmuːʔ Khmer: khmau Mundari: humu Kuy: kmau Ho: homu Palaung: kəmu Kurku: kumu gloss: ‘forest’, Shorto (2006) *briːʔ Bahnar: briː Kharia: biru Praok: praj Mundari: bir Jah Hut: bəriʔ Bhumji: bir Old Khmer: βrai Ho: bir gloss: ‘long’, Shorto (2006) *ɟliːŋ Old Mon: ɟliŋ Mundari: ɟiliŋ Lawa: ʔleiɲ Bhumji: ɟiliŋ Nicobarese: cəliŋ gloss: ‘air, wind’, Shorto (2006) *kjaːl Old Mon: kjal Kharia: kɔjɔ Bahnar: kjaːl Juang: kojo Khmer: khjɔl Santali: hɔe Kuy: kjaːl Mundari: hɔjɔ gloss: ‘road, way’, Shorto (2006) *kraʔ Kuy: knaː Mundari: hora Praok: kra Bhumji: hora Mương: khá Kurku: kora gloss: ‘hail’, Shorto (2006) *priəl Bahnar: prɛl Palaung: preər Khasi: phria

Gutob: Mundari: Gorum:

bireːl, breːl aril areːl

234

sidwell and rau

gloss: ‘two’, Shorto (2006) *ɓaːr Bahnar: ɓaːr Old Khmer: ber Old Mon: ɓar

Kharia: Mundari: Kurku:

ubar bar bari

gloss: ‘spread a mat’, Shorto (2006) *b(e)l Stieng: biːl Palaung: per Riang: pɛl¹

Santali: Sora: Kurku:

bel belbindil

Santali: Sora:

ɟɛmbɛˀd bed-

gloss: ‘suck in mouth’, Shorto (2006) *[ɟ]ɓiət Bahnar:11 biəːt Khasi:12 biat Mon: həbet

Affixation On the basis of what has been discussed so far, we have more or less settled upon a model of pAA that permits mono- and sesquisyllabic word forms, with the acknowledgment that in addition to simple roots there were prefixed and infixed derivatives. Suffixing was unlikely to be a feature of pAA, as there is a lack of direct comparative evidence for this, and it would be inconsistent with the typological indications for a largely isolating iambic language, nonetheless, there is yet to be unanimity on this point among scholars. Pinnow was quite insistent on the antiquity of suffixing: It may be regarded as certain that a large number of both prefixes and suffixes were employed in the Austroasiatic languages at an earlier period. In fact once again neither Munda nor Nicobarese lost their suffixformations but rather extended the use of such formations through the erection of various new suffixes is evidence not for an actual common development but rather for a common tendency. Pinnow 1963: 144–5

The above assertion is both bold and unsupported, and contains within itself an important contradiction. The suffixing that is evident in Munda and Nicobarese is not cognate; and as Pinnow acknowledges, both groups have a tendency for “the erection of various new suffixes”. This is consistent with a

11 ‘to purse (lips) in token of disapproval’ 12 ‘to eat (pulses) a little at a time’

austroasiatic comparative-historical reconstruction

235

hypothetical shift away from pAA iambic rhythm in which more and more head-dependent constructions emerged and grammaticalised to clitics and suffixes. There is a plethora of evidence for the direction of change from analytic to synthetic, while examples of change in the other direction (e.g. such as loss of morphology in, say, Vietnamese) simply has no bearing on the issue. The general picture of word formation outside of Munda and Nicobar is quite clear: infixing is derivational, largely deriving nouns from verbs, and prefixing deals with valence changing and other verbal operations, and more marginally there is prefixing of nouns for deictic distinctions and case-marking (see Alves (2014) for a survey). Beyond that there is varying use of compounding and reduplication, but that is beyond the scope of the reconstruction discussed here. The following pAA affixes were reconstructed by Sidwell (2008): Prefixes causative *p-/*pCreciprocal *t-/*tNstative *h-/*hNInfixes nominalising nominalising iterative nominalising instrumental

*-n*-r-/-l*-p-

And there were certainly others, including the high likelihood of vowel affixes *-ə-, *i- (analysed as allomorphs of consonantal *-ʔ-, *j- by Shorto (1963) in his discussion of affixation in northern MK languages). Prefixes are either single consonants or consonants with sonorous increments (nasal, rhotic or a neutral vowel) that may have been additional infixes and/or played a phonotactic role of creating rhymes for unstressed syllables. Infixes are always inserted to the right of the most leftward consonant, regardless of the length of the word. Given typical restrictions on word length, and rhythmic structure, infixes may also be regarded as prefixes that attach directly to the prosodic head (such as the stance Kruspe (2004: 65–6) takes in her analysis of Semelai). Both forms of affixation lead to resyllabification, creating licit unstressed syllables. In Aslian and Nicobarese, and apparently vestigial in Khmuic, a process of “coda-copying” creates fully-formed unstressed syllables. Also cheekily referred to as ‘incopyfixation’ by Matisoff (2003, see same for a general discussion of the topic), coda-copying may have been a regular feature of pAA. In Aslian the process occurs when infixation of sesquisyllabic stems is accompanied by reduplication of the main syllable coda; a couple of examples from Jahai (Aslian, taken from Burenhult (2005)) follow:

236

sidwell and rau

stem

-n- infixation

tbɔh ‘to beat’

tbɔh

bdɛl ‘to shoot’

bdɛl

assimilation by local rule

mdɛl

coda copying

surface form

tnhbɔh

[tənahˈbɔh] ‘the act of beating’ [məlˈdɛl] ‘shot’

mldɛl

And some apparently fossil forms from Khmu Cuang (from Premsrirat 2001), typically fixed in adverbial expressions: kʰras smglim ‘laugh (smile on the lips all the time)’ klɔːk slkuəl ‘shade of white’ mat rpjeːp ‘wink quickly’

In Nicobarese languages there are no sesquisyllables, so the phonology is somewhat different. In Nancowry the rule is that prefixes of the form CV– when attached to CVC stems with final stops (oral or nasal)—get copy codas, while stems with other codas do not. Some examples (from Radhakrishnan (1970), who incidentally does not discuss the functions for the prefixes, just the phonology): - prefix + prefix vs. hiák ‘smooth’ ʔukhiák ‘broom’ káp ‘bite’ ʔupkáp ‘to bite’

- prefix téha ‘to cut smth.’ jál ‘pour, descend’

+ prefix ʔutéh ‘to cut, harvest’ ʔujálcaŋ ‘to irrigate’

If coda copying were simply restricted to Aslian and Nicobarese one could make an argument that it was a southern Austroasiatic areal development (in spite of the fact that the forms and rules are quite different), but the traces in Khumic are so strikingly similar to Aslian as to suggest a pAA feature. Unfortunately, comparisons that would indict specific pAA reconstructions with coda-copied forms are lacking, but it should be kept in mind as a real possibility. If we are to reconstruct pAA coda-copying, an immediate effect is the possible need to extend our phonological templates of the proto-word to accept sequences of at least four consonants preceding the stressed vowel, and we enter a territory that includes unambiguously disyllabic words. This would also be the case if multiple affixes were permitted to accrue on to mono- and

austroasiatic comparative-historical reconstruction

237

sesquisyllabic stems. This is apparently the situation seen in Katu, as described by Costello: In Katu there are many affixes, occurring mainly on verb roots but also on adjective, adverb and noun roots, and on numerals and locatives. Combinations of prefixes can occur in Katu, which is fairly rare in MonKhmer languages. Each of the main prefixes—causative, reciprocal, adjectivizer and involuntary—is able to occur in combination with the causative prefix. The combination of prefixes occur most extensively on verb roots, but are also able to occur on adjective and noun roots. A word in Katu has a maximum of four syllables, made possible when two prefixes are added to a two syllable root. [. . .] saruum13 ‘to fall’ pajuak ‘to persuade’ karauq ‘to hurt’

tapasaruum ‘to cause each other to fall’ tapajuak ‘to persuade each other’ pakaruaq ‘to cause to hurt’

Costello 1966: 63–4

However, as Costello remarks, such multiple affixation is “fairly rare” and like coda-copying, we lack direct comparisons across between branches that would suggest specific pAA forms with such polysyllabic forms. The comparative evidence we have (such as compiled by Shorto 2006) consistently suggest maximal proto-forms which conform to our *Ci(Cm)VCf and *(Cp(n/r/l)) CiVCf templates. 2.3 Segmental Phonology The first serious step toward a pAA phonology is in Schmidt’s Grundzüge (1905), the core of which is a comparison of written Mon and Khmer with contemporary sources for the Bahnaric languages Bahnar and Stieng. Schmidt’s analysis was confused by his interpretation of the retroflex and voiced aspirate signs adapted to Mon and Khmer from their Indic source, but these issues were more or less sorted out by Otto Blagden.14 A contemporary of Schmidt, Blagden deciphered Old Mon and compiled an enormous Aslian comparative lexicon (Skeat & Blagden 1905). Over four decades, until his death in 1948, he 13 Original notation is preserved for these examples. 14 Blagden's files are presumed lost after the passing of Harry Shorto, although there is a slim possibility that they may still be found (unlabeled or mislabeled) at the SOAS Library in London.

238

sidwell and rau

compiled some 5000 ledger cards (now lost) towards an etymological dictionary of Old Mon. Shorto used Blagden’s compilations and insights in the course of his reconstruction, as well as making liberal use of lexical comparisons and analyses of contemporaries (such as Gordon Luce, H.J. Pinnow and André Haudricourt). Shorto pAA segmental phonology is quite problematic in relation to vocalism, while his proto-consonant inventory is typologically quite typical for eastern AA languages, and appears to be mostly consistent with the model apparently reconstructed (without published explanation) by Diffloth, and it is adopted with only slight modification in the present discussion. 2.3.1 pAA *Ci Consonants In this section we discuss the reconstruction of the segments that occupy the *Ci slots in our *Ci(Cm)VCf and *(Cp(n/r/l))CiVCf pAA word templates. A major insight that our predecessors achieved in respect of this question is the recognition that the system reflected in the inventory of Old Mon consonants more or less directly reflects the pAA inventory unchanged. Later in the Middle Mon period the language underwent extensive phonological restructuring that introduced gross discrepancies between the written and spoken forms of the language, see the introduction to Shorto (1971) for an overview. Various field researchers have also revealed living languages that are of this “unrestructured type” (to use the terminology of Huffman (1985a)) such as Katu (Costello 1971, Costello & Sulavan 1996) and Bahnar (Banker et al. 1979; Guilleminet & Alberty 1959–63) and as will be seen below, various AA branch level reconstructions also indicate similar inventories. Consequently we can reconstruct pAA *Ci inventory as follows: pAA inventory of Ci segments * p t c k ʔ b d ɟ g ɓ ɗ (ʄ?) m n ɲ ŋ w r, l j s h

There remains a question of whether a palatal implosive *ʄ should be reconstructed. The contrast is apparent in Katuic (see Sidwell 2005) but not in other branches. Strict application of the logic of the comparative method would demand extending this to pAA, but I am ambivalent on this at present; refer forward to the section on Katuic below.

austroasiatic comparative-historical reconstruction

239

It is notable that the three series of oral stops (including the incomplete implosive series—an evident example of “velar gap”, see Maddieson 2005) have variously restructured into two series in most branches and ultimately in most present day languages. The general tendency is either to merge the voiced and voiceless plosives into one series or to merge the voiced plosive and voiced implosives into one series. This parallels somewhat the well-known Indo-European Centum-Satum problem that forces the reconstruction of three PIE guttural series. Consider the following table of comparanda of language data from eight branches, including Munda (Sora): ‘drunk’ ‘two’ ‘shoots’ ‘thick’ ‘obtain’

Old Mon Khmer Bahnar Katu Khmu Cuang bəlɓul pùl¹ bəɲul² ɓal — ɓar pìː ɓaːr ɓəːr baːr tɓaŋ tùmpɛ̀əŋ təɓaŋ ʔabaŋ (təpáŋ)3 — — həɓəl (kəbən)4 həmpɯl) — — — ɓoːn bɨan

Palaung Khasi Sora — buː ʔaːc buʔul — — bagu bəŋ — tabəŋ pu rbɛn — bɯn — —

¹ ‘vegetable poison’, ² ‘toadstool’, 3 Kammu Yuan,

It is evident from the above table that the implosive bilabial *ɓ survives in Old Mon, Bahnar and Katu, whereas it has variously merged with the equivalent plain voiced and plain voiceless stop elsewhere. The reflexes of pAA apical *ɗ follow a similar pattern: ‘bamboo ‘water’ ‘span’ ‘cover/hide’ ‘ripe’

Old Mon Khmer Bahnar Katu Khmu Cuang Palaung kɗɯŋ¹ — ɗiːŋ — (Mlabri ɗoŋ) diŋ ɗaik tɨk ɗaːk ɗəək — — cɗaʔ² — səɗaː cadaː sndaʔ (chətháʔ)3 kɗøp tùp kədəp paɗʌp dap dəp — tum ɗuːm ɗam hnduum4 kəndum5

Khasi Kharia tɨndɔŋ — — ɖaʔ sda — — ɖaˀb — —

¹ Middle Mon ‘areca-nut container’; ² Middle Mon; ³ Lamet; ⁴ ‘to ripen’; 5 ‘bud’

Aside from these stop series, the other *Ci correspondences (sonorants, approximants) are trivially similar and present no special problems of interpretation. A summary tabling of the correspondences at proto-branch level follows:

240

sidwell and rau

pPearic

Old Mon

pAslian

pNicobarese

pBahnaric

pKatuic

pVietic

*f*p*p*t*t*dr*h*-h*s-/h*s*c*k*k*m*n*ɲ*ŋ*w*ɽ*l*j-

pKhmuic

*p*b*b*t*d*d*h*-h*s*c*ɟ*k*g*m*n*ɲ*ŋ*w*r*l*j-

pPalaungic

*p*b*ɓ*t*d*ɗ*h*-h*s*c*ɟ*k*g*m*n*ɲ*ŋ*w*r*l*j-

pKhasian

*p*b*b*t*d*d*h*-h*s*c*ɟ*k*g*m*n*ɲ*ŋ*w*r*l*j-

pMunda

*p*p- *pʰ-/*p- *p- *p- *p*p*p*p*b*b*b*b- *b- *b*b*b*b*ɓ*b*b*ɓ- *b- *ɓ*ɓ- *ɓ-/*b- *b*t*t*tʰ-/*t- *t- *t- *t*t*t*t*d*d*d*d- *d- *d*d*d*d*ɗ*d*d*ɗ- *d- *ɗ*ɗ- *ɗ-/*d- *d*h- *Ø*h*h- *h- *h*h*h*h*-h- *-Ø*-h*-h- -Ø- *-h- *-h*-h*-h*s*s*s*s- *s- *s*s*s*s*c*s*c-/s- *c- *c- *c*c*c*c*ɟ*ɟ*ɟ-/*j- *ɟ- *ɟ- *ɟ- *ɟ-/*ʄ*ɟ*ɟ*k*k*kʰ*k- *k- *k*k*k*k*g*g*k*g- *g- *g*g*g*g*m- *m*m*m- *m- *m- *m*m*m*n*n*n*n- *n- *n*n*n*n*ɲ*ɲ*ɲ*ɲ- *ɲ- *ɲ*ɲ*ɲ*ɲ*ŋ- *ŋ-/*ɲ*ŋ*ŋ- *ŋ- *ŋ*ŋ*ŋ*ŋ*w- *w*w*w- *w- *w- *w*w*w*r*r*r*r- *r- *r*r*r*r*l*l*l*l- *l- *l*l*l*l*j*j*j*j- *j- *j*j*j*j-

pAA

pKhmer

AA summary proto-branch level Ci correspondences

2.3.1.1 Speculation Concerning Infixes The reconstruction of pAA consonants for each structural slot reveals an intriguing asymmetry: we observe that while nasal infixes are quite common, initial nasals in monosyllabic roots are relatively infrequent (about 5% of total), and often transparently secondary (e.g. West Bahnaric *niəm ‘good’ < pBahnaric *liəm ‘good’). By contrast, nasal codas are strongly preferred word finally, the proportion typically exceeding 25% of the lexicon. These tendencies suggest that during a specific historical stage nasals were not preferred as leading onsets. It is also curious that the implosives only occur prevocalically

austroasiatic comparative-historical reconstruction

241

in pAA, even though there are no articulatory reasons why they should not occur preconsonantally (as they do in Bahnar for example, see Banker et al. 1979). Thus we can say that the following were close to being in complementary distribution in pAA:

· ·

(C)ɗ/ɓV(C): permitted where the implosive is the etymological Ci (C)n/mV(C): permitted where the nasal is an infix or otherwise as second member of a cluster or secondary (by assimilation etc.) Consequently, it may be tentatively suggested that the nasal infixes may have originated as implosives, perhaps (speculatively) as metathesized prefixes. 2.3.2 pAA *Cf Consonants It is generally acknowledged that a characteristic typological trait among conservative AA languages is that the set of Cf segments is equivalent to the Ci set minus any distinction of voice onset timing or phonation. On this basis we hypothesize that pAA follows the same pattern, which can be illustrated with reference to a language such as Katu. Compare below the sets of initials and finals of Katu as described in works such as Costello (1971), Costello & Sulavan (1993), L-Thongkhum (2001) (with notation adjusted to IPA as appropriate, also note that Costello & Sulavan (1993) list complex finals /jʔ/ and /wʔ/ although these are explained as decomposed *c and a long back vowel before *-ʔ respectively): Katu: Ci consonants Cf consonants pʰ tʰ kʰ p t c k ʔ p t c k ʔ b d ɟ g ɓ ɗ ʄ m n ɲ ŋ m n ɲ ŋ w r, l j w r, l j s h s h

Additionally, there are various language specific differences—typically smaller inventories due to losses and/or mergers. In some cases mergers or losses among Cf segments are associated with the emergence of tones or phonation types (e.g.: loss of *-h, *-s is associated with hỏi and ngã tones in Vietnamese), or vowel changes (e.g.: fronting of vowels before palatals that may/may not have later depalatalized), or apparent voicing of codas associated with long

242

sidwell and rau

stressed vowels (e.g. in Khasi). To a great extent, changes in Cf segments leave traces which permit secure reconstructon of proto-values on internal grounds, confirming the pattern described above. Consequently, this typological argument leads us to reconstruct the following pAA Cf inventory: pAA Cf consonants * p t c k ʔ m n ɲ ŋ w r, l j s h

There are also strong indications that point to the absence of open (vowel final) stressed syllables in pAA, and this is a feature of Shorto’s reconstruction. This state of affairs persists in various AA languages, such as in Khmuic (e.g. Khmu Chuang, Mlabri), in North and Central Aslian, and in Old Mon, where words with vowel final main syllables are restricted to loans and a few monosyllabic grammatical morphemes. This is consistent with pAA being iambic in structure, as Donegan relates: Shorto (1960) described the word in the Mon-Khmer language Palaung as consisting of a heavy ‘major’ syllable preceded by an optional light ‘minor’ syllable. The major syllable is heavy, so its two moras fit into the accent: the rising accent is word-final. Donegan 1993: 6

With the stressed syllable necessarily heavy, the rhyme has at least two morae, and especially a short syllable must have a coda. This connects with the idea of a “bimoraic constraint” in AA, which has been offered by Anderson (e.g. 2004) to explain, among other things, the resyllabification and/or reduplication of monosyllabic free form nouns in Munda languages. It is also important to note that AA oral stop codas are typically unreleased and tense, accompanied by glottal restriction that halts the airflow through the larynx such that voicing ceases more or less simultaneously with oral closure. Consequently, there is no audible voicing that would occur if the airflow continued until equalisation of sub- and supra-glottal air pressure. From this perspective it may be more appropriate to notate/reconstruct the proto-oral stop codas as follows: * pˀ tˀ cˀ kˀ Øˀ

austroasiatic comparative-historical reconstruction

243

It is also worth making some points about word final /s/. It may be that this fricative coda is best reconstructed as a laminal [ɕ] or similar as many daughter languages do actually reflect this final as post-alveolar, or showing a wide allophonic range between alveolar and palatal. This is often noted in sources with a digraph notation ih, yh or similar, indicating a voiceless palatal fricative with noticeable vowel transition approximating a brief non-syllabic [i], [ɪ] or [e] (or also commonly misanalysed as a cluster). The indicative IPA notation for such a sound should be [ç] or [j̊], but these are rarely encountered in the relevant language documentation (an exception of note is the use of ẙ by Diffloth (1984) for this coda in Nyah Kur, with the non-IPA use of y for the palatal). This hints at another issue in AA linguistics which has not attracted much attention, namely that in terms of oral place of articulation, it is apparent that active articulator rather than passive articulator place of contact/ approximation is often primary in the articulatory phonology. However, this is not captured nicely in IPA notation (ɕ, ȶ, ȵ etc. commonly used by Sinologists for laminals are not IPA, which instead requires the rather inelegant “square below” diacritic) and it is the historical tendency in the field that phonetics is taught with a bias to classifying articulations primarily in terms of place of passive articulator contact/approximation. The upshot is that the use of the s glyph for this coda is for consistency with the equivalent onset consonant, rather than faithful phonetic characterisation. In any case, the fricative space is clearly wide open and a dynamic range of allophones is normal. While various languages have retained the set of proto-finals mostly unchanged, many also show a reduction in the number if distinctions, often quite drastically. Common changes include:

· · · ·

loss of final glottal stop, this happened generally in various branches, and partially in Bahnaric; loss of orals stop articulations with merger to glottal stop, especially /k, ʔ/ > /ʔ/; merger of palatals and velars (or less commonly palatals and apicals) leading to an absence of final palatal stops; mergers or total loss of finals /r, l/, or even merger with /n/ (e.g. in Khasi). On the geographical periphery of the AA speaking areas there are striking secondary phonetic developments among the Cf that do not necessary change the phonological oppositions, but do reflect interesting areal and internal typological restructuring issues, as well as occasionally leading to terminological and notational confusions.

244

sidwell and rau

An innovation common to the Munda languages was a reanalysis of the final stops as voiced, presumably when the languages became suffixing and, once in a medial position (rather than final position), these stops—unmarked for phonation—became associated/analogised with the voiced (less marked series of) initials. The glottal constriction remains part of their articulation and is much more acoustically salient since there is an audible interruption to voicing within the phonological word. The glottal constriction is commonly written with a raised comma before the stop by Mundarists (e.g. Kharia mɔ’ɖ, Mundari mɛ’d ‘eye’ etc.), reflecting the phonetic fact that the glottal stop is released and voicing resumes during oral closure. Word finally in free forms, the stop coda can be voiceless leading to a voicing alternation (e.g. Sora [pətoˀt] vs. [pətod-ən] ‘hole’, see Donegan and Stampe (2004: 22) for discussion). Within Aslian there is a phenomenon of denasalisation of nasal stop codas that can lead to complete merger of oral and nasal stops. We defer to Matisoff’s description of the problem: Even the earliest collectors of vocabulary from Aslian languages were struck by peculiar complex final nasal sounds whose onset was the homorganic stop: -pm, -tn, -cɲ, kŋ, Phonetically, sometimes the nasal component seemed to predominate, but sometimes the occlusive portion was more salient and the collector wrote the stop on the line and the nasal as a superscript: -pm, -tn, -cɲ, kŋ. Blagden aptly named these sounds ‘disintegrated nasals’ since they represent a breakdown of ordinary ‘nasal stops’ (where the oral occlusion and the lowering of the velum are simultaneous) into separate buccal and nasal phases (the velum is not raised until the oral occlusion has been phonated). Blagden deplored this ‘careless, slovenly articulation’ and observed that the dirty habit was by no means confined to the aboriginal tribes of the Peninsula but was shared by such insular Austronesian groups as the Dayaks of West Borneo. Matisoff 2003: 18–19

The patterning varies across Aslian and is one of the principle facts justifying the internal classification of the languages. The process is related to historical vowel length; as Matisoff relates in the case of Sabum, denasalisation is complete after short vowels yielding voiceless stops, while the original stop codas have become voiced. This yields the typologically unusual situation of contrasting voicing in stop codas (see discussion of Aslian further below). Finally, there have been suggestions from time to time that a larger series of finals is indicated for pAA. Diffloth (1976) tentatively proposed a *-s, *-ś

austroasiatic comparative-historical reconstruction

245

distinction on the basis of some correspondences, and Shorto (2006) reconstucted suffixes *-h, *-s (on the model of Benedict’s (1972) proposals for protoSino-Tibetan) to explain similarly anomalous apparent correspondences. However, such proposals are lacking typological and/or morphological justification, such that the correspondences underlying them are probably better regarded as epiphenomena rather than real. 2.3.3 pAA Creaky Voice? A marginal feature of Shorto’s (2006) reconstruction are 55 proto-forms with a post final glottal stop (e.g. §497 *kaːŋʔ ‘chin, jaw’, §1444 *kuːjʔ ‘to bend, nod, drowse’, etc.). The function of this glottal in the reconstruction is to account for Vietnamese forms with sonorant codas that have sắc and nặng tones which were otherwise explained by Haudricourt (1954) as reflecting stopped codas. This appears to be an ad hoc solution to a problem that is far more extensive than Shorto anticipated. Diffloth (1989) draws attention to the fact that several AA branches show languages with a contrast of creaky voice and/or preor post-glottalisation of finals that effectively constitute a register distinction (analogous to the tense/lax systems found in various language due to devoicing of initial stops but apparently unrelated to them). Throughout Vietic languages there is a regular contrast of plain versus post stopped continuants, even in non-tonal Vietic languages such as Arem. In Katuic languages Ong, Talan, Katang (depending on the description), a similar situation prevails, although with complications; tense register final nasals are post-stopped in short syllables and pre-stopped in long syllables, while etymological oral stop codas in tense syllables surface as post-stopped nasals. In Pearic, the Chong dialects have a three- or four-way contrast that variously combines modal, breathy and creaky phonation. The conventional explanation for voice registers in Austroasiatic languages relates breathy phonation of voiced onsets (see Ferlus (1980) for a wide ranging discussion, Huffman (1985a) for a schematic account, and Thurgood (2002) for a detailed articulatory analysis), but this does not explain the cases of creaky voice since—as Diffloth shows—there is no correlation between these distributions of this “creak” and the historical phonation of initials. To confuse matters further; Diffloth also demonstrates that the creak does not pattern across cognate vocabulary items, which appears to argue against a straightforward inheritance explanation. Nonetheless, Diffloth favours descent from pAA: A long interim period evidently separates Proto-Austroasiatic: from each of the proto-branch levels. I have mentioned a few systematic distributional differences between the creakiness of Proto-Katuic, Proto-Pearic

246

sidwell and rau

and Proto-Vietic; these differences are probably the result of distinct innovations which occurred during intermediate periods. If we can discover more of these early innovations, and identify inter-branch borrowings, we may be able to explain the inconclusive etymological evidence shown above and reconstruct creakiness at the PAA level; for the moment, it is just a presumption. Diffloth 1989: 150

In addition to the tree branches discussed by Diffloth, Zide (1965; 1987) suggests reconstructing a glottal prosody to explain certain phonological changes in Munda, and Shintani (2008) documents four southern Palaung dialects which show a similar contrast of pre-stopped nasals versus plain oral and nasal stops (in this case the pre-stopped nasals appear to reflect decomposed oral stops). Thus there are five AA branches with indications of some kind of glottal prosody, although phonetically diverse and not showing regular correspondences in cognate vocabulary. Furthermore, aside from the Munda examples in this case (which have a different explanation, see section on Munda below), the other examples all have an important historical-typological feature in common: restructuring of their onset stop series, with devoicing of voiced stops and/or aspiration of voiceless stops, changes that can be characterised as a general tensing of the phonological system. For the time being, this suggests to me that the origins of this feature lie in linguistic tendencies linked to sesquisyllabism and iambicity, quite liable to arise independently, consistent with the failure to pattern regularly across cognate vocabulary. Ferlus (2011) has suggested that the glottal stricture prosody in Pearic, Katuic and Vietic languages is linked to a combination of the more tense articulation of words with sesquisyllables (as opposed to simple monosyllables) plus contact influence from Middle Chinese. This relies on the observation that during the period from the 3rd to 8th Centuries there was an active overland trade route from Vietnam, over the mountains and though Cambodia to the Gulf of Thailand, so that contact between Chinese and local AA speakers would have occurred. This is clearly a speculative hypothesis, since it supposes a significant linguistic change without evidence of extensive Chinese lexical borrowing. Also it would not appear to explain the Palaungic examples, although it would be premature to deny any merit in Ferlus’ proposal. The bulk of the people traversing the overland trade route may well have been Vietic speakers working for/with the Chinese, so some kind of linguistic area is possible. And if the underlying driver of this restructuring lies within the shared typology of the languages it may not have taken too much of a push to head down a common path.

austroasiatic comparative-historical reconstruction

247

Considering all of the above, it would be premature to reconstruct creak or glottalisation as a feature of pAA or to posit it for specific etyma, and we withhold judgement for now. 2.3.4 pAA Cp Consonants The challenge of reconstructing pAA Cp segments by direct comparison and analysis of segmental correspondences, etymology by etymology, is quite significant. As Anderson (2004) remarks: One of the outstanding problems in comparative Austroasiatic linguistics is the at times frustrating lack of direct correspondences of full forms of nouns both within individual Austroasiatic subgroups as well as across these groups. For example in Munda, it is often easy to isolate a monosyllabic root in nouns, but free forms are unrecoverable for the protolanguage. Anderson 2004: 163

It is my assessment that Anderson is too pessimistic about regular correspondences, for example he made an unfortunate choice relying on Peiros’ (1996) Katuic compilation and reconstruction to test for regularity of correspondence, but he makes a good point. Given sesquisyllabism and iambicity, it is inherently more likely that Cp segments will be lost, replaced, reanalysed or analogised, therefore direct correspondences will be less frequent than for other segments. For the present, I provisionally accept and report the solution provided by Shorto (2006) in broad terms, though we may differ in relation to analysis of specific etyma. According to my reading of Shorto (2006), there were apparently only a few important constraints on the possible consonant sequences within initial clusters; namely /j, w, h, ɓ, ɗ/ did not occur as Cp. The main problem is to resolve the voice quality of stops in the preconsonantal position, since a strong tendency across AA is for many languages to permit no, or only limited, voicing contrast in this position. Shorto’s solution was to infer voice quality by reference to the phonation types associated with the registers of Mon and Khmer, proposing the following relations: pMK (pAA) voiceless + voiceless voiceless + voiced voiced + voiceless voiced + voiced

Mon head register chest register head register chest register

Khmer head register head register chest register chest register

248

sidwell and rau

The underlying principle is that Mon registers consistently reflect the voice quality of the prevocalic consonant, while Khmer registers regularly reflect the voice quality of the preceding consonant. The formulation appears—so far as I can tell—to hold up fairly robustly, although it requires more comparative work to show its reliability; for the present it is accepted as a working model for predicting voice quality of Cp segments. The typology of pAA initial clusters predicted by the model finds a very close parallel in contemporary Khasi (and other AA languages of Meghalaya). At this point it is worth quoting Henderson on Khasi: One of the most striking features of Khasi is the astonishing richness of its word-initial clustering possibilities. This great variety of initial clustering patterns is a typological feature of other Austroasiatic languages, as for example Khmer and Old Mon, and contrasts sharply with neighbouring Sino-Tibetan languages in which permitted initial clusters are relatively few in number and restricted almost exclusively to plosives plus following liquids or semi-vowels (l, r, y, w). Rabel, who describes the variety of Khasi initial clusters as “absolutely phantastic” (Rabel 1973),15 lists 127 two-consonant clusters with examples of the occurrences with each of twelve final consonants (Rabel 1961: 21–29). Henderson 1976: 523

Henderson’s paper draws attention several important points:

· · ·

the Khasi initial clusters are not unique, as similar typology is found in various AA branches; the clustering possibilities are very rich, including many that violate the general linguistic tendency to prefer rising sonority syllable initially; the clustering of initials is robustly distributed, regardless of adjacent main syllable vowels or codas. Henderson’s work points out that the more typologically unusual clusters are associated with expressive/iconic vocabulary. This is interesting, and maybe significant for our reconstruction; expressive/iconic vocabulary is generally a strong feature of AA languages and it is appropriate to reconstruct a similar situation in the proto-language. The usefulness of Henderson’s analysis for 15 It is not clear to which paper Henderson is referring here, as it is not listed in the References, but the year suggests a draft paper circulated at the 1973 ICAAL meeting in Hawaii.

austroasiatic comparative-historical reconstruction

249

phonological reconstruction—I suggest—lies in the consideration that if one is faced with ambiguous indications in phonology, the semantics of the etymology in question may influence our proto-phoneme assignment. Based on the above considerations we propose the following pAA Cp inventory: * p t c k ʔ b d ɟ g m N r, l, s

The *N represents a nasal that assimilates to the place of the following consonant, while *m appears to retain its phonological independence. 2.3.5 pAA Vowels The reconstruction of pAA vocalism is recognised as profoundly difficult. Thomas (1964) in his overview of comparative studies remarks: Still another difficulty, one that besets all Mon-Khmer comparativists, is the complexity of the vowel shifting that has taken place in Mon-Khmer, making it very difficult to establish regular patterns. Schmidt, after a careful survey of the situation, had to content himself with just statements about possible general trends, establishing no sound-laws. Other comparativists have stated flatly that regular sound-laws simply do not exist in Mon-Khmer vowels, and, indeed, no one has yet succeeded (in print, anyway) in establishing a regular pattern in Mon-Khmer vowel comparisons. I feel that the solution, however, lies in starting at the very lowest level of comparison, working on adjacent languages to establish protoforms at that level, then using these reconstructions as the basis for comparison on the next level. Only in this way, I feel, will the Mon-Khmer vowels be able to be solved. Thomas 1964: 160–1

Thomas was writing half a century ago, not long after Pinnow’s Versuch had appeared and low level reconstruction had not been attempted on a wide scale, largely a function of insufficient reliable data. Since that time, scores of comparative analyses, some quite short and others quite substantial in coverage, have appeared, and tremendous progress has been achieved more or less directly in the manner that Thomas suggested, albeit organically in an uncoordinated effort rather than by an overt program.

250

sidwell and rau

In parallel to the broad scale bottom-up approach, Shorto devised a kind of a shortcut, using just two languages (Mon and Khmer) to make a first approximation of the proto-vocalism. He explains: The new approach I propose here depends on the assumption that a number of types of vowel variation or gradation existed in the proto-language. Before setting out the data, it is desirable to examine the consequences of this assumption. The basic vowel system will be reflected in a series of “regular” correspondences from which those due to variants have to be distinguished. Given that the incidence of vowel variation is limited, and that variant forms are distributed more or less randomly in the modern dialects, it follows that the wider the extension of a cognate set the greater the chance of its including one or more vowel variant forms. A comparison which embraces the largest possible number of languages will elicit a very large number of over-lapping, partly similar correspondences. On the other hand, in a comparison restricted to two languages we could expect a relatively high number of regularly related cognates and a smaller number of “irregular” ones attributable to variation. Shorto 1976: 1041–2

Applying this method, Shorto (1976) proposed the following inventory, with one additional tentative member added to a later (2006) version of the reconstruction: */ i u iː uː e ə o eː əː oː a ɔ aː ɔː iə (ɯə) uə ai / pMK(AA) vowel inventory proposed by Shorto (2006)

This is the “proto-Mon-Khmer” vocalism used in his comparative dictionary, and it does have some merit, although the tolerance for multiple correspondences and variant forms is untidy, frustrating and frankly unrealistic. Shorto was not unreasonable to posit some kind of vowel variation or gradation in the proto-language, as variation does occur, especially in expressive or descriptive vocabulary, right across AA. For example, Nagaraja (1985) gives the following illustration from Khasi:

austroasiatic comparative-historical reconstruction

251

ap dŋaŋ ‘wait for- to get something’ ap dŋeŋ ‘wait for- an opportunity’ ap dŋoŋ ‘wait for- lest anything happens’

hap ‘open’ (the mouth) (small) hep ‘open’ (the mouth) (big) hop ‘open’ (the mouth) (small) hup ‘open’ (the mouth) (very large or big)

The problem is that while this kind of iconic vocalism, in which degrees of openness and backness broadly correlate with diminution/augmentation etc., varies tremendously across the phylum and direct comparisons simply multiply the correspondences, much as Shorto anticipated. I take the view that it is appropriate to devise a method that initially yields a “core” system of correspondences which can later be refined by more detailed work, but I suggest that this is best done by extending the range of languages in the comparison while excluding initially etyma showing variation that multiplies the correspondences in a problematic manner. The result insofar as Shorto’s reconstruction is concerned, is that while he has reconstructed a vowel inventory that is probably quite close to the real pAA values, we are often at a loss to know which particular vowel has priority for a given eytmon. In the most extreme example, §1058 ‘to pull hard on, pull up, out, reap’ there are 11 variant proto-forms, all with the shape *rVt or *rVc employing 11 different vowels. Similarly, §1669 ‘to go round, to turn round’ has eight variants with the shape *wVr. One can accept that such monosyllabic verbs are eminently liable for expressive elaboration or deformation, but the method employed by Shorto does not lend itself to discovering any constraints or other specific properties of the supposed system of gradation, and instead manufactures a monstrous proto-vocabulary. Shorto neglected to consider other sources of vowel variation, especially phonological change associated with inherent tendencies in the syllable structure, which only came to be reasonably well understood a decade or so after Shorto presented his proto-vocalism. AA languages often have very large vowel inventories, perhaps the largest in the world (e.g. 42 vowel phonemes claimed for Bru in the analysis of Vương Hừu Lễ (1999)), but we are in the happy position that the diachronic typology of AA vowel restructuring, including vowel splitting, raising, diphthongisation, and register-genesis, is quite well understood. Huffman (1985a) lays out a neat summary of the restructuring processes that frequently give rise to diphthongal vowels and phonation contrasts in Austroasiatic, frequently associated with consonant phonation:

252

1. 2.

sidwell and rau Proto-language

Conservative

Transitional

Register

Restructured

Tonal

*/kaː/ */gaː/

/kaː/ /gaː/

/kaː/ /kʰaː/

/kaː/ /kàː/

/kaː/ /kia/

/káː/ /kàː/

Huffman (1985a:141) idealized scheme of phonological restructuring in Austroasiatic

The basic parameters are that voiceless onsets are associated with tense or 1st register vocalism, and voiced onsets with lax or 2nd register vocalism. The effect, which is most marked among long vowels, is for more raised vowel onsets in second register and/or lowered onsets in the 1st. This potentially leads to extensive diphthongisation, such as Huffman diagrammed for Bru and Katang, both Katuic languages: Register

2nd 1st 2nd 1st 2nd 1st

Bru (Ubol)

Katang (Salavan)

iː ↓ eiː

ɨː ↓ ɤɨː

uː ↓ ouː

iː ↓ ɪː

ɨː ↓ ɤː

uː ↓ ʊː

eː ↓ ɛeː

ɤː ↓ əɤː

oː ↓ ɔoː

iːe ↑ eː

ɨː ↑ əː

uːo ↑ oː

(ɛː) ↓ aɛː

iːa ↑ aː

uːa ↑ ɔː

ɛː

ɨːa ↑ aː

uːa ↑ ɔː

Huffman (1985a: 142–3) scheme of vocalic restructuring in Bru and Katang.

A strong tendency is for high lax vowels and low tense vowels to remain stable, while mid vowels can go either way—note how Bru and Katang contrast in this respect. More broadly, it should be obvious that, even though Huffman is discussing languages that have more or less neatly phonologised register contrasts, the underlying tendencies are latently inherent in any system of syllables with differing degrees of tension in their onsets, and one would reasonably expect to see various examples of raising, lowering and diphthongisation

austroasiatic comparative-historical reconstruction

253

right across AA, resulting in an awkward proliferation of vowel correspondences. The theory thus laid out by Huffman gives us a ready framework to make sense of numerous restructured vowel systems found among AA daughter languages, and on that basis greatly reduce the number of problem vowel correspondences that plagued Shorto and earlier investigators. Reviewing Shorto’s correspondences, and integrating with newer branch level reconstructions, I propose the revised proto-vocalism below, which has several additional members, including two tentative additional diphthongs): */ i u iː uː e ə o eː əː oː ɛ a ɔ ɛː aː ɔː (ie) (uo) iə uə / pAA vowel inventory proposed by Sidwell

The pAA diphthongs are somewhat problematic; two are securely reconstructed, another two only tentatively. *iə, *uə are reasonably indicated, although it is likely that many of Shorto’s *iə, *uə forms are better reconstructed with *ɛː, *ɔː (see Sidwell 2009). On the other hand, Shorto’s *ɯə and *ai are not required, as the etyma in question can be explained variously by recourse to *aː, *ɛː and *iː. Strikingly some branches are reconstructed without diphthongs at all (e.g.: Vietic: Ferlus 1998; Headley 1985; Munda 1968; Pinnow 1959), and a couple with just one (*iə; Palaungic: Sidwell 2010; Khasian: Sidwell 2012, Pearic: Sidwell ms.). At the same time, it is apparently necessary for other branches to reconstruct three (Aslian: Phillips 2012), four (Nicobarese: Diffloth 1991) or even six (Katuic: Sidwell 2005) contrasting diphthongs. Yet it is difficult to establish regular correspondences between, in particular, the Nicobarese, Aslian and Katuic diphthongs, so the suggestion of *ie and *uo remains tentative. The remainder of this chapter briefly profiles the phonological reconstruction of each AA branch, except for Mangic, for which comparative analyses are yet not sufficiently developed. The chapter concludes with a Swadesh 100 list of branch-level proto lexicon (without Mangic, and with Nancowry forms standing in for pNicobarese), pAA forms from Shorto (2006), and revised pAA reconstructions by Sidwell.

254 3

sidwell and rau

Austroasiatic Branches

3.1 Aslian 3.1.1 Introduction The Aslian branch consists of perhaps ten or a dozen distinct languages in three or four sub-branches, with some showing considerable dialectal diversity. The total population of Aslian speakers is small, perhaps around 50,000, with some individual languages having hundreds or, in some cases, only dozens of speakers.16 Contemporary scholarship recognizes three subdivisions, North, Central and South Aslian (or Jehaic, Senoic and Semelaic, respectively), while the position of one language—Jah Hut—is ambiguous. For informative surveys see Benjamin (1976; 2011) and Matisoff (2003). Aslian languages are known for several typological quirks which are generally taken to be secondary within the branch. These include:

· · ·

a tendency for pre-stopping or post-stopping of final nasals, including complete mergers with the Cf stop series: a tendency to lose vowel length distinctions; the absence of implosive stops and a general tendency against tones or phonation contrasts (with an apparent exception of Mah Meri/Besisi, see Stevens et al. (2006)). Additionally, Aslian languages are known for the morphological phenomenon of coda-copying, also called ‘incopyfixation’ (Matisoff 2003), and they may be an especially important witness for the study of Austroasiatic morphology. It is a curious situation that there has been something of a reluctance to pursue comparative Aslian studies, despite the fact that they were among the first AA languages known to science, and substantial data compilations have been accessible for over a century. Nonetheless, there was early interest in the history of the branch; Schmidt (1901) offered a first substantial attempt at historical analysis, and Blagden (1906) presented a huge comparative Aslian vocabulary with etymological commentary. But there have always been substantial obstacles to progress with comparative studies. Benjamin spells out the situation:

16 Matisoff (2003, 10) compares population figures from various sources, and Benjamin (2011) gives some detailed recent figures. It would appear that Aslian populations have always been rather modest.

austroasiatic comparative-historical reconstruction

255

There are grounds for thinking that northern Peninsular Malaysia and the Isthmian regions of Thailand have witnessed not one, but three, layers of Mon-Khmer-speaking presence. The hundreds of place-names in the Aslian languages that are well-formed phonologically as Mon-Khmer words but which have no meaning in the present-day languages, suggests that there may have been a Mon-Khmer presence prior to the advent of Aslian. And there is evidence that Mon was present as the language of lowland civilisation in the Isthmian tracts and as far south as Perak and Kelantan until around 1200 CE. Khmer too seems to have left some traces in the same region, and even further south. In addition to their basic character as Mon-Khmer languages, the Aslian languages also contain lexical evidence of secondary contact with both the Mon and the Khmer languages specifically (Benjamin 1997: 105–112; Bauer 1992). Moreover, the Aslian languages also contain many words in their lexicon that are clearly of Austronesian, but not Malay, provenance (Benjamin 2009a). (There are many Malay loanwords too, of course.) Benjamin 2011: 14–15

Blagden, a dominating figure in the field during the first half of the 20th century, was especially scathing of the idea that the Aslian languages formed a family or were even directly Austroasiatic, despite having compiled an enormous comparative vocabulary incorporating many wider AA comparisons. Until recently, the most important contribution was Diffloth’s (1977) reconstruction of protoSemai, a Central Aslian dialect chain, presenting 284 etymologies/reconstructions. The author also mentions having collected an Aslian lexicon in excess of 26,000 items; it is regrettable that this resource remains unpublished/inaccessible. The tide of comparative Aslian studies only turned very recently with the PhD thesis of Timothy Phillips (2012) which presents a phonological and lexical reconstruction of pAslian, and the discussion that follows summarises and comments on Phillip’s results. 3.1.2 pAslian Reconstruction Phillips (2012, henceforth simply Phillips) posits the following syllable/word template for pAslian: monosyllabic: CV(C) sesquisyllabic: C(ə).CV(C) disyllabic: CV(C).CV(C)

256

sidwell and rau

Rather curiously, Phillips does not accept monosyllables with medial approximants, and generally posits a schwa vowel between a stop and an approximant without exception. This seems rather forced; although I did apply the same to proto-South Bahnaric (Sidwell 2000) under the influence of Ilia Peiros, I now consider this to have been a serious error. Phillips’ disyllables include the possibility of a phonemic vowel, and it is worth taking note of his discussion at this point: Furthermore, in the same paper Diffloth demonstrates that many of the non-schwa vowels in the penult cannot be traced to either a conditioning environment or to an inserted morpheme (productive or frozen). Therefore it is presumed that Aslian languages preserve some rather ancient phonemic vowels in the penult. These phonemic, pre-ultimate syllable vowels are /a, i, u/, with /a/ being the most frequent. [. . . .]. If these phonemic penultimate vowels existed in PMK, as seems likely, then iambic and monosyllabic pressures appear to have essentially eliminated them in most MK languages in due course. Iambic pressures have probably reduced many presyllable vowels to schwa even in Aslian over time. Phillips 2012: 36

A significant problem with this analysis is that the forms in which these /a, i, u/ occur do not have evident AA etymologies, except for a modest number of etyma with /a/, which appears to be harmonising with a long /aː/ in the stressed syllable (e.g. pAslian *mahaːm ‘blood’ etc.). Thus I suggest that it is more likely that the Aslian prefinal syllable vowel distinctions are secondary features, innovative to Aslian. Notwithstanding the above quibbles, Phillips’ pAslian syllable/word template is readily reconciled with our provisional pAA template, presenting no fundamental difficulties. The following pAslian segmental inventories are reconstructed by Phillips (2012): Main syllable onset consonants * p t c k ʔ b d ɟ g m n ɲ ŋ w r, l j s h

austroasiatic comparative-historical reconstruction

257

Main syllable coda consonants * p t c k ʔ m n ɲ ŋ w r, l j s h Minor syllable onset consonants * p t s c k ʔ b d ɟ g r, l h Cm Consonants * w r, l j

h

Main syllable vowels * i ɨ u ə ɛ a ɔ

iː ɨː ɯː uː iɛ uə eː oː ua ɛː aː ɔː

Minor syllable vowels * i u ə a

Several important phonological features are evident:

· · · ·

The pAA pattern of not permitting open syllables is continued in pAslian. Among the voiced stops, only the palatal takes the r/N rhyme (where N is the unmarked nasal that assimilates to the place of the next segment). The ʔN minor syllable occurs before oral stops, while occurs ʔ(ə) elsewhere. The frequency of voiced stops in minor syllables is increased by a general tendency to dissimilate voicing (i.e. become voiced before a voiceless consonant, e.g. ‘bone’ *ɟəʔaːŋ < pAA *cʔaːŋ). Generally the consonant correspondences are quite trivial and the reconstruction is straightforward. The group is generally marked by loss of implosion in stops (having merged with the plain voiced series without registrogenesis or tonogenesis) so there are only two stop series, and changes among sonorants are

258

sidwell and rau

minimal, aside from the pre-/post-stopping of final nasals (see Matisoff 2003) which is entirely secondary and poses no difficulties for the reconstruction. The pAslian vocalism, on the other hand, is not a trivial problem, and may prove to be very important for pAA. Diffloth (1991: 15), somewhat obscurely in a paper about Palaungic vocalism, remarked that four diphthongs could be reconstructed for pAslian, in parallel with Nicobarese, although gave no details. Phillips appears to have done a very thorough job of investigating pAslian vocalism, although his analysis is distributed through the text over many tables, making it difficult to follow. I have taken the liberty of consolidating his vowel correspondences into a single table, which is presented below. First of all it is important to note that the subscripted correspondences do not indicate additional phonemes, but multiple correspondences supporting individual proto-phonemes; the specific environments/distributions are discussed by Phillips. Note that the correspondences are divided into blocks corresponding to received sub-branches, and this matches the phonological patterns neatly. The Central Aslian tongues preserve the long/short distinction, mostly reflecting the pAslian diphthongs as long monophthongs. Jah Hut, which appears to stand apart from the three sub-branches, has very distinctive reflexes of the diphthongs, and appears to be important for confirming their reconstruction on internal grounds. Strikingly, the reconstruction is asymmetrical, positing two back diphthongs but only one front diphthong, apparently disconfirming Diffloth’s earlier speculation. Northern

Jah Hut

i i ə ɛ ɛ ə ɨ u a

iː iː eː eː ɛː ɛː ɨː ɤː aː

iː iː eː eː ɛː ɛː ɨː ? aː

iː iː eː eː ɛː ɛː ɨː ? aː

iː iː eː eː ɛː ɛː ɨː ? aː

Mah Meri

Chewong

i ɨ ə ɛ? ɛ/a o ɨ ɨ a

Temoq

Kensiw/Kintaq

i ɨ ɨ i ɛ o ɨ ɨ a

Semelai

Tonga-Mos

i ɨ ɨ i ɛ o ɨ ɨ a

Semaq Beri

Mintil

i ɨ ɨ i ɛ o ɨ ɨ a

Semnam

Minriq

i ɨ ɨ i ɛ o ɨ ɨ a

Lanoh

Jahai

i ɨ ɨ i ɛ o ɨ ɨ a

Temiar

Batek

i ɨ ɨ i ɛ o ɨ ɨ a

Southern

Semai

PAslian

*iː₁ *iː₂ *eː₁ *eː₂ *ɛː₁ *ɛː₂ *ɨː *ɯː *aː₁

Central

i/e i/e i/e i/e ? ? ? ? e e e e ɛ? ɛ? ɛ? ɛ? ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɛ ɨ ɨ o ɨ u u u u a a a a

259

austroasiatic comparative-historical reconstruction Northern

Jah Hut

Central

Southern

Minriq

Mintil

Tonga-Mos

Kensiw/Kintaq

Chewong

Semai

Temiar

Lanoh

Semnam

Semaq Beri

i/e ? u o ɔ ɨ ɔ ɔ

i/e ? u o ɔ ɨ ɔ ɔ

i/e ? u o ɔ ɨ ɔ ɔ

i/e ? u o ɔ ɨ ɔ ɔ

e ? u o ɔ ɨ ɔ ɔ

a a? o/ɔ o ɔ o ɔ ɔ

aː aː uː uː oː oː ɔː ɔː

aː aː uː uː oː oː ɔː ɔː

aː aː uː uː oː oː ɔː ɔː

aː aː uː uː oː oː ɔː ɔː

a ə o o? u u ɔ u

a a a ə ə ə o o o o? o? o? u u u u u u ɔ ɔ ɔ u u u

*i *ɛ *ɨ *ə *a₁ *a₂ *a3 *u *ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə a ɛ ? u ɔ

i ɛ/a ɨ ə a ɛ ? u ɔ

i ɛ ɨ ə ? a/ɛ? a? o/ɔ ɔ

i ɛ ɨ ə a a a u ɔ

i ɛ ɨ ə a a a u ɔ

i ɛ ɨ ə a a a u ɔ

i ɛ ɨ ə a a a u ɔ

i ɛ ɨ ə a a/ɒ ə ɔ ɔ

i ɛ ɨ ə a ɒ ə ɔ ɔ

i ɛ o ɔ a ɒ ə ɔ ɔ

i ɛ ɨ ə a ɛ ə ɔ ɔ

*iɛ

ɛ

ɛ

?

iɛ

iɛ

ɛ

ɛ/iɛ

iɛː

?

ɛ

ɛ

ɛ

e

*ua

a

a

a

a

a

a

ɔː

ɔː

ieː/ ɛː ɔː

o

o

o

o

wɔː wɔː

? ?

oː oː

ɔ ɔ

ɔ ɔ

ɔ ɔ

ɔ ɔ

*uə a/e? *uəN ɛ

ə ɛ

a

wɔ / wa oː / wɛ a/e? a/e? a/e? a/e? ia? wo oː ɛ ɛ ɛ iɛ iɛ o? oː

Also significant are the high vowels *ɨ, *ɨː and *ɯː, which appear to be well supported. It is striking that the etyma reconstructed with these long high central vowels typically have high rounded back vowel reflexes elsewhere among AA languages. The following examples are compared with Shorto (2006) reconstructions:

Mah Meri

Jahai

i/e ? u o ɔ ɨ ɔ ɔ

Temoq

Batek

i/e ? u o ɔ ɨ ɔ ɔ

Semelai

PAslian

*aː₂ *aː₃ *uː₁ *uː₂ *oː₁ *oː₂ *ɔː₁ *ɔː₂

260

sidwell and rau

Shorto (2006)

pAslian

Bahnar

Mon

Khmu

Katu

Palaung

*phoːm ‘to fart’ (#1433) *ɓu(ː)l ‘drunk’ (#1765) *ɗuŋ ‘house’ (#492) *ɗu(ː)m ‘ripe’ (#1362) *su(ː)c ‘to sting’ (#873) *puːt ‘to blow’ (#1023)

*pəhɨːm

phoːm

—

puːm

—

–

*bɯːl

—

həbu —

ɓɑl

–

*dɯːŋ — *ʔəndɯːm ɗuːm

— —

— —

ɗɔŋ ɗɑm

*sɯːc

soːc

—

huːc

ʔasɑc

– kəndum (‘bud’) –

*pɨːt

–

–

poːt – (Ksingmul)

put

The lack of central vowel reflexes elsewhere in AA is suggestive of an Aslian innovation of pAA *uː > pAslian *ɯː/*ɨː, and thus is a potential candidate as a common Aslian innovation. Alternately it may be reasonable evidence for a pAA high central vowel. Additionally, the pAslian morphological system was quite rich, and Phillips reconstructs several affixes, all of which are consistent with our pAA reconstruction:

· · · ·

nominalizer *(-)n agentive nominalizer *(-)m causative *pər iterative *(–)l– Plus coda-copying, already discussed under §2.1. A Swadesh 100 list of Phillips pAsian lexical reconstruction (adjusted to IPA notational conventions) is included at the end of this chapter. 3.2 Nicobarese 3.2.1 Introduction The Nicobarese branch is very small; with just a few thousands of speakers on an island chain in the Andaman Sea (the true speaker population is unknown since the Boxing Day Tsunami). Although geographically isolated from the Asian mainland, the islands may never have been effectively socially isolated as they lie on the sea route between India and China, and their mention in Ptolemy’s atlas (c. 150 A.D.) means that they have been known to Europe since

austroasiatic comparative-historical reconstruction

261

classical times. In the introduction to de Röepstroff (1875), the islanders are described as having extensive trade relations with the outside world and many were familiar with numerous outside languages, and a high degree of intelligibility between islands is mentioned. It is later in British colonial times and under Indian rule that Nicobar islands served as a penal settlement and became inaccessible to outsiders. Consequently the most important published linguistic descriptive and analytic work has been done with speakers on the Indian mainland. The most extensive sources for the present investigation of Nicobarese are the colonial era dictionaries and grammars such as Man (1889), de Röepstroff (1884), Temple (1902), Whitehead (1925), plus a couple of later studies: Crichfield-Braine (1970), Radhakrishnan (1981) and Das (1977) which are written in modern linguistic frameworks, although even the latter present special difficulties of interpretation. These and other available sources only give extensive data for two varieties, Car and Nancowry. However, there are good reasons to proceed with an attempt at a reconstruction based on these two languages; the documentation is extensive and quite useful, and the languages appear to represent geographical extremes of what is clearly a dialect chain. A comparative word list of 68 items for seven Nicobar dialects presented by Radhakrishnan at the 1978 International Conference on Austroasiatic Linguistics (Mysore) suggests only minor phonological and lexical differences across the chain, so with the exception of Shompen, for which there are no useful extensive sources, Car and Nancowry together seem to reasonably present as criterion languages for a comparative reconstruction. There are some sources for the enigmatic Shompen language (e.g. Rizvi 1990; Elangaiyan et al. 1995; Chattopadhyay & Mukhopadhyay 2003), spoken by a small population on the island of Great Nicobar, but these are very difficult to work with. Having looked exhaustively through the available Shompen materials (see Blench & Sidwell 2011), I can say that it certainly contains substantial Nicobarese lexicon, although it also shows some Aslian and Austroasiatic isoglosses that have not been found elsewhere in Nicobarese. The significance of these facts is not clear, and for the moment the assumption is maintained that Shom Pen is a heavily restructured Nicobarese language, but its data is not used to support the reconstruction. The reconstruction of pNicobar is also made somewhat problematic by a low rate of cognates between Nicobarese and other AA languages (commonly ascribed to word tabooing), averaging around 10% on the 100 word list by my counts. This can make it difficult to test the provisional historical phonology by external comparison—a vital part of the method. There is also large cohort of loans from many sources (including English, Portuguese, and Malay) which can be difficult to distinguish.

262

sidwell and rau

In respect of previous comparative-historical work, there is an eight page draft pNicobar reconstruction by Norman Zide and D.P.S. Dwarikesh (1962, copy obtained directly from Zide), and Harry Shorto made some notes on Nicobarese historical phonology (also unpublished). In 2003 I prepared notes for a lecture series at the Australian National University, and these were subsequently extended in 2012 to make a list of 150+ Nicobarese etymologies. On the basis of these notes I present the beginnings of a Nicobarese historical phonology. pNicobar phonology Generally the Nicobarese languages restructured phonologically, eliminating all consonant clusters within syllables, and developing suffixing, in addition to making extensive use of prefixes and infixes. This phonology also involves much more extensive morphophonemic changes/alternations that are commonly found among AA languages, but we will not dwell upon these here. We will focus instead on indentifying the reflexes of AA roots and their segmental phonology. Similarly, the languages are now characterised by verb-initial syntax, but this is not developed here. Rather, a basic framework for phonological reconstruction is briefly sketched. Based on the common structure of the witness languages, pNicobar lexical roots can be:

· ·

monosyllabic CV(C) (always with onsets assuming that glottal stop is phonemic), or disyllabic CV(n).CV(C), where -n- is apparently not a productive infix, although both -n- and -m- appear in that slot as infixes in derivatives. The segmental phonology as described for Nancowry (Radhakrishnan 1981) and Car (Crichfield-Braine 1970) is straightforward enough, except for the diphthongs. Nancowry is described as having stress (typically on the second syllable in a disyllabic root/stem) and stressed vowels are always long while unstressed vowels are always short, with the former always marked with an acute diacritic by Radhakrishnan. Five diphthongs are described: ía [iːə], úa [uːə], iá [iaː], uá [uaː], and ɯ́ a [ɯːa] and these frequently correspond to diphthongs elsewhere in AA, except for ɯ́ a, which regularly corresponds to a back monophthong in Car and other AA languages. On the other hand, Car is described as having phonemic long and short vowels, although the contrast is restricted to the stressed syllable of lexical roots (with suffixing the stressed syllables may be penultimate or antepentultimate

austroasiatic comparative-historical reconstruction

263

in the phonological word). Crichfield-Braine describes three diphthongs ɛ́ɔ [ɛɔ], ɛɔ́ [ɛɔː], and ɛá [ɛaː], although these are infrequent, tending to appear in descriptive vocabulary. Otherwise, long monophthongs in Car tend to correspond to diphthongs in Nancowry. Consequently, it appears that Car examples must be relied upon to reconstruct pNicobararese vowel length, while Nancowry is necessary to establish the proto-diphthongs. Aside from these considerations, syllable structure is common to both, and other segmental differences are minor. The most significant non-trivial patterns of correspondences are among palatals, rhotic and apicals. The following pNicobar consonant inventories can be reconstructed for stressed syllables: Ci Consonants * p t c k ʔ b dr ɟ g m n ɲ ŋ w l ɽ j f s h Cf Consonants * p t c k ʔ m n ɲ ŋ w l j s h

The first thing is to note about the consonants is the relative equivalence between the Ci and Cf inventories. This is consistent with the reduction in syllable structure and loss of iambicity at word level; as the asymmetry in the syllable was lost, so was the asymmetry in the segmental inventory. The initials of unstressed syllables present the same range of possibilities as stressed syllables, although there are differences in frequency/distribution that are yet to be fully worked through. AA voiced Ci oral stops were devoiced in pNicobar, except for *ɗ, which surfaces as [dr-] in Car (Crichfield-Braine’s 1970 r̃, described as pre-stopped), and in Nancowry as a segment that is variously written r and/or d in the sources, strongly suggesting a segment directly comparable to [dr-], so *dr is reconstructed for pNicobar (e.g. riák ‘water’ < *draːk < pAA *ɗaːk). The pAA Ci *r became [j] in Nancowry, while in Car it is reflected as a retroflex [ɽ] (written as a typescript r by Crichfield-Braine), reconstructed as pNicobar *ɽ and reflected in etymologies such as:

264

sidwell and rau

gloss

pAA (Shorto #)

Nancowry

Car

‘deep’ ‘fly’ ‘to overflow

*ɟruːʔ (#172) *ru(ə)j (#1504) *ruəc (#843 ‘drip’)

cijaːw juaːj juaːc-ŋa

ʔaɽuːʔ inɽuːj taɽeːc-i

Additionally, pAA *p became pNicobar *f while pAA *b, *ɓ, *d, *ɟ and *g simply became their plain voiceless stop equivalents. pAA *c became pNicobar *s, while pAA *s became pNicobar *h. Consequently, the pattern of mergers is different at each place of articulation rather than simply according to natural classes. Selected examples follow: gloss

pAA (Shorto #)

Nancowry

Car

‘four’ ‘to tie, bind’ ‘to carry on back’ ‘breast’ ‘wall’ ‘water’ ‘to perch, alight’ ‘sour’ ‘child’

*puən (#1166) *bək (#357) *ɓɔʔ (#121) *tɔh (#1999) *bdaŋ (#580) *ɗaːk (#274) *ɗ(uː)n (#1158) *ɟuʔ (#50) *ku(ə)n (#1127)

fuːən poːk pɯːəʔ tuaːh taːŋ riːək rɯːən pacaːw kuaːn/koːn

feːn pɔk ka-poː-tə (‘carried away’) tɛh tanaŋ waːk (doubtful cognate) drɔːn kuːən

Finally a remark about affixation: Nicobarese languages also show an important morphological alternation in the causative prefix/infix. It is realised with two allomorphs, prefix ha- with monosyllabic stems, and infix -um- in disyllabic stems. Examples:

Nancowry

Car

ŋok ‘to eat’ / haŋok ‘to feed’ cim ‘to cry’ / hacim ‘to make someone cry’ lapəʔ ‘pretty’ / lumpəʔ ‘to make someone pretty’ karuʔ ‘large’ / kumdruʔ ‘to enlarge’

ɲaː ‘to eat’ / haɲaː ‘to feed’ pɯɲ ‘to cry’ / hapɯɲ-ɲɔː ‘to make cry’ kucik ‘be palatable’ / kumcik ‘to taste’ kale ‘brave’ / kumle ‘bravery’

austroasiatic comparative-historical reconstruction

265

The initial /h/ is the regular outcome of pAA *p in Ci in unstressed syllables. However, it is not clear whether ha-/-um- originates in conditioned reflexes of one morpheme, or a merger of two different but similar morphemes. If the former, it may hold a clue to the problem of the possible relationship between nasal infixes and implosive stops in AA mentioned under §2.3.1.1. The pNicobar vocalism is more problematic and still requires detailed working-out. Without having a full set of regular correspondences compiled and analysed, but relying on the apparent cognates and taking into account typological considerations, it seems appropriate to posit the following pNicobar vowel inventory as a provisional model: Vowels * i u e ə o ɛ a ɔ

iː ɯː uː iːə (ɯːə) uːə eː əː oː iaː uaː ɛː aː ɔː

At the 18th Southeast Asian Linguistics Society meeting (Bangi Melasia, May 22nd 2008) Gerard Diffloth presented a handout listing seven etymologies indicating regular correspondences between the Nancowry /uːə, uaː/ diphthongs and pAslian /*uə, *uɔ/ respectively, equivalent to Phillips (2012) pAslian /*uə, *ua/. Thus there would seem to be external support for pNicobar *uːə, *uaː and by analogy *iːə, *iaː, but pNicobar *ɯːə is more doubtful and is left in parentheses for now. Regretfully, a list of pNicobar is not included in the comparative 100 word lexicon at the end of this chapter. 3.3 Monic 3.3.1 Introduction The Monic Branch effectively consists of two languages, Mon and Nyah Kur, which are readily reconstructable as descendents of pMonic; apparently spoken in central Thailand in the First Millennium CE, and probably the vernacular language of the Dvaravati civilization that preceded the Siamese in that region between the 6th and 13th centuries.17 Today the Mon centre of gravity is in Myanmar, where perhaps ¾ of a million speakers still use the language, and there is also a sizable population of Mons in Thailand (the Ethnologue cites a speaker population of 851,000 for both countries at 2004). By contrast 17

We may speculate that Dvaratvati (and the predecessor of Angkor, Chen La) emerged from the break-up of Funan, a presumably ethnically diverse polity, which according to Chinese sources unified an area that spread from the Mekong Delta to the Malay Peninsula (Wheatley 1961, Vickery 2003).

266

sidwell and rau

Nyah Kur is a critically endangered language spoken in central Thailand in the hills of the Mekong and Menam Chao Phraya watershed (the Ethnologue cites 1,500 speakers and decreasing at 2006). Despite the lack of overall diversity within the branch, Mon has a long written history that dates back some 1500 years, with the oldest layer (Old Mon) indicating retention of important AA phonological features. Old Mon has received much attention from philological/epigraphic studies, and played an important role in Schmidt’s foundational work (1905; 1906 etc.) and later AA comparative studies. After much pioneering work on Mon inscriptions by figures such as Otto Blagden, George Coedes and Gordon Luce, a combination of comparative and philological studies in the period of the 1960s to 1980s filled out a reconstruction of Monic linguistic history that is reflected substantially in the following works:

·

Shorto (1971): this dictionary of the Mon inscriptions remains the principle consolidated source for Old Mon (although its coverage could be extended significantly with contemporary data) and includes extensive etymological commentary (it is really the foundation upon which Shorto 2006 was build). Ferlus (1983): this 90-page paper uses both comparative and philological methods to reconstruct the phonetic history of Mon and offers a phonological reconstruction of pMonic (proto môn in his terminology). Diffloth (1984): this monograph-length study relies mostly on fieldwork data to assemble some 680 etymologies, and reconstruct pMonic and intermediate forms for pMon, pNyah Kur. Huffman (1990): although strictly a synchronic comparison, the phonological systems and lexicons of Burmese and Thai Mon and Nyah Kur are compared, including a well organised 800+ item comparative lexicon

· · ·

Below we discuss the reconstruction of pMonic, as revealed by Ferlus and Diffloth, emphasizing the commonalities in their results, and a slightly revised pMonic reconstruction is proposed. 3.3.2 pMonic Reconstruction Broadly speaking, pMonic phonology poses no serious difficulties. Although Modern spoken Mon is substantially restructured and has lost many older phonological distinctions (including loss of consonant voicing, vowel length contrast, dropping medial sonorants in many places, and an almost complete merger of minor syllables to /h/), Old and Literary Mon forms preceded most of these changes and largely confirm the comparative reconstruction. Nyah

austroasiatic comparative-historical reconstruction

267

Kur, although also restructured, preserves the word final and minor syllable consonants more faithfully and still contrasts vowel length. Thus direct comparison of the two languages yields a reliable pMonic reconstruction, and this is confirmed by the fact that two independent historical reconstructions conducted at about the same time (Ferlus 1983; Diffloth 1984) yielded results that are not drastically discordant.18 Ferlus and Diffloth are in accord in reconstructing the following pMonic word structure: monosyllables disyllables *Ci(Cm)VCf *(Cp(Cson))CiVCf

Note that there are no vowel final lexical words. Ferlus identifies the monosyllables formed with medials (Cm) as quasi-monosyllables, although this seems an unnecessary distinction. Diffloth’s taxonomy divides words into one, two and three consonant onsets; and reflecting his concern for phonetic reality, he suggests that three consonant onsets—having a sonorant in the second position—may have been syllabified with an epenthetic schwa. The main difference between the Ferlus and Diffloth models is in the protovocalism, and both their proto-vowel inventories are given below, followed by discussion. Ci consonants * p t c k ʔ b d ɟ g ɓ ɗ ɟ g m n ɲ ŋ w r, l j s h Cm consonants * w r l j h

18 This level of concord in AA branch level reconstruction is not as frequent as one might hope. For example, in respect of pKatuic, the recontructions of Thomas (1967), Efimov (1983) and Peiros (1996) while broadly consistent with each other, are utterly discordant with the works produced by, for example, Diffloth (1982), L-Thongkhum (2001), Sidwell (2005).

268

sidwell and rau

Cp consonants * p t c k ʔ b d ɟ g l s Cson Consonants * m N r l Cf consonants * p t c k ʔ m n ɲ ŋ w r, l j s h Vowels (Ferlus) * i ɨ u e/ɛ ə o a ɔ Vowels (Diffloth) * i ɯ u e ə ɛ a ɔ

iː ɨː uː uo ɛː/ia aː ɔː/ua

iː ɯː uː eː əː oː ɛː aː ɔː iə

The differences in proto-vocalism are readily reconciled: Ferlus’s *uo and Diffloth’s *oː are the same phoneme; for the reconstruction Diffloth prefers the model of the Nyah Kur reflex, while Ferlus points out that Old Mon reflexes are written with the ū glyph. Diffloth distinguishes mid and low front vowels and *iə, making five distinctions amongst these, while Ferlus does not posit a contrast between short /e/ and /ɛ/, or between long /ɛː/ and diphthong /ia/, and does not reconstruct a long *eː. Ferlus’ ambiguous pairs *e/ɛ and *ɛː/ia reflect a lack of contrasting environments:

· ·

*e occurs before palatals and *w, ɛ elsewhere, although both are infrequent; and *ɛː occurs before glottal stop and *w, while *ia occurs elsewhere.

austroasiatic comparative-historical reconstruction

269

Diffloth only reconstructs five examples of *eː, two of these correspond to Ferlus’ *iː, while the others are ambiguous or appear to belong to the same correspondence as Ferlus’ *iː. Ferlus’ *o is probably unsafe; all but one of his examples corresponds to Diffloth’s *ə before *w, and the remainder is one root *poh ‘to propel, flick’ (Old Mon poh ‘to shoot with pellet bow’) for which Shorto (2006, entry #2024) suggests a long proto-vowel. Paralleling the front diphthong, Ferlus does not contrast *ɔː/ua; while he generally posits *ua in proto-forms, he allows for the phonetic difference at pMonic. While the loss of long/short contrast in Mon creates multiple correspondences, the broad pattern in Nyah Kur is that monophthong reflexes are found before velars and glottals while diphthonged reflexes occur elsewhere, and this complementary distribution prevailed in pMonic. In this case, Diffloth takes the phonemic approach and reconstructs a monophthong *ɔː for all cases of this correspondence, while Ferlus favours *ua. Some examples illustrate the vowel reconstructions discussed above: Gloss ‘river’ ‘rope’ ‘to sew’ ‘spicy’ ‘nest’ ‘hair’ ‘varan’ ‘firewood’ ‘to return’ ‘hand’ ‘tooth’ ‘that’

Ferlus Diffloth. OM *kruoŋ *kroːŋ kruṅ *ɟuok *ɟoːk juk *ɟiːŋ *ɟeːŋ jiṅ *riːŋ *reːŋ riṅ* *kmpuaj *cmpɔːj sapoy* *suak *sɔːk sok *tkuat *trkɔːt darkot *ʔuas *ʔɔːs ʼoh *cow *cəw caw *tej *təj tey *ŋiak *ŋiək ṅek *tɛːʔ *tɛːʔ te’

MonRao NKCentral krə́ŋ króːŋ cə̀k chòːk còiŋ chìːŋ ròiŋ rìːŋ həpúa mpúaj sók chɔ́ ːk həkót trkúat ʔúh ʔúaj̊ cáo cɔ́w tóa tɛ́j ŋɛ̀ak ŋìak téʔ tɛ́ːʔ

* Literary Mon

Considering the above, we can offer a moderately revised pMonic vocalism as follows: Revised pMonic vowels * i ɨ u iː ɨː ə əː ɛ a ɔ ɛː/ia aː

uː oː ɔː/ua

270

sidwell and rau

The short vowels *e and *o are not required, as they can generally be explained as conditioned reflexes of *ə. Ferlus’ *uo is merged with Diffloth’s *oː, while Diffloth’s *eː generally reflects *iː. Diphthongs *ia and *ua are in complementary distribution with *ɛː and *ɔː respectively, and I suggest that monophthong and diphthong allophones can be reconstructed to provide a more phonetically realistic proto-phonology. On this basis, we have compiled the Swadesh 100 list of pMonic lexicon at the end of this chapter. The forms are taken directly from Diffloth’s (1984) reconstruction, with vocalism adjusted to reflect the revision proposed above. While both Diffloth and Ferlus reconstruct the same consonant inventories, Diffloth reconstructs specific consonant values that account for more diverse reflex forms, based mainly on his use of extensive Nyah Kur field data. 3.4 Khmeric 3.4.1 Introduction The Khmeric branch is represented by a single language, Khmer (Cambodian), which shows three main dialects: Standard Khmer, Northern or Surin Khmer, and Western or Cardamom Khmer. The written standard has been used extensively in comparative works, especially since Schmidt, and like Mon, the older inscriptional record (which goes back to the 7th century CE) gives important clues as to the historical phonology. Through the Middle Khmer period (14th to 19th centuries CE) the language underwent extensive phonological restructuring, so that today there is a gross disparity between written and spoken Khmer. The contemporary language has also been elaborated with extensive borrowings, and new formations including hypercorrections and pseudo-archaisms, so one needs to apply special care when invoking Khmer data in comparative reconstruction. It is apparent that all the known dialects derive from Khmer of the Angkorian period (9th to 14th centuries CE) or later, and are younger than the oldest written sources, which is known as preAngkorian Old Khmer (7th and 8th centuries CE). However, preAngkorian is known only from a few hundred (mostly brief) inscriptions, and appears to show a rather special register—an administrative or court language—which is very specialised in lexicon and restricted in syntax,19 so it is only partly representative of vernacular speech of the preAngkorian populace. Furthermore, there is a degree of standardisation in the preAngkorian corpus that strongly suggests the language had already been written for some time, and this further implies that the vernacular base 19 For example, there are no second person pronouns attested at all in the entire Old Khmer corpus.

austroasiatic comparative-historical reconstruction

271

of preAngkorian, what we would call pKhmeric, was a living language for some time previously, around 500 AD. To address these issues and reconstruct pKhmeric therefore requires a combination of epigraphic analysis, comparative reconstruction, and consideration of the phonology and semantics of many loan words that entered the language at various stages, evidently from Mon, Tai and other sources. This is the approach taken by Ferlus (1992b), whose reconstruction of pKhmeric (proto khmer in his terminology) is outlined here. The phonology of Old Khmer, and its correspondence with written forms, is discussed briefly in dictionaries of Old Khmer, such as Long Seam (2000), Pou (1992, revised 2004), and Jenner (1980–86; 2009). There is a recent grammar of Old Khmer (Jenner and Sidwell 2010) and an internal reconstruction of Khmer morphology (Jenner and Pou 1980–81). The development of Khmer phonology from Old Khmer to the present is discussed in various works, including: Jenner (1974), Jacob (1976), Huffman (1978), Ferlus (1979; 1992c), Pinnow (1979) and others. 3.4.2 pKhmeric Reconstruction The syllable and word structure of pKhmeric is assumed to be the same as apparent for Old Khmer (in fact quite similar to Modern Khmer) and the latter is discussed in various works, such as Jacob (1960; 1993), Jenner (2009a,b), and Jenner & Sidwell (2010). Words of the native lexicon are conventionally analysed as either monosyllables, disyllables, or sub-disyllables (a term approximating ‘sesquisyllables’), as follows: monosyllables sub-disyllables *Ci(Cm)V(Cf) *Cp(ə/h)Ci(Cm)V(Cf)

disyllables *CpəN.Ci(Cm)V(Cf) *Cprə.Ci(Cm)V(Cf)

Words are iambic, and (sub-)disyllabic words are frequently and transparently formed by affixation. Syllabification can be accompanied with a schwa juncture or aspiration, consequently phonetically aspirated sequences could arise by juncture/affixation, but there was no distinct aspirated series as such. In the transition from pAA to pre-pKhmeric several consonant changes took place:

· ·

there was a general loss of final glottal stops, allowing words to be vowel final; the pAA implosives merged with the voiced stop series, creating a single voiced series.

272

sidwell and rau

Later, apparently during the Angkorian period, there were further consonant changes:

· · ·

prevocalic labial and apical voiceless stops became implosive, reconstituting an implosive series; final /s/ and /h/ began merging to /h/ (this may actually have occurred later); there was a period of confusion between /b/ and /ʋ/, possibly realised as a [β] allophone. The inventory of short vowels is strikingly small, as a number of mergers occurred in pre-pKhmeric, including reduction of some long vowels; at the same time there is evidence for four distinct diphthongs, based on epigraphic and loan word interpretation. The following pKhmer segmental inventories are reconstructed: Ci consonants * p t c k ʔ b d ɟ g m n ɲ ŋ ʋ r, l j s h Cm consonants * w r l j Cp consonants * p t c k ʔ b d ɟ g r, l s Cf consonants * p t c k ʔ m n ɲ ŋ w r, l j s h Vowels * i ɨ u ə a ɔ

iː ɨː uː ie ɨə uo eː əː oː ɛː aː ɔː ia ua

austroasiatic comparative-historical reconstruction

273

Some examples illustrating the sound changes discussed above are provided: Gloss pKhmeric preAngkorian Angkorian Mod Khmer reduction of ‘water’ *dɨk dik dik tɨk long vowels: ‘wind’ *kjəl kyol~kyel khyal kʰjal ‘bone’ *cʔəŋ - - c ǝʔɤŋ

pAA (Shorto#) *ɗaːk #274 *kjaːl #1782 *cʔaːŋ #488

reduction of ‘two’ *bier ver~ber vyar~ piː, pɨl diphthongs: byar~bir ‘oil’ *prieŋ pareṅ paryyaṅ~ preːŋ pareṅ ‘candle’ *dian den dy:an~dyan tiːən ‘to learn’ *rian ren ryyan~ riːən ryy:an~ry:an ‘boat’ *duok dok~dvak dvak tuːk ‘to offer’ *ɟuon jon~jvan jvan cuːn ‘four’ *puan pon pvan~pv:an ɓuːən ‘group’ *buak - vvak~ puːək vok~bok

*ɓaːr #1562

final glottal loss:

‘hand’ ‘day’

*taj *tŋaj

tai tai ɗaj tṅai~tṅaiy - tʰŋaj

Loss of ‘sugarcane’ *ʔmbaw - ʼaṃvau ʔampɤw implosion: ‘to cover’ *dap - dap tʊp

*pri(ə)ŋ #702 *riən #1192 *ɗuːk #336 *ɟuːn #1148 *puən[] #1166 *buək #362

*tiːʔ #66 *tŋiːʔ #31 *(t)ɓuːʔ #125 *ɗəp #1261

Note: preAngkorian and Angkorian inscriptional forms in bold-italic.

A Swadesh 100 list of pKhmeric lexical reconstruction is included at the end of this chapter. Note that they are not taken directly from Ferlus (1992c) but assembled by Sidwell on the basis of the sound laws that Ferlus has reconstructed.

274

sidwell and rau

3.5 Bahnaric 3.5.1 Introduction The Bahnaric is one of the more extensive and diverse branches of AA, with more than 30 distinct languages spread over five coordinate sub-branches; it is spoken by communities spread over substantial areas of central and southern Vietnam and neighbouring areas of Laos and Cambodia. As a witness to the reconstruction of pAA, Bahnaric is crucial as it includes various languages with unrestructured vocalism and minimal changes to consonants, in addition to various highly innovative languages. It is also important that Bahnaric subgroups fall over different historical language contact areas (variously in contact with Chamic, Khmer, Katuic etc.) and this provides insight into which lexical items and structural features are variously archaic or more recently acquired. French colonial officials and missionaries began working with and documenting the languages of Bahnaric-speaking peoples from the 1860s. A century later American researchers, especially individuals associated with SIL, began documentation and comparative work, and by the 1970s various important studies had appeared in print. From the late 1980s other scholars joined the effort, building eventually to a very useful body of scholarly work. These include:

· · · · · · · ·

Blood (1966) “A Reconstruction of Proto-Mnong (Including Tentative Reconstruction of Proto-South-Bahnaric” (MA thesis), Thomas & Smith (1967) “Proto-Jeh-Halang”, Smith (1967) “A Phonological Reconstruction of Proto Central North Bahnaric”, Efimov (1990) “Istoricheskaja Fonologija Juzhnobaxnaricheskix Jazykov” (Historical phonology of South Bahnaric languages), Sidwell (2000) “Proto-South-Bahnaric: a reconstruction of a Mon-Khmer language of Indo-China”, Jacq & Sidwell (2000) “A Comparative West Bahnaric Dictionary”, Theraphan L-Thongkum (2001) “Languages of the Tribes in Xekong Province Southern Laos”, Sidwell & Jacq (2003) “A Handbook of Comparative Bahnaric: Volume 1, West Bahnaric”. An annotated bibliography of Bahnaric linguistics was recently published (Cheeseman et al. 2013). Not published is Sidwell’s PhD thesis “A Reconstruction of Proto-Bahnaric” completed in 1998, and an updated version of my pBahnaric reconstruction (2011a) has been made available online at

austroasiatic comparative-historical reconstruction

275

sealang.net/monkhmer. The latter consolidates various data sources and subbranch level reconstructions done since 1998, and posits 1011 pBahnaric forms, plus sub-branch reconstructions that adds approximately 800 etymologies. The text that follows summarizes the findings of this reconstruction. 3.5.2 pBahnaric Phonology The pBahnaric syllable and word templates are reconstructed as being almost the same as posited pAA, with the addition of several extra members within minor syllable rhymes: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(N/m/r/l/h/ə))CiVCf

The inventory of sounds is somewhat richer than pAA, including preglottalized sonorants (originating with old cluster reductions), plus an apical affricate that is assumed to originate as an old loan phoneme (paralleled in Vietic, although both may be borrowed from a third unknown source). The pBahnaric segmental inventories are as follows: Ci Consonants * p t ʦ c k ʔ b d ɟ g ɓ ɗ ʄ m n ɲ ŋ ʔm ʔn ʔɲ ʔŋ w r l j ʔw ʔl ʔj s h Cf Consonants * p t m n w r, l s

c k ʔ ɲ ŋ j h

Cp Consonants * p t c k ʔ b d ɟ g s h

276

sidwell and rau

Cm Consonants * w r l j h Main syllable vowels * i ɨ u iː ɨː uː iə uə e ə o eː əː oː ɛ a ɔ ɛː aː ɔː

Phonotactic notes:

· · · ·

Diphthongs are restricted to occurring in closed syllables, and only with heterorganic finals (*iə not with palatals, *uə not with labials or velars). Minor syllable *ʔ was vocalized to [ʔə]. A phonemic schwa could occur in disyllables before an approximant Ci. There was no length contrast in syllables with final *ʔ or *h and the vowels in such syllables are treated as short. Open main syllables are all long. The preglottalized sonorants (ʔm, etc.) are treated as unitary segments for the purposes of syllable structure despite the digraph notation. They are mostly reconstructed on the basis of reflexes in North Bahnaric. 3.5.3 pBahnaric Phonological Innovations For pBahnaric I reconstruct short and long high-central vowels *ɨ, *ɨː; these are mostly raised reflexes of pAA *a, *aː; e.g.: pBahnaric *ʔmɨː ‘rain’ *ɲɨːm ‘weep’ *prɨːt ‘banana’ *-mɨt ‘gall, bile’ *bnɨm ‘mountain’

cf. Khmu Cuang kmaʔ cf. Old Mon yam cf. Old Mon brat cf. Vietnamese mật cf. Old Khmer vnaṁ

A similar raising of central vowels also happened later within West Bahnaric (the conditioning is not clear, but seems to involve preceding approximants), e.g.: pBahnaric *kjaːl ‘wind, air’ *hraːj ‘maggot’ *klaː ‘tiger’

pWBahnaric Laven Bahnar *kəjɨːl kəjɨəl kjaːl *srɨːj srɨəj hraːj *klɨː klɨə klaː

austroasiatic comparative-historical reconstruction

277

Also, an affricate *ʦ reconstructed to account for a curious correspondence of /s ~ ʦ/ in North Bahnaric to /t/ elsewhere in the branch. The origin of *ʦ is obscure—Shorto (2006) mistakenly (in my view) included it in his *t2 correspondence set. Ferlus (2007) reconstructs doublets such as *ɟaːŋ / *tʃɨəŋ ‘bone’ for pVietic, which is at least suggestive of an old borrowed stratum into pBahnaric. Some examples: pBahnaric *-ʦaŋ ‘bitter’ *-ʦoːɲ ‘fern’ *pʦul ‘anthill’ *kʦɨːŋ ‘bone’

Bahnar Rengao Sedang taŋ ʦaŋ sa̰ ŋ ‘salty’ kasún - kətɔːɲ pətɔl bəʦṳl pətɔl kətiːŋ kəʦe̤ːŋ kəsiəŋ

The pBahnaric implosives *ɓ, *ɗ are reconstructed on their witness in Bahnar; it seems remarkable, but so far as I can tell only Bahnar preserves a contrast between these implosives and plain voiced stops; elsewhere they have merged with the plain voiced stops, or their subsequent voiceless reflexes where devoicing has occurred. Strikingly, implosives are recorded for Bahnar in most but not all etyma for which they would be expected on the basis of occurrences in, for example, Old Mon or Katu. However, Bahnar has experienced substantial contact and borrowing from North Bahnaric languages such as Rengao and Sedang, and so some irregularity in correspondences is understandable. Implosives, although lost elsewhere in the branch, have re-emerged in South Bahnaric languages such as in Sre, apparently from hardening of glottal-nasal sequences. At the sub-branch level, North Bahnaric stands out as phonologically innovative, developing tense/lax registers; for example see Smith’s (1972) comparative North Bahnaric reconstruction. Oddly, there was no general devoicing in North Bahnaric, and the registers do not correlate with historical voice quality of the initial stops. Instead, the explanation seems to be related to a reorganisation of vowel timbre differences, similar to what also seems to have occurred in the Katuic language Pacoh (see Diffloth 1982). The other Bahnaric subbranches are mostly unrestructured, generally just showing some mergers in their segmental inventories, although some individual languages have undergone their own restructuring. For example, within West Bahnaric, Nyaheun restructured all sesquisyllables into *Ci(Cm)VCf type monosyllables, and Lawi went through conventional stop devoicing and vowel splitting (see Sidwell and Jacq 2003). A Swadesh 100 list of pBahnaric lexical reconstruction is included at the end of this chapter.

278

sidwell and rau

3.6 Katuic 3.6.1 Introduction The Katuic languages are among the best documented and most thoroughly analysed from the perspective of comparative AA studies. These studies reveal a typologically diverse group that includes a range of historical outcomes, from perhaps the most conservative AA of all (Katu) to highly innovative tongues that include among the most complex vowel systems in the world. The more innovative Katuic languages (Bru and Kui) are spoken by communities distributed over a large area of NE Thailand, NE Cambodia, southern Laos and border-lands of Vietnam; at the same time the greatest diversity of Katuic languages lies in a relatively small region about the Salavan and Sekong provinces of Laos and adjacent areas of Vietnam. From a comparativehistorical point of view, Katuic gives us insights into ancient AA features plus an important understanding of the typology of phonological changes that have occurred in AA. We can now say that the history of the group is quite well understood, and the proto-lexicon is extensively documented. The most important works are:

· ·

Thomas (1967/1976): The first attempt at a Katuic comparative reconstruction. Diffloth (1982): An excellent analytical essay of the problems of Katuic historical phonology, focuses on the different paths of register development vis-à-vis West Katuic and Pacoh; offers a summary reconstruction with 138 etymologies. Efimov (1983): Extends and revises Thomas (1967), although some Khmuic data is mistakenly included as Katuic. Gainey (1985): a comparative phonology of Kui, Bru and So. Shorto (nd., ms.): in the early 1980s Shorto compared Bru, Kui, Pacoh and Katu, creating a total of 794 Katuic etymologies and reconstructions.20 Peiros (1996): offers a comparative lexicon of 1241 cognate sets, based on a modified version of Thomas’ (1967) historical phonology. Theraphan L-Thongkum (2001): extends our knowledge of the Katuic family with a substantial comparative lexicon for Kantu, Ta’Oi, Kriang, Chatong, Triw and Dakkang. There are 1406 comparisons, reflecting more than 1300 distinct etymologies, although the historical phonological analysis is somewhat unsystematic.

· · · · ·

20 The ms. is held by Sidwell, copies available on request.

austroasiatic comparative-historical reconstruction

279

·

Sidwell (2005): is a reconstruction based on 1395 Katuic etymologies, largely consistent with Diffloth (1982), but with a different analysis of the diphthongs. Special attention should go to Diffloth (1982), which really solved most of the important questions concerning the phonological history of Katuic. With reference to hardly more than 140 etymologies, Diffloth showed the secondary nature of the Bru and Kui register systems and restructured vowels, and the typologically divergent register system of Pacoh. Although some of the correspondences and sound laws he proposed are incomplete or subject to dispute, especially due to the brevity of the treatment, the analysis corrected significant earlier errors made by Thomas (1967/76), errors which were unfortunately retained and magnified in by Efimov (1983) and Peiros (1996). I fully acknowledge the analytical debt of gratitude to Diffloth, having taken on board most of his analysis in my later work. Subsequently, the discussion that follows is based largely on Sidwell (2005), which includes commentary and analysis of the works listed immediately above, except for the notes of Shorto, which were not known to the author at the time. 3.6.2 pKatuic Phonology The pKatuic sound system was quite similar to pAA, in terms of both phonotactics and available phonemes. The main differences lay in the presence of some additional members. The syllable and word templates are reconstructed as follows: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(r/l/n/a))CiVCf

The rhymes of unstressed syllables included a phonemic vowel, symbolised here as *a, but the quality may have been variable or more like a schwa. Some languages, such as Pacoh, have developed contrastive vowel quality in minor syllables, and uses cliticised pronouns as case markers (see Alves 2006, but there is no indication that this is an old feature). The preconsonantal nasal generally assimilated to the place of the following segment, and when a nasal took the Cp position, it added an initial glottal stop to create a minor syllable with stop onset (e.g. *ʔɲcəj ‘head louse’, *ʔmpəw ‘dream’ etc.).

280

sidwell and rau

I reconstruct the following pKatuic segmental inventories: Ci Consonants * p t c k ʔ b d ɟ g ɓ ɗ ʄ m n ɲ ŋ w r, l j s h Cf Consonants * p t m n w r, l s

c k ʔ ɲ ŋ j h

Cp Consonants * p t c k ʔ b d ɟ g r, l, s h Cm Consonants * w r, l j h Main syllable vowels * i ɨ u iː ɨː uː e ə o eː əː oː ɛ a ɔ ɛː aː ɔː

iə ɨə uə ia ɨa ua

3.6.3 pKatuic Phonological Innovations The reconstruction largely models the Katu dialects as most closely reflecting the proto-language phonologically. However, it is not completely clear whether certain key facts reflect pKatuic innovations or archaisms: the pKatuic *ʄ, multiple levels of proto-diphthongs, and creaky voice/stopped finals that are found especially in some dialects of the Ta’oi group. A failing of my (2005) reconstruction of pKatuic was the inadequate treatment of the creaky voice problem, already mentioned at §2.3.3. The approach I took in 2005 was to effectively ignore the problem, assuming that it was a

austroasiatic comparative-historical reconstruction

281

local innovation of the Ta’oi sub-branch within Katuic. This assumption may well be correct, but is undemonstrated. My confidence that it is a secondary issue lies in the fact that there are no strong indications of reflexes of a pKatuic creaky voice across the Katuic branch generally, and the consideration that those languages which do reflect the feature (such as Ong, Talan, Katang and some others, although sources are somewhat confused) have all undergone a general devoicing of stops, which may be connected with the formation of creak (see Ferlus 1974; 1980). The other two aspects of the reconstruction that deserve further attention are less problematic, and will be discussed with special reference to the phonology of Katu, which is interpreted as being especially conservative. Costello’s (1971) multi-dialectal dictionary of Katu provides the following inventory of segments (adjusted to IPA notation following the author’s description): Katu consonants * pʰ tʰ kʰ p t c k ʔ b d ɟ g ɓ ɗ ʔj m n ɲ ŋ w r, l j s h Katu vowels * i ɨ u iː ɨː uː e ə o eː əː oː ɛ ʌ ɔ ɛː ʌ ː ɔː ɛ a ɑ aː ɑː

iə ɨə uə

The aspirated stops reflect sequences of stop + /h/, plus some cases of dialectal variants of the plain stop series; taking this into account the consonant inventory generally corresponds to pAA values, having not undergone a devoicing or other restructuring. The sound given as /ʔj/ by Costello is described by Wallace (1969: 65) as a “preglottalized voiced alveopalatal affricate”, which suggests phonetically [ʔʥ] or an underlying /ʄ/. If we table correspondences across Katuic for palatal stops the following patterning is found:

282

sidwell and rau

pKatuic Kui Bru Ta’oi Kriang Pacoh Katu *c c c c c c c *ɟ c~ch c c c c ɟ *ʄ ɟ~ʥ j ʔj~ɟ ʔj ʔj ʔj

This patterning closely parallels that for the labial and apical stops: pKatuic Kui Bru Ta’oi Kriang Pacoh Katu *p p p p p p p *b p~ph p p p p b *ɓ b b b b b ɓ~b *t t t t t t t *d t~th t t t t d *ɗ d d d d d ɗ~d

The correspondences make it clear that Katu is not actually needed to distinguish the implosive series in Katuic, since it is faithfully reflected by voiced stops generally, but Katu does indicate the reconstruction of the pKatuic plain voiced series, which merged with the voiceless series outside of Katu. The parallelism between the palatal and the labial and apical correspondences is compelling, and thus underlies the reconstruction of *ʄ. At a deeper level however both *ɟ and *ʄ regularly correspond to *ɟ in other AA branches. On the face of it this suggests a pAA distinction, but it does seem somewhat odd to just be represented in one branch. The Katuic diphthongs are more problematic; the reconstruction relies substantially, although not entirely, on distinctions present in Pacoh. The language has a tense/lax phonation contrast that is phonemic among the mid-level vowels, such that from another perspective, Pacoh has an extra degree of vowel aperture, as opposed to the three degrees typical across Katuic. Incidentally, Katu also has four degrees of vowel aperture, but it is a different problem, and is discussed further below. For Katu, Costello uses the notation ie/ia, ươ/ưa, uo/ua but these are just notational variants of iə, ɨə, uə based on Vietnamese orthographic practices. Correspondences for the pKatuic diphthongs, plus the high long monophthongs, are tabled below; note that Kui and Bru registers are not marked as they do not affect the pattern. Also the correspondences are consolidated rather than separated according to conditioning finals, so they are more systematic than they may appear here:

283

austroasiatic comparative-historical reconstruction pKatuic Kui Bru *ɨː ɨː/əː ɨː *ɨə ua/ɔa ɨa *ɨa iː/ia ia

Ta’oi ɨː ɨa iː

Kriang Pacoh ɨː ɨː ɨa əː iː/eː/ɨː/ɨa uː

Katu uː ɨə ɨə/iː

*iː *ie *ia

iː/aj/eː iː/ej/ia iː/eː iː eː/iː/ia ia/iː iː iː iː/æː ɛː/ia ɛː/ia ia

iː/ia/ɛa əj/ie/i eː i uː ia

*uː *uo *ua

uː/oː uː/uə uː/oː uː/oː uː/oː uː/uə/ɔː uː uː uː/ua/ɔː ua ua ua

uː oː ua/ɔa

uː/ua/u/ɔ/ɨ/ɑ uː/u uə

As can be seen to some extent in the tabled correspondences, in some environments Katu long vowels have become short, and this has led to a crowding of the short vowel space, and a modest restructuring resulting in an additional degree of aperture. The Katu long /eː, oː/ are infrequent and tend to occur in loans. Consequently, we can be reasonably confident that the proposed pKatuic vowel inventory explain the daughter inventories well without any significant unresolved difficulties. A Swadesh 100 list of pKatuic lexical reconstruction is included at the end of this chapter. 3.7 Khasian (Meghalayan) 3.7.1 Introduction The Khasian languages (I would prefer the more neutral term Meghalayan) are spoken by a substantial population in Meghalaya State of North East India; although the group itself is extremely asymmetrical, dominated socially by Standard Khasi (henceforth simply Khasi) while several other closely related languages and transitional dialects can be quite small. Perhaps the most important for reconstruction, Lyngngam, has only some thousands of speakers. Comparative-historical analyses of Khasian have been lacking in the literature; since the work of Schmidt (1904) the usual practice of scholars has been to cite Khasi forms as representative of the branch and pay scant attention to other Khasian languages. Still, Standard Khasi is important for comparative purposes; it shows a rich inventory of consonant clusters and vestiges of morphology. Also, being isolated geographically from SEAsia the branch has not been under the same areal pressures to restructure phonologically as many other AA groups.

284

sidwell and rau

Recently, more extensive sources of lexical and other data for Khasian languages have become available (e.g. Nagaraja 1996, Choudhary 2004, Bareh 2010, Ring 2012, Baker 2013 and others) and these have permitted the compilation of a progressively larger comparative Khasian lexicon, and an emerging pKhasian reconstruction. Details of the emerging reconstruction were discussed in Sidwell (2011b), and in early 2012, a working version of the lexical compilation and reconstruction was posted online at sealang.net/monkhmer. The following discussion reports the emerging results of this work. 3.7.2 pKhasian Phonology The pKhasian sound system is reconstructed with reference to four principal languages, Khasi, Pnar, War and Lyngngam. All four show aspirated stops at Ci corresponding to plain voiceless stops in many other AA tongues, indicating a sound change characterised by Haudricourt (1965) as a Germanic shift. A similar change has occurred in some individual languages/sub-branches elsewhere in AA, but only in Khasian has it affected a whole branch, and is therefore reconstructable to the pKhasian level. Additionally, there was a merger of pAA implosives and plain voiced stops yielding a single voiced series. A moderate number of etyma retain initial voiceless stops without the Germanic shift, for reasons that are unclear; also there are apparent lexical innovations at pKhasian that include voiced, voiceless and aspirated Ci stops. The pKhasian syllable and word templates are reconstructed as follows: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(r/n))CiVCf

Consonant clusters/sesquisyllables are extensive, and have been the subject of discussion, including consideration of whether they violate some of Greenberg’s (1965) phonological universals (e.g. Henderson 1976; 1989–90). In summary, initial stop sequences show few restrictions, and clusters of voiced + voiceless stops (plain and aspirated) and stop + fricative are common. Such sequences are typically written orthographically with an intervening letter y, reflecting a tendency to vocalise the transition between stops with a short [ɨ̆]. Comparing Khasi to Greenberg’s universals, Henderson remarks: Underlying this generalization would seem to be the idea that there is a tendency to have the voiced consonants of a cluster closest to the vowel, i.e. adjacent to the voiced nucleus of the syllable. This may indeed be a tendency, but it is not a universal. Khasi has initial clusters bt-, bth-, bs-,

austroasiatic comparative-historical reconstruction

285

bsh- [bʃ], dk, dkh-, dp-. In fact, there seems in Khasi to be a deliberate dissimilation of voicing in initial clusters, especially when the cluster is of two stops. Thus, alongside bt-, dk- and dp-, we can have tb-, kd- and pd-, but pt- and tk- are reserved for a few onomatopoeic and what one might call ‘expressive’ words only. The general word stock prefers voice dissimilation. Henderson 1989–90: 62

Traces of nasal and rhotic infixation, apparently now unproductive, are frequent, and infixes can be assimilated to a following sonorant producing a long segment (cf. Khasi ʃɔŋ ‘to sit’ and nominalisation ʃɨ̆nɔŋ ‘village’; Khasi pɨ̆llac ‘set free’ shows an assimilated nasal, while the cognate in Pnar, pnlac ‘to cause’, retains an unassimilated nasal). In terms of vocalism, Khasian languages variously show either no length contrast, or an incomplete or asymmetrical length contrast among the main syllable vowels. The comparative reconstruction indicates an inventory which is strikingly similar to the inventory indicated for pPalaungic, including the presence of a single *ia diphthong (in place of ɛː); their diphthongs are secondary. The pKhasian phonemes are tabled as follows: Ci Consonants * pʰ tʰ kʰ p t c k ʔ b d ɟ m n ɲ ŋ w r, l j s h Cf Consonants * p t c k ʔ m n ɲ ŋ w r j h Cp Consonants * pʰ tʰ kʰ p t c k b d ɟ r, s h

286

sidwell and rau

Cm Consonants * w r l j h Main syllable vowels * i u iː uː e ə o eː əː oː ɛ a ɔ iə aː ɔː

The proto-vocalism is presently reconstructed on the assumption that Standard Khasi is reasonably conservative, although not without some changes. Most Khasian varieties have lost the AA vowel length contrast, with some concomitant restructuring, but length remains in the standard dialect, and Pnar (closely subgrouped to Khasi) also seems to have retained some old values not well represented in Khasi orthography. Together these varieties provide a strong basis of pKhasian vowel reconstruction. 3.7.3 pKhasian Phonological Innovations In addition to the Germanic shift mentioned above, there are a number of striking indentifying phonological innovations mong the Cf segments; all Khasian languages reflect the following changes: Change *-l > -n /[apical, stop]_ *-l > zero elsewhere *-s > t *-h > t *-ʔ > zero

pAA (Shorto#) pKhasian *ɗiəl (#1745a) > *deːn ‘tracks’ *priəl (#1791) > *pʰria ‘hail’ *kiəl (#1710) > *kʰia ‘cucumber’ *nɔːs (#1912) > *nɔːt ‘heart/liver’ *ʔas (#1871) > *ʔat ‘to swell’ *kpaːs (#1915) > *kmpaːt ‘cotton’ *kiəh (#1967) > *kʰiat ‘deer’ *cuh (#1984) > *prsut ‘bellows’ *caʔ (#8) > *bsa ‘to feed’ *kaʔ (#16) > *kʰa ‘fish’

While the changes among the Cf segments listed above are shared by all Khasian languages, there are also significant phonological changes within Khasian languages. Three specific developments are illustrated below.

austroasiatic comparative-historical reconstruction

287

1) Rhymes with zero/glide finals pattern distinctly according to sub-groups: proto-rhymes *aːj *aːw *əw *əj *uː *iː

Lyngngam Khasi (orth.) Pnar War pKhasian forms aj ái e ɛ/aj *laːj ‘3’, *kʰnaːj ‘rat’ aw áw o ia *-maːw ‘stone’, *saːw ‘red’ u ew aw ia *ksəw ‘dog’, *ɟəw ‘sour’ i ái aj ua *rwəj ‘sing’, *bnəj ‘moon’ u iew u əw/ow *cʰruː ‘taro’, *tʰluː ‘hole’ ɛj/i i i ɛ/i *sŋiː ‘sun’, *ʔiː ‘we’

2) In the War sub-group, which includes Mnar and Amwi, there is a raising of *a, *aː, often to [ə] and [i]; compare the following: pKhasian *kʰlaː ‘tiger’ *kʰaː ‘fish’ *snaːm ‘blood’ *ʔaːr ‘two’ *ksəw ‘dog’ *bnəj ‘moon’

Lyngngam Khasi(orth.) Pnar Mnar Amwi kʰla khla kʰla kʰli kʰli əkʰa ’kha dəkʰa - hi snam snam snam cʰim rnə ʔaːr ar ʔar iːr Ɂə̃ r ksi ksew ksaw ksou ksia bni bn§ai bnaj pni pnua

3) Among initial palatals and *s there is a four-way correspondence that indicates a partial lenition/frication of *c: pAA *ɟ *c /C_ (?) *c elsew. *s

pK Lyngngam *ɟ ɟ *s s *c c *s s

Khasi (orth.) j s sh s

Pnar ɟ s ʧ⁽ʰ⁾ s

War pKhasian forms c *ɟaː ‘rice’, *ɟəw ‘sour’ s *ksəw ‘dog’, *prsaː ‘nephew’ ʃ/cʰ *cɔŋ ‘to sit’, *cʔeːŋ ‘bone’ s/r *sim ‘bird’, *snaːm ‘blood’

Consequently, it would appear that the main problems of Khasian historical phonology are readily tackled, with the main challenge being to sort out finer details of the vowel correspondences, and to flesh out the comparative vocabulary. A Swadesh 100 list of pKatuic lexical reconstruction is included at the end of this chapter.

288

sidwell and rau

3.8 Khmuic 3.8.1 Introduction The Khmuic branch is primarily represented by the vast Khmu’ dialect chain that spreads across Northern Laos and into neighbouring countries, and another ten or so member languages located around the periphery of the Khmu’ area, and into Thailand, China and Vietnam. Khmuic is internally very diverse, with massive lexical change reducing the amount of cognate basic vocabulary between languages to below 30%; this makes it especially difficult to find sufficient cognates to satisfactorily fill out phonological correspondences. Nonetheless, there is sufficient material to prepare a useful and informative reconstruction. The lexicon of the Khmu’ dialects is richly represented in published sources (e.g. Lindell 1974, Premsrirat 2002), and although published lexicons for other Khmuic languages are fairly mixed, quite useful vocabularies are accessible (a bibliography of Khmuic studies is given by Proschan (1996)). Due to the lack of good published resources for the minor Khmuic languages, historical studies have either focused on sub-groups (such as Filbeck’s (1978) T’inic reconstruction) or invoked Khmu’ reflexes in discussion of wider AA issues (such as Haudricourt (1953, 1954) using Khmu’ cognates to solve the origin of Vietnamese tones). Significant comparative studies include:

·

Filbeck (1978): a reconstruction of proto T’inic based on Mal and Pray dialects, the text includes discussion of old loans from Thai and tables of lexical comparisons with Khmu’, Palaungic, Khmer and Mon. Lindell et al. (1980): laids out the framework for comparative analysis of Khmu’/Kammu with extensive well organized data for seven dialects. Egerod (1984): a paper presented to the 17th International Conference of Sino-Tibetan Languages and Linguistics, present brief comparative study of Mlabri with phonological correspondences with Kammu, Lamet, Mal, Proto-Palaung, Proto-Waic and Mương. Unchalee Singnoi (1988): this unpublished MA thesis develops much of the work in Filbeck (1978), including presentation of an extensive Mal-Pray lexicon (approx. 1250 entries). Suwilai Premsrirat (2001, 2004): two papers discuss tonogenisis/registrogenisis in Khmu’ dialects. Rischel (2007): analyses the history of Mlabri in terms of an elaborated pTinic reconstruction. Rischel indentifies various irregular or otherwise unexplained correspondences and sets up numerous numbered variants of proto-phonemes.

· · · · ·

austroasiatic comparative-historical reconstruction

289

In 2011 I compiled a comparative Khmuic lexicon and reconstruction, which is posted online at sealang.net/monkhmer, and details were presented in a talk given at the 5th International Conference on Austroasiatic Linguistics at Canberra in September 2013. The discussion below follows that analysis. 3.8.2 pKhmuic Phonology The pKhmuic sound system is complicated by several factors; it is apparent that in pre-history pKhumic spread from a homeland in the far northwest of Laos to cover more or less the northern half of the country. During this initial dispersal the speaker community separated into dialects that underlie the current linguistic diversity of the branch. Some time there was another expansion out of the north west, possibly associated with the establishment of a kingdom localised across an area roughly from Luang Prabang to Xieng Khouang (speculatively associated with the Plain of Jars culture). The second expansion accounts for the present distribution of Khmu’ dialects, and saw substantial inter-dialectal borrowings which shows up as multiple phonological correspondences between Khmu’ and the other languages. In more recent centuries, Khmu’ social dominance waned in favour of Lao, Northern Tai and Vietnamese, leading to substantial linguistic influence—including phonological restructuring—especially in the geographically peripheral languages. pKhmuic is characterised as unrestructured, although the pAA implosives are merged with the voiced stops. The vocalism is very conservative, and appears to provide good witness to pAA values, including retention of pAA length contrast and syllable final glottal stops. The comparative reconstruction relies heavily on Khmu’ Cuang which is a phonologically conservative dialect (Premsrirat 2002), and Kammu Yuan which is restructured and tonal (Lindell et al. 1980, Svantesson et al. 2002, Premsrirat 2001; 2004). In addition to these Khmu’ dialects, there are extensive lexicons for Mal/Pray (Filbeck 1978; 2009) and Mlabri (Rischel 1995; 2007), both spoken in the western periphery and into Thailand (where most field work was done). Mal/Pray is a dialect chain that shows a Germanic shift (aspiration of voiceless stops) while Mlabri is an enigmatic language that is lexically diverse but apparently phonologically conservative. On the eastern periphery, extending into Vietnam, are languages of the Pramic sub-group, plus Ksing-Mul (which which author suspects may be an aberrant Mal/Pray dialect); the published sources for these languages are not as thorough as one would like but are nonetheless useful.

290

sidwell and rau

The pKhmuic syllable and word templates are reconstructed as follows: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(r/N/m/l))CiVCf

The minor syllable rhyme segments r/N/m/l are typically infixes, plus we should probably reconstruct coda-copying in expressive vocabulary to account for forms such as as Khmu’ Cuang klɔːk slkuəl ‘shade of white’, mat rpjeːp ‘wink quickly’ etc., but the lack of direct comparisons in other Khmuic languages means that no specific etymologies with copied codas have been identified in this study. There are also some marginal cases that suggest a phonemic minor syllable vowel before an approximant Ci but the evidence is not strong. The pKhmuic phonemes are reconstructed as follows: Ci Consonants * p t c k ʔ b d ɟ m n ɲ ŋ w r, l j s h Cf Consonants * p t c k ʔ m n ɲ ŋ w r j h Cp Consonants * p t c k (b) (d) ɟ g r, l s h Cm Consonants * w r l j Main syllable vowels * i u iː uː iə uə e ə o eː əː oː ɛ a ɔ ɛː aː ɔː

291

austroasiatic comparative-historical reconstruction

Phonotactic notes: The pAA pattern of not permitting open syllables is continued in pKhmuic. The Cp inventory presently reconstructed is quite small, including only two voiced oral stop members: *ɟ, *g, but one would expect also *b and *d on typolgical grounds, especially in infixed forms. Interestingly, Cp *ɟ is reconstructed for forms such as *ɟʔaː₁ŋ ‘bone’ which has a voiceless initial in Khmu’ Cuang /cʔaːŋ/, and we might have suggested that voiced reflexes (such as Mlabri /ɟiʔɛːŋ/) are due to voicing dissimilation much as we see in Aslian. However, we would expect *c to lenite to /s/ in Cuang, which has not happened, so the palatal stop in assumed to reflect *ɟ. Cm *h is not reconstructed, as pre-pKhmuic is marked by the regular loss of medial /h/. Later borrowing of aspirates, affixation and sound changes reintroduced medial /h/. The proto-vocalism presents some problematic correspondences that are provisionally dealt with by positing numbered variants. These are cases where there are apparently two distinct correspondences that continue a single pKhmuic vowel. The pattern appears to be that conservative forms, probably diffused from Khmu’, have replaced forms that would be expected to have undergone regular sound changes. Examples: pAA (Shorto#)

gloss

pKhmuic K-Cuang K-Yuan

Mlabri Mal

Ksing-Mul Pong

*ɟhaːm (#1430) *cʔaːŋ (#488) *sra(ː)ŋ (#709) *taːɲ (#898) *haːn (#1216) *smaːl (#1779) *par (#1633 ‘fly’) *cnam (#1368) *[t]ɓəl (#1768) *tmiʔ (#144) * piʔ (#257) *kləjʔ (#1519) *klmiəʔ (#137)

‘blood’ ‘bone’ ‘tooth’ ‘to weave’ ‘goose’ ‘soul’ ‘wing’ ‘year’ ‘thick’ ‘new’ ‘three’ ‘husband’ ‘sugarcane’

*maː₁m *ɟʔaː₁ŋ *sraː₁ŋ *taːɲ *haːn *-maːl *pna₁r *-na₁m *-ba₁l *-meʔ *peʔ *gle₁ʔ *klme₁ʔ

mɛːm ɟiʔɛːŋ thrɛːŋ taːɲ – hmaːl hnʌr hnʌm ɓʌn hmɛʔ pɛʔ – gilmɛʔ

miəm – həgiəŋ taːɲ – həlmaːl phəlnal – – həmeː – ʔəliː –

maːm cʔaːŋ ʰraːŋ taːɲ – hmmaːl pnɨr nɨm hmbɨl hmmeʔ peʔ gleʔ klmeʔ

màːm cəʔáːŋ raːŋ taːɲ háːn hərmàːl pənɯ́ ːr nɯ̀ m həmpɯl həmmeʔ péʔ klèʔ kəlmèʔ

miam ʔiaŋ siaŋ thaːɲ haːn maːl panəɰ – mpal mɛʔ phɛʔ – –

miːm sɁiːŋ hriːŋ taːɲ haːn maːl ʔannaːr – – trməːj – – kəlmiː

292

sidwell and rau

Each item has a strong AA etymology, so we can be rather confident that the phonetic values approximate what has been reconstructed. The correspondences with subscripts show expected or regular correspondences based on consideration of patterns across the wider data set, while the unmarked correspondences show trivially similar or unchanged vowel values. 3.8.3 pKhmuic Phonological Innovations There are two important phonological changes that belong to pKhmuic: 1. 2.

The loss of implosion and merger of pAA implosives and plain voiced stops. The loss of pAA Cm *h.

The loss of implosion is not an uncommon change, but it is important that it is reconstructable to the pKhmuic level. Implosives /ɓ, ɗ/ are recorded for Mlabri, sometimes in older Tai loans, but also in some AA etyma, but the Mlabri lexicon is so mixed in origins it is unsafe to reconstruct a pKhmuic contrast on that basis (see also Rischel 2007 for discussion). The loss of medial h is strikingly unique to Khmuic and produces distinctive reflexes, e.g. pAA (Shorto#)

pKhmuic

Khmu Cuang

Mlabri

Mal

Ksing-Mul

Pong

*ɟhaːm ‘blood’ (#1430) *bhiːʔ ‘to be sated’ (#259) *khəjˀ ‘moon’ (#1542) *phoːm ‘to fart’ (#1433)

*maː₁m *biʔ *kiʔ *poːm

maːm biʔ – puːm

mɛːm biʔ kiʔ –

miam piʔ – –

miəm – – poːm

miːm – kiː pom

A Swadesh 100 list of pKhmuic lexical reconstructions is included at the end of this chapter. 3.9 Palaungic 3.9.1 Introduction The Palaungic (or Palaung-Wa) branch is spoken by numerous scattered communities in the Shan state of Myanmar and neighbouring parts of Yunnan, Thailand, and Laos. Diffloth (1980) expresses the importance of Palaungic for AA reconstruction:

austroasiatic comparative-historical reconstruction

293

Among the fourteen or so extant branches of the Mon-Khmer family, only three or four have developed and preserved enough differentiation today to yield proto-branch reconstructions of great antiquity. They are: the Bahnaric, the Aslian, the Palaungic and probably the Viet-Mương branches. It is mostly from these reconstructions that we will be able some day to cast a glance at Proto-Mon-Khmer and beyond. Diffloth 1980: 3

Various sub-groups have been subject to comparative-historical studies, including an early important treatment by Schmidt (1904). Proschan (1996) and Gordon (2013) provide select bibliographies of Palaungic studies. Works immediately relevant to the comparative reconstruction include:

·

Shafer (1952): sets forth over a hundred comparisons for 11 Palaungic tongues, establishing regular correspondences for the consonants and vowels, proposing some 88 Proto-Palaungic items. Shorto (1963): provides an important phonological and morphological analysis of Palaungic word structure, including prefixation, crucially informing the reconstruction. Luce (1965): provides 245 lexical comparisons between Mon, Danau, Riang, Palaung and Wa, plus an appendix of wider comparisons, in addition to a rare Danau lexicon. Mitani (1977): compiles phonological correspondences for consonants and vowels based on 283 comparisons for languages of the Palaung-Riang sub-group. Mitani (1979): follows-up Mitani (1977) by working Riang into the analysis. Diffloth (1977): sketches important aspects of a Palaungic reconstruction, outlining a classification based on phonological innovations. Diffloth (1980): a monograph-length comparative study of the Waic subbranch, proposing 544 etymologies and reconstructions. Svantesson (1988): the most important analytical source on the Angkuic sub-branch, going into great detail analyzing the remarkable phonetic history of U, with extensive comparisons to other Palaungic data (especially with Hu and Lamet). Paulsen (1989/1991): effectively a reconstruction of the Samtao sub-branch of Waic, which lacks a distinct reconstruction in Diffloth’s treatment. The results include an etymological lexicon of 556 entries and reconstructions, comparing Plang, Shinman and Pangloh, drawn from the author’s own field data.

· · · · · · · ·

294

sidwell and rau

· ·

Diffloth (1991): a short follow-up to Diffloth (1980) that discusses data from the Bulang sub-branch of Waic and offers a partial vowel reconstruction. Shorto (2013): is an extensive Palaung lexicon with numerous etymological and comparative notes (facsimile publication of ms. dated 1957). In 2010 I compiled some 1250 comparisons and reconstructions based on Danau, Riang, Rumai, Palaung, Hu, U, Lamet and Diffloth’s pWaic lexicon, and this was made available online at sealang.net/monkhmer. The commentary that follows largely reflect those results. 3.9.2 pPalaungic Phonology Compared to pAA, the pPalaungic sound system shows various mergers that simplified the vocalism, and some consonant changes that resulted in preglottalized and preaspirated sonorants in monosyllables, otherwise the system is quite conservative, retaining the three pAA stop series. Subsequently pPalaungic dispersed into at least five sub-groups, each with a distinct phonological history. How these sub-branches coordinate is an open question (see discussion in the separate chapter on classification); but for the present purposes the reconstruction is not dependent on any particular model of philogeny and the sub-branches are treated as equally coordinate. Phonologically the sub-branches have the following secondary characteristics:

·

Danau: spoken in just a few villages west of Inle Lake (Shan State), Danau has been restructured by contact with Shan and Burmese, leading to loss of vowel length contrast, devoicing of stops, a four tone system, and the merger of final labials with apicals (*-p, *-m > /-t, -n/). Palaung-Riang: a large dialect chain the Shan State; while some dialects are innovative and somewhat restructured, the Palaung recorded by Shorto and used in his (2006) reconstruction is quite conservative and is an important witness to the reconstruction. Waic: A large number of Wa dialects are spoken in Thailand, Shan State and Yunnan; Diffloth (1980) reconstructed pWaic. Although various Waic dialects are restructured, the results indicate that pWaic distinguished voicing in stops although lacked any vowel length contrast. Lameet: Several Lameet dialects are spoken in Northern Laos near the Myanmar border. While at least one dialect has devoiced stops and structured to yield a two tone system, vowel length contrasts are retained consistently.

· · ·

austroasiatic comparative-historical reconstruction

295

·

Angkuic: a minor group in the east of the Shan State, Angkuic languages have uniquely developed tones out of vowel length contrasts. The group also underwent devoicing of stops and a Germanic consonant shift, but unconnected to the tonogenisis. Other languages identified as Palaungic by this author, Khabit and Khang, spoken in Laos and Vietnam respectively, are known only by very limited published materials and are not used in the reconstruction. The syllable and word templates are reconstructed as follows: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(r/l/N))CiVCf *Ci[+son., -voice]VCf *Ci[+son., +preglttal]VCf

All stressed syllables are closed. I reconstruct the following segmental inventories: Ci Consonants * p t ʦ c k ʔ b d ɟ g ɓ ɗ m n ɲ ŋ m̥ n̥ ɲ̊ ŋ̊ ˀm ˀn ˀŋ w l, r j w̥ l̥, r̥ ˀw ˀl ˀj s h Cf Consonants * p t m n w r, l s

c k ʔ ɲ ŋ j h

Cp Consonants * p t c k ʔ b d ɟ g s h

296

sidwell and rau

Cm Consonants * w r l j h Main syllable vowels * i u (eʔ/h) ə o ɛ a ɔ

iː uː eː əː oː iə aː ɔː

In terms of Phonological structure, pPalaungic was not drastically different from pAA; especially if we threat the devoiced and preglottalised initial sonorants as clustered onsets (reflecting their likely origins). However, the subsequent history of Palaungic sub-groups was dominated by consonant devoicing and syllable reduction, and vowel systems simplified by loss of length distinctions, and in some cases the emergence of contour tones and phonation contrasts. It is clear that this complex history has been largely conditioned by language contact; Palaungic communities are dispersed and intermingled with numerous other ethnic groups over diverse upland plateau and river valley environments. The largely Tai and Tibeto-Burman languages of the area have dramatically different phonologies to the Austroasiatic prototype, and multilingualism is a social inevitability. Consolidated Ci and main syllable vowel correspondences are presented here: Palaungic C1 Consonant correspondences: *Ci

Palaung-Riang Danau Riang Rumai Palaung pWaic

Angkuic Lameet Hu U

tone series

*p *t *k *c *s *h *ʔ

p t k ʦ θ h ʔ

p t k s h h ʔ

p t k s h h ʔ

p t k s h h ʔ

*p *t *k *s *h *h *ʔ

p t k s h h ʔ

ph th kh s/ch s h ʔ

high high high high high high high

*ɓ *ɗ

b~ɓ d~ɗ

b~ɓ d~ɗ

b d

b d

*b (Low) p *d/nd (Low) t/nt

p p t/nt/th t/nt

high high

ph th kh s θ h ʔ

297

austroasiatic comparative-historical reconstruction *Ci

Palaung-Riang Danau Riang Rumai Palaung

pWaic

Angkuic Lameet Hu U

tone series

*b *d *ɟ *g

– t ts k

p – c k

b – ɟ g

b – ɟ g

*b *d *ɟ *g

– t c k

p – c k

– t/nth c/k k

low low low low

*m *n *ɲ *ŋ *w *l *r *j

m n ɲ ŋ w ~ vw l r j (~ ɲ)

m n ɲ ŋ w ~ vw l r j (~ ɲ)

m n ɲ ŋ w l ɹ j

m n ɲ ŋ v l r j

*m *n *ɲ *ŋ *w *l *r *j

m n ɲ ŋ w l r/ɣ j

m n ɲ ŋ v l ʁ j

m n ɲ ŋ v/f l χ j

low low low low low low low low

*m̥ *n̥ *ɲ̊ *ŋ̊ *l̥ *r̥ *w̥

m – – – l r ~ ʰr w

m n ɲ ŋ l r w

m̥ – ɲ̊ – l̥ hʲ w̥

hm hn – hŋ hl hr hv

*hm – *hɲ *hŋ *hl *hr *hw

– – – – l r –

– – – – l θ –

– – – – l χ –

high high high high high high high

*ˀj *ˀl *ˀm *ˀn *ˀŋ *ˀw

– – – – ŋ –

c – m – ŋ w

ɟ ɗ~d m – – –

j – m – – –

*ʔj *l *ʔm *ʔn *ʔŋ –

j – ʔm~m – ŋ –

– – m – – –

– – (m)m n ŋ ʔaw

high high high high high high

The reconstruction of the implosive-plain contrast, and the preglottalized and preaspirated sonorants, are dependent on correlation with tonal series in Waic and Riang reflexes, and preaspiration recorded in Palaung (Shorto 2006 and 2013) and Riang (Luce 1964). Plain sonorant initials are associated with low series tone, while preglottals and preaspirates are associated with high series;

298

sidwell and rau

various cases of incomplete etymologies mean that we have no direct segmental indications whether preglottalisation or preaspiration accounts for the high tone. Voiced stops are associated with low tones, while implosives correlate with high tones. Implosion is not well represented in the daughter languages; Luce (1964) vacillates in writing b ~ ɓ and d ~ ɗ for Danau and Riang, so tone series seems to be a better indication of original phonation type. Unfortunately the number of relevant low series stops in the comparisons is very moderate (note gaps in the table of correspondences) but typologically the reconstruction of implosives is on solid ground; the indications are that pPalaungic implosives outnumber the plain voiced stops by three to one or more, similar to the results obtained for pKatuic and by Shorto (2006) for his PMK. Main vowel syllable correspondences: pPalaungic

Danau

Rumai

Riang

Palaung

pWaic

Lamet

Hu

U

*iː *i *eː *e /_ʔ,h *iə *ɛ *əː *ə

ɪ/i/iə/əi/ə ɪ/i/ə ɪ ɪ iː/iɛ/iə/jæ ɪ/æ a/ə/iɛ/ji a/ə

iː/ʲaj iː/ɒː ɒː ɑ/e/ɔ/aj jɜ/jo/ɛ a/ɛ/æ/ɜ/ɑ ɑ/ə/a/ɜ/æ æ/ɜ/a/ɨ/ɒ/ɔ/o

i(ː) i/e ɛ/e e/æ ia ɛ/æ ə/ɔ/e/u ə

i i/e e e/ɛ iə/eə ɛ ɯ/ə i/ɯ/ə

e *i *i *i/*e *e/*ɛ *ɛ *o *ɤ

iː i iː/eː e/ɛ ɛː ɛ eː ɨ/ə/e/i

i e e/i i/e i/e e/ɛ/a e/o/ə ɨ/ə/ɛ/e

*aː *a *uː *u *oː *o *ɔː *ɔ

ɑ/a ɑ/a/ʊ/i V[-front] V[-front] o/u/ɤ/ə o/u V[-front] ɔ/o/ə

aː ɑ/a uː/ɛw ɨ o/u/æ/aw o/ɒw/aw ɔ/ɑ/o/æ/ɨ ɔ/o

ɑ/a a u(ː) u o/u o ua ɔ/o

a ə/a u u o o uə/ɔ ɔ

*a *ɒ *ɨ/*ɤ *ɨ *o *ɔ *ɔ

aː a u(ː) u oː o ɔː ɔ/o

a a u/o u o/u – ɔ ɔ

i i i i/e ɛ/e/i e/ɛ eː e/ɛ/ ə/o a a u u o/u – ɔ ɔ

The vowel reconstruction is largely consistent with the proto-Palaung-Rumai inventory proposed by Diffloth (1991):

austroasiatic comparative-historical reconstruction

299

Diffloth (1991: 25) proto-Palaung-Rumai vowels * i u ə eː əː oː a ɛː aː ɔː

Diffloth’s reconstruction is limited to syllables with final velar nasals, in which the full set of oppositions is substantially reduced. The point is made that the system is apparently transitional between a fully elaborated AA vocalism and a more Northern Tai-like system. Extending the comparative analysis to include five sub-branches and all coda consonant environments adds additional contrasts, yielding a more balanced system further illustrating how the vocalic distinctions have gradually collapsed over time in Palaungic. Of particular note, I reconstruct a pPalaungic diphthong *iə, corresponding to Diffloth’s *ɛː, and no corresponding back diphthong. Diphthongs /ua ~ uə/ do occur in Palaung and Riang, but they correspond to /ɔ(ː)/ elsewhere in Palaungic, and within Palaung [uə] and [ɔː] are in complementary distribution, with [ɔː] before velars and labials. However, diphthongal reflexes of *iə are found generally in the Palaung-Riang group and in Danau, and have diphthongal cognates externally (e.g. ‘hailstone’: Palaung /preər/, Pnar (Khasian) / pʰria/, Laven (Bahnaric) /prial/, Katu (Katuic) /priel/ etc.). It would be irrational to posit a reversing chain shift iə >ɛː > iə when a unidirectional iə > iə/ɛː will provide a general account. A Swadesh 100 list of pPalaungic lexical reconstructions is included at the end of this chapter. 3.10 Pearic 3.10.1 Introduction The Pearic branch is a small, highly endangered group of languages spoken mainly in western Cambodia, the Trat Province of Thailand, and small pockets to the east of Siam Reap. While a number of Pearic speech varieties are distinguished in the linguistic literature, it is evident that they effectively represent only a couple of distinct languages: Pear of Kompong Thom (KPT) in the north-east of the Pearic range, plus a Chong dialect chain that extends discontinuously across the south-west of Cambodia and into Thailand (Baradat (1941) and Choosi (2007) report high intelligibility across the whole chain). The group is generally known to linguistics as Pearic, based on the somewhat derogatory Khmer term, so one might suggest “Chongic” as an alternative based on the autonym /cʰɔ̤ ːˀŋ/, although Pearic is used here. Ferlus (2011) provides an extensive bibliography of lexical and comparative sources.

300

sidwell and rau

Heavily influenced by Khmer and Thai, and terminally endangered, Pearic languages have a remarkable 4-way register/phonation system that is typologically unusual and arguably significant for pAA, although views differ on this (see Huffman 1985b for a desciptionof the system in Chong, and Diffloth 1989 for comparative perspective). The group is so small and lacking in internal diversity that it probably reflects a single language that diversified into several dialects since the Angkor period. Additionally, major phonological restructuring, such as devoicing of stops, emergence of breathy phonation and associated vowel changes, probably occurred in circumstances of contact with Khmer and Siamese during relatively recent times (Ferlus (2011) dates the devoicing in Khmer and Siamese as occurring in the second half of the 17th Century; Headley (1998) confirms this with reference to Chamic loan word phonology). Consequently pPearic is probably less than 1000 years old, and this affects its usefulness as a witness language for pAA. Comparative-historical work on Pearic really begins with Headley’s (1985) preliminary pPearic reconstruction, which treated around four hundred etymologies, but did not discuss the register question. The issue was advanced considerably at the 2009 ICAAL meeting in Thailand with presentations by Michel Ferlus and Weera Ostapirat, with Ferlus’ analysis being published in 2011. Building on those works, I have compiled of a comparative Pearic vocabulary and a working model of historical phonology; it is this synthesis that informs the following discussion. 3.10.2 pPearic Phonology The syllable and word templates are reconstructed as follows: monosyllables sesquisyllables/disyllables *Ci(Cm)VCf *(Cp(r/l/N))CiVCf

All stressed syllables are closed. I reconstruct the following segmental inventories: Ci Consonants * p t ʦ c k ʔ b d ɟ g m n ɲ ŋ m̥ n̥ ɲ̊ w l, r j (w̥ ) r̥, l̥ j̊ s h

austroasiatic comparative-historical reconstruction Cf Consonants * p t m n w r, l s

301

c k Ø ɲ ŋ j h

Cp Consonants * p t c k b d ɟ g m r Cm Consonants * w r l j h Main syllable vowels * i ɨ u iː ɨː uː iə e o eː oː ɛ a ɔ ɛː aː ɔː ± v̰ (creak)

The pAA word final glottal stop was lost (as in Khmer, Katuic, and partially in Bahnaric) apparently at a pre-pPearic stage, in connection with the emergence of creaky register (the origin of the later remains obscure). The Ci consonants show no implosive series, as these merged with the regular voiced stops in the pre-pPearic stage, paralleling a similar development in pre-Khmer. Voiceless sonorants are reconstructed to account for a modest number of high register words with sonorant initials, although these may also reflect diverse origins, such as clusters with voiceless initials that have not survived but may be recoverable with more evidence. The Cp inventory shown here is quite small, being an artefact of the limited reconstruction done so far; I expect that there was more complexity in pPearic initials and initial sequences than indicated here. The pPearic vowel system appears to show a single front diphthong *iə, which can be reconstructed on the basis of reflexes in western Chong dialects. Diphthongs are generally common in Pearic, but mostly they can be shown to originate with historical monophthongs, or in loanword vocabulary. Nonetheless, I have found at least seven etymologies which support an *iə reconstruction, all of which correspond to a similar diphthong elsewhere in AA. Strikingly, this asymmetry in diphthongs is paralleled in Palaungic and Khasian.

302

sidwell and rau

The vocalism is also notable for the lack of a mid-central vowel, although I do reconstruct high centrals *ɨ/ɨː; frequently these reflect raised reflexes of a pAA schwa in addition to conditioned reflexes of other proto-vowels. Proto Pearic is also reconstructed as having a creaky register, which is assumed to have arisen in pre-pPearic—I am sympathetic to the view of Ferlus (2011) that creak emerged in connection with initial clusters in pre-pPearic, but that subsequent changes have obscured much of the recoverable details. Breathy register, which exists alongside creak in modern Pearic (creating the unusual four-way register contrast), is taken to be a later development associated with devoicing of stops in late pPearic, and is assumed to have occurred in the context of contact with Khmer, which devoiced stops during the Middle Khmer period. A fundamental problem of Pearic reconstruction is how to account for the presence of both plain and aspirated stops in high register vocabulary in Chong dialects, while Pear of Kompong Thom (KPT) generally shows plain voiceless stops without aspiration. Headley (1985) reconstructed two series of stops for this double correspondence, giving them underspecified values (*p, *t, *c, *k versus *P, *T, *C, *K). Both Ferlus (2011) and Ostapirat (2009) reject the idea of two high series in pPearic. The emerging consensus is that there was a general shift to aspiration in the Chong high series stops, while no such shift occurred in KPT, and voiceless unaspirated stops were mostly reintroduced into Chong by lexical borrowing. Perhaps a third of relevant high series Chong vocabulary shows unaspriated voiceless stops. A number of Khmer and Siamese loans can explain these, but there remains a residue of distinctive Pearic vocabulary represented in the problematic lexicon. The working hypothesis is that dialect mixing reintroduced unaspirated Pearic forms into Chong, at a time when Pear speakers were more numerous and widespread than at present. Additional circumstantial support for the hypothesis of a once-larger Pear speaker community can be found in analysis of loanwords among neighbouring AA groups. For example, the distinctly Pearic etyma *tuːs ‘head’ and *kic ‘small’ were borrowed into West Bahnaric languages (see Sidwell & Jacq 2003). The presence of Pearic loans in neighbouring groups strongly suggests a much more extensive Pearic presence north east of Tonle Sap in past times. The other significant problem for Pearic historical reconstruction is the register system, which in its most elaborated form presents four distinctions, divided into low and high series, as Siripen (2001) characterizes for Chong:

austroasiatic comparative-historical reconstruction Table 4.1

303

Chong registers

high series

R1 [CVC] clear-modal R3 [CV̤ C] breathy

low series

R2 [CVˀC] clear-creaky R4 [CV̤ ˀC] breathy-creaky

The breathy registers straightforwardly arose in association with pPearic voiced intials, much as also happened in Khmer and other languages of the region. The following table of examples helps to illustrate the restructuring of voiced, voiceless and implosive onsets (and other changes discussed in this section). Gloss

pPearic

Chong (Huffman 1985b)

Chung (Choosri 2009)

Samre (Pornsawan 2001)

KPT Pear (Baradat 1941)

‘water’ ‘forest’ ‘two’ ‘resin’ ‘foot’

*daːk *briː *ɟar *ɟar *ɟɨŋ

tʰa̤ ːk pri̤ː paːˀj – –

tɤ̤ ːk pri̤ː pɤ̤ aːˀ – cɯŋ

tâːk prìː pâːr sàr sìɲ

teak [tiek] prî [priː] pierrl [piər] char [car] –

‘thorn’ ‘to weave’ ‘louse’ ‘rat’ ‘hand’ ‘to go’ ‘to bark’

*ɟrlaʔ *taːɲ *ciː *kɔːˀn *tiː *ceːw *ker

(kala̤ ʔ) tʰaːɲ cʰiː kʰɑːˀn tiː cɛːw kɛːw

crlɯ̤ ːˀ tʰaːɲ cʰej kʰɔːˀn tej ceːw kaj

salâː – cʰiː kʰûən tiː ciːw kir

pAA (Shorto#):

*ɗaːk (#274) *briːʔ (#181) *ɓaːr (#1562) *ɟar (#1601) *ɟiŋ/*ɟəŋ (#538I) chrâléa [crəlie] *ɟlaʔ (#205) tanh [taːɲ] *taːɲ (#898) chei [cej] *ciʔ (#39) – *kuən (#1127) ktei [ktej] *tiʔ (#66) – *cəw (#1821) kier [kiər] *(c)kiːl (#1709)

It remains difficult to explain the emergence of creak in Pearic, as it is apparently unconnected to membership of either high or low series. The explanation offered by Ferlus (2011) is that the creaky series arises quite early, due to the effect of initial clusters/sesquisyllables conditioning a more tense

304

sidwell and rau

articulation. More speculatively, he also suggests that this was reinforced by influence from Middle Chinese, which is thought to have gone through an analogous phonological change. This makes some sense in the context of the trans-Indo-China trade route that operated during the First Millennium, passing through Pearic territory near the Gulf of Thailand (so called “Han Trail”). Ferlus’ proposal is difficult to assess and it is possible that another internal explanation will yet emerge. Either way, creak appears to be old in Pearic, and we are forced to reconstruct it as a feature of pPearic, and that is where the matter rests for now. 3.10.3 pPearic Innovations In addition to the distinct phonological history discussed above, lexical innovations unique to Pearic are readily identified and compiled. A selection follows: Gloss

Kasong (Noppawan 2003)

Samre (Pornsawan 2001)

Chong (Huffman 1985)

KPT Pear (Baradat 1941)

items replaced (Shorto#):

‘fish’ ‘moon’ ‘water leech’ ‘chicken’ ‘fire’

mê̤ːw kaːŋ pîː lɛ́ːk ple̤ːw

miːr kaːŋ pîr liək pliːw

me̤ːˀw kaːŋ piːˀw lɛːk pli̤ːw

miel [miəl] kang [kaːŋ] pair [pəir] lék [leːk] phlou [pʰlow]

*kaʔ (#16) *khəjʔ (#1542) *gləw (#1858) *( )ʔiər (#1552) *( )ʔu(ː)s (#1872)

A Swadesh 100 list of pPearic lexical reconstructions is included at the end of this chapter. The list is extracted from the working reconstruction compiled by Sidwell. 3.11 Vietic (Mương-Viet) 3.11.1 Introduction The Vietic branch is extremely asymmetrical in its distribution, reflecting the political development of Vietnam over the past Millennium. The branch consists of Vietnamese with over 80 million speakers, the very closely related Mương dialects (often written simply as Muong) with over a million speakers, plus maybe a dozen very small languages spoken mainly by communities living in the vicinity of the Viet–Lao borderlands. Documentation of the smaller

austroasiatic comparative-historical reconstruction

305

more isolated members is still a pressing matter, but on balance the study of the branch is quite well advanced; for a somewhat dated but useful bibliography see Barker (1993). The Vietic languages have been the subject of significant comparative studies over the past six decades, one can now say that the history of the group is quite well understood, and the proto-lexicon is extensively documented. The most important works include:

·

Haudricourt (1953; 1954): produced ground-breaking papers establishing the AA ancestry of Vietnamese, and framed the reconstruction of Vietic linguistic history since. Barker (1966): presents some 600 Viet-Mương lexical comparisons, illustrating the tonal correspondences between Vietnamese and Mương. Barker & Barker (1970): reconstructs Proto-Viet-Mương finals and vowels on the basis of 210 illustrative comparisons between Vietnamese and Mương. Thompson (1976): attempts to reconstruct Proto-Viet-Mương supported by a consolidated comparative vocabulary of approximately 700 entries with Vietnamese and three Mương dialects. Huffman (1977): provides 100 lexical comparisons between Vietnamese and 18 other AA languages, organized according to phonological/tonogenetic contexts. Sokolovskaia (1978): presents a reconstruction of Proto-Vietic utilizing 27 different sources, creating a comparative lexicon of approximately 650 entries. Ferlus (1975; 1982; 1991; 1992a,b; 1996; 1997; 1998; 2001; 2005): is a series of papers Ferlus has developed a reconstruction of Vietic history. Since 1991 this has included circulating a consolidated Proto-Vietic lexicon in progressively refined drafts.

· · · · · ·

The pVietic reconstruction of Ferlus is the most authoritative work to date, and the text that follows is a synthesis of his results. 3.11.2 pVietic Phonology Broadly speaking, Vietnamese and the Mương dialects are the most innovative Vietic languages, having restructured the pVietic sesquisyllables into tonal CV(C) monosyllables. But the minor languages, especially Sac, Ruc and Arem (the latter non-tonal) are very conservative, and provide fairly complete and reliable witness to pVietic.

306

sidwell and rau

The pVietic syllable and word templates are reconstructed by Ferlus as follows: monosyllables sesquisyllables *CiCmV(Cf)/(ˀ) *Cp-CiV(Cf)/(ˀ)

Additionally, there are various infixed forms reconstructed, such as *t-rn-neːʔ ‘urine’, and some multi-segmental pre-syllables, such as *tr-kɔːt ‘monitor’, *km-ɓoːr ‘pangolin’ etc. which are not readily analysed as multi-morphemic. Ferlus uses the dash symbol (-) notation to generally separate initial clusters, without regard to whether they are mono- or bi-morphemic (or assuming that CV(C) roots are basic); in other words he assumes that any initial clusters may have been analysed as sesquisyllables. I would counter that—ignoring infixed derivatives—voiced stops and *ʧ only occur before vowels and approximants /r, l, w, j/, and they do not appear to be grammatical prefixes. Consequently I would treat most pVietic forms with onsets consisting of stop + approximant as monomorphemic monosyllables, rather than sesquisyllables. The following pVietic segmental inventories are reconstructed by Ferlus: Ci Consonants * (pʰ) (tʰ) s (kʰ) h p t c ʧ k ʔ b d ɟ (ʤ) g ɓ ɗ ʄ m n ɲ ŋ w l, r j Cm Consonants * w Cp Consonants * p t c ʧ k ʔ b ɟ g m r, l s Cf Consonants * p t c k m n ɲ ŋ w r, l j s h

austroasiatic comparative-historical reconstruction

307

Vowels * i u iː uː ə eː (əː) oː a ɛː aː ɔː (ia) (ɨa) (ua)

Aspirates *pʰ, *tʰ, *kʰ are infrequent and generally come from clusters of stop + /h/ and ancient loans. The initial *ʧ (Viet. x) is uncommon but welldocumented in the northern (Viet-Mương) sub-branch. The corresponding sound *ʤ is rare. The vowels given in parentheses appear only in borrowings which are limited to sub-branches. Borrowings give the misleading impression that these vowels may belong to pVietic. 3.11.3 Vietic Tonogenesis and Stopped Finals Perhaps the aspect of Vietic phonology which has attracted most attention is the tone system of Vietnamese and cognate tone systems in the branch, which superficially resemble Thai or Cantonese typologically. So striking is this feature, and so inconsistent with most AA languages, that for half of the 20th century in Europe (and until recently in Vietnam and China) the view dominated that Vietnamese is a Tai or Sinitic language with an AA loan stratum. This view was particularly advocated by Maspéro (1912) and Sebeok (1942), and was only convincingly overturned with the demonstration by Haudricourt (1953; 1954) that the tones emerged from AA features (particularly with reference to Khmu’ data). The six tones of Vietnamese were recognised as emerging from two distinct aspects of syllable structure: a high/low series correlating with onset voicing, analogous to registrogenesis in Mon, Khmer etc., and three further divisions depending on whether the syllable codas are sonorant, stopped or fricative. Somewhat later, scholars noted that this explanation is not quite complete (e.g. Gage 1985); in particular some words with sonorant codas have stopped (sắc and nặng) tones when this is not otherwise expected by syllable structure or external correspondence. Shorto was cognisant of this in the 1960s and 1970s while compiling his PMK reconstruction and posited various proto-forms with post-final glottal stops to account for such etyma (e.g. §1166 *punʔ ‘four’, §1813 *ʔaːwʔ ‘upper garment’ etc.). Coincidentally, it was also noted in the 1980s and onward that the various minor Vietic languages, such as Arem, Ruc, Sac etc. show corresponding glottilized continuant codas (c.f Arem *puənˀ ‘four’, *ʔɛːwˀ ‘upper garment’). These observations have lead to the current synthesis, reflected in, for example, Ferlus (1998) and Thurgood (2002) etc., which recognises a fourth category of finals in Vietic (those with glottal constriction) and forces the reconstruction of the glottalisation feature for pVietic.

308

sidwell and rau

3.11.4 Vietnamese Consonant Restructuring Typologically the Vietnamese consonant system is somewhat asymmetrical, showing a very thorough restructuring of the various consonant series. Unlike most AA languages, which typically continue many pAA sounds with little or no apparent change, the majority of Vietnamese segments have undergone significant reorganisation of phonetic features. Notable aspects of the standard dialect include:

· · · ·

eight fricatives, including a voicing distinction; voiceless bilabial stop present only in loans; no palatal or rhotic approximants; several retroflexes

The inventory of onset consonants, with orthographic symbols in parenthesis, is tabled as follows: (p) ɓ (b) m (m) f (ph) w (v,u)

tʰ (th) t (t) ʈ (tr) c (ch, tr) k (c,k,q) ʔ (Ø) ɗ (d) n (n) ɲ (nh) ŋ (ng, ngh) s (s/x), ʂ (s) x (kʰ) h (h) z (d, gi, r) ʓ (r) ɣ (g, gh) l (l)

The Vietnamese nasals /m, n/ include direct reflexes of the pAA implosives, having merged unconditionally. After the nasalisation of the old implosives, the voiceless stops *p, *t became implosive, somewhat paralleling a similar shift affecting prevocalic stops in Old Khmer. Additionally, the bilabial and velar aspirates became fricatives, as did various initial clusters, although the etymological fricative *s hardened to a stop /t/. The eventual almost complete elimination of initial clusters (Cw onsets are still tolerated) also delivered the retroflex segments. The palatal and rhotic approximants merged to a voiced fricative, although this did not happen in all dialects. Examples of these changes are illustrated in the following table:

309

austroasiatic comparative-historical reconstruction pAA (Shorto#)

gloss

pVietic

*rk[aw]ʔ (#1820) *mp[ɔ]ʔ (#105) *taːɲ (#898) *ɗaːk (#264) *punʔ (#1166) *tɓaŋ (#636) *luən (#1201a) *suk (#467) *kraʔ (#162)

‘husked rice’ *r-koːˀ ‘dream’ *s-poːˀ ‘to weave’ *taːɲ ‘water’ *ɗaːk ‘four’ *poːnˀ ‘shoots’ *-maːl ‘round’ *g-lɔːn ‘hair’ *-suk ‘road, way’ *k-raːˀ

Vietnamese

Mương HoaBinh

Thavung

Sac

Arem

ɣaw²¹

kaːw⁴ paw¹ taːɲ¹ dia⁵ poːn³ vaŋ¹ klɔːn² tʰak⁷ tʰaː³

akoː³

təkoː³ poː³ taːɲ¹ daːk⁷ təɓaŋ¹ tlɔːn² suk⁷ -

ŋkɔːˀ mpɔː taːɲ dæːk puənˀ aɓaŋ tloːn utʰʊk uræːˀ

ɓaw³³ ɗan³³ ‘knit’ nɯək⁴⁵ ɓon⁴⁵ mÏaŋ³⁵ ʈɔn⁴² tɔk͡p⁴⁵ sa²⁴

(tam) daːk⁷ poːn³ baŋ¹ sɔk⁷ kʰəlaː³

A Swadesh 100 list of pVietic lexical reconstructions is included at the end of this chapter. 3.12 Munda by Felix Rau and Paul Sidwell 3.12.1 Introduction The Munda branch consists of about 20 languages, located in eastern and central India and adjacent areas of Nepal and Bangladesh, and is fairly well known to linguistics, with a significant history of dictionaries and grammars being published over the past century and a half, including some very recent modern linguistic grammars such as Nagaraja 1999, Neukom 2001, Peterson 2011 as well as the sketches in Anderson 2008a. Khewarian is the largest subgroup of Munda, both in regard to the number of speakers as well as to the number of languages. The main languages of this group are Santali, Mundari, and Ho, furthermore there are several smaller languages, such as Asuri, Birhor, Korwa, and Turi, all closely related to the three bigger Khewarian languages. Santali and Mundari are probably the best documented Munda languages with Bodding (1929a, 1929b, 1929–1936) as well as Neukom (2001) for Santali and Hoffmann (1903, 1930–1978) as well as Osada (1991) for Mundari. Korku is the most western AA language and is geographically separated from the other Munda languages. Korku is very important from a comparative point

310

sidwell and rau

of view as Zide (1960, 1965, 2008) reports tone for Korku and his low tone seems to reflect crucial phonological properties of pMunda. Unfortunately, tone could not be verified by Nagaraja (1999). Korku and Khewarian are generally thought to form the North Munda branch of Munda. Kharia is a one of the larger Munda languages and is spoken in close proximity to Khewarian, but differs significantly from these languages. It is well described by Pinnow (1959) and Peterson (2008, 2011a, 2011b). Juang is one of the less well documented Munda languages, even though it has received some attention: Matson (1964) and Patnaik (2008). It is often grouped with Kharia as Central Munda, but a lot of evidence contradicts this grouping and Anderson (2001) argues against this subgrouping. Gutob-Remo are two very closely related languages—Gutob (also known as Gadaba) and Remo—spoken in close proximity to Gorum and Gtaʔ. Remo is reasonably well known through Bhattacharya (1968) and Fernandez (1968) while Gutob has received proper attention only in recent years by Griffiths (2008). Sora-Gorum is a subgroup consisting of two geographically separated languages. Sora has been well documented since Ramamurti (1931), Biligiri (1965) and Starosta (1967, 1976, 1992), while Gorum has received less attention: Aze (1973), Zide (1982), Anderson and Rau (2008) as well as Rau (2011a). The virtually unknown, Juray also belongs to this subgrouping. The data in Zide (1982) suggest that it is very similar to Sora. Gtaʔ is the southermost Munda language and differs significantly from other Munda languages in its prosodic structure. Even though it has been studied by Mahapatra (1976), Ghosh (2003) and Anderson (2008b), fairly little is known about this language. Gtaʔ has been grouped traditionally with Gutob and Remo, but there is hardly any data supporting this subgrouping, beyond mere geographical proximity. The Munda branch of AA restructured significantly on phonological, morphological, and syntactic levels in its development after emerging as a distinct branch. Proto-Munda restructured phonologically, eliminating consonant clusters within syllables, losing phonemic vowel quantity, most diphthongs, the glottal phonemes, and the distinct implosive series of pAA. Today, Munda

austroasiatic comparative-historical reconstruction

311

languages differ radically from the eastern AA languages on almost every level of grammar from prosody to syntax. The typological divide is so striking that Donegan and Stampe (2004) characterise it as the extreme of a typological polarity in AA. Being spoken in South Asia, the Munda languages have enjoyed a long history of areal context that is radically different to the rest of the family and this contact situation has often been used to explain the typological differences. However, as Donegan and Stampe (2004) point out, hardly any feature distinguishing the Munda branch from other branches of AA can be explained by contact in a straightforward manner. As one can note from Anderson’s overview in this volume, Munda is marked by extensive verbal morphology, especially suffixing, and rather simple segmental inventories and syllable structure. The process as well as the sequence of changes that led from pAA and pMunda to the current situation is not understood in its entirety, but we believe that crucial parts are reasonably well understood and systematic comparative Munda studies have been neglected in recent years without good reason. Comparative Munda studies were put on a strong foundation by Pinnow (1959), offering a comparative-historical phonological analysis, oriented especially towards Kharia, based on more than 500 etymologies. Pinnow’s results are preliminary and marred by irregularities and unsolved questions, and regrettably Pinnow did not follow through with more in-depth historical Munda studies. Subsequent historical work on Munda by other scholars has discussed aspects of phonological and lexical reconstruction, but on balance has really focussed on verbal morphology. Furthermore, Munda specialists, intensely interested in understanding the origins of verbal morphology, have consistently emphasised bottom-up reconstruction methodology, and this has arguably left them trying to account for such a diversity of data and analytical problems that they have not succeeded in producing a coherent consensus account. Consequently, in the second decade of the 21st century, we are yet to see a published consolidated pMunda lexicon or comprehensive comparative phonology explicitly explaining Munda derived from pAA prototypes. Researchers are reduced to extracting isolated ad hoc and provisional pMunda forms from a selection of papers over 5 decades, while Pinnow’s (1959) tables of phonological correspondences remain the principal reference of its type. There has been, to our minds, a lack of boldness, coupled with an unreasonable fear of premature publication, that have held back the field and retarded possibilities for fruitful cooperation across the field of comparative AA studies. The present writers take a different position. It is our contention that it is reasonable to make the following claims:

312

sidwell and rau

· ·

pMunda lexical root and syllable structure are reasonably well understood and present no serious problems of analysis; pMunda consonantism is very well understood; the correspondences are straightforward and are readily explained by recourse to various simple mergers post-pAA; pMunda glottal phonology has been described with unnecessary complexity; pMunda vocalism is a complex and unsolved problem, principally because length and diphthongisation contrasts were lost generally, followed by further reduction to five-vowel systems in various, although not all, Munda languages. The changes to vocalism created chaotic correspondences, such that purely bottom-up reconstruction has proven to be non-viable. However, we contend that with well supported top-down analyses, many root vowels can be reconstructed with high confidence, and simple notational devices can be employed to deal provisionally with problematic cases.

· ·

Given the above, it is a viable exercise to present a reconstruction of pMunda lexical roots with apparent AA etymologies, and we present here our provisional reconstruction on these principles. Consequently, we discuss below our reconstruction, and include (an incomplete) Swadesh 100 list of pMunda forms in the table at the end of this chapter. 3.12.2 pMunda Phonology PMunda restructured phonologically, eliminating consonant clusters within syllables, generally by shifting syllable boundaries and often without deleting segments. Thus the complex onsets of pAA gave way to a simple pattern of lone or repeated (Ci)V(Cf) syllables making up the phonological word. Consequently, with syllable internal asymmetries largely eliminated, pressure for further segmental restructuring eased and the consonants of pAA are reflected fairly faithfully. An important aspect of the reduction in asymmetry within the syllable is that the pAA ± implosive contrast was lost, reducing the phonation contrast in stops to just two series. A different change, this one apparently areally conditioned, is that initial velar nasals, an otherwise common segment, became dispreferred, and consequently there are many cases of unconditioned palatalization. Strikingly, the oral stop codas in all Munda languages are realised as pre-glottalized, unreleased, or maybe imploded stops. The glottal series of pAA—*ʔ and *h—were lost, but glottals subsequently reemerged (direct correspondneces suggest *h in a moderate number of etyma not inherited from pAA). Consequently, we propose the following pMunda consonant phonemes (the pMunda reconstructions are marked with

austroasiatic comparative-historical reconstruction

313

double ** to indicate their provisional status, with its substantial reliance on ‘top-down’ reasoning): Ci Consonants ** p t c k b d ɟ g m n ɲ ŋ w l, r j s h Cf Consonants ** ˀp ˀt ˀc ˀk m n ɲ ŋ w l j

There is an areal tendency to disfavour the velar nasals, so that there are various non-velar reflexes here and there, or it can be decomposed into a prenasalised voiced velar stop, but the diversity of reflexes argues for straightforwardly reconstructing **ŋ to pMunda. Thus we reconstruct pMunda **siŋi(iˀ) ‘sun’ reflecting pAA *t2ŋiːʔ21 (with apparently regular shift from t2 > s) based on Mundari siŋgi, Santali siɲ, Remo siŋi, Gtaʔ sni. Also, there is a general preference to shift to retroflexion for the voiced apical stops (see ‘water’ etymon below) but again the variation in reflexes suggests simply **d at pMunda. The role of glottal stops in modern Munda and pMunda is has been the source of some problems in the reconstruction of pMunda, but we believe that it is a solved issue. In the first place, we assume a pre-Munda merger in which final glottals—both *ʔ and *h—became phonological zeroes. Etymological *k became [ʔ] in coda position, although in some languages such as Santali, the glottal character of syllable final /k/, represented as k’ in Neukom (2001) is less clear. In some Munda languages such as Kharia, [ʔ] is still a transparent allophone of /g/ in syllable final position. Peterson (2011) orthographically distinguishes from even though he explicitly treats it as an allophone. Thus we find oʔ ‘house’ but og=aʔ ‘house=gen’ and ɲoʔ=te ‘eat=act.prs’ but ɲog=e ‘eat= act.irr’. In other languages such as Gorum, /ʔ/ is a now a separate phoneme that is still mostly in complementary distribution to /g/. To represent the velar character which is still preserved in Santali or Kharia, we represent pAA *k in pMunda as **ˀk. There is another source for glottals in present day 21 PAA reconstructions cited in this section are Shorto (2006) forms.

314

sidwell and rau

Munda languages and this has led to some confusion in the reconstruction of pMunda. We will come back to this below when we are discussing syllable weight and related issues. Another sound change is the loss of pAA *s in coda position. Current evidence suggests that it is lost in all Munda languages and thus is best assumed to have been lost to pMunda already. Evidence for the loss are for example pMunda **rOOˀ ‘elephant’ reflecting pAA *r[]uəs based on Gutob roʔ, Sora rʔa(ː), and Gorum raʔ as well as as **(gə)rE ‘heart, liver’ reflecting pAA *ris based on Kharia gore, Remo gire, Sora əgare, Gorum gare, and Gtaʔ grire. The vowel system is much more problematic, and we acknowledge that what we offer here is both provisional and somewhat at odds with views of other authors. As already mentioned above, it is apparent that quantity and diphthongisation contrasts were lost in the transition to pMunda. However, pAA length left traces in pMunda in regard to syllable weight and a contrast in the conditioning of resyllabification of Munda lexical roots. This does not require us to reconstruct long and short vowels as such for pMunda, as we take the resyllabification as already completed by that stage. However, we do need to distinguish light and heavy syllables in the nominal vocabulary, for which we use a second vowel sign followed by a raised glottal stop **Vˀ. Through this we intend to indicate that the vowels were mostly derived from phonologically long vowels (pAA Vː), but also that vowel length was lost later in all Munda languages and a glottal feature—either a glottal stop or creaky voice—became the default realisation of heaviness in these cases. Typologically, it would appear that Sora is an important—although admittedly indirect—witness to the pMunda vowel inventory. While other Munda languages, consistent with local areal tendency, have reduced their vowel inventories to as few as five members, one is obliged to treat Sora as having nine vowel phonemes: Sora vowels i ɨ u e ə o ɛ a ɔ

Of course, even these nine vowels are not sufficient to account for the complexity of vowel correspondences that one finds in Munda, even within unambiguous Munda sub-groups. But, we can neither throw our hands in the air in resignation nor mindlessly posit a typologically unrealistic proto-vowel inventory that tries to account for every correspondence. Rather, we take the view that the possible pMunda inventory should be modelled in a way that

austroasiatic comparative-historical reconstruction

315

approximates what appears to be the most conservative vocalism attested in the branch. In this context, our best estimation is that there was no quantity or length distinction, and applying this mechanically to the pAA inventory would yield a set of eight monophthongs, only one short of the contemporary Sora inventory. Furthermore, given the areal tendency to reduce vowel contrasts, it would be a reasonable starting point to treat the Sora inventory as the exception which has not undergone significant reductions since pMunda. However, close inspection does indicate that various splits and mergers have occurred in the history of Sora vocalism, so one would not immediately want to posit contemporary Sora values as directly reflecting pMunda vowels without clear external evidence, otherwise underdetermined values must be reconstructed. As it stands, there are many etymologies which appear to be unproblematic, and for which there should be no hesitation in accepting proto-vowel reconstructions offered here. for example, Munda reflexes of pAA *blu:ʔ ‘thigh’ unambiguously indicate pMunda *buluuˀ, although we would be delighted to receive any counter-argument. On the other hand, there are also plenty of ambiguous cases, although even amongst those it is often apparent that, for example, a central vowel is indicated rather than a front or back vowel vowel. For the present we propose a notational solution of using capital letters to represent underdetermined vowel reconstructions, as in the table below. One can also note that we presently adopt **E and **O for all non-high front and back vowels due to the difficulty of distinguishing between mid and low members. Also we presently do not propose a high-central pMunda vowel, as so far it seems that contemporary occurrences of ɨ or similar in Munda are usually readily identified as conditioned variants of other vowels. Consequently we propose the following provisional pMunda vowel inventory, with the caveat that the symbols should be read as representing a set of at least eight phonemic monophthongs. pMunda Vowels ** i/I u/U E ə/Ə O a/A

Proto-Munda also deviated in several significant ways from pAA on the prosodic level. The most apparent change is the reduction of the pAA syllable template to a maximal pMunda syllable CV(Vˀ)C. This and the resulting elimination of consonant clusters is a defining feature of pMunda. Initial clusters preceding an etymologically long vowel are resyllabified and a short copy of the original long vowel is inserted between the two consonants, so that pAA

316

sidwell and rau

C1C2Vː becomes C1V1C2V1V1ˀ in pMunda. This is demonstrated by the correspondences for *bluːʔ ‘thigh’. Note again that the final *ʔ is lost pre-pMunda and that Vˀ indicates syllable weight in our reconstruction of pMunda. This results in the following forms: gloss ‘thigh’

pAA pMunda Santali Korku Kharia Remo Sora Gtaʔ *bluːʔ **buluuˀ bulu bulù bhulu buli bulu bulu

Clusters preceding a short vowel are eliminated by having another vowel insterted between the two consonants. We assume this to be the mid-central vowel.This means that pAA C1C2V becomes C1əC2V in pMunda as can be seen in the reflexes of pAA *d2raŋ ‘horn’: gloss pAA pMunda Santali Kharia Remo Sora Gorum Gtaʔ ‘horn’ *d2raŋ **dəraŋ dereɲ ɖereŋ ɖeruŋ ɖeraŋ/ɖereŋ ɖeraŋ ɖiraŋ

The reflexes of pAA vowel length are most apparent in pMunda in this resyllabification of initial clusters. However, there is an aspect of languages of the Munda branch that requires us to retain a representation of pAA long vowels in pMunda: The minimal free noun in most Munda languages is a heavy monosyllabic word such as such as Kharia tiʔ ‘hand’ or a bisyllabic word of any syllable structure such as Sora bulu ‘thigh’. This phonological restriction on Munda nouns has been called the “bimoraic constraint” by Anderson & Zide (2001 and elsewhere). A direct effect of this restriction on minimal free nouns is seen today in the ways that various Munda languages maintain a distinction between free forms which resyllabify with mechanisms such as reduplication and epenthetic glottal stops versus morphologically bound forms that happily tolerate a light (C)V(N) nominal. In other languages such as Santali, the constraint is not active and Santali allows for CV nouns such as ti ‘hand’. As mentioned before, the loss of vowel length seems to have happened in pMunda already, but syllable weight resulting from long vowels was preserved in several environments in some Munda languages. We represent etymologically long vowels as VVˀ, since we assume that proper vowel length (*Vː) was lost in pMunda, but the heaviness—the second mora if you like—was preserved in some environments. In most languages heaviness is realised by adding a glottal stop, however Gorum shows creaky voice or glottal stop in these environments and Korku low tone. According to our view on pMunda, pAA *tiːʔ ‘hand’ remained a heavy syllable even after the loss of *ʔ and vowel length in pMunda and thus we represent the resulting form as **tiiˀ.

austroasiatic comparative-historical reconstruction

317

For a reconstruction of pMunda and in fact also for understanding modern Munda languages it is crucial to realise that there are two historical sources for glottals in most Munda languages. One set of glottal stops derive from pAA *k via pMunda **ˀk. The glottal stops representing pAA *k and pMunda **ˀk can be seen in the reflexes of pMunda **daaˀk ‘water’ from pAA *ɗaːk. gloss pAA pMunda Santali Kharia Gutob Remo Gorum Sora Gtaʔ ‘water’ *ɗaːk **daaˀk daˀk ɖaʔ ɖaʔ ɖaʔ ɖaʔ ɖaʔ (n)ɖiaʔ

Modern Munda languages possess another set of glottals that cannot be explained by pAA *k. These glottals, their source and even place in the grammar is at the very heart of many problems of the phonological and morphological reconstruction of pMunda. The phenomena involved are Zide’s nominalizing infix -ʔ- (e.g. mentioned in Zide 1966), Korku low tone (Zide 1966), Gorum creaky voice (Aze 1974, Rau 2011b), Zide’s laryngeal X (Zide 1978) as well as probably Pinnows vowel length (Pinnow 1959) and maybe Zide’s glottalised vowels of Gutob-Remo (Zide 1965). As this lists already suggests, the issue has been interpreted as vowel quality, phonation or register, or as a segmental phoneme, as well as a morpheme realised by the phoneme /ʔ/. The issue is far too complex to argue here in full. Suffice it to say that we treat all these instances as reflexes of syllable weight in pMunda represented as **Vˀ in our reconstruction. These go back to pAA *Vː in instances in which they are etymologically connected to the pAA level. The distinction between etymological /ʔ/ deriving from *k via **ˀk and glottals deriving from pAA Vː via pMunda **Vˀ or even from post-pMunda prosodic requirements is highly relevant in some languages and crucial for a successful reconstruction of pMunda. In Gorum, creaky voice is always a reflex of **Vˀ and never of **ˀk, while Gorum /ʔ/ can be a reflex of either. The difference between etymological ʔ ) *kɔ(ː)k ‘neck’ #505 *tk(ɔː)ŋ’ (nape of) neck’ #744 *t1luŋ; *t1luːŋ; *t1luəŋ ‘throat’ #144 *t1miʔ ‘new’ #264 *m[h][ɯə]ʔ; *m[h][ɯə]h ‘evening, night’ #1328 *[c][ɯə]m ‘night’ #1352 *btɔm ‘night’ #2045 *muh; *muːh; *muːs ‘nose’ #943 *ʔət; *ʔəːt; *[ʔ]it ‘used up, finished, lacking’ #1118 *ʔɔn ‘ small in quantity’ #1297 *ʔam ‘not’ #86 *ɗiːʔ; *ɗiːs; *ɗuːʔ; *ɗuːl ‘one’ #1495 *muːj; *muəj; *muːɲ ‘one’

*kɔʔ ‘neck’ *-kɔːŋ ‘nape’ *-lɔːŋ ‘throat’

*tmiːʔ ‘new’ *cəːm ‘night’ *-tam ‘night’

*mɔːh, *muːh, *muːs ‘nose’ *ʔət ‘used up, lacking’ *ʔam ‘not’

*diʔ ‘one’ *muəj, *moːj ‘one’

354

sidwell and rau

(cont.) pMunda

pKhasian pPalaungic pKhmuic

pVietic

pKatuic

pKhmeric

64.

person/ human

**kOrOˀ

*bruː

*ɓiʔ, *ʔiʔ, *ʔuʔ

*k-muʔ, *kmraʔ

*-riː, *ɗeː

*bnuːs, *kuːj

*ʔnak

65.

rain

**gəma

*slap

*sǝlɛʔ, *ɟuŋ

*kmaʔ

*k-maː

*maː, *tɨŋ, *ɓɔː

*pliəŋ

66.

red

**ɟəŋAˀt

*saːw

*roːŋ

*-rVɲ, *thuəɲ

*tɔh, *koːl

*broːŋ, *ksəw

*krhɔːm

67.

road, path

**kOrA

*lntiː

*kraʔ, *ɗeːŋ

*gruəŋ

*k-raːˀ

*krnaː

*gnlɔːŋ

68.

root (of a tree) round (object)

**rEˀt

*traːj, *tʰəːt *pnlun

*riəs

*riəs

*k-riɛs

*rias

*ris

*ʔmər

–

–

–

*wiel, *rwiəl, *rwɔŋ

69.

–

austroasiatic comparative-historical reconstruction

355

pPearic

pBahnaric pMonic

pAslian

Nancowry #Shorto (2006) pMK

Sidwell (ms.) pAA

*kɟɨm

*bŋaːj

*mnus

*səmaːʔ

pajuh

*ʔiːʔ ‘person’ *mraʔ ‘person’

*gmaːˀ

*ʔmɨː

*brəj

*ləsəm

mih

*ŋar

*ɓroːŋ

(OM bərket)

–

ʔak

*kraː

*truːŋ, *trɔːŋ

*trəw, *glɔːŋ

*noːŋ

kaji

*reːs

*riəh

*ris

jiah

*moːl

*(t)wil, *pɔːm

–

*rʔiɛs ~ ʔriɛs –

tual

#2 *ʔiːʔ ‘person’ #60 *ɲaʔ; *ɲah ‘person’ #139 *k[n]muʔ ‘person, human being’ #183 *mraʔ ‘person’ #1455 *[m]ŋaːj ‘person, human being’ #127 *miʔ; *mih; *miːw ‘rain, to rain’ #141 *gmaʔ ‘rain’ #180 *briːʔ ‘sky, rain’ #539 *juːŋ ‘rain, to rain’ #688 *[ɟ]rɔːŋ ‘red’ #788 *[t]huːŋ; *thuːɲ’ red’ #1977a *guːh ‘red’ #1865 *ks[aw]ʔ ‘red’ #162 *kraʔ ‘road, way’ #590 *rɗiŋ; *rɗiəŋ ‘road, track’ #698 *truːŋ; *truəŋ ‘path, road’ #736 *glɔːŋ ‘path, road’ #1927 *ris; *riəs; *rəs; *rʔiəs ‘root’ #1201a *luːn; *luən ‘round, to roll’ #1653 *m[ ]uːr; *m[ ] ər ‘to roll up’ #1794 *wil; *wiːl; *wiəl(ʔ); *wəl; *wal; *wail ‘to turn’

*gmaʔ ‘rain’ *briːʔ ‘sky, rain’

*-saw ‘red’

*kraʔ ‘road, way’ *ɗeːŋ, *ɗiəŋ ‘road, way’ *truːŋ,*trɔːŋ ‘road, way’ *glɔːŋ ‘road, way’ *riəs ‘root’ *wi(ə)l ‘round’

356

sidwell and rau

(cont.) pMunda

pKhasian pPalaungic pKhmuic

pVietic

pKatuic

pKhmeric

70.

sand

**kEˀt

*cʔiap

*smaːc

*kaːc

*t-kaːc, *-rɛh

*skaɲ, *cuah

*ksac

71.

see

**(n)El

*pɛc, *Ɂjiː

*jɔ(ː)t, *jəːʔ

–

*ɗaːm

*taloːj

*jɔl, *ghəːɲ

73.

sit

**kO

*cɔŋ, *skəw

*moːk, *ɗeʔ

*jəːm

*ŋuːj

*tkuː

guj

74.

skin

**usal

*sneːʔ

*n̥ aːm, *huːr

*-puːr

*-taː, *k-roːt

*ʔŋkar

*spɛːk

75.

sleep

**gətiˀc

*tʰiaʔ

*ʔiət

*cih

–

*ɓic

*tiːk

76.

small

–

*-dit, *rit

*knɗiət, *ɓac

*ɲɛʔ

*ɗiːt

*kɛ(ː)t

*tic, *cmaːr

77.

smoke (n.)

**mOˀk

*tdɛm

*təːʔ

*ptoʔ

78.

to speak, say

**sun, **gam, **kaj

*ʔɔŋ, *krɛn

– *law, *rʔ(m)aːh, *ŋɛʔ

*ɗuːp, *k-hɔːjˀ *coːlˀ

*g(rn/ə) (?*psɛːɲ) jɨək *paːj *srtiː

austroasiatic comparative-historical reconstruction

pPearic

pBahnaric pMonic

pAslian

Nancowry #Shorto (2006) pMK

*(g)laːŋ

*braːs

*p-m-tɯl

–

jaja

*daŋ

*saw, *ʦaw

*maj

*daŋ

hew

*kɨl

*ʔwaj

–

–

kɯ̃ ʔ

*-loːˀ

*ʔəkaːr

*cnaːm

*kətɯːʔ

ʔok

*bic

*kuj, *ɓic

*cŋkuj, *dəm,

*tiɛk

tiak

*kic

*-dɛʔ

–

*kanɛ(ː)t

ric, pẽc

*kmaː⁽ˀ⁾s *ʔɲuːj

*jak

–

fus

*ɲaːj

(OM gah)

*ca(ː)l

ʔujola

*-nɔh

357

Sidwell (ms.) pAA

*ksa(ː)c ‘sand’ #874 *ksac; *ksaːc ‘sand’ #1737 *bnt1iːl ‘sand’ #158 *[s]jəːʔ; *[s]ʔəːj *jɔl ‘to see’ ‘to see’ #1696 *jɔl, *ʔɔl ‘to see’ *ŋgu(ː)j ‘to sit #283 *[ ]guk ‘to sit, down’ remain, dwell’ *-kuːʔ ‘to sit, stay’ #1448 *ŋguj ‘to sit down’ #1818 *tkaw[ʔ] ‘to sit, stay’ #1370 *cn1aːm ‘skin’ *huːr ‘skin’ #1568 *cnkuər; *[c] nka[a]r ‘integument’ #1687 *huːr ‘skin’ #1825 *kt(aw]ʔ ‘skin’ #305 *tiːk; *tiək ‘to lie *tiək ‘to lie down, sleep’ down, sleep’ #797 *ɓic; *ɓiəc to lie *ɓic ‘to lie down, sleep’ down, sleep’ *dit, *kɗi(ə)t #59 *ɲ(eː)ʔ ‘small’ ‘small’ #1016 *kɗit; *kɗiːt; *kɗiət ‘small’ #1828 *pt1(əw)ʔ *ptoʔ ‘smoke’ #1425 *hom ‘to speak’ *ŋaːj ‘to speak’ #1457 *(s)ŋəːj ‘to speak’ #1964 *rʔah ‘to say, speak’

358

sidwell and rau

(cont.) pMunda

pKhasian pPalaungic pKhmuic

pVietic

pKatuic

pKhmeric

*ɟhɔːr

79.

to stand

**tənaŋ, **tƏŋgə

*jəŋ

*ɟəːŋ

*dək

*təŋˀ

*tʄɨŋ, *tjɨŋ, *juor

80.

star

–

stone sun to swim

*k-raːw, *k-ɓeː *l-taːˀ, *b-ruːˀ – –

*bntuor, *pkaːj *smaɲ *tməw *tmɔː

82. 83.

**bərƏl, **sərEŋ **siŋi(iˀ) –

*sʔməɲ, *krlaːŋ *cmoʔ, *rʔaːŋ *sŋiːʔ *l̥oːj *ŋɔj

*sməɲ

81.

*smaɲ, *kʰloːr *smaːw

84.

tail

**pata

*tdɔŋ, *-tʰaː

*-taʔ

*hntaʔ, *dɔːŋ

*d/tɔːj

*sɔːj

–

85.

that (dist.)

**han

*taj

*ʔeʔ, *tVj

*niʔ

–

*kiː, *ʔntih

*nɔh

86.

this (prox.) **En

*niː

*gəːʔ, *niʔ, *ʔǝ(h)

*giː, *ʔeːʔ

*-niː

*neʔ, *neː

*nih

87.

thou/you

**(n)Am

*meː

*miːʔ

*ʔmɔː

*miː

*maj

–

88.

tongue

**laːˀŋ

*tnleːc

*-taːk

*hntaːk, *lat

*laːs

*ʔntaːk

*ʔntaːt

*sŋiː *ɟnpaː, *ɟŋiː

*glaː₁ŋ *cŋi/eʔ *luːj

*ʔmɓaːŋ *tŋaj *loːj, *hɛːl(?) *poh

austroasiatic comparative-historical reconstruction

pPearic

pBahnaric pMonic

*taːl

*-jɔːŋ, *-juːŋ, *taːw, *dɨk

*ksɨm

*s(n)lɔŋ, *smaɲ *tməw

359

pAslian

Nancowry #Shorto (2006) pMK

Sidwell (ms.) pAA

*taːw

–

siak

*dək ‘to stand up’ *ɟə(ː)ŋ, *ɟuːŋ ‘to stand’ *taːw ‘to stand’

*s( )maɲ

–

maleca

#332 *(ɗ)ək ‘to stand up, to go up’ #538(Iː) *ɟəŋ; *ɟiŋ; *ɟiːŋ; *ɟuːŋ ‘to stand’ #1824 *taːw ‘to stand’ #888 *smʔaɲ ‘star’

*tmɔːʔ

*təmɔʔ

maŋɛʔ

#146 *t2mɔʔ ‘stone’

*tmɔʔ

*(t/s)ŋiːˀ *mat-tŋaj *tŋaj *heːl *rɛː *ɓiːŋ

– –

hɛŋ kical

*tŋiːʔ ‘sun, day’ *loːj ‘to swim, wade’

*paːs

*tiəŋ

*prtaːʔ

*sə[n]taːʔ ret

*dan

*tɔʔ, *tɔː

*tɛːʔ

–

ʔane

*ʔan

*nɛʔ, *nɛː

*-nɔːʔ

–

ne, nine

#31 *t2ŋiːʔ ‘sun, day’ #1456 *(l)ŋuj ‘to swim’ #1516 *lujʔ; *luːj[ ]; *luəj[ ]; *ləːj[ ] ‘to wade, to swim’ #73 *bt1aʔ; *st1aʔ ‘tail’ #542 *t1iə(ŋ) ‘tail’ #585 *t1ɗɔːŋ ‘stalk, tail’ #65 *t1iʔ; *t1iːʔ; *t1iəʔ ‘that yonder’ #66a *tɔʔ ‘that, there’ #1466 *t1uːj; *t1uəj; *t1aːj ‘that (yonder)’ #26 *ge(ː)ʔ ‘deictic & 3 pronoun’ #91 *niʔ; *nih ‘this’ #92 *nɔʔ; *nɔh ‘this’

*boː

*maj, *ʔih, *ʔaj *lpiət

*beh

*heːʔ

me

#128 *mi(ː)ʔ; *miːh ‘you (sing.)’

*klntaːk

*ləntaːk

kalitak

#320 *nt1aːk, *l(n) t1aːk ‘tongue’

*tmoːˀ

*ktaːˀk

*s(ʔ)maɲ ‘star’

*sntaʔ ‘tail’ *-dɔːŋ ‘stalk, tail’

*tiːʔ ‘that (dist.)’ *tɔʔ ‘that (med.)’

*gi(ː)ʔ ‘3rd person’ *niʔ; * neʔ ‘this (prox.)’ *nɔ(ː)ʔ ‘this; here’ *miːʔ ‘you (sing.)’

*lntaːk ‘tongue’

360

sidwell and rau

(cont.) pMunda

pKhasian pPalaungic pKhmuic

pVietic

pKatuic

pKhmeric

89.

tooth

**gənE

*lmɛɲ

*r̥aːŋ, *kaɲ, *piəŋ

*sraː₁ŋ

*k-saŋ

*knɛːŋ

–

90.

tree

**ɟiːˀ

*dɛːŋ

*khoʔ, *kseʔ

*cʔɔːŋ

*gawˀ, *g/kəl, *k-caːŋˀ

*ʔalɔːŋ

*ɟhəː

91. 92.

two to walk, go

**baːˀr **sEn

*ʔaːr *diʔ

*lʔaːr *həl, *leh

*baːr –

*haːr *d/tiː

*ɓaːr *bier *snɗaːw, *təːr *cɗɨaw

93.

warm/hot

–

*sɁaːc, *cit

*cʔəm, *kVc

*waːr, *pɔːk

*p-ruŋˀ, *ʔatoʔ, *p-ɗ/ruŋˀ *s(a/ŋ) ʔam

*ktaw

94.

water

**daːˀk

*ʔum

*ʔoːm

*ʔom

*ɗaːk

*tɨk

95.

we (excl.)

–

*ʔiː

*ʔjeʔ

*jiʔ

–

*ɗaːk, *ɗəːk –

96.

what?

–

–

–

–

*ʔaməh

–

97.

white

–

*leːʔ

*məh, *mVʔ *blɔːk, *kraːʔ, *-koʔ

*klɔːk

*baːk, *t-lɔːk

*ɓɔːk

*skuː *sɔː

–

austroasiatic comparative-historical reconstruction

361

pPearic

pBahnaric pMonic

pAslian

Nancowry #Shorto (2006) pMK

Sidwell (ms.) pAA

*koːj

*p(ɨ/ə)ŋ

*ŋiak

*ləmoːɲ

kanap

*lmVɲ ‘tooth’ *sraŋ ‘tooth’ *p(i)əŋ ‘tooth’

*neːˀm

*ʔlɔːŋ

*chuːʔ

*ɟəhuːʔ

ʔuɲiha

*baːˀr *ceːw

*ɓaːr *ceː₃w

*ɓaːr *ʔa(ː)r

*ciːp

ʔã soŋ

*tuːˀ

*toʔ

*k-m-taw

*bɨt

taɲ haʔuaj

*daːk

*ɗaːk

*ɗaːk

*b[ə]teːw riak

*hɛːŋ

(*ʔjɨː ?)

*pəj/ *pw(ə)j

*ɟeːʔ

he

*cmpiːˀj

–

*(ɟ-)moːʔ

–

*broːŋ, *pruːs

*(ɓ/b)ɔːk

(OM bətaiŋ)

*bəj[oː]k

cin, cja, ka tĩah

#597 *gnaiŋ; *gniəŋ ‘tusk, eye-tooth’ #709 *sraŋ; *sra[a]ŋ ‘tooth, sharp projection’ #254 *ɟhoːʔ; *ɟhəːʔ; *ɟheːʔ ‘tree, wood’ #491 *t2ʔɔːŋ ‘tree, wood’ #1343 *t2əm; *t2əːm; *t2am ‘plant, to grow; to begin’ #1562 *ɓaːr ‘two’ #462 *swa(ː)k ‘to walk, travel’ #1202 *luən; *lən ‘to go, proceed’ #1821 *cəw ‘to come, go, to return’ #70 *ktuːʔ ‘hot. #897 *t1aɲ ‘hot’ #1307 *(s)ʔəm; *(s) ʔəːm; *(s)ʔuːm; *(s) ʔuəm; *[ ]həm ‘warm’ #274 *ɗaːk ‘water’ #1298 *ʔ(o)m ‘water’ #1 *ʔiʔ; *hiʔ ‘we’ #150 *j(eː)ʔ ‘we’

*ɟhuːʔ, *ɟhiːʔ ‘tree, wood’ *tnəːm ‘trunk, stem, tree’

*ɓaːr ‘two’

*toʔ ‘hot’ *taɲ ‘hot’ *(s)ʔ(a/ə)m ‘warm’ *ɗaːk ‘water’

*ʔiːʔ ‘we (incl.) *ʔjeːʔ ‘we (excl.)’ (*ʄeːʔ ?) #136 *m(o)ʔ; *m(o)h *məh ‘what’ ‘what’ *ɓɔːk ‘white’ 116 *kboːʔ ‘white’ 369a *kɓɔːk ‘white, *-lɔːk ‘white’ grey’ 436 *kl(uə)k ‘white’

362

sidwell and rau

(cont.) pMunda

98.

who?

99.

woman/ wife

100. yellow

pKhasian pPalaungic pKhmuic

pVietic

pKatuic

pKhmeric

–

*m̥ ɛh

*-moh

*ʔeː

*ŋaːj

–

**selA, **kəni

*kntʰaj

*-pən, *knjaʔ

*ka₁n

*muːˀ, *p-keːˀ, *-bəːˀ

*knsaj, *kan, *kntaj *ʔnɗil, *kɗial, *k(m)paj

**saŋsaŋ

*lmit, *stɛm

*rmiːt

*ɟŋaːr

*-ʔɛːlˀ

*raːk

–

austroasiatic comparative-historical reconstruction

363

pPearic

pBahnaric pMonic

pAslian

Nancowry #Shorto (2006) pMK

Sidwell (ms.) pAA

*ʔmih

*buː

(OM mə)

–

ci

*mVh ‘who’

*kɨn

*ʔəkan

*braw (OM *kndɔr ‘wife’)

*kəndah (Semelai kdor ‘female’)

kan

*joːˀs

*dreː₂ŋ

-

-

ŋə

110 *bu(ː)ʔ ‘individual’ 1126 *kan; *kaːn ‘woman, female’ 1163 *pən ‘female’ 1628 *kdɔːr, *kdar ‘woman, wife’ 1052 *rmit; *rmiːt; *rmiət ‘Curcuma species; yellow’

*kan ‘woman, female’ *k(-)duər ‘wife’

*-ŋaːr ‘yellow’

chapter 5

Overview of the Munda Languages Gregory D.S. Anderson 1 Introduction The Munda languages are a group of Austroasiatic languages spoken across portions of central and eastern India by perhaps as many as ten million people total. The Munda peoples are generally believed to represent autochthonous populations over much of their current areas of inhabitation. This is codified in one of the common terms used locally to describe them, adivasi or ‘aboriginal’.

Approximate Distribution of Munda languages

N E P A L

UTTAR PRADESH

N

AR SG TTI

SHTRA

CHHA

ARA MAH

Remo

0 Miles

Santali

BIHAR

D Koda HAN J H A R K Santali Koraku Asuri5 Turi KorwaAsuri WEST 4 2 2 3 1 BENGAL 3 Santali1 4 5 Bhumij 2 3 5 Kharia 1 3 1 Korwa

H

Korku

Agariya

A

ANDHRA PRADESH

150

Gta

Juang ORISSA Gorum

Mahali

So ra

1. Birhor 2. Ho 3. Kharia 4. Mundari 5. Turi

Gutob

INDIAN OCEAN

0 Km 150

Map 5.1

ESH

MADHYA PRADESH

I

GLAD

Agariya

D

BA N

I

Mundari

Location of the Munda languages in India.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_006

overview of the munda languages

365

Originally, Munda-speaking peoples probably extended over a somewhat larger area before being marginalized into the relatively remote hill country and (formerly) forested areas primarily in the states of Odisha and Jharkhand; significant Munda-speaking groups are also to be found in Madhya Pradesh, and throughout remote areas of Chhattisgarh, West Bengal, Uttar Pradesh, Andhra Pradesh, and Maharashtra, and through migration to virtually all areas of India, especially in tea-producing regions like Assam. Of course much of the original Munda-speaking territory was subsequently settled or colonized by Indo-Aryan-speakers and Dravidian-speakers. 1.1 On the Prehistory of the Munda-Speaking Peoples The pre-history of the Munda languages remains obscure. Munda languages constitute the westernmost representatives of the far-flung Austroasiatic linguistic phylum. Two other Austroasiatic groups are found in the presentday territory of India, the Khasian-speaking groups of Meghalaya and the Nicobarese-speaking groups of the Nicobar Islands. The other subgroups of Austroasiatic are all found outside of India, and it is generally believed that the Austroasiatic ancestral language was not to be found in India but rather further to the east. Thus, at some point the ancestors of the Munda-speaking peoples must have migrated westward into the Subcontinent. When, how, and by what path they entered India remains a subject of considerable debate. Indeed, it is not even clear that there was a single migration of pre-Munda speakers, but there may even have been two or more such movements. 1.2 Internal Ρelationships among Languages of the Munda Family Concensus has not yet been reached on the internal relationships of the Munda languages, but several subgroups have been proposed and some of these appear to be sound. It is hoped that further work in comparative Munda grammar and lexicon may shed light on this issue. The northern-, eastern- and westernmost groups of Munda languages are clearly related and appear to fall into two broad groupings. The first of these is the westernmost Munda language Korku which appears to be a sister to the remainder of this subgroup, the large and complex Kherwarian dialect/language chain, the better known varieties of which are Santali, Mundari and Ho. Kherwarian also includes a number of minor varieties as well, e.g. Turi, Asuri, Birhor, Bhumij, Korwa, etc. Korku and Kherwarian together are conventionally known as North Munda. The remaining Munda languages are almost only found in the state of Odisha, which appears to be the epi-center of diversity of the family, although some Kharia speakers are found in Jharkhand, West Bengal and Chhattisgarh as well and some of the southernmost groups like the Gtaʔ or the Gorum

366

anderson

are found in pockets across the border in northern Andhra Pradesh. How each of these non-North Munda languages or subgroups (logically known as South Munda in contrast to North Munda) are related to each other remains a topic of considerable debate. Some languages clearly form subgroups, such as Sora with Gorum/Parenga, or Gutob with Remo/Bonda. The classification of the remaining three languages (Kharia, Juang and Gtaʔ/Didayi) remains an open question (Anderson 2001). I synopsize some of the approaches that have appeared in print below. Although a small number of characteristic features of various higher order taxa within Munda have been offered, how exactly to subgroup the languages in the family and which higher order taxa can be justified remain open questions, and await adequate public demonstration. It is thus best to remain cautious at present. Just to give a sample of the different proposals that have been seriously offered by scholars in the last half century, we can compare those of Pinnow, Zide, Bhattacharya and Anderson.

Approximate Distribution of Munda languages of Orissa 100

G

A

R

Kharia Mundari Kharia Santali Mundari Bhumij Turi Ho Santali Birhor M u n d a r i Ho Kharia Birhor Ho

H

Juang

A

T

T

IS

0 Miles 0 Km 100

WEST BENGAL

JHARKHAND

CH

H

ORISSA

Mahali

Juang

Bhubaneshwar

Birhor Sora

I N D I A Sora Gta? Gutob Remo H Gorum ES D Gta? A PR A R DH AN Map 5.2

Munda languages of Odisha.

S ora

INDIAN OCEAN

Map Area

367

overview of the munda languages

The approaches differ in many details. Both Bhattacharya and Zide unite the three southernmost languages into a single genetic group. Bhattacharya’s approach is based mainly on the typology of the verb, and Lower Munda represents the single most salient and important split in the family as a result. Zide on the other hand, based largely on lexical data, unites this group with the other southern group within the Munda family, viz. Sora-Gorum into an intermediate node (Koraput Munda), echoing the classification of Pinnow (1959). Pinnow (1959) was specifically designed to create comparative lexica and infer previous phonological states of Kharia. It convincingly established Munda as a coherent family and a constituent family of the Austroasiatic phylum as some well-known scholars had relatively recently publicly disputed (e.g., Sebeok 1942). It proposed four coordinate branches of Munda: Kherwarian, Korku, Koraput and Kharia-Juang, see Figure 5.1. Given the obvious areality of these groups and the reliance on purely phonological and lexical materials to model the internal subgrouping or classification on, it comes as no surprise that i) these groups would be inferred by this method and ii) that a careful comparison of the languages as systems might reveal other, equally valid patterning that have to be considered when hypothesizing about the prehistory of the Munda languages. A team of scholars led by Norman Zide in the 1960s collected many materials on the Munda languages, and released a slightly different classification of them, uniting Korku and Kherwarian in a North Munda node; all other languages were considered South Munda, with the same two subgroups. To the South Munda group was added the then newly identified Gtaʔ (Didey) which was placed in the western branch of Koraput Munda (now rechristened GutobRemo-Gtaʔ contrasting with the eastern Sora-Gorum subgroup), see Figure 5.2. Bhattacharya (1975) on the other hand, found that there was a meaningful distinction in the verbal structure of the Gutob-Remo-Gtaʔ group such that their lack of object marking in the verb was such a fundamentally important fact that it must represent the primary split within the family, and this group Kherwarian Munda

Korku Koraput Kharia-Juang

Figure 5.1 Classification of Munda by Pinnow (1959).

368

anderson North Munda

Korku Kherwarian

Munda South Munda

Kharia-Juang Koraput

Gutob-Remo-Gtaʔ Sora-Gorum

Figure 5.2 Classification of Munda by Zide (1969).

Upper Munda Munda

Kherwari-Korku Sora-Parengi Kharia-Juang

Lower Munda Gadaba, Bonda, Didey Figure 5.3 Classification of Munda by Bhattacharya (1975).

was thus postulated to constitute the Lower Munda branch coordinate with the Upper Munda branch consisting of Korku, Kherwarian, Sora-Gorum and Kharia-Juang (Figure 5.3). Apart from more or less ignoring lexical and phonological developments, it also does not correctly split apart the languages of the family according to this one typological parameter, as Kharia shares this feature with the Lower Munda languages but is called Upper Munda by Bhattacharya. Another problem with this is that Gtaʔ has certain archaic features not found in Gutob-Remo (or Kharia), but found in Juang, such as prefixal subject marking (see below). Anderson’s (2001) approach combines lexical data and comparative morphological data, and proposes that of the standard groups, only three appear to be valid both morphologically and lexically: North Munda, Gutob-Remo and Sora-Gorum, and that in the remaining languages one is most likely dealing with a linkage rather than a genetic taxon, since we get different histories of shared innovations when looking at lexical vs. morphological data. Specifically, in terms of verbal morphology, neither Kharia should be united with Juang, nor Didey/Gtaʔ with Bonda (Remo)-Gadaba (Gutob). Thus there may in fact be no support for Lower Munda nor Koraput Munda nor possibly even South Munda per se as usually conceived, or at least the lexical and morphological histories do not appear to be co-terminous.

369

overview of the munda languages Proto North Munda

Korku

Proto-Munda linkage

Later Proto-Munda linkage

Kherwarian

Proto-Sora-Gorum Proto-Juang

Proto-Kharia Proto-Gutob-Remo Later southern Munda linkage Proto-Gtaʔ Figure 5.4 Classification of the Munda languages by Anderson (2001).

1.3 Noteworthy Features of Munda Languages Among the most interesting of linguistic phenomena to be found in Munda languages may be included the highly elaborated systems of demonstratives found in many of them, for example Santali or Gorum. Munda vowel and consonant systems can be quite complex, with different register and secondary articulatory features, many of which are still now in need of description. Another topic of considerable interest are the elaborate and intersecting systems of voice/valence/transitivity, person-marking and tense/aspect that characterize Kherwarian verbal systems. Further, the highly elaborated system of noun incorporation found in Sora push the limits of our understanding of such constructions from a theoretical perspective. The highly developed system of reduplication and expressive formation that characterize most Munda languages also bear mention here. Finally, the interaction of tense/aspect marking and negative operators in negative formations in South Munda Gutob stand out among the most complex of such systems known. Almost all aspects of every Munda language require more analysis before we have an adequate consensus understanding of even their basic features, in particular, syntactic issues and phonetic analysis are in desperate need of further systematic investigation. Studies on topics in the semantics and discourse of Munda languages are practically non-existent. Comprehensive comparative study has not been really possible up to this point either on the lexicon or the grammar, so a more thorough and comprehensive investigation into most historical linguistic issues in Munda also remains a goal for the future. 1.4 Estimated Number of Speakers of Some Modern Munda Languages Far and away the largest Munda language is Santali, which now stands among the official languages of India. The next two largest Munda languages and the only other two with more than a million speakers are also Kherwarian

370

anderson

languages, viz., Mundari and Ho. Only three other Munda languages have more than 100,000 speakers and these are, in order of total number of speakers, Korku, Sora and Kharia. Gutob and Juang both have somewhere between 20,000 and 30,000 speakers, but are locally endangered and thus this number will dwindle in coming decades. Gorum, Remo and Gtaʔ are all endangered languages and have fewer than 10,000 speakers each, indeed Gorum and Gtaʔ have fewer than 5,000. Some of the minor Kherwarian languages are even smaller. (1) Santali 5–7,000,000 Mundari 1–2,000,000 Ho 1,000,000+ Korku 3–400,000 Sora ca. 300,000 Kharia 2–300,000 Gutob < 30,000 Juang < 30,000 Gorum < 5,000 Remo < 6,000 Gtaʔ < 4,000

1.5 Munda in the Austroasiatic Language Family Ever since Pater Schmidt established the relation of the Munda and so-called ‘Mon-Khmer’ languages at the beginning of the last century (1906), researchers of comparative Austroasiatic have been troubled by the apparent typological distance between the majority of the modern Austroasiatic languages on the one hand, and the Munda languages on the other. Some consider the Munda languages to have acquired their often quite complicated structure along with SOV constituent order as the result of diffusion from neighboring non-Munda languages of South Asia, i.e. local Dravidian and Indo-Aryan languages (cf. Donegan 1993; Donegan and Stampe 1983, 2004), in many cases, presumably, independently in the individual Munda languages or sub-groups.1 Others, e.g. Pinnow (1959, 1963), suggest that Proto-Austroasiatic was more like Munda, with rather extensive morphological complexity. There is little doubt that Munda belongs in Austroasiatic despite these caveats. Pinnow (1959) secured the lexical correspondences to anchor Munda languages within Austroasiatic, echoing Logan (1859), Cust (1878), Schmidt 1 Indeed, as Donegan and Stampe (1983, 2004) and Donegan (1993) put it, the morphemic complexity of Munda has resulted from a shift in the ‘rhythmic holism’ of the languages from the Austroasiatic/Southeast Asian type to the South Asian type. However, they admit (1983: 341) that “in fact, the Munda languages are far more agglutinative and polysynthetic than is typical of India”. This is not the space to critique the details of their presentation, which merits a full-length study in its own right, but the prosodic differences between Munda and the eastern Austroasiatic languages has been overplayed, and there is considerable evidence to show that despite obvious differences in the construction of morphosyntactic words in Munda than in most other Austroasiatic languages, there are phono-prosodic word templates or phonological words that are much more consistent with other Austroasiatic subgroups, so that Munda languages are not as aberrant as Donegan and Stampe would have us believe. For some data relating to this, see section 2.

371

overview of the munda languages

(1905, 1906) and Przyluski (1924). Sidwell (2009) is an excellent synopsis of the history of the Austroasiatic language family, and Sidwell (2010) and Sidwell and Blench (2011) offer some new hypotheses. To summarize the debate: does Munda constitute a separate branch at a taxonomic level coordinate with one, with more than one, or with all other recognizable Austroasiatic constituent families? One theory assumes a primacy to the Munda split from the rest of the family, conventionally called Mon-Khmer. This is the approach of Zide (1969) and Diffloth (1989). This theory of a primary bifurcation of Austroasiatic into Munda vs. non-Munda/Mon-Khmer was based mainly on rough global assessments of the overall typology of Munda vs. the other Austroasiatic constituent families (e.g. Diffloth & Zide 1992): Munda being considered synthetic-agglutinative, but the eastern Austroasiatic languages isolating. The coherence of the individual family-level taxa within Austroasiatic are almost never questioned.2 What is debated is how the recognizable taxa may or may not form higher intermediate taxa; this has been approached differently by virtually every researcher who has ever attempted a comprehensive classification of the Austroasiatic languages. A closer inspection of the data reveals the similarities and differences to be both overplayed and obscured by layers of erosion and other ‘Sinospheric’ influences on the eastern Austroasiatic languages and ‘Indospheric’ (and earlier ‘Himalayospheric’) influences on Munda.3 More recent classifications have questioned whether Munda really is that different lexically from other Austroasiatic branches, e.g. Peiros (1998) using a lexicostatistic method. Diffloth (2005) revised his classification to consider Munda to be as Austroasiatic as any group from a lexical perspective. The most recent assessment is that of Sidwell (2009/2010), who situates Munda roundly or at least as an equal constituent within the Austroasiatic complex.

Aslian

Nicobarese

Munda

Khmuic

Bahnaric

Palaung-Waic Khasian

Mangic

Khmeric

Monic

Pearic

Katuic Vietic

Figure 5.5 Sidwell (2009) model of Austroasiatic. 2 Except Nihali once thought to be Austroasiatic is now recognized as an isolate language. 3 Note in this regard that various Nicobarese and Aslian languages which are outside this area are significantly more morphologically-rich than is typical of ‘Mon-Khmer’ languages in Southeast Asia proper.

372

anderson

Nominal morphology in various vestigial forms may be found throughout the AA family. Various components of the Proto-AA nominal system have been inferred, e.g. an infixed nominalizer in *–n-, prefixed syllabic nasals, a prefixal formative in *k(V)-, a prefixed pronominal case formant/augment in *a-, etc. Characteristics of the Proto-Munda verb not of South Asian origin are of particular interest in the reconstruction of the heretofore elusive ProtoAustroasiatic verbal system due to the overall paucity of (especially inflectional) verbal morphology in many Austroasiatic languages, or even basic descriptions of many of the languages. To be sure, verbs in Munda languages can be quite complex morphologically (2): (2) i. Juang ii. Remo ɖi-mi-ɲ-ɛ-pɛ-kia o-sum-oʔ-ki-niŋ give-3.fut-give-fut.ii-2pl-dl caus-eat- past.ii- perf-1 ‘they-2 will give it to you all’ ‘I had caused (s.o.) to eat’ (Pinnow 1960-ms: 131) (Fernandez 1968: 57) iii. Mundari iv. Santali senoʔ-dʒa-n-a-ko dal-e-’t-me-tahɛ̃kan-a-e go.intr-asp-intr-fin-pl beat-asp-tr-2-ipfv-fin-3 ‘they just got down by themselves’ ‘he was beating you’

While the extreme morphemic complexity typical of modern Kherwarian might or might not be attributable to Proto-Munda, and it is generally not characteristic of the remainder of the languages of the Austroasiatic family, it is far from the case that morphemic complexity is entirely alien to the eastern Austroasiatic languages. In a variety of languages, there are forms which consist of stems in combination with two or even three additional morphemes. (3) i. Katu ii. Katu ta-pa-têêŋ ta-pa-gluh recp-caus-work recp-caus-go.outside ‘make each other work’ ‘make each other go outside’ (Costello 1965: 41) (Costello 1965: 41)

[Katuic]

iii. Bahnar iv. Bahnar dʒo-po-lôtʃ dʒo-to-jaih perf-caus-die perf-pass-untied ‘to have killed’ ‘to have been untied’ (Banker 1964: 113) (Banker 1964: 113)

[Bahnaric]

373

overview of the munda languages v. Khasi ja-pɨn-sam-thjaʔ recp-caus-inclin-feel.sleepy ‘together make (others) feel sleepy’ (Nagaraja 1985: 27)

[Khasian]

vi. Nancowry Nicobarese pa-m-um-huaʔ be.afraid-agt-caus-be.afraid ‘one who frightens’ (Radakrishnan 1981: 58)

[Nicobarese]

vii. Kentakbong maʔ-pi-jin-ʔam des-caus-neg-suckle ‘(she) wants to wean (the baby)’ (Haji Omar 1976: 955)

[Aslian]

1.6 Munda Languages in the South Asia Areal Context The Munda languages have without question been profoundly influenced by the other languages of India: for example, the developed system of complex predicates seen throughout the Munda family (Anderson 2007), as well as the shift to subject-object-verb word order both reflect influences of the South Asian Sprachbund, in particular influence from Dravidian (Anderson 2003) and at a later period, Indo-Aryan as well. Nevertheless, a careful comparison of the Munda languages, starting with recent studies by Zide and Anderson (1999, 2001), Anderson and Zide (2001) and Anderson (1999, 2001, 2004), has revealed a considerable amount of old, morphologically realized categories that must be reconstructed all the way back to the Proto-Munda ancestor language. These include, but are not limited to, such categories as causative, reciprocal, negative, a transitive (or ‘active’) and intransitive (or ‘middle’) past, a non-past, a reduplicated habitual, an intransitive imperative, at least one participial construction, as well as a set of subject proclitics and object suffixes. Among other features which are not soundly reconstructible, but nevertheless may still be old in the Munda family, is a limited system of verb-noun stem compounding, that is, a type of noun-incorporation—found in South Munda languages which is lacking in modern North Munda languages, but may be found in archaic ritual language in Ho (cf. Deeney 1979) and also appears to have parallels in other Austroasiatic languages (e.g. Nicobarese or Khasi), see 3 below.

374

anderson

A phonological feature of modern Munda languages that is worth noting here is the near complete lack of word-initial ŋ-, a feature common to many of its sister Austroasiatic languages, but one which is highly marked in the South Asian areal context. While Proto-Munda may well have possessed word-initial ŋ-, it is found only in a small number of words in certain South Munda languages. In Sora, there is at least one lexical item with word-initial ŋ-, while in Remo and Gtaʔ it occurs relatively more frequently, but here preceding a velar stop, and frequently functioning as a syllable nucleus, not an onset (i.e. it is a syllabic nasal). (4)

Sora ŋorəŋ ‘have hairs standing erect, be thrilled’ (Zide 1968: 76)

(5) i. Remo ii. Gtaʔ ŋkuj ŋkui ‘water pot’ ‘water pot’ (Bhattacharya 1975b: 47)

As this last point suggests, Munda languages also differ from many of the surrounding languages of South Asia in possessing initial syllabic nasals, and as described for Santali (Anderson 2000), pre-nasalized stops as well, see above. 2

Overview of the Phonology of Munda Languages

Phonologically speaking Munda languages reflect certain areal norms, but in other ways show features not overly characteristic of South Asian languages. In some ways indeed, Munda languages are reminiscent of their sister languages to the east, despite what has become to believed as a result of the claims of Donegan and Stampe mentioned above. 2.1 Phonemic Inventory of Santali Santali (Ghosh 2008) has a relatively small number of total phonemes, but nevertheless some interesting phonological (and phonetic) phenomena are encountered. The minimal vocalic inventory is as in (6). There are nasalized variants of all the vowels as well. In addition to the basic inventory, there is an allophone of /a/ or phoneme of limited distribution [ə] in both Northern and Southern Santali dialects.

375

overview of the munda languages (6) Santali Vowels i u e ə o ɛ ɔ a

According to Zide (2000: 9), there is a degree of laryngeal tension (phonation type) associated with certain Santali vowels. This gives Santali its characteristic sound. A detailed phonetic study of laryngeal tension in Santali has not yet been done. Santali also has a set of phomemic nasalized vowels. Thus one finds minimal pairs of the following type, contrasting õ and ɔ̃ : (7) Santali arõʈ arɔ̃ ʈ ‘dense’ ‘unsympathetic, filthy’ (Bhattacharya 1975b: 49)

A wide range of diphthongs and triphthongs can be found in Santali. These include ae, eo, ao, ɛo, ɛi, ɛɔ, əi, əu, iu, oe, ɔe ɔi, ui, ea, iə, oa, uə, iəu, eae, oao, etc. (8) Santali iu dʒiu ‘spirit’ (Ghosh 2008: iə niə ‘this very’ 24–25) ĩu nĩuri ‘forest tree’ uə ruə ‘fever’ ea barea ‘two’ oa noa ‘this’ əi ləi ‘to tell’ ɛo hɛo ‘to carry on heap’ əu hiləu ‘to shake’ ae beɽhae ‘round about’ oi moiɖoŋ ‘bald’ oe hoe ‘to be eo heoa ‘to accustom’ ɔe kɔe ‘to ask for’ ui turui ‘six’

tiək’ ‘lead’

iəte ‘because’

ruəɽ ‘to return’ reaɽ ‘cold’ toa ‘milk’ dəi ‘elder sister’ nɛota ‘to invite’ biɖəu ‘to test’ maedʒiu ‘woman’ ɖoi ‘wooden spoon’ hoeo ‘to shave’ eoɽa ‘to wind thread round the spindle’ hɔedak’ ‘thunder shower’ muiguc’ ‘dirty’

uihar ‘fond remembrance’

Large vowel sequences can also be encountered in Santali, e.g. kɔeaeae ‘he will ask for him’, with six consecutive vowels! Many Munda languages show similar systems of vocalism to that in Santali, albeit not permitting the variety of vowel sequences that Santali allows; see Table 5.1.

376

anderson

Table 5.1

Munda vocalism

Santali Mundari Keraʔ Ho Korku Juang Kharia Sora Gorum Remo Gutob Gtaʔ

a a a a a a a a a a a a

i i i i i i i i i i i i

u u u u u u u u u u u u

o o o o o o o o

o o

e e e e e e e e e e e e

ɔ

ɔ ɔ

ɔ ɔ ɔ

ɛ

ã ã ã ɛ ã ? ã ? ? ã ã ? æ/ɛ ã

ə

aː aː aː

aʔ

aː ? ?

aʔ ? ? aʔ aʔ

ə

ṅ

ɨ

ɨ

? a̰

?

nD

à ? ? ?

ʊ ṅ

nD

ṅ

nD

ɪ

Some varieties Contrastive in Expressives only Marginal in system Loanwords May not be phonemic ṅ = syllabic nasal

Vowel harmony in Santali is morphologically triggered. Some affixes show variants dependent on the nature of the vowel(s) in the stem, while others do not. Harmonic (‘weak’) affixes include the locative –rɛ (oɽak’-re vs. hɔr-rɛ) and allative case -tɛ (oɽak’-te vs. hɔr-tɛ) markers (Bodding 1923: 29), or the third singular agreement marker when found enclitic to the word preceding the verb (9). Non-harmonic (‘strong’) affixes include the transitive aorist marker –ke (10). (9) i. Santali ii. Santali iii. Santali ɛɽɛ=jɛ rɔɽ-a səri=je rɔɽ-a ba=e rɔɽa lie=3 speak-fin truth=3 speak-fin neg=3 speak-fin ‘he tells lies’ ‘he tells the truth’ ‘he won’t speak’ (10) Santali

hə̃ɽũ=i dəɽ-ke-t’-a hanuman=3 run.away-aor-tr-fin ‘the hanuman monkey ran away’ (Bodding 1923: 29)

The consonantal system of Santali recognizes five places of articulation, labial, dental, alveolar-retroflex, palatal, and velar (11). Stops are found at all places, both voiceless and voiced/checked series; phonemic nasals are found only at

377

overview of the munda languages

four (ɳ occurs only as an allophone of n before –ɖ). There is no word-initial **ŋ- or **ɽ- in Santali, though these may be initial in word-medial syllables (see below). (11) Santali Consonants p t ʈ tʃ k b d m n s r, l

ɖ dʒ g ɲ ŋ h ɽ j

Many Munda languages use similar consonantal inventories: Table 5.2 Munda consonantism

Santali Mundari Keraʔ Ho Korku Juang Kharia Sora Gorum Remo

Gutob Gtaʔ

p p p p p p p p p p

b b b b b b b b b b

t t t t t t t t t t

d d d d d d d d

tʃ tʃ tʃ tʃ tʃ tʃ tʃ tʃ s d tʃ, ts, s p b t tʃ, ts p b t ʔ tʃ

dʒ dʒ dʒ dʒ dʒ dʒ dʒ dʒ z z, dz

ʈ ʈ ʈ ʈ ʈ ʈ ʈ

ɖ ɖ ɖ ɖ ɖ ɖ ɖ

k k k k k k k k ʈ ɖ k ʈ ɖ k

g g g g g g g g g g

dz, dʒ dʒ

ʈ? ɖ k g m n ɲ

ŋ r ? l dʒ?

ʈ ɖ k g m n

ŋ r ɽ l j?

Red font = in loan words only (?) Green font = different than in other languages

m m m m m m m m m m

n n n n n n n n n n

ɲ ɲ ɲ [ɲ] ɲ ɲ ɲ ɲ ɲ

ɳ ŋ r ɽ ɳ ŋ r ɽ ? ŋ r ɽ ŋ r ɽ? ɳ ŋ r ɽ ɳ ŋ r ɭ ŋ r ɽ ŋ r ɽ ŋ r ɽ ɳ ŋ r ɽ?

l l l l l l l l l l

j j j? j j j j j j j

? w w? w v, w

h h h h h

s s s s s s w h ? s s w,v s

? w

ʔ √ ʔ ʔ ʔ ʔ √ ʔ ʔ √ ʔ ʔ ʔ

c ʔ

h s ʔ Ch

Blue font = being lost in some speakers Violet font = may be vowel/nucleus feature, not segmental one

The northernmost Munda languages have a characteristic preglottalization or ‘checking’ of some obstruents in word- or morpheme-final coda position. Thus Osada (2008) describes Mundari /b/ and /d/ as [ʔb̥ m], [ʔd̥ n]. Phonetically this entails the following articulatory features: the glottis is closed and the

378

anderson

tongue/the lips simultaneously form oral closure, and when the glottal closure is released, this is optionally followed by nasal release and voicing. Note that in Mundari nasal release occurs only in monosyllabic words: /ub/ ‘hair’ [uʔb̥ m] but /udub/ ‘to tell’ [uduʔb̥ ]; /rid/ ‘to grind’ [riʔd̥ n] but /birid/ ‘to stand up’ [biriʔd̥ ]. Kharia, although taxonomically not North Munda, has been influenced significantly by Kherwarian, in particular Mundari. Thus, Peterson (2008) described plosives in coda position as pre-glottalized. Further, the contrasts between [±voiced] and [±aspirated] are neutralized, likewise the oppositions between dental and retroflex plosives have been neutralized in coda position. Thus one finds /ʔb/ [ʔb˺m], /ʔɖ/ [ʔɖ˺ɳ] and /ʔdʒ/ [ʔɟ˺ɲ] in Kharia. Coda-position /g/ becomes [ʔ], except in recent loanwords. Phonetically Peterson describes these sounds in Kharia as follows: the flow of air is suppressed for production of the plosive, the glottis contracts, producing a glottal stop and before the plosive is released, the glottal stop is released, and air flows through the nasal cavity, producing a homorganic nasal. 2.2 Aspects of Munda Phonology in an Austroasiatic Perspective Among the salient characteristics of the Austroasiatic language phylum is what appears to be the recurrent use of prefixes in nouns, realized as fossilized or historical relics, and often within the curious and characteristic ‘minor syllables’ that typify the languages of this phylum. This sesquisyllabic or weakstrong prosodic word structure is a hallmark of eastern Austroasiatic languages but is usually said to be lacking in the Munda languages. A closer inspection of Munda phonology in particular reveals that there appear to be different types of phonological and morphological word patterns existing in parallel that are found with different attendant types of word and phrasal prosody. Indeed, one such pattern is in fact a pronounced weak-strong phonological word prosody found across Munda that is exactly parallel to the word prosody of many other Austroasiatic groups. This is so strong that one must speak of contrasting phonological and morpho-lexical templates or types of words in Munda languages. Note that this older Austroasiatic pattern carries over to lexically unanalyzable and unitary stems with four syllables in Mundari (Osada 2008: 104), which therefore constitute one morpho-lexical word but two phonological word(s) (or word units) simultaneously. Indeed most Munda languages reflect this distinction between phonological and morphological word prosodies.4 The 4 Note that this is not the same as the obvious mismatch between phonological and morphological words attested in Kherwarian languages with respect to the placement of the subject clitics. As exemplified in section 4 below, the subject clitics of Kherwarian either appear at the end of the verb or preferentially enclitic to the word immediately preceding

overview of the munda languages

379

sequence of weak+strong (or minor+major) syllables in many cases define the phonological word prosody of many Austroasiatic languages, e.g. Khasi (12), and is often referred to as a sesquisyllabic structure. (12) Khasi (minor + major) [Rabel-Heymann 1976: 971) W minor- major

LH

This has its analog in Munda languages as well, e.g. in Kharia (Peterson 2008), who describes a low-high prosodic pattern that defines the phonological word in Kharia, which begisns with a low-tone pitch that gradually rises throughout word. If the word is monosyllabic in Kharia, then we find a rising contour: (13) Kharia rotʃhoʔb ‘side’ [ɾɔ̀.tʃhɔ́ ʔb˺m] laŋ ‘tongue’ [lǎŋ]

Sometimes this contrast appears to have been rephonologized in terms of stress assignment. For example, in Santali (Ghosh 2008), stress is always on the second syllable of a word unless it is monosyllabic, irrespective of whether it is an open or a closed syllable. At almost the furthest point away from Santali in the family, we find Plains Remo (Anderson and Harrison 2008), where stress is always on the second syllable. That is, it is in final position in disyllable words, and otherwise second syllables get primary stress, and every subsequent even numbered syllable gets secondary stress. These tendencies reflect the integrity of this old phono-prosodic word template in the history of Munda languages Further, the endurance of the phono-prosodic word template appears to be have been sufficiently strong and pervasive to act to reconstitute a quasisesqui-syllabic word-structure in certain Gtaʔ (Didey) varieties. Thus, there is evidence in Munda that phonological word domains do not overlap or match up entirely with morphosyntactic words. This mismatch may well reflect a historic periodization or layering within Munda, such that phonological words the verb. This type of mismatch is transparent and restricted to languages of the Kherwarian subgroup, lacking for example in the only non-Kherwarian North Munda language Korku. The type of phonological vs. morphosyntactic mismatch under consideration in this section here is found across the languages of the family.

380

anderson

are more consistent with the Austroasiatic model, and reflect a word-level prosodic structure while morphosyntactic words reflect a phrase-level prosody that brings Munda words more in line with the more morphemically developed languages of South Asia.5 Certain other features of individual Munda languages are reminsicent but as of yet not directly tied diachronically to similar features found in other Austroasiatic languages. Such features would include creaky voice in Gorum (Anderson and Rau 2008), glottalized vowels in Sora, or the development of tone in Korku (Zide 2008). 2.3 Dravidian Influence on Munda Phonology As briefly mentioned above, Dravidian languages have had a profound influence on the Munda languages at multiple levels of analysis. This includes, also, in part, phonological influence. The synchronic system of Santali presents an opposition between voiced and voiceless pairs of retroflex stops (e.g. ʈ:t and ɖ:d). It may be that retroflex ʈ in Munda languages in fact reflects Dravidian (and/or later Indo-Aryan) contact influence or was re-inforced by this, even though there are a very small number of stems that are cognate between various individual Munda languages possessing correspondences of retroflex ʈ. However, these are extremely limited (14): (14) i. NM ‘neck’ *hoʈo’g ii. Gorum kaʈi ‘rat sp.’ Ho kaʈia ‘big house rat’ (Bhattacharya 1975a: 84)

Other retroflex sounds are also relatively infrequent in the Munda languages, and are not thought to be part of the phonemic system of the Proto-Munda language. Thus, ɭ is quite rare, ɽ often appears as an intervocalic variant of -ɖ-, and ɳ is also not particularly common. However, like ʈ, there are correspondences of ɳ in the various modern Munda languages in cognate vocabulary items in certain lexical elements that appear to have a Munda origin, and others that do not. Again, the presence of retroflex liquids and nasals may ultimately reflect Dravidian influence in the Munda languages.

5 Thus Donegan and Stampe seem to be focusing on the level of morphological or phrasal prosody and not word-internal or phonological prosody in their studies. One consequence of this is that there is no rhythmic holism that affects all levels of the language as they assume, but rather there are competing levels of rhythm or prosody and different sub-systems related to these.

overview of the munda languages

3

381

Munda Nominal Morphology

The highly important discussion on whether Munda languages recognize distinct parts of speech notwithstanding (see Bhat 1997, Evans and Osada 2005 for an overview of the issues), it is descriptionally convenient to recognize nominal and verbal subsystems morphosyntactically in Munda, and this presentation operates under this assumption. The nominal systems of the various Munda languages present a historically complicated picture. The inflectional and derivational categories realized in the nominal system include animacy, number, case, and a range of denominal and deverbal formants. In addition, Munda languages reflect to varying degrees an archaic and complex system of deriving free-standing forms of nouns from corresponding roots. 3.1 Animate and Inanimate Noun Classes In many Munda languages there is at least a covert distinction between animate and inanimate nouns, primarily reflecting a natural gender classification, that is living entities are animate and almost everything else belongs to the inanimate group. However, as is typical of languages with an animate/ inanimate distinction, there are a small number of culturally important or significant inanimate entities (e.g. Santali ɲindatʃãdo ‘moon’, ipil ‘star’, dʒənum ‘thorn’, ɛ̃rgɔt’ ‘ear-wax’, putul ‘doll’) which belong to the animate class (Ghosh 1994: 40–41). The animate/inanimate distinction in Santali is not marked on non-derived (i.e. root) nouns at all, but rather this is marked morphosyntactically through such means as the differing genitive case forms for animate and inanimate nouns, the lack of locative and allative case forms for animate nouns, or the triggering (or not) of particular types of verb agreement. 3.2 Infixation Santali, like many Munda languages, makes use of a range of derivational infixes to form both deverbal and less commonly denominal nouns as well. Thus, in various stems one finds such derivational infixes as -tV-, -nV-, -mV-, -pV-, and -ɽV-. The infixed formant –tV- (with –V- representing a copy of the root vowel), is a relatively common means of marking abstract deverbal nouns. Note that with polysyllabic stems, the copied stem vowel can either be the vowel of the first vowel or the second vowel of the stem. Compare the forms meaning ‘beginning’ and ‘excess’ below, with the second and first vowel copied, respectively.

382 (15) Santali

anderson i. bɔr ‘fear (v)’ > bɔtɔr ‘fear (n)’ ii. rɔk’ ‘sew’ > rɔtɔk’ ‘seam’ iii. ɲum ‘name (v)’ > ɲutum ‘name (n)’ iv. ɛhɔp’ ‘begin’ > ɛtɔhɔp’ ‘beginning’ v. sarɛtʃ’ ‘exceed’ > satarɛtʃ’ ‘excess’

The pan-Munda derivational infix in –nV- is found in a number of forms, generally with an instrumental type meaning. (16) Santali

i. dʒɔk’ ‘sweep’ > dʒɔnɔk’ ‘broom’ ii. rakap’ ‘rise, ascend’ > ranakap’ ‘up, development’ iii. gotʃ˙’ ‘kill’ > gonotʃ˙’ ‘killing’ iv. dapal ‘cover’ > danapal ‘lid’

South Munda languages also frequently use the instrumental infix in –n-. (17) Sora (Ramamurti 1931)

dʒoʔ ‘sweep’ > dʒənoʔ ‘broom’

The –n- infix is an extremely old feature in the family, found, for example, in virtually every subgroup of Austroasiatic. (18) Khasi (Henderson 1976: 517–8) shong ‘sit, dwell’ > shnong ‘village’ her ‘fly’ > sner ‘feather, wing’ (19) Bahnar (Banker 1964)

pǎr ‘fly’

> pənǎr ‘wing’

(20) Khmer (Huffman 1970: 315)

cuəl ‘rent v.’ siət ‘insert’

> cnuəl ‘rent n.’ > sniət ‘wedge’

(21) Mon (Nai Pan Hla 1976: 906) cù ‘rest’ - hənù ‘resting place’

sāl ‘spread’ > snāl ‘mat’

3.3 Case Prefixes Most Munda languages also make use of nominal case to mark a range of categories. Grammatical object[ive] case was originally only with pronouns, and marked by a prefix in Munda. This state of affairs is reflected in such South Munda languages as Gorum, Gutob, Gtaʔ. In others, this has been extended to include all nouns, for example in Remo or Sora.

overview of the munda languages

383

(22) Gorum maŋ e-niŋ mo-dʒel-l-in you obj-I 2-tell-pst-1 ‘you told (to) me’ (Bhattacharya 1975a: 161) (23)

Gutob o-nom o-maj əbsois-o-niŋ obj-you obj-he show-tr.pst-1 ‘I showed him to you’ (Bhattacharya 1975a: 164)

(24) Gtaʔ me a-niŋ bug-ke he obj-1 beat-pst ‘he beat me’ (Bhattacharya 1975a: 166) (25) Sora kuni a-tarbaɲ-dʒi a-manɖra tija those obj-flower-pl obj-man give.imp ‘give those flowers to the man’ (Bhattacharya 1975a: 162) (26)

Remo a-joŋ-ɖeŋ sun-oʔ obj-mother-refl tell-tr.pst ‘he told his mother’ (Bhattacharya 1975a: 165)

A cognate case prefix (marking dative of pronouns) is found in various other Austroasiatic languages as well, e.g. Pacoh (Bahnaric). (27) Pacoh ʔa-maj ʔa-ɲaŋ dat-2sg dat-1du ‘to you’ ‘to us two’ (Alves 2006: 31)

Note that North Munda languages generally use suffixes in objective/grammatical (and local/directional/instrumental, see below) case functions, as is typical

384

anderson

of South Asian languages. Indeed, in Ho the optional object-marking element –ke appears to be an Indo-Aryan loan (Anderson et al. 2008; Pucilowski 2013). Similar borrowing of an objective case suffix in -ke is also found in certain Gtaʔ varieties (Anderson 2008a). 3.4 Case Suffixes/Enclitics A suffixal or enclitic case element commonly occurring in various Munda languages is a locative or directional element in –bo(ʔ). This is found primarily in South Munda languages, but in various disparate branches. (28) Remo gari-bo road-loc ‘on the road’ (29) i. Gutob aspatal-boʔ ui-gi-nin ɖu-gu hospital-loc go-intr.pst-1 aux-intr.pst ‘I had gone to the hospital’ (Hook 1991: 186) ii. Gutob biroŋ-bo(ʔ) ui-na jungle-loc go-cond ‘if you go to the jungle’ (Bhattacharya 1975a)

Note that in Juang, in addition to directional and locational, semantics, with certain nouns, -bo has acquired instrumental semantics as well. (30) i. Juang ii. Juang bilɔ-bo nui-bo paddy.field-loc river-loc in the paddy field’ ‘to the river’ (Bhattacharya 1975a)

iii. Juang semɔr-bo knife-loc ‘with a knife’

3.5 Santali Demonstratives Santali has a complicated demonstrative system (Zide 1972). Its basic three-way system is a straightforward proximal, distal, remote system coming in animate (-i/kin/ko) and inanimate forms (-a/-akin/-ako) [singular, dual, and plural].

overview of the munda languages

385

(31) Proximal Distal Remote sg dl pl sg dl pl sg dl pl anim nui nukin noko/nuku uni unkin onko/unku hani hankin hanko inan noa noakin noako ona onakin onako hana hanakin hanako ‘this’ ‘these 2’ ‘these’ ‘that’ ‘those 2’ ‘those’ ‘that/those (2) yonder’ ‘these’

Alongside of these are intensive forms (32) (marked by infixation of –k’-), exclusive or identificational focus or ‘just’ forms (33) (marked by a shift of (o/ u>)-i-), as well as perceptual forms which add connotations of ‘things seen’ and ‘things heard’ (34). (32) Intensives nuk’ui ‘this very one’ nik’i ‘just this very one’ nɔk’ɔdʒ ‘this very thing’ (33) anim inan

‘Just’ ‘Just’ Proximal Distal sg dl pl sg dl pl nii nikin neko/niku ini inkin enko/inku nia niakin niako ina inakin inako ‘just this’ ‘just these 2’ ‘just these’ ‘just that’ ‘just those (2)’

(34) ‘Seen’ ‘Seen’ ‘Heard’ Distal Remote Distal sg dl pl sg dl pl sg dl pl ɔnɛ ɔnɛkin ɔnɛko hanɛ hanɛkin hanɛko ɔtɛ ɔtɛkin ɔtɛko ‘that seen’ ‘those (2) ‘that yonder ‘those (2) yonder ‘that heard’ ‘those (2) heard’ seen’

3.6 Munda Numerals Zide (1978) has written about the various sub-systems of numerals in the Munda languages, so little will be offered here. Higher numbers are generally loans, predominantly from local Indo-Aryan varieties, and lower numbers may be loans as well in individual languages. A representative set of numerals from various Santali and Gtaʔ (both archaic in this regard) is offered here.

386

anderson

(35) Santali Gtaʔ ‘1’ mit’ m-mwiŋ ‘2’ bar(ea) mbar ‘3’ pɛ(a) ndʒi ‘4’ pon(ea) ɔ̃ ‘5’ mɔ̃ ɽɛ̃ malwe ‘6’ turui tur ‘7’ eae gu ‘8’ iral tma ‘9’ arɛ/e sɔ̃ tiŋ ‘10’ gɛl gwa

Most Munda languages have a base-10 or combined base-10/base-20 numeral system. Sora has a curious base-12/base-20 system. Thus, in Sora ‘twelve’ is migel (which includes the root for ‘ten’ within it), while ‘thirteen’ is migelboj literally [12-1], ‘fifteen’ is migeljagi [12-3], etc. ‘Twenty’ is bokuri literally [1-20] and 32 is literally [(1-20)-12] or bokuri migel and ‘thirty-three’ is bokuri migelboj [(1-20)-12-1], ‘sixty-five’ is jakuri migeljagi [(3-20)-12-3] and so on. 4

Overview of Munda Verbal Systems

The verbal systems of the Munda languages represent the most complicated and diverse grammatical sub-system. A wide range of categories are realized in the verbal form of the attested Munda languages and the various reconstructed proto-languages. These include such categories as voice, valence, tense, aspect, and mood, subject and object, negation, and affectedness. In addition, a wide range of complex predicate structures are found in the various modern Munda languages. 4.1 TAM, Voice and Valence Categories in Munda The tense-aspect systems of the attested Munda languages present a historically complicated picture. As is the case with many languages from across the globe, the categories of tense and aspect are often intimately connected in the Munda languages; frequently elements are grammaticalized first in a particular aspectual meaning and then shift to more generalized tense functions. In the vast majority of the Munda languages, there is some formal opposition or contrast between two inflectional series, conventionally referred to as transitive/active and intransitive/middle markers. This contrast may be

overview of the munda languages

387

achieved through either separate transitive and intransitive series of tense markers as in the majority of South Munda languages or through a single tense/aspect marker augmented by a consistent marker of (in)transitivity in the North Munda languages. To be sure, the history of tense/aspect markers is one of the most vexing, complex, and outstanding problems in the diachrony of the verbal systems of the Munda language family. See Anderson (2007, Chapter 4) for more details. In North Munda languages the intransitive marker is *-n-. It occupies the same position in the verb template as does the transitivity marker in *-d-, that is, it immediately follows the (tense/)aspect markers. (36) i. Bhumij dʒom-ke-n-a-iŋ eat-asp-intr-fin-1 ‘I was eating’, ‘I ate’ ii. Mundari suku-le-n-a-ko happy-asp-intr-fin-pl ‘they had been happy’

iii. Mundari diku=ɲ itu-a-n-a Hindi=1 teach-ben-intr-fin ‘I have been taught Hindi’

The North Munda intransitive marker in *-n- similarly has reflexes in southern Munda languages, in particular in Sora and Gorum (i.e. in Proto-Sora-Gorum), and differently in Juang. Here, as in the North Munda languages, it is used in a wide range of constructions marking not only intransitive, as well as various types of detransitive categories (passive, middle, reflexive). Note, for example, the varied uses of -n- in the following Sora forms: (37) i. Sora ii. Sora ab-uma-dəm-te-n-a:i tid-dəm-te-n caus-wash-refl-npst-intr-1 beat-refl-npst-intr ‘I’ll get myself washed’ ‘he beats himself’ (Ramamurti 1931: 29) (Ramamurti 1931: 26) iii. Sora so:-te-n-a:i ‘hide-npst-intr-1 ‘I’ll hide myself’ (Ramamurti 1931: 26)

388

anderson

(38) i. Sora ii. Sora bɔndi-lɛ-n-dʒi-a-tiki ran-n-e:ten lock.up-pst-intr-pl-attr-after’ crush-intr-3.pst ‘after he was locked up’ ‘it was crushed’ (Starosta 1967: 228) (Ramamurti 1931: 39) (39) i. Sora ii. Sora al-kaɲ-te-n-dʒi al-ber-te-n-dʒi recp-abuse-npst-intr-pl recp-talk-npst-intr-pl ‘they are abusing each other’ ‘they are conversing with each other’ (Pinnow 1966: 114) (Ramamurti 1931: 61)

Note that as in North Munda languages, the passive often carries a potential meaning as well in Sora, that is, ‘can be Xed’. (40) Sora mo:-te-n swallow-npst-intr ‘it can be swallowed’ (Ramamurti 1931: 26)

The Sora intransitive/detransitive marker (-n-) may even mark actions that are slow, difficult to accomplish, or even progressive or on-going; in other words, categories which cluster around a notion of ‘low transitivity’ (Hopper and Thompson 1980). (41) i. Sora ii. Sora bəru-n ber-te-n ‘hill.field-intr talk-npst-intr ‘cultivate/make a hill-field’ ‘he talks’ (Starosta 1967: 152) (Ramamurti 1931: 26) iii. Sora anin dolba-tɛ vs. he clear.field-npst ‘he will clear the field’ (Starosta 1967: 152)

iv. Sora anin dolba-te-n he clear.field-npst-intr ‘he is clearing the field’ (Starosta 1967: 152)

overview of the munda languages

389

In Juang, the intransitive or class-II past tense/aspect marker is –an, which historically includes the *-n marker: (42) Juang arɔki uru-nɔm-an-ki they red:drink-prog-pst.ii-pl ‘they were drinking’ (Matson 1964)

The transitive/past element is also found in North Munda and in Juang. In Kherwarian languages, the etymological transitive past form has mainly lost its past tense meaning, and now functions primarily as a marker of transitivity, except in Jhanrkhandi Ho as described by Pucilowski (2013), where the original past tense meaning is maintained in certain forms (e.g., those marked by the applicative). (43)

Ho ĩjel aparob-ko=ɲ em-a-ɖ-me-ja feather wing-pl=1 give-appl-tr-2-fin ‘I gave you feathers and wings’ (Pucilowski 2013: 160) {NB: orthography slightly altered from original where j = y}

(44) Santali əgu hɔ’t-ke-’t-ko-a=e bring aux-aor-tr-pl-fin=3 ‘he brought them quickly’ (Bodding 1929: 277)

Several southern Munda languages (Juang, Remo, Kharia, Gutob) have a cognate element functioning as the transitive or class-I/active past form, as in the following Gutob form: (45) Gutob niŋ biɽ-oʔ-niŋ I sow-pst.ii-1 ‘I sowed (it, i.e., the field)’ (Field Notes)

390

anderson

That these two elements are most likely etymologically connected can be seen in such correspondences as the word meaning ‘eye’ in Gutob and Santali. (46) Gutob mɔʔ

Santali mɛ’t

‘eye’

4.2 Subject and Object Encoding Both a set of subject proclitics and object suffixes need to be reconstructed for Proto-Munda, for first and second person. Third person dual/plural forms were invariably marked by suffixes, if at all;6 third singular is always unmarked in southern Munda but marked in Kherwarian. Korku lacks subject marking on verbs, being found only in the locative copula formation with third person non-singular subjects (Zide 2008). For Proto-South-Munda, both subject prefixes and object suffixes may be reconstructed relatively straightforwardly, based on correspondences between Juang and Gorum. (47) i. Juang ii. Juang iii. Juang mɛ-dʒɔ-ki-ɲ ni-kib-tan ba-ama-gito-ke 2-see-pres.tr-1 1pl-do-cond 1dl-neg-sing-pres.tr ‘you see me’ ‘if we do (it)’ ‘we two don’t sing’ (Matson 1964: 35) (Matson 1964: 35) (Matson 1964: 53) (48) i. Gorum ii. Gorum mo-taʔj-iŋ ne-aʔj-t-om 2-give-1 1-splash-npst-2 ‘you gave me’ ‘I will splash you’ (Aze 1973: 249) (Aze 1973: 250)

The North Munda languages present a slightly different and more complicated picture. Object marking is found in all the attested North Munda languages, and presumably was characteristic of Proto-North Munda as well. (49)

Korku kul-kiɲ-bà send-3dl- fin ‘sends them 2’ (N. Zide, field notes)

6 Note that the object suffixes themselves in pre-Proto-Munda may have originally been possessive markers.

391

overview of the munda languages (50) i. Santali Kumbɽəbad-te=ko əgu-ke-‘t-le-a pn-loc=3pl bring-asp-tr-1pl-fin ‘they brought us to Kumbraba’ (Bodding 1929: 208)

ii. Santali iii. Santali laga-ɲ-kan-a=e kɔe-e-a=e drive.away-1-prog-fin=3 ask-3-fin=3 ‘he is driving me away’ ‘he will ask him’ (Bodding 1923: 25) (Bodding 1923: 18)

The correspondences between Proto-South Munda and Proto-North-Munda object suffixes suggest that these should be reconstructed all the way back to the Proto-Munda ancestor language (Anderson 2007). The history of subject marking in the Munda languages is slightly trickier to explicate. One of the noteworthy aspects of subject marking in the Kherwarian languages (and Proto-Kherwarian and perhaps Proto-North-Munda as well) is that the subject markers are preferentially attached to the word immediately preceding the verb; see examples in (51–3). (51) i. Mundari (52) i. Santali hola Ranchi-te=ŋ sen-ke-n-a hɛ̃ iɲ=iɲ tʃala’k-a yesterday pn-all=1 go-asp-intr-fin yes I=1 go-fin ‘yesterday I went to Ranchi’ ‘yes I will go’ (Cook 1965: 228) (Bodding 1929: 58) ii. Mundari ii. Santali ka=ko dʒom-ke-d-a alo=m ləi-a-e-a neg=3pl eat-asp-tr-fin prohb-2 tell-ben-3-fin ‘they didn’t eat (it)’ ‘don’t tell him’ (Osada 1992: 39) (Bodding 1929: 81) (53) Mayurbhanj Ho okonde=m dʒonom-le-n-a where=2 born-tam-intr-fin ‘where were you born?’ (Field Notes [KCN])

Note that this is not a mere Wackernagel second position clitic, as it is always on the word immediately preceding the verb that the subject is indexed on, regardless of that word’s position in the clause. Most striking in this regard is the fact that the immediately pre-verbal subject agreement markers will even appear on an overt subject pronoun, if that happens to be the word in the appropriate position (52i). If there is no preceding word, and optionally in as yet to be determined conditions when there are preceding words, the subject marker may appear enclitic to the verb itself

392

anderson

(54) Mayurbhanj Ho mandi dʒom-ke-ɖ-a=m rice eat-tam-tr-fin-2 ‘you ate rice’ (Field Notes [CMH]) (55) i. Santali ii. Santali ir-ke-t’-ta-e-a=ko sɛn-otʃo-daɽe-a-e-a=ɲ cut-asp-tr-poss-3-fin=3pl go-caus-abil-ben-3-fin=1 ‘they have cut his (paddy)’ ‘I let/made him come’ (Bodding 1929: 182–183)

In emphatic discourse, both strategies may be employed simultaneously, as in the following Ho example: (56) Mayurbhanj Ho abu hotel-te=bu senoʔ-tan-a=bu we hotel-abl=1pl go-prog-fin=1pl ‘we are going to/from the hotel’ [CMH]

However, unlike the indicative where subject was marked prefixally and object suffixally, the order was probably Verb-obj-subj in the Proto-Munda transitive imperative, as this pattern is found in both Sora and Ho for example. (57) Sora (58) i. Ho ii. Ho ti’-iɲ-ba eto-ɲ-me dʒom-e-ben give-1–2pl teach-1–2 eat-inan-2dl ‘give (it) to me (you-pl)’ ‘teach me!’ ‘eat it you-2!’ (Ramamurti 1931: 141) (Deeney 1979: 18) (Deeney 1979: 14)

4.3 Negation in Munda Verbal Morphology Negative in Kherwarian is generally expressed by a preverbal negative particle which serves as host for the subject clitics. (59) i. Santali ii. Santali ba=ko sap’-le-d-e-a alo=m ləi-a-e-a neg=3pl catch-asp-tr-3-fin proh=2 tell- appl-3-fin ‘they did not catch him’ ‘don’t tell him’ (Bodding 1929: 212) (Bodding 1929: 81)

There are two negative prefixes or particles that are commonly found in the South Munda verb systems, namely *a(r)- and *əm-. Both negative markers are

393

overview of the munda languages

used in Gorum and Juang, while phonological changes make it impossible to determine which (i.e., either or both) or whether (i.e., neither) of these are found in Gtaʔ. Gutob and Juang also both show stem reduplication plus a negative copula in the negative present (habitual, durative) or at least in some forms anyway of this. (60) i. Gorum ii. Gorum buboŋ-di or-orʔiŋ-tuʔ or-alam baby-foc neg-walk-npst neg-touch ‘the baby can’t walk’ ‘don’t touch’ (Aze 1973) (Aze 1973) (61) i. Juang aɲ koŋ~koŋ I red~know ‘I don’t know’ (Mahapatra & Matson 1962-ms)

iii. Gorum ambu alam proh touch ‘don’t touch’ (Aze 1973)

ii. Juang dʒena ba-ama-gito-ke dʒena neg.cop 1dl-neg-sing-pres.tr neg.cop ‘we 2 don’t know’ (Matson 1964)

iii. Juang ar-aitog-ki-ki dʒena neg-scratch-pres.tr-pl neg.cop ‘they aren’t/won’t be scratching’ (Pinnow 1960-ms.) (62) i. Gtaʔ ii. Gtaʔ n-ár-aʔtʃoŋ-ke n-á-tʃoŋ 1-neg-feed-pst 1-neg-eat ‘I didn’t feed (s.o.)’ ‘I won’t/wouldn’t eat’ (Mahapatra et al. 1989) (Mahapatra et al. 1989) (63)

iii. Gtaʔ tʃ~tʃoŋ n-á-ɖiŋ red~eat 1-neg-aux ‘I won’t be eating’ (Mahapatra et al. 1989)

Gutob se~ser uraʔ/orodʒ red~sing neg.cop ‘doesn’t sing’ (N. Zide field notes; Griffiths 2008)

Some unexpected quirks in the interaction of negative encoding in various TAM forms are encountered in the southernmost Munda languages. Thus, the prohibitive in Gutob and Remo–and, marked in a formally different manner, in Gtaʔ–consists of the negative prefix combined with the use of a past tense morpheme, e.g. *ar-Verb-gɨ or *ar-Verb-ɔʔ. In addition, the negative forms of

394

anderson

past tenses in Gutob-Remo seems to have used the negative prefix in combination with non-past suffixes, e.g. *ar-Verb-tV. This *ar-Verb-tV is realized as Gutob ar-Verb-to. (64) i. Gutob ii. Gutob iii. Gutob iv. Gutob ar-ser-gu se~ser ar-ɖu-gu ar-ser-a ar-ser-to neg-sing-past.intr red-sing neg-aux-pst.intr neg-sing-neg.fut neg-sing-neg.pst ‘don’t sing’ ‘don’t be singing’ ‘won’t sing’ ‘didn’t sing’ (N. Zide field notes) (N. Zide field notes) (N. Zide field notes) (N. Zide field notes)

4.4 Munda Complex Predicate Structures Complex predicates are pervasive in Munda clause structure, as they are throughout the languages of South Asia. The inflectional and syntactic structure of complex predicates in Munda languages generally show the typical SOV clausal constituent order found in almost all language groups of the region. As such, it is not surprising that most complex predicates in Munda consist of two verbal elements, verb one of which generally has a lexical function and verb two of which usually contributes functional or grammatical semantics to the formation. Such a Verb1+Verb2 formation where one element is lexical and the other functional has been called an auxiliary verb construction [AVC] (Anderson 2006, 2011). The standard pattern found in many Munda languages consists of a construction-specific non-finite dependent form of the verb followed by a finite fully inflected form of the auxiliary verb, and this pattern is found across all of the major language families of the region. The specific form that the lexical verb appears in includes a range of nominalized or adverbial dependent forms called, depending on the language, tradition and construction, infinitive, participle, converb, nominal, supine, etc. (65) Lexical Verb Auxiliary Verb

Munda languages show several characteristic patterns within the aux-headed sub-type of AVC marked by a dependent or non-finite form of the lexical verb. In the Gutob-Remo subgroup (Anderson 2001), some AVCs require the lexical verb to be in a past participle form (that looks like a partially inflected past tense form). (66) Lexical verb Auxiliary Verb

overview of the munda languages

395

This comes in two morphs, differentiated for the two verbal classes found in these languages which are variously described as encoding a middle vs. active or intransitive vs. transitive distinction or more neutrally as class-I vs. class-II, as seen in the following forms from Remo and Gutob. (67) Remo Gutob baɖ-oʔ suŋ-oʔ -niŋ golgolte gai-gi ui-to slap-pst.ii.prtcpl aux-pst.ii-1 smoothly enter-pst.ptcp aux-hab ‘I finished slapping’ ‘smoothly it goes in.’ (Hook 1991: 185) (Fernandez 1983) (69)

Kharia kol-ob-ɲoʔ-ɖom-dhab-na laʔ-ki-kijar recp-caus-eat-pass-quickly-inf ipfv-pst:intr-3dl ‘they two were being fed by each other quickly’ (Malhotra 1982: 357)

In such Munda languages as Gtaʔ and Remo, there is another participle or converb form that serves as a dependent or non-finite form of the lexical verb in other formally distinct aux-headed AVCs in these languages. This non-finite form is marked by C1- reduplication (copy of only the initial consonant) in Gtaʔ (71) and in C1ə-/C1V1- reduplication in Remo (72). (70) Lexical verb AuxiliaryVerb (71)

Gtaʔ tʃ~tʃoŋ n-ɖiŋ-e red~eat 1-perf-fut ‘I will eat’ (Mahapatra et al. 1989)

(72) Remo niŋ nsuɽaʔ su~sum ɖen-t-iŋ I banana red~eat aux-npst-1 ‘I am eating a banana’ [Field Notes; SDM]

Note that there is a cognate auxiliary verb involved in these formations that require the lexical verb to appear in this reduplicated non-finite form in these two Munda languages, despite the fact that the meaning of the AVCs that they are grammaticalized within differs across the two languages, encoding future in Gtaʔ, but progressive in Remo.

396

anderson

Stucturally similar complex verb forms deriving from the fusing of an auxheaded AVC can be found in Juang (74) as well, where the original lexical verb stem appears in the non-finite reduplicated form required by the original AVC. (73) redupl-Verb-tam i -tam j (-subj) < *redupl-Lexical Verb+Auxiliary Verb-tam j -(subj) (74) Juang aiɲ dʒɛ’g~dʒɛ’g-nɔm-an I red~cry-prog-pst.intr ‘I was weeping’ (Pinnow 1960-ms: 122)

Morphologically complex verb forms deriving from AVCs in which the lexical verb appeared in a reduplicated form can be found in Sora and Kharia with a formally but not functionally cognate element, encoding frequentative in Sora, and continuous in Kharia. (75)

Sora (76) gu~gu-lo:-te-n red~call-freq-npst-intr ‘he calls (me) frequently’ (Ramamurti 1931: 28)

Kharia gam~gam-loʔ-ki-maj red~talk-cont-pst.i-pl ‘they kept in talking’ (Malhotra 1982: 145)

Another aux-headed AVC pattern that is a family-level characteristic found in the Munda languages is one in which the construction-specific form of the lexical verb is an unmarked or Ø-marked bare stem form, followed by an inflected auxiliary. Such an AVC is found across the Munda language family, including Gtaʔ or Santali. (77) Lexical Verb Auxiliary Verb (78) i. Gtaʔ tʃoŋ n-læʔ-ge eat-1-prog-pst.i ‘I was eating’ (Mahapatra et al. 1989)

ii. Gtaʔ iii. Gtaʔ tʃoŋ n-á-læʔ-ke knweʔ-ɽæ gweʔ we-ge eat 1-neg-aux-pst wife-3 die aux-pst.i ‘I hadn’t eaten’ ‘his wife had died’ (Mahapatra/Zide tx 2; Hook 1991: 184)

397

overview of the munda languages (79) i. Santali dʒɔm =baɽa-ke-t’-a-ko eat =aux-aor-tr-fin-pl ‘they are done eating’ (Bodding 1929: 277)

ii. Santali əgu =hɔt’-ke-t’-ko-a-e bring =aux-aor-tr-pl-fin-3 ‘he brought them quickly’

In Kherwarian languages, one entire series of inflections, called Series-II by Anderson (2007), likely reflects this kind of pattern historically, where the original lexical verb appeared in an unmarked form (as in the Santali forms above), and the original auxiliary was fully inflected. Such is the case for example in the following forms from Mayurbhanj Ho: (80) i. Mayurbhanj Ho okonde=m dʒonom-le-n-a where=2 born-tam-intr-fin ‘where were you born?’ (Field Notes [KCN])

ii. Mayurbhanj Ho mandi dʒom-ke-ɖ-a=m rice eat-tam-tr-fin=2 ‘you ate rice’ (Field Notes [CMH])

One last Munda complex predicate structure needs to be mentioned here. In Gorum (81) (and Gutob) there are formations where both the auxiliary verb and the lexical verb are found in fully finite forms. (81) i. Gorum (Parengi) miŋ ne-gaʔ-ru ne-laʔ-ru I 1-eat-pst 1-aux-pst ‘I ate vigorously’ (Aze 1973: 279)

ii. Gorum (Parengi) miŋ ne-adaʔ-ruʔ ne-k-ruʔ I 1-thirst-pst:aff 1-aux-pst:aff ‘I was thirsty’ (Aze 1973: 296)

Note that while aux-headed formations are more common than doubled inflection in Gutob, Gorum stands out among the Munda languages for its preference for doubly inflected AVCs of this type. As such formations are found in the South-Central and Central Dravidian languages spoken in this same region where Gorum and Gutob are found (82), in the case of Gutob in the same villages, and originally not in the Munda languages probably, it is likely that this doubled inflectional pattern in AVCs is the result of diffusion or borrowing from Dravidian into Munda (Anderson 2003). (82) Muria Gondi punno-n ata-n know:neg-1 aux:pfv-1 ‘I didn’t know’ (Steever 1997: 290–1)

398

anderson

Indeed, complex predicate structure in Munda languages in general is one domain in which these languages likely do reflect convergence with South Asian areal norms. In addition to the aux-headed pattern with reduplicated or bare stem forms of the lexical verb found in the Munda languages mentioned above and the doubled pattern just exemplified, one other inflectional patterns found in South Asian languages should be mentioned here, which represents one of the two cross-linguistically common types of inflectional splits found in AVCs (Anderson 2006). The first pattern has a structure where negative is marked exclusively on the lexical verb and subject exclusively on the auxiliary. Such a formation is found in Remo (84) but is especially common in Kiranti Tibeto-Burman languages like Camling (83) of Nepal. (83)

Camling mi-tim ŋas-i-e neg-meet aux-1pl-npst ‘we have not met’ (Ebert 2003b: 541)

(84) Remo a-sap ɖen-gi-t-iŋ neg-come prog-pst.ptcp-npst-1 ‘I have not been coming’ (Fernandez 1983)

One type of complex predicate structure found in individual Munda languages that does have analogs in other Austroasiatic languages are the deictic serial verb constructions of the following type found in Gtaʔ, where sequences of two or three verbs concatenated in a nuclear serial structure are common in discourse. Often one verb is ‘go’ which indeed appears in both its full form [we-] and a clipped serial form [e-] in these constructions: (85) i. Gtaʔ t~tar-tʃe mmwiŋ slaʔ alo we-tʃe e-rsiŋ-har-ke red~come.out-ss one tree under go-ss go-assemble-pl-pst ‘they came out and then went under a tree and assembled’ ii. Gtaʔ sela-Mboeʔ huŋ-ɖæ-ke bæ-ke “ɖe bobo muta girl=f child-3.refl-obj` send-pst voc child water.jug e-dʒog-raŋ-la ldʒo læʔ-ke ɖaktʃe basoŋ-ke go-pick-bring-imp field cop-pst quot say-tam ‘he sent his daughter, saying “go, my child, and get the jug and bring it back; it got left back at the fields”’

For more on these formations in Gtaʔ see Anderson (2007, 2008a).

399

overview of the munda languages

4.5 Nominal Combining Forms in Sora Noun incorporation is a characteristic feature of several southern Munda languages, especially Sora. Noun incorporation in Munda usually is a stemforming process uniting a semantically ‘verbal’ or predicative element with a semantically ‘nominal’ argument element into an inflectable verb stem, in the order V+N. The form of the noun that is incorporated is the so-called ‘combining form’, usually the CV(C) root. The actual free-standing form of the noun is derived from this root or ‘combining form’ by one of several different means, including prefixation, compounding, infixation, or reduplication. Each of these are exemplified in the table below (Table 5.3): Table 5.3 Derivational Relationships in Combining Forms and Free-Forms in Sora Full Form

Combining Form

Gloss

Means of Derivation

əleb əsoŋ kinsod kəmbud ontid usal gorzaŋ saŋsaŋ tudʒtudʒ dʒeʔeŋ raʔa

-əleb -soŋ -sod -bud -tid -sal -zaŋ -saŋ -tudʒ -dʒeŋ -ra

‘wild-goat’ ‘dung’ ‘dog’ ‘bear’ ‘bird’ ‘skin’ ‘village’ ‘turmeric’ ‘star’ ‘leg’ ‘elephant’

-ə < *N-(n -ə < *N*kVn*kVn*on*u*gor+ RED RED -ʔ-ʔ-

(Starosta 1992: 85–86; Ramamurti 1931: 69ff.)

For some examples of verbs with combining forms incorporated in Sora and other Munda languages, see below. 5

Overview of Munda Syntax

All of the modern Munda languages show a fairly rigid verb-final order. The comparative method suggests then that APV order should be reconstructed to the Proto-Munda ancestral tongue.

400

anderson

(86) Korku iɲdʒ dukana:-ʈen sa:kar sa~sa:-bà I store-abl sugar red~bring-fin ‘I will bring sugar from the shop’ (Nagaraja 1999: 71)

Indirect/Primary Objects usually precede direct objects or themes in most of the Munda languages: (87)

Remo a-niŋ ɖaʔtɔr a-beʔ-tɔ-nɔ-ki obj-I milk neg-give-fut.ii-2-q ‘will you give me milk or not’ (Bhattacharya 1968)

Despite the dominance of SV/APV order in the modern Munda languages, the question arises as to whether pre-Proto-Munda was in fact AVP ~ VAP as is found in most of its sister language families. This is a result of both external/ comparative and internal evidence. The external evidence comes from the fact that the cognate Austroasiatic languages, like Khasi, Khmer, or Bahnar primarily show AVP order, with VAP order found in other Austroasiatic languages, especially in Nicobarese and some other languages, if these languages have syntactically determined order at all. The internal evidence for possible non-APV order in earlier stages of the Munda language family comes from noun incorporation. Here we find the order of elements VN (88), not NV as might be expected in an original APV language. Noun incorporation is an old feature in Munda, with cognates in other languages, e.g. Nicobarese, where the combining forms of the nouns are cognate (but not the free-forms), and these likewise appear in a VN configuration (89). (88)

Remo Gtaʔ Sora guiti (næŋ) mbæʔsiaʔ gweʔ-ti-ke le:m-si-t-am wash.hand I left wash-hand-pst bow-hand-npst-2 ‘wash hand’ ‘I washed my left hand’ ‘I bow to your hands’

Kharia Juang gutʃte gutʃti wash.hand wash.hand ‘wash hand’ ‘wash hands’ (Anderson 2007)

overview of the munda languages

401

cf. (89) Nicobarese təŋ ‘reach; up to’ təŋtataj ‘reach for’ (cf. təŋ/nta ‘reach at’) (Radhakrishnan 1981: 106)

5.1 Switch Reference in Gtaʔ Gtaʔ has grammaticalized the use of switch reference in its complex clause structure. In addition, Gtaʔ narrative style reflects the areally typical use of head-to-tail linkage, with copying of the finite verb of a preceding clause in a non-finite form at the beginning of the following clause. Both of these systems interact in Gtaʔ (Anderson 2007, 2008a, Anderson and Boyle 2002). In Gtaʔ, the same subject marker is –tʃe(ʔ)7 and the different subject marker –la. Examine their use in the following examples (from Mahapatra and Zide, no date). (90) Gtaʔ Same subject ɖukri hoʔ-ru=hoʔ-ria-tʃe swa e-r’~ro-raŋ-tʃe old.woman weep=echo-ss fire go-red~carry-bring-ss hanɖa-Nɖæ-ne moɽ-ke tʃwar-tʃe aʔ-nswar-bo-ke husband-3.refl-poss corpse-obl dry-ss caus-dry-keep-pst ‘the old woman wept a lot and then made a fire, dried up her husband’s corpse and preserved it’ ŋku gnag-hwaʔ toʔ-tʃe ga-ge tiger door-rope open-ss enter-past ‘the tiger opened the door(-rope) and entered’ (91) Gtaʔ Different subject hɽiŋ oʔɽi=mwa sgwa we-la ɖokra gweʔ=we-ge later.on how.much=year like go-ds old.man die=aux-pst ‘later on, after like several years passed, the man died’ a-mane-la bar.mmwiŋ ŋgir=boeʔ huŋ-ɖæ-ke bæ-ke neg-agree-ds another male child-3.refl-obl send-pst ‘she did not agree and he sent another male child’ 7 The allomorph with a final glottal stop occurs in the texts generally only preceding the enclitic =ka meaning either ‘only’ or simply adding emphasis. The glottal stop allormorph in the cognate element in Remo may also possibly (at least originally) be phonologically conditioned by the presence of a following clitic, as appears to be the case in Gtaʔ. Again, it is hoped that better data and further analysis will clarify this issue.

402

anderson

The combination of switch reference and head-to-tail linkage can be seen in the following examples, taken from five consecutive lines in a single text: (92) Gtaʔ ɖæt ɖak-tʃe knweʔ hanɖa wiŋ.haʔ-har-ke like.that say-ss wife husband quarrel-pl-pst ‘speaking like that, the wife and husband quarreled’ (93) Gtaʔ wiŋ=haʔ-la meʔ-swa-ne knweʔ-ɽæ ɖiɖile hliʔ samboʔ we-ke quarrel-ds one-day-def wife-3 ptcp shoot plant go-pst ‘they quarreled and one day the wife went to the bamboo-plant place’ (94) Gtaʔ hliʔ samboʔ shoot plant

we-tʃe poga ɖiɖile hliʔ-nnia tʃu-biʔ-la go-ss tobacco ptcp shoot-near smear-aux-ds

tæn hliʔ-ke tʃu-biʔ-tʃe wig-ke that shoot-obl smear-aux-ss go-pst ‘she went to the bamboo-plant place, smeared tobacco on the shoots and returned home’ (95) Gtaʔ wig-la hɽiŋ hanɖaN-ɖe pag=liʔ we-ke go-ds afterwards husband-refl break=shoots go-past she went and afterwards, the husband went for bamboo shoots’ (96) Gtaʔ hliʔ pag-tʃe tʃoŋke-la poga sgwa bsæʔ læʔ-ke shoot break-ss taste-ds tobacco like bitter aux-past ‘he broke the shoots and tasted them, they were bitter like tobacco’

6

Munda Lexicon

The reconstructed lexicon of the Munda languages remains a subject of ongoing research. With respect to the lexical makeup of the Munda languages, the core or basic vocabulary shares much in common with other Austroasiatic languages, and this is the foundation for the Austroasiatic language family, already established by Schmidt (1906), and codified by Pinnow (1959). Not surprisingly,

overview of the munda languages

403

there is a siginificant layer of Indo-Aryan loans of relatively shallow time depth but in some languages is so pervasive as to alter core vocabulary, e.g. kin terms. In some languages Dravidian loans are evident, while a very small number of lexical items seem to be shared between Munda and Tibeto-Burman languages, probably reflecting a previous contact period. A preliminary attempt at comparative Munda phonology on a pan-Munda and pan-Austroasiatic scale may be found in Pinnow (1959). However, a careful periodization of Munda loan strata and the developments within Munda itself remains a goal for the future. One of the most characteristic and difficult aspects of, and vexing facts about the Munda comparative lexicon is the tendency for cognate forms of nominal elements to show cognacy only in the root/stem portion, but not in the whole free-standing form of the word. This is true both within and across sub-groups of Munda and of Austroasiatic more generally (Anderson and Zide 2002). Combining forms, e.g. as in (97) for ‘hand’ (which repeats (88) above) are usually the roots from which the semi-cognate free-standing forms are derived, and it is these that are cognate across the languages, while the free forms are not (98). (97) PSM *-ti ‘hand’ i. Remo ii. Gtaʔ iii. Sora gui-ti (næŋ) mbæʔsiaʔ gweʔ-ti-ke le:m-si-t-am wash-hand I left wash-hand-pst.b bow-hand-npst-2 ‘wash hand’ ‘I washed my left hand’ ‘I bow to your hands’ iv. Kharia v. Juang gutʃte gutʃti wash-hand wash-hand ‘wash hands’ ‘wash hands’ (98) free-standing forms for ‘hand’ (< PM *X-ti) Language Sora Gorum Juang Kharia Gutob Remo Gtaʔ

‘hand’ siʔi siʔi iti tiʔ titi titi tti

< *Form < *=ti + *-ʔ< *=ti + *-ʔ< *N=ti < *=ti + *-ʔ< *Redpl-=ti < *Redpl-=ti < *Redpl-=ti

404

anderson

Across the entire Austroasiatic phylum as a whole, including within specific subgroups of Munda, it is often not possible to reconstruct a full, syntactically free-standing form of a noun, but rather only the root that is found in the word (usually *CVC in shape), and further, that root may occur in semantically related families of forms, in compounding structures and as the form that serves for noun incorporation with verbs, and moreover is the element that is typically cognate with other Austroasiatic subgroups. The free-standing form on the other hand is created from the root by some derivational process such as prefixation, infixation, suffixation, or reduplication. This helps the freestanding form of the noun meet a minimal word constraint that the underlying root form fails to meet; see Figure-5.6. W[ord]

µ µ µ

µ

(µ)

V

ʔ]w

w [C-C..]

Figure 5.6 Bimoraic Word Constaint in Proto-Munda and Proto-Austroasiatic.

One attempt at situating the Munda data on the free-forms of nouns within their broader comparative Austroasiatic perspective is Anderson and Zide (2002). Anderson and Zide (2001) approached the topic of verb morphology from the same perspective. Both studies were subsequently combined and later expanded as Anderson (2004) and summarized here. The reflexes of the older prefix system and the minimal word constraint is best preserved when looking at the more southern Munda languages. Kherwarian languages have lost most traces of the prefixation system (some a- initial forms likely reflect a syllabic nasal prefix like Santali alaŋ, and all of the Munda languages appear to have lexicalized prefixes in specific lexemes, for example, *lV- in the word for ‘ear’), however, and many old Munda roots in Kherwarian nouns now appear only in a monomoraic root form (e.g., Ho ti ‘hand’). Some derivational processes are common across the Munda language family such that both the root and the word form can be relatively safely inferred and Proto-Munda forms securely reconstructed (‘turmeric’ and ‘broom’ below for example), but there is often no general agreement in the means of creating the word from the root across the Munda languages. What does emerge are

405

overview of the munda languages

various kinds of patterns that not infrequently correspond as specific traits for the recognizable low-level sub-groups, and thus proto-forms can be proposed, for example, for Proto-Gutob-Remo or Proto-Sora-Gorum. Table 5.4 offers a set of noun correspondences across the Munda language family which can be seen as typical of such correspondence sets. The patterns of word formation that yield the bimoraic free-standing forms of these nouns are abstracted in Table 5.5. The main methods are reduplication (rdpl), n-infixation (-n-), prefixation of a syllabic nasal (N-) or what was a syllabic nasal historically (*N-), one of two *k-initial prefix forms, glottal stop infixation (-ʔ-) or suffixation (-ʔ), or suffixation of –a/-e/-ai/-ae. The forms for ‘turmeric’ (> ‘yellow’) and ‘broom’ are some of the few panMunda etymologies where all subgroups show the same pattern of word formation from the same *CVC root. Note that Gtaʔ is quite distinct in its noun formation processes from Gutob and Remo, and variant forms that coincide with Remo (as in ‘hand’) are some of the most obvious examples of borrowing that Gtaʔ shows from Remo; doublets of this type where one variant matches Remo (and the other usually has a syllabic nasal prefix) are common in Gtaʔ. Further, Kharia and Juang agree in this set mainly in pan-Munda etymologies (‘bear’ is one exception). Further while lexemes or alternate stems not infrequently correspond across North Munda languages, even across different Kherwarian varieties one can find unexpected distribution of the older stems in the modern languages, such as the presence of the otherwise South Munda-looking form leʔe for ‘tongue’ in Ho (or le in Mundari), and the presence of corresponding Table 5.4 Selected Munda noun correspondences Gutob

Remo

Gtaʔ

Kharia

titi susuŋ mɔd, mɔʔ gikil, kilɔ gubɔn sasaŋ sunog

titi tiksuŋ mɔd

tti, nti tiʔ nco *-dʒuŋ m-mwaʔ moɖ

kilɔ gibɛ saŋsaŋ sunug

ŋku gbɛ ssia cnoʔ

Juang

Sora

iti idʒiɲ/ŋ ɔmɔr/d

sʔi dʒʔeŋ mʔɔd, amad kiɽo(g) kiɭɔg kɨna bane/ai banae kəmbud saŋsaŋ sa(ɽa)ŋsaŋ saŋsaŋ dʒonoʔ dʒenɔk dʒənoʔ

*Kharia preserves the old stem in gudʒuŋ ‘wash someone’s feet’

Gorum Kherw

gloss

siʔi dʒiʔiŋ mad

ti ~ tii dʒaŋga mɛt/ɖ

‘hand’ ‘foot’ ‘eye’

kul(a) kibud saŋsaŋ dʒɔnɔ

kul[a] (Mu) bana sasaŋ dʒɔnɔ[k’]

‘tiger’ ‘bear’ ‘turmeric’ ‘broom’

406

anderson

Table 5.5 Word formation processes in Table 5.4 Gutob

Remo

Rdpl Rdpl Rdpl-√ tik-√ Ø Ø

Gtaʔ

Kharia Juang

Rdpl, N- -ʔ N— NØ

*N*N*N-

Sora

Gorum

Kherwari gloss

-ʔ-ʔØ

Ø, ˘ -a Ø

Ø/-ʔ[-] *kV-√1

Ø N*kV-√2 *kV-√2

Ø Ø √1-e/-ai √1-ae

-ʔ-ʔ-ʔ-, a- < *NØ *kən-√3

red

red

red

red

red

red

Ø Ø *kən- √1-a √3/ *kV-√3 red red

-n-

-n-

-n-

-n-

-n-

-n-

-n-

-n-

‘hand’ ‘foot’ ‘eye’ ‘water’ ‘bear’

‘turmeric’ ‘broom’

otherwise North Munda looking alaŋ for ‘tongue’ in Sora; the Remo form appears to have both roots combined. Note that the forms for ‘dog’ show the typical North Munda stem form with –e- vocalism, a lack of prefixes and the suffix –a, as well as characteristic prefixes in Sora (kin-), Remo (gu-), and Gtaʔ (g-) and their characteristic stem reflexes with rounded vowels. The colorcoding here reflects the varied root-allomorphs and derivational processes used to derived the free-forms Table 5.6 ‘dog’ and ‘tongue’ in six Munda languages Ho

Bhumij

Santali

Sora

Remo

Gtaʔ

gloss

seta

seta

seta

kinsor

gusoʔ

gswi/gsui

leʔe

alaŋ

alaŋ

alaŋ

leʔa[ŋ]

nle

‘dog’ < *s√ ʔɖ, *səʔɖ ‘tongue’ < *le[ʔ], *laŋ

To summarize, lexical and phonological data are tied inextricably with word formation processes in Munda. To ignore all three kinds of information, as well as excluding contact between different Munda languages as a viable explana-

overview of the munda languages

407

tion for a given observed homology can only yield partially valid results. At times it can seem like there are almost as many patterns of correspondences as there are correspondence sets themselves. What does become clear is that the obvious subgroups emerge (e.g., Kherwarian or Gutob-Remo). In order to rectify this situation, the only apparent current solution is to get an adequately large data sample and begin anew the comparisons in a step-wise systematic fashion. To be sure, Munda is not alone among subgroups of Austroasiatic in showing regular correspondences across the roots in nouns, but not in the freestanding forms of the nouns themselves with different individual languages utilizing different word-building processes in the creation of the syntactically free-standing noun forms from their associated underlying roots (or possibly reflecting different morphologically related sets of words derived from a common root historically speaking). Virtually all other Austroasiatic subgroups exhibit this kind of partial correspondences when looking at individual lexemes. Thus, the various Nicobarese languages all show the same cognate root for ‘hand’ but embedded within different word-formation templates: (99) Nicobarese words for ‘hand’ (or ‘palm’) (Man 1975 [1888–9]) Central Car Shom Pen Teressa gloss kane-tai el-tiː noai-tiː mɔh-tiː ‘hand’

In Car Nicobarese (Pu), the tiː element occurs with different initial segments, forming a set or family of related words all based on the same basic root: (100) Car Nicobarese (Das 1977: 32) ɛltiː ‘palm of hand’ uktiː ‘back of hand’ kuntiː ‘finger’

How does one explain this seemingly paradoxical situation, where roots are clearly cognate, but it is often difficult to determine how and why full forms derived from such roots will be differentiated in individual languages or subgroups. The roots often appear preceded by a small set of what thus appear to be prefixes, as in the Kensiw forms in (101) or the Bolyu forms in (102). (101) Kensiw (Bishop and Peterson 1994: 176, 178, 185) tə-, ma-, -kɔʔ təkɔʔ ‘pus, sputum’ makɔʔ ‘egg’ (102) Bolyu mɔldʒǎŋ mɔti mɔmau mɔƒǔn cf. ɣəmti ‘eagle’ ‘finger ‘finger’ ‘bamboo’ ‘arm’ (Edmondson 1995: 139–141) mɔ-, ɣəm-, -ti, -mau, -ƒǔn, -ldʒǎŋ

408

anderson

In some cases, one may be dealing with both prefixes and compounds stems, as the root element seems to be able to appear in either position in a prefix-√ structure or in a putative √-√ structure as well, as the following sets of related forms in Bolyu in (103) suggest, specifically the distribution of mat-/-mat and dʒɔŋ-/-dʒɔŋ. (103) Bolyu ɬaimat ‘eye’

mbɔŋmat ɬaimbu ɬaitən mɔtən lɔŋmi lɔŋʔi ‘eyelid’ ‘louse’ ‘melon’ ‘melon seed’ ‘nose’ ‘anus’

matdʒɔŋ dʒɔŋɬəŋ ‘ankle joint’ ‘leg’ (Edmondson 1995: 144–154) ɬai-, mɔ-, lɔŋ-; -mbu, -tən, -mi, -ʔi; mat-, -mat, dʒɔŋ-, -dʒɔŋ; mbɔŋ-, -ɬəŋ

All this together leads one to ask whether there was a system of prefixes/ proclitics that was semantically rich in PAA, perhaps later reinterpreted as opaque, co-lexicalized phonological material used to meet a minimal word constraint in individual subgroups. Although stripped of their semantic bases, the derivational processes were still necessary phonologically for the nouns, even if no longer semantically connectable or predictable. So the irregular correspondences witnessed above may either have once reflected a meaningful semantic contrast in the forms, now lost, and/or simultaneously in specific cases represent later individual, lexeme- and language-/dialect-specific means of resolving a minimal word constraint. At least this is what appears to have happened in the history of Munda, and the varied correspondences presented above suggest it may have been true in other Austroasiatic subgroups as well, which in turn suggests we look to deeper history for an explanation of the observed phenomena. A look through the lexicon of a wide range of Austroasiatic languages reveals recurrent elements that may have reflected a once active system of semantic prefixation or noun classes.8 Thus, ɛl occurs in a number of body parts in Car Nicobarese (104). Other prefixes don’t correlate exactly with obvious semantic classes like this, but nevertheless are common enough in the lexicon of the language, e.g. li-, ta-, ha-, etc. (105). (104) Car Nicobarese (Das 1977: 17) [ɛl]mɛh ‘nose’ ɛlmat ‘color’ ɛlkui ‘brain’ ɛlti ‘palm of hand’ ɛlŋoh ‘chest’ ɛlran ‘sole, hoof’

ɛlwaŋ ‘mouth’

8 Note that similar phenomena are also found in Tibeto-Burman subgroups.

409

overview of the munda languages (105) Car Nicobarese (Das 1977: 31–32, 41, 42) tarul ‘cloud’ tatʃam ‘dew’ tahɯi ‘today’ taaː ‘day after tomorrow’ takɯn ‘thigh’ haniːŋ ‘axe’ hataːm ‘night’ har̃ap ‘evening’ litak ‘tongue’ likɯn ‘nape’ kilap ~ likap ‘gullet’

Bugan is an only recently identified tonal Austroasiatic language of southern China that appears to belong in the Mangic family, with six tones (55, 33, 35, 13, 31, + 0 w/prefixes)) described for the language in the analysis of Li (1996), although this really seems more like a two level tone and three contour tone system of five tones, with the set of elements which are toneless representing putatively archaic prefixes. It is likely that better data on Mangic languages will help shed light on the still opaque semantics that might (have) be(en) associated with these elements. (106) Bugan (Li 1996: 137–139, 141, 144) pə55bop31/pə55bou31 ‘head’ pə55tsɛ35 ‘ear’ pə55tsə ‘placenta’ pə55lai33 ‘tongue’ pə55nuk31 ‘heart’ pə0mã31 ‘nose’ mə0kua31 ‘folk song’ mə33tsa31 ‘hand straw cutter’ tə0qou35 ‘palm of hand’ tə0kɔ33 ‘tiger’ te55taŋ55 ‘moss’

pə55mai35 ‘pupil of eye’ pə0kui55 ‘craftsman’ mə33dou33 ‘firewood knife’ te55kuk31 ‘frost’

Some prefixal elements are semantically clear in Bugan. How widespread such elements are remain a subject for future research to resolve (107). Formally there are two types of such elements, tone bearing ones like pu55-, mu33-, or ɲa33-, and toneless ones like tse0- (108). Whether this represents two diachronic layers of grammaticalization of these elements is unknown at present. (107) Bugan pu- > male animals pu55laŋ31 ‘stallion’ pu55li55 ‘ox’ mu- > plants and mu33paŋ31 mu33tsau33 ‘bitch’ mu33laŋ31 ‘mare’ female animals ‘peach tree’ mu33li55 ‘cow’ > mu33xi31laŋ31 mu33xi31li55 ‘small mare’ ‘small cow’ ɲa33 > diminutives ɲa33tsaŋ55 ‘mouse’ ɲa33li55 ‘calf’ ɲa33pu55li55 ‘small ox’ (108) Bugan tse- > birds tse0qa35 ‘duck’ tse0ŋaŋ35 ‘goose’

410

anderson

7 Summary Munda languages are clearly Austroasiatic in origin and retain many features that reflect this history. However, they also clearly reflect an extensive network of areal contacts and developments at various stages in their history. Because of the great age of the Austroasiatic phylum, and the inferrable history of the Munda languages, it is certain that ongoing investigations into periodicizing the stages in the historical development of the Munda languages will yield considerable insight into still vexing problems in the ancient ethnolinguistic history of the entire South Asian and Southeast Asian regions, in both of which the Munda languages appear to have played significant roles. 8 References Alves, M. 2006. A grammar of Pacoh. Canberra: ANU. Anderson, Gregory D.S. 1999. A new classification of the Munda languages: Evidence from comparative verb morphology. Paper presented at 209th meeting of the American Oriental Society, Baltimore. ———. 2000. Santali. In Encyclopedia of the World’s Major Languages. In Jane Garry and Carl Rubino (eds.). New York & Dublin: H.W. Wilson Press, 623–628. ———. 2001. A new classfication of Munda: Evidence from comparative verb morphology. Indian Linguistics 62: 27–42. ———. 2002. Issues in proto-Munda and proto-Austroasiatic nominal derivation: The Bimoraic Constraint. In Marlys A. Macken (ed.), Papers from the 10th Annual Meeting of the Southeast Asian Linguistics Society. Tempe, AZ: Arizona State University, South East Asian Studies Program, Monograph Series Press, 55–74. ———. 2003. Dravidian influence on Munda. International Journal of Dravidian Linguistics 32.1: 27–48. ———. 2004. Advances in proto-Munda reconstruction. Mon-Khmer Studies 34: 159–184. ———. 2006. Auxiliary verb constructions. Oxford: Oxford University Press. ———. 2007. The Munda verb. Typological perspectives. Berlin: Mouton de Gruyter. ———. 2008a. Gtaʔ. In Gregory. D.S. Anderson (ed.), 682–763. ———. 2008b. Remo (Bonda). In Gregory D.S. Anderson (ed.), 557–632. ———. 2011. Auxiliary verb constructions and other complex predicate types: A functional-constructional approach. Language and Linguistics Compass 5.11: 795–828. ———. (ed.). 2008. The Munda Languages. Routledge Language Family Series. Abingdon & Oxford: Routledge. Anderson, Gregory & John P. Boyle. 2002. Switch-reference in South Munda. In Marlys A. Macken (ed.), Papers from the 10th Annual Meeting of the Southeast Asian

overview of the munda languages

411

Linguistics Society. Tempe: Arizona State University, South East Asian Studies Program, Monograph Series Press, 39–54. Anderson, Gregory D.S. & K. David Harrison. 2008. Sora. In Gregory D.S. Anderson (ed.), 299–380. Anderson, Gregory D.S., Toshiki Osada and K. David Harrison. 2008. Ho and the other Kherwarian languages. In Gregory D.S. Anderson (ed.), 195–255. Anderson, Gregory D.S. & Felix Rau. 2008. Gorum. In Gregory D.S. Anderson (ed.), 381–433. Anderson, Gregory D.S. & Norman H. Zide 2001. Recent advances in the reconstruction of the Proto-Munda (Austroasiatic) verb. In Laurel J. Brinton (ed.) Historical Linguistics 1999. Amsterdam: Benjamins, 13–30. Aze, Richard. 1973. Clause patterns in Parengi-Gorum. In Ronald L. Traill (ed.), Patterns in clause, sentence, discourse in selected languages of India and Nepal, Vol. 1. Norman, Okla: Summer Institute of Linguistics, 235–312. Banker, Elizabeth M. 1964. Bahnar affixation. Mon-Khmer Studies 1: 99–118. Bhat, D.N.S. 1997. Noun-verb distinction in Munda languages. In Anvita Abbi (ed.), Languages of tribal and indigenous peoples of India: The Ethnic Space. Delhi: Motilal Banarsidass, 227–251. Bhattacharya, Sudhibhushan. 1968. A Bonda dictionary. Poona: Deccan College. ———. 1975a. Linguistic convergence in the Dravido-Munda culture area. In International Journal of Dravidian Linguistics 4, 199–214. ———. 1975b. Studies in Comparative Munda Linguistics. Simla: Indian Institute of Advanced Study. Bishop, Nancy & M.M. Peterson. 1994. Kensiw glossary. Mon-Khmer Studies 23: 163–196. Bodding, Paul O. 1923. Materials for a Santal Grammar, Phonetic. Dumka: Santal Mission of N. Churches. ———. 1929. Materials for a Santali Grammar, II. Dumka: Santal Mission. Cook, Walter. 1965. A descriptive grammar of Mundari. Ph.D. Dissertation, Georgetown University. Costello, Nancy. 1965. Affixes in Katu. Mon-Khmer Studies 2: 63–86. Das, A.R. 1977. A study on the Nicobarese language. Calcutta: Anthropological Survey of India. Deeney, John. 1979. Ho Grammar and Vocabulary. Chaibasa: Xavier Ho Publications. Diffloth, Gérard. 1989. Proto-Austroasiatic Creaky Voice. Mon-Khmer Studies 15: 139–54. ———. 2005. The contribution of linguistic palaeontology and Austro-Asiatic. In Laurent Sagart, Roger Blench & Alicia Sanchez-Mazas (eds), The Peopling of East Asia: Putting together archaeology, linguistics and genetics. London: RoutledgeCurzon, 77–80. Diffloth, Gérard and Norman Zide. 1992. Austro-Asiatic languages. In William Bright (ed.), International Encyclopedia of Linguistics, Volume 1. New York & Oxford: Oxford University Press, 137–142.

412

anderson

Donegan, Patricia J. 1993 Rhythm and Vocalic Drift in Munda and Mon-Khmer. Linguistics in the Tibeto-Burman Area 16.1: 1–43. Donegan, Patricia J. & David Stampe. 1983. Rhythm and the holistic organization of language structure. In The Interplay of Phonology, Morphology, and Syntax, Chicago: Chicago Linguistic Society, 337–353. ———. 2004. Rhythm and synthetic drift of Munda. In R. Singh (ed.), Yearbook of South Asian Languages and Linguistics 2004, 3–36. Edmondson, Jerold. 1995. English-Bolyu Glossary. Mon-Khmer Studies 24: 133–160. Evans, Nicholas, and Toshiki Osada. 2005. Mundari: The myth of a language without word classes. Linguistic Typology 9.3: 351–390. Fernandez, Frank. 1968. A grammatical sketch of Remo: A Munda language. Ph.D. dissertation, University of North Carolina. Ghosh, Arun. 1994. Santali: A Look Into Santali Morphology. New Delhi: Gyan Publishing House. ———. 2008. Santali. In Gregory D.S. Anderson (ed.), 11–98. Griffiths, Arlo. 2008. Gutob. In Gregory D.S. Anderson (ed.), 633–681. Haji Omar, Asmah. 1976. The verb in Kentakbong. In Philip Jenner et al. (eds.), 951–970. Hayes, La Vaughn H. 1992. On the track of Austric: part I. Mon-Khmer Studies 21, 143–78. Henderson, Eugénie J.A. 1976. Vestiges of Morphology in Modern Standard Khasi. In Philip Jenner et al. (eds.), 477–522. Hoffmann, Johan. 1950. (1930–1950) Encyclopedia Mundarica, Patna: Government Superintendent Printing. 15 volumes. Hook, Peter. E. 1991. The Compound Verb in Munda: An Areal Overview. Language Sciences 13.2: 181–195. Huffman, Franklin E. 1970. Modern spoken Cambodian. New Haven & London: Yale University Press. Jenner, Philip N., Laurence Thompson and Stanley Starosta (eds.). 1976. Austoasiatic Studies II. Honolulu: University of Hawai‘i Press. Li, Jinfang. 1996. Bugan-a new Mon-Khmer language of Yunnan Province, China. MonKhmer Studies 26: 135–160. Mahapatra, Bijoy & Daniel Matson. 1962. Brief Description of Juang. Unpublished-ms. Puri. Mahapatra, Kageshwar. (with Dobek Pujari and P.K. Panda). 1989. ɖiɖayi (In Oriya). Bhubaneshwar: Academy of Tribal Dialects and Culture, Government of Odisha. Mahapatra, Kageshwar & Norman H. Zide. No date. Gtaʔ Texts. Unpublished manuscript. Malhotra, Veena. 1982. The structure of Kharia: A study of linguistic typology and language change. Unpublished Ph.D. dissertation, Jawaharlal Nehru Univeristy, New Delhi.

overview of the munda languages

413

Man, E.H. 1975. [1888–9]. A Dictionary of the Central Nicobarese Language. Delhi: Sanskaran Prakashak. Matson, Daniel. 1964. A grammatical sketch of Juang. Ph.D. dissertation, University of Wisconsin. Nagaraja. K.S. 1985. Khasi: A descriptive analysis. Pune: Deccan College. ———. 1999. Korku grammar with texts. Tokyo: Tokyo University of Foreign Studies. Nai Pan Hla. 1976. A comparative study of Old Mon Epigraphy and Modern Mon. In Philip Jenner et al. (eds.), 891–918. Osada, Toshiki. 1992. A reference grammar of Mundari. Tokyo: Institute for the Study of Languages and Cultures of Asia and Africa. ———. 2008. Mundari. In Gregory D.S. Anderson (ed.), 99–164. Peiros, Ilia. 1998. Comparative linguistics in Southeast Asia. Pacific Linguistics C-142. Canberra: ANU. Peterson, John. 2008. Kharia. In Gregory D.S. Anderson (ed.), 434–507. Pinnow, Heinz-Jürgen. 1959. Versuch einer historischen Lautlehre der Kharia-Sprache. Wiesbaden: Otto Harrassowitz. ———. 1960. Beiträge zur Kenntnis der Juang-Sprache. Unpublished manuscript. ———. 1963. The position of the Munda languages within the Austroasiatic language family. In H. Shorto (ed.), Linguistic comparison in South East Asia and the Pacific, 140–152. London: SOAS. ———. 1966. A comparative study of the verb in the Munda languages. In Norman Zide (ed.), Studies in comparative Austroasiatic linguistics. (Indo-Iranian Monographs v. 5) The Hague: Mouton, 96–193. Pinnow, Heinz-Jürgen, J. Kullu & P. Kerketta. 1965. Der Wertiger und andere Geschichte in Kharia. Indo-Iranian Journal 9.1: 32–68. Radhakrishnan, R. 1981. The Nancowry Word. Edmonton: Linguistic Research, Inc. Ramamurti, G.V. 1931. A Manual of the So:ra: (or Savara) Language, Madras: Government Press. Schmidt, Pater P.W. (1906) Die Mon-Khmer Völker: Ein Bindeglied zwischen Völkern Zentralsiens und Austronesiens. Archiv der Anthropologie, Braunschweiger 33 [n.s. 5]: 59–109. Shorto, H.L. 2005. A Mon-Khmer comparative dictionary. Canberra: ANU. Starosta, Stanley. 1967. Sora syntax: A generative approach to a Munda language. Ph.D. dissertation, University of Wisconsin. ———. 1992. Sora Combining Forms and Pseudo-Compounding. Mon-Khmer Studies 18–19: 78–105. Zide, Arlene R.K. 1968. A comparative lexicon of Sora and Gorum. Unpublished manuscript. Zide, Norman. H. 1972. A Munda demonstrative system: Santali. In J. Barrau, et al. (eds.) Langues et techniques, nature et société. vol. I Approche linguistique. Paris: Klincksieck, 267–274.

414

anderson

———. 1978. Studies in the Munda numerals. Mysore: CIIL. ———. 2000. Three Munda Scripts. Linguistics of the Tibeto-Burman Area 22: 199–232. ———. 2008. Korku. In Gregory D.S. Anderson (ed.), 256–298. ———. 2001. The Proto-Munda Verb: Some Connections with Mon-Khmer. In K.V. Subbarao and Peri Bhaskararao (eds.), Yearbook of South-Asian Languages and Linguistics-2001. Delhi: Sage Publications, 517–40. Zide, Norman. H. & Gregory D.S. Anderson 1999. The Proto-Munda Verb and Some Connections with Mon-Khmer. In Peri Bhaskararao (ed.), Working Papers: International Symposium on South Asian Languages: Contact, Convergence and Typology. Tokyo: Institute for the Study of Languages and Cultures of Asia and Africa. Tokyo University of Foreign Studies, 400–419.

part 2 Grammar Sketches

∵

Map 1.1

Location of languages represented in the grammar sketches.

Car

Danau

Pnar

Khasi

Mon

Kui

Semaq Beri

Koho Bunong

Sedang

Pacoh

Vietnamese

Northern Aslian

Khmer

Chong

Mlabri

Kammu Dara’ang

Bugan

Section 1 Aslian

․․

chapter 1

Northern Aslian Nicole Kruspe, Niclas Burenhult and Ewelina Wnuk This chapter provides a typological profile of Northern Aslian, a subbranch of the Aslian branch of Austroasiatic comprising a number of closely related language varieties spoken in southern Thailand and northern Peninsular Malaysia. It builds on the authors’ ongoing field research, as well as available sources. The profile is based on data from most known varieties of Northern Aslian, but draws in particular on the Ceq Wong (cwg), Jahai (jhi) and Maniq (tnz) languages, in accordance with the current research focus of each author. 1 Background Northern Aslian is one of the four clades of the Aslian branch of Austroasiatic, as evidenced by historical phonology (Diffloth 1975), conventional lexicostatistics (Benjamin 1976a), and recent phylogenetic studies (Dunn et al. 2011, 2013; Burenhult et al. 2011). The clade encompasses some 8 to 10 speech varieties spoken by minority groups in northern and central Peninsular Malaysia and southern Isthmian Thailand. It is divided into two main branches: (1) Ceq Wong, a geographical isolate in western Pahang state, Peninsular Malaysia; and (2) Maniq/Menraq-Batek (MMB), a group of closely related varieties forming a dialectal continuum from southern Thailand down into northern Pahang. The latter group can be tentatively subdivided into two subgroups, one comprising northerly varieties like Maniq, Kensiw, and Kentaq (which share the term mniʔ or maniʔ for ‘human being’), spoken in southern Thailand and adjacent areas across the border to Malaysia, and one comprising the varieties Jahai, Menriq, and Batek, strung out from the border area to northern Pahang. Jahai and Menriq share a term for human being, mnraʔ, whereas Batek uses the eponymous term batɛk. Table 1.1 provides an overview of the Northern branch of Aslian. Historically, the two subbranches likely separated soon after proto-Northern Aslian branched off from the rest of Aslian, possibly around 1500–2000 BP, and the secondary separation of the Maniq subgroup from Menraq-Batek took place a few centuries later (Dunn et al. 2013:393). All MMB varieties are closely associated with the ethnographic category Semang, mobile subsistence © koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_007

420 Table 1.1

Ethnolinguistic group

Kruspe, Burenhult and Wnuk Overview of Northern Aslian. Thailand data is based on Phaiboon (2006) and Wnuk (2009 2006; Pahang and Terengganu information draws on Lye (1997, 2004, personal communicaBatek are from 2003 and were obtained from the website of the Center for Orang Asli Subgroup Branch

Maniq

Kensiw

Location

Thung Wa, Langu, Manang, and Khuan Don districts of Satun province, Palian district of Trang province, Kong Ra, Tamot and Pa Bon districts of Phatthalung province, and reportedly Khuan Kalong of Satun and Rattaphum district of Songkhla province, Thailand Thanto district of Yala province, Thailand; Baling district of Kedah state, Malaysia

Maniq

Kentaq

Hulu Perak district of Perak state, Malaysia

Tea-de

Weang and Srisakorn districts of Narathiwat province, Thailand Maniq/ MenraqBatek

Jahai

Menriq

MenraqBatek

Batek

Ceq Wong

Hulu Perak district of Perak state and Jeli district of Kelantan state, Malaysia; Srisakorn district of Narathiwat province and Betong district of Yala province, Thailand Gua Musang, Jeli and Kuala Krai districts of Kelantan state, Malaysia

Gua Musang district of Kelantan state, Lipis and Jerantut districts of Pahang state, and Hulu Terengganu and Besut districts of Terengganu state, Malaysia

Ceq Wong

Temerloh and Raub Districts, Pahang state, Malaysia

Northern Aslian

421

field notes); Malaysia data from Kedah, Perak and Kelantan is from a survey conducted by Burenhult 2004tion) and Kruspe (2002; 2008 field notes); Malaysian total population figures for Kensiw, Kentaq, Jahai, and Concerns (http://www.coac.org.my). Sociolinguistic situation

Several nomadic, semi-nomadic, and settled groups living within Thai villages or in their own permanent settlements. The largest group of 54 (as of 2009) is settled in Baan Khlong Tong in Trang. Total number of speakers is above 240 (Wnuk 2009 fieldnotes)—most likely about 300.

Thailand group settled in one village, number of speakers is unknown; Malaysian group settled in one village and numbers 232 speakers; occasional speakers in semi-nomadic mixed Kensiw-Jahai groups along the border. Settled in one village, 157 speakers; close contacts with Kensiw, which is very closely related. Situation unknown; groups possibly still largely nomadic; may be identical to a group referred to by the Jahai as Jdɛk, in which case Tea-de is in contact with Kelantan Jahai. Several settled, semi-nomadic and nomadic groups, with particular concentrations around Lake Temengor in Perak and Sungai Rual in Kelantan; while the Thailand figure is unknown, the number of speakers in Malaysia is 1,843; in Perak largely co-settled with Temiar, and in Kelantan with Menriq. Settled in three villages along the middle portion of the Kelantan river and one village in Jeli district; only in two of these locations do speakers refer to themselves as Menriq; in the other two they refer to themselves as Batek Təh; total number of speakers can be estimated at 250–300; one group is co-settled with Batek Deq, another with Jahai. Several settled, semi-nomadic and nomadic groups, divided into five distinct dialects (Batek Deq, Batek Teq, Batek Nong, and Batek Tanum); the number of speakers is 1,255, although this figure is likely to include some 100 speakers of Menriq who nevertheless refer to themselves as Batek; co-settled with Menriq in one location in Kelantan. A group of Batek Deq live in a resettlement village with the Semaq Beri in Terengganu; c. 26 Batek Teq live in a small resettlement village in Besut, Terengganu. The Batek Tanum are sometimes referred to as Mintil by other Batek. There is no information for Batek Nong. The Eastern group live in one village, with several small family groups living in scattered locations in the Krau Game reserve. Total number of speakers less than 200. The Western Ceq Wong are resident in several mixed villages along with Semai and Temuan (Aboriginal Malay) speakers. The Eastern Ceq Wong report dialectal differences between the two groups. They do not maintain contact with them. Population figure is unknown.

422

Kruspe, Burenhult and Wnuk

foragers and traders. For the most part they form a closely knit network of contact and intermarriage, and they make up a distinct sociolinguistic niche (Benjamin 1985a; Burenhult et al. 2011). Ceq Wong, on the other hand, does not form part of the Semang societal sphere and does not partake in the pattern of contact characteristic of the rest of Northern Aslian. The Ceq Wong practise a ‘mixed’ economic adaptation of foraging with semi-sedentary residence like that of swidden horticulturalists. Our current understanding of linguistic structure in Northern Aslian relies primarily on Burenhult’s 2005 grammar of Jahai, as well as ongoing descriptive work on Ceq Wong and Batek by Kruspe (in progress), Maniq1 by Wnuk (in progress), and Menriq by Burenhult (and more recently by Yager, in progress). Additional materials include an early grammatical sketch of Jahai by Schebesta (1928), an Honor’s Thesis on Kentaq by Asmah (1963), as well as a number of shorter works on Kensiw by Bishop and Peterson (see especially Bishop 1996, and Peterson 1997), and a description of Maniq phonology by Wnuk & Burenhult (in press). Furthermore, various aspects of lexical semantics have been explored in Jahai by Burenhult (2003, 2006, 2008a, 2008b, 2011, 2012), Burenhult & Majid (2011), Levinson & Burenhult (2009), and Majid and Burenhult (2014). Loanwords in Ceq Wong are described in Kruspe (2009a&b). Much of the authors’ own descriptive work has taken place within the framework of major modern language documentation initiatives, especially the Volkswagen Foundation’s DOBES program and the Hans Rausing Endangered Languages Project.2 2

Phonetics and Phonology

2.1 Word and Syllable Structure Northern Aslian languages display word structure typical of the Aslian family. Most words (underived lexemes) tend to be mono-, sesqui- or disyllabic, as in

1 In this chapter ‘Maniq’ refers only to the language spoken by the Maniq in the Banthad mountain range, see Table 1.1. 2 Kruspe’s research was supported by the Research Centre for Linguistic Typology at La Trobe University, the Hans Rausing Endangered Languages Program, and the Volkswagen Foundation (DOBES), Burenhult’s research was supported by the Swedish Research Council (421-2007-1281), the Volkswagen Foundation (DOBES), and the European Research Council (the European Union’s Seventh Framework Programme, Grant agreement n° 263512), and Wnuk’s is supported by Max Planck Gesellschaft.

Northern Aslian

423

Batek Deq toʔ ‘elder sibling’, ploʔ ‘fruit’, and pacɛ̃w ‘Water monitor (Varanus salvator)’. Longer trisyllabic word forms also occur, like Batek Deq tlabas ‘Malayan Sun bear’ and Jahai kmalbɔp ‘a type of vegetable’, however in many cases they appear to bear traces of morphological complexity, probably the result of fossilized processes. The minimal word in Northern Aslian is CV(C), while the maximal word, attested in Jahai, is the derived tetrasyllabic form /C.C.C(V)C.CVC/ as in t-b-tadɔʔ (rel-prog-wait) ‘waiting’, e.g. tmkal t-b-tadɔʔ ‘the waiting man’. There is a preference in Northern Aslian for words to have a closed final syllable. In Jahai and one variety of Menriq final syllables are always closed, but several languages including Ceq Wong, Batek, Menriq and Maniq allow words with an open final syllable. Such words are generally less numerous than those with closed syllables, but they are often high-frequency items, as in Ceq Wong sɑ ‘a little’, and galə ‘cassava’ from Malay ubi benggala ‘potato’ and Maniq wa ‘to go’, ce ‘to dig’, and kabi ‘to be new’. In Northern Aslian languages, the minimal syllable is [CV], and the maximal syllable is usually [CVC], or [CCVC] for Ceq Wong and Maniq. Syllabification proceeds from right to left, and is maximally exhaustive. In Jahai, for example a string CCCVC will be parsed as /CC.CVC/. Sequences of consonants are articulated with the assistance of an epenthetic vowel, so /CC.CVC/ is [CVC.CVC]. The default realisation of the epenthetic vowel is schwa [ə]. There are two variants [i] and [u], conditioned by the segmental features of onset of the syllable to the right. A palatal segment produces [i] as in Ceq Wong sy�ʔ [ɕiyɑʔ] ‘to look around’; and a bilabial, labio-velar or velar segment may result in [u], e.g., lwəc [lu.wəc] ‘to ascend’ and skoʔ [ɕu.koʔ] ‘nail, claw’ (cwg). When the onset of the final syllable is a glottal segment, /h/ or /ʔ/, the epenthetic vowel may be a copy of the final syllable vowel, as thun [tu.hun] ‘year; season’ (cwg). Stress placement is invariably on the final syllable. 2.2 Phoneme Inventory and Phonotactics Northern Aslian languages have rather typical Austroasiatic consonant systems. There is a core set of distinctive places and manners of articulation, shared by all Northern Aslian (and other Aslian) languages described to date and adding up to approximately 20 phonemes. Thus, the points of articulation include bilabial, alveolar, palatal, velar, and glottal. Voiceless stops occur in all of these (/p/, /t/, /c/, /k/, /ʔ/), and voiced stops and nasals in all but the glottal position (/b/, /d/, /ɟ/, /g/ and /m/, /n/, /ɲ/, /ŋ/). Voiceless fricatives occur in alveolar (or palatal) and glottal position (/s/, /h/); approximants in bilabial and palatal position (/w/, /j/). All languages also have a lateral liquid /l/, and most of them have a rhotic /r/, which is realised in a multitude of ways. All

424

Kruspe, Burenhult and Wnuk

Table 1.2

Northern Aslian consonant phonemes (Consonants in parentheses do not occur in all varieties)

p b (pʰ) m

(ɸ)

t d (tʰ) n r l w

c ɟ ɲ

s (z)

j3

k g (kʰ) ŋ

ʔ

h

Northern Aslian languages except Ceq Wong add to this core set a voiceless bilabial fricative /ɸ/ (see e.g. Bishop 1996; Burenhult 2005; Phaiboon 2006). Some varieties also have a voiced alveolar or palatal fricative /z/ (Benjamin 1985b). Voiceless post-aspirated stop initials occur marginally in Kensiw and Maniq, the two northernmost languages of the group (Bishop 1996; Wnuk & Burenhult, in press). The phenomenon appears to be restricted to loanwords of Thai and Malay origin. Most Northern Aslian varieties have a system of contrastive vowel qualities that contains three degrees of height and three degrees of front-to-back. Ceq Wong departs from this typical three-by-three system and replaces central /a/ with an additional height to the front and back series, with /æ/ and /ɑ/, as in /læh/ ‘hort’ and /lɑh/ ‘sibling-in-law’.4 Furthermore, Bishop (1996) proposes for Kensiw a system with five degrees of height for front and central vowels, and four for the back vowels. This highly deviant system displays a level of quality variation that is hardly observed even at the phonetic level in other Northern Aslian languages, and Benjamin (2012) cautiously questions the analysis. Ceq Wong, Kensiw and Maniq supplement their systems with between 2 and 7 diphthongs. In all of Northern Aslian, oral realisations contrast with 3 The Northern Aslian phonemes in Tables 1.2 and 1.3 are represented in accordance with IPA standard. However, throughout this chapter, language examples are represented in the practical orthographies developed for the individual languages in accordance with much Aslian (and Southeast Asian) descriptive tradition. While these orthographies largely adhere to IPA notation, they depart from it in that the voiced palatal stop /ɟ/ is represented as j and the palatal glide /j/ is represented as y. Furthermore, epenthetic vowels are consistently left out. 4 The Ceq Wong vowel /æ/ is represented orthographically as ‘a’.

425

Northern Aslian

a slightly reduced set of nasal counterparts. Like Southern Aslian, but unlike most Austroasiatic languages, Northern Aslian does not have distinctive vowel length. Thus, its systems of contrastive vowel nuclei are comparatively small, the number of vowel phonemes in each language typically ranging between 15 and 20. Table 1.3

i e ɛ (æ)

Northern Aslian oral vowel phonemes (Vowels in parentheses do not occur in all varieties)

ɨ ə

( a)

u o ɔ (a)

2.2.1 Phonotactics In the final syllable, all consonant phonemes are attested in the onset, apart from the bilabial fricative /ɸ/ (Maniq is an exception here, /ɸ/ occurs in the onset realized as [ph]). The coda displays reduced possibilities with voiced stops never occurring in this position. In word final position following an oral vowel, the nasals are preploded [bm, dn, ᶡɲ, gŋ], e.g., [cɕadn] can ‘foot’, [həɲjaɟɲ] həɲyaɲ ‘to stand’, [kajeɡŋ] kayeŋ ‘to hear, to listen’ (tnz). Restrictional possibilities disallow the simultaneous co-occurrence of the rhotic /r/ in both onset and coda position, and nasal vowels are uncommon following a voiced stop. Consonant distribution is further reduced in the onset of non-final syllables, where the bilabial fricative and glides do not occur, and the palatal and velar nasals are also uncommon. The coda in the penultimate syllable is typically either a liquid, or nasal homorganic with the final syllable onset, e.g., tuŋkal ‘male’ (cwg). In some instances it is a nonhomorganic nasal, possibly an archaic infix, e.g. tmkal ‘male’, attested in Maniq, Jahai and Batek. All other phonemes can only occur in the penultimate coda if they are a copy of the final syllable coda (see §3.2). In trisyllabic words, the onset of the penultimate syllable is most often a nasal or sonorant phoneme, while that of the prepenultimate syllable may be a voiced or voiceless stop, or fricative. The prepenultimate syllable is always open. All phonemic vowels occur as the nucleus in the final syllable, with the distributional possibilities reduced in the penultimate syllable, e.g., phonemically nasal vowels are not found in this position. In prepenultimate syllables

426

Kruspe, Burenhult and Wnuk

non-phonemic epenthetic vowels replace phonemic ones, with the occasional exception, as in Ceq Wong kararow ‘sound of walking through forest litter’. The epenthetic vowel is typically [ə], see §2.1. 2.3 Suprasegmentals Northern Aslian languages typically do not make use of suprasegmental phenomena for contrastive purposes. However, there are brief reports of marginal tonal contrast in Jahai (Schebesta 1928) and Kensiw (Bishop 1996), and Hajek (2003) concludes that there is limited tonal activity in these languages. The evidence cited by Hajek is questioned by Wnuk & Burenhult (in press). 3

Word Formation

3.1 Compounding Most compounds are nominal, consisting of two nouns, or a noun and verb, of which the initial is the head. Compounds are usually compositional. Noncompositional compounds, where the meaning cannot be derived, at least synchronically, from the sum of the two parts, are less frequent, e.g., t�m tyəg (liquid banana) ‘final stage of post-partum bleeding’ (Ceq Wong). Compounds are frequent in the names of plants, animals, topographic features and body parts, e.g., Jahai kdek bapaŋ (rodent ?) ‘Prevost’s squirrel’, tɔm bɨʔ (water mother) ‘big stream’, jʔeŋ blɨʔ (bone thigh) ‘femur’, Maniq wac ʔɔs (Beautiful.tree.squirrel fire) ‘Plantain squirrel (Callosciurus notatus)’, sɔk ʔaŋkɛʔ (hair chin) ‘beard’. In Ceq Wong these classifying compounds can elide the head once they are established in discourse. This is not possible with other compounds. In Maniq, it is only possible to elide the head, if the remaining modifier does not have another meaning on its own, e.g. (wac) kukɛh ‘Pallas’s’ squirrel (Callosciurus erythraeus)’ but not *(wac) ʔɔs (ʔɔs ‘fire’). The associative construction is a syntactic construction that produces compound-like constructions with a range of semantic relations, from possessive to benefactive, see §4.4. 3.2 Derivational Morphology The main function of derivational morphology is to transfer words from one form class to another, either between word classes, e.g. nominalisation (§3.2.1), or within them between subclasses, e.g., Aktionsart (§3.2.2). The only valency increasing operation is causativisation; there is little evidence for valency decreasing operations. This appears to conform to the general low relevance of transitivity that characterises the grammar of these Northern Aslian languages.

Northern Aslian

427

Aslian derivational morphology is typically non-linear, although morphology borrowed from Malay tends to be exclusively prefixal. A single derivational process will potentially have a range of phonologically determined allomorphs in order to accommodate the prosodic constraints that determine the shape of the output. The constraints produce a complex interplay between faithfulness to the form of the affix, the prosodic structure of the output, and the general phonotactic constraints of the language. In the most extreme cases, the identity of the original root may be lost, as in the case of Ceq Wong causative. The competition of a fully prespecified prefix pi- ‘caus’ and a constraint requiring a maximally disyllabic output results in the deletion of the nonfinal syllable of disyllabic roots producing a monosyllabic stem, e.g., harən ‘to know’ → pi-rən ‘to make known’. Affixation is described in terms of three criteria: 1) the domain of attachment; 2) the prosodic structure of the output, and 3) the structure of affixes in terms of the degree of phonological prespecification. The criteria apply in parallel operations, and the order of presentation here is not indicative of an ordered series of operations. The domain of attachment is defined as the prosodic unit to which a morphological process applies. The domain is language specific, for instance in Maniq it is always located in the penultimate syllable, as illustrated in (a–c) below, whereas in Jahai there is marginal attachment also to prefinal syllables, as in (d). Ceq Wong patterns like Maniq for indigenous processes, but there is some marginal evidence of attachment to prefinal syllables with borrowed processes. a) Prefixation: kap ‘to bite’ → n-kap (nml-bite) ‘biting’ (cwg); wa ‘to go’ → ʔam-wa (incep-go) ‘to head off’ (tnz) b) Stem modification: haʔũt ‘to stink’ → nahũt (nml.stink) ‘stinking’ (cwg); həɲyaɲ ‘to stand’ → piyaɲ ‘to stand (of objects)’ (caus.stand) (tnz) c) Infixation: pset ‘to be spicy’ → pset (be.spicy) ‘spiciness’ (cwg); cilɔ ‘to hide’ → cilɔ ‘to be hiding’ (hide) (tnz) d) Prefixation: hgik ‘to be afraid’ → p-hgik (caus-be.afraid) ‘to frighten’ (jhi); saʔot → l-saʔot (iter-call.out) ‘to call out repeatedly’ (jhi)

Processes are governed by constraints on the prosodic structure of the output, which must conform to the prosodic well-formedness constraints that operate within the language (§2). These constraints are language specific. Furthermore, different processes may be subject to different output constraints. For instance in Jahai, derivations of disyllabic roots or bases are systematically trisyllabic, whereas in Ceq Wong, the maximal derived form is almost always disyllabic, reflecting a general phonotactic preference for disyllabic structures over

428

Kruspe, Burenhult and Wnuk

trisyllabic ones. Further, the output must conform to the prosodic well-formedness constraints that operate within the language (§§2.1–2.2). Given these constraints, processes of affixation may involve either the creation of a new syllable, or the restructuring of an existing syllable. In some cases, the penultimate syllable is replaced. The creation of a new syllable is demonstrated in a) above, where the creation of a new syllable expands a monosyllabic base to a disyllabic form. Affixation may also result in the restructuring of an existing syllable, for instance through the addition of new material creating a heavy syllable CVC from a light syllable CV as in the Ceq Wong example in c) above. Given that word stress is always fixed on the final syllable it is never affected by affixation. Manipulation of the base to accommodate constraints on syllable structure sometimes results in the violation of prosodic well-formedness. Epenthesis operates to repair this. The epenthetic vowel is usually [ə], e.g., Ceq Wong dias ‘to follow’ → n‑dias [nə‑dias] (nml-follow) ‘following’. Reduplicated bilabial and labio-velar codas may cause the epenthetic vowel to be realised as the high back rounded vowel [u], palatal consonants result in the high front unrounded vowel [i], and (in Jahai) copied glottal codas condition vowel realization as [a]; in Ceq Wong the onset may also cause conditioning, e.g., jok ‘to move residence’ → jk‑jok [ jik‑jok] ‘be moving residence’. Recursive derivation is richly attested in Jahai, where, for example, imperfective forms feed further derivational morphology in the form of progressives, iteratives, continuatives, and nominalisation (Burenhult 2005: 96). However, in Ceq Wong it is generally rare and restricted mainly to the causative stems. Causative stems feed the progressive, and nominalisation processes as shown in the following data: ŋɔ̃ ɲ ‘to be alight’ → c-ŋɔ̃ ɲ (caus‑be.alight) ‘to light’ → ɲi-ŋɔ̃ ɲ (nml.caus-be.alight) ‘lighting’. Affixes also vary as to the degree of phonological prespecification. There are three basic types. The first are fully prespecified, where both the syllabic structure and the phonological content of the affix are fully specified. These prefixes generally have the form of a syllable, like pi- ‘caus’. The second are partly prespecified, where only part of the affix is phonologically prespecified, the remainder consists of empty phonological slots, e.g., nominalising +n+ ‘nml’, which may manifest as either an onset, or coda depending on the prosodic structure of the base. Third, is underspecified affixation, which is essentially reduplication. The affix, or ‘reduplicant’ consists of a series of empty C and V positions conforming to a permissible syllable type, and phonological content is ‘copied’ from the final syllable of the base to which the affix attaches. These 3 types represent the basic building blocks of Aslian morphology. In addition it is possible to have combinations of these, for example, combining a partly prespecified affix, and copying, as in the affix p– ‘be X number’

429

Northern Aslian

where n�̃ y ‘one’ → py-n�̃ y ‘single, each one’. Whatever the permutations and combinations of the units, the output is always constrained in accordance with the phonotactic rules of the specific language. Full reduplication of lexemes is generally rather rare in Aslian. Some examples from Ceq Wong with numerals are n�̃ y n�̃ y (one~red) ‘all together’, limãʔ limãʔ (five~red) ‘in a group of five’, and with a stative verb deriving an adverb, as in ktaʔ ktaʔ (be.loud~red) ‘loudly’. In Jahai nouns may be reduplicated to express plurality or variety. 3.2.1 Deriving Nouns Processes of nominalisation derive verbal nouns that denote a state or manner, e.g., Jahai ʔnay ‘to bathe’ → ʔnay, ‘act of bathing’, unitized nouns from nominal roots, e.g., ktɔʔ ‘day’ → ktɔʔ ‘[unit of] day’, and to a lesser extent, nouns denoting physical entities, e.g., instruments, objects, or location, e.g., cɔm ‘to burn’ → nm-cɔm ‘swidden’. All of these are attested in Jahai and Ceq Wong (Table 1.4); in Maniq the physical entity type is more frequent than state or manner, while unitization is not attested, making the Maniq nominalisation less productive than in other Northern Aslian languages, see Table 1.5. The nominalising affix +N+ ‘nml’ is a partly specified segmental affix whose realisation depends on the structure of the base, and in some languages cooccurs with Coda Copy, the reduplication of the coda. The various realisations in Ceq Wong and Maniq are presented in the tables below. Table 1.4

Nominalisation in Ceq Wong n-

CVC

+

Cr/lVC

+

(C)C.CVC CV. CVC (n.)

CV.CVC (v.) CC.CVC (n.)

Nasal mutation N:

jəʔ ‘to heat’ → n-jəʔ ‘heating’ +

krit ‘to grate’→ n-krit ‘grating’

+

btow ‘be flooded’ → btow ‘flooding’ hlɔt ‘to faint’ → slɔt ‘fainting’ blwəɲ ‘be grue’ → blwəɲ ‘grue’ taləs ‘taro’ → taləs ‘a taro’ +

patuʔ ‘to try’ → m-atuʔ ‘trying’ pryok ‘cooking pot’ → p-n-yok ‘potful’

430 Table 1.5

CV(C)

CV.CV(C)

Kruspe, Burenhult and Wnuk Nominalisation in Maniq n-

nC-

+

+

+

dda (*da) ‘to move across’ → n-da ‘lateral direction’ lɛn ‘to tie a waistband’ → n-lɛn ‘waistband’ kɔŋ ‘to sieve’ → nŋ-kɔŋ ‘sieve’ kɨc ‘to grind tubers’ → kɨc ‘grinding stick’

+

kapa ‘to carry a child’ → kapa ‘cloth used as child carrier’ paʔɛ̃c ‘to rise (of sun)’ → paʔɛ̃c ‘the east’

Each language also has some minor nominalisations. In Maniq there is the marginal presence of a nominalising +m+, e.g. haluh ‘to blowpipe’ → haluh ‘bamboo sp. used for blowpipes’ (bamboo). In Ceq Wong there is a prefix ni- ‘qnml’ that derives measure nouns from a disparate range of word classes including stative verbs, quantity nominals, a locational noun and an aspectual adverb, e.g., jəsɨŋ ‘be long’ → ni-sɨŋ (qnmlbe.long) ‘length’. The habitual agent nominalization ‘anml’ combines a specified segment pn- and nominalisation +N+ of the onset. Monosyllabic roots prefix pn-, as in cɨp ‘to walk’ → pn‑cɨp [pə.nə.cɨp] ‘a wanderer’; disyllabic roots undergo nasal mutation pN-, as in kn�̃ m ‘to urinate’ → p-ŋən�̃ m ‘a perpetual urinator’. This affix is rare, and has not been attested with all root types. The prefix na- ‘incl’ derives a collective numeral ‘all X[num]’ from the indigenous numerals e.g., p�n ‘four’ → na-p�n (incl-four) ‘all four’; while ka- ‘ord’ derives ordinal numbers from both indigenous and Malay loans, e.g., ka-ber (ord-two) ‘second’ . 3.2.2 Deriving Verbs All Northern Aslian languages have productive verbal derivations including the causative, and various aspectual and Aktionsart derivations. Less prevalent

Northern Aslian

431

are processes to derive verbs from nouns. Most of the languages have a means of deriving verbal forms from numerals, and Maniq derives a small class of orientation / body posture verbs from body part terms. 3.2.2.1 Deriving Verbs from Nouns Noun-to-verb derivation is fairly marginal. Jahai derives intransitive property verbs from nouns by means of a borrowed Malay prefix b-, e.g. b-tɔm ‘to have/ use/contain water’ from tɔm ‘water’. In Maniq a different kind of noun-toverb derivation is carried out with the causative morpheme . The process applies to a small group of body part terms to derive orientation / body posture verbs, i.e. hadɛp ‘front side’ → hdɛp ‘to turn the front to s.o.’, kaʔɔʔ ‘back side’ → kyɔʔ ‘to turn the back to s.o.’, khkieh ‘side’ → khkieh ‘to turn the side to s.o.’, katoŋ ‘knee’ → ktoŋ ‘to bend knees’. Ceq Wong, Jahai and Maniq all exhibit processes of deriving verbs from numeral roots. Both Jahai and Maniq derive a verb from the numeral ‘one’ with the causative affix pi- ‘caus’, as in pi-nɛy ‘one by one’, and pi-nay ‘to make s.o./s.th. be on its own’, respectively. In Ceq Wong a prefix p- comprising a prespecified onset in conjunction with, coda copy derives a verb ‘a group of X num’ from the set of four indigenous numerals, e.g., ber ‘two’ → p‑ber ‘be in a group of two’. In Ceq Wong there is some limited zero derivation of verbs from a small subset of meteorological nouns, e.g., kəs ‘afternoon’ → ʔu=kəs ‘be(come) late afternoon’, and some stative verbs that can have a defined endpoint. 3.2.2.2 Deriving Verbs from Verbs Causative All Northern Aslian languages have a productive causative derivation, with pi- ‘caus’ common to all languages, see below, and Asmah 1976: 954–55 for Kentaq Bong. Ceq Wong lacks a productive causative infix unlike Jahai (Burenhult 2005: 108–109), and Maniq, and other Aslian languages (Benjamin 1976b and Semaq Beri, this volume). Instead it patterns like its neighbour, the Aslian isolate Jah-Hut (Diffloth 1976a). Most of the languages exhibit several causative forms, e.g., Batek Teq’s p-, pi- and (Kruspe field notes 2008), but in general one morpheme is more productive than the others. For example of the five Ceq Wong causative prefixes, pi-, p-, c-, s- and ki- ‘caus’, pi- is the only one that is productive. There is a suggestion of a semantic difference between the prefixes, e.g., pi-ʔam ‘to lay down’ compared to p-ʔam ‘to marry’, and pi-bɨt ‘to threaten to burn (s.o.)’ and p-bɨt ‘to heat (s.th.)’. A rare lexicalized form is p-bɨt ‘to reheat (s.th.)’ with an infix as in Batek.

432 Table 1.6

Kruspe, Burenhult and Wnuk Imperfective in Maniq / Progressive in Ceq Wong Prefix CC-

CVC

Infix

+

yek ‘to return’ → yk-yek ‘to be returning’ (tnz) dias ‘to follow’ → ds-dias ‘to be following’ (cwg)

C.CVC

+

slɔk ‘to wear’ → slɔk ‘to be wearing’ (cwg)

CV.CVC

+

bakɛ̃ɲ ‘to be bald’ → bakɛ̃ɲ ‘to be bald’ (tnz) lipat ‘to fold’ → lipat ‘to be folding’ (cwg)

Aspectual/Aktionsart Derivations Imperfective/Progressive Ceq Wong, Jahai and Maniq, and possibly all Northern Aslian varieties, have verbal derivations that invoke partial reduplication, where depending on the root structure, either the root coda, or onset and coda are copied and affixed to the root. This derives an aspectual distinction, like the Imperfective in Jahai (Burenhult 2005: 94) and Maniq, or the Progressive in Ceq Wong, see Table 1.6 below. In both Maniq and Ceq Wong, roots with complex onsets may behave like monosyllables. Maniq, monosyllabic CCVC roots with a complex onset copy the root onset (and coda) into the prefix, e.g., hyɛ̃ɸ ‘to whistle shrilly’ → hyɸhyɛ̃ɸ (ipfv-whistle) ‘be whistling shrilly’. In Ceq Wong roots with a /Cr/ or homorganic nasal plus stop onset pattern with monosyllabic bases, e.g., krit ‘to grate (cassava)’ → kt-krit (prog-grate) ‘be grating’. Curiously, disyllabic Ceq Wong roots with phonemic /a/ in the penultimate syllable lose the /a/ in the progressive derivation. This applies to both indigenous and borrowed roots as in kajəw ‘to take flight’ → kjəw [kuʷ.ˈɟəw] (take.off) ‘to be taking off’, and ʔageh ‘to share out’ → ʔgeh [ʔəh.ˈɡeh] (share.out) ‘to be sharing out’, respectively. In Jahai, imperfectives frequently feed further derivation by means of the borrowed Malay prefix /b-/ to produce progressives (Burenhult 2005: 96–99). A progressive category is also attested in Maniq: here the progressive /mV/ is a partly prespecified affix consisting of m followed by an underspecified vowel. It surfaces as a prefix in monosyllabic bases and as an infix in disyllabic bases. In disyllabic bases, the vowel is a copy of the initial-syllable vowel. In most occurrences, the progressive is accompanied by the imperfective affix (a copy of the final coda), e.g. bas ‘to run’ → ma-bas ‘to be running’ (prog-run), bikãt ‘to cry’ → bikãt ‘to be crying’ (cry). Progressive

Northern Aslian

433

derivation of h-initial disyllabic verbs almost always results in the suppression of the vowel in the initial syllable, although its copy in the form of an epenthetic vowel does surface directly to the right of , e.g. hipɛk ‘to fix, to secure’ → hipɛk [hmipɛk] ‘to be fixing, to be securing’. The progressive applies not only to dynamic, but also auto-causative-derived stative verbs, e.g. ɡəbaʔ ‘to be angry’ → ɡbaʔ ‘to be making oneself angry’ (be.angry ). Iterative Some varieties derive iteratives from dynamic verbs, in all attested cases by means of an affix involving the phoneme /l/. For Jahai, see Burenhult 2005:99–100. The Maniq iterative /l/ is a fully prespecified affix, prefixed to monosyllabic and infixed in disyllabic bases, e.g. wa ‘to walk’ → l-wa ‘to walk repeatedly’ (iter-walk), piŋɔk ‘to sit s.o.’ → pi-ŋɔk ‘to sit s.o. repeatedly’ (caus-sit). Like the progressive, it is frequently accompanied by the imperfective, e.g. cɔk ‘to pierce’ → l-cɔk ‘to pierce repeatedly’ (iter-pierce), ɲup ‘to be soft’ → l‑

ɲup ‘to be soft repeatedly’ (iterbe.soft), cakip ‘to bend one’s head down’ → ca

kip ‘to be bending one’s head down repeatedly’ (bend.head). Iterative signals the notion of temporal multiplicity (repeated action). Ceq Wong exhibits several examples of what appears to be an iterative (+lC+), however it is yet to be established as a productive process, e.g., bɔc ‘to speak’ → lc-bɔc ‘to prattle on’. Ceq Wong has a rare prespecified infix or fed by verbs, including expressive-like verbs that indicates multiplicity or distributed instantiations. It is only attested with sesqui- and di-syllabic roots as in jŋõh ‘appearance of s.o. sitting alone’ → jŋõh ‘appearance of many people sitting (in a house)’; saŋɛ̃m ‘a falling spark from a bark torch’ → saŋɛ̃m ‘many falling sparks’ and bluy ‘to suddenly appear alone’ → bluy ‘to suddenly appear in a group’. The verb cŋdɔ̃ ŋ ‘one person walking carrying something long on the shoulder’ derives cdɔ̃ ŋ ‘many people walking carrying s.th. long on one shoulder’. Again there are too few examples to posit a productive process. This derivation is similar to the Mah Meri distributive infix or , see Kruspe 2010. Distributive Jahai, Menriq and Maniq derive distributives from both dynamic and stative verbs. In Jahai and Menriq, the process involves copying of root consonants in combination with a prespecified vowel /i/, e.g. Jahai ʔik-ʔɛk ‘to distribute’ (from ʔɛk ‘to give’), cil-cɔl ‘to converse, to exchange stories’ (from cɔl ‘to tell’). In Maniq, the distributive derivation involves affixation of a fully prespecfied affix l, e.g. katuʔ ‘to have protruding spots (of monitors, turtles, etc.)’ → katuʔ ‘to have protruding spots (here and there)’ (have.spots). The resulting forms encode that the state or action denoted by the verb is

434

Kruspe, Burenhult and Wnuk

characterized by spatial multiplicity, that is, distribution of the action or state across several locations or participants. In Jahai and Menriq, distributives also form the main strategy of expressing reciprocity, as in miy-mɛy ‘to delouse each other’, tbɔh ‘to hit each other’ (Burenhult 2011; field notes). Inceptive The Maniq inceptive is expressed by ʔam- prefixed to m onosyllabic bases, or infixed in disyllabic bases. It indicates the beginning of a situation or entrance into a state. The disyllabic morpheme is similar to the disyllabic progressive morpheme . However, the two morphemes can normally be distinguished since in most cases the inceptive surfaces in the coda, while m in the progressive is predefined as syllable onset. Both dynamic and stative verbs can be derived with the inceptive, e.g. tiek ‘to lie/sleep’ → ʔam-tiek ‘to begin lying/sleeping’ (iter-lie/sleep), lawɛc ‘to climb’ → lawɛc ‘to begin climbing’ (climb), haŋɛc ‘to be cold’ → haŋɛc ‘to cool down’ (be.cold). Happenstance In Ceq Wong, the prefixation of t- ‘happ’ (from Malay ter-) derives the happenstance, as in t-yow (happ-see) ‘to happen to see’. When affixed to a reduplicated verb it expresses carelessness or lack of thought in engaging in an activity. Continuous aspect The full reduplication or repetition of a verb expresses the continuation of an event over time in Ceq Wong and Maniq and is labelled the continuous aspect. Both dynamic and stative verbs are used in the continuous. The verb is typically reduplicated two or more times. (1)

teʔ cɨp wel, cɨp~cɨp~cɨp~cɨp~cɨp 3du go again go~cont ‘They two went (on) again, walking and walking walking.’ (cwg)

(2) ʔɛʔ hah, ʔɛʔ balɛk~balɛk 3 open.mouth 3 lick.mouth~cont ‘It’s opening its mouth, it keeps on licking itself.’ (tnz)

In Ceq Wong there are two derivations, Intensive repetition/Continuative, and the Intensive state, both of which violate ordinary processes in terms of employing full reduplication at the syllable and word levels, and one with affixation on the left edge of the root. The intensive repetition, or continuative affix s- may be a Malay loan. The intensive state derivation is structurally similar to an expressive derivation in Semai (Tufvesson, pc.).

Northern Aslian

435

The Intensive repetition/Continuative derivation s-(CV)ˈCVC ‘INTNS’ consists of a prespecified prefix s- and total reduplication of the root, regardless of whether it is mono- or disyllabic. The prefix attaches to the left edge of the root, and the reduplicant occurs to the right of the root. The two phonological words constitute a single grammatical word. The process is fed by semantically compatible dynamic, stative and expressive verbs, and marginally by nouns and expresses intensive repetition over a sustained period of time: dor ‘to walk along (a branch, log)’ → s-dor~dor ‘to walk along one after the other’, and lsəm ‘to rain’ → s-lsəm~lsəm ‘to rain on and on’. In the Intensive state derivation, the whole monosyllable, or final syllable of a disyllabic form is reduplicated deriving a compound of two phonological words CVC~CVC, or CCVC~CVC by the repetition or ‘echo’ of the final syllable, as in saluc ‘be smooth’ → saluc~luc (be.smooth~intns) ‘be really smooth’ and jras�k ‘appearance of monitor lizard lying still’ → jras�k~s�k (lie.still~intns) ‘appearance of monitor lizard lying really still’. This process is restricted to stative verbs expressing physical property, and expressive verbs depicting movement. 3.2.3 Other Derivations Some languages in the Northern branch exhibit a process whereby the coda, usually either a glottal stop /ʔ/ or fricative /h/ is either replaced by, or mutates to a nasal, either the bilabial nasal /m/, or the alveolar nasal /n/, depending on the individual language. For example in Menriq and Batek, the personal pronouns have forms with a final /m/ for the irrealis, see §5.1.1.1. The process is at odds with the fact that these are otherwise typical prefixing/infixing languages, there are no other suffixes or enclitics in the language, and the final syllable is usually unaffected by any morphological processes. In Ceq Wong, nasal mutation of the coda is observed in a number of minor word classes, and marks epistemic modality. The roots that feed this process are: the existential verbs, e.g., wəʔ ‘to exist’ → wəm (exist.irr) ‘might be’; demonstratives, e.g., dəʔ ‘this’ → dəm (this.irr) ‘maybe this, and adverbs like yəʔ ‘as well, too’ → yəm (as.well.irr) ‘as well maybe’. A second set in Ceq Wong are the deictic locatives that replace final segments with an /n/, as in pɑh ‘upstream’ and hampɑn ‘upstream place’, see §5.3. Furthermore, the coda of the prefix han- assimilates to a nasal homorganic with the onset of the base. This process has no precedent in Northern Aslian, but a similar occurrence of nasal mutation is attested in Semelai where locative prepositions have a corresponding nominal form where the coda is a nasal

436

Kruspe, Burenhult and Wnuk

segment, see Kruspe 2004. The derived forms are used with a motion verb to designate a goal, as well as being used adnominally. hampɑn hantian hantən haɲyun

haN=pɑ(h)n haN=tia(h)n haN=tə(h)n haN=yu(h)n

loc.up.stream ‘upstream’ loc.down.stream ‘downstream’ loc.up.hill ‘uphill’, ontop ‘on top of’ loc.down.hill ‘downhill’, under ‘underneath’

3.3 Clitics Two types of clitic are identified: true clitics, and cliticising free morphemes. Both kinds are always proclitic to the left edge of a word or phrase, lack stress, and do not have meaning in isolation. True clitics exist only in their clitic form, while the latter are reduced forms of free morphemes. Examples of the former from Ceq Wong are the determiner ʔa= ‘det’ which attaches to demonstratives, and temporal adverbs, e.g., ʔa=nũʔ (det=that) ‘that one’, and from Jahai and Ceq Wong the linking clitic l= ‘id’, and the contrastive or focus marker d= ‘contr/foc’ which takes an NP as the host. In Jahai most prepositions are proclitics (Burenhult 2005: 115). Cliticising free morphemes are minor independent lexemes, that exhibit cliticising tendencies and behave like bound words, attaching to a phrasal constituent. In Ceq Wong multiple clitics of this type may occur in the one form, e.g., pi=nə̃=yəʔ (like=that=too) ‘like that too’ from piʔ ‘like’ and the demonstrative nũʔ ‘that’ proclitic to the adverb yəʔ ‘too’. The pre-verbal subject pronouns (obligatory with dynamic verbs in Jahai and Ceq Wong) are proclitic-like in that they do not receive stress. In Jahai they behave segmentally like free morphemes, but in Ceq Wong some, like third person singular, are reduced. 4

Phrase and Clause Structure

4.1 Basic Clauses 4.1.1 Constituent Order Most Northern Aslian languages have a basic SV/AVP constituent order, but ordering is not fixed resulting in significant flexibility. Moreover, there is also a high rate of ellipsis. There is a basic distinction between Dynamic and Stative clauses (§5.2). In the Dynamic clause, the subject (A or S) is cross-referenced by a bound pre-verbal pronoun in Jahai and Ceq Wong, as shown in (3) and (4). In Maniq only third person subjects are indexed by the number neutral pronoun ʔɛʔ ‘3’.

Northern Aslian

437

(3) ʔap ʔoʔ kap ba=mnraʔ tiger 3s bite goal=person ‘The tiger bit the person’. (jhi) (4) jɔ ʔu=ras kaʔ ʔɔs pn 3s=fall.over loc fire ‘Jɔ fell into the fire.’ (cwg)

In both Jahai and Ceq Wong, the irrealis markers replace the indexical subject pronoun in dynamic clauses. In Jahai the clitic wa= replaces the third person singular index, while ya= replaces all other persons (Burenhult 2005). In Ceq Wong the irrealis markers kun ‘irr’ and mãh ‘irr’ do not encode person. (5)

kun səc kaʔ jəʔ irr abandon loc wife ‘How could (you) just abandon (your) wife?’ (cwg)

Stative clauses typically do not cross-reference the subject, as in the following example. (6) tieʔ pasɛl ground be.dry ‘The ground is dry.’ (tnz)

Examples (7)–(9) illustrate the flexibility of constituent order in Ceq Wong. In the transitive clause both the A and P are post-verbal, as is the S in a single argument clause. Some languages, like Jahai, utilise morphological marking to encode permutations in constituent order (Burenhult 2005: 148–149), while others like Maniq and Ceq Wong do not. (7) ʔu=cəʔ [kuceŋ]A [kuy ʔɑy]P 3sg=eat [cat]A [head game]P ‘The cat ate the head of the game.’ (cwg) (8) ʔiŋ biʔ mɨ̃ʔ, hay=cɨp tut hay 1sg with 2sg 1du.incl=go be.alone 1du.incl ‘You and I, we’ll go together.’ (cwg) (9) pɨk cɑs, pɨk cɑn, pɨk ros be.depleted hand be.depleted foot be.depleted liver ‘The hands were finished, the feet were finished, the livers were finished.’ (cwg)

438

Kruspe, Burenhult and Wnuk

(10) ɡaw pãn ʔey tayes ʔɛʔ law ʔɛk wild.pig also father elder 3 tell give ‘The elder father also taught [a song about] wild pig (to x).’ (tnz)

Core arguments tend to occur following the verb, followed by obliques and adjuncts. (11) yaʔ ʔɛʔ ʔɛk kut ʔey woman 3 give bottle man ‘The woman gave man the bottle.’ (tnz)

High rates of ellipsis result in zero representation of NPs. (12) ʔiɲ kɛn 1sg take ‘I’ll take (it).’ (tnz) (13) sɛ̃p.sɛ̃p kɑʔ pɨk, hadeɲ eventually perf be.depleted be.finished ‘Eventually (it) was finished. (It was) all gone. (cwg)

4.1.2 Negation The various Northern languages utilise different strategies to negate a predicate. Predicate negators are described in §5.5. In Ceq Wong the negator həʔ ‘to neg.exist’ is the main verb, and the negated clause is the complement. The lower clause is always nominalized, and has no subject. Compare the non‑negated and negated clauses in (14). (14) ka=nɑ̃y k ɑʔ ʔiŋ=pi-lɨt, ka=nãy həʔ REL=one perf 1sg=caus-extinguish rel=one neg.exist mi-lɨt nml.caus-extinguish ‘One I already extinguished, (the other) one (I) didn’t extinguish.’ (cwg)

In some languages negation involves irrealis constructions. In Menriq and Batek Deq the pronoun in the negated clause bears the irrealis –m ‘irr’, as in example (15), and in Jahai the negator is followed by the verb with a procliticised irrealis pronoun, as in (16).

Northern Aslian

439

(15) nɛŋ yɛm kjeŋ neg 1sg.irr hear ‘I didn’t hear.’ (Batek Deq) (16) braʔ wa=cɨp neg 3sg.irr=go ‘He didn’t go.’/‘He won’t go.’ (jhi)

4.1.3 Questions In Ceq Wong, polar questions are formed with a question tag, like ʔay ‘q’, or bəh ‘q’; in Jahai with a clause-initial interrogative proclitic ha=. The response usually involves a repetition of the predicate as shown in (17). (17) Q: ʔu=hagoʔ pi.nə.yəm ʔay pɛ̃ʔ ? A: ʔu=hagoʔ pi.nə.yəʔ 3sg=ask like.that.too.irr q pn 3sg=ask like.that.too Q: ‘He would have asked like that wouldn’t (he), Pɛ̃ʔ ?’ A: ‘Yes, he asked like that.’ (cwg)

In Maniq polar questions are marked solely by intonation. (18) naʔ mɔh dɔy ka ŋ yəbac? foc 2sg come when k.o.fruit ‘So will you come in the [yəbac] season?’ (tnz)

Content questions contain a question word (see §5.1.1.2), which in Ceq Wong and Jahai is usually in clause initial position. In Maniq, question words may occur in situ as in examples (20)–(21). (19) hɑ̃n halaʔ ʔu=ŋɔ̃ k? where direction 3sg=live ‘Where does he live?’ (cwg) (20) ʔuʔ ʔɛn kəw, ʔuʔ mɔh cikieʔ kəw ʔɛʔ? 3sg this what 3sg 2sg snap what 3 ‘What (plant) is this, the one you snapped?’ (tnz) (21) mɔh pɔk kaw daʔ.ʔa? 2sg remove prox where ‘Where did you put that?’ (tnz)

440

Kruspe, Burenhult and Wnuk

(22) ʔu=kaloŋ cɔʔ pɛ̃ʔ, soloŋ ləy? 3sg=throw what pn first.night ‘What did he throw on the first night, Pɛ̃ʔ?’ (cwg)

4.1.4 Commands In Northern Aslian the positive imperative clause has the following features which distinguish it from other clause types: a) there is no bound subject pronoun on the verb (in the languages where it is usually obligatory); b) the verb is usually in the bare root form; and c) aspectual marking is restricted to future distinctions. The imperative clause consists minimally of a bare verb, but frequently it contains other elements, as illustrated in(23)–(25). In (25)the verb is ellipsed. (23) tas law! get.up quickly ‘Get up quickly!’ (cwg) (24) mɔh pãn hãw ʔawa 2sg distribute eat many ‘Give the food to the others!’ (tnz) (25) ka=toy sɑ jin nũʔ! loc=that little 2pl that ‘(Move) away there a little you there!’ (cwg)

The verb may be in a derived form as in the nominalized instance from Ceq Wong. (26) Context: The women are grating cassava on lengths of thorny rattan. n-daŋ n-daŋ biʔ cas! nml-watch nml-watch with hand ‘Watching out for (your) hands!’ (cwg)

Additionally there may be a vocative placed at the periphery of the clause as in (27), or a clause-final emphatic particle like Maniq kɛh ‘imp’, and Ceq Wong ʔen ‘imp’, or mɛ̃h ‘emph’ in (28). (27) koʔ pinãŋ tay! seek areca pn ‘Look for betel nut Tay!’ (cwg)

441

Northern Aslian (28) mɔh hãw 2sg eat ‘Eat!’ (tnz)

kɛh imp

In some varieties there is no distinct hortative construction. In Ceq Wong the invitational lah may precede a regular clause with a first person non-singular pronominal index as in the example below; in Maniq the imperative/hortative particle specific to contexts involving going to a place yah is used. (29) lah hay, hay=cəʔ! hort 1du.incl 1du.incl=eat ‘Come, let us eat!’ (cwg) (30) yah hiy kɛn hort 2du.incl get ‘Let’s go get (firewood).’ (tnz) (31) mɔh yah wa, ʔiɲ wa pãʔ 2sg hort go 1sg go also ‘Go, I’m going too.’ (tnz)

Jahai has a set of hortatives procliticised optionally to verbs in imperative clauses (Burenhult 2005: 113). Negative commands are formed by placing a negator before the verb. In Ceq Wong it is the negative verb yən ‘to not want’ which takes a clausal complement introduced by kɑʔ, as shown in (32). Jahai has a unique negator ʔakɛt ‘proh’, while Maniq utilises one of its regular negators, hat ‘neg’. (32) “ʔɑy” ʔu=bət “yən.kɑʔ cəʔ! ʔu=pantaŋ n-wəʔ mhə̃m” game 3sg=say NEG.IMP eat 3sg=taboo nml-exist blood ‘Game”, he said, “Don’t eat (it)! It is tabooed in the presence of blood.’ (cwg) (33) hat ɡɔh neg hit ‘Don’t hit!’ (tnz)

The negative markers may also be used alone as a command as in Ceq Wong yən! (neg.want) ‘Don’t!’

442

Kruspe, Burenhult and Wnuk

4.2 Nonverbal Clauses Nonverbal clauses consist of the apposition of two noun phrases comprising a topic and a comment. Nonverbal clauses fall into three semantic types: 1) identifying/presentative, example (34); 2) ascriptive as in (35), and 3) locative in (36). (34) ʔadiʔ ʔuh mõy pɨŋ younger.sibling 3sg mother pn ‘His younger sibling (is) Pɨŋ’s mother.’ (cwg) (35) bawur blan lɑh, hiʔɔ̃y piʔ ni-nɨ̃ʔ wɔŋ.cɑs stake spear.trap sibling.in.law be.small like mnml-be.big finger The stake of sibling-in-law’s spear trap, (its) smallness (was) like the size of a finger. (cwg) (36) tieʔ daʔ mɛt.katɔʔ, can ʔɛʔ daʔ kapin earth loc sun foot 3 loc upper.side ‘The earth (is) up in the air, his feet (are) up.’ (tnz)

In Jahai identificational clauses, the two NPs are conjoined by the identificational marker l= ‘id’, as in tahanɛn, l=kneh ʔoʔ (pn id=wife 3sg) ‘Tahanan (is) his wife.’ Maniq negates the verbless clause with regular negators, however Ceq Wong has a dedicated nominal negator yaŋ mãh ‘not’, illustrated in example (37). (37) bah palik kayoʔ neg bat Sunda.flying.lemur ‘Sunda flying lemur (is) not a bat.’ (tnz) (38) briʔ ləy dom lwəc raŋkal, yaŋ mãh buac deɲ person real that.IRR ascend ladder not irr Dusky.Langur longer ‘Real people were climbing the ladder, (they were) no longer Dusky Langurs.’ (cwg)

The perfect and continuative aspectuals kɑʔ ‘perf’ and səɲ ‘still’ may occur in the nonverbal clause in Ceq Wong. The subject may host the focus marker d= ‘foc’. (39) kɑʔ kaʔ moy lwɑy perf loc mother soul ‘(Her) soul (was) already with (her) mother.’ i.e. (in her mother’s possession). (cwg)

Northern Aslian

443

4.3 Complex Clauses Investigation into complex syntax in Northern Aslian has yet to be carried out systematically. The following sections note emergent features. Clause combining strategies vary across the languages of the Northern branch, however they can be characterised by a general low level of differentiation and structural complexity. Simple juxtaposition or chaining of both simple and complex clauses is the most common means of combining clauses. For many complex clause types there is no overt indication of the semantic relationship between clauses, and there is a reliance on pragmatics to determine interclausal relationships. While there are some indigenous conjunctions and connectives, the majority are loanwords from Malay, see §5.6. Complementation is discussed in §4.3.3. Serial verb constructions, widespread in Mainland Southeast Asia, are not regularly attested in Aslian languages, although see Maniq, §4.3.4 below. 4.3.1 Clause Chaining Clause chaining simply juxtaposes clauses with no overt indication of the semantic relationship between them, as shown in the following examples of a coordinated clause in (40), and a hypothetical clause in (42) below. Example (41) illustrates a backgrounded clause where a reduced clause without the subject pronoun, recaps the previous clause. (40) pi-cɔm ʔɔs, hɛʔ rbus bap caus-light fire 1incl boil tuber ‘Light a fire (and) we’ll boil tubers.’ (Batek Deq) (41) wek kaʔ hyaʔ, ʔu=daŋ həʔ tbas. return loc lean.to 3sg=see neg.exist slash ‘Returning home, he saw (that they) hadn’t slashed the undergrowth.’ (cwg) (42) koŋ biʔ tuŋkal, braniʔ woman with man be.brave ‘(If I were to go as) a woman with a man, (I would be) brave (enough).’ (cwg) (43) ʔu=kɨh ləw halaw jŋloŋ 3sg=explode bamboo chase ghost ‘The bamboo explodes (and it) chases the ghosts away.’ (cwg) (44) nɛŋ tahoʔ masaʔ kɛn yɛʔ neg know period child 1sg ‘I didn’t know (when) I was a child.’ (Batek Teq)

444

Kruspe, Burenhult and Wnuk

In Ceq Wong in the purposive, relative (§4.3.2), and complement clause constructions (§4.3.3), the dependent clause is nominalised as in (45). (45) sər skɔc! rub be.soft ‘Rub (so it is/becomes) soft!’ (cwg)

4.3.2 Relative Clauses Varieties differ markedly as to how clauses are relativised. Jahai has a dedicated relative marker k= ‘rel’ which is attached proclitically to the initial constituent of a phrase or clause to signal relativisation (Burenhult 2005: 123). In Maniq, relativisation is carried out through juxtaposition of the main clause and the relative clause (see ex. (68)). This is also the typical situation in Ceq Wong, where any juxtaposed modifying clause may function as a relative clause with no overt marking, as in example (46). The verb is typically nominalised, as illustrated in example (47). In some clauses the verb hosts a clitic ka= on the verb as in (48), reminiscent of the Jahai marker k= ‘rel’. Its precise function remains unclear, and it is tentatively glossed as ‘rel’, (see Kruspe in progress). (46) haliʔ.wat [ʔiŋ tək didɑʔ] hadeɲ yəʔ leaf_k.o 1sg pick yesterday be.finished too ‘The leaves I picked yesterday are finished too.’ (cwg) (47) wɔŋ biʔ [rɛʔ n-təs lən] offspring person [recent nml-arrive pst.prox] ‘(He is) the child of the people who just arrived a while ago.’ (cwg) (48) wəʔ jaʔ ka=toloŋ. exist 1du.excl rel=help ‘There were you two who helped (me).’ (cwg)

4.3.3 Complement Clauses A complement clause fills an NP slot of an argument of the verb, either S or P, with a clause. In Ceq Wong, two types of clause are noted. While most verbs, like mɛ̃ʔ ‘to want’ in (49) simply take a normal clause as the complement, the complements of negative verbs are nominalised. Complement taking verbs include verbs of knowledge, ability, and ordering among others.

445

Northern Aslian (49) ʔiŋ=mɛ̃ʔ ʔiŋ=car wɔŋ ʔu=doʔ 1sg=want 1sg=inspect child 3sg=that ‘I want to see that child of hers.’ (cwg) (50) ʔiŋ=hntɨŋ n-daŋ. ʔu=hit sɔk mɨ̃ʔ, ʔu=pər tarəŋ 1sg=be.afraid nml-watch 3sg=burn hair 2sg 3sg=touch lamp ‘I am afraid to look. Your hair will catch fire (if) it touches the lamp.’ (cwg)

In the prohibitive clause in Ceq Wong, həʔ n-kɔm V, the dependent verb V is in the bare root form and not nominalised. However in the negated abilitative use of kɔm, all subsequent verbs are nominalised. (51) həʔ n-kɔm bət kaʔ cɔʔ neg.exist nml-permit say loc name ‘We are not allowed to say the name.’ (cwg) (52) həʔ n-kɔm neg.exist nml-be.able ‘(It) isn’t able to fly.’ (cwg)

ŋapɔy nml.fly

In some instances, the complement clause precedes the main clause, as shown in the following example. (53) ʔu=tapɑt kaʔ talon, payɑh 3sg=hit loc snake be.difficult ‘It was difficult (for him) to hit the snake.’ (cwg)

4.3.4 Multi-verb Constructions Multi-verb constructions are common in Maniq. They are defined as strings of verbs forming single prosodic units. Since the language does not have explicit formal marking of subordination or coordination, the relationships between verbs in such constructions are difficult to determine. Another term, serialverb constructions (SVCs), is frequently used to describe similar constructions in other languages (e.g. Aikhenvald & Dixon 2006). However, the label multiverb constructions is preferred here in order not to exclude the types of verb complexes which are not prototypically associated with that label. The maximal number of verbs attested in a verbal complex in Maniq is four. Multi-verb constructions have not been reported for any other Northern Aslian language. It is possible that their presence in Maniq is a result of contact with Thai.

446

Kruspe, Burenhult and Wnuk

(54) mɔh kɛn kɔs kɨc pidɛŋ sunaʔ 2sg take peel grind show Dioscorea.daunea ‘Show how to peel and grind D.daunea.’ (tnz) (55) taʔɔʔ ʔɛʔ kap pantɛs tiger 3 bite be.quick ‘The tiger bites quickly.’ (tnz)

4.4 Noun Phrases The Noun Phrase (NP) in Northern Aslian has the general structure: (num/q) N (assoc) (VP) (dem). Individual languages place idiosynctratic restrictions on the combinatorial possibilities of the constituents. The head of the NP may be a simple nominal, a compound noun (56), or a nominalised verb as in (57). (56) t�m.bɨr ʔ�y stew game ‘a meat stew’ (cwg) (57) sbaʔ tnl�c shave dart ‘dart shavings’ (cwg)

The numeral precedes the noun it enumerates. In many varieties, but not Maniq, the noun may be in the derived unitised ‘unit’ form, as in Ceq Wong p�n briʔ (four person) ‘four people’. (58) ʔu=t-yow ber n-kleʔ 3s=happ-see two unit-tiger ‘He happened to see two tigers.’ (cwg) (59) ʔɛʔ ma-pɛs kamam kuy 3 prog-delouse two head ‘They are delousing each other (two of them).’ (tnz)

The same holds if a numeral is used with a classifier, e.g., ber hlay nɨs (two sheet mat) ‘two mats’, or n�̃ y dil hyãʔ (one fruit house) ‘one house’ (cwg). The quantifier also precedes the head, as in the example.

Northern Aslian

447

(60) yaŋ mãh dɨl briʔ, hnũn briʔ not IRR some people all people ‘Not some people, all people.’ (cwg)

Nominal modifiers typically occur in an associative construction where the nominal compound [NPhead NP] denotes a specific relationship between two entities. The associative construction is used to express a range of semantic relations, including possession, kin relations, part-whole, object material and so forth. Associatives are distinguished from non-compositional compounds by the fact that each constituent retains its original meaning and may be independently modified; in a non-compositional compound the meaning cannot be derived, at least synchronically, from the sum of the two parts, and the internal syntax is fixed. The associative construction occurs most frequently as a means of expressing simple possession. The nominal head precedes the possessor, which is expressed either by a noun or pronoun: NPhead NPposs. Other relationships are: part—whole kit loŋ (bum basket) ‘base of a back-basket’; entity—purpose srɔk pinãŋ (pouch areca.nut) ‘betel nut pouch’; kin—relationship klək ʔuh (parent-in-law 3sg) ‘her parents-in-law’; entity– source/material ʔɔs ləw (fuel bamboo) ‘fuel derived from bamboo’ and entity– benefactor tandaʔ biʔ ntəs con (sign person nml-arrive be.far) ‘a sign for people arriving from afar’ (cwg). (61) ʔu=daŋ wəʔ [cur [blaw buŋsuʔ]] 3sg=look.at exist pattern blowpipe Last.Born ‘She saw there were the markings of Last-born’s blowpipe.’ (cwg) (62) ʔay wɔŋ wac game child Beautiful.squirrel.(Callosciurinae) ‘the baby Beautiful squirrel (type of) game’ (tnz) (63) kuy bəwac head Pig.tailed.macaque ‘Pig-tailed macaque’s head’ (tnz) (64) hayãʔ mɔh house 2sg ‘your house’ (tnz)

Other modifiers of the head noun include verbs, prepositional phrases (67), or relative clauses. Stative verbs directly modify the head in Maniq and Ceq Wong

448

Kruspe, Burenhult and Wnuk

(65)–(66). In Maniq, noun phrases such as (65) have the same surface structure as simple stative clauses. In Jahai, verbal modifiers in the noun phrase are linked by means of a relativizing prefix t-. Dynamic verbs require relativisation in Maniq (68), or nominalization in Ceq Wong in (69). (65) ɲahuʔ ʔahaw tree be.big ‘big tree’ (tnz) (66) sac rɛʔ flesh be.new ‘the resurrected’ (Lit., new flesh) (cwg) (67) ʔɔs kaʔ cɑs ʔu=doʔ fire loc hand 3sg=that ‘that lighter in his hand’ (cwg) (68) baŋkaʔ ʔɛʔ wa katieʔ pãʔ Asian.forest.scorpion 3 go on.ground be.different ‘baŋkaʔ (the scorpion) which walks on the ground is different (from a baŋkaʔ plant)’ (tnz) (69) rɔŋ biʔ n-cəh sound person nml-fell.tree ‘the sound of someone felling trees’ (cwg)

Demonstratives always fill the final slot in the noun phrase. (70) mãm ʔiŋ dɑʔ food 1sg this ‘my food here’/‘this food (of) mine’ (cwg)

5

Word Classes

Northern Aslian has clearly distinguished word classes. Lexical items must be either morphologically or syntactically derived in order to function in a nonprototypical slot. Some exceptions do exist, and these are predominantly Malay loans, e.g., Ceq Wong clup (from Malay celup) which has both the nominal and verbal meanings ‘dye’ and ‘to dye’, and Jahai topih (from Malay tupi) with the meanings ‘headgear’ and ‘to wear headgear’. Distinct word classes are emerging

Northern Aslian

449

as a typical Aslian characteristic; for a similar situation in Southern Aslian see Kruspe 2004: 94–97 and this volume. However, in Maniq many verbal roots, especially stative verbs may be used as nouns with zero derivation, e.g., hatiek ‘to be long’ used in the subject or object slot stands for ‘the long one’. The potential of Maniq verbal roots to fill subject and object slots co-occurs with a limited use of nominalized verb forms in this language. The open classes are noun, and verb. The closed classes are preposition, pronoun, numeral, demonstrative, adverb, negator, connective/conjunction and interjection. Some closed classes, e.g. pronoun and numeral, are considered to belong in a ‘nominal’ superclass. The status of the word class of expressivetype words remains unclear in Northern Aslian languages, although they would appear to be a subclass within the verbal class, and not constitute a separate major word class as for instance in Semelai (Kruspe 2004), or Temiar (Benjamin 1976b). 5.1 Nouns Nouns typically have concrete entities as their referents, e.g., wɔŋ ‘offspring’, can ‘foot’, ʔɔs ‘fire’, ktɔʔ ‘sky, day’ and sɔk ‘hair’, or abstract concepts such as mniʔ ‘smell’ (cwg) and rway ‘head-soul’ (jhi). Nouns function as the heads of NPs, arguments of predicates, ‘subjects’ of verbless clauses, complements of verbless clauses, complements of prepositional phrases, and as clausal adjuncts. They may be modified by another noun, a demonstrative, stative and dynamic verbs, a quantifying expression or a relative clause. There are numerous processes deriving nouns from verbs. Few productive means of deriving verbs from nouns are identified. Morphology is discussed in §3.2.1 5.1.1 Pronouns and Question Words 5.1.1.1 Personal pronouns Personal pronouns typically have the same functions as nouns, as the heads of NPs, arguments of predicates, and vocatives. Pronouns may only be modified by demonstratives and temporal adverbs, e.g., Batek Deq ʔoʔ ton kɔŋ (3sg that pst.prox) ‘him there before’. Northern Aslian pronominal systems, like those in Central Aslian, distinguish singular, dual and plural categories, and have an inclusive/exclusive distinction in the first and second person. Most varieties collapse second and third person plural. Some varieties make a three-way distinction within second person singular, expressing degrees of intimacy, familiarity, and/or politeness (Burenhult 2005: 83; Bishop and Peterson 1994: 163–95). The third person pronouns may have an animate or inanimate entity as the referent. Table 1.7 shows the fairly typical Northern Aslian system present in Menriq.

450 Table 1.7

Kruspe, Burenhult and Wnuk The Menriq pronominal paradigm

Person

1incl 1excl 2 3

INT mɔh

sg

du

pl

yɛʔ

hɛy yɛh

hiʔ yɛpãh

FAM mɨʔ ʔuʔ

POL bɛʔ

jɨh wɨh

gɨn

Menriq and Batek have special pronominal paradigms for irrealis constructions, which in both languages involve the systematic replacement of the final consonant of each basic pronoun with /m/, e.g. Menriq yɛm ‘1sg.irr’, wɨm ‘3du.irr’, yɛpãm ‘1pl.excl.irr’, etc. (cf. Table 1.7). (71) yɛm diʔ nɨs 1sg.irr make mat ‘I want to make a mat.’ (Batek Deq)

Ceq Wong departs to some extent from the typical Northern Aslian pattern. There is no neutralisation in the second and third person plural, and no distinctions expressing degrees of familiarity or politeness in second person singular. In this respect the Ceq Wong paradigm resembles that of Central Aslian Temiar (Benjamin, 1976b). Table 1.8

The Ceq Wong pronominal paradigm

Person

sg

du

pl

1incl 1excl

ʔiŋ

hay jah

hɛʔ jaʔ ( jaʔ paʔ)

2 3

mɨ̃ʔ ʔuh

juh teʔ

jin gən

Northern Aslian

451

Another feature common to Northern Aslian is the presence of avoidance pronouns. These are used with affinal kinsmen with whom one should not directly engage, for instance, an opposite gender parent-in-law. In Ceq Wong the plural forms, jaʔ ‘1pl’, jin ‘2pl’ and gən ‘3pl’ function as in‑law avoidance pronouns. No distinction is made for number (see also Burenhult 2005: 83 for the Jahai set). Ceq Wong and Maniq have special collective compound forms consisting of a pronoun and kin term or personal name, as in Ceq Wong teʔ toʔ (coll elder. sibling) ‘elder siblings’; teʔ teʔ (coll husband) ‘co-husbands (in a polyandric marriage)’ or teʔ Bəw (coll pn) ‘Bəw and the kids’, where Bəw is the speaker’s wife’s name, and in Maniq wih toʔ kaw (coll elder.sibling here) ‘elder siblings here’; wih ʔa=Pɔ (coll pn) ‘ʔa=Pɔ and the group with him’. In Ceq Wong where the collective is of mixed gender, but there exists a gender specific term, the collective is formed on the female term as in teʔ mõy (coll mother) ‘parents’. Note that the pronoun is the third person dual exclusive teʔ ‘3du.excl’ in Ceq Wong, and the third person dual wih ‘3du’ in Maniq, but the collective may denote more than two individuals. Similar terms can be created with jah ‘1du.excl’ (cwg) and yɛh ‘1du.excl’ (tnz), [jah/yɛh pn], forming first person collectives, e.g., jah yɛ̃ŋ (1du.excl pn) ‘Yɛ̃ŋ and all of us (cwg)’, yɛh ʔay.Wang (1du.excl pn) ‘ʔay Wang and all of us (tnz)’. This does not occur as systematically in Ceq Wong as the forms derived with teʔ 3DU. 5.1.1.2 Interrogative and Indefinite Pronouns The interrogative words make distinctions for thing, person, place, time, manner, quantity, reason and manner, as shown in Table 1.9 below. The interrogative clause is discussed in §4. In Ceq Wong and Jahai, the locative is the only unique form, h�̃ n and lbah ‘where’, respectively. All other forms are composed of a nominal or unanalysable root, and question particle in Ceq Wong, Batek and Maniq, or an interrogative prefix in Jahai. In Ceq Wong the interrogative pronouns are compounds of a generic term and a question particle, as with cɔʔ ‘name’ cɔʔ (ʔay) ‘what’ and, cɔʔ mɛ̃h ‘why’; biʔ ‘person’ and biʔ ʔay ‘who’, cin gayə ‘how’, and the loanword (ba)paʔ ‘how much’ from Malay berapa. Individual languages use different strategies for indefinite pronouns. In Ceq Wong some indefinite pronouns are drawn from generic nouns like mnə ‘thing’ and biʔ ‘person’, while others are special forms like manɛ̃ʔ (manɛ̃ʔ n�̃ m) ‘whoever, anyone, whatever’, or in the case of the indefinite locative pronoun, a demonstrative ʔen n�̃ m ‘everywhere, wherever’. Used in negated clauses, indefinite pronouns function as negative pronouns.

452

Kruspe, Burenhult and Wnuk

Table 1.9

Interrogative pronouns in Batek Deq, Ceq Wong, Jahai and Maniq

Thing

Person

Place

Reason

Manner

Batek Deq

ʔay ləw

ʔoʔ ləw

baʔ ləw

ʔay ləw

–

Ceq Wong

cɔʔ (ʔay) biʔ ʔay

hãn

cɔʔ mɛ̃h

cin (gayə)

Jahai

mɛy

makɛn

lbah

Maniq

kaləw ~ kəw

kuʔ cɔʔ ~ kaləw cɔʔ

ʔa; daʔ ʔa (daʔ); kiʔ ʔa (daʔ)

maʔacin daʔ kaləw ~ daʔ kəw

ʔa naʔ

Time

Quantity

masaʔ ʔay ləw bʁapaʔ pon ʔay

(ba)paʔ

mapuʔ

mɛy siʔ

ʔathiem; katɔʔ ʔa

Maniq uses the noun maniʔ ‘person’ (often reduced to miʔ, which is also the Jahai exponent) as an indefinite pronoun. In addition, it employs the word nkyak (nominalization of yak ‘to be finished, to be exhausted’) to express meanings like ‘everyone’, ‘everything’, or ‘everywhere’. 5.1.2 Numerals, Quantity Words and Measure Words 5.1.2.1 Numerals Numerals consist of two small closed classes: a small set of indigenous numerals, and the borrowed Malay numerals which complement the indigenous set. Ceq Wong is unique in being the only Aslian language outside of the Southern branch to have an indigenous term above three, n�̃ y ‘one’, ber ‘two’, pɛt ‘three’ and pɑn ‘four’ (see Kruspe 2004, Diffloth 1976b). Apart from Kentaq Bong which has bye ‘two’ (Diffloth 1976b: 31), and Maniq kamam (pabi) ‘two’, most Northern languages have indigenous terms only for ‘one’, e.g., Jahai nɛy, and Kensiu and Menriq nay. Batek, however, has no indigenous numerals. Indigenous and borrowed numerals do not co-occur in compounds. Further, there may be language specific restrictions on certain collocations, for instance in Ceq Wong, Malay numerals are used in o’clock expressions, e.g. pukul tigaʔ, or jam tigaʔ ‘three o’clock’, but indigenous numerals are used to enumerate hours, e.g. pɛt ni‑jam (three nml-hour) ‘three hours’. Numerals feed various derivational processes, some of which are unique to the indigenous set, others to both, e.g., Ceq Wong pɛt ‘three’ > pt-pɛt ‘to be

Northern Aslian

453

three’, but limãʔ ‘five’ > limaʔ limãʔ, Jahai dwaʔ ‘two’ > dwaʔ ‘two-and-two’, ‘pairwise’. Numeral derivations are discussed in §§3.2.1 and 3.2.2. Numeral classifiers, common throughout Mainland Southeast Asia, are infrequent in Aslian languages, and when present are confined to a few basic semantic domains. In languages like Jahai and Ceq Wong, unitisation is the preferred strategy when enumerating. The classifier construction would appear to be a loan construction, and the terms in Ceq Wong are invariably Malay loans, or calqued on Malay. See Burenhult (2005: 82) on the dispensability of classifiers in Jahai. In Maniq too numeral classifiers are sometimes used, but are not obligatory. Unitisation is not attested. 5.1.2.2 Quantity and Measure Words Quantity is usually expressed by means of the unitized form of a noun referring to the object which contains the quantified matter, e.g. Jahai hapɨʔ (basket) ‘basketful’, banɨʔ (quiver) ‘quiverful’, ns-bas (unitbas) ‘busful’. Quantity is also expressed by quantifiers, some of which are also stative verbs, e.g., kanɛt ‘be small, little’, dɨl ‘some’, and hnũn ‘all’ (cwg). Save for occasional Malay loans (e.g. batu ‘stone’ for distance) measure words are not much in evidence. However, Jahai has an indigenous term soʔ for cubit, which is unitized when quantified: nɛy nʔ-soʔ (one unit-cubit) ‘one cubit’. 5.1.3 Proper Nouns Personal names come in a variety of forms, and each individual typically has different names for different contexts. Among the Jahai and the Batek, for example, ‘true names’ (or ‘forest names’) are distinguished from ‘village names’ used in the presence of non-indigenous outsiders. Jahai ‘forest names’ are drawn from plant species or products, often fragrant ones, or are nominalisations of actions or properties associated with the named individual (e.g. Cklĩk ‘looking sideways’), while ‘village names’ frequently are Malay-style Arabic names. Some individuals also have humourous nick-names. Teknonymy, i.e. the practice of referring to parents by the names of their children, is widespread among the Batek (Lye 2004; Tengku Intan et al. 2012). Endonyms other than the indigenous term for ‘human’ or ‘people’ are often lacking or are at least not in frequent use; most names of ethnic groups in administrative use are exonyms. Thus, Northern Aslian groups refer to themselves as either maniʔ, mnraʔ, batɛk, or briʔ, all being local terms for human/ people, usually with additional notion of ‘indigenous person’. The Ceq Wong use the term biʔ ‘person, body’ when naming other ethnic groups, e.g., biʔ jhɑy ‘Western Ceq Wong’ and biʔ lɔk ‘Chinese’.

454

Kruspe, Burenhult and Wnuk

Place names remain an understudied field in Northern Aslian. Jahai place names (described briefly in Burenhult 2008a) refer to catchment areas rather than habitations or distinct features of the landscape and almost invariably have mythical origins. Although phonologically well-formed Jahai words (typically monolexemic), they are also to a large extent semantically opaque and etymologically obscure, which suggests that they are extremely archaic. Others are analysable and then frequently draw their meaning from foraged plant and animal species or objects of material culture, believed to have been left behind or forgotten by the creation beings of the mythical past in the named area. Some place names display unusual morphological processes of reduplication. The following selection of Jahai place names denotes small catchment areas in the major watershed of Mendelum (Jahai Mɲjlom, a tributary of the Perak river east of the Temengor Lake): Hnuyal Pnitcɨt Silsyul Pɨn Srmjam Canlon Barʔɔp Ksiŋ ʔayam Rimrajɛ̃m Sysgɔy Pnslɨs Hmkrɛ̃k Kadɔy

[name of creation being, unknown etymology] [name of creation being, unknown etymology] [uncertain etymology, possible derivation of syul ‘king cobra’] [a type of tree with fragrant flowers] [name of creation being, unknown etymology] [a type of plant with fragrant leaves used for ritual decoration] [a type of plant] ‘wild lemongrass’ (Cymbopogon sp.) ‘chicken’ [name of creation being, unknown etymology] [uncertain etymology, possible derivation of sgɔy ‘to vaporise’] [name of creation being, unknown etymology] [name of creation being, unknown etymology] ‘lantern fly’ (Pyrops candelarius)

5.2 Verbs Verbs denote actions and states, and function primarily as predicates, determining the argument structure of the clause. Verbs may be modified by adverbs, and reduplicated to express intensity. They may also function as nominal modifiers, see §4.4. Verbs feed the following derivations: nominalisation, causativisation, and various aspect and aktionsart distinctions. Another category that some languages like Ceq Wong and Jahai either mark on the verb, or adjacent to it, is the irrealis. Two subclasses of verbs are identified: Dynamic, and Stative, see §5.2.1. Syntactic transitivity appears to be of low relevance and most verbs may be used in transitive or intransitive contexts without any formal marking. The

455

Northern Aslian

transitive/intransitive distinction appears to be of minimal relevance to the marking of grammatical relations, see §5.2.2. Causativisation is the only clearly attested valence-changing derivation, see §3.2.2. 5.2.1 Dynamic and Stative Verbs Dynamic verbs are those which denote an action, process, or human propensity. Dynamic is defined here according to Mathews (1997) as a verbal label denoting an action or process as opposed to a state, and not as a superordinate aspectual label. Some examples of dynamic verbs are: Ceq Wong səc harən

‘to abandon’ ‘to know’

hit hak

‘to burn’ ‘to abort’

Batek Teq hɨr ‘to enter’ cah ‘to fell’

ywek ‘to take home’ cmadɔk ‘to burp’

Maniq kɔp ɡamɔs

sa cadɛk

‘to bathe’ ‘to live’

‘to descend’ ‘to puncture’

Stative verbs denote states, including existence and non-existence, and properties such as dimension, physical property, value, age, distance, qualification, speed, and possibly quantification and colour. Included are complex descriptive forms embodying sensual responses to visual, aural, and tactile stimuli, typical of the expressive forms found in other branches of Aslian. Some examples of stative verbs are: Ceq Wong

pɨk bcoʔ

‘be used up, finished’ ‘be fat’

rɛʔ swət

‘be new’ ‘be red’

Batek Teq

bhəʔ kdɛk

‘be white’ ‘be bitter’

cŋlɛŋ braʔ

‘be clear’ ‘to not exist’

Maniq

tasɛk haɲɛp

‘be ripe’ ‘be dark’

ʔamɛt ley

‘be far’ ‘to exist’

Stative verbs also exhibit a number of morphological and syntactic peculiarities which distinguish them from dynamic verbs. For example, in Ceq Wong, stative verbs function attributively in their root form, as in wɔŋ hiʔɔ̃y (offspring be.small) ‘a small child’, whereas dynamic verbs require nominalisation biʔ n-təs (person nml-arrive) ‘people who arrived’ or biʔ n-prun (person

456

Kruspe, Burenhult and Wnuk

nml-clear.swidden) ‘people clearing a swidden’. In Ceq Wong, a subset of stative verbs may host a bound pronoun, deriving a dynamic inchoative verb: ‘become V’. There is no change in valence. Some of the verbs which are attested with this marking are: bək ‘to be full’ → ʔu=bək ‘to become full, fill, (intr.)’, soʔ ‘be rotten’ → ʔu=soʔ ‘to rot’, and badon ‘to be old’ → ʔu=badon ‘to become older’. In Central Aslian and the Southern Aslian language Semelai, expressives constitute a separate open word class along with nouns and verbs (Kruspe 2004 for Semelai). In Northern Aslian they are not formally distinct, and are treated together with stative verbs. Attested forms in Ceq Wong include kləh ‘appearance of someone slashing something’, baprɨp ‘appearance of people collapsing in a heap’, blŋkuŋ ‘be pot-bellied’, cher ‘something red seen in the distance’, cacɔ̃ s ‘to sound like oil splattering’. Some expressive verbs use vowel alternation to express a subtle change in meaning, a strategy exclusive to this subclass, e.g., gərəbɔk ‘sound of something being thrown away landing’, and gərəbuk ‘sound of something being thrown to someone and caught’, but this is not systematic. In Maniq several features associated with expressives— e.g., rich sensory meanings conveyed in compact forms, iconic form-meaning correspondences—are found in stative verbs. These verbs, however, are not formally distinct from other verbs in a way that would substantiate positing a separate expressive class. Instead of an expressive category as word class, Maniq has a depictive mode of speech. It is clearly distinct from the regular mode of speech since it is marked with special intonation, often has a performative character and is accompanied by iconic gesture. It frequently involves total reduplication and lengthening. The depictive mode is not restricted to a special class of items, but may be used with various types of words and utterances. Depictive utterances are usually introduced by the irrealis marker ya (often with lengthened final vowel, yaaaa). (72) haliʔ ʔɛʔ, cawãc yaa cawãc~cawãc~cawãc~cawãc leaf 3 be.striped.lengthwise irr be.striped.lengthwise~red ‘The leaf is striped lengthwise: cawãc cawãc . . .’ (tnz) (73) ʔɛʔ pak may, yaa pak may pak may (. . .) 3 hit shoulder irr hit shoulder hit shoulder ‘He hit (his) shoulder: pak may . . .’ (tnz)

5.2.2 Intransitive and Transitive Verbs Intransitive verbs take a single argument, usually either an agent or experiencer. Some motion verbs may express the goal or location as a bare NP

Northern Aslian

457

without a preposition. Intransitive verbs may have their valence increased by causativisation, see §3.2.2. The agent becomes the patient, and the new argument is introduced as the agent. Transitive verbs typically have two arguments, an agent (A) and a patient ℗. The patient typically lacks morphosyntactic marking. In both Jahai and Ceq Wong a P may be preceded by a preposition, its presence suggesting reduced affectedness of the object. In Ceq Wong the P marker is the locative preposition kaʔ ‘loc’, but instrumental biʔ ‘ins’ is also used. At present, there is little evidence to distinguish core and oblique relations syntactically. Therefore recipients, benefactives, addressee of speech verbs, and so forth are simply treated as obliques. In general oblique relations require an obligatory preposition, (see Prepositions §5.4), however in Maniq this is not the case as shown in the example here. (74) ʔɛʔ yek haluh ɡalɛs ʔɛk wɔŋ 3 return blowpipe carry.on.back give child ‘He returned from hunting, brought (a catch) and gave (it) to the child.’ (tnz)

5.2.3 Existential Verbs There are existential verbs meaning ‘to be, exist’, e.g., Ceq Wong and Batek Teq wəʔ, and Maniq and Jahai weʔ. Maniq has a second existential verb ley. There are also negative existential verbs ‘to not be, not exist’, Ceq Wong həʔ, Jahai and Batek Teq braʔ and Maniq hɨc. The negative existential forms also function as the verbal negator in some languages, see §4.1.2. (75) kɑʔ cɨp con laŋ dɑh. wəʔ kleʔ! neg.imp go be.far fut.prox niece.voc exist tiger ‘Don’t go far now, Niece! There are tigers.’ (cwg) (76) hɨc sɔk neg hair ‘There is no hair.’ (tnz)

5.3 Locational and Directional Words Categories of spatial deixis vary greatly across Northern Aslian varieties as to their form and function. All of them boast elaborate multi-term demonstrative systems, i.e. demonstrative paradigms with additional distinctions beyond the cross-linguistically prevalent two or three term distance-based systems. Jahai and Menriq have particularly rich paradigms of distinctions, illustrated by Jahai in Table 1.10.

458

Kruspe, Burenhult and Wnuk

In addition to the common distance- or accessibility-encoding distinctions, some languages have forms which encode elevation of the referent in relation to the deictic centre (‘there.up’, ‘there.down’); in Ceq Wong, Jahai, and Menriq (and possibly other varieties) there is a form dedicated to referents which are perceived with senses other than vision. Jahai and Menriq share a pair of possibly unique oppositions which encode the angular location of referents outside the speech dyad (Burenhult 2008b). Jahai (and arguably all Northern Aslian languages) have some distinctions which specialize in managing the attention of the addressee, e.g. drawing attention to a new referent, changing the attention from one referent to another, and confirming that attention is on the correct referent (see Table 1.10). In some languages the demonstratives interact with sets of locatives and directionals (see the Ceq Wong examples below). The Ceq Wong demonstratives are listed and describe in turn below. dɑʔ ‘this’ nũʔ ‘this’ doʔ ‘that’ dih ‘that’

dɑy ‘here’ toy ‘yonder’ ʔen ‘there’

The proximal demonstrative dɑʔ ‘this’ is used in the location of the speaker. The medial demonstrative doʔ ‘that’ is used to locate something away from the speaker, possibly in the location of the addressee, but if not, pertaining to shared knowledge. Doʔ co-occurs with directional adverbs, and in locative constructions. (77) daŋ tət doʔ, chər beʔ ŋ�� c look upstream that red.distant rice.plant just ‘(He) looked upstream there, nothing but the ripe red rice. (cwg)

Nũʔ ‘that’ is used for non-situational reference, frequently with temporal nouns and expressions, and in particular for managing referents in discourse as in (78). It combines with the proximal past lən ‘prox.pst’ in anaphoric function. nũʔ has a reduced form nə̃(ʔ). (78) ʔu=wek kuy ʔanɛ̃k nũ=lən kaʔ hyãʔ 3sg=return head nephew that=pst.prOx loc house ‘The nephew’s head returned to the house.’ (cwg)

Perceptual modality

Spatial frame of reference

Accessibility

Supercategory

Subjacent ʔnɛʔ

ʔuyih

Superjacent ʔitɨh

ʔɲɨʔ

ʔadeh

Speakeranchored

Addresseeanchored

ʔũn

ʔaniʔ

ʔon

Addresseeanchored

Speakeranchored

Addresseeanchored

tnɛʔ

tuyih

titɨh

tɲɨʔ

tadeh

tũn

taniʔ

ton

Confirms that addressee’s attention is on referent

Attention-managing properties in exophoric use

Perceived only through its emissions, not through direct visual or tactile experience of its inherent physical properties (heard, smelled, felt in the skin or body)

Located below speech situation (underneath, downhill, or downstream)

Located above speech situation (overhead, uphill, or upstream)

Located outside addressee’s side of speech perimeter

Located outside speaker’s side of speech perimeter

Inaccessible to addressee (unfamiliar, unestablished, unidentified, unattended to)

Draw addressee’s attention to referent

Shifts addressee’s inaccurate attention to correct referent

Inaccessible to speaker (distal, imperceptible, unreachable, inapproachable, etc.)

Accessible to addressee (familiar, established, attended to, manipulated)

Accessible to speaker (proximal, perceptible, reachable, approachable, etc.)

tə̃h

Speakeranchored

ʔə̃h

Referent characteristics in exophoric use

Oppositions Adverbial Nominal

Emission– perceptible

Elevation

Exterior

–Access

+Access

Parameter

Table 1.10 The Jahai demonstrative system

Northern Aslian

459

460

Kruspe, Burenhult and Wnuk

Dih that is used for something perceived nonvisually, that is either heard, smelled, felt, or thought of. (79) ʔu=hm�ŋ prow, “dih ʔah” ʔu=bət 3sg=listen sound.leaping that.sns ah 3sg=say ‘He heard the sound of leaping through the canopy, “Ah there!” he thought (to himself).’ (cwg)

The function of a fifth term, toy ‘yon/yonder’ is unclear as it behaves both as an adnominal and an adverbial, as in hyãʔ ʔiŋ toy ‘my house across there.’ toy may be modified by doʔ ‘that’. It is used for something within or conceived of as within view, but away from the speaker and hearer. In addition to the set of contrastive demonstratives are the presentatives nĩn ‘this’ used to accompany a manual demonstration, and hnõʔ ‘this here in my hand’. The form nĩn is cognate with the Jah Hut demonstrative nin ‘this’ (Kruspe 2002 field notes). Dɑʔ ‘this’ and doʔ ‘that’ may take a determiner proclitic, as in ʔa=dɑʔ ‘this one’. All the demonstratives, apart from dih have forms with a final m indicating uncertainty or lack of certainty, as in dɑm ‘that one?’ The third person singular pronoun may be bound to the demonstrative as in ʔu=doʔ ‘him there’. The regular Ceq Wong demonstratives do not encode distinctions for elevation as in Jahai, but this may be achieved by employing deictic directional adverbs and related derived locative terms that encode distinctions such as ‘uphill’ and ‘downstream’. (80) lər ja=jok tiap dɑʔ so 1pl.excl=move.residence downstream this ‘So we moved downstream here.’ (cwg) (81) hyãʔ ʔuh hampɑn house 3sg LOC.upstream.LOC ‘his house upstream’ (cwg) (82) ʔiŋ=cɨp ka=tiah 1sg=go loc=downstream ‘I went downstream there.’ (cwg)

Ceq Wong has a set of adverbs that encode direction, anchored to topography of the land, or watercourse. Table 1.11 illustrates the directional adverbs and related locatives. See Semaq Beri, this volume, for a similar set of adverbs.

461

Northern Aslian Table 1.11 Locational and directional terms in Ceq Wong Adnominal

Goal

Locative

Adverb

‘the place X’

pɑh ‘upstream’ tiah ‘downstream’ təh ‘uphill’ yuh ‘downhill’ toy ‘across’

ka=pɑn ka=tiah (doʔ) ka=tən ka=yuh (doʔ) ka=toy doʔ

hampɑn hantian hantən also ‘on top of’ hanyun ‘underneath’ hantoy ‘yonder’

tət tiap nũl dɨt/nũt toy

ni-tət ni-nũl ni-nũt

(83) ʔiŋ cɨp tət 1sg go upstream ‘I went upstream.’ (cwg)

The terms nũl ‘upward’ and nũt/dɨt ‘downward’ are also used for strict verticality as in bɑy nũt ‘to dig vertically (for tubers)’ and in the following example. (84) lər ʔu=juwek dɨt yəʔ then 3sg=bring.home down too ‘Then she brought him down home (to the earth)’. (cwg)

The locative nominal forms, e.g., hantian ‘upstream’, are derived by prefixing han- to a spatial deictic base, and mutation of the coda consonant of the base to the alveolar nasal /n/, see §3.2.3. Maniq has a set of 6 demonstrative pronouns distinguishing referents based on distance and a broadly conceived notion of accessibility (both physical and epistemic). Several forms specialize in the management of addressee’s attention. ʔɨ̃h ʔɛn ʔun ʔum ʔɔm hiyun

accessible and usually proximal to speaker, presentative, attention drawer accessible to both speaker and addressee inaccessible or new to addressee, attention drawer inaccessible and usually distant from speaker, often out of sight distant in space or time, employed as intensifier, e.g., ʔamɛt ʔɔm ‘very far’ (ʔamɛt ‘to be far’) identification of referent established in discourse, attention confirmer

462

Kruspe, Burenhult and Wnuk

In most situations, more than one pronoun is appropriate. For instance, ʔɨ̃h and ʔɛn can both refer to proximal objects and in most contexts they appear to be interchangeable. However, speakers show a preference for ʔɨ̃h in situations involving pointing or demonstration of an object/feature, often involving object manipulation. (85) (picking up various organs while butchering a monitor lizard) ʔɨ̃h rus this liver ‘This is the liver.’ (tnz)

Demonstrative pronouns may be employed adnominally and pronominally. In addition, they occur in verbless clauses where they function as the predicating element. (86) hɨc yak hayãʔ ʔɛn neg finish house this ‘This house is not finished.’ (tnz) (87) ʔiɲ pɔk.key ʔun 1sg dislike that ‘I don’t like/want that (one).’ (tnz) (88) ʔɛn basiŋ this Dusky.Langur ‘This is a Dusky leaf monkey.’ (tnz)

All demonstrative pronouns may also be employed as locational demonstratives. In this function, they normally occur with locative prepositions daʔ or kiʔ (apart from hiyun). (89) hɨc ɡɔ̃ p kiʔ ʔɛn neg Hog.badger loc this ‘There are no Hog badgers here.’ (tnz)

Hiyun is a specialized pronoun used to confirm that the addressee’s attention is on the referent. It is also used when referents established in discourse become identified in space, as in the example below.

463

Northern Aslian

(90) (A group of Maniq is watching a movie where Batek men are climbing to get petai. There’s a brief argument between the Maniq as to what the Batek are actually getting. In the final scene a group of Batek men stand in a row presenting bunches of petai—it is now clear that those who thought it was petai were right. Several people utter simultaneously) Hiyun here ‘Here.’

5.4 Prepositions Northern Aslian languages have numerous prepositions expressing distinctions like location, direction, source, comitative, instrumental, contrast, equation, and possession. Prepositions form a phrasal constituent with nominals, sometimes in the form of free morphemes, sometimes as proclitics. They may express subcategorised complements of the verb, or circumstantial roles. A distinction needs to be made between ‘true’ prepositions and locative or relational nominals, whose spatial semantics sometimes partly subsume the topological information encoded in some prepositions and render them redundant. For example, in Jahai relational nouns like klɛŋ ‘inside’ and krpiŋ ‘upper side’ sometimes pre-empt the use of the locative- and goal-marking prepositions (Burenhult 2012: 25). Prepositions from three languages are presented in Table 1.12. All three languages tend to have simple monosyllabic forms for most prepositions. Both Ceq Wong and Maniq collapse the locative and goal distinctions, while Ceq Wong also has only the one form for both the comitative and instrumental. In Jahai, the locative and instrumental are expressed by the same term. Jahai lacks a possessive preposition, while Maniq lacks the contrastive/focus marker d=. Ceq Wong collapses the locative and goal into a single preposition kaʔ ‘loc’. An alternative preposition taŋ ‘loc’ is also used. Table 1.12 Prepositions in Ceq Wong, Jahai and Maniq Loc

Ceq Wong Jahai Maniq

Goal

kaʔ k=/ka=

Source

Com

lah ba=

daʔ/kiʔ/nɨŋ

Ins

biʔ

Poss

Contr

Similarity

nɑ̃ʔ

d=

piʔ

cn=/can=

ʔalɔʔ

ka=

–

d=

pn=

nataŋ

koʔ

(kiʔ)

(kɔŋ)

–

h ay

464

Kruspe, Burenhult and Wnuk

(91) gən=blawan kaʔ tɛʔ kaʔ tam kaʔ tɛʔ 3pl=fight loc ground loc water loc ground ‘They fought on the ground, in the water, (back) on the ground.’ (cwg)

A set of terms that encode distinctions for elevation, see §5.3, combine with the locative preposition. ka=pɑn ka=tiah (doʔ) ka=tən ka=yuh (doʔ) ka=toy doʔ

loc=up.stream ‘upstream’ loc=down.stream ‘downstream’ loc=up.hill ‘above, uphill’ loc=down.hill ‘below, downhill’ loc=there.lateral ‘yonder’

Source is expressed by lah ‘from’. lah is also used for topological space, for example lah kit (from bottom) ‘at the base’. High frequency motion verbs, like cɨp ‘to go’ and wek ‘to return’, which encode direction are frequently used without a preposition to introduce the goal or source. The comitative and instrumental are both expressed by biʔ ‘with’. The possessive preposition nɑ�ʔ ‘poss’ occurs in a construction N nɑ�ʔ N, where the former N is the possessum and the latter is the possessee, paralleling the associative construction. The prepositions kaʔ, taŋ ‘loc’ and biʔ ‘com/ins’ are also used to optionally introduce P arguments, and to encode the recipients of verbs of transfer, and the addressee of verbs of communication. Maniq has three locative prepositions: daʔ, kiʔ and nɨŋ. The semantically general daʔ and kiʔ combine with relational nouns without restrictions, e.g. daʔ kapin ‘at the top’ (kapin ‘upper side’), kiʔ kaʔɔʔ ‘at the back’ (kaʔɔʔ ‘back’). Nɨŋ is restricted to ‘on’ and ‘in’-types of relations hence its combinatorial properties with relational nouns are limited, e.g. nɨŋ nɛy ‘inside’ (nɛy ‘inside’). As in Jahai, the use of relational nouns may pre-empt the use of prepositions. All locative prepositions are also used to encode goals. Both location and goal may be introduced without any locative elements, especially if spatial information is encoded in the verb. (92) ʔanciʔ ʔɛʔ tul cɔ̃ ŋ tubers 3 be.in basket ‘Tubers are in the basket.’ (tnz)

Instruments and possessors normally occur without prepositions. However, it is possible to use them. The possessive preposition kɔŋ is a loanword from Thai.

Northern Aslian

465

Also several words contain the locative *ka- which is no longer used productively in Maniq, e.g., katieʔ ‘on the ground’ and kahɨp ‘forest’. The locative meaning of these words is partially bleached since they are often introduced by other locatives, e.g. daʔ katieʔ ‘on the ground’. 5.5 Negators The individual languages vary as to the number and type of negators. In Ceq Wong there is only one negator, the nominal negator yaŋ (mãh) or ʔayɑŋ ‘not’, that does not have a primary function in another word class, while in Maniq there are numerous negators, including hɨc, bah, hat ‘neg’ and ʔayaŋ ‘not’, and in Batek Deq there is nɛŋ ‘neg’ and ʔayaŋ ‘not’. Jahai has indigenous braʔ, occasionally replaced by borrowed Malay bokan. In Jahai, Maniq and Ceq Wong, the negative existential verb doubles as the verbal negator, see §4.1.2. 5.6 Conjunctions/Connective words Northern Aslian languages each have a handful of connectives, often Malay loans. For example, six connectives are identified in Ceq Wong, the conditional mɛ̃n ‘if’, kalaw/ʔilaw ‘if’, the causal lah ‘because, on account of’, sampay ‘until’, and the discourse connectives lər ‘so then’, and ləpas ‘after’ from Malay lepas ‘after’. Maniq uses the following conjunctions: takɨh ‘after’, kasɛɲ ‘before’, kaŋ ‘when, at the time of’, ʔɔʔ pãʔ ‘or, and (between clauses)’, hat ‘or (in questions)’. 5.7 Adverbs Most Northern languages have a small class of lexical adverbs expressing aspect, manner, time, degree and frequency. Aspectual adverbs tend to occur in pre-verbal, or clause initial position, e.g., Maniq bɔʔ ‘just’, whereas temporal, manner, degree and frequency adverbs tend to be post-verbal in a modifying role, like Maniq hapay ‘only, just’, hiyɛn ‘soon’ and kiyaʔ ‘again’. Stative property verbs complement this word class and function as modifiers to verbs. In this case they usually follow the verb. Some adverbs are grammaticalised verbs, for example the Ceq Wong and Batek Teq aspectual rɛʔ ‘just’ signaling the recent change of state or completion of an activity, which is related to the stative verb rɛʔ ‘be new’, and Maniq kabi ‘just’ related to kabi ‘to be new’. The Ceq Wong adverb balek ‘again as before’ to repeat an activity is from a Malay verb balik ‘to return’. (93) bɔʔ miʔ tɔ ʔam-ŋɔk just Maniq marry incep-sit ‘When someone (Maniq) has just gotten married, they begin to live (together).’ (tnz)

466

Kruspe, Burenhult and Wnuk

(94) mɔh dɔy kabi kaw 2sg come new prox ‘You’ve just come.’ (tnz) (95) kamaʔ.baprɨp gən=hlɔt balek collapse.in.heap 3pl=faint as.before ‘Collapsing in a heap, they fainted again as before.’ (cwg)

5.8 Particles Aslian languages generally have few particles, a feature common also to the languages of the Northern branch. Clause final particles include question tags as in Ceq Wong bəh ‘q’, assertive validation particles teh ‘asrt’ used where a speaker asserts certainty about the facts, and emphatic markers like mɛ̃h ‘emph’. (96) ʔuh, pi=nũm teh, ʔu=bət buŋsuʔ 3sg like=that.IRR asrt 3sg=say youngest.born ‘He, just like that I’d say, Youngest Born spoke.’ (cwg) (97) The speaker invites someone to share his tobacco: njɔt mɛ̃h! smoke emph ‘Go ahead and smoke (it)!’ (cwg)

Burenhult (2005) refers to unstressed pre-verbal subject-marking pronouns in Jahai as ‘subject particles’. 6

Lexical Semantics

6.1 Lexicalisation Principles Northern Aslian languages display (as do other Aslian languages) a general principle of lexicalisation which involves highly specific meanings encoded in distinct monolexemic forms. The pattern is particularly pervasive in verbs. Actions or states are systematically conflated with semantic dimensions like location, direction, patient, instrument, manner, matter, and so on, and compressed into unanalysable lexemes. This results in a remarkably rich, semantically fine-grained, and culturally entrenched inventory of verbal lexemes. For example, the basic set of Jahai motion verbs conflate motion with the geophysical feature on which the movement occurs, as well as the directionality

Northern Aslian

467

of movement on the feature in question: rkruk ‘to move along a large watercourse’, cɨk ‘to move across a large watercourse’, tigil ‘to move along a hillside’, jɔh ‘to move up a hillside’, gɛs ‘to move down a hillside’, gəw ‘to move along the crest of a ridge’, etc. (Burenhult 2008a: 193–195). Similarly, in the domain of placement, Jahai verbs of donning conflate attachment with the identity or material characteristics of the attached object as well as the bodily location of attachment: lɛn ‘to don a loincloth’, bəl ‘to don a string around one’s waist’, tɛl ‘to don a headband’, caduk ‘to don an oblong object in one’s hair’, etc. (Burenhult 2012: 34–35). In the domain of physical separation (cutting and breaking), Maniq verbs may encode features of the object, the instrument, as well as the manner of destruction: kaʔɛt ‘to cut with a bladed instrument’, giet ‘to cut a non-rigid object’, cel ‘to cut hair’, tis ‘to break by pulling, to pluck’, coh ‘to hit with an instrument’, ɡɔh ‘to hit with hand’, cadɛk ‘to puncture’, bakah ‘to crack’, dahaʔ ‘to snap a long object’, and tɛs ‘to sever a rigid object’. Furthermore, most Northern Aslian languages divide up the domain of ingestion into lexically distinct verbal categories which encode general classes of food, e.g. Jahai gey ‘to eat starchy food’, hɛ̃w ‘to eat vegetables’, but ‘to eat ripe fruit’, muc ‘to eat meat’ (Levinson & Burenhult 2009). Among stative verbs, verbs of olfaction form a particularly conspicuous set. Jahai, Maniq and Menriq, for example, each has a set of at least a dozen abstract smell verbs which encode a complex array of parameters to express distinct odour qualities, e.g. Jahai plʔeŋ ‘to smell raw (of e.g. fresh fish, meat, egg, stagnant water, and iron)’, pʔus ‘to smell musty (of e.g. mushrooms, old dwellings, decaying vegetation, coffee, dried spices, and stale food)’, cŋɛs ‘to have a stinging smell (of e.g. petrol, smoke, paint, bat droppings, and roots of some wild gingers)’ (Burenhult & Majid 2011; Majid & Burenhult 2014; Wnuk & Majid 2014). Also impressively elaborate are lexica related to other perceptual domains. For instance, Maniq has a rich color and visual surface vocabulary encoding chromatic properties along with background information on the object, such as reflectance, stage of growth of a plant, spatial arrangement, e.g. yəhəy ‘to be shiny red (of animal eyes at night)’, bayel ‘to be light green and sometimes slightly red (of young leaves)’, hlawãk ‘to have distributed patches of white (of wing tips of the great hornbill, Buceros bicornis)’. Analogous semantic specificity is evident also in some nominal domains. For example, in Jahai body part terminology, monolexemic forms systematically denote fine-grained anatomical detail, e.g. nus ‘upper lip’, tnɨt ‘lower lip’, bliŋ ‘upper arm’, prbɛr ‘lower arm’, blɨʔ ‘upper leg’, gor ‘lower leg’, etc. Jahai lacks terms for higher-level anatomical categories like lip, arm and leg and instead prefers to lexicalize body parts at a finer level of granularity.

468

Kruspe, Burenhult and Wnuk

6.2 Foraging and Specialized Ritual Vocabulary in Ceq Wong Ceq Wong has an extensive set of avoidance terms, unparalleled among the Northern languages. Avoidance terms are used when a speaker is under a taboo against naming an entity or an activity, for example when planning or undertaking a hunt, or foraging, or when staying out in the forest. The terms name animal and vegetal entities, activities, and typical possessions that one would take into the forest. Naming the entity would jeopardise a successful outcome, and depending on the type of taboo, result in an attack by a tiger (kleʔ), snake (talon), or centipede (kʔɛp). For instance, using the normal term for rattan, instead of ʔɑy l=ʔac war (game id=intestine) will result in the tiger killing the perpetrator and draping his/her intensines in the forest like rattan. Some examples are: bajaw l=sənjɨg (game id=breathe) ‘flying squirrels’, ʔɑy l=mɛ̃d kleʔ (game id=eye tiger) ‘dammar (lamp)’, ʔɑy l=daraŋ mɔ̃ h (game id=hole nose) ‘bamboo’, ʔɑy l=lɲhũɲ (game id-fragrance.of.Agila.wood) ‘Agila wood’. ʔɑy l=lhɛŋ talon (game id=saliva snake) ‘water, river’, and ʔɑy l=kuy kleʔ (game id=head tiger) ‘cooking pot’. The following example illustrates the use of the terms with bajaw l=lhɛŋ ‘river’ and bajaw l=purpur ‘to bathe’. The speaker interchanged ʔɑy with bajaw. (98) ʔiŋ mɛ̃ʔ cɨp kaʔ bajaw.l=lhɛŋ, ʔiŋ mɛ̃ʔ bajaw.l=purpur 1sg imm go loc river 1sg imm bathe ‘I’m going to the river, I want to bathe.’ (cwg)

Another example of specialised vocabulary related to activities in the forest is the set of terms used for a first night in a new camp. Normal terms must be replaced with avoidance terms to repel attack by a tiger, e.g., majaw ‘to fish in the evening’ instead of pgam ‘to fish’, and caləs ‘to hunt’ instead of the usual verb pndap ‘to hunt with a blowpipe’. An example is present in the accompanying text in §7. 7 Text An excerpt from the Ceq Wong traditional narrative Citərə tɑʔ jaŋlũt5 ‘The Story of Elder Two-headed Worm’, told by KP, and transcribed by Kruspe with the assistance of JM. The bracketed text contains commentary from the conarrator, KJ. 5 Citərə tɑʔ jaŋlũt, Ceq Wong Corpus, DOBES archive, Max-Planck Institute for Psycho linguistics, Link: http://hdl.handle.net/1839/00-0000-0000-0018-CAC2-B.

Northern Aslian

469

ʔu=cɨp ktɔʔ nũʔ pndap buac.6 pndap buac, 3sg=go day that hunt.blowpipe Banded.Langur hunt.blowpipe Banded.Langur ʔu=yow ploʔ prɛh.7 ʔu=daŋ “hoy” ploʔ prɛh lbat. sʔɛ̃ŋ, 3sg=see fruit Perah 3sg=look.at wow fruit Perah be.heavy be.black

gɨm. lwəc. ʔu=lwəc, ʔu=kwɑt. ʔu=yey pɛt nulas. be.partially.ripe. go.up 3sg=go.up 3sg=hook.down 3sg=carry three nml.piece ‘He went that day blowpiping Banded Langur. While he was hunting Banded Langur, he saw Perah fruit. He looked, “Oh! the Perah fruits were plentiful”. (Some were) black and some were partially ripe. (He) climbed (it). He climbed up and hooked the fruits down. He took three fruits.’ ʔu=wek kaʔ hyãʔ, ʔu=bət kaʔ jəʔ. “ʔa= dɑm” ʔu=bət “ploʔ prɛh. 3sg=return loc house 3sg=say loc wife det=this.irr 3sg=say fruit Perah hndɑʔ hɛʔ balek, pi=hndɑʔ yən.kɑʔ bət kaʔ teʔ.mõy!” [mɛ̃ʔ tomorrow 1pl.incl go.back tomorrow neg.imp say loc coll.parents want puas.] mɛ̃ʔ n-cəʔ tut. ʔah gən ktɔʔ nũʔ n-ləp, kɨŋ hyãʔ, satisfied want nml-eat be.alone ah! 3pl day that nml-camp.out build lean.to ʔu=lwəc prɛh. lpas n-kɨŋ hyãʔ, lwəc. ʔu=kwɑt mɛ̃h. 3sg=go.up Perah after nml-build lean.to go.up 3sG=hook.down emph ‘He went home and told his wives, “Look at this here”, he said, “Perah fruit. Tomorrow we’ll go back, tomorrow. Don’t tell our parents!” [Wanted to eat (his) fill.] (He) wanted to eat alone. Ah, next day they (went) camping out in the forest, made a lean-to, and he climbed the Perah tree. After making the lean-to, (he) climbed (it). He hooked the fruit down.’ gən=kal mɛ̃h teʔ jəʔ, gən=kal mɛ̃h, teʔ jəʔ lən, 3pl=dehusk emph 3du wife 3pl=dehusk emph coll wife pst.prox

6 Presbytis femoralis. 7 Elateriospermum tapos. The fruit is classified under the Tiger taboo, and the avoidance name is ʔɑy l=pɛt.

470 lwhyãw piʔ gushing.out like

Kruspe, Burenhult and Wnuk mhə̃m. blood

ʔu=bɨŋ. ʔitɔŋ 3sg=be.formed like

toʔ ʔuh lən ʔu=kal elder.sibling 3sg pst.prox 3sg=dehusk

tuhwɑʔ ʔu=kal ʔuh. “ʔah” ʔu=bət kaʔ teʔ be.ripe 3sg=dehusk 3sg ah! 3sg=say loc 3du

jəʔ. “dɑʔ laŋ wek kaʔ hyãʔ” ʔu=bət kaʔ teʔ jəʔ, “bat wife this FUT.PROX return loc house 3sg=say loc coll wife do cɔc. mɨ̃ʔ laŋ” ʔu=bət kaʔ jəʔ ka=nɑ̃ y “bat lawot. ʔiŋ dɑʔ boil.peeled 2sg FUT.PROX 3sg=say loC wife REL=one do boil.whole 1sg this laŋ mndap hntəʔ FUT.PROX nml.hunt accompaniment

buac” ʔu=bət. Banded.Langur 3sg=say

‘The wives dehusked the fruit. The wives there dehusked the fruit, (and the contents) came gushing out like blood. The elder sibling dehusked the fruit and the contents were firm, as if they were ripe. She dehusked (them). “Ah”, he said to his wives, this (when) we get home, this”, he said to his wives, ‘boil (it) peeled. You now”, he said to the other wife, “boil (it) whole. I will go hunting now to get Dusky Langur as an accompaniment”, he said.’ “nũm mɨ̃ʔ laŋ” bət teʔ jəʔ “yən wəʔ punɛ̃n!”. “hah” ʔu=bət that.irr 2sg F UT.PROX say coll wife NEG.IMP exist taboo what.say? 3sg=say “braʔẽn, hɛʔ kbɨs”. ʔu=cɨp mndap. [ ɲaləs mɛ̃h, hɛʔ=bət who.cares 1pl.incl die 3sG=go nml.hunt nml.hunt emph 1pl.incl=say hɛʔ.] 1pl.incl “That, you there”, said the wives, “Don’t (speak like that)! It’s taboo.” “Hah, as if,” he said, “Who cares (if) we die”. He went blowpipe hunting. [ ɲaləs is (what) we say.] ʔu=cɨp mndap. [rɛʔ tibaʔ cɔʔ ɲaləs.] tibaʔ katɛʔ, ʔu=yow tupay. 3sg=go nml.hunt be.new arrive name nml.hunt arrive in.forest 3sg=see squirrel ʔu=hapit, 3sg=clamp

ʔu=pasah. 3sg=singe.off

lpas after

ʔu=pasah, ʔu=cəʔ 3sg=singe.off 3sg=eat

Northern Aslian ʔu=sapuʔ mhə̃m kaʔ haliʔ klawəŋ. 3sg=wipe blood loc leaf wild.yam

471 ɲĩk.ɲĩk hɑɲ kleʔ. look.tiger.mouth mouth Tiger

‘He went blowpipe hunting. [(If we) have just arrived in the forest, the name (for hunting) is (substituted by) ‘ɲaləs’]. Reaching the forest, he saw a squirrel. He clamped (it) (and) singed off the fur. After he singed off the fur, he ate the squirrel. He wiped the blood on a klawəŋ yam leaf.8 (In the distance), the tiger’s mouth was quivering with the look of a mouth about to bite a transgressor.’9 lpas nũʔ ʔu=cɨp. lpas cɨp, kɑʔ con sɑ masuk cŋdoŋ jhũʔ after that 3sg=go after go perf be.far a.little pass.under be.leaning tree tkam kleʔ kaʔ hŋkɨŋ. tkam kleʔ “ʔəw!” karaw. cɔʔ mãh pounce tiger loc back pounce tiger sound.scream scream what irr wəʔ ka=wəʔ kap~kap kleʔ. exist rel=exist bite~cont tiger ‘After that he walked on. After (he had) already gone quite far, (he) passed under a leaning tree. The tiger pounced on his back. The tiger pounced, “Aargh!” he screamed. As if there was anything left, the tiger biting and biting (into him).’

8

Bibliography

Aikhenvald, Alexandra Y. & R.M.W. Dixon (eds.) 2006. Serial verb constructions: a crosslinguistic typology. Oxford: Oxford University Press. Asmah, Haji Omar. 1963. Bahasa Semang: Dialek Kentakbong. Unpublished honours thesis. University of Malaya. ———. 1976. The verb in Kentakbong. In: Philip N. Jenner, Laurence C. Thompson & Stanley Starosta (eds), Austroasiatic Studies, Part II, Honolulu: University Press of Hawaii, 951–970.

8 The blood can be seen to this day on the red-tinged underside of the leaf of this species of yam (Araceae). 9 The tiger hasn’t appeared yet. In the distance, he is aware that someone has transgressed a taboo. In this case, the man has committed punɛ̃n by first not telling his kin about the fruit, secondly by not using the taboo term, and thirdly by eating the squirrel without sharing.

472

Kruspe, Burenhult and Wnuk

Benjamin, Geoffrey. 1976a. Austroasiatic subgroupings and prehistory in the Malay Peninsula. In: Philip N. Jenner, Laurence C. Thompson & Stanley Starosta (eds), Austroasiatic Studies, Part I, Honolulu: University Press of Hawaii, 37–128. ———. 1976b. An outline of Temiar Grammar. In: Philip N. Jenner, Laurence C. Thompson & Stanley Starosta (eds), Austroasiatic Studies, Part I, Honolulu: University Press of Hawaii, 129–187. ———. 1985a. In the long term: three themes in Malayan cultural ecology. In Karl L. Hutterer, A. Terry Rambo & George W. Lovelace (eds.), Cultural Values and Human Ecology in Southeast Asia University of Michigan: Center for South and Southeast Asian Studies, 219–278. ———. 1985b. On pronouncing and writing Orang Asli languages: a guide for the perplexed, Part 1. Orang Asli studies newsletter, vol. 4, Hanover, NH: Department of Anthropology: 4–16. ———. 2012. The Aslian Languages of Malaysia and Thailand: An Assessment. In Peter Austin & Stuart McGill (eds.) Language Documentation and Description, London, SOAS. Vol. 11, pp. 136–230. Bishop, Nancy. 1996. A preliminary description of Kensiw (Maniq) phonology. MonKhmer Studies: 227–254. Bishop, Nancy & Mary M. Peterson. 1994. Kensiw glossary. Mon-Khmer Studies 23:163–195. Burenhult, Niclas. 2003. Attention, accessibility, and the addressee: the case of the Jahai demonstrative “ton”. Pragmatics 13: 363–379. ———. 2005. A Grammar of Jahai. Canberra: Pacific Linguistics. ———. 2006. Body part terms in Jahai. Language Sciences 28: 162–180. ———. 2008a. Streams of words: hydrological lexicon in Jahai. Language Sciences: 30.2/3:182–199. ———. 2008b. Spatial coordinate systems in demonstrative meaning. Linguistic Typology, 12(1), doi:10.1515/LITY.2008.032: 99–142. ———. 2011. The coding of reciprocal events in Jahai. In Nicholas Evans, Alice Gaby, Stephen C. Levinson and Asifa Majid (eds.) Reciprocals and semantic typology, Amsterdam: John Benjamins, 163–176. ———. 2012. The linguistic encoding of placement and removal events in Jahai. In Anetta Kopecka and Bhuvana Narasimhan (eds.) Events of putting and taking: a crosslinguistic perspective. Amsterdam: Benjamins, 21–36 Burenhult, Niclas & Asifa Majid. 2011. Olfaction in Aslian ideology and language. Senses & Society. 6.1:19–29 Burenhult, Niclas, Nicole Kruspe & Michael Dunn. 2011. Language history and culture groups among Austroasiatic-speaking foragers of the Malay Peninsula. In N.J. Enfield (ed.), Dynamics of human diversity: The case of mainland Southeast Asia. Canberra: Pacific Linguistics, 257–277.

Northern Aslian

473

Diffloth, Gérard. 1975. Les langues mon-khmer de Malaisie: classification historique et innovations. Asie du Sud-est et Monde Insulindien 6 .4: 1–19. ———. 1976a. Jah-Hut: an Austroasiatic language of Malaysia. In Nguyen Dang Liem (ed.), South-east Asian linguistic studies 2. Canberra: Research School of Pacific and Asian Studies, Australian National University (C.42), 73–118 ———. 1976b. Mon-Khmer numerals in Aslian languages. Linguistics (Special issue: Austroasiatic number systems) 174: 31–8. Dunn, M., Burenhult, N., Kruspe, N., Tufvesson, S. and Becker, N. 2011. Aslian linguistic prehistory: a case study in computational phylogenetics. Diachronica 28. 28.3: 291–323. Dunn, Michael, Nicole Kruspe & Niclas Burenhult. 2013. Time and place in the prehistory of the Aslian languages. Human Biology, 85: 383–399. Hajek, John. 2003. Evidence of tonal activity in Aslian. Mon-Khmer Studies: 59–66. Kruspe, Nicole. 2004. A grammar of Semelai. Cambridge Grammatical Descriptions. Cambridge: Cambridge University Press. ———. 2009a. Loanwords in Ceq Wong, an Austroasiatic language of Peninsular Malaysia. In: Martin Haspelmath & Uri Tadmor (eds), Loanwords in the world’s languages: a comparative handbook of lexical borrowing, Berlin: Mouton de Gruyter, 659–685. ———. 2009b. Ceq Wong vocabulary. World Loanword Database. http://wold.clld.org// vocabulary/26. ———. 2014. Semaq Beri. This volume, 475–516. ———. 2010. A dictionary of Mah Meri as spoken at Bukit Bangkong. (Oceanic Linguistics Special Publication no. 36). Honolulu: University of Hawaii Press ———. (In progress.) A grammar of Ceq Wong. Levinson, Stephen C. & Niclas Burenhult. 2009. Semplates: a new concept in lexical semantics? Language 85: 153–174. Majid, Asifa & Niclas Burenhult. 2014. Odors are expressible in language as long as you speak the right language. Cognition: 130.2: 266–270 Matthews, P.H. 1997. The concise Oxford dictionary of linguistics. Oxford: Oxford University Press. Peterson, Mary M. 1997. Kensiw grammar: basic declarative, interrogative and imperative propositions. Thammasat University: TU-SIL-LRDP. http://msealing.info/digidata/495.992/b07428.pdf. Phaiboon, Duangchan. 2006. Glossary of Aslian languages: the northern Aslian languages of southern Thailand. Mon-Khmer Studies 36: 207–224. Schebesta, Paul. 1928. Grammatical sketch of the Jahai dialect, spoken by a Negrito tribe of Ulu Perak and Ulu Kelantan, Malay Peninsula. (Trans.) C.O. Bladgen. Bulletin of the School of Oriental Studies: 803–826.

474

Kruspe, Burenhult and Wnuk

Tengku Intan Suzila Tengku Sharif, Mohd Yusri Mohamad Noor and Badli Esham Ahmad. 2012. Language change or preference: the Batek Aslian names. Proceedings of the 7th Malaysia International Conference on Languages, Literatures, and Cultures, 2012. www.fbmk.upm.edu.my/micollac/proceedings Wnuk, Ewelina. (in progress). Semantics of verbs of perception in Maniq (PhD dissertation), Manuscript in preparation. Wnuk, Ewelina & Niclas Burenhult. (in press). Contact and isolation in hunter-gatherer language dynamics: Evidence from Maniq phonology (Aslian, Malay Peninsula). Studies in Language. Wnuk, Ewelina & Asifa Majid. 2014. Revisiting the limits of language: The odor lexicon of Maniq. Cognition 131.1: 125–138 Yager, Joanne. (in progress). Northern Aslian linguistic prehistory: tracing the effects of contact. MA thesis, Lund University.

chapter 2

Semaq Beri Nicole Kruspe 1 Background Semaq Beri (szc) belongs to the Southern branch of Aslian (Diffloth 1975, Benjamin 1976, Dunn et al. 2011), along with Semelai (Kruspe 2004a, 2004b), Mah Meri (Kruspe 2010, 2011, in progress) and Temoq. There are approximately 2400 Semaq Beri speakers occupying a large area in the Malay Peninsula, north of the Pahang River, and east of the Tembeling River. The exonym Semaq Beri, a calque on the Malay Orang Hutan (person forest) ‘forest person’, was bestowed by colonial authorities when the Semaq Beri came under their attention in the 1950s. Prior to this they had been labeled Sakai (Skeat and Blagden 1906) or Jakun (Evans 1920). Like numerous other Aslian groups, historically they did not have an endonym that named the group as a whole. The Semaq Beri have sometimes been erroneously referred to as Semelai, pace Needham (1974). While this is the name with which some of them associate themselves, the ethnonym Semelai has long been established with a different Southern Aslian group who live predominantly in southwestern Pahang. In the north, ‘Semelai’ is the ethnonym used by the Batek for their Southern Aslian neighbours (Kruspe fieldnotes). The Semaq Beri are a heterogeneous group, in the south displaying the more typical attributes of the Southern Aslian grouping—combining the collection of forest produce for trade and low-level horticulture—while in the north engaging in mobile foraging like their Northern Aslian neighbours. Despite this heterogeny, they identify themselves as a unified group with kin in far-flung areas, and provide ethnohistorical motivations for their current distribution. This paper deals specifically with the speech variety of the northern Semaq Beri of Terengganu. Although they have not been included in the Semang ethnographical sphere, they live like their Batek neighbours combining hunting and foraging with the occasional collection of forest produce for trade, or waged labour (Kuchikura 1987). Their society is egalitarian and the conjugal family is the only persistent social unit. There is a weak patrifiliative bias, but no fixed pattern of post-marriage residence. Strict cross‑sex avoidance is maintained with affinal kin, and also between parents and children, and between siblings after the onset of puberty.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_008

476

Kruspe

Exogamous marriage forms an integral component of the foraging mode, and the Semaq Beri maintain close social relations across their extensive territory with other groups of Semaq Beri. The northern group also have close relations with the neighbouring Batek, resulting in inter‑Aslian bilingualism absent elsewhere in the Southern branch. Northern Semaq Beri exhibits both lexical and grammatical influence from North Aslian, which is less prevalent in southern varieties (Kruspe fieldnotes). The Semaq Beri have remained largely unknown both linguistically and ethnographically. Kuchikura’s research on subsistence ecology (1987) provides the only ethnographic research of northern Terengganu Semaq Beri. The earliest linguistic record of Semaq Beri is in Blagden’s “Comparative vocabulary” (Skeat and Blagden 1906: 503, 507–764). Evans (1915, 1920) provides the first ethnographic materials and wordlist from the people he named the Tekai River Jakun. No further linguistic materials were published until Endicott’s 1975 paper. Based on comments from Diffloth, he reports that there were a number of words loaned from Jah-Hut (an isolate to the west) and Ceq Wong (a Northern language to the southwest of the Jah-Hut). The authors of a brief report on Semaq Beri syntax noted dialect variation between the two groups they surveyed (Nik Safiah Karim and Ton bt. Ibrahim 1977). Semaq Beri shares considerable lexicon with Semelai, but they are mutually unintelligible. The major lexical differences in Semaq Beri are the presence of Northern Aslian loanwords; some cognates shared with Mah Meri, but not Semelai, and some loanwords possibly from the isolate Jah Hut. The borrowings from Northern Aslian are of interest as they exhibit closer resemblance to Ceq Wong (Kruspe, unpublished database), a language with which present day Semaq Beri has no contact, more so than the Batek language spoken in the same settlement. Overall, Semaq Beri has a lower loanword rate from Malay than either Semelai or Mah Meri, particularly in the domains of kinship and ethnobiology, making it particularly valuable for historical reconstruction. In terms of syntax and derivational morphology, Semaq Beri has both similarities and differences with Semelai and Mah Meri. Semaq Beri exhibits infixing coda reduplication like Semelai but unlike Mah Meri, while Semaq Beri has a causative infix with a nasal segment, attested in fossilised Mah Meri forms (Kruspe 2010), but not present in Semelai. Semaq Beri exhibits a typical Southern Aslian pronominal system §5.1.1.1, but one unusual feature in the system is a gender distinction, ja ‘2sg.f, female’, and heʔ ‘2sg.m, male’, not found elsewhere in Aslian. The male term is clearly cognate with Mah Meri hiʔ ‘2sg’, however the etymology of the feminine form remains unclear. This chapter is based on the author’s ongoing research. Research is conducted at a resettlement community of approximately 300 people, both

Semaq Beri

477

Semaq Beri and Batek, in Terengganu.1 The settlement was established in the late 1970s when the communities were encouraged to settle permanently. Both Semaq Beri and Batek are spoken in the settlement, but Semaq Beri is dominant. Malay is spoken with varying degrees of proficiency. The people subsist by collecting forest produce for trade, supplemented by local foraging for food. 2 Phonetics/Phonology Consistent with other Southern Aslian languages, Semaq Beri has both aspirated voiceless stops, and a lack of vowel length. Atypically, it has phonemic diphthongs, which are otherwise unattested in Semelai and Mah Meri (Kruspe 2004; 2010). Typical of most Aslian languages, Semaq Beri lacks a tone or register distinction. Stress is always on the final syllable, and there is no notable secondary stress. 2.1 Syllable and Word Structure The minimal syllable in Semaq Beri is [CV]; the maximal syllable is [CCVC]. Sequences of contiguous vowels in closed syllables [CVVC] are treated as diphthongs, and represented phonemically as /CVC/, e.g., ɡuən ‘to fetch water’, rather than /CˈGVC/, where G is a glide, e.g., *ɡwən. There are however, also instances where a sequence [CVVC] is interpreted as /CGVC/, where G is either /w/ or /y/. The analyses depend on the position of word stress, and morphological processes, although in some instances its status is ambiguous. The minimal word is CV, e.g., ci ‘louse’, typical of Southern Aslian. Semaq Beri also has words of the so-called ‘sesquisyllabic’ type, e.g., cpə̃ŋ [cə.pə̃ ŋ] ‘frog (k.o.)’, where an initial sequence of two consonants is articulated with the assistance of an epenthetic vowel, as well as disyllabic words with a phonemic vowel in the initial syllable, e.g., ʔatɛ ‘earth, ground’. There is a strong tendency for roots to be maximally disyllabic. Most trisyllabic words exhibit evidence of morphological complexity, although these are often not the product of synchronic derivations, e.g., gruŋlaŋ ‘screw worm’ or krbəh [kə.rə.bəh] ‘vine (k.o)’. Syllabification proceeds from right to left, in accordance with the maximal exhaustion principle and taking into account phontactic restrictions (see §2.2.3), e.g., ʔamrəɲ ‘body louse’ syllabifies as [ʔam.rəŋ], and sblɛ̃k [ɕə.bə.lɛ̃k] ‘to be straight’. A closed syllable can never occur to the left of an open syllable *CC.C.CV(C). There are several ethnobiological names, and 1 Research was supported by the Max-Planck Institute for Evolutionary Anthropology, Leipzig (1999–2000), the Volkswagen Foundation DOBES language documentation initiative, (2007– 2010), and currently by the Bank of Sweden Tercentenary Foundation.

478

Kruspe

marginal recent loans that violate all phonotactic constraints, e.g., kawaw togroh [to.groh] ‘kind of bird’ and ʔeksiden [ʔek.ɕi.dẽn] ‘car accident’. The epenthetic vowel that occurs between consecutive consonants in sesqui- and tri-syllabic word forms is usually realized as [ə], but may be influenced by the following consonant if it is a labio-velar or palatal segment, e.g., [u] in twiŋ [tuˈwigŋ] ‘a type of headache resulting from breaking a food taboo’ or [i] in the environment of a palatal. This process is also observed across morpheme boundaries, e.g., between the unstressed pronominal index and the following verb where kɛ wɛn (3sg NEG.want) ‘He doesn’t want (to).’ may be realized as [kuˈwɛdn]. Epenthetic vowels do not occur in onset clusters of a nasal and homorganic stop, e.g., ɲsəc [ɲɕəic] ‘flesh’. This initial is dropped in morphological processes. Sequences of a stop plus liquid may be pronounced as a cluster, e.g., plit [ˈplit] ‘night’. 2.2 Phonemes and Phonotactics 2.2.1 Consonants Semaq Beri has 23 consonant phonemes (see Table 2.1 below), including the typical Southern Aslian series of aspirated voiceless oral plosives at four places of articulation (see Kruspe 2004 for Semelai, and 2010 for Mah Meri). These phonemes are infrequent, with the palatal aspirate the least frequent of all. Table 2.1

p pʰ b m

w

t tʰ d n r l

Consonant phonemes

c cʰ ɟ ɲ s

k kʰ ɡ ŋ

ʔ

h

j2

2 The Semaq Beri phonemes in Tables 2.1 and 2.2 are represented in accordance with IPA standard. However, throughout this chapter, language examples are represented in the practical orthographies developed for the individual languages in accordance with much Aslian (and Southeast Asian) descriptive tradition. While these orthographies largely adhere to IPA notation, they depart from it in that the voiced palatal stop /ɟ/ is represented as j, and the palatal glide /j/ is represented as y. Furthermore, epenthetic vowels are consistently left out.

479

Semaq Beri

The fricative /s/ is the post-alveolar fricative [ɕ]. The least stable consonant is the rhotic /r/ which is also realized as a weak approximant or as the velar lateral approximant [ʟ] in coda position, perhaps influenced by neighbouring Batek, e.g., srdɔk [ɕəɹdɔk / ɕəʟdɔk] ‘to wear’, and tər [təɹ / təʟ] ‘to arrive’. Nonphonemic preaspiration of the liquids /l/ and /r/ is often attested across syllable boundaries where /CV(N).C/ → [CV(N).ʰC], e.g., paloŋ [pa.ʰloŋ] ‘thatch’. 2.2.2 Vowels Semaq Beri has both monophthongs and diphthongs. There are 10 oral monophthongs, of which eight have a phonemic nasal counterpart, e.g., sɔk ‘to climb up to retrieve blowpiped arboreal game’, and sɔ̃ k ‘to suck’. Due to the presence of unpredictable nasal spread, compare mɔ̃ t ‘eye’ and mɔt ‘to smoke’, all nasalised vowels must also be treated as phonemically nasal. However some, like the mid-close /ẽ/ and /õ/ only ever occur as the result of rightward nasal spread. Table 2.2 Monophthong vowel phonemes

i e ɛ

ĩ (ẽ) ɛ̃

ɨ

ɨ̃

ə

ə̃

a

ã

u o ɔ a

ũ (õ) ɔ̃ ɑ̃

The vowel /a/ is realized as [æ] when followed by /y/, as in spay [ɕəpæj] ‘be new’. There is a tendency for the rounded back vowels to be slightly fronted and unrounded in rapid speech so that they approximate [ʊ, ɤ and ʌ], e.g., moy [mɤj] ‘mother’. In addition to the high central vowel /ɨ/, and the lower /ə/, some words have a mid-close central vowel [ɘ]. In the absence of any minimal pairs, the phonemic status of this vowel remains unclear. Semaq Beri also has seven phonemic oral diphthongs /iɛ, ia, iə, uɛ, uə, ua, uɑ/, e.g., lawɑk hapiɛʔ ‘Sepia River’ and srsuər ‘to walk along a riverbank’, and two nasal counterparts /ĩã/ and /ũɛ̃/, e.g., lksĩãk ‘to have fat’. Diphthongs are restricted to the final syllable, and are infrequent. Some [VV] sequences exhibit mutable behavior, the initial [V] varying between a vowel and a glide on different occasions. Similar mutability is noted for Jah-Hut (Diffloth 1976), and Ceq Wong, (Kruspe et al., this volume). While the Semaq Beri diphthongs are unique in Southern Aslian, it remains unclear whether they represent a Semaq Beri retention or innovation, or they

480

Kruspe

have been borrowed with the diphthong in situ from Northern or Central Aslian languages, for example, Semaq Beri /pantuay/ and Ceq Wong /bantuay/ ‘Malayan Pangolin’. 2.2.3 Phonotactics 2.2.3.1 Consonants Final syllable Consonant phonemes have the greatest distribution in the onset of the final syllable with all 23 consonants occurring in this position. Few restrictions are noted for onsets, although voiced or aspirated stop onsets almost never co-occur with a nasal vowel. The voiced, and voiceless aspirated oral stops are not attested in coda position. Voiceless oral stops are checked in final position, while nasal stops are prestopped following an oral vowel, e.g., ʔn-ʔen [ʔən.ʔedn] ‘to be coming’. Non-final syllables In disyllabic words, all consonants, apart from the aspirated voiceless stops may occur as the onset of the penultimate syllable. For sesquisyllabic words this is more constrained, and in addition to aspirated stops, nasals and glides are unattested, with the exception of a few nasal + homorganic stop onsets, see §2.2.1. In trisyllabic words, the onset of the penult is usually a nasal or liquid, often the result of fossilized infixation, while the onset of the antepenultimate syllable is usually a stop or fricative. The coda in the penultimate syllable is either a nasal or liquid, sometimes homorganic with the final onset, kmpən [kəm.pədn] ‘wife’. In general, most codas appear to result from fossilized morphological processes, typically copies of the final syllable coda as in ʔanwɔn ‘to be bare (of a tree-trunk)’ or klʔtuəʔ ‘bee (k.o.)’, or an infixed glottal stop, e.g., blaʔŋɨ̃t ‘bruised’. The prepenultimate syllable is always open. 2.2.3.2 Vowels All vowels occur freely in the final syllable. The most frequently occurring phonemic vowel in the penultimate syllable is /a/, as in talŋɛ̃l ‘be white (on a black background)’. The vowel /ɑ/, ‘true’ phonemic nasal vowels, i.e., those that do not occur adjacent to a nasal consonant, and diphthongs are all absent. Phonemic vowels are rare in prepenultimate syllables. 3

Word Formation

Aslian languages stand out from the majority of the Austroasiatic languages of Mainland Southeast Asia in terms of their complex and productive morphological processes. Semaq Beri has a range of derivational processes,

Semaq Beri

481

and these are discussed in §3.2, following a discussion of the less significant process of compounding. 3.1 Compounding Typical of Aslian languages, ‘compounding’ as a means of word formation is of low frequency, see Kruspe et al., this volume. Where compounds exist, they are always nominal—the initial nominal is the head—and have the syntax of noun phrases, for example some compounds consist of a noun and a property verb, e.g., ʔay blŋɨt (game be.dark) ‘the class of dark-fleshed game’, and in particular are lexicalised associative constructions, e.g., suk mɔ̃ t (hair eye) ‘eyelash, eyebrow’, see §4.3. Most compounds are transparent, but some require recourse to traditional knowledge, e.g., ʔay ʔwʔaw (body IPFV-stand) ‘rainbow’. The head of these compounds cannot be elided and retain lexical integrity. Coordinate compounds like moy ʔibɛʔ (mother father) ‘parents’ are rare. Compounds that are calqued on Malay are frequently hybrid compounds where one Malay term is replaced by a Semaq Beri term, e.g., ʔəc lalat (excreta fly) from Malay tahi lalat (excreta fly) ‘freckle’. Many compound-like nouns typically consist of a ‘classifying’ term, and a specifier, and are commonly used in ethnobiological nomenclature, topographic terms, and exonyms. These ‘classifying’ terms include jlʔmɔ̃ ʔ ‘vine’, boŋãʔ ‘flowering plant’, tijɔ ‘snake’, kawaw ‘bird’, ʔay ‘game’, lawɑk ‘stream’, ʔatɛ ‘ground, soil’, smãʔ ‘person’, e.g., boŋãʔ taŋaʔ cɔ (flower stair dog) ‘kind of flower’, lawɑk lwəc ‘Luit River’, and ʔatɛ bmbɨm (earth ?) ‘montane soil’. The modifying element of this type of compound may be used without the class noun if the referent is established in the discourse. 3.2 Derivational Morphology Semaq Beri is an exclusively prefixing/infixing language, and derivational processes are fed both by indigenous and loaned roots. Unlike Semelai (Kruspe 2004), there are few morphological processes borrowed from Malay. Typical of Aslian languages, the form of the affix in Semaq Beri is dependent on the syllabic structure of the root. The domain of attachment is always the penultimate syllable. Indigenous processes are driven by considerations of syllable weight and phonotactic wellformedness of the output form. This results in non-linear affixation, so that a morpheme will have allomorphs underspecified with respect to the syllabic structure of the root. The marginal loaned morphological processes do not adhere to this and simply prefix at the left edge of the root. Morphological processes derive forms that are maximally trisyllabic. Derivational processes may feed further derivations, for example, nominalisation of imperfective verb stems, e.g., broh (blowpipe)

482

Kruspe

‘blowpiping’ from broh (blowpipe) ‘to be blowpiping’. Other rare examples are also noted, as in clɔn ‘to walk backwards’ possibly derived from the causative stem clɔn ‘to turn one’s back’, rather than the root clɔn ‘back’, and infixation of in pa-kʰɛʔ ‘to pass information (to a third party)’ from the causative pa-kʰɛʔ (caus-know) ‘to inform, explain (to s.o.)’. Affixes may be underspecified with respect to form, phonetic content, or both, for example, both full and partial reduplication are attested, and reduplication may coccur with infixation as a single process. Prespecified segments are typically drawn from nasals and liquids /m, n, l, r/, for example, the nominalising morpheme +n+ ‘nml’. Both full and partial reduplication are attested in Semaq Beri. Productive partial reduplication of segmental phonemes is the most common form of reduplication; less common is reduplication of lexemes. Coda copy (Kruspe 2004, Burenhult 2005) is a process whereby the coda of the final syllable is reduplicated, and infixed into the empty coda position of the penultimate syllable. As in other Aslian languages (Kruspe et al., this volume), in Semaq Beri this formal process has multiple functions, dependent on the word class of the root. Nominal roots derive verbs, see §3.3.1, whereas verbal roots derive an aspectual distinction, see §3.3.2. Frequent low productivity of morphological processes suggests nonproductive processes, however viewed from the perspective of the very specific semantics of Aslian lexemes, it can be argued that this low productivity can be accounted for by the fact that derivations are semantically driven, and therefore often only applicable to small domains within the lexicon. Epenthetic vowels of morphophonetic origin are the same as those described above for syllabification in §2.2.3, except that reduplicated glottal codas may result in an epenthetic vowel [a]. An exception is noted for Malay loans with schwa in the penultimate syllable, where an infixed /h/ does not give rise to an epenthetic vowel [a], but retains schwa. Compare Semaq Beri bhroh [baroh] ‘to be hunting with a blowpipe’ and the Malay loan blah [bəlah] ‘to be preparing’, and not the expected form [balah]. 3.2.1 Deriving Nouns Nominalisation The nominalisation of verbs involves an underspecified affix +n(C)+ that has four different allomorphs depending on the syllabic structure of the root, see Table 2.3. Deverbal nouns usually name an activity or state. They function like typical nominal heads, see §5.1. Occasionally, verbal nouns name agents or entities, e.g., gar ‘to want’ → nr-gar ‘admirer, tləp ‘to spread out’ → tləp ‘mat’, jmɔ̃ h

483

Semaq Beri Table 2.3 Nominalisation in Semaq Beri

CVC

n-

bək ‘to tie’ → nk-bək [nək.bək] ‘act of tying’

C.CVC

glət ‘to swallow’ → glət [ɡən.lət] ‘act of swallowing’

CV.CVC

caduəh ‘to resprout’ → caduəh [cə.na.duəh] ‘act of sprouting’

CVC.CVC

n

haʔgiʔ ‘to share’ → naʔgiʔ [naʔ.giʔ] ‘act of sharing’

‘to call’ → jmɔ̃ h ‘name’, tarek ‘to pull’ → tarek ‘zipper’ and cilas ‘to comb’ → cilas ‘comb’. Agent nominalisation Some agent or instrument nouns are derived from monosyllabic verbs by the prefixation of pn- or pna- ‘a.nml’, e.g., cɔ̃ c ‘to pierce’ → pn-cɔ̃ c ‘skewer’; sɑk ‘to probe for (game)’ → pn-sɑk ‘probe’, and gəh ‘to not share’ → pna-gəh ‘someone who is stingy’, and cʰək ‘to plant in the ground’ → pna-cʰək ‘lean-to pole’. There is no comparative derivation for sesqui- or disyllabic roots, which simply infix ‘nml’, as described above. Unitisation A process that is formally the same as the nominalisation of verbs (see Table 2.3) produces ‘unitised’ or count nouns from common nouns. They function in enumerative constructions and bear some resemblance to the numerical classifiers found in other MSEA languages, e.g., syã ‘lean-to’ → syã as in mar syã ‘two lean-tos’. Other examples are: moy ‘mother’ → ny-moy ‘a mother’, plit ‘night’ → plit ‘a night long’ and kolɛh ‘tumbler’ → kolɛh ‘a tumblerful’. (1)

ʔah gi kʰom marem? limãʔ, nãm tahon, gi lompot filler 3pl sit how.many five six year 3pl flee ‘Ah they stayed for how long (now) . . .? Five, six years, (and then) they fled.’

The human collective affix +rʔ+ +COLL+ derives collective nouns from a small set of age-class nouns.3 While unattested elsewhere in Southern Aslian, a

3 baʔi ‘alone’ is the only non-age term to feed this derivation: baʔi, as in gi kʰom br-bar gi baʔi (3pl sit be-two 3pl alone) ‘They are sitting the two of them on their own’.

484

Kruspe

s imilar but more productive process is found in some Northern and Central Aslian languages, see Kruspe et al., this volume. The affix consists of an underspecified morpheme +rʔ+, e.g., gayaʔ ‘grandmother’ → gayaʔ ‘group of grandmothers’. If the penultimate onset is a nasal or glottal segment, +r+ simply replaces it, as in knaya ‘pubescent female’ → kaya ‘group of pubescent females’ and ʔipəʔ ‘male’ → r-ipəʔ ‘group of males’. (2) hɛ swak gadəʔ tahĩʔ sakɛʔ 1incl go woman seek pandanus ‘We will go (us) women to look for pandanus.’

Note that in compounds, the collective is marked on the modifier, not the head, if the modifier is a human noun, as in smãʔ tohaʔ → smãʔ tohɑʔ ‘group of adults’, or kaʔkɔn r‑ipəʔ ‘group of male children’. However if the modifier is a property verb, the collective must be marked on the head, e.g., kaʔkɔn mahɛ̃t (child be.small) ‘group of small babies’. Diverse plural Word-level reduplication of nouns expresses either a range of possible referents from the domain of intended referents, or a lack of specificity of the intended referents. The distinction is determined contextually. Although it expresses plurality, it does not express number alone, but rather focuses on diversity. (3) nɔ̃ ʔ gop~gop balɑʔ, ʔah təʔ kiriʔ kanãn gi now Malay~red be.many filler that left right 3pl ‘Now there are lots of Malays all over, ah, to the left and right of them.’

Numerals There are two derivations restricted to the indigenous numerals two, three and four, see §5.1.2.1 for numerals. An ordinal derivation prefixes nato the root, e.g., na-mar ‘second’. The collectivizing derivation prefixes pna-, as in pna-mar ‘both’, and pna‑mpon ‘all four’. There are no equivalent derivations for loaned numerals. Demonstratives and Locative nominals The proximal demonstrative nɔ̃ ʔ ‘this’ is reduplicated to derive the pronominal or temporal nʔ-nɔ̃ ʔ ‘this thing here; now’. The distal demonstrative təʔ derives the pronominal dʔtəʔ ‘thing there’. The directional adverbs ʔaləŋ ‘above, uphill’, and ʔacoʔ ‘downward, downhill’ derive locative nominal forms, replacing the /ʔ/ onset with d-, e.g., daləŋ ‘place uphill’ and dacoʔ ‘place downhill’. Nominal attributive forms are derived

Semaq Beri

485

by replacing the onset with n-, e.g., dlɔŋ naləŋ təʔ  (tree nml.uphill that) ‘that tree uphill there’. The corresponding locative nominals for upstream and downstream appear to be unrelated to the adverbials, but maintain the same forms, an initial d-, as in dapon ‘place upstream’ and datɔʔ ‘place downstream’, and attributive forms napon ‘upstream’ and natɔʔ ‘downstream’. An alternatively analysis would posit replacive syllabic prefixes da- and na- for these two marginal derivations. 3.3 Deriving Verbs 3.3.1 Deriving Verbs from Nouns There are few processes for deriving verbs from nouns, and those attested generally apply to only a small set of roots. The same formal processes are attested as for verbal morphology: an infix +mC+, Coda copy, and causativisation, see §3.3.2. There is also the marginal presence of b- ‘have’ from Malay ber-. In addition to the productive processes described below are some rare derivations like psum (nest) ‘to make a nest, cocoon’ from psum ‘nest, cocoon’, and kraymɔy ‘to be forming fruit’ which appears to have a similar structure *kmɔy, but no root *kmɔy is attested. Infixing +m+ (attr) derives a verb expressing possession of a physical attribute or kin. All roots attested so far are either body or plant parts, or kin relations. For example, monosyllabic suk ‘hair’ → mk-suk ‘to have hair’, sesquisyllabic lpɛh ‘juvenile tuber’ → lpɛh ‘to have juvenile tubers’, and disyllabic jareʔ ‘finger, toe’ → jareʔ ‘to have digits; be a fingerwidth’ and ʔiwãʔ ‘child’ → m-iwãʔ ‘to have a child, give birth’. Coda copy The infixation of a copy of the root coda is fed by nouns naming the stages in the growth of fruit, e.g., boŋãʔ ‘flower’ → boŋãʔ ‘to be in flower’, and potek ‘bud’ → potek ‘to be in bud’, names of implements, e.g., jareŋ ‘seine net’ → jareŋ ‘to fish with a seine net’, and samoy ‘flute’ → samoy ‘to play the flute’, and other nouns, e.g., bereŋ ‘side’ bereŋ ‘be on the periphery’. The derivation is fed mainly by Malay loanwords. The indigenous numerals two to four derive forms expressing the number of people in a group, ‘to be X people’ also by Coda copy, e.g., mar ‘two’ → br-bar ‘be two’. The initial /m/ of the numerals three and four is elided, e.g., mpɛʔ → pʔ-pɛʔ ‘be three’. Similar derivations are found in other Southern languages, e.g., Semelai (Kruspe 2004), as well as in the wider Aslian family, see Kruspe et al., this volume. Borrowed numerals above five are used in conjunction with a unitized noun, e.g., limãʔ smãʔ (five person) ‘five persons’.

486

Kruspe

Causativisation The derivation of nouns with the causative morpheme ‘caus’ is infrequent, and has only been attested with sesquisyllabic roots, e.g., clɔn ‘back’ → clɔn ‘to turn o.’s back’, and kʰləm ‘odour’ → kʰləm ‘to emit an odour’. Possessive In describing the absence of a property, some speakers use a construction beh b-N, where the noun is prefixed with b- ‘have’, e.g., beh b-kʰləm (neg.exist have-odour) ‘(It) has no odour’. The prefix, probably borrowed from Malay ber-, only occurs in this marginal construction. 3.3.2 Deriving Verbs from Verbs Semaq Beri verbs have inherent lexical aspect. Derivational processes introduce distinctions in aspect and aktionsart. The Imperfective +(C)C+ ‘ipfv’ is derived by coda copy +C+. Both transitive and intransitive verbs feed the derivation. This is for the most part an optional aspectual derivation that conveys information about the internal structure of an event. This derivation has high frequency as it is used in the formation of the negative, see §4.1.2. (4) kɛ gh-gəh kweh 3sg ipfv-not.share biscuit ‘He isn’t sharing his biscuits.’

Monosyllabic roots prefix CC- consisting of a copy of the onset and coda of the root, e.g., ʔaw ‘to stand’ → ʔw-ʔaw [ʔuw.ʔaw] to be standing. All other roots simply infix the reduplicated coda into the penultimate syllable coda position, as in jʔɔh ‘to drink’ → jʔɔh [jah.ʔɔh], blac ‘be smooth’ → blac [bic. lac], dagaŋ ‘to visit’ → dagaŋ [daŋ.ɡaŋ] and panas → panas [paɕ.naɕ]. For the realisation of epenthetic vowels, see §3.1. Causativisation There are numerous causative affixes in Semaq Beri, and their individual productivity is yet to be established. The causative derives verbs with an additional argument, introduced in the agent role, from dynamic and stative verbs. (5)

smãʔ.tohɑʔ gi pa-kʰɛʔ ʔəɲ person.be.old 3pl caus-know 1sg ‘(My) elders told me.’

(6) cur tbiŋ hanɛ̃h teh ʔəɲ!  pour be.full FUT.PROX tea 1sg   ‘Pour my tea full now!’

Semaq Beri

487

Causative verbs are used for direct or controlled manipulation of an activity. To facilitate someone to do something, or facilitate a state, a periphrastic construction with the verb ʔãɲ ‘to carry’, is used, e.g., hɛ ʔãɲ jlaŋ nɛ̃ŋ (1incl carry be.long pst.prox) ‘We made (it) longer just now’. Monosyllabic roots select from a range of prefixes pa-, paʔ- or pan-, e.g., cwəh ‘to ascend’ → pa-cwəh ‘to raise (s.th.) up’; dəm ‘to lie down’ → pan-dəm ‘to lay (s.th.) down’; hũk ‘to bleed from the nose’ → pan-hũk ‘to cause (s.o.) to bleed from the nose’. Sesquisyllabic roots infix e.g., ghɛ̃t ‘be sweet’ → ghɛ̃t ‘to sweeten’. The nasal assimilates to the onset of the final syllable, e.g., kbəs [kbəs] ‘to kill’. There is variation, with some speaker’s exhibiting a preference for before prepalatal onsets, e.g., kloʔ ‘be near’ → kloʔ ‘to draw near’. Disyllabic roots infix the nasal , e.g., bukɑʔ ‘to open’ → bukɑʔ [buŋkɑʔ] ‘to open (s.th)’. The infix ‘to have an inherent attribute X’ (attr) is attested infrequently, and so far only with sesqui- and di-syllabic roots. It is composed of an infix and a copy of the root coda , e.g., jadiʔ ‘to become’ derives jadiʔ ‘to be able to transform bodily’. It functions to distinguish an activity as having an inherent trait, like a shaman’s ability to transform into an animal, or someone’s characteristic trait. (7)

beh kɛ kaləŋ neg.exist 3sg sook ‘He doesn’t sook.’

A small subset of stative property verbs (see §5.2.1) feed derivations expressing degree. Comparative (COMPAR) Some property verbs derive a comparative form through the infixation of : e.g., jlaŋ ‘be long’ → jlaŋ ‘be longer’. Other verbs include jruh ‘be tall’, jlɛ̃ʔ ‘be short’ and kloʔ ‘be near’. Intensive (INTNS) Many property verbs express the intensive by infixing , e.g., mahɛ̃t ‘be small’ → mhɛ̃t ‘be really small’, and pgus ‘to smell musty’ → pgus ‘to smell really musty’. Some roots combine this with vowel alternation to /ɛ̃/ or /ə/, e.g., pagɑh ‘be big’ → pɡɛ̃h ‘be really big’, and sdãc ‘be cool’ → sdɛ̃c ‘be really cold’, bdɑm ‘be slow’ → bdəm ‘be a really long time’. Coda mutation The verbs kanɛ̃t ‘be little, few’ and mahɛ̃t ‘be small’ derive the intensive through the mutation of the final consonant, whereby /t/ becomes /s/, as in kanɛ̃s and mahɛ̃s ‘be tiny’. Although not unique in Aslian (see Kruspe et al., this volume), structurally this is highly unusual as the final consonant is generally the most stable part of the root.

488

Kruspe

Continous/Distributive ‘CONT’ Root reduplication of verbs provides an aspectual distinction expressing continuous aspect for dynamic verbs as in (8), and a distributive reading for stative verbs. Stative verbs are reduplicated twice; dynamic verbs up to five times. (8) kɛ tahĩʔ~tahĩʔ~tahĩʔ~tahĩʔ, beh dapɑt 3sg seek~cont neg.exist succeed ‘She searched and searched and searched, (but) didn’t didn’t find her.’

3.4 Clitics There are three clitics in Semaq Beri, which are all proclitic to noun phrases: the focus marker d= ‘foc’, the referential ʔi= ‘ref’, which creates a reference term from personal names, see §5.1.3, and the quotative hn= ‘quot’. The locative preposition has a reduced cliticising form when it occurs with a demonstrative, e.g., ha=nɔ̃ ʔ ‘here’, from haʔ nɔ̃ ʔ (loc this). 4

Phrase and Clause Structure

4.1 Simple Clauses 4.1.1 Constituent Order The combination of a high rate of ellipsis of NPs and non-fixed constituent order makes it irrelevant to propose a basic constituent order at clause level, and any patterns can only be noted as tendencies. The constituency of Semaq Beri clauses depends on the type of verb in the predicate. Two classes of verb are identified—Dynamic, and Stative—the distinction cross-cuting the classes of transitive and intransitive verbs, see §5.2.1. In clauses with a dynamic verb, defined as those involving an event or change of state, the verb hosts an obligatory pronoun indexing the A of a transitive clause, or the S of the intransitive clause. The pronoun is syntactically bound: it is always adjacent to the verb, and no other material may intervene, other than the imminent aspect marker gaʔ ‘imm’. The high rate of zero representation of NPs means the pronominal is often the only representation of the argument in the clause. This pronoun is absent in the Stative verb clause. This alignment pattern is not attested elsewhere in Southern Aslian, but is common in Northern Aslian, see Kruspe et al., this volume. The subject NP (A or S) may occur in pre- or post-verbal position. There is no formal marking on the preverbal subject, but the post-verbal NP is optionally preceded by nãʔ ‘agt’, compare (9) and (11). Elsewhere, nãʔ is the comitative or

Semaq Beri

489

instrumental preposition, see §5.5. There is no morphological marking of the object NP (P), nor is it cross-referenced on the verb, see (9). (9) gayit kɛ yɔk hɛ tiger 3sg take 1incl ‘A tiger will take us.’ (10) gi cwəh pna-mar moy.ʔibɛʔ kɛ 3pl ascend incl-two parents 3sg ‘Her parents both got out.’ (11) kɛ rɛɲ nãʔ kmõr  3sg gnaw agt grub ‘The grubs ate away (at it).’

Both the subject and object may be in pre-verbal position: (12) tlkɔʔ tʰi, hantuʔ kɛ kltɑs haʔ mɔ̃ t nail hand ghost 3sg flick loc eye ‘The ghost flicks its fingernails into (our) eyes.’

The single argument in a stative clause has no morphosyntactic marking, is usually post-verbal as in (13), but may also precede the verb (14), or be elided. (13) kanɛt ʔa y be.small body ‘(Its) body is small.’ (14) kɛ jlaŋ 3sg be.long ‘It is long.’

Adverbial modifiers tend to immediately follow the verb. Constituents with clausal scope, such as adverbials, prepositional phrases or temporal nouns, occur in initial or final position in the clause. Example (15) shows a temporal noun in initial position, and a locative phrase following the verb. (15) mũy tahon kɛ kʰom haʔ ʔəɲ, ʔarɛh bayek kɛ one year 3sg sit loc 1sg only.then be.good 3sg ‘He stayed with me for one year, and only then was he healed.’

490

Kruspe

Topics or afterthoughts precede or follow the core clause in the extraclausal Left- or Right-hand position. In the Lefthand position the topic NP hosts a demonstrative, usually nɔ̃ ʔ ‘this’ or təʔ ‘that’ (16), while in Right-hand position, the topic may optionally take the pro-clitic focal marker d= ‘foc’, see (17). Afterthoughts lack either of these features. (16) ʔah təʔ, ʔay hɛ sɨl nɔ̃ ʔ, hɛ tlaɲ, hɛ loh ah that meat 1incl k.o.tortoise this 1incl transgress 1incl live ‘So, our game this Giant Brown tortoise (that I’m talking about), (even if) we transgress the taboo, we will live.’ (17) kɛ jadiʔ ʔikan.blidaʔ, d=tarɛs.tɔm nɔ̃ ʔ 3sg become fish.k.o. foc=kingfisher.k.o. this ‘It tranforms into a kind of fish, this Kingfisher (I just mentioned).’

4.1.2 Negation The verbal negator is the negative existential verb beh ‘to not exist’, and the negated predicate is a complement of the matrix verb. This pattern is unattested elsewhere in Southern Aslian, but is a common strategy in Northern Aslian (Kruspe et al., this volume). Dynamic verbs are in the imperfective form. The subject is optionally represented in the dependent clause. (18) ʔah gaʔuʔ təʔ nɛ̃ŋ, beh kɛ jn-jon ah elder.sister that pst.prox neg.exist 3sg ipfv-give ‘Ah, the elder sister didn’t give (it).’ (19) beh ʔn-ʔen gaʔ bri neg.exist ipfv-go goal forest  ‘(He) didn’t go to the forest.’ (20) beh payah ʔãɲ hntɔr gop neg.exist be.difficult carry sound Malay   ‘It isn’t difficult to speak Malay.’ (21) beh pnãh crɛt jalɨw neg.exist ever eat.meat< ipfv > boar ‘(We) have never eaten boar.’

The aspectual adverbs dɔl ‘still’, pnãh ‘ever’ and lɑc ‘perf’ may occur in negated clauses, meaning ‘not yet’, and ‘never’.

491

Semaq Beri (22) ʔah ah

təʔ beh crɛt, beh pnãh gaʔ that neg.exist eat.meat neg.exist ever imm

crɛt dɔl eat.meat still ‘Ah, (we) don’t eat (it), (and we) would never eat yet (it).’

4.1.3 Questions Polar questions have a clause final particle, bəh ‘q’ expressing uncertainty. There is no equivalent to ‘yes’ or ‘no’, and responses to polar questions include a partial repetition of the question. (23) Q: kɛ smaɲ rajaʔ, “ʔiwãʔ ja pagɑh bəh?” A: “pagɑh lɑc” 3sg ask king child 2sg.f be.big q be.big perf Q: ‘The king asked, “Is your child big (yet)?”’ A: “She’s already big.” (24) gi lompot bəh, beh bəh? hn=kɛ 3pl flee q neg.exist q quot=3sg ‘“Did they flee, or not?” he asked himself.’

Content questions replace the constituent being questioned with an interrogative word, see §5.1.1.3. The interrogative may be clause initial as in (25), or substitute the questioned word in situ. (25) marem krɨŋ dlɔŋ? limãʔ krɨŋ how.many trunk tree five trunk ‘How many trees? Five trees’.

4.1.4 Commands Imperatives use the simple verb root; dynamic clauses lack the pronominal index. The emphatic particle lah (borrowed from Malay) may occur clause finally in imperatives and hortatives, as in (29). The addressee is optionally included as a vocative in clause-initial or final position. Temporal adverbs like haʔ yɛm ‘first’ follow the verb as in (28). In the Semaq Beri socio-cultural context, commands have weak force, and there is no expectation of compliance. (26) gɑŋ teh! boil tea ‘Make (some) tea!’

492

Kruspe

(27) ʔen yɔk! go fetch ‘Go (and) get (it)!’

There is no special construction for the hortative, and a normal clause with the first person inclusive hɛ ‘1incl’ is employed. (28) teh hɛ, jʔɔh teh təʔ haʔ.yɛm! tea 1incl drink tea that first (Here’s) our tea. Let’s drink the tea first! (29) hɛ carɔŋ manɛ̃k lah!  1incl thread bead hort ‘Let’s thread beads!’  

4.1.4.1 Negative Imperative The imperative negator is jeʔ baʔ ‘NEG.IMP’ followed by a simple verb root. The construction is probably borrowed from Northern Aslian, and the negator jeʔ is possibly cognate with Jahai yiʔ ‘to refuse, reject’; jeʔ is not attested as a verb in Semaq Beri, or elsewhere in Southern Aslian. Compare the Pahang Semaq Beri imperative negator bəy with Semelai bɔy ‘NEG.IMP’!. The negator jeʔ ‘Don’t!’ may be used alone. Gentle commands used to dissuade children utilise the negative verb wɛn! (neg.want) ‘Don’t!’, lit. ‘(You) don’t want (to)’. (30) jeʔ.baʔ

rɲjɑh

mɔ̃ k təʔ! like that ‘Don’t grasp it like that!  Stand here!’

NEG.IMP grasp

ʔaw stand

tən from

nɔ̃ ʔ! this

(31) jeʔ.baʔ

jʔɔy samãʔ hn=kɛ NEG.IMP make same quot=3sg “Don’t make (it) (in the) same (place)”, they said.

Negative imperatives may be formed with a negated clause: (32) jeʔ.baʔ

beh

kʰɛʔ

NEG.IMP neg.exist know

‘Don’t not know!’

Semaq Beri

493

4.1.5 Nonverbal Clauses Equative, ascriptive and identifying clauses do not contain a verb, and consist of two NPs juxtaposed. The ordering of the NPs is not fixed. The complement may be an NP (33), a prepositional phrase (34), or a directional adverb. Semantically compatible aspectual markers like the perfect lɑc ‘perf’ and imminent gaʔ ‘imm’ may co-occur in the clause as in (35). (33) nɔ̃ ʔ ʔiʔɛʔ kɛ nɔ̃ ʔ this elder.brother 3sg this ‘This (is) the elder brother of him here.’ (34) nɔ̃ ʔ, haʔ dəŋ ha=nɔ̃ ʔ d=kampoŋ nɨ̃y this loc house loc=this foc=encampment rem.pst ‘Here, at the settlement here (was) their former encampment.’ (35) kʰoy kʰoy ʔika lɑc, hateʔ klantɔ̃ r head head fish perf tail Giant.Millipede ‘The head (was) already a fish’s head, the tail (was still) a Giant Millipede.’

4.2 Complex Clauses Complex clauses typically consist of two types, those that are composed of independent sub-clauses, and those that are composed of two or more verbs, one of which lacks the features of a main clause verb. Complement clauses, where a clause fills a core argument position, A, S or P, in a higher clause are also attested (§4.2.2). 4.2.1 Clause Concatenation Clauses are combined by simple juxtaposition as in (36)–(40), with no overt conjunction, and the semantic relations between the clauses are determined pragmatically. Each event is discrete, the subject is indexed on each verb if it is dynamic, and operators such as aspectuals, negation and so forth only have scope over each sub-clause, as in (37). Concatenated clauses are used for coordination, simultaneous events, the conditional and the counterfactual. (36) [hɛ yɔk mahãm hɛ təʔ], [hɛ kasɔc haʔ kɛ] [1incl take blood 1incl that] [1incl throw loc 3sg] ‘We take our blood (and) we throw (it) at him.’ (37) [kɛ cɨk sumuʔ], [beh kbəs] [3sg stab rhinoceros] [neg.exist dead] ‘He stabbed the rhinoceros (but) it didn’t die.’

494

Kruspe

(38) [kɛ pyər], [kɛ bɑy takop] [3sg sing] [3sg dig tuber] ‘She sang (while) she dug tubers.’ (39) [hɛ knãʔ hantuʔ nɔ̃ ʔ], [lyar mɔ̃ t] [1incl incur ghost this] [be.wild eye] ‘(When/if) we are afflicted by this ghost, (our) eyes (look) wild.’ (40) [ʔaleʔ] [hɛ yɔk], [hɛ crɛt], [beh ʔapaʔ] [be.long] [1incl take] [1incl eat.meat] [neg.exist what] ‘(If it is/After) a long time (and) we catch it (and) eat it, it doesn’t matter.’

Purposive, and periphrastic causatives clauses are typically multiverb constructions where the two verbs are construed as representing a single event. The initial main verb bears a pronominal index if dynamic, subsequent verbs are simply concatenated in the bare form, and the arguments occur after the two verbs, as in(41)–(43). No constituent, including adverbs, may occur between the verbs. (41) gi ʔen kɨm smãʔ 3p go hold person ‘They’ve gone to heal someone.’ (42) ʔəɲ gaʔ ʔen broh tupay 1sg imm go shoot.blowpipe squirrel ‘I am going to blowpipe squirrels.’ (43) ʔəɲ ʔãɲ swak ja 1sg carry walk 2sg ‘I’ll take you walking (about).’ (44) ʔəɲ kɨm ʔɔ̃ ɲ 1sg hold keep ‘I’ll take (it) for keeping.’

Multiverb constructions may also function as adverbial clauses, expressing manner. Example (45) demonstrates the lack of distinction between this and a purposive reading. Some verbs that function as manner adverbials are sɛc ‘do surreptitiously’, ʔyot ‘(do) again’, and roh ‘completely, to the full extent’. All

Semaq Beri

495

modifying verbs, except for sɛc, follow the main verb. In adverbial function, it does not bear stress as it would when functioning as a full verb. (45) moy kɛ ʔyot tahĩʔ kɛ mother 3sg return seek 3sg a. ‘(Her) mother went home to look for her.’ b. ‘(Her) mother went back to searching for her.’ (46) kaluʔ kuciŋ kɛ laŋkah, kɛ ʔyot loh if cat 3sg climb.over 3sg return alive ‘If a cat climbs over (the corpse), (it) will come back to life.’ (47) kɛ ɲca gəh 3sg eat.carbohydrate not.share ‘He ate without sharing.’

4.2.2 Complement Clauses Some verbs may take a clausal complement in an argument slot, in place of an NP. With the verbs of desire like gar ‘to want, to like’ and wɛn ‘to not want’ the complement fills the P argument slot as in (48)–(49). Example (52) has a complement is the S slot. Complement-taking verbs include verbs of desire, achievement, ability, cognition, speech, and knowledge, (48) kɛ gr-gar kɛ. kɛ gar kʰɛʔ 3sg ipfv-want 3sg 3sg want know ‘She wants (me to). She wants to know.’ (49) ʔəɲ btʰɔŋ swak~swak gaʔ bri 1sg afraid walk~cont goal forest ‘I am afraid to keep walking in the forest.’

The complement clause usually follows the matrix clause, but it may also precede it: (50) kɛ, kɨŋ kɛ kʰɛʔ  3sg make.shelter 3sg know  ‘Her, she knows (how) to make a shelter.’

496

Kruspe

The negative verbs beh ‘to not exist’, and wɛn ‘to not want’ take the imperfective form of dynamic verbs in the complement as in (51). (51) gi wɛn cʔɔŋ 3pl neg.want roast ‘They don’t want (it) roasted.’

Potential or hypothetical events contain the imminent aspect marker gaʔ ‘imm’ in the dependent clause. This marker is homonymous with the goal preposition, see §5.5. (52) ʔah təʔ payah hɛ gaʔ ʔaleh filler that be.difficult 1incl imm change ‘So, it would be difficult for us to change (it).’

4.3 Noun Phrases Noun phrases (NPs) function as the arguments of verbs, subjects and complements of nonverbal clauses, the complements of prepositions, and clausal adjuncts. The constituent order of the noun phrase is: (quant) Head (assoc) (attrib) (dem). Attributives may be stative property verbs in the bare root form, dynamic verbs in either the imperfective or nominalised form, a prepositional phrase, or a whole clause, the functional equivalent of a relative clause. Some examples of NPs follow:

mũy smãʔ (one person) ‘one person’ syã ʔaleʔ   (lean.to be.old) ‘an old lean-to’ cnrəŋ btək (path grow.over) ‘an overgrown path’ sala cʔũm təʔ nɛ̃ŋ (leaf wrap that prox.pst) ‘that leaf wrapper (from) just now’ caraʔ nm-kʰom smãʔ tohɑʔ nɨ̃y  (manner nml-live person be.old rem.pst) ‘the way of life of the people long ago’ tɔm haʔ syɔk gajah (water loc footprint elephant) ‘water in the elephant’s footprint’

Nouns may be coordinated with the preposition nãʔ ‘com’, or sapɔʔ ‘com’, the latter coordinates two entities of the same kind, e.g., gadəʔ sapɔʔ gadəʔ ‘a woman with a woman’.

Semaq Beri

497

There is no distinct relative clause construction in Semaq Beri. The modifying clause is simply juxtaposed following the head like a normal attributive, as shown by the bracketed clauses in ex. (53)–(54), and the relationship is determined pragmatically. (53) kaluʔ kɛ gar ʔiwãʔ hɛ, ʔiwãʔ hɛ [kɛ gr-gar] kɛ if 3sg want child 1incl child 1incl [3sg ipfv-want] 3sg gaʔ jwɑl, beh kʰɔm hɛ gaʔ jon imm sell neg.exist be.able 1incl imm give ‘If they want one of our children, (and) they are going to sell our child that they want, we wouldn’t give (it).’ (54) haʔ syã ʔaleʔ kɛ, syã nɨ̃y kɛ,  syã [ ɲji~ɲji]  loc lean.to be.old 3sg lean.to rem.pst 3sg lean.to [be.ill~cont] ‘In (their) old camps, their traditional camps from long ago, camps (where/in which they) were continually ill’ 

An associative construction [NP head NP mod] is used to express a range of semantic relations, including possession, kin relations, part-whole, entitysource, entity-purpose, and benefactive, as in (55). The semantic relationship between the constituents is determined contextually. (55) gi yot tən gahroʔ, gi gɔʔ lew rakit gi 3pl return from k.o.Agila.wood 3pl fell bamboo raft 3pl ‘They were on the way back from (seeking) Agila wood, (and) were felling bamboo for a raft for themselves.’

The difference between an associative construction and a compound noun is that in an associative construction each constituent in the construction retains its individual meaning Most compositional compounds in Semaq Beri are associative constructions. NPs may host the focus marking clitic d= ‘foc’, and the quotative clitic hn= ‘quot’. The quotative follows a statement and marks it as direct speech, or thought. (56) lɑc kɛ jaɲiʔ lɑc, lapan.ratus ʔəɲ  bayar hn=kɛ perf 3sg arrange perf eight hundred 1sg  pay quot=3sg ‘He had already promised, “I’ll pay $800”, he’d said.’

498 5

Kruspe

Word Classes

Semaq Beri has clearly distinguished word classes. Lexical items must be morphologically derived in order to function in a non-prototypical position. The two open classes are noun and verb, and the closed classes are preposition, adverb, negator, connective, interjection, pronoun, numeral, and demonstrative. The latter three belong in the nominal superclass. The status of expressive type words remains unclear. 5.1 Nouns Nouns function as the heads of NPs, which in turn function as arguments of predicates, subjects and complements of nonverbal clauses, and complements of prepositional phrases. They may also occur as unattached units in the form of preverbal topics, vocatives, or afterthoughts. Nouns may be modified by another noun, a demonstrative, a verb, a prepositional phrase, a quantifying expression or an attributive clause, §4.3. Nouns derive a unitised form that is used in the enumeration of nouns, and as measure terms, see §3.2.1. No productive means of deriving verbs from nouns has been identified, see §3.2.2. Nouns expressing spatial relations may be used in a locative function, replacing prepositions as in (57), or co-occuring to provide greater specificity as in (58) and (77). These include karom ‘underside, underneath’, dayal ‘above’, kloc ‘inside’, kupar ‘in front’, kruw ‘behind’, and bodypart nouns like clɔn ‘back’, kʰoy ‘head’ and jɔŋ ‘foot’. (57) kɛ kʰom kloc bri 3sg sit inside forest ‘He stayed inside the forest.’ (58) kɛ kʰom dayal ɡtʰu mrbaw 3sg sit above bark Merbau ‘It lives on the bark of the Merbau (tree).’

An important means of placing an event in time is to encode it on the nominal rather than in elsewhere in the clause; for example with nɛ̃ŋ (prox.pst) ‘just now’, nɨ̃y (rem.pst) ‘back then’. (59) moy ʔəɲ nɨ̃y, daʔ ʔiwãʔ ʔəɲ pn-pon nɨ̃y, ʔarɛh mother 1sg rem.pst exist offspring 1sg be-four rem.pst only.then

499

Semaq Beri

kɛ jon 3sg give ‘Back then my mother, back (when) there were four children of mine, only then did she allow (me to eat it).’

5.1.1 Pronouns and Question Words 5.1.1.1 Personal Pronouns The Semaq Beri pronominal paradigm has a basic singular/plural distinction for all three persons, with the addition of an inclusive/exclusive distinction for first person. Typically for a Southern language, it lacks the dual distinction of Northern and Central Aslian, see Kruspe et al., this volume. Unusually for an Aslian language, there is a gender, but no number distinction in the second person. Table 2.4 Semaq Beri personal pronouns

1 2 3

incl excl f m

sg

pl

ʔəɲ

yɛ

kɛ

hɛ

ja heʔ

gi

There is no gender-neutral second person plural form; the singular pronouns ja and heʔ are simply juxtaposed. Semaq Beri lacks a third person possessive form like =hn and han ‘3poss’ found in Semelai (Kruspe 2004) and Mah Meri (Kruspe 2010) respectively. Second person pronouns are only used with persons in the same, or descending generations, but not with affinal kin. The third person plural pronoun gi 3pl replaces the second person singular pronoun to address a sibling-in-law. The third person singular kɛ ‘3sg’ is also used for inanimate referents as in (60), and with meteorological verbs as in (61). (60) gtʰu nɔ̃ ʔ, kɛ hã, nɔ̃ ʔ hɛ risik wɛn skin this 3sg this.here this 1incl remove discard ‘This skin, it here, this we remove and discard.’

500

Kruspe

(61) beh kɛ panãs, tdoh neg.exist 3sg be.hot be.overcast ‘It’s not hot, (it) is overcast.’

Non-human and inanimate referents take the singular index, regardless of number, neutralising the distinction. In (62), the plurality is expressed by the quantifier hamɔ̃ ʔ ‘some’. (62) kɛ jadiʔ baliəw hamɔ̃ ʔ  3sg become Malayan.giant.frog some ‘Some become Malayan giant frogs.’

The third person plural gi ‘3pl’ is used as a collective pronoun with kinship, social unit, or ethnic group terms, (63)–(64). For similar constructions in Northern Aslian, see Kruspe et al. this volume. (63) gi ʔiwãʔ ʔəɲ, gi cnũʔ ʔəɲ  3pl child 1sg 3pl grandchild 1sg ‘All my children (and) my grandchildren.’ (64) dɔl gi klamĩn masə dɔl still 3pl married.couple time still ‘At this time the couple are both still living.’ (lit. still a married couple)

A plural pronoun modified by a reduplicated numeral, produces a synthetic dual or trial pronoun as in the following example: (65) gi swak gi br-bar gaʔ bri  3pl go 3pl be-two goal forest  ‘The two of them went to the forest.’

Pronouns may be modified by locative and temporal demonstratives, e.g., gi təʔ nɛ̃ŋ (3pl that PST.PROX) ‘them just now’. The personal pronouns also function as bound indexical pronouns in the verb complex, see §4.1.1. 5.1.1.2 Demonstratives Demonstratives form a small closed class, see Table 2.5. They function primarily as deictic determiners of noun phrases in either the situational or discourse context, and may themselves be modified by temporal adverbs. The demonstratives

501

Semaq Beri

occur as the complement of the locative preposition haʔ ‘loc’, as in ha=ʔin (loc=that) ‘there’, and the goal preposition gaʔ ‘goal’. The adverbial demonstratives only cooccur with gaʔ ‘goal’, as in ga=hnɔ̃ ʔ (GOAL=hither) ‘hither’. Table 2.5 Semaq Beri demonstratives Adnominal

Nominal

Adverbial

Manner adverb

nɔ̃ ʔ ‘this’ təʔ ‘that’

nʔ-nɔ̃ ʔ dʔ-təʔ

hnɔ̃ ʔ

di-nɔ̃ ʔ di-təʔ

ʔin ‘that’ ʔɛn ‘that’

hnĩʔ

hnɛ̃ʔ ‘this/that’

The proximal demonstrative nɔ̃ ʔ ‘this’ locates an entity at the deictic centre. In discourse, nɔ̃ ʔ functions to mark a current topic, as in (67). nɔ̃ ʔ ‘this’ has derived forms nʔnɔ̃ ʔ ‘that one’, and the manner adverbial di-nɔ̃ ʔ ‘like that, in that way’. (66) sput ʔəɲ nɔ̃ ʔ lah blowpipe 1sg this asrt ‘This (is) my blowpipe.’ (67) ʔah təʔ ci nɔ̃ ʔ, hɛ bɛŋ, knãʔ suc tʰi haʔ.yɛm filler louse this 1incl seek must wash hand first ‘So, these lice, (after) delousing, we must wash our hands first.’

The medial demonstrative təʔ ‘that’ is used contrastively with nɔ̃ ʔ ‘this’ in the immediate location of the speech event. For example ʔɔ̃ ɲ ha=təʔ! (put loc=that) ‘Put it there!’ (pointing to a location near the addressee, away from the speaker), or in the directive in (68). (68) ja kʰom ha=təʔ, ʔəɲ kʰom ha=nɔ̃ ʔ 2sg.f sit loc=that 1sg sit loc=this ‘You sit there (and) I’ll sit here.’

502

Kruspe

təʔ is particularly frequent in it’s anaphoric function in discourse, and in temporal expressions, e.g., smãʔ təʔ nɛ̃ŋ (person that pst.prox) ‘that person (I mentioned/was present) before’. təʔ has derived forms dʔtəʔ ‘that one’, and the manner adverbial di-təʔ ‘like that, in that way’. (69) ʔalaʔ ɲsəc. təʔ hɛ paliŋ jagaʔ nʔ-nɔ̃ ʔ taboo flesh that 1incl most look.after nml-this ‘The flesh taboo. That (one) we watch out for the most now.’

The demonstrative ʔin ‘that’ contrasts directly with nɔ̃ ʔ ‘this’ and is used for something which is present in the speech context, but which is located away from both the speaker and hearer. It is also used in the temporal expression losaʔ ʔin (day.after.tomorrow that) ‘day after tomorrow’. The demonstratives təʔ ‘that’ and ʔin ‘that’ co-occur with the directional adverbs (§5.3). (70) ʔin jlʔmɔʔ that vine

‘That (is) a vine’.

ʔɛn ‘that’ is used for remote locations, or the furthest location from the deictic centre, as in dacoʔ ʔɛn dɔl (downhill.place that still) ‘further downhill (right at the bottom)’. The vowel of the distal terms ʔin and ʔɛn may be elongated to accentuate the distance, as in [ʔiːːːn] ‘way there’ and [ʔɛːːːn] ‘way, way there’. The demonstrative hnɛ̃ʔ ‘this, that’ is used for something heard, smelt, felt, thought of, or remembered, but is not visible. Similar distinctions are found in Mah Meri (Kruspe 2010), and in Ceq Wong and Jahai, see Kruspe, et al., this volume. Hnɛ̃ʔ is also used for distant locations as an alternative to ʔin ‘that’, for example, the mouth of the Terangganu River, a location outside of Semaq Beri territory may be referred to as tamaʔ trŋanõʔ hnɛ̃ʔ (Kuala Terengganu that) ‘Kuala Terengganu there’. The adverbial demonstratives hnɔ̃ ʔ ‘hither’ and hnĩʔ ‘thither’ are usually used in conjunction with the preposition gaʔ ‘goal’, the directional adverbials and verbs of orientation or motion where no preposition is required. They are employed where the focus is on the movement rather than the location. (71) hɛ kʰom hnĩʔ kʰom hnɔ̃ ʔ 1incl sit here sit there ‘We lived (moving camp) here (and) there.’

503

Semaq Beri

5.1.1.3 Interrogative, Indefinite and Negative Pronouns There are four interrogative pronouns mʔay ‘what, which’, ʔŋkɔʔ ‘who’, hnãn ‘where’ and marem ‘how many’. The manner pronoun is a compound mɔ̃ k hnãn (like where) calqued on Malay macam mana ‘how’. There are no interrogatives for time or reason. There are no distinct indefinite or negative pronouns, other than ʔam ʔay ‘something’. Usually a generic noun is employed like smãʔ ‘person’ or ʔay ‘thing’, and the indefiniteness determined pragmatically. Negative pronouns are expressed by the indefiinte pronoun in a negated clause: (72) samõʔ.bri kʰɔm hɛ dəm, beh ʔam.ʔay  morning be.able 1incl sleep neg.exist something ‘In the morning we can sleep. There’s nothing (of concern).’ (73) tapiʔ smãʔ beh kʰɛʔ  but person neg.exist know  ‘But no-one knew.’

The interrogative hnãn ‘where’ is often used in rhetorical questions as in (74), a construction possibly calqued on the Malay use of mana ‘where’. (74) hnãn gi gaʔ jon? where 3pl imm give ‘As if they would they give them (up)?’

5.1.2 Numerals, Measure Words and Quantity Words 5.1.2.1 Numerals The variety of Semaq Beri described here has indigenous Austroasiatic numerals up to four.4 The remaining numerals to ten, and higher basic units are all loanwords from Malay. mũy ‘one’ mar ‘two’ mpɛʔ, ‘three’ mpon ‘four’

limãʔ ‘five’ nãm ‘six’ tujuh ‘seven’ lapan ‘eight’ smilan ‘nine’

← ← ← ← ←

lima ‘five’ enam ‘six’ tujuh ‘seven’ lapan ‘eight’ sembilan ‘nine’

4 Some speakers retain knowledge of msɔŋ ‘five’, but it is associated with another dialect, and not used by this group.

504

Kruspe puluh ‘tens unit’ blas ‘base of ten’ ʔatus ‘hundred’ ʔibuʔ ‘thousand’

← ← ← ←

puluh ‘tens unit’ belas ‘base of ten, from 11–19’ ratus ‘hundred’ ribu ‘thousand’

The indigenous numerals form hybrid compounds with loan numerals, e.g., mũy blas (one ten) ‘eleven’, mpɛʔ puluh (three ten) ‘thirty’ and limãʔ ʔatus mar puluh (five hundred two ten) ‘five hundred and twenty’. This pattern is also found in Mah Meri (Kruspe 2010), but contrasts with the other Southern Aslian languages, including the Jabor dialect of Semaq Beri (Kruspe fieldnotes 2007) and Semelai (Kruspe 2004), and Aslian in general, see Ceq Wong (Kruspe et al., this volume), Jahai (Burenhult 2005), Jah Hut (Diffloth 1976). When used to enumerate, numerals precede the noun. The noun is nominalised, as in mar plit (two night) ‘two nights’, or mũy n-ibɛʔ (one father) ‘one father’. 5.1.2.2 Measure and Quantity Nouns Measures form part of the numeral phrase, and along with Quantifiers like rem ‘every’ and mər ‘all’ precede the head like numerals, e.g., rem bŋãh (every daytime) ‘all day long’. Other quantity words follow the head, such as hamɔ̃ ʔ ‘some’, the verbs balɑʔ ‘be many’ and kanɛt ‘be few’, and measure terms like gantaŋ ‘a measure of rice’, and ʔeka ‘acre’, e.g., mũy gantaŋ bras ‘one measure of rice’. The quantifier nhroh (nml-finish) ‘all’ may precede or follow the head as can be seen with nhroh smãʔ (all body) ‘the whole body’, compared to ʔipəʔ nhroh (male all) ‘all males’. 5.1.3 Names and Terms of Address Each Semaq Beri individual has multiple names. A personal name, jmɔ̃ h ɲsəc (call flesh) ‘flesh name’, is derived from the environment, for example the toponym of one’s birthplace, or a plant growing in the locality of one’s birthplace. This naming strategy is not attested among other Southern Aslian groups, but is found among the Northern Aslian Ceq Wong, and neighbouring Batek groups. People will also have one or more nicknames, and an official Malay name. The latter is used freely with non-Semaq Beri as these names are considered to be external to Semaq Beri culture. Once a person has children, a teknonym is adopted, moy / ʔibɛʔ X ‘mother/ father of X’ based on the name of the eldest surviving child. Teknonyms are transient, for example when a child takes a spouse; the name of the next child becomes the formative. The term gayaʔ ‘grandmother’ is also occasionally used in teknonyms.

Semaq Beri

505

Taboos prescribe against disrespectful behaviour toward people in ascending generations. This includes not only addressing anyone in an ascending generation with their personal name, or a second person pronoun, but even uttering a personal name to someone else. The naming taboos operate between children and their elder kinsmen, and reciprocally across all three generational levels of affinal relations. A kin term or teknonym is used instead. The naming taboo also applies to the entity from which the name was taken. For example, a man whose brother-in-law is named after a river never utters the river’s name, even when giving directions, replacing it with complex circumlocutions. This taboo persists after the death of the individual, and people continue to use circumlocutions. Speakers have numerous anecdotes relating how this has lead to permanent lexical change. Personal names, or address terms are permitted with consanguineal kin and non-kinsmen in collateral and descending generations. When used as reference terms, personal names take a gender-neutral clitic ʔi= ‘ref’, e.g., ʔi=Dayaŋ ‘Dayang’. For a description of Semaq Beri kinship, see Kruspe (2014, ms). 5.2 Verbs Verbs function primarily as predicates, determining the argument structure of the clause. The unmarked neutral verb is the most frequently occurring form. There is no grammaticalised tense, and clauses are interpreted according to context. 5.2.1 Dynamic and Stative verbs Two major subclasses of verbs are identified: Dynamic verbs and Stative verbs. The fundamental difference between dynamic and stative verbs is that the former are cross-referenced in the verb complex, whereas the latter are not, as illustrated in the two clauses in (75), see §4.1. (75) kɛ jadiʔ tadek, jlaŋ hateʔ kɛ 3sg become Long.tailed.macaque be.long tail 3sg ‘He became a Long-tailed Macaque (and) his tail was long.’

Dynamic verbs are those which denote an action, process, or human propensity. Dynamic is defined here according to Mathews (1997) as a verbal label denoting an action or process as opposed to a state, and not as a superordinate aspectual label. Dynamic verbs include, e.g., ŋɨ̃y ‘to ford’, kɨm ‘to hold’, jon ‘to give’, katam ‘to cut in lengths’, kbəs ‘to die’, mksɛk ‘to be envious’, gor ‘to burn’, kʰɛʔ ‘to know’ and biŋliŋ ‘to listen’.

506

Kruspe

Stative verbs denote states, including existence and non-existence, and properties such as dimension, physical property, value, age, distance, qualification, speed, quantification and colour, e.g., but ‘be decayed’, bcoʔ ‘be fat’, baŋat ‘be quick’, lɛy ‘be real’ and surɑt ‘be red’. They are distinguished from dynamic verbs by their ability to directly modify nominals in their root form, e.g., ʔiwãʔ mahɛ̃t (offspring be.small) ‘a small child’, whereas dynamic verbs require nominalisation, and negated stative verbs are not required to be in the imperfective. 5.2.2 Intransitive and Transitive Verbs Transitivity appears to be of low relevance, and most verbs may be used in transitive or intransitive contexts without any formal marking. Causativisation is the only valence-changing process, see §3.3.2; there are no valencedecreasing derivations, other than nominalisation which produces deverbal nouns, §3.2.1. Both intransitive and transitive verbs feed the imperfective, and causative derivations, where semantically plausible, while only stative intransitive verbs feed derivations like the intensive. Intransitive verbs take a single argument, typically either an Agent or Experiencer. Some intransitive verbs of motion with inherent direction like ʔyot ‘to return’ or jok ‘to break camp’ do not require a preposition to encode the goal of motion. (76) kɛ lpas treŋanõʔ 3sg left.for Terengganu.River ‘He has left for the Terengganu River.’

Transitive verbs have two arguments, typically an Agent and a Patient. The patient of verbs expressing non-prototypical transitive events, like verbs of perception or knowledge, may encode the object with the locative haʔ ‘loc’ as in (79)–(80). (77) kɛ yəs mɔ̃ t haʔ karom kmɛ̃k 3sg put eye loc underneath armpit ‘It puts (its) eye(ball)s under (its) armpits.’ (78) gi jagaʔ smãʔ kbəs, gi pa-mɔ̃ c nãʔ gi 3pl watch.over person dead 3pl caus-bathe agt 3pl ‘They watch over the dead person (and) they bathe (them).’

Semaq Beri

507

(79) ʔəɲ knãl haʔ ja 1sg know loc 2sg.f ‘I know you.’ (80) kɛ nɔ̃ t haʔ ʔəɲ 3sg look.at loc 1sg ‘She looked at me.’

Semantically ditransitive verbs, like verbs of transfer or speech, have an additional argument that may be introduced by the preposition haʔ ‘loc’. (81) kɛ smãɲ haʔ ʔəɲ mamãʔ kɛ 3sg call loc 1sg parent’s.younger.brother 3sg ‘He calls me his uncle.’

There is no evidence to support an analysis of the recipient or addressee as a syntactic indirect object, and they are treated as oblique NPs. On the rare occasion that all three arguments are represented, the recipient NP follows the theme like a locative adjunct, however the recipient may precede the object as in (83). The recipient is often encoded as a possessor or benefactor in the object NP, see (84). (82) kɛ jon lapan ratus haʔ ʔəɲ 3sg give eight hundred loc 1sg ‘They gave me 800 (ringgit).’ (83) ʔəɲ jon ja bras 1sg give 2sgf rice ‘I gave you rice’. (84) ʔah təʔ ʔəɲ jon mãm gi filler that 1sg give starch.food 3pl ‘So I gave (them) their food.’

5.2.3 Existential Verbs There are two existential verbs, daʔ ‘to exist’ and beh ‘to not exist’, which belong to the subclass of stative verbs. beh ‘neg.exist’ also functions as the verbal negator, see §4.1.2. The adverb ŋɑ̃c ‘any’ occurs in negative existential clauses to express complete nonexistence.

508

Kruspe

(85) haʔ jabor, haʔ smbiŋ beh ŋɑ̃ c smãʔ klantan. pahaŋ daʔ loc Jabor loc Lembing neg.exist only person Kelantan Pahang exist ‘At Jabor, at Lembing, there aren’t any Kelantan people at all. There are (people) from Pahang.’

5.2.4 Auxiliary Verbs Ability and the permissive are expressed with the verb kʰɔm ‘be able, to incur’. As a main verb, it is preceded by a pronominal index (86). As a modal verb, the index is absent as in the permissive and prohibitive clauses in (87)–(88). (86) kɛ, kɛ kʰɔm kʔɔr gatiʔ 3sg 3sg incur dart too ‘Him, he was struck by a blow-pipe dart too.’ (87) kʰɔm hɛ jmɔ̃ h bndaʔ ʔasiŋ be.able 1incl name thing other ‘We are allowed to name other things.’ (88) beh kʰɔm gaʔ jmɔ̃ h neg.exist be.able imm call ‘One is not allowed to name (it).’

Obligation is expressed by the verb knãʔ ‘to have to, must’ from the Malay kena ‘to incur’, see example (67). 5.3 Directional Adverbs There are two sets of directional adverbs based on axes of verticality, one mapped in relation to verticality in the land, and the other to watercourses. They typically post-modify verbs of motion, e.g., ʔaŋkit ʔaləŋ ‘Take (it) uphill!’, may co-occur with the adverbials hnɔ̃ ʔ ‘hither’ and hnĩʔ ‘thither’, ʔaləŋ hnĩʔ ‘uphill to there’, derive locative nominal forms, e.g., daləŋ ʔin ‘uphill place there’, and attributive forms that post-modifies nominals, e.g., dəŋ naləŋ ʔin ‘that uphill house above’. The directional adverbs present economy in the language, for instance, if someone has said they are going to swak ʔaləŋ ‘go uphill’, they will take a landbased path, whereas swak ʔanap ‘go upstream’ indicates that they will follow a watercourse. Land-based vertical path ʔaləŋ ‘upwards, uphill’ ʔatʰaʔ ‘downwards, downhill’

Riverine path ʔanap ‘upstream’ ʔatet ‘downstream’

Semaq Beri

509

(89) ʔaləŋ! paŋkah jruh!  upward throw be.high ‘Upward! Throw it (up) high!’ (90) bək haʔ nɔ̃ ʔ, lipat ʔatʰaʔ  fasten loc this fold downwards ‘Fasten it here, (then) fold it downwards!’ (91) ʔarɛh kɛ ʔyot  kɛ wɛn kakas ha=nɔ̃ ʔ, kɛ just.then 3sg return 3sg discard possessions loc=this 3sg low ʔatet pass down.stream ‘As soon as he came home, he left his bags here (and) went downstream (to his in-laws’ house)’.

5.4 Adverbs There is a small class of lexical adverbs expressing manner, aspect, time, degree, frequency and direction. Stative verbs may also postmodify other verbs, effectively functioning as adverbs, e.g., baŋat ‘be quick’ and ʔayɔn ‘be slow, long (of time)’, see §5.2 above. The imminent aspect marker gaʔ ‘imm’, punctual ʔarɛh ‘only then’, experiential pnãh ‘ever’, continuative dɔl ‘still’ and the perfect lɑc ‘perf’ are exclusively adverbs. While gaʔ has a fixed position immediately before the main verb (see (88)), ʔarɛh ‘only then’ is always clause initial, e.g., ʔarɛh kɛ mən (just. then 3sg be.ripe) ‘Only then is (it) ripe’. The adverbs dɔl and lɑc may occur pre- or post-verbally (92), or simultaneously in both positions. The punctual and imminent aspect markers occur together to express the prospective, ʔarɛh gaʔ gi jʔɔy (pros 3pl make) ‘They were just about to make (it).’ The preverbal aspectual, spay ‘recent, just now’, is from the verb spay ‘be new’. (92) kɛ tohɑʔ lɑc, beh dapɑt dɔl 3sg be.old perf neg.exist achieve still ‘He’s already old (and) he still doesn’t get it.’

Other frequently occurring temporal adverbs are the future proximal hanɛ̃h (fut.prox) as in grək hanɛ̃h! (fall fut.prox) ‘You will fall now!’, past proximal nɛ̃ŋ (PST:PROX), and the unanalysable haʔ yɛm (LOC ?) ‘first, before’. Temporal nominals may function as adverbial modifiers at the clause level, as in (93), in which case they tend to occur in clause initial or final position.

510

Kruspe

(93) ʔəɲ swak ʔəɲ bŋãh 1sg go 1sg daytime ‘I go in the daytime.’

Adverbs of degree include stret ‘completely’, as in ponhəl stret ‘completely white’, and ʔamãt ‘very’ and paliŋ ‘most’, both Malay loans. The latter precedes the verb. There is no dedicated comparative or superlative construction. The correlative is expressed with the verb kɔy ‘to follow’, kɔy ʔaleʔ, kɔy balɑʔ (follow be.long follow be.many) ‘the longer the time, the more’. Adverbs of manner—wel ‘again, too’, matəʔ ‘just, only’, gatiʔ ‘too, likewise’— occur in clause final position. Adverbs of frequency include kadaŋ ‘sometimes’, jaraŋ kaliʔ ‘seldom’, direk continuously, snaʔluʔ ‘always’ and snɔ̃ n ‘year after year’. All except the last one are loanwords. 5.5 Prepositions Prepositions form a phrasal constituent with an NP, and function to express circumstantial roles, including location, goal, source, comitative, the instrumental and similarity. Some also encode a subcategorised argument of the verb. A distinction is made between ‘true’ prepositions, and locative nominals, which may function like prepositions. Prepositional phrases typically occur in post-verbal position (94), but they may also occur in clause initial position (95). Prepositions are monosyllabic, and are not derived from other parts of speech. (94) ʔəɲ gaʔ dəm haʔ bri 1sg imm sleep loc forest  ‘(I’m) going to sleep outside.’ (95) haʔ hɛ nɔ̃ ʔ, batew beh loc 1incl this Siamang neg.exist ‘At us here, there are no Siamang.’

Locative haʔ ‘loc’ locates an entity or activity. It is used with adjuncts and subcategorised arguments of the verb such as recipients, see §5.2.2. (96) pa-kʰom haʔ tuŋkuʔ! caus-sit loc hearth.stone ‘Set (it) on the hearth!’

Semaq Beri

511

(97) ʔəɲ smur haʔ kʰoy kɛ  1sg spray loc head 3sg ‘I sprayed (it) on her head.’

The locative preposition can combine with a demonstrative pronoun to create a locative demonstrative. The final glottal stop is inaudible, e.g., haʔ ‘loc’ and təʔ ‘that ⇒ hatəʔ ‘there’. The locative preposition may also combine with a relational noun, see §5.1. Goal gaʔ ‘goal’ expresses the goal (98), or path of motion. In examples (99)–(100) the focus is on the path rather than on the endpoint. (98) kɛ kɔy gaʔ moy ʔatʰaʔ 3sg follow goal mother downstream ‘He followed after his mother downstream.’ (99) ʔəɲ gaʔ cʔəc gaʔ tɔm 1sg imm shit goal river ‘I’m going to shit at the river.’ (100) hɛ kɔy gaʔ bri, plɔm balɑʔ  1incl follow goal forest forest.leech be.many  ‘(If) we follow (a path) through the forest, (there) will be many leeches.’

g  aʔ ‘goal’ is a formative in the following demonstrative adverbs: gahnɔ̃ ʔ ‘hither’ and gahnĩʔ ‘thither’, which are used for non-specific goals. Source tən ‘from’ encodes the source. It is also used to express spatial relations projected from a specified point of reference, e.g., ʔiwãʔ tən nɔ̃ ʔ (child from this) ‘the child on this side’. (101) gi hmos tən kʰoy kɛ, tən jɔŋ kɛ, haʔ gnɔ̃ s kɛ 3pl blow from head 3sg   from foot 3sg loc heart 3sg ‘They blew (the incense) (on her) from her head, from her feet, and on her heart.’

Comitative, Instrumental nãʔ ‘com, ins’ expresses both the comitative (102), and the instrumental (103). nãʔ is also optionally used to cross-reference post-verbal agent NPs, and to coordinate nouns, see §4.1. (102) kɛ ʔyot nãʔ ʔibɛʔ 3sg return com father ‘He went home with (his) father.’

512

Kruspe

(103) gi ʔisiʔ~ʔisiʔ~ʔisiʔ~ʔisiʔ təʔ, gi bək nãʔ kayen 3pl fill~cont that 3pl bind ins sarong ‘They put (his intestines) back in, put (them) back in, (and then), they bound (the wound) with a cloth.’

Similarity The preposition mɔ̃ k ‘like’ co-occurs with nouns and demonstratives expressing equivalence or similarity to an object, or a state of affairs. (104) kɛ jʔɔy mɔ̃ k bawoŋ 3sg do like k.o.catfish ‘It looks like the Baung catfish.’

5.6 Negators The negative existential verb beh ‘to not exist’ functions as the predicate negator, see §4.1.2. The negator ʔiway ‘not’ negates non-verbal clauses, nominals, as in ʔiway gadəʔ ‘not a girl’, and is used to reject a proposition as in (105). (105) kɛ kɔy kreŋ, ʔiway kɛ kɔy but 3sg follow be.dry neg 3sg follow be.decayed ‘(It) becomes dry, (it) isn’t that it becomes rotten.’

5.7 Connectives Almost all connective words are borrowed from Malay. Complex clauses tend to be simply juxtaposed, and the relationship is determined pragmatically. The connectives, which are not used by all speakers, are: tapiʔ ‘but’, the conditional kaluʔ ‘if’, ʔasal ‘if, when’, concessive ʔalaw/walaw ‘even though’, and discourse connectives ʔasal ‘so, in the event’, təʔ pon ‘even so’, ʔarɛh ‘at that point, as soon as’, and sampay ‘until, to the point that’. 5.8 Particles and Interjections Typical of Aslian languages, there are few particle words, see Kruspe et al., this volume. Clausal particles generally occur in final position in the clause, like bəh ‘q’ which marks a clause as a question (see §4.1.3), and the emphatic and imperative lah ‘emph’, a loanword from Malay. (106) daʔ lah, tən bri kɛ kʰom exist ASRT from forest 3sg sit ‘(He) was there. He was sitting toward the outside.’

Semaq Beri

513

Interjections are a minor word class and include the interrogative hãh ‘what did you say?’, the hesitation or self-correcting ʔanũʔ ‘umm’, and the placeholder ʔam ʔay ‘whatever’. (107) nɨ̃y jmatan dlɔŋ kloc tɔm, ʔanũʔ dayal tɔm təʔ nɨ̃y rem.pst bridge wood inside river hes above river that rem.pst ‘Once there was a wooden bridge in the river, umm, over that river once’

5.9 Expressives Another set of lexemes that typically provide detailed descriptions about properties are expressives, e.g., jrahəc ‘appearance of s.th. (like a ghost or Langur) perched above’. They are phonotactically and morphologically unremarkable, typically trisyllabic, but also include sesquisyllabic and reduplicated monosyllabic forms, e.g., jrahɨk ‘appearance of a ghost: bare bones, and empty eye sockets’, pltɔ̃ s ‘to have very large eyes’, and sãw sãw ‘appearance of foliage struck by lightening’. They exhibit structurally regular, but so far unanalysable morphological processes. For example, krcwɛ̃c ‘appearance of a Slow Loris’ curling movement; a snake’s zig-zagging path’ and kraʔwɛ̃c ‘curly, of hair’ suggest a root *kwɛ̃c, or *wɛ̃c, as in the nominal compound lpəc wɛ̃c (alimentary.canal ?) ‘small intestine’. The status of this word class is unclear. They remain a topic of ongoing research. 6

The Lexicon

Semaq Beri has a rich lexicon with monolexemic forms expressing fine-grained semantic distinctions. Verbs may incorporate detailed information about the object, e.g., crɛt ‘to eat meat’, mamãh ‘to eat raw vegetal food’. Some verbs incorporate adverbial information like path or manner, e.g., kcbəc ‘to pole-fish walking along a river bank’ and pgəm ‘to pole-fish sitting on the bank’, and buŋ ‘to dig a small deep vertical hole the length of one’s forearm’, gur ‘to dig laterally the length of one’s arm’, and hawil ‘to dig a large hole while sitting in it’. 7

Sample Text

The speaker SP is applying poison to blowpipe darts and talks about knowledge transfer, learning how to make blowpipe darts, and the danger of mishandling them. Recorded 17 July 2008.5 5 An excerpt from recording SZCDV080719_3, DOBES archive, Max Planck Institute for Psycholinguistics. Transcribed by the author.

514

Kruspe

hɛ broh basɛŋ, pagɑh hɛ bɛs, hɛ 1INCL shoot Dusky.langur be.big 1INCL apply.poison 1INCL broh tupay, mahɛ̃t. hɛ bɛs timãʔ təʔ, tupay shoot.blowpipe squirrel be.small 1INCL apply.poison as.far.as that squirrel bayek lɑc. ʔah basɛŋ . . . basɛŋ mɔ̃ k nɔ̃ ʔ cukop lɑc. be.good perf filler Dusky.langur Dusky.langur like this be.enough perf ʔah təʔ beh kʰɔm hɛ yəs pagɑh. ʔah təʔ slɛmɛ̃ʔ gatiʔ filler that neg.exist be.able 1INCL put be.big filler that always too hɛ jʔɔy ʔay nɔ̃ ʔ~nɔ̃ ʔ. kaluʔ beh hɛ blajar, beh hɛ 1INCL make thing this~red if neg.exist 1INCL learn neg.exist 1INCL kʰɛʔ, ʔah təʔ, hɛ blajar btol, hɛ kʰɛʔ. ʔah təʔ ʔibɛʔ ʔəɲ know filler that 1INCL learn be.true 1INCL know filler that father 1sg kɛ blajar, ʔəɲ kʰɔm blajar. ʔah təʔ kɛ cɔ̃ c blɔr, ʔəɲ jʔɔy 3sg learn 1sg be.able learn filler that 3sg pierce weight 1sg do ʔəɲ cɔ̃ c . kɛ klos blɔr kɛ, ʔəɲ klos. ʔah kɛ thur, ʔəɲ thur. 1sg pierce 3sg trim weight 3sg 1sg shape filler 3sg sand 1sg sand ʔah təʔ kɛ bɛs, kɛ cʔɔŋ, ʔəɲ cʔɔŋ, kɛ bɛs, ʔəɲ filler that 3sg apply.poison 3sg roast 1sg roast 3sg apply.poison 1sg bɛs. ʔah təʔ kaluʔ beh hɛ kʰɛʔ, kɛ hoŋ gaʔ apply.poison filler that if neg.exist 1INCL know 3sg fall goal smãʔ hɛ gtah hanɛ̃h hɛ bul. ʔah təʔ human.body 1INCL sap fut.prox 1INCL be.poisoned filler that hɛ bul. smãʔ tən hnãn 1INCL be.poisoned person from where

pon beh ʔəɲ kʰɛʔ even neg.exist 1sg know

lah. kɛ gaʔ broh tupay, kɛ emph 3sg imm shoot.blowpipe squirrel 3sg

ʔen go

haʔ pisaŋ loc banana

kɛ. kɛ ʔen haʔ pisaŋ nɛ̃ŋ, kɛ yəs ha=nɔ̃ ʔ kɛ. 3sg 3sg go loc banana pst.prox 3sg place loc=this 3sg

Semaq Beri

515

kɛ. yəs ha=nɔ̃ ʔ, cɔ̃ c. cɔ̃ c, beh daʔ mũy menẽt, kɛ kbəs. 3sg place loc=this pierce pierce neg.exist exist one minute 3sg die beh dan kɛ gaʔ broh tupay. ʔah təʔ. neg.exist experience 3sg imm shoot.blowpipe squirrel filler that (When) we blowpipe Dusky Langur, we apply a really big amount of poison. (When) we shoot squirrels, a little. We apply the poison this much, (and) it’s good for squirrels. Ah the Dusky Langur . . . (For) the Dusky Langur, like this (it) is already enough. Ah we can’t put a lot. Ah, so always just like this, we’ve done this thing. If we don’t learn, we won’t know (how to do it). Ah, if we learn properly, we’ll know. My father learnt (and so) I was able to learn. Ah (when) he skewered the weight on (to the shaft), I’d do (it), I’d skewer (it onto it). He trimmed the weight, (and) I trimmed. Ah, he sanded (it) (and) I sanded. He applied the poison and heated (it over the fire) (and) I heated (mine). He applied the poison, (and) I applied the poison. Ah, if we don’t know how, (and) the poison falls onto our body, we’ll be poisoned. Ah, we’ll be poisoned. There was a person, from where I don’t know, (and) he wanted to blowpipe squirrels. He went to a banana plant. He went to a banana plant, where’d he put (the quiver) here (like this). He’d put (it) here. Pierced! It pierced (him) and there wasn’t one minute (and) he was already dead. He didn’t get to blowpipe squirrels, you see.

8 Bibliography Benjamin, Geoffrey. 1976. Austroasiatic subgroupings and prehistory in the malay peninsula. In Philip N. Jenner, Laurence C. Thompson and Stanley Starosta (eds.) Austroasiatic Studies Vol. 1 (Oceanic linguistics, Special Publication, 13.) Honolulu: University of Hawai’i Press, 37–128. Burenhult, Niclas. 2005. A grammar of Jahai. Canberra: Pacific Linguistics. Diffloth, Gerard. 1975. Les langues mon-khmer de Malasie: classification historique et innovations. Asie du sud-est et monde insulinde, 6.4: 1–19. Diffloth, Gérard. 1976. Jah-Hut: an Austroasiatic language of Malaysia. In Nguyen Dang Liem (ed.), South-east Asian linguistic studies 2. Canberra: Pacific Linguistics, 73–118. Dunn, Michael, Niclas Burenhult, Nicole Kruspe, Sylvia Tufvesson & Neele Becker. 2011. Aslian linguistic prehistory: A case study in computational phylogenetics. Diachronica, 28: 291–323. Endicott, Kirk M. 1975. A brief report on the Semaq Beri of Pahang. Federated Museums Journal (new series), 20: 1–23. Evans, I. J. N. 1915. Notes on some aboriginal tribes in Pahang. Journal of the Federated Malay States Museum, 5.4: 192–219.

516

Kruspe

———. 1920. Further notes on the Aboriginal tribes of Pahang. Journal of the Federated Malay States Museum, 9.1: 16–33. Kruspe, Nicole. 2004a. A grammar of Semelai. Cambridge Grammatical Descriptions. Cambridge: Cambridge University Press. ———. 2004b. Adjectives in Semelai. In R. M. W. Dixon and A. Y. Aikhenvald (eds.) Adjective Classes: a cross-linguistic typology. Oxford: Oxford University Press, 283–305. ———. 2010. A dictionary of Mah Meri as spoken at Bukit Bangkong. (Oceanic Linguistics Special Publication no. 36). Honolulu: University of Hawaii Press. ———. 2011. Reciprocals in Mah Meri. In Evans, Nicholas, Alice Gaby, Stephen C. Levinson and Asifa Majid (eds.), Reciprocals and Semantic Typology. Amsterdam: John Benjamins, 149–162. ———. 2014ms. Semaq Beri kinship terminology: a preliminary account. ———. In progress. A grammar of Mah Meri. Kruspe, Nicole, Niclas Burenhult and Ewelina Wnuk. 2014. Northern Aslian. (this volume, 419–474). Kuchikura, Yukio. 1987. Subsistence ecology among Semaq Beri Hunter-gatherers of Peninsular Malaysia. Hokkaido Behavioral Science Report Series E, No. 1. Sapporo: Hokkaido University. Needham, Rodney. 1974. Some ethnographic notes on Semelai in Northen Pahang. Journal of the Malaysian Branch of the Royal Asiatic Society, 47: 123–129. Nik Safiah Karim and Ton Binti Ibrahim. 1979. ‘Semoq Beri: some preliminary remarks.’ Federated Museums Journal, New Series, 24: 19–38. Skeat, Walter William, and Charles Otto Blagden. 1906. Pagan races of the Malay Peninsula. 2 Vols. London: McMillan & Co.

section 2 Monic

․․

chapter 3

Old Mon Mathias Jenny and Patrick McCormick 1 Background Old Mon (autonym rmeñ), together with Old Khmer, is one of the two Austroasiatic languages for which we have written records going back furthest in time. Inscriptions in Old Mon first appear in the sixth century AD, and records of the language, with some gaps, exist through to the present day. While the number of inscriptions in Old Khmer is greater than that of Old Mon, for Old Mon we have several lengthy and substantial texts, such as the Shwezigon and Kyanzittha inscriptions of Pagan (late 11th or early 12th century). Because inscriptions are the sole source of information for the Mon language until quite recent centuries, most scholars tend to treat them as a unit without breaking them into Old Mon and Middle Mon. While such a division is inevitably arbitrary, this chapter deals only with the Old Mon period, which ends around the thirteenth century AD. Beginning with the Lamphun inscriptions of the thirteenth centuries, inscriptions after this date start to offer evidence of some of the sweeping sound changes that differentiate Old from Modern Mon, reflected in part by orthographical changes. We provide here a brief overview of the state of research related to the Old Mon language and its speakers before turning to a description of the language, its word and clause formation, and sample texts. The Old Mon language is associated with the two modern nations of Thailand and Burma (Myanmar). While today the bulk of Mon speakers live in Burma and Mons tend to think of Burma as their ancestral homeland, in fact the earliest Old Mon inscriptions appear in what is now Central Thailand in the sixth century. Over the course of the next centuries, Old Mon inscriptions then appear in northern Thailand in places such as Lamphun during the eighth and ninth centuries, and later northeastern Thailand in the tenth and eleventh centuries. Despite occasional claims to the contrary, the Old Mon language does not appear in Burma until the eleventh century. When it does, it covers a widelydispersed area from the kingdom of Pagan (ninth-thirteenth-centuries AD) in the north, down through Lower Burma and beyond to Tavoy (Dawei) along the Tenasserim coast. Precisely because Old Mon inscriptions are so widely

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_009

520

Jenny and McCormick

dispersed, possibly also into what is now Laos, this means that not all these areas were necessarily Mon speaking. Speakers of local Austroasiatic or other languages may not have had writing, or wanted to use a supra-local, culturally or religiously prestigious language such as Old Mon. Such a scenario may explain the otherwise unexpected presence of the Old Mon language in Pagan. As is the case for Old Khmer, much of the contents of the Old Mon corpus concerns various donations and offerings to temples and the like. Unlike for Old Khmer, however, there is little that offers us a glimpse of statecraft that would help us understand the nature of contemporaneous Mon-speaking society. Indeed, the Shwezigon and Kyanzittha’s Palace Inscriptions tell us much more about life in Pagan, presumably a Burman society. In what is now Thailand, there appear to have been numerous Mon city states, commonly lumped together under the name Dvāravatī (Wales 1969). It is important to keep in mind that this term may represent a modern notion of a unified past, rather than a coherent society based on large-scale agricultural production similar to Angkor. For Burma, a lively debate has arisen regarding the nature of Mon-speaking societies of Lower Burma and Central Thailand during the first millenium. A generation of colonial-era scholars, working in the nineteenth and twentieth centuries, argued that a large Mon state rivalling Pagan and Angkor existed in Lower Burma during the first millennium AD, with its capital at Thaton (Harvey 1925, Guillon 1999). Recently, historian Michael Aung-Thwin (Aung-Thwin 2005) has argued that the written evidence for a state in Thaton at that time is based on later additions to the written record beginning in the eighteenth century, and that the archaeological evidence is equivocal at best. Firm evidence for large Mon polities in Burma, according to Aung-Thwin, appears only later during the second millennium, and given the proximity of Mon-speaking towns to the sea, these polities were based on trade, not large-scale agriculture. In any case, outside of Pagan and Upper Burma, little sustained archaeological work has been done in Burma, especially in Lower Burma. Our state of knowledge, especially in terms of the state of the Old Mon inscriptional corpus, remains largely the same today as it was in the early 1970s, when Harry Shorto, father of ‘Mon Studies’, published his A Dictionary of the Mon Inscriptions: From the Sixth to the Sixteenth Centuries, a work which covers both Old and Middle Mon and which provides the most complete references to preceding works. In the intervening decades, however, developments in the study of Austroasiatic languages, together with continuing research on the Old Mon inscriptions, not to mention in the wider field of linguistics, have allowed us to refine the work of an earlier generation of scholars. Shorto’s work itself was based on the pioneering scholarship of figures such as Charles

Old Mon

521

Blagden, Taw Sein Ko, Charles Duroiselle, Georges Cœdès, and Gordon Luce, all of whom were closely involved in the archaeological exploration of Burma and Siam in one way or another. The corpus of Mon inscriptions was published in five volumes starting in 1919 as Epigraphia Birmanica, which provides pictures of the original inscriptions together with transcriptions and translations. In Burma, these transcriptions have been reprinted in Chit Thein’s Mun Kyauksa Paungyout [Collection of Mon Inscriptions] (Chit Thein 1965). While scholars inside Burma, including Nai Pan Hla, have continued to study Mon Inscriptions, few new inscriptions have come to light there in recent decades. Similarly, because of difficulties of access, few foreign scholars have worked directly with Old Mon in the country. This is not to say that nothing has happened since Shorto. Both Gérard Diffloth (Diffloth 1984) and Michel Ferlus (Ferlus 1984) have worked on the reconstruction of the phonology of the Mon language from the time of Old Mon through the present. Ferlus has reconstructed the sound changes of the Mon language from its oldest to its modern forms. Diffloth has studied the Nyah Kur language, still spoken by perhaps a little over a thousand people primarily in Chaiyaphum province in Thailand. Nyah Kur appears to be an offshoot of Old Mon, although Nyah Kur forms suggest descent from a close, common predecessor to both Old Mon and Nyah Kur. Both Diffloth and Ferlus take into account Nyah Kur data in their analyses. Speakers of Nyah Kur have not been in contact with Mon speakers for probably centuries and do not consider themselves Mon, yet their language preserves features of Old Mon that later forms of the Mon language do not. Meanwhile, Christian Bauer (1990) has written about possible Old Mon loanwords in Thai and about how spelling variations in the Old Mon corpus may suggest dialectal differences. Thai scholars and scholars working in Thailand have continued to expand our knowledge of the Old Mon language, inscriptions, and contemporaneous society. As part of a series on the inscriptions of Thailand, Kongkaew Wiraprachak (Wiraprachak 1986) of the National Museum in Bangkok has printed a compendium of all the previously published work on the Mon inscriptions of that country. Uraisri Varasarin (Varasarin 1988) has written about several newly-discovered Old Mon inscriptions from northeast Thailand. Prapod Assavavirulhakarn and Peter Skilling (2006–2007) have argued that at least one inscription, previously considered to be Old Mon, is in fact in an Indic language. Prapod and Skilling remind us of the need for caution in attributing everything old in Thailand to the Mons, a tendency which finds many parallels in the scholarship of Burma. Most recently, Hunter Watson (Watson 2014), an MA student in the Faculty of Archaeology at Silpakorn University in Bangkok, has prepared an inventory and compendium of the Old Mon inscriptions of

522

Jenny and McCormick

Thailand. In the process, Watson has discovered several dozen artifacts with Mon inscriptions in Chiang Mai, Nakhon Sawan, Khon Kaen, Mahasarakham, and Kalasin, all of which appear to have been otherwise undocumented. Among a wider audience, many scholars are interested in comparing Old Mon and Old Khmer. A glance at a family tree of the Austroasiatic languages shows that the two languages are not particularly close, although they do have a clear family resemblance. For example, throughout its history, Khmer has allowed more diverse initial consonant clusters and has had a more complex system of vowels than Mon. While Old Mon had a rich set of initial consonant clusters and presyllables, many of these have simplified and eroded into the modern language. A nice comparison is the Old Mon infixed form tuṁwāy [təmwaj] ‘gift, offering,’ whose cognate in pre-Angkorian Khmer is taṅhvāy [ɗəŋwaːj]. The modern reflexes are Mon kəwaj compared with Khmer tɑŋwaːj. Despite the long-term contact that historical and archaeological evidence suggests must have existed between speakers of the two languages in what is now Thailand, there is not much evidence of either direct borrowings between the languages (but Bauer 1992 provides speculation on the topic), nor of the replication of syntactic patterns between them. Nor is there evidence that the speech of the region was a ‘creole’ or some kind of mix of Mon, Khmer, and Thai (despite Khanittanan 2004). Following long-term contact patterns over the past several centuries, Mon has replicated many features of Burmese syntax (McCormick and Jenny 2013), whereas Thai and Khmer have formed a similarly close dyad (Huffman 1973). The following sections give an overview of the main phonological and grammatical characteristics of the Old Mon language, mainly based on the epigraphic corpus found at Pagan. The language of the Old Mon inscriptions differs in many points from Modern Mon (see the chapter on Modern Mon in this volume), both in phonology and grammar. Weak syllables have become (even more) reduced, derivational affixes have lost their productivity, a number of functional words have been replaced, and the clause structure has become more similar to the structurally very different Burmese in the case of Mon in Burma, or Thai in Thailand. 2 Phonology Old Mon is written in a script of Indian origin, in which the sounds of the language can be represented only with limitations. The actual pronunciation of Old Mon words can be deduced from spelling variants as well as from comparison of modern Mon dialects and the closely-related other Monic language, Nyah Kur (Shorto 1971, Diffloth 1984). The correspondence of the

Old Mon

523

orthography with the pronunciation is thus not always straightforward. In the following sections, Old Mon is represented in the traditional Indic transliteration system as applied in the Epigraphia Birmanica and by Shorto (1971), with one exception: the glottal stop is written with the IPA symbol ʔ, rather than with ’. The spelling of many words varies, sometimes within the same inscription. In the descriptive text of this chapter, not all variants are given, especially if they are easily recognizable. The Old Mon spelling of the published inscriptions is generally retained. The phonemic form is given when the assumed pronunciation is not directly evident from the orthography. Here the general practice of using IPA symbols of this volume is followed, deviating in some cases from Shorto’s (1971) transcription. 2.1 Word and Syllable Structure Old Mon words generally consist of one syllable, which may be preceded by a weak syllable. The onset of the main syllable consists of a simple consonant or a cluster of no more than two consonants. Weak syllables generally only allow simple consonant onsets. The nucleus of the main syllable can be any vowel. Weak syllables have the neutral central vowel ə as nucleus, or, alternatively, syllabic nasals or r (according to Diffloth’s (1984) reconstruction). Only one consonant is allowed in the coda of any syllable. Old Mon words obligatorily end in a consonant. Open syllables only occur as weak syllables and in grammatical morphemes, not in major lexical categories like verbs and nouns. All content words have the minimal structure CVC. This restriction also applies to early Indic loans, such as pūjāw ‘worship’, from Pali pūjā ‘worship’ and dewatāw ‘god, deity’ from Pali devatā ‘deity’. Orthographic open syllables with a written short vowel are pronounced with a final glottal stop, as in pa [paʔ] ‘do’.1 With the influx of larger numbers of loans from Pali and Burmese, and the loss of some final consonants in Middle Mon, words ending in open syllables become common after Old Mon. Polysyllabic words in Old Mon are mostly either loans from Sanskrit or Pali, or compounds. The word and syllable structure of Old Mon can be summarized as follows, where C represents any consonant, V any vowel, and c one of the possible final consonants. (Cə(c)-)C(C)Vc, for Dvāravatī Old Mon possibly also CN̩ -CCVc.

1 The only Dvāravatī Old Mon noun reconstructed as ending in a long vowel is *ʔədaː ‘duck’ (not attested in the inscriptions; see Diffloth 1984: 68), which is an apparent loan from Khmer or another (eastern) Austroasiatic language. A few Pali loanwords retain final open syllables, such as kulo ‘kinsman’.

524

Jenny and McCormick

2.2 Phoneme Inventory Based on a comparison of Old Mon and Nyah Kur dialects, Diffloth (1984) reconstructs the following phonemes for Dvāravatī Old Mon, the common ancestor of Nyah Kur and Mon, which he takes to be a stage preceding the oldest documents of the language. 2.2.1 Consonants Table 3.1 Consonants

Voiceless stop Voiced stop Implosive Nasal Fricative Approximant

p b ɓ m w

t d ɗ n s r, l

c ɟ

k g

ɲ

ŋ

j

ʔ

h

Shorto (1971: xiiiff) has an additional series of prenalized stops in his phonology of Old Mon as represented in the inscriptions. His analysis is based on such spelling variants as ʔba, mba and ʔamba ‘father’, which all probably represent [mbaʔ], with the initial monophonemic, rather than a cluster, as shown by its morphological behavior. The glottal stop must be considered a full consonant phoneme in Old Mon in all positions. The initial glottal stop is more than just a vowel support, as shown by forms like sʔār ‘will go’, from ʔār ‘go’, where the irrealis prefix s- would otherwise expected to attach directly to the vowel, yielding a non-attested form *sār. The orthographic representation as used in the transliteration differs from the phonemic representation in the following cases (see Table 3.2).2 Old Mon allows a number of initial consonant clusters. As suggested by the development in Modern Mon and Nyah Kur, some of these clusters, especially of the type stop+stop and stop+nasal, may actually have been pronounced with an epenthetic (non-phonemic) schwa, as in tkaʔ [tkɔʔ~təkɔʔ] ‘island’, 2 The character ḅ does not appear in the Indian scripts and was invented by the Mon to represent the implosive ɓ. It was originally the letter b with a dot added, but in the modern writing it looks like w with a dot in the middle. The Indian retroflex ḍ was used to represent implosive ɗ. This is the only orthographic retroflex consonant that appears in indigenous words and is consistently distinguished from its dental/alveolar counterpart.

525

Old Mon Table 3.2 Orthography and pronunciation—consonants Orthography

Pronunciation

ṅ c j ñ y ḍ ḅ s; ṣ (after k)

ŋ c ɟ ɲ j ɗ ɓ s

while others, such as stop+glide, represent real clusters, as in kloñ [kloɲ] ‘make, do’ and twāñ [twaɲ] ‘village’ (see Diffloth 1984: 304ff). Orthographic aspirated stops (kh, ch, ṭh, th, ph) occur in Old Mon, but they are phonologically clusters of the type stop+h, rather than simple phonemes, at least in indigenous words. This analysis is supported by the fact that morpheme boundaries may occur between the stop and h, as in phāp ‘feed’ (also spelled p-hāp), causative of hāp ‘eat’, that infixes are inserted between the stop and h as in buhic ‘frighten’, causative of phic ‘be afraid’, and that aspirated stops do not occur in clusters.3 Orthographic voiced aspirated stops (gh, jh, ḍh, dh, bh) occur in Indic loanwords and were probably pronounced as clusters of voiced stop+voiced ɦ, or as voiced aspirated stops bʱ, dʱ, etc. This is supported by the fact that these initials develop into the same register as voiced initials in later stages of the language and retain the aspiration. In Modern Mon, they are pronounced as aspirated voiceless stops with second (breathy) register, as in phɛ̀əsa ‘language’ from bhāsā. Pre-Old Mon appears to have had clusters containing voiced stops and h-, probably pronounced [ɦ] in this context. These clusters are rendered by voiceless aspirated stops in the orthography, but voicing appears when infixes are added, as in thic [thɤc] ‘be good, proper’, attributive duṁhic ‘good, proper’, and thatta4 [thɔt] ‘severe, strong’, nominalized dirhat [dərhɔt] ‘vigor, power, strength’. In these cases the Modern Mon 3 Clusters of the type Ch-glide appear in Middle Mon, and in Indian and Burmese loans the aspirated stops are probably to be considered monophonematic. 4 This spelling reflects the usage of Haribhuñjay (Lamphun), which doubles the final consonants rather than adding the vowel silencer virāma above the final consonant.

526

Jenny and McCormick

reflexes often show first register (pointing to an original voiceless initial) in the base form and second register (pointing to an original voiced initial) in the derivate. The case of chāy ‘be beautiful’ with the derived noun cirhāy ‘beauty’ shows that the voicing is not a general morphophonemic process applying to C+h clusters with a derivational infix. In final position, only the voiceless stops, nasals, fricatives, and approximants are found. Voicedness distinctions are obviously neutralized in the coda, and implosives are excluded from this position. In line with areal tendencies and the situation in Modern Mon and Nyah Kur, final consonants were probably non-released. The spelling of Indian loanwords retains the voiced stops in final position, but the pronunciation is probably the same as the corresponding voiceless stop. 2.2.2 Vowels The vowel system of Old Mon is asymmetrical, as not all vowels occur with all finals. The same situation also holds in Modern Mon (see chapter Mon in this volume). The following vowels are considered phonemic by Diffloth (1984: 298ff), who also takes vowel length as distinctive in Dvāravatī Old Mon (Table 3.3). In addition, syllabic nasals and r occur as nucleus of weak syllables, the former generally homorganic with the following consonant. Table 3.3 Vowels (Dvāravatī Old Mon, following Diffloth 1984)

i iː e eː ɛ ɛː ia

ɯ ɯː ə əː a aː

u uː (o) oː ɔ ɔː ua

In his work, Shorto (1971: xviiff) analyzed the vowel system differently, providing an analysis that does not include phonemic length distinction. Because of the continued salience of his work, we reproduce his scheme here. In addition to the vowels listed below, a central neutral vowel ə occurs in weak syllables. Yet another way to understand the Old Mon vowels is to look at their representation in the native Indic writing system, which orthographically distinguishes only five vowels (a, i, u, e, o) and two diphthongs (ai, au), and could not represent the vowel system of Old Mon adequately. This has led to a rather complex system of interplay between written vowel and final consonant.

527

Old Mon Table 3.4 Vowels (Epigraphic Old Mon, following Shorto 1971)

i e

ø [ɤ]5 a ai

ɯ

u o ɔ

The following table (3.5) summarizes the spellings of Old Mon vowels, but see Shorto for an earlier analysis (Shorto 1971: xv11 ff). Table 3.5 Orthography and pronunciation—vowels ( following Shorto 1971) Pronunciation

Orthographic rendering

i u e ɤ

i, ī6 u, ū e; e, i (before h) a, i, u, e, ei (not before palatals); i, ī, e (before palatals), o (in the word poy ‘we’) u, ū, o, uo (before velar and laryngeal final) o ā (not before laryngeals and velars); a (before laryngeals and velars) o (before velars, ʔ, h, and w); a (before dentals and labials); a, e (before palatals); e, eai (before y)7 ā, e (before velars)8 a, i, u (in weak syllables)

ɯ o a ɔ ai ə

5 Shorto uses ø to transcribe the central vowel. Based on the areal context and the development in the later stages of the language, we prefer the transcription ɤ, which seems more likely to represent the actual sound. 6 It is not clear whether length was contrastive in Old Mon. Marking of length may represent an adaptation of a foreign writing system to Mon phonology and may not have marked an actual phonemic contrast. 7 From Middle Mon onwards, the vowel ɔ is written with a symbol of the same shape as final ʔ or m (the Indic ṁ or anusvāra) before velar finals. 8 Possibly the actual pronunciation of this diphthong was a low or mid-high vowel followed by a palatal offglide, as is still the case in Modern Mon varieties, where orthographic kyāk is pronounced as kjac [ʨʲaʲc ~ ʨʲaʲk].

528

Jenny and McCormick

In Middle Mon, the vowel ɤ is represented by the digraph traditionally transliterated as ui.9 This spelling was probably introduced by the Burmese to represent a central vowel ɤ and later taken over by the Mon to write a similar vowel sound. The fact that the digraph for the central vowel occurs in Old Burmese from the first dated documents but does not appear in Mon until the Middle Mon period two centuries later suggests a Burmese, rather than Mon origin of this innovation.10 The Indic diphthongs ai and au are not used in Old Mon orthography.11 A number of sound changes separate Middle and Modern Mon from Old Mon. These developments are described by Shorto (1971), Diffloth (1984), and Ferlus (1986). 2.3 Suprasegmentals The devoicing of initial stops which led to the arising of register distinction in Modern Mon had not yet occurred in Old Mon. There is therefore no indication of phonemic voice quality or pitch distinctions. Word (and possibly phrase) stress appears to have been iambic as shown by the reduction of initial syllables, as in Old Mon bihār [bəhar] ‘monastery’, which becomes bhā [bha] in Middle Mon. 3

Word Formation

Word formation in Old Mon is mainly based on derivational morphology, with a number of productive prefixes and infixes. Compounding also occurs, in some cases resulting in shortening of one part of the compound. Reduplicated forms, apart from partial reduplication as infix support, appear less frequently in the inscriptions than in the modern language, which may be due to the genre of the language recorded rather than to an actual difference in the language 9 A transliteration as iu is probably more appropriate, also representing the actual order of writing and spelling in Burma. Alternatively, the digraph could be rendered by a single symbol in transliteration, parallel to o, which in Indic writing is composed of e and ā. The rendering as ɤ or ɯ would probably be closest to the actual pronunciation in Old Burmese and Old Mon. 10 There are a few words where the digraph may be used in 12th century Old Mon inscriptions, but the reading is not certain. 11 The one exception is the rare spelling treaiy or treai for the more common trey [trɔj] ‘noble, exalted, holy’.

529

Old Mon

structure. Reduplication in Modern Mon is frequently found in colloquial and poetic style, rather than in heavily formalized language as found in the inscriptions. 3.1 Derivational Affixes Old Mon uses prefixes and infixes, most commonly on verbal bases, to express different notions, such as nominalization, attributive, causative, among others. The function of some affixes is not entirely clear. Commonly found are the following affixes. Table 3.6 Derivational affixes Affix

Function

context

Base

Derivate

pa-, pu-, p-u-m-uṁ-um-irCred-irr-n-in-11 -w-uw-

caus caus attr attr attr nml nml nml nml nml nml nml

simple initial initial cluster simple initial initial cluster labial initial initial cluster simple initial initial h simple initial initial cluster simple initial initial labial

das ‘be’ graṅ ‘know’ goṅ ‘be brave’ jnok ‘be big’ pa ‘do’ jnok ‘be big’ das ‘to be’ hāp ‘eat’ sāl ‘spread, lay out’ gruṅ ‘laugh’ til ‘to plant’ pa ‘do’

p-das ‘create’ graṅ ‘tell, explain’ goṅ ‘brave’ jnok ‘big’ pa ‘doing, who does’ jnok ‘extent’ d-das ‘existence’ r-hāp ‘food’ sāl ‘mat’ gruṅ ‘laughter’ til ‘cultivable land’ pa ‘deed’

That the derivational processes were productive in Old Mon is shown by their applicability to borrowed roots, as in smis ‘residue, exception’, attributive of the verb sis ‘to remain’ from Sanskrit śiṣṭa ‘remaining’. Some irregular affixes occur, such as the puas nominalizer in pumic ‘wish, desire’ from the verb root mic ‘to wish, want’. Combinations of two infixes are also found in cirna ‘food’ from ca ‘to eat’, which combines the nominalizing infixes -ir- and -n-, and juṁwin ‘present’, derived from jin ‘to make over, give’ by application of the nominalizing infixes -in- (-uṁ- before labials) and -w-.

530

Jenny and McCormick

Frequentatives, denoting continuous or repeated actions are formed from what appears to be causative bases by the addition of the infix -in- or before labials, -uṁ-. Of unclear function are the rare infixes -i- and -ir-, which Shorto (1971: xxiv) labels “frequentative, but their function is far from clear”. Examples of this derivation type given by Shorto (1971: xxiv) are kirloḥ ‘penetrate’ from an unattested root *kloḥ ‘penetrate’,12 and biḅat ‘test, try’, from ḅat ‘to match’, with reduplication of the root initial as infix support. The form tirṅey ‘daily’ looks like a frequentative formation of this type from the nominal base tṅey ‘sun, day’. Diffloth (1984: 263ff) gives an overview of derivational affixes of Dvāravatī Old Mon, reconstructed in the basis of comparison of epigraphic Old Mon with Nyah Kur. This reconstructed system differs in a few points from the documented stage of Old Mon, though there are no fundamental differences. 3.2 Compounding Compounding of nouns is common in Old Mon, involving both indigenous material and Indic loanwords. The latter may have been borrowed as compounds into Mon, as they usually exhibit the head-final structure typical for Pali and Sanskrit. In compounds made up of indigenous lexemes, or of indigenous words mixed with loan words, the distinction between lexical compounds and phrases is not always clear, and it may be the case that most noun-modifier constructions are phrases rather than compound nouns. Borrowed compounds are exemplified by the following. Compound form: mhāther, mahāther ‘senior monk’ bodhisat ‘Bodhisatva, Buddha-to-be’

Component Elements: Pali mahā ‘big’, thera ‘monk’ Pali bodhi ‘enlightenment’, satta ‘being’

Indigenous compounds (or phrases) are the following. Compound form: Component Elements: kon-brāt-tḅow ‘young sugar-cane kon ‘offspring’, brāt ‘banana’, tḅow banana’ ‘sugar-cane’ kon-lwūt ‘young daughter’ kon ‘offspring’, lwūt ‘virgin’ ñaḥ-brow ‘woman’ ñaḥ ‘person’, brow ‘female, woman’ 12

The base verb is attested in literary Mon as kloh, kluih, Spoken Mon klɑh ‘penetrate, pierce, understand’.

Old Mon

531

Hybrid compounds consisting of indigenous and borrowed material are the following. Compound form: Component Elements: tṅey-ʔādittawār ‘Sunday’ Old Mon tṅey ‘sun, day’, Pali ʔādittawār ‘Sunday’ kyāk-srī, kyāk-śrī ‘goddess of fertility Old Mon kyāk ‘holy object’, Sanskrit and luck; glory’ śrī ‘glory’ sṅi-dān ‘alms-hall’ Old Mon sṅi ‘house’, Pali dānaṁ ‘offering’

In one case, a noun phrase has become lexicalized and lost its transparency synchronically, though the non-contracted form also appears in the inscriptions: triley, tirley (and other variant spellings) [tərlɔj] ‘my lord’, from tralaʔ, trilaʔ (and other spellings) ‘lord, master’ and ʔey ‘1sg’. The full form tralaʔ ʔey ‘my lord’ is found sporadically. Verbal compounds are equally difficult to distinguish from verb concatenations (serial verb constructions), and are treated here as phrasal, rather than lexical. Some collocations may have been frequent enough to become lexicalized, but the boundaries are not clear and the material available in the corpus is too limited to reach a decision in this respect. 3.3 Reduplication Full reduplication is very rare in the Old Mon inscriptions, but was presumably a feature of the spoken language, as shown by its frequency both in Modern Mon and Nyah Kur. In any case, reduplication is a widespread areal feature. As seen above, partial reduplication of a simple initial occurs as support for those infixes that require onsets consisting of two consonants. 4

Phrase and Clause Structure

Old Mon is basically head-initial, that is, in most cases modifiers follow the element they modify. A number of quantifiers precede the noun they quantify. 4.1 Simple Sentences The constituent order in simple sentences is SV/AVP. Omission of known or retrievable arguments is frequent, and objects may be fronted for pragmatic reasons (topicalization, focusing). The alignment of grammatical relations is accusative, that is, the single argument of intransitive clauses and the agent of

532

Jenny and McCormick

transitive clauses are treated identically and can be subsumed under the term ‘subject’. 4.1.1 Intransitive Clauses In intransitive clauses, the subject usually precedes the verb. 4.1.2 Transitive Clauses Though the unmarked word order is AVP, object-initial order is not uncommon and subject-final word order is found in some instances. (1) yaṅ ḍik top servant

suṁʔur trūs puma bad male attr.do

sanraṅ guṁloṅ act attr.many

ma rel

sak tim tarla neg.exist know lord ‘bad male servants13 who do things which the lord does not know’ (Shwezigon, EB i)14

4.1.3 Ditransitive Clauses The recipient in ditransitive clauses, if overtly expressed, is normally marked by the oblique preposition ku ‘to’ or the dative ta ‘to’. The verb kil, kul, kel [kɤl] ‘give’ often is added to the main predicate to express benefactive a relation, also if the recipient is not overtly expressed. 4.1.4 Imperatives The imperative can be formed by the verb alone, without any marking. The addition of the focus marker da, originally a shortened form of the verb das [dɔs] ‘to be’, is frequent and is probably used to make the imperative less direct. (2) smiṅ dewatāw kuṁ rmiṅ da. king god 2 hear foc ‘Hear, king of gods!’ (Shwezigon, EB i)

The permissive/jussive verb ʔor ‘let’ can be used to introduce imperatives directed to the second or third person. This appears to be used especially when the predicate is a non-volitional, non-control verb. 13 Translations of this term range from ‘slave’ to ‘servant’ to something like ‘bonded laborer’ or ‘underling’. We have translated the term as ‘servant’ throughout. 14 The sources of the examples are given by the name of the inscription and, where applicable, the number in the Epigraphia Birmanica (EB).

Old Mon

533

(3) ʔut bnaḥ ʔor graṅ da. all 2pl let know foc ‘May all of you know!’ (Shwezigon, EB 1)

The preverbal prohibitive particle laḥ [lah] is used to express negative imperatives directed to the second person. If the subject is overtly expressed, it appears before the prohibitive particle. (4) tarley laḥ kḍas.mleʔ crihāy kirmun goḥ my.lord proh reject splendor kingship medl ‘Do not reject the splendor of kingship, my Lord.’ (Shwezigon, EB i) (5) tarley dūṅ rirʔāc ʔey da, laḥ pa cit cʔāṅ. my.lord receive nml.beg 1sg foc proh do heart other ‘Accede to my request, my Lord; do not be contrarily disposed.’ (Shwezigon, EB i)

If the prohibitive is directed to a person other than the addressee, laḥ ‘don’t’ is combined with the permissive/jussive ʔor ‘let’, which occurs before the subject of the clause, which is syntactically subordinate to laḥ ʔor ‘don’t let’. In the following sentence the P argument of the subordinate clause is fronted and marked as topic by yaṅ. (6) yaṅ ñirñāc kyek trey mettey top NML.see holy noble pn laḥ ʔor ḍeh goʔ proh let 3 get ‘Don’t let him get sight of the holy Buddha Metteya.’ (Myazedi)

4.1.5 Questions Due to the mostly narrative nature of the texts found in the inscriptions, not many questions occur in Old Mon. Two markers are apparently used for questions, namely -tā, which is suffixed to the final word of the question, and the sentence final particle yo for polar and content questions, respectively. The former probably originates in a long ā attached to the final word of the sentence without an initial glottal stop, allowing it to merge with the preceding word. Possibly the preceding final consonant was lengthened. (7) ʔabhiprāy tarla gaḥ ci cmat-tā. royal.utterance lord say contr true-q ‘Is it true what you said, Lord?’ (Shwezigon, EB i)

534

Jenny and McCormick

The few instances of content questions show dislocation (frequently fronting) of the interrogative element in cleft structures. No interrogatives in situ seem to occur in the inscriptions, but this may be due to the scarcity of the data as well as to the formal genre of the texts. (8) mu het man tirlaʔ gruṅ yo. what reason rel lord laugh q ‘Why (is it that) you laugh, lord?’ (Shwezigon, EB i)

The analysis of interrogative sentences as syntactically biclausal cleft sentences is supported by the possibility of fronting the subordinate clause. (9) ma tirla pa kirʔim woʔ ci mu het yo. rel lord do nml.smile prox contr what cause q ‘That you smiled (like) this, my Lord, what is the reason?’ (Shwezigon, EB i)

4.1.6 Non-Verbal Predicates Old Mon has a small number of copulas, besides allowing nominals to appear as predicates. The equative copula, das ‘be, become’ is a full lexical verb, exhibiting the normal verbal morphology, like the irrealis sdas ‘will be’ and the nominalized form dirdas ‘existence’. (10) ma das tarlaʔ ḍūṅ ʔarimaddanapūr rel be lord city pn ‘who is the lord of Arimaddanapura’ (Phayre Museum, EB v)

The existential verb nom ‘there is, exist, have’ is in some sentences in Old Mon used like a transitive verb, with the possessor appearing as A argument, and the possessee as P argument. (11) ñaḥ ma ʔan yiryuk dirhat guṁloṅ snom yiryuk dirhat. person rel few vigor strength attr.many irr.exist vigor strength ‘Those who lack vigor and strength shall have vigor and strength.’ (Shwezigon, EB i)

Otherwise, the possessee appears as an oblique argument. (12) ma nom ku yiryās jirku rel exist obl nml.bright body ‘who is of bright appearance’ (Shwezigon, EB i)

Old Mon

535

In negative expressions, the existential nom ‘exist’ is replaced by the negative existential sak ‘there is not, not exist’, sometimes together with the verbal negator kaḥ ‘not be so’. (13) sak het kyāk buddha tarley guṁloṅ sikʔim neg.exist cause holy Buddha my.lord attr.many irr.smile ci kaḥ sak. contr neg neg.exist ‘It does not happen that the Buddhas smile without a reason.’ (Shwezigon, EB i)

4.1.7 Negation Negation is achieved mainly by means of the negative verb kaḥ ‘not to’, which itself can take the irrealis prefix s-, skaḥ ‘will not’. Though kaḥ generally is restricted in its applicability to verbs, it also occurs with numerals, especially moy ‘one’. The expression kaḥ moy, literally ‘not one’ often means ‘many, diverse’. (14) kaḥ moy lṅim kaḥ ḅār lṅim sak goʔ na slros. neg one thousand neg two thousand neg.exist get ins irr.count ‘not one thousand, not two thousand, but uncountable’ (Shwesandaw, EB viii)

The other negator is the negative verb sak ‘not to be, have, happen’, which often occurs pleonastically with kaḥ. This negator appears clause initially, before the subject. (15) tirta toʔ tluṅ sak ḍeḥ kil lop sṅi. heretic pl come neg 3 give enter house ‘The heretics come (but) they don’t let them enter the house.’ (Kubyaukgyi)

4.2 Complex Sentences 4.2.1 Complement Clauses Complement clauses are not overtly marked with a complementizer. They are often preceded by a verb of speaking with the deictic adverbial row woʔ ‘thus, in this manner’, where woʔ has cataphoric function. Complement clauses are often followed by the corresponding anaphoric phrase row goḥ ‘like that’ with the medial demonstrative. The matrix verb may appear both before and after the complement clause.

536

Jenny and McCormick

(16) yuk suṁḅup.tey smāñ row woʔ “ma tirla pa kirʔīm lift joined.hands ask manner prox rel lord do nml.smile woʔ ci prox contr

mu het yo. sak het kyāk what cause q neg.exist cause holy

buddha tarley guṁloṅ sikʔim ci kaḥ sak”, Buddha my.lord attr.many irr.smile contr neg neg.exist row goḥ tarley ʔānan smāñ da manner medl my.lord pn =ask foc ‘He lifted his joined hands and asked (like this), “That you smiled, my Lord, why was that? Without reason, Buddhas do not smile.” Like that did the Lord Ā nanda ask.’ (Shwezigon, EB i)

4.2.2 Attributive Clauses The relativizer in Old Mon is ma [mə] or mun, min [mɤn], occurring clauseinitially in the attributive clause. While ma occurs with all types of relative expressions, mun, min seems to be commonly used with relativized objects and oblique roles. (17) ñaḥ ma ʔan ḍīk person rel few servant ‘those who have few servants’ (Shwezigon, EB i) (18) pun dān ma smiṅ pa merit donation rel king do ‘the acts of merit and charity which the king performed’ (Shwesandaw, EB viii) (19) ḍey sthān ma kaḥ sak kṣīw mun tarley gawaṁpatither in place rel neg neg shake rel lord.1sg pn

croḥ.ptāw ci maintain contr ‘in a place which cannot be shaken, which the lord Thera Gavampati maintains’ (Shwezigon, EB i)

(20) ḍek mun jun ta kyāk servant rel make.over dat holy ‘the servants which he made over to the shrine’ (Myazedi)

Old Mon

537

In some instances, the relativizer occurs directly before the verb, rather than at the beginning of the clause. This word order becomes increasingly common in Middle Mon and is used almost exclusively in modern Literary Mon. (21) smiṅ daddharāja dewatāw guṁloṅ ma siw king pn god attr.many rel attend ‘King Daddharāja whom the gods attend.’ (Kubyaukgyi)

4.2.3 Adverbial Clauses Nominal heads are frequently used to introduce adverbial clauses, like the noun row ‘manner, as, like’ in accord clauses, kāl ‘time’ for temporal clauses, and het ‘cause, reason’ for causal clauses. (22) row [min kyek buddha tirley byādes goḥ] tluṅ das ʔut mleḥ. manner rel holy Buddha my.lord foretell medl come be all exactly ‘All came to pass as the Lord Buddha had foretold.’ (Myagan, EB iii)

Purposive clauses are often introduced by the instrumental marker na. (23) smiṅ dhammarāja myil ʔabhiprāy ʔor sūl lār king pn promulgate royal.utterrance let write deposit girluṅ guṇa jirku ma das na sʔor mñaḥ nml.many favor body rel be ins irr.let person guṁluṅ graṅ. attr.many know ‘King Dhammarāja issued an order to write the sum of his virtues, so that the people would know (about them).’ (Shwesandaw, EB vi)

Conditional clauses are introduced by clause-initial yal [yɔl] ‘if, when’. (24) yal kcit sak ñaḥ ma yām. cond die neg.exist person rel weep ‘If they die, there will be no one to weep for them.’ (Kubyaukgyi)

4.2.4 Coordination and Chaining A frequent means to describe sequences of events is the expression blaḥ goḥ ‘after that, then’, literally ‘end that’. It occurs between sentences or clauses, or at

538

Jenny and McCormick

the beginning of a new paragraph, indicating a temporal link to the preceding text. As linker between clauses, blaḥ ‘end, that which precedes or is finished’ is also used. (25) tarley guraṅ kel ʔin blaḥ tarley cis tūn my.lord caus.know give pn end my.lord descend return jetawanamāwihār pn ‘The Lord informed Indra and then he went back down to the great Jetavana monastery.’ (Shwezigon, EB i) (26) blaḥ goḥ rmeñ klaṁ ḅār cwas turow ma ciṅgin end medl Mon hundred two ten six rel carry.on.shoulder klas slūy klas ti ta goḥ suṁ ku jar copper jar earth pl medl incl obl glik kuchoṁ buṁteṅ, blaḥ goḥ . . . waist.cloth jacket attr.white end medl ‘then 126 Mons, carrying copper vessels and earthen vessels on their shoulders, and wearing white waist cloths and jackets, then . . .’ (Kyanzittha’s Palace, EB ix)

In the early San Sung inscription (Lopburi, 7th century), the connector khāḥc, khāḥp ‘and then, further’ is used to enumerate identical or similar events. The spelling and reading of this form, which does not appear elsewhere, is not clear: (27) khāḥc puṇya cāp ʔinda, ḍek mun jun ta kyāk. coord merit title pn servant rel make.over dat holy ‘And further the merit of cāp Inda, the servant(s) which he donated to the pagoda/temple/Buddha.’ (San Sung)

4.3 Syntax and Pragmatics Fronting of objects (and other constituents) for pragmatic reasons is frequent in Old Mon. The fronted object is optionally marked by the preposed topic marker yaṅ.

539

Old Mon

5

Word Classes

While Shorto distinguishes four major word classes in Old Mon, namely nouns, pronouns, verbs, numerals, and particles, here we suggest the division of lexical classes based on semantic and morpho-syntactic criteria into nominals, verbs, prepositions and adverbs, and particles. 5.1 Nominals Nominals are taken here to include nouns, pronouns, demonstratives and interrogatives, and quantifiers. There is some overlap in the use of demonstratives and interrogatives, especially with adverbs, as some nominals can be used in adverbial function. 5.1.1 Nouns Nouns in Old Mon can be divided into common nouns, referring to persons, objects, or abstract notions (types), and proper names, referring to specific persons or places (tokens). The former can be modified by nouns (possessive or specifying), attributive verbs, demonstratives, and quantifiers, while the latter usually do not take modifiers, except for non-restrictive attributive expressions. Common nouns take postnominal modifiers, which may be attributive verbs or nominals. If the postnominal modifier has a personal referent, it is interpreted as possessive, otherwise descriptive or in some other way associated with the head noun. (28) glik sek thar loincloth color gold ‘a gold-colored loincloth’ (Shwezigon, EB i)

While modifiers normally follow the noun, some precede the modified noun. This is especially true of the interrogative moʔ, mu ‘what’, as in moʔ kāl ‘what time, when’. If a preposition is also present, it appears between the prenominal modifier and the noun. (29) moʔ what

ḍey kāl in time

yo ma kyāk buddha tarley goḥ q rel holy Buddha my.lord medl

culoḥ tuḅaḥ lār kantar dirdas gna.smiṅ reveal show deposit origin nml.be king

540

Jenny and McCormick śrī.tribhuwanādityadhammarāja goḥ culoḥ tuḅaḥ pn medl reveal show ‘At what time was it that the Lord Buddha revealed the origin and coming into existence of King Śrītribhuwanādityadhammarāja?’ (Shwezigon, EB i)

The plural of common nouns can be expressed optionally by the postposed plural marker toʔ or its weak form ta, or the attributive form of the verb gluṅ ‘be many’, guṁluṅ ‘many’, which also occurs after the noun, as in ñaḥ toʔ ‘the people’ and dewatāw guṁluṅ ‘the gods’. The former indicates inclusive or complete plurality, while the latter marks less determined plurality and occurs more frequently in the corpus. As toʔ/ta is infrequent in the texts, the exact difference between the two plural markers is not clear. In Modern Mon, the corresponding forms mark associative plural (tɔʔ) and general plural (həlɒ`ɲ), respectively, the former also marking the plural of personal pronouns. Many proper names, especially those of kings and clergy, are of Indian origin, often Sanskrit in the case of the former, Pali the latter. Place names may be Mon, although the meaning of many is not known. 5.1.2 Pronouns Old Mon has two types of personal pronouns, namely pronouns with number distinction in the first and second persons, and pronouns without number distinction in the second and third persons. According to Shorto (1971: 240), the first person plural pronoun is exclusive of the addressee. As there is no corresponding inclusive form in the corpus, it is not clear whether poy is indeed exclusive, or that there are simply no instances of inclusive use due to the limitations of the corpus. Pronouns with number distinction Table 3.7 Personal pronouns 1

1st 2nd

sg

pl

ʔey beḥ

poy bnaḥ

The following forms can be either singular or plural, the latter marked by the postposed pronominal plural marker tʔeḥ.

541

Old Mon Table 3.8 Personal pronouns 2

2nd 3rd 3rd

sg/pl

pl

kuṁ ʔja ḍeḥ

kuṁ tʔeḥ ʔja tʔeḥ ḍeḥ tʔeḥ

The 3rd person pronoun ḍeḥ ‘he, she, it, they’ frequently occurs resumptively to refer to a full NP subject, even within the same clause. It can be used with both human and non-human referents. Pronouns can be combined with numerals with the addition of the quasiclassifier ñaḥ ‘person’: kuṁ tʔeḥ ñaḥ ḅār ‘you two’, literally ‘you pl person two’. In addition to the above listed pronouns, honorifics and titles can be used in the place of pronouns, such as tirla ‘lord’ for the second person to show reverence. The corresponding first person pronoun is sometimes the honorific ʔey-ḍik, literally ‘I (your) servant’, though the unmarked ʔey ‘I’ is also used in this context.15 The noun jirku ‘body’ can be used as a reflexive pronoun, bound either by the subject of the same clause or the subject (overt or understood) of a preceding clause, frequently the main clause. The latter use is called ‘long-distance reflexive’ (see e.g. Huang 2000: 90ff) and is commonly found in other languages of Southeast Asia. The reflexive jirku ‘body, self’ in the following sentence refers to the ‘son of the queen’, not the ‘king’. (30) kon gna.kyek ma ʔimoʔ rājakumār goḥ mirnas guṇ offspring queen rel name pn medl remember favor ma smiṅ ʔiñcim jirku. rel king feed body ‘The son of the queen, who was called Rajakumar, remembered the favors the king had bestowed on him.’ (Myazedi)

15

An example of a prince talking with the king is found in the text in section 7.

542

Jenny and McCormick

5.1.3 Demonstratives and Interrogatives The demonstratives attested in Old Mon are the proximal woʔ ‘this’ and the medial goḥ ‘that’, which is also used to mark anaphoric reference and topical elements. The distal teʔ and proximal ʔanoʔ ‘here’, which occur in Middle Mon inscriptions and Modern Mon, were probably part of the Old Mon demonstrative system, though they are not attested in the Old Mon corpus. Cognates occur in Nyah Kur and the forms have been reconstructed for Dvāravatī Old Mon (Diffloth 1984: 146f) as *tɛɛʔ and *-nɔɔʔ, respectively. Interrogatives are moʔ, mu ‘what’ and combinations of nouns with the general interrogative marker lhāw ‘which’. The latter is attested only once in the inscriptions and the reading uncertain (the l and ā are barely readable). There is a potentially corresponding form in Modern Mon, lɑ ‘which’, the common interrogative word, both adnominal and adverbial. The uncertain reading, its absence from Middle Mon and Nyah Kur, and the irregular sound correspondence with Modern Mon lɑ, make this item highly speculative. In Old Mon it appears to have pronominal function. (31) lhāw rūp wises traley yo. which appearance distinctive.quality my.lord q ‘Of what sort is the distinctive quality of our Lord?’ (Shwezigon, EB i)

More commonly found is the interrogative mu, moʔ, which can occur as an argument or as a modifier with nouns. In this function it frequently appears in prenominal position, suggesting a cleft structure. The expression mu het ‘why’, literally ‘what cause’ is probably to be analyzed as ‘what (is the) cause’, mu kāl ‘when’, literally ‘what time’ as ‘what (is the) time’. This analysis is further confirmed by the use of the relative marker ma [mə] ‘which, that’ in interrogative expressions. (32) mu het man tirlaʔ gruṅ yo. what reason rel lord laugh q ‘Why (is it that) you laugh, lord?’

5.1.4 Quantifiers The Old Mon numerals are the following.

543

Old Mon Table 3.9 Numerals

moy ḅār pi pan mṣūn, musūn

one two three four five

turow duṁpoḥ, duṁpoh diñcām, dincām dincit cas

six seven eight nine ten

Multiples of ten are formed by combining the units with cwas, a derived form of cas ‘ten’. ‘Five’ after tens is shortened to sūn. Other indigenous terms for numerals are klaṁ ‘hundred’, lṅim ‘thousand’. Higher numbers are Pali or Sanskrit loans: lak, lakṣa ‘ten thousand’, koṭ/koṭi ‘hundred thousand’. The word for ‘zero’ is not attested in the corpus. Old Mon numerals follow common nouns, but precede measure words. The latter include also time expressions such as tṅey ‘sun, day’, kintu ‘moon, month’, and cnām ‘year’, and classifier-like nouns like sāk ‘kind, type’. (33) nor gnis pan yiryās turow sāk ḍeḥ tīt. abl eye.tooth four nml.bright six kind 3 exit ‘From his four eye-teeth light of six colors issued.’ (Shwezigon, EB i)

The quantifier ʔut ‘all’ preceeds the quantified noun. (34) row goḥ da ʔut yiryās turow sāk goḥ ḍeḥ manner medl foc all nml.shine six kind medl 3 ʔār pṅoʔ ʔut dis cas blaḥ. go directly all direction ten end ‘In this manner did all the six-colored rays speed to all ten directions.’ (Myagan, EB iii)

The quantifier jāp ‘every’ precedes the noun it quantifies. The derivate jmāp ‘every’ with the attributive infix suggest a verbal origin of this lexeme. (35) jāp ḍūṅ jāp rās jāp twāñ jāp every city every district every village every

544

Jenny and McCormick

ñaḥ grip person forest ‘every kingdom, every district, every village, all the people of the forest’ (Shwesandaw, EB viii)

In some contexts, jāp ‘every’ functions like a pronoun. (36) pirgin kil jāp tuy offer give every finish ‘reverently offering it to each (of them)’ (Kyanzittha’s Palace, EB ix)

Two verbs can function as quantifiers, namely gloṅ ‘be/have much’ and ʔan ‘be/have little, few’. They occur mainly as quantitative predicates, but can also appear in attributive function, the former in this case formally marked as attributive: guṁloṅ ‘attr.many’. (37) ñaḥ ma ʔan ḍik sgloṅ ḍik, ñaḥ ma ʔan person rel little servant irr. many servant person rel little jlow ci sgloṅ jlow ox contr irr.many ox ‘Those who have few servants will have many servants, those who have few oxen will have many oxen.’ (Shwezigon, EB i)

5.2 Verbs Verbs in Old Mon can be distinguished from other word classes in that they can take the irrealis or future prefix s-, si-. The form si- occurs before the initials /s, k/, s- elsewhere. This prefix is still present in modern Literary Mon, but its function is in the spoken language expressed by other markers.16 It can be taken as the only inflectional affix in Old Mon, as it expresses a combination of tense, aspect, and probably modality. The exact function is not clear, and we use the general label ‘irrealis’ here. Shorto (1971) labels this prefix ‘hypothetical’, while Bauer (1991), based on the use in the frescoes and plaques, argues 16

The frequency with which these prefixed s- forms are found in the nineteenth and early twentieth century Literary Mon texts originating in Siam, where Mons have been bilingual in Thai for many centuries, suggests that some kind of reinforcement was in operation related to the Thai irrealis marker ca£ ʔ, dialectally siʔ. Modern dialects of spoken Mon in Thailand have similar forms.

545

Old Mon

for a label ‘ingressive’ for the occurrences in the glosses accompanying the images on the frescoes and plaques. Verbs are also the most common bases for derivation, though a few instances of derivation from other word classes are found. Verbs can be divided into actions and states, the latter often being translation equivalents of adjectives. Apart from functioning as main predicates, some verbs also occur as (partly) grammaticalized function words, indicating modality, directionality, aspect, and others. The verb kil ‘give’, for example, appears before the main verb with causative function, after the main verb with benefactive function. The literal meaning may still be present, though this is not always the case. (38) ʔor coṅ pnāṅ cḍow sak kul plit. let burn candle lamp neg give extinguished ‘He orders them to light candles and lamps and not let them go out.’ (Shwesandaw, EB viii) (39) ḅas kil sacchu. pluck give fruit ‘He plucks fruit for (him).’ (Ananada) (40) tarley gawaṁpati guraṅ kel smiṅ ʔin. my.lord pn caus.know give king pn ‘The Lord Gavampati informs Indra.’ (Shwezigon, EB i)

The verb duʔol ‘abandon’ has perfective function, indicating that an activity is or will be carried out to the end. There may be a notion of irreversibility involved in this use. (41) ʔut dussīl ma dmas marsa na sīl duṃhic ma all vice rel like stench ins virtue attr. goo rel dmas marmal like fragrance

goḥ medl

gna.smiṅ king

śrī.tribhuwanādityadhammarāja pn

goḥ spliñ duʔol. medl irr.destroy abandon ‘King Śrī Tribhuvanādityadhammarāja shall make all vice dissolve away before right virtue, as a stench before fragrance.’

546

Jenny and McCormick

The verb lār ‘deposit, place’ is mostly used as postverbal aspectual marker, indicating perfective or resultative aspect. It expresses that an act is carried out to its end and the result persists. (42) ciṅ turow lṅim ma dak lār tirṅey elephant six 1000 rel tie deposit freq.day ‘the six thousand elephants that are tied daily’ (Nagayon fresco)

Other verbs that function as secondary verbs are the directionals ʔār ‘go, away from point of reference’, tlūṅ ‘come, towards point of reference’, dmoṅ ‘stay, ongoing state’, tuy ‘finish, completed act’, goʔ ‘get, can, may’, and many more. As Old Mon allows the concatenation of verbs in a single predicate, it is not always evident whether a verb actually has a grammatical function or retains its lexical meaning. Often, both may be the case at the same time. 5.3 Prepositions and Adverbs Old Mon has a number of prepositions that cannot be linked to lexical nouns or verbs: ku ‘oblique’, ta ‘dative’, ḍey ‘in, inside, into’, and na ‘instrumental’. The functions of the first three overlap to some extent, as does the form suṁ ‘to (the presence of), together with’, which is probably connected with the inclusive marker suṁ, as in suṁ ʔut ‘all’. The locative han, hin ‘locative’ is probably shortened from the noun hān ‘place’ and competes with ḍey ‘in, inside, into’, which replaces it in the modern language. (43) woʔ puṇya ju smac.yu.grāḥ ḍek mun jun ta kyāk. prox merit title pn servant rel make.over dat holy ‘This is the merit of ju Smac Yu Grah, the servant(s) which he donated to the pagoda.’ (San Sung)

The translation of the word nor given by Shorto is ‘space, time, bounded by initial term, from, after, than’. It covers the functions of an ablative preposition, both spatial and temporal, and also marks the standard in comparative constructions. (44) cis nor tāwatiṅ descend abl pn ‘He descended from Tāvatiṁsa heaven.’ (Shwezigon, EB i) (45) tarley ma prey nor ʔut sat guṁloṅ my.lord rel noble abl all being attr.many ‘the Lord who is nobler than all (other) beings’ (Shwezigon, EB i)

547

Old Mon

Other words with prepositional and adverbial function are probably nominal: ʔantūl [ʔəntul, ntul] ‘top, upper part, head, on, above’, krow, kindrow [krow, kəndrow] ‘space/time behind, after’, and kinta [kəntaʔ] ‘space/time before, in front of’. (46) kon padumāwatī musun klaṁ ma dmoṅ ʔantul pkāw dirkas offspring pn five hundred rel stay top flower lotus ‘the five hundred sons of Padumāvatī who are seated on lotus flowers’ (Kubyaukgyi)

5.4 Sentence Particles Due to the nature of the inscriptional material, not many discourse particles are found in the Old Mon corpus. Besides the question particles yo and tā for content and polar questions respectively, there is the focus marker da, daʔ [daʔ] derived from the copula das ‘to be, become’, which also marks sentence focus. In addition, particles such as gaṁ [gɔm] ‘more, further’, which in sentence final position means ‘also, too’ and can soften imperatives. A similar function is also expressed by the additive particle kuṁ [kɤm] ‘also, even’, which can also occur in sentence final position with a mirative notion. The particle ci has an emphatic or contrastive force and occurs in clause initial position or after the element(s) contrasted. 6

Semantic Fields

As the Old Mon inscriptions that have survived all contain royal or religious texts (the two not being readily distinguishable), it is evident that the recorded vocabulary contains a large number of lexemes relating to religious and state affairs. A large part of this official lexicon is derived from Pali and Sanskrit. Typical borrowed terms related to Buddhism are the following. dhar ‘law, doctrine’ dān ‘offering, alms’ dhāt ‘relic’ bihār ‘monastery’

Sanskrit dharma ‘law, doctrine’ Pali/Sanskrit dānam ‘offering’ Pali/Sanskrit dhātu ‘element’ Pali vihāra/bihāra ‘monastery, place for retreat’

Terms for state affairs borrowed from Pali and Sanskrit include the following. ʔekarāj ‘king’ rās ‘district’

Pali/Sanskrit ekarājā ‘king, sole ruler’ Sanskrit rāṣṭra ‘realm, people’

548

Jenny and McCormick

Apart from the borrowed words pertaining to the state and religion, Mon also has a number of indigenous terms. Especially interesting among these is the word kyāk, kyek ‘holy being’, which is the normal term used to refer to the Buddha. The use of an indigenous term to denote the Buddha is also found in Burmese (pʰəjà, Old Burmese purhā), but otherwise not widespread in Southeast Asia. Indigenous terms relating to the state are ḍūṅ, ḍuṅ ‘city, area subject to a city, country, kingdom’, an important concept in many states of Southeast Asia, twāñ ‘village’, and smiṅ [smɤŋ] ‘king, prince, ruler, god’. 7

Sample Glossed Text

Excerpt of the Mon face of the Myazedi inscription (1112 AD, Pagan) śrī namo buddhāya śrī sās glory reverence Buddha.dat glory religion ‘Glory! reverence to the Buddha! Glory!’

kyek holy

buddha Buddha

tirley my.lord

kuli ʔār moy lṅim turow klaṁ ḅār cwas diñcām cnām tuy last go one thousand six hundred two ten eight year finish When the religion of the Lord Buddha had lasted for 1628 years ḍey ḍūṅ ʔarimaddanapur woʔ smiṅ śrītribhuwanādityadhammarāj das. in city pn prox king pn be in the city of Arimaddanapura [Pagan], Śrī Tribhuwanādityadhammarāja became (king). gna.kyek smiṅ goḥh moy trilokawaṭaṁsakādewī ʔimoʔ. queen king medl one pn name One of the king’s queens was named Trilokavaṭaṁsakādevī. kon gna.kyek goḥh rāja.kumār ʔimoʔ. offspring queen medl pn name The son of that queen was was named Rājakumār. smiṅ goḥh kil ḍik pi twāñ ku gna.kyek goḥh. king medl give servant three village obl queen medl The king gave three villages of servants to the queen.

Old Mon

549

kāl gna.kyek goḥ cuti ʔār ʔut kiryā gna.kyek goḥh time queen medl die go all apparel queen medl When the queen died, all the queen’s possessions ku ḍik pi twāñ goḥ smiṅ tun kil ku kon gna.kyek obl servant three village medl king return give obl offspring queen and all three of the villages of servants, the king gave to the son of queen, ma ʔimoʔ rājakumār goḥ. rel name pn medl who was named Rājakumār. smiṅ goḥh kmin ḅār cwas diñcām cnām tuy kāl smiṅ goḥ ʔjey king medl reign two ten eight year finish time king medl sick The king reigned for 28 years, and when he became sick, ñan scuti kon gna.kyek ma ʔimoʔ rājakumār goḥ near irr.die offspring queen rel name pn medl approaching death, the son of queen who was named Rājakumār mirnas guṇ ma smiṅ ʔiñcim jirku. remember favor rel king feed body remembered the favors the king who had nourished him. kindaṁ kyek thar moy ʔār tuḅok smiṅ munas row woʔ build holy gold one go offer king inform manner prox He cast a golden Buddha image and went to offer it to the king, telling him, kyek thar woʔ ʔey pa raṁpoʔ tirla holy gold prox 1sg do portion lord “This golden Buddha image I have made on your behalf, my Lord. ḍik pi twāñ ma tirla kil ku ʔey goḥh, servant three village rel lord give obl 1sg medl Those three villages of servants which you gave me, ʔey-ḍik kil ku kyek woʔ. tirla ʔanumodanā da 1sg-servant give obl holy prox lord approve foc I give to his image. May you approve, my Lord.”

550

Jenny and McCormick

kāl goḥ smiṅ sḍik.gap.pumas. thic ʔār, thic ʔār smiṅ pa sādhukār. time medl king irr.pleased good go good go king do approval [Then] the king was very pleased and gave his approval, saying “well done, well done.” kāl goḥh tirla poy mhāther ticār muggaliputtatissather time medl lord 1pl senior.monk venerable pn Then [in the presence of] our lord, the Senior Monk, the venerable Muggaliputtatissather, ticār sumedhapaṇḍit venerable pn the venerable Pandit Sumedha, ticār brahmapāl ticār brahmdiw ticār son ticār venerable pn venerable pn venerable pn venerable the venerable Brahmapāl, the venerable Brahmadeva, the Venerable Son, saṅghasenawarapaṇḍit pn and the venerable Saṅghasenavarapaṇḍita, kinta tirla ta goḥ smiṅ cut ḍek han ti. in.front lord pl medl king put water loc soil in front of these lords, the king poured water on the earth. blaḥ goḥ kon gna.kyek ma ʔimoʔ rājakumār goḥ end medl offspring queen rel name pn medl After that, the son of the queen, who was named Rājakumār, ket kyek thar goḥ thāpanā kandaṁ guoh cloṅ thar woʔ. take holy gold medl enshrine build cave spire gold prox took the golden image and enshrined it, building this cave with the golden spire. kāl busac kyek guoh woʔ kon gna.kyek goḥ ket time dedicate holy cave prox offspring queen medl take When he dedicated this image and cave, the queen’s son brought sakmunalon moy twāñ rapāy moy twāñ ñaḥh.gir.ʔuy moy twāñ pn one village pn one village pn one village from the villages of Sakmunalom, Rapāy, and Ñaḥ Gir ’Uy,

Old Mon

551

ʔut ḍik pi twāñ goḥ. all servant three village medl all the servants.’

8 References Archaeological Survey of Burma. 1919–1937. Epigraphia Birmanica, being lithic and other inscriptions of Burma [5 vols]. Rangoon: Government Printing Press. Assavavirulhakarn, Prapod & Peter Skilling. 2006–2007. New readings of early Indiclanguage inscriptions from Nakhon Si Thammarat, southern Thailand. Bulletin d’études Indiennes 24–25: 11–24. Aung-Thwin, Michael. 2005. Mists of Rāmañña. Honolulu: University of Hawai’i Press. Bauer, Christian. 1992. Thai etymologies. In Mon-Khmer Studies 18–19: 254–261. ———. 1991. Old Mon s-. In Jeremy H.C.S. Davidson, ed. Austroasiatic languages. Essays in honour of H.L. Shorto. London: SOAS, 241–253. ———. 1990. Numismatics, dialectology and the periodization of Old Mon. In MonKhmer Studies 16–17: 155–76. Chit Thein, ed. 1965. Mun kyauksa paungyout [Collection of Mon inscriptions]. Archaeological Division, Ministry of Culture [of Burma]. [In Burmese]. Diffloth, Gérard. 1984. The Dvaravati Old Mon language and Nyah Kur. Bangkok: Chulalongkorn University Printing House. Ferlus, Michel. 1984. Essai de phonétique historique du môn. In Mon-Khmer Studies 12:1–90. Guillon, Emmanuel. 1999. The Mons: a civilization of Southeast Asia. Bangkok: The Siam Society Press. Harvey, G. E. 1925. History of Burma: from the earliest times to 10 March 1824, the beginning of the english conquest. London: Longmans, Green & Co. Huang, Yan. 2000. Anaphora: A cross-linguistics study. Oxford: Oxford University Press. Huffman, Franklin. 1973. Thai and Cambodian—a case of syntactic borrowing? In Journal of the American Oriental Society, 93:4:489–509. Khanittanan, Wilaiwan. 2004. Khmero-Thai: The Great Change in the History of the Thai Language of the Chao Phraya Basin. Paper from the Eleventh Annual Meeting of the Southeast Asian Linguistics Society, Tempe AZ. McCormick, Patrick and Mathias Jenny. 2013. Contact and convergence: the Mon language in Burma and Thailand. In Cahiers de Linguistique Asie Orientale 42:2:77–117. Shorto, Harry L. 1971. A dictionary of the Mon inscriptions from the sixth to the sixteenth centuries. London: Oxford University Press.

552

Jenny and McCormick

——— 1956. Notes on Mon epigraphy. In Bulletin of the School of Oriental and African Studies, University of London. 18:2:344–352. Varasarin, Uraisi. 1988. Les inscriptions mônes decouvertes dans le nord-est de la Thaïlande. In Premier Symposium Franco-Thai: La Thailande des debuts de son histoire jusqu’au xv siecle. Bangkok: Silpakorn University, 196–211. Wales, Horace Geoffrey Quaritch. 1969. Dvaravati, the Earliest Kingdom of Siam (6th to 11th Century AD). London: Quartich. Watson, Hunter. 2014. Careuk phasa Mon boran nai prathet Thai. [Old Mon inscriptions in Thailand]. unpublished manuscript. Bangkok: Silpakorn University, Department of Oriental Languages, Faculty of Archaeology, 1–17 [in Thai]. Wiraprachak, Kongkaew. 1986. Careuk nai prathet Thai lem 2: akson Pallawa akson Mon phuttasatawat 12–21 [Inscriptions of Thailand, vol. 2: Pallava and Mon Alphabets, 12th–21st Centuries of the Buddhist Era]. Bangkok: National Library [in Thai].

chapter 4

Modern Mon1 Mathias Jenny 1 Background Mon (mnw) belongs to the Monic branch of the Austroasiatic languages, together with its close cousin Nyahkur, which is spoken in a few villages in central and northeastern Thailand. With a documented history going back to the sixth century, Mon is one of the earliest written languages of Southeast Asia.2 The Mon apparently occupied a vast area in central Thailand, spreading to northern and northeastern Thailand, to the Malay peninsula and into southern Myanmar at the height of its expansion, a period referred to in the literature as Dvāravatī. Through the loss of sovereign nationhood to the Burmese empire, first probably in the 11th century, then again in the 16th and 17th centuries, Mon has also lost its political and economic status, a status it could never regain while resisting widespread Burmanization, a process that started early in the 15th century and became even stronger after the end of the British colonial era in the mid 20th century. Mon is today spoken by an estimated population of nearly a million, mainly in southern Myanmar, with a number of communities in central Thailand. In Myanmar, Mon language use seems to be stable, in spite of widespread bilingualism in Mon communities. Mon teaching is officially allowed in a number of Mon schools offering an all-Mon curriculum from primary school through high school. In Thailand, the language is in a less secure situation, with most full speakers aged over 60. In most Mon communities in Thailand the language shift to Thai has been all but completed, though there are substantial numbers 1 This chapter is based on different language sources, which are indicated after each example. The main sources used are: cl_ck_ca—short story Chan lon, chan kwaeh, chan awt published in The Future of Mon magazine, Mudon, Mon State, 2011 (written text); KM—interview with two young men from Kanni, Karen State (audio); MKP—Mon story Mi Kon Plem by Ven. Candimacara, Moulmein, Mon State, 2001 (written text), NM_frog01—frog story recorded by a woman in Sangkhlaburi, Thailand (audio), WW2_nc/mn—interview about World War 2 in Monland with an elderly couple and their son, Kawdot, Ye, Mon State (audio). Examples with no source indication are from elicitation or field notes and have been rechecked with native speakers. 2 For a description of Old Mon, see the chapter Old Mon in this volume.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_�10

554

Jenny

of semi-speakers to be found in many places. The Mon varieties of Myanmar and Thailand are largely mutually intelligible, though both groups have taken large numbers of loanwords from their respective national language. Apart from lexical influence, strong influence (of varying degrees) from Burmese and Thai can also be seen in the structure of all Mon dialects. Especially Mon dialects in Thailand often resemble relexified Thai rather than original Mon. In Myanmar, the influence is less obvious, though it affects the syntax as well as semantics of the language. The present grammar sketch is based on Mon dialects spoken in Myanmar and the language used in modern Mon publications, mainly from Myanmar. In spite of its cultural and historical importance, Mon has not received much attention from the linguistic community. Old and Middle Mon inscriptions are available in editions and English translations dating back to the beginning of the 20th century (the Epigraphia Birmanica, Taw Sein & Duroiselle 1919 etc., Blagden 1936), as well as a number of re-editions published more recently in Myanmar. The earliest grammatical descriptions and dictionaries of Mon (Stevens 1896, Haswell 1901, Halliday 1955) date to the late 19th and early 20th century, as do the first anthropological description (Halliday 1917) and translations of legendary and historical texts (e.g. Halliday 1923). Nai Pan Hla published a number of academic and popular articles on Mon from the late 1950s, including editions of Mon inscriptions and literature and a Mon textbook (Nai Pan Hla 1988–89). In the 60s of the 20th century Mon studies made a re-entry into the Western academic circle, thanks to the publication of Shorto’s dictionary of Modern Spoken Mon (1962) followed by a dictionary of Old and Middle Mon (1971). These publications were accompanied by a number of studies in Mon culture and history by Shorto, but it was not until 1999 that a comprehensive monograph was published on Mon culture and history (Guillon 1999). In 2003, South published a comprehensive account of the recent history and present situation of the Mon people in Myanmar. Ferlus (1986) treats the development of Mon phonology and Guillon (1976) gives an account of some syntactic features. A comprehensive grammar of Mon was written by Bauer (1982) but remains unpublished, and comparative work of Mon and Nyahkur was published by Diffloth (1984) and Huffman (1990). Sakomoto compiled an extensive dictionary of a Mon variety in Thailand in two volumes (Sakomoto 1994, 1996), the usefulness of which lies also in its numerous examples in Mon (with Japanese translations). In 1997 Nai Tun Way published the so far most comprehensive English-Mon dictionary, followed in 2000 by his Mon-English dictionary. Jenny (2005) is a monograph on the verb system of Mon, but covers also the phonology and other aspects of the language. The first international Mon Studies Conference, held in Bangkok in 2007, brought together for the

Modern Mon

555

first time international scholars working on all aspects of Mon history, culture, and language. The proceedings of this conference were published in book form in 2011 (McCormick, Jenny & Baker 2011). Apart from these major publications, a number of papers appeared in different journals, and dictionaries and grammatical descriptions were published in Myanmar and Thailand in the respective national language. Mon language and culture has also been the topic of a number of theses completed in universities in Thailand and abroad, most of which remain unpublished. 2 Phonetics/Phonology Mon does not have an official standard variety and the dialects vary greatly, mostly in phonology and vocabulary. The written form dates back to Middle Mon and is therefore only partly adequate to the modern spoken language. The variety presented in this chapter is a super-dialectal form with a constructed phonology based on Mon dialects spoken in southern Myanmar from Kayin State down to Ye Township. The details of the phonology used here are laid out in Jenny (2005).3 2.1 Word/Syllable Structure Native words in modern spoken Mon consist mostly of one syllable, often preceded by a weak presyllable with reduced phonetic possibilities. Every syllable must begin with at least one consonant and can end in at most one consonant. Clusters of two consonants are permissible in syllable initial position, with a restricted set of possible medial consonants, namely /j, r, l/ and /w/. The first consonant in a cluster is always a velar or labial stop, i.e. /k, kh, p, ph/, but not all medials occur with all initials. The complete list of possible initial clusters is given in table 4.1. Additional clusters, such as /mj-/ are found only in Burmese loanwords. The cluster /kj/ merges in some, /khj/ in most dialects with /c/ and /ch/ respectively. /khw/ merges with /hw/ as [f] in some speakers’ pronunciation. Otherwise there is no tendency to simplify initial clusters in Mon. 3 A few changes have been made to the phonology developed in Jenny 2005: The palatal glide is here spelled as /j/ (instead of /y/), in accordance with IPA standard. The sound /ɕ/ used in Jenny 2005 is here analyzed as an allophone of /c/ before the rhyme /-iəʔ/. /ʃ/ has been replaced by /ɕ/, as this is closer to the actual pronunciation in Mon (and in Burmese, which is the most common source of words containing this sound in Mon). The rhymes /ɛŋ/ and /ɛk/ in light register syllables have been changed to /aŋ/ and /ak/ respectively, and /aŋ/ and /ak/ in both registers to /ɒɲ/ and /ɒc/.

556 Table 4.1

kj khj pj phj

Jenny Consonant clusters

kr khr pr phr

kl khl pl phl

kw khw

Only one of a limited set of consonants is found in final position: k, c, t, p, ŋ, ɲ, n, m, j, h, ʔ. Presyllables allow only the neutral vowel /ə/ and a limited set of consonants. No closed presyllables are found in the colloquial speech, and initial clusters are rare and restricted to loanwords. In many dialects, especially in the southern parts of the Mon speaking area, the presyllables tend to be further reduced, so that only /ʔə, hə-/ and /pə-/ are commonly found. In more formal speech and the northern dialects, also /kə-/ and /tə-/ are commonly found in complementary distribution, namely /tə-/ before velar consonants and /kə-/ before all other main syllable onsets. The maximal syllable structure of native Mon words is (cə)C(C)V(C) 2.2 Phoneme Inventory (Consonants and Vowels) and Phonotactics The consonant inventory of Mon includes the following phonemes (table 4.2), some of which are found only in loanwords, mostly from Burmese. One remarkable feature of Mon is that aspiration occurs not only with voiceless stops, but also with nasals and liquids. The only real voiced stops in Mon are the two implosives, /ɗ/ and /ɓ/. Aspirated /hw/ is frequently pronounced as [f], which is absent as a phoneme from Mon. The alveo-palatal fricative /ɕ/ occurs only in loanwords, mostly from Burmese. The pronunciation varies between [ɕ] and [ʃ]. The syllable /ciəʔ/ is variously pronounced as [ɕiəʔ], [ʃiəʔ], [ʨiəʔ] or [cɕiəʔ] in different dialects, both in inherited and borrowed words. The aspirated stops were clusters stop+h in Old Mon, but are real aspirated stops in modern Mon, counting as simple phonemes.4 All final consonants are pronounced unreleased. The velar finals /k/ and /ŋ/ are often palatalized after /ɤ/, so that klɤŋ ‘come’ is pronounced as [klɤʲɲ˺]. In some dialects, especially in the northern parts of the Mon speaking area in Myanmar, palatal finals are 4 In Old Mon, infixes could be inserted between the stop and h, as between other consonants in a cluster (see Bauer this volume on Old Mon). In modern Mon, the aspirated consonants are treated as one phoneme and cannot be split.

557

Modern Mon Table 4.2 Initial consonants

voiceless unaspirated stops voiceless aspirated stops implosives fricatives nasals aspirated nasals trill approximants aspirated approximants

labial

alveolar

palatal

velar

glottal

p ph [pʰ] ɓ

t th [tʰ] ɗ s n hn [n̥ ] r l hl [l̥]

c [cɕ, ʨ] ch [cɕʰ, ʨʰ]

k kh [kʰ]

ʔ

m hm [m̥ ] w hw [w̥ ]

ɕ ɲ hɲ [ɲ̊ ]

ŋ

h

j

pronounced as velar, though with a marked palatal off-glide. In some Thailand varieties, palatal finals are usually realized as dentals, with palatalization of the vowel. The word for ‘elephant’, coɲ, is thus pronounced as [cɕoʲŋ˺] in northern Mon dialects of Myanmar, but as [cɕɤn˺] in Thailand varieties. In some speakers’ pronunciation, stops and the fricative /s/ are partially voiced in some contexts. This seems to be more common in urban speech in people fluent also in Burmese and is probably due to Burmese influence, rather than a retention of original voicedness. The status of /ʔ/ as a consonant phoneme is established by the fact that in some (pragmatically marked) contexts, the vowel before final /ʔ/ can be lengthened, mostly for emphatic or other discourse connected reasons. Final glottal stop is therefore not predictable as always and only occurring after short vowels. Secondly, initial /ʔ/ produces light register, i.e. it counts as voiceless onset. Vocalic onsets would be expected to count as voiced and therefore produce the heavy register (see 2.3.1). Mon has a rich inventory of vowels and diphthongs. Not all vowels occur with all finals or in both registers, and many vowels tend towards diphthongization, for example /ɔ/ is often realized as [ɔo̯ ] or [ɒɔ̯ ], /i/ is pronounced variously as [ɪ], [ɨɪ̯], [ɯɪ̯]. /o/ in open syllables approaches [ʊ] or [ou̯ ], while in closed syllables it is sometimes diphthongized to [oɔ̯ ~ʊɔ̯ ]. Before the palatal finals /c/ and /ɲ/, there is an audible palatal offglide after the vowel, so that phɔc ‘fear’ is pronounced as [pʰɔʲc˺], kwaɲ ‘sweets’ as [kwaʲɲ˺]. The central unrounded vowel /ɤ/ is often pronounced with a palatal off-glide in open syllables and before the glottal finals /h/ and /ʔ/. In closed syllables with final

558

Jenny

Table 4.3 Vowels front

high mid mid-low low Diphthongs

i e ɛ iə eə ɛə

a uə oə ɔə

central

back

ɤ ə

u o ɔ

ɒə ao

ɒ

nasal or stop, it is frequently lowered to [ʌ]. The neutral vowel /ə/ occurs only in presyllables. Its realization approaches [ɐ] in most contexts; before or after palatal consonants it is pronounced higher, close to [ə] or [ɪ]. The vowel qualities vary greatly among the dialects. The following vowel distinctions are taken to be phonemic in Mon (see Table 4.3). Vowel length is not phonemic in Mon, but in most cases depends on the nature of the final consonant. Open syllables are always pronounced long, as usually are syllables ending in a nasal or approximant. Syllables ending in /ʔ/ or /h/ are pronounced with a short vowel or diphthong. Vowels before a final stop are usually about halfway between long and short. Of the diphthongs, /iə/ can be followed by a final consonant, namely /k, ŋ/, or /ʔ/, the latter only after initial /c/. /ao/ and /ɒə/ can be followed by /ʔ/ or /h/. The diphthongs, /ɒə, oə/ and /ao/ occur only in light register syllables, /eə/ and /ɛə/ only in heavy register syllables. 2.3 Suprasegmentals 2.3.1 Registers Mon has two clearly distinct registers (phonation types), called sa ‘light’ and sɒ̀ ɲ ‘heavy’ in Mon, but usually labeled ‘clear’ and ‘breathy’ voice in western descriptions (see Thongkum 1988, 1990 for phonetic studies of registers in Mon). The light register regularly occurs in words with original voiceless initial consonants and the implosive (glottalized) ɓ and ɗ (as well as ʔ and h) and the aspirated nasals and approximants. The heavy register, which is pronounced with breathiness and low pitch, occurs after originally voiced initials. As all voiced stops were devoiced at some point in the history of Mon, the

559

Modern Mon

sole distinction today is in the different registers. Old Mon in modern Mon thus becomes tò with the heavy register, while remains to with the light register. In the transcription used here, the heavy register is indicated by a gravis accent above the main vowel of the syllable, while the light register is unmarked. Presyllables are neutral in terms of register, usually pronounced in the same register as the main syllable. 2.3.2 Prosody The main stress of words and phrases usually falls on the last syllable: hɒəʔ ˈpèh ‘your house’ ɗɔə ˈhɒəʔ sakkəˈrat ‘era, year’ ciəʔ ˈpɤŋ

‘in the house’ ‘eat rice’

If the phrase ends in an auxiliary or particle, the latter is often unstressed: ˈklon mɔ̀ ŋ ‘is working’ ˈhɒm na

‘speak on’

In a few cases stress is used to differentiate meaning. One important case in point is the medial demonstrative kɔ̀ h, which is stressed when used as demonstrative, but unstressed when functioning as topic marker: rɔ̀ ə ˈkɔ̀ h ‘that guy’ ʔəˈpa kɔ̀ h ‘as for father’

There is a marked downdrift at the end of utterances, mostly declarative, but also in some types of questions. The focus marker raʔ, when occurring in sentence final position is usually pronounced with high pitch and often lengthened vowel, i.e. it retains its full stress. A rising intonation at the end of a clause indicates that the utterance is not finished. This is a common means in narratives, building up suspense throughout a paragraph which is concluded by a lower intonation. 2.4 Phonology and Orthography Mon is written in an Indian based Abugida script. The modern orthography goes back to the Middle Mon period, a stage of the language before the general devoicing of stops. Register distinctions are therefore indicated by voicedness distinctions. The Mon consonant inventory consists of 35 letters, arranged in the traditional Indian way in groups of stops and nasals from velar to labial, and two groups of non-classified letters, as laid out in table 4.4. The Indian retroflex consonants are used to spell Indian loanwords as well as inherited words in the

560

Jenny

Table 4.4 Consonants

က kaʔ စ caʔ ဋ taʔ တ taʔ ပ paʔ ယ jɛ̀ʔ ဟ haʔ

ခ khaʔ ဆ chaʔ ဌ thaʔ ထ thaʔ ဖ phaʔ ရ rɛ̀ʔ ဠ laʔ

ဂ kɛ̀ʔ ဇ cɛ̀ʔ ဍ ɗaʔ ဒ tɛ̀ʔ ဗ pɛ̀ʔ လ lɛ̀ʔ ၜ ɓaʔ

ဃ khɛ̀ʔ ၛှ chɛ̀ʔ ဎ thɛ̀ʔ ဓ thɛ̀ʔ ဘ phɛ̀ʔ ဝ wɛ̀ʔ အ ʔaʔ

ၚ ŋɛ̀ʔ ည ɲɛ̀ʔ ဏ naʔ န nɛ̀ʔ မ mɛ̀ʔ သ saʔ ၝ ɓɛ̀ʔ

case of and , for implosive /ɗ/ and light register /n/ respectively. The consonants are given with their transliteration as well as phonological value. Consonants always have an inherent vowel, namely /a/ for light and /ɛ̀/ for heavy register orthographically open syllables. In presyllables, the inherent vowel is realized as /ə/. Other vowel values are obtained by addition of final consonants and vowel signs, which together determine the pronunciation of the syllable. If a consonant is to be pronounced without inherent vowel in syllable final position, it receives the ‘killer’ (həcɒt). Initial clusters are spelled with special combining forms of the second consonant in the cluster. Due to historical development of the language, in some cases the first consonant in a cluster has disappeared, sometimes leaving a trace in the register of the syllable. These combining forms are presented in table 4.5. The doted circle indicates the position of the first consonant in a cluster. Table 4.5 Combining forms of consonants Full form

Combining form

Value

Example

ၚ န မ ယ ရ လ ဝ ဟ

◌၇ ◌ ၞ ◌ ၟ ◌ျ ြ◌ ◌ၠ ◌ွ ◌ ှ

ŋ n m j r l w h

တဲ၇ တၞံ သၟိင် ဖျာ ြခာ ကၠ ကွာ မှာ

ŋu nɔm hmoɲ phja khra klaʔ kwa hma

561

Modern Mon Table 4.6 Vowel signs

အ a အဲ ◌ဲ ay

အာ ◌ာ ā အဴ ◌ဴ au

ဣ ◌ိ i အံ ◌ံ aṁ

ဣဳ ◌ဳ ī အး ◌း aḥ

ဥ ဥ ◌ု ◌ူ u ū အို ◌ ို ui (iu)5

ူ

ဨ ဩ ေ◌, ◌ဵ ေ◌ာ e o

Table 4.6 gives the vowel signs of Mon, both their independent (syllable initial) and combining forms, with the transliteration values. The independent vowel signs are used mostly in Indian loanwords and do not normally occur in inherited vocabulary. The dotted circle in table 4.6 indicates the position of the consonant preceding the respective vowel in pronunciation.5 3

Word Formation

3.1 Compounding Mon makes regular use of compounding to form new words. Generally, a modifying element follows a head. The modifying element can be nominal or verbal. In some cases, compounds have undergone univerbation, resulting in non-transparent formations resembling derivations. One well-known example of this process is ləkjac ‘monk’, from an earlier kəlaʔ-kjac ‘lord of the Sacred Being’. Regular compounds are expressions like hɒəʔ-ɗac ‘bathroom’ (lit. ‘house-water’), kon-rɔ̀ ə ‘assistant, follower’ (lit. ‘son-companion’), həkɒppənan ‘commander’ (lit. ‘cover-war’). Compounds taken from Burmese usually show reversed order of the elements, according to Burmese word formation rules. An example is pɤ̀-kjɤ̀k ‘commander’ from Burmese bo-ʨʰouʔ [bo.ʥouʔ] (lit. ‘forces-cover’), the source of the Mon calque həkɒp-pənan. Other types of nominal compounding consist of a general element indicating the semantic class of the expression, followed by a specifying element. Examples are ɗɤŋ-sem ‘Thailand’ (lit. ‘land-Thai’), sɔt-prèə ‘coconut’ (lit. ‘fruitcoconut’). Phonologically and syntactically these compounds are lexical compounds, rather than phrases consisting of a noun and a modifier. In other cases, 5 There is some discussion about the value and order of elements of the digraph , which occurs since Middle Mon, probably representing a central vowel /ə/ or /ø/.

562

Jenny

none of the two parts of the compound serves to modify or specify the other, they both together make up the meaning of the expression, as in mìʔ-mɛ̀ʔ ‘parents’ (lit. ‘mother-father’) and ɗɤŋ-kwan ‘country, land’ (lit. ‘city-village’). Verbal compounds are frequent in Mon, and the boundary between lexicalized compounds and serialized verbs is not always clear. One example of a lexicalized compound is khjɒt-plɒt ‘certain’ (lit. ‘die-extinguished’). Often, two verbs with similar meanings are put together to modify or intensify the meaning, though the semantic nuances expressed by these compounds are not always clear, as in thiəŋ-həjaʔ ‘think’ (lit. ‘think-think’). The noun cɒt ‘heart, mind’ is used in combination with verbal elements to express feelings. The resulting compounds are used like verbs, as mìp-cɒt ‘happy’ (lit. ‘comfortableheart’), ʔon-cɒt ‘sad, disappointed’ (lit. ‘few-heart’). In other cases cɒt precedes the verb, as in cɒt-klèʔ ‘short tempered’ (lit. ‘heart-short’). In these cases the wordhood of the compound is less clear and the expressions are better seen as phrases, at least syntactically. 3.2 Derivational Affixes Old Mon had an elaborate system of derivational affixes, including pre- and infixes. This process has been all but lost in modern Mon, leaving only one partly productive prefix. Old affixes are still found in lexical forms, but these are lexicalized and often not transparent. Examples are sɛ̀ʔhàn ‘love’ from chan ‘to love’ and həlɒ̀ ɲ ‘quantity’ from klɒ̀ ɲ ‘many’, both by infixation. 3.2.1 Deriving Nouns Nouns can be derived from verbs mainly by lexical compounding. The most commonly used morphemes to derive nouns are pəriəŋ and pərao ‘story, affair’. More formal are hətɔ̀ h mə ‘the state/fact of’ and pəwaʔ mə ‘the fact/act of’, which contain the nominalized forms hətɔ̀ h (from tɔ̀ h ‘be’) and pəwaʔ (from paʔ ‘do’), and the relativizer mə, which is rare in the colloquial language. The colloquial referential marker ɓɛ̀ʔ can be used to form nouns from complex expressions, functioning like a head of an unmarked relative clause in many cases, as in ɓɛ̀ʔ pèh hɒm kɔ̀ h ‘what you said’ (lit. ‘thing-you-say-that’). The form sɛ̀k/sak, which does not occur as free morpheme, derives gerundive-like forms from verbs, meaning ‘thing to V’, as in sɛ̀k-ciəʔ ‘something to eat’, sɛ̀k-ʔa ‘place/ reason to go’ and sɛ̀k-hɒm ‘something to tell’. The prefixes ʔə- and ʔiʔ- are found in some nominalizations of verbs, though this process is not productive. As ʔə- is a common deverbalizing prefix in Burmese, its use in Mon is also spreading with large numbers of Burmese loans. For some speakers, this seems to have led to the prefix being applied to new forms as well, so that one occasionally hears ʔəchan ‘love, darling’ from

563

Modern Mon

the verb chan ‘love’, obviously influenced by Burmese ʔəʨʰiʔ ‘love, darling’ from ʨʰiʔ ‘to love’. The prefix ʔiʔ- is attached also to nouns, most frequently (but not exclusively) denoting female referents. This can be seen in the forms ʔiʔnàj ‘aunt (older than parent)’, which contrasts with ʔənàj ‘uncle (older than parent)’, both probably connected with nàj ‘mister’. 3.2.2 Deriving Verbs Whereas older stages of Mon had ways to turn verbs into nouns, there are no regular processes to derive verbs from other categories. Some roots can be used both as verbs or as nouns, like kəpɔʔ ‘dream’ and chan ‘love’. If the root can not be used verbally, a dummy verb like paʔ ‘do’, nùm ‘exist’ or tɔ̀ h ‘be’ is used to make a non-verbal element predicative. For causative verb derivation see section 5.2.5 below. 3.2.3 Deriving Other Parts of Speech Systematic derivation exists for a small number of lexemes, namely the demonstratives nɔʔ ‘this, proximal’, kɔ̀ h ‘that, medial’, and tɤʔ ‘that, distal’ and the interrogative lɒ ‘which’. These bases are attached to nouns, and can take three different prefixes to form pronouns and adverbs, as shown in table 4.7. In colloquial Mon, the pronominal forms undergo a sound change, whereby the prefix ʔiʔ- is inserted after the initial consonant of the stem, resulting in the palatalization of the initial. Adverbs can be derived from verbs periphrastically, especially in formal Mon. The common construction used is pùə mə=V ‘in a V way’. The word pùə does not occur as a free word but only functions as an adverbial head in the literary or formal language. The adverbial head pɔ̀ t forms language (and other nationality related) adverbs, as in pɔ̀ t ʔaŋkəlòc ‘in English’. Table 4.7 Derivations of demonstratives and interrogatives Base Pronoun

nɔʔ ‘this’

ʔiʔ-nɔʔ ~ ɲɔʔ ‘this one’ Adverb 1 locative ʔə-nɔʔ ‘here’ Adverb 2 manner hə-nɔʔ ‘like this’

kɔ̀h ‘that’

tɤʔ ‘that (over there)’ lɒ ‘which’

ʔiʔ-kɔ̀ h ~ cɔ̀ h ʔiʔ-tɤʔ ~ cɤʔ ‘that one’ ‘that one’ ʔə-kɔ̀ h ‘there’ ʔə-tɤʔ ‘there’ hə-kɔ̀ h ‘like that’ (hə-tɤʔ)6

ʔiʔ-lɒ ~yɒ ‘which one’ ʔə-lɒ ‘where’ hə-lɒ ‘how’

6 The form hətɤʔ is not in common use. A periphrastic construction like pɤ̀m tɤʔ ‘that manner’ is used instead.

564

Jenny

3.3 Reduplication A special kind of derivation is reduplication, which is used in Mon mainly with verbs, though reduplicated forms of other categories occur as well. Reduplicated forms usually have intensifying function, as in ʔa prɔh-prɔh ‘go quickly!’ and ciəʔ klɒ̀ ɲ-klɒ̀ ɲ ‘eat a lot!’. When used attributively, stative verbs are usually reduplicated, often with an intensifying, sometimes also downtoning, effect: hɒəʔ hnòk-hnòk ‘a (rather/very) big house’. In connection with interrogatives, reduplicated verbs express indefinites, as in (1). (1) pɤ̀m lɒ ɗɛh hɒm~hɒm, ʔuə hùʔ pətɛh pùh. manner inter 3 speak~red 1sg neg believe neg ‘However he speaks, I don’t believe him.’

Reduplication in Mon is rarely used to indicate plurality of nouns. One example is given in (2), where the noun plàj ‘young man’ is reduplicated. (2) plàj~plàj pɔn kɔ̀ h kɤ̀ʔ na pɤŋ kwì young.man~red four medl get caus.go cooked.rice wrap nù mì.kon.plɛm. abl pn ‘The four young men got the rice wrapped (in banana leaves) from Mi Kon Plem.’ (MKP)

Also pronouns can be reduplicated to express generic reference, as in (3). (3) ɲèh~ɲèh ʔuə~ʔuə klùə klɤŋ cɒp phɛ̀ə roŋ. person~red 1sg~red gather come arrive monastery asrt ‘Everyone will come and gather at the monastery.’ (MKP)

The numeral mùə ‘one’ is reduplicated to indicate a non-specific referent, as in example (4). (4) mənìh mùə~mùə human one~red ‘someone’

Modern Mon

4

565

Phrase and Clause Structure

4.1 Simple Sentences The basic word order of Mon is SV/AVP, though pragmatically based mutations are frequent in main clauses.7 Retrievable or known arguments are often omitted in a sentence, and topical or focused elements can be fronted. Syntactically, S and A behave similarly in most of constructions, so that one may speak of ‘subject’ in Mon, while ‘object’ as set comprising P and T, or P and G, is more restricted (see Jenny forthc. a.). Example (5) shows a transitive sentence with both arguments appearing as full NPs and unmarked word order. (5) kalaʔ kɔ̀ h mìʔ.kon.plɛm kɔ̀ h pətɛt kəsɔp raʔ. time medl [pn medl] A caus.exit [nml.think.over]P foc ‘Then Mi Kon Plem made her plans.’ (MKP)

Frequently the P argument is fronted to clause-initial position for pragmatic reasons, leading to PAV word order, as in (6). In this case, there is no resumptive pronoun in the postverbal position of the clause. (6) mənìh prèə kɔ̀ h ɲèh kok.khao ciəʔ mìʔ.lìm.cənaj raʔ. [human female medl] P [person]A call eat pn foc ‘The woman was called Mi Lim Canay.’ (MKP)

The word order APV also occurs, as in (7). This can be seen as double fronting, first of the P argument to the clause-initial position, then the A to a pre-clause position. This position, also called ‘left-detached position’, is separated from the main clause by an optional intonation pause and the argument appearing in it can be overtly expressed in the clause by a resumptive pronoun. (7) ʔəpa ʔuə (kɔ̀ h), ʔərɛ̀k ( ɲèh) hùʔ sɤŋ, ɓɔk [Father 1sg medl] A [liquor]P [person]A neg drink [cigar]P ( ɲèh) hùʔ sɤŋ. [person]A neg drink ‘My father, he doesn’t drink, nor does he smoke.’ 7 The constituent order in dependent clauses is in many cases restricted to the basic order.

566

Jenny

Interrogative Sentences Polar questions, both positive and negative, in Mon are formed by addition of the interrogative particle ha at the end of the sentence. The intonation can rise or fall in polar questions, with nuances in pragmatic differences. The same morpheme ha is also used to form alternative questions (‘or’), which in Mon are expressed by two parallel polar questions. In tag questions (‘isn’t it?’) the defective copula siəŋ ‘be so’ is used, formally as (hùʔ) siəŋ ha ‘isn’t it so’, which is colloquially usually shortened to siəŋ or seʔ/sɛʔ. In this case, this particle is normally pronounced with a high or high-rising pitch Examples are given in (8) to (11). (8) pèh ciəʔ pɤŋ toə jaʔ ha. 2 eat cooked.rice finish nsit q ‘Have you eaten?’ (9) pèh ket pɤŋ ha, kwaɲ ha. 2 take cooked.rice q sweets q ‘Do you want rice or a snack?’ (10) ɗɛh ʔa phja, hùʔ siəŋ ha. 3 go market neg be.so q ‘He has gone to the market, hasn’t he?’ (11) pèh hɒm ʔərè sem lèp, seʔ. 2 speak language Thai able tag ‘You speak Thai, right?’

The answer to a polar question normally consists of the main predicative element of the question, optionally followed by a particle like raʔ ‘focus’ or lè ‘emphatic’. For negative answers, the verb of the question is used with the negator hùʔ. If a non-verbal element is the main predicate asked about, the non-verbal negation hùʔ siəŋ ‘it is not so/the case’ is used following the nonverbal predicate. Content questions contain an interrogative word (wh- word) and end in the particle rao (pronounced ro/ru/rɔ in many dialects). This particle is often

Modern Mon

567

dropped in colloquial speech, but is regularly kept in formal or literary style. As in polar questions, the intonation may either rise or fall towards the end of a content question. For the position and use of interrogative forms see section 5.1.2. below. Example (12) illustrates the form of a simple content question. (12) ʔəwao hnòk ʔɒt kɔ̀ h klon mɔ̀ ŋ mùʔ rao. older.brother big all medl do stay what q ‘What does your oldest brother do?’ (KM)

Imperative Sentences For imperatives, the bare verb without any particle can be used, though this sounds often abrupt and rude. The downtoner ɲìʔ ‘a little bit’ is often added at the end of a command for politeness. Other sentence final particles, such as cɔm ‘try’ and lè ‘right now, just, immediately’ add an urgent notion to a command. Mon does not normally use indirect speech acts to express commands. Direct imperatives, usually modified by some downtoning or emphatic particle, are the most common way to utter a command or request. Hortatives end in the sentence particle coʔ ‘let’s’. The S or A argument of all kinds of imperatives may or may not be overtly expressed. Negated commands use the prohibitive marker paʔ (formal ləpaʔ) ‘don’t’ in preverbal position. This is a reduced form of literary Mon , which goes back to the Old Mon prohibitive particle combined with the verb ‘do’. The phonetic reduction resulted in the prohibitive marker becoming homophonous with the verb ‘to do’. No ambiguity arises from this homophony, though, as the prohibitive marker is always followed by a verb and cannot occur on its own, while the verb paʔ ‘do’ is never followed by a verb. 4.2 Complex Sentences There are different means to combine clauses into larger units, resulting in complex sentences. Clauses can be juxtaposed to each other, without formal means marking the connection. This is the normal way of coordinating clauses in Mon. Subordinate clauses can be marked in different ways, or occur without any overt marking, depending on the semantics of the clause linkage, as will be illustrated in sections 4.2.2 to 4.2.4. Shared arguments in conjoined clauses can be omitted, with S/A serving as pivot in some contexts. In (13), only the A of the first clause is overtly expressed, the P is omitted. In the second clause, which consists of the verb only, the roles are reversed. In (14) the S of the first clause does not appear in the second clause, which has no overt arguments.

568

Jenny

The A and S arguments of the second clause are coreferent and can both be omitted. If the S of khjɒt ‘die’ was different from the A of phɔc ‘fear’, it would normally be overtly expressed. (13) ɗɛhi ket Øj hmaʔ Øj ket Øi. 3 take restr take ‘If she wants me, I’ll take her (otherwise I won’t).’ (KM) (14) ɓaŋ-kja kɔ̀ h tao Øi phɔc Øi khjɒt. ship-wind medl burn fear die. ‘The airplane was burning and we were afraid that we would die.’ (WW2_nc/mn)

4.2.1 Coordination and Chaining In narratives and in spontaneous speech, a kind of chaining is common, whereby a linking element toə ‘finish’ or toə-teh ‘and then’ (lit. ‘finish-topic’) occurs between the two clauses, the order of which is always iconic sequential, as in (15). (15) kraoh pèh həlɒ̀ c kətɒ nù kəmot toə kəliəŋ.cao klɤŋ male 2 shake rise abl fire finish return come kəpac hɒəʔ pèh raʔ. side house 2 foc ‘Your husband got up from the fire and came back to your house.’ (MKP)

Frequently the comment of a sentence is repeated in the following sentence as topic, linked to a new comment by toə teh or teh as in (16). (16)

ɗɛh pɤ̀ mɔ̀ ŋ ŋèə kɔ̀ h ɗɔə pəlaŋ kɔ̀ h. pɤ̀ mɔ̀ ŋ ŋèə 3 watch stay frog medl loc bottle medl watch stay frog

kɔ̀ h toə teh, ŋuə klùʔ teh ɗɛh toc. ɗɛh toc hloɲ medl finish top day dark top 3 sleep 3 sleep asleep

Modern Mon

569

. . . ʔa. ɗɛh toc hloɲ ʔa teh go 3 sleep asleep go top ‘They watched the frog in the jar. As they watched the frog and it got dark, they went to bed. They fell asleep. They fell asleep and then . . .’ (NM_frog01)

The inclusive particle sɒm ‘both x and y, all of, the whole of’ is used to coordinate elements of different levels, lexical, phrasal, and clausal. The construction pattern is sɒm X sɒm Y, with X and Y standing for any two constituents on the same syntactic level. Examples (17) to (19) illustrate the use of sɒm with different constituent types. In (19), the construction with sɒm (mə) introduces an adverbial clause. (17) poj kɤ̀ʔ nɛ̀ŋ mənìh cɤŋ mɔ̀ ŋ mɔ̀ ŋ noŋ, 1pl get caus.come human complete stay stay asrt sɒm rɔ̀ ə kwoɲ.kwɒc sɒm ʔəca.wòn sɒm mənìh khep sok. incl friend play.music incl doctor incl human cut hair ‘We got people of every profession, musicians, doctors, and barbers.’ (WW2_nc/mn) (18) mɔ̀ ŋ hmah~hmah sɒm nòɲ tɛ̀h kìʔ sɒm tɛ̀h tɤ̀ŋ pjìʔ stay gratis~red incl waist hit hurt incl hit receive poison

kɔ̀ h tup mɔ̀ ŋ kɒ hə-tɔ̀ h ʔuə raʔ noŋ. medl same stay obl nml-be 1sg foc asrt ‘My situation is like one who does not do anything (wrong) and has to suffer pain and poison.’ (cl_ck_ca)

(19) sɒm mə=pjɤ̀ thaŋ sɒm mə=ɗoŋ nɛ̀ʔ cɒt mə=klèʔ incl rel=hungry thirsty incl rel=tired ins heart rel=short

hùʔ mòc kɒ hwɔh raʔ. neg des give answer foc. ‘Being both hungry and thirsty, and tired, he was angry and did not want to give an answer.’ (MKP)

570

Jenny

4.2.2 Relative Clauses Modern colloquial Mon has all but lost the inherited relativizer, which occurs in formal style as mɛ̀ʔ, mə=. In Old Mon this particle introduced relative clauses, but in Middle Mon it shifted to the immediate pre-verbal position and cliticized to the verb, merging with the attributive form (formed by a labial-nasal prefix or infix in Old Mon and Middle Mon). In the formal language, the relativizer mə= regularly occurs before the verb, not in clause initial position. Verbs can occur in attributive function without overt marker in spoken Mon, which may have facilitated the loss of the relative marker. Spoken Mon is therefore left with no overt relativizer in most relative clauses, and relative clauses are attached after the head noun they modify, often ending in the medial demonstrative kɔ̀ h, which marks an expression as referential rather than predicative. Example (20) illustrates the form of relative clauses. (20) kɤ̀ʔ tɛ̀h ɓɛ̀ʔ kon ŋèə həkaoʔ kla£ j kɔ̀ h. get hit ref offspring frog [body seek medl] REL ‘He got the little frog he was looking for (by coincidence).’ (NM_frog01)

The referential marker ɓɛ̀ʔ can be used as dummy nominal head of a relative expression, as in (21). (21) ɓɛ̀ʔ seh kɔ̀ h ɗɛh pɔn. [ref remain medl] REL 3 shoot ‘The ones that remained, they shot.’ (WW2_nc/mn)

In natural speech, the relativized functions are mainly restricted to core relations (S, A, P) and some oblique functions. More complex relative clauses are generally avoided and juxtaposition of independent clauses is preferred. 4.2.3 Complement Clauses Complement clauses usually occur before the matrix verb and are either unmarked or take the medial demonstrative kɔ̀ h, marking the clause as nonpredicative. It is not always clear whether kɔ̀ h has scope over the whole clause or only over the NP preceding it, as in example (24) below. The alternative order with the complement clause following the matrix is found in more formal style. In this case, the complement clause is sometimes introduced by the verb-turned-subordinator kɛ̀h ‘say; that’. The two types are illustrated in (22) and (23), respectively.

571

Modern Mon (22)

mənìh plàj paʔthəmaʔ kɔ̀ h [human young.man first medl

cùt caɲ kənɛ̀ŋ.kənò bone chicken lie.across

kɛ̀ŋ mɔ̀ ŋ ɗɔə kɔʔ toə khjɒt təŋɛ̀ə mɔ̀ ŋ stuck stay loc throat finish die lie.on.back stay

kɔ̀ h kɤ̀ʔ chɤ ket raʔ. medl] COMP get find take foc ‘She found out that the first of the young men had a chicken bone stuck in his throat and had died, lying on his back.’ (MKP)

(23) ɲèh thiəŋ.həjaʔ kɛ̀h rɔ̀ ə həkaoʔ khjɒt ʔa jaʔ. 3H think say [friend body die go nsit] COMP ‘He thought that his friend was dead.’

Complement clauses can function as S or P of the matrix clause. Examples (22) and (23) above show P-complements, (24) an S-complement. (24) kɤ̀ʔ pɔŋ.phak ʔa mùə.han kom.kaoʔ kao plàj [get associate go together with older.brother young.man

kɔ̀ h. tɔ̀ h hùʔ maǹ raʔ. medl] COMP be neg win foc ‘It is impossible that I will stay together with you, my brother.’ (MKP)

Reported speech is a special kind of complement clause and covers both reported declarative and interrogative utterances. The only difference between direct and indirect reported speech in Mon is that in the former the pronouns remain unchanged, while in the latter personal pronouns are shifted. The reported speech can precede or follow the matrix verb, as seen in (25) to (27). If the reported speech clause follows the matrix, it may optionally be marked by the clause initial subordinator kɛ̀h. There is thus no syntactic difference between reported speech and other kinds of complement clauses. In example (25) with a first person as matrix and subordinate subject, there is no difference between direct and indirect reported speech, while (26) is direct, (27) indirect reported speech.

572

Jenny

(25) ʔuə hɒm ʔuə kəliəŋ mɔ̀ ŋ phɛ̀ə həmɛ̀ə plɔn noŋ. 1sg speak [1sg return stay school Burma again asrt] REP.SP ‘I said that I was going to go back to the Burmese school.’ ‘I said: “I am going back to the Burmese school”.’ (KM) (26) ʔəmè ʔuə hɒm pèh ʔa paʔ həpoj. mother 1sg speak [2 go do novice]REP.SP.drct ‘My mother said: “you go and get ordained as a novice”.’ (KM) (27) ɗɛh hùʔ klɤŋ pùh [3 neg come neg] REP.SP.INDR ‘He said he wouldn’t come.’

ɗɛh hɒm. 3 speak

Free quotations without overt matrix clause often end in the verb kɛ̀h ‘say’ as reportative marker, which is also used as clause-initial subordinator, as seen above. This usage is more common in the modern written style than in colloquial Mon, as in example (28). (28) pɤŋ hwaʔ tom lɔ̀ swak mənìh plàj pɔn cooked.rice curry cook deposit for human young.man four kɔ̀ h lɛ cut lɔ̀ kjìʔ noŋ kɛ̀h. medl add put deposit poison asrt rep ‘She had put poison in the rice and curry for the four young men, it is said.’ (MKP)

4.2.4 Adverbial Clauses Adverbial clauses can express a wide range of functions, and in Mon there is no unified structure associated with this category. The most explicit way to form adverbial clauses is by means of a semantically full head noun, to which a relative clause is added. This construction is common for causal and temporal clauses, as well as for clauses indicating manner. In some cases the head noun has lost its categoriality and undergone semantic bleaching and phonetic reduction. This happens to indigenous lexemes and loan words, such as hɒt ‘reason > because’ from Pali hetu ‘reason, cause’ and kalaʔ ‘time > when, while’ from Pali kāla ‘time’. The latter is usually abbreviated to laʔ, with the first syllable dropped, and translated as ‘when, as’. In this case, the former noun

573

Modern Mon

does not take a preposition anymore, as is possible with less grammaticalized elements. Compare the temporal clauses in (29) and (30) with two different lexemes meaning ‘time’. (29) (*ɗɔə) laʔ cəpan tɒn lɔ̀ k.həplaŋ tɤʔ, cəpan klɒʔ na loc when Japan ascend pn dist Japan cross caus.go

ɗan tɒc tɤʔ. path land dist ‘When the Japanese came up to Lok Hapleng, they crossed that path there.’ (WW2-nc/mn)

(30) plàj paʔthəmaʔ mùə kɔ̀ h ɗɔə ʔəkhoɲ ŋuə plɒt young.man first one medl loc time day extinguished wɛ̀ətìʔ prap klɤŋ hɒəʔ mìʔ.kon.plɛm. at.beginning approach come house pn ‘When it got dark, the first of the young men came to Mi Kon Plem’s house.’ (MKP)

Conditional clauses are either introduced by jɔ̀ (raʔ) ‘if (it is that)’, where the focus marker raʔ still shows traces of its origin as copula (Old Mon ‘to be’). To enforce the conditional function, the clause can end in the topic marker teh (in formal style mə=kɛ̀h ‘saying this’). The clause initial conditional marker is frequently omitted, resulting in only the clause final teh expressing the conditional meaning. Similarly to kɔ̀ h in complement clauses, it is thus a clause final marker that seems to replace an original clause initial subordinator, though its origin as topic marker is very different from a real subordinator. Examples (31) to (33) illustrate the conditional clauses. (31) jɔ̀ ʔuə hùʔ chɤ klɤŋ ɲèh mùə hùʔ chɤ. if 1sg neg find come person one neg find ‘If I hadn’t found it, no one would have found it.’ (WW2_nc/mn) (32) pɤ̀m.praŋ pèh nɔʔ jɔ̀.raʔ appearance 2 prox if

wətɒə tɛm klɤŋ chɤ teh, pn know come find top

574

Jenny ɗɛh chanɗu noŋ, pèh thiəŋ ha. 3 pity asrt 2 think q ‘Do you think Wati would feel pity for you if she knew about your condition and came to see you?’ (cl_ck_ca)

(33)

pèh rao wətɒə, ɗɔə kon.kəmɔʔ mɔ̀ n.sɒc mùʔ nùm mɔ̀ ŋ kɔ̀ h 2 rtop pn loc heart pn what exist stay medl

məkɤ̀ʔ tɛm teh, hi cɔm rɔ̀ ŋ kwìəŋ.mòt ɗɛh mùə.tɔə kɒm. des know top look try look eye 3 once also ‘If you want to know what is in Mon Saik’s heart, Wati, just look into his eyes once.’ (cl_ck_ca)

Purposive clauses usually precede the matrix clause and are introduced by ɲɔ̀ ŋ lit. ‘like, similar’ or swak (hwak) ‘for’, the latter being more formal. Alternatively, purposive clauses remain unmarked, in which case the preverbal auxiliary kɤ̀ʔ is usually present (see below, section 5.2.7). Examples of purposive clauses are given in (34) and (35). In (35), the two types of purposive constructions are used in one complex sentence, once with same subject (regularly introduced by swak), once with different subject (regularly introduced by ɲɔ̀ ŋ). (34) ɲɔ̀ŋ ʔiʔ-tèʔ prèə kɤ̀ʔ pɔɲ-cɒt, kao plàj as nml-younger.sister female get full-heart older.brother young.man nɔʔ kəlaŋ ciəʔ ʔərè ʔiʔ-tèʔ prèə noŋ. prox listen eat language nml-younger.sibling female asrt ‘I will obey everything you say, so that you can agree (with my offer), my dear sister.’ (MKP) (35) swak kɤ̀ʔ phap.pəciəʔ kaoʔ mənìh pla£ j paʔthəmaʔ kɔ̀ h, for get caus.eat dat human young.man first medl

Modern Mon

575

mənìh həlɒ̀ ɲ ɲɔ̀ŋ kwɤ̀ʔ ɲàt, kɒ mɔ̀ ŋ ɗɔə human attr.many as neg.get see give stay loc kənao ɗɔə hətɔ̀ m toc toə kɒ ciəʔ raʔ. mosquito.net loc room sleep finish give eat foc. ‘She let the first of the young men stay under the mosquito net in the bedroom and let him eat there, in order that that she could entertain him so that others would not see him.’ (MKP)

Concessive clauses precede the matrix and are introduced by ɓɒn (raʔ) and usually end in kɒm lɛ. Clauses can be introduced by prepositions to express different adverbial functions, including temporal and manner. No nominalization of the clause is necessary in these cases, as shown in (36) and (37). (36) ɗɔə kɒ hətɛt ɗɒp kɔ̀ h hətɛt ɗɒp kwɤ̀ʔ. loc obl caus.exit head medl caus.exit head neg.get ‘When he tried to pull out the head, he couldn’t pull it out.’ (NM_frog01) (37) rèə sac wùʔ mìʔ.kon.plɛm kɔ̀ h sɒm kɒ hùʔ manner type prox pn medl incl obl neg hɲa.həɲɛ̀ʔ kɛ̀h tao raʔ. act.modestly say stay foc ‘Mi Kon Plem spoke like this without having any modesty.’ (MKP)

4.3 Syntax and Pragmatics The structure of Mon sentences is to a significant extent determined by pragmatics. Sentences are usually built according to the topic-comment structure, and the topical (or non-predicative) part of a sentence usually receives the medial demonstrative kɔ̀ h as topic marker. The main predicative part or focus of the sentence is marked by postposed raʔ, which can mark narrow or broad focus scope, including sentence focus. The focused part of a clause can be fronted, in which case the marker raʔ is quasi obligatory, and kɔ̀ h on the presupposed or topical part of the clause may or may not be present. The structure XP kɔ̀ h XP raʔ is especially frequent where the predicative part of the utterance is non-verbal. In this case the focus marker raʔ regularly marks the predicative part. In the written language more regularly than in spoken Mon, raʔ is used to

576

Jenny

mark the end of an utterance (see Jenny 2006 for a more detailed account of the functions of raʔ). In terms of information structure, the demonstrative kɔ̀ h is frequently used to mark topical elements that have been introduced to the discourse and are active or accessible, or have cataphoric importance. In this function, kɔ̀ h can occur after other demonstratives; it is not stressed, and regularly pronounced in low pitch. It can be lengthened to create an intonational slow down effect, and it may be followed by an intonational pause. Other particles used to mark topics are lɛ (colloquially sometimes ʔiʔ), which often occurs after reactivated or additional topics, and teh, which has stronger contrastive effect and is also used as clause final conditional marker and sequential linker between clauses. The content question sentence particle rao can be used to mark topics, especially resumptive topics in negated expressions. Examples (38) to (40) illustrate the use of the different pragmatic particles. (38) hmoɲ ʔələwìʔ kɔ̀h [king pn medl] TOPIC

nùm kɒ krɒə.cɔ̀ h hnòk tao [exist obl glory big stay

pùə.mə.lòn raʔ. exceedingly foc] COMMENT ‘This king Alawi was of great glory.’ (MKP) (39) ʔe, ɗɒc klɤŋ la. ɗɒc klɤŋ la kɔ̀h toə teh, filler ride come donkey ride come donkey medl finish top la kɔ̀h thɒʔ thɒʔ hə-ʔɒt. donkey medl discard discard adv-all ‘They came riding on donkeys. When they came riding on donkeys, they just got rid of those donkeys.’ (WW2_nc/mn) (40) rɔ̀ p tɛ̀h naɲ chuʔ. rɔ̀ p tɛ̀h naɲ chuʔ teh catch hit branch wood catch hit branch wood top həjaʔ tɛ̀h naɲ chuʔ. tɔ̀ h mɔ̀ ŋ naɲ chuʔ rao hùʔ siəŋ. think hit branch wood be stay branch wood rtop neg be.so ‘He got hold of a tree branch. As he got hold of a tree branch, he thought it was a tree branch. But it wasn’t a tree branch.’ (NM_frog01)

Modern Mon

5

577

Word Classes

5.1 Nominals Nominals are a distinct word class in Mon, showing category syntactic behavior distinguishing them from verbs and other categories. The phonological shape of nouns is not a good indicator of category membership, though some regularities are apparent (see Bauer 1982: 300, 367), namely word shapes ʔiʔ__ and ʔə__ are restricted to nouns. The latter is problematic in many (especially southern) dialects, where the presyllables kə-, tə- and ʔə- merged into ʔə-, resulting in a substantial number of verbs beginning with ʔə-. Syntactically, nouns can be followed by a possessive expression (noun or pronoun) or a determiner (demonstrative, interrogative), and only nouns can be directly associated with a numeral, which either precedes or follows them (see 5.1.3 below). 5.1.1 Common Nouns Common nouns in Mon can be of different morphological shapes. Besides simple monomorphemic nouns, like chuʔ ‘wood’, hɒəʔ ‘house’ and kon ‘son, daughter’, there are morphologically derived nouns, such as kəlon ‘work’ and həjɒ̀ ɲ ‘life’, by infixation from klon ‘do, make’ and kjɒ̀ ɲ ‘live’ respectively. The phonological shape of a noun is not always a firm indicator of its morphological complexity. Monosyllabic nouns are always monomorphemic, but sesquisyllabic nouns may be derived (bimorphemic) or simple (monomorphemic). Examples of the latter are həmɛ̀ə ‘Burma, Burmese’, hətaj ‘rabbit’, and həlɛ̀ʔ ‘thorn’. Loans from other languages are generally well integrated into the phonological system of Mon, such as cɒt ‘heart, mind’ from Pali citta and mənìh ‘human, man’ from Pali manussa, phja ‘market’ ultimately from Persian baz9ar (Old Mon) and ka ‘car’ from English car (via Burmese kà). Polysyllabic nouns are mostly loans, such as p£aɲsəkɤk ‘movie, cinema’, ultimately from English bioscope, and mìjətha ‘train, railway’ from Burmese mì-jətʰà. Number is not obligatorily indicated in Mon. The postpositional plural particle tɔʔ can be added to mark a definite or complete plurality, as in rɔ̀ ə tɔʔ ‘the friends’. The plural marker is obligatory only in connection with a demonstrative (other than kɔ̀ h), as the demonstratives nɔʔ and tɤʔ have a singularizing effect. The word rɔ̀ ə is thus underspecified in terms of number and definiteness and can be translated as ‘a/the friend, (the) friends’, but rɔ̀ ə nɔʔ is only ‘this friend’. For ‘these friends’, the plural marker must be added: rɔ̀ ə tɔʔ nɔʔ. The combination tɔʔ nɔʔ is usually shortened to tənɔʔ ‘these’. The plural marker also occurs as nominalized form ʔiʔ-tɔʔ, often pronounced cɔʔ and used as 2nd person plural pronoun. The referential marker ɓɛ̀ʔ can precede any noun, often pleonastically, to emphasize the fact that it refers to a concrete or known entity. This function is

578

Jenny

partly shared with the medial demonstrative kɔ̀ h, which is also used to mark a constituent as topical or presupposed (see 4.3 and 5.1.5). The use of ɓɛ̀ʔ with human referents, including personal names, especially of high social status, is considered rude by many speakers, but occurs rather frequently in natural dialogue. Numerals always follow common nouns, as in rɔ̀ ə ɓa ‘two friends’. No classifier is needed to combine common nouns with numerals, though in some cases a classifier may be used (see 5.1.3 below). The numeral mùə ‘one’ is frequently used like an indefinite article following a noun. In this case it is unstressed and usually pronounced in a low pitch. As seen above, the medial demonstrative kɔ̀ h has functions similar to a definite article, indicating that the referent of a noun is accessible. 5.1.2 Pronouns Mon has a rather straightforward system of personal pronouns, with only little in terms of politeness distinctions. A separate plural form exists only for the first person, other plural pronouns are formed by adding the plural marker tɔʔ to the singular form. While the first person pronoun ʔuə is commonly used, second person pronouns are usually avoided in polite speech and replaced by kinship or professional terms. Third person pronouns can be freely omitted if the referent is accessible or retrievable, though they may also occur pleonastically together with a full NP, as seen in example (41). (41) ʔəpa ɗɛh jəmùʔ kjɛ.làj, ʔəmè ɗɛh jəmùʔ tan.ɲòʔ. father 3 name pn mother 3 name pn ‘Father’s name is Kyae Lay, mother’s name is Than Nyunt.’ (KM)

The common form ɗɛh is usually replaced by the more polite ɲèh (lit. ‘person’) in educated or formal speech when referring to humans, especially of some status, but it is not considered impolite in informal conversation by most speakers. The use of ɗɛh for non-human referents is rare in the spoken language, zeroanaphora or full NPs being preferred in these cases. The following table shows the commonly used pronouns in colloquial and formal Mon. Special forms are used in the first person when addressing monks, namely ʔuə-ɗoc lit. ‘I servant’ and poj-ɗoc lit. ‘we servants’. The derogatory second person form ɓɛ̀ʔ is identical to the referential marker, which also serves as dummy nominal head. Its use is widespread, though considered rude by many speakers. The formal pronoun mənɛ̀h is restricted to the literary language and formal contexts, such as public

579

Modern Mon Table 4.8 Basic personal pronouns

1sg 1pl 2 3

ʔuə poj ɓɛ̀ʔ ɗɛh

pèh ɲèh

mənɛ̀h

speeches. A polite form for the second person which is hardly used in the colloquial language is kəlaʔ-ʔuə ‘my lord’ or kəlaʔ-poj ‘our lord’. A special pronoun (or pronominal use) is the word həkaoʔ ‘body’, which is also used as reflexive pronoun. This pronominal həkaoʔ can be used in subject, object, or possessor function and can refer to a non-overtly expressed antecedent. The antecedent usually is the speaker or cognizer of the situation described. This phenomenon has been described as ‘long distance reflexives’ in the linguistic literature, and is similar to logophoric pronouns (see Huang 2000: 172ff). The following examples illustrate different uses of the reflexive həkaoʔ, as subject and possessor in (42) and as object in (43). (42) hə-tɔ̀ h raʔ həkaoʔ kəpɔʔ ɲàt kɤ̀ʔ ciəʔ kwaɲ kwì kɔ̀ h nml-be foc body dream see get eat sweets wrap medl cəphɔn kɔ̀ h lèə həɓah kɒ kəlaʔ hɒəʔ həkaoʔ raʔ. while medl tell show obl master house body foc ‘As a matter of fact, she told her husband that she had dreamed that she ate a snack wrapped in banana leaves.’ (MKP) (43) hə-tɔ̀ h məlɤ̀ klɔt chan mɔ̀ ŋ həkaoʔ kɔ̀ h paʔ pətɔm nù nml-be pn steal love stay body medl do beginning abl

ʔəkhoɲ lɒ lɛ mɔ̀ n.sɒc khjoɲ həjaʔ ket hùʔ tɛ̀h raʔ. time inter add pn consider think take neg hit foc ‘Mon Sak had no idea since when Maloe had been in love with him.’ (cl_ck_ca)

580

Jenny

Reciprocal events are expressed by the complex form ɲèh-həkɔʔ, made up from ɲèh ‘person’ and həkɔʔ ‘group, friend, companion’. This pronoun never occurs in S or A function. Interrogative pronouns include mùʔ ‘what’ and ɲèh-kɔ̀ h ‘who’. The latter is a lexicalized phrase originally meaning ‘this (known) person’, which came to be used as interrogative by a rare semantic change. In present day Mon it is prosodically different from ɲèh kɔ̀ h ‘that one (we have been talking about)’ and in some varieties, especially in Thailand, the interrogative has acquired a distinct phonetic form, namely ɲèh-kòə (sometimes shortened to ʔiʔkòə) ‘who’ versus ɲèh kɔ̀ h ‘that one’. Syntactically ɲèh-kɔ̀ h functions like a normal noun, occurring in preverbal position as S or A argument, post-verbal when P or oblique, and it combines with prepositions. On the other hand, the position of mùʔ is less fixed. It frequently occurs clause initially, sometimes reduplicated, also when functioning as P, but not as oblique argument with a preposition, in which case it regularly follows the verb. Frequently it occurs twice in the same clause, once clause initially, and once in situ, i. e. after the verb. Both mùʔ and ɲèh-kɔ̀ h can occur with reduplicated verbs to achieve an indefinite reading (see 3.3. above). For negative indefinites, mùʔ is used with negated verbs more regularly than ɲèh-kɔ̀ h. The notion ‘no one’ is usually expressed by ɲèh mùə ‘one person’ together with a negated verb. Examples (44) to (46) illustrate the use of the interrogative pronouns, example (47) the indefinite use of the negated interrogative. (44) lòc nɔʔ kɔ̀ h ɲèh.kɔ̀h khju lɔ̀ rao. text prox medl who write deposit q ‘Who wrote this letter?’ (45) paʔ mùʔ~mùʔ rao, ɗɔə càt kɔ̀ h. do what~red q loc performance medl ‘What do you do in this theater?’ (KM) (46) ʔaŋkəlòc kɔ̀ h mùʔ ɗɒc klɤŋ mùʔ. English medl what ride come what ‘What did the English ride when they came here?’ (WW2_nc/mn)

581

Modern Mon

(47) hùʔ tɛm, mùʔ hùʔ tɛm, krɤk neg know what neg know Chinese ‘They didn’t know it, they didn’t know anything, the Chinese.’ (WW2_nc/mn)

5.1.3 Numerals, Measure Words, and Quantifiers Mon numerals follow the decimal system, with amounts higher than ten composed regularly of lower numerals. The only exception to the regularity is that in compounds with pəsɔn ‘five’, pəsɔn loses its presyllable and the combining form coh ‘-ty’ appears as ʔiʔ. Indigenous expressions exist for exponents of ten up to one thousand. 10,000 and 100,000 are Indian loans, the latter being the highest basic numeral in common use. The normal way to say one million is cɔh kɒt lit. ‘ten hundred-thousands’. Table 4.9 gives the basic numerals, including Mon script, together with some compound forms. Most common nouns in Mon do not require the use of a classifier in connection with numerals. The one exception is həkaoʔ ‘body’, which is regularly used when counting members of the Buddhist clergy such as həmɒə ‘monk’ and həpoj ‘novice’. The omission of the classifier in this context is deemed ungrammatical, though expressions like həpoj ɓa ‘two novices’ are heard in colloquial style. Sporadically, some objects are used with mɛ̀ʔ ‘seed, small round thing’ as general classifier, and plants take nɔm ‘tree’ as classifier. If a classifier is used, the word order is always N NUM CLF, as in hɒəʔ mùə mɛ̀ʔ ‘one house’. If numerals are used with personal pronouns, the quasi-classifier ɲèh is obligatorily Table 4.9 Numerals

sunɲaʔ mùə ɓa pɒəʔ pɔn pəsɔn kərao həpɔh həcam həcit cɔh

0 1 2 3 4 5 6 7 8 9 10

၀ ၁ ၂ ၃ ၄ ၅ ၆ ၇ ၈ ၉ ၁၀

cɔh-mùə cɔh-sɔn ɓa-coh ɓa-coh-pɒəʔ klɔm pɔn-klɔm ŋìm kərao-ŋìm-həcam-klɔm həcit-ŋìm-pəsɔn-klɔm-pɔn-ʔiʔ-sɔn lak kɒt

11 15 20 23 100 400 1000 6,800 9,545 10,000 100,000

၁၁ ၁၅ ၂၀ ၂၃ ၁၀၀ ၄၀၀ ၁၀၀၀ ၆၈၀၀ ၉၅၄၅ ၁၀၀၀၀ ၁၀၀၀၀၀

582

Jenny

used, as in poj ɲèh ɓa ‘we two, the two of us’. Its position is always between pronoun and numeral, unlike real classifiers, which always follow the numeral. Measure words, which include time expressions such as ŋuə ‘day’, hətaoʔ ‘month’, and hnam ‘year’, differ from common nouns in that they follow numerals, rather than preceding them, as in mùə ŋuə ‘one day’, ɓa hətaoʔ ‘two months’, pɒəʔ hnam ‘three years’. Some lexemes can function either as common noun or as measure word, such as pəlaŋ ‘bottle’ and khok ‘cup, glass’. Measure words can be nominal (hnam ‘year’) or derived nominals (hərìp ‘arrow’s flight, instance’ from krìp ‘run’). In some cases non-derived verbal bases appear as measure words, such as mùə kok ‘distance of one call’. Other quantifiers are rɤ̀h or ʔərɤ̀h ‘every, each’ (from the verb rɤ̀h ‘count’; in formal Mon kəmɛ̀p/cəmɛ̀p), which precedes the noun, and təŋɤ̀ ‘some, various’, which follows the noun or occurs as nominal form on its own. Both can be reduplicated to reinforce the meaning. The form rɤ̀h ‘every’ can also precede a verbal expression, giving a reading ‘each time V happens’. The interrogative quantifier mùʔ-ciʔ ‘how much, how many’ functions syntactically like a numeral, following common nouns and preceding measure words. An instance of the latter is also mùʔ-ciʔ kəlɔ̀ ʔ ‘how long’, lit. ‘how much duration’. The verb klɒ̀ ɲ ‘be much, numerous’ combines with nouns in two ways. It can follow the noun, as in hloə klɒ̀ ɲ ‘much money’, or it can function as a predicate with an oblique object marked by kɒ ‘with, by’, as in klɒ̀ ɲ kɒ ŋuə ‘many days’. The verb ʔɒt ‘be all’ can be used as quantifier in its base form or as derived (adverbial) həʔɒt ‘all’. The former can function as a quasi-transitive verb and take the quantified noun as P argument, while the latter always occurs after the noun. To express wholeness or completeness, Mon uses the verb klɒh ‘penetrate’, as in klɒh ŋuə ‘the whole day’, or the form ʔəlùm (from Burmese lòun ‘round, whole’) in combination with the numeral mùə ‘one’, as in ʔəlùm mùə kwan ‘the whole village’. 5.1.4 Names and Terms of Address Mon traditionally don’t have family names. The normal form of address is by first name, usually preceded by an honorific like nàj ‘mister’, mìʔ ‘madam’ or mɛ̀m ‘young man, boy’. Mon names can be either made up of inherited vocabulary, like kao-mɔ̀ n ‘Mon flower’, used both for males and females, or of Indian or Burmese loans. Many names contain frequently recurring elements like rɔ̀ t ‘jewel’ (Pali ratana), ʔɔŋ ‘win’ (from Burmese ʔaun), kao ‘flower’, mɔ̀ n ‘Mon’ and others. Professionals are usually addressed as ʔəca ‘teacher’, non-intimate acquaintances as rɔ̀ ə ‘friend’. Kinship terms are used in many contexts and are by no means restricted to biological relatives. It is in most cases polite to address another person as kao ‘older brother’ ʔiʔci ‘aunt (younger than parents)’ or kon

Modern Mon

583

‘son, daughter’. Also expressions like həjùʔ ‘old person’ (from pjùʔ ‘be old of age’) are considered polite and commonly heard. These forms of address frequently also replace personal pronouns, especially in the second person. 5.1.5 Nominal Modifiers Nouns can be modified by different elements, including determiners (demonstratives, interrogatives), verbs, and nouns. Modifiers regularly follow the noun they modify. This is true for the demonstratives nɔʔ ‘this; proximal’, kɔ̀ h ‘this, that; medial’, and tɤʔ ‘that; distal’, as well as for the interrogative lɒ ‘inter’ and attributively used verbs, which can occur in the base form or in lexicalized attributive form (klɒ̀ ɲ ‘be much’ ~ həlɒ̀ ɲ ‘much’). Nominal and pronominal possessors follow the possessum in most cases. An exception are interrogative modifiers, such as mùʔ ‘what (kind of)’ and ɲèh-kɔ̀ h ‘whose’, which frequently precede the noun they modify, as in mùʔ hwaʔ ‘what kind of curry’ and ɲèh-kɔ̀ h kon ‘whose child’. Class nouns or class terms indicate the category to which a nominal expression belongs. The class term nɔm ‘tree, plant, trunk’ is used with plant names, kao ‘flower’ with flowers, kaʔ ‘fish’ with fishes, ɗɤŋ ‘city, town, country’ with names of towns and countries, etc. The combination of a class term and a specific name can be seen as constructions where the specific term modifies the class term. These class terms are to be distinguished from classifiers, although the same lexemes are used in some cases (see section 5.1.3 above). 5.2 Verbs Verbs in Mon can be defined syntactically as lexemes that can be preceded by personal pronouns and can be negated by hùʔ ‘not’. A verb can be the sole constituent of a complete utterance. Translation equivalents of adjectives form a subcategory of verbs in Mon with some differences from other stative verbs. Among these differences is the possibility of comparison with the ablative preposition nù between verb and standard. The pattern of comparative constructions is illustrated in example (48). (48) hɒəʔ ɗɛh hnòk nù hɒəʔ ʔuə. house 3 big abl house 1sg ‘His house is bigger than mine.’

Active verbs that describe a volitional act do not necessarily include the result of the activity, which can be expressed by a second verb in a core serialized structure. So the verb ciəʔ ‘eat’ does not imply that the food was actually eaten,

584

Jenny

but only that the A tried to eat something. The result can be negated, as in ciəʔ hùʔ ceh ‘eat not go down’, or ‘unable to eat’. Similarly, the causative həcɒt ‘kill’ does not imply that the P actually died, so the expression həcɒt hùʔ khjɒt ‘(she) killed (him but he) didn’t die’ is perfectly grammatical and not semantically anomalous in Mon. For a more complete survey of the verb system of Mon, see Jenny 2005. 5.2.1 Intransitive Verbs In intransitive clauses, the S argument generally precedes the verb. In a few cases, the word order VS is common or compulsory, especially in presentational expressions (see 5.2.6 below) and constructions where the S is actually predicative rather than the verb itself, which is topical and presupposed. Compare the following examples with the intransitive verb seh ‘be left, remain’. Sentence (49) makes a statement about the ‘rice’, while (50) is about the ‘left overs’. In the latter case, the presupposition is that something is left, the assertion is that this thing is ‘rice’, in contrast to ‘curry’. (49) pɤŋ seh mɔ̀ ŋ ɲìʔ thɔ̀ raʔ. cooked.rice remain stay little only foc ‘There’s a little bit of rice left.’ (50) seh mɔ̀ ŋ chaʔ pɤŋ, hwaʔ ʔɒt ʔa jaʔ. remain stay just cooked.rice curry all go nsit ‘Only rice is left, the curry is all eaten up.’

The distinction between intransitive and transitive verbs in Mon is not always clear-cut. Directed motion verbs (see 5.2.4) can take a noun referring to a location as direct argument, as in ʔa phja ‘go to the market’, tɒn phɛ̀ə ‘go (up) to school’, and cao hɒəʔ ‘return home’. A subcategory of the transitive verbs form stative quality verbs, which in many cases correspond to adjectives in other languages. They are distinguished from other stative verbs in that they enter comparative constructions of the type SUBJECT QUALITY COMPARATIVE-MARKER STANDARD. The ablative marker nù ‘from, since’ is regularly used as comparative marker. The superlative is expressed by ʔɒt ‘all’ after the quality verb (see Jenny forthc. b)

Modern Mon

585

5.2.2 Transitive Verbs In transitive clauses, the unmarked word order is AVP, though PAV and APV also occur. There are no instances of PVA. If the P argument is fronted, it usually receives the medial demonstrative kɔ̀ h as topic marker. Alternatively or additionally, the additive marker lɛ can be added after the fronted P (see 4.1 above). Secondary verbs like directionals, aspectuals, and some modals occur between V and P, while others follow the P. In some cases there is a difference in meaning between A V V2 P and A V P V2, as illustrated in examples (51) and (52). (51) ɗɛh rɔ̀p kɤ̀ʔ kaʔ. 3 catch get fish ‘He caught a fish.’ (‘He succeeded in catching a fish.’) (52) ɗɛh rɔ̀p kaʔ kɤ̀ʔ. 3 catch fish get ‘He can catch fish.’ (‘It is possible/allowed for him to catch fish.’)

A number of verbs obligatorily take a generic object if no specific object is expressed or given in the context. This generic object is usually fixed, as in ciəʔ pɤŋ ‘eat’ (lit. ‘eat cooked rice’), sɤŋ ɗac ‘drink’ (lit. ‘drink water’), həton lòc ‘learn’ (lit. ‘learn text’), and pɔ̀ h lòc ‘read’ (lit. ‘read text’). No object is expressed if it is already known, mentioned or visible, which is true also for the generic object after its first occurrence. Generic objects are also used with verbs that don’t readily take other (specific) objects, such as hum ɗac ‘bathe’ (lit. ‘bathe water’). 5.2.3 Ditransitive Verbs Mon does not have many ditransitive verbs that take two unmarked objects. The only clear example is kɒ ‘give’, which occurs in the pattern A kɒ G T, that is, the indirect object (recipient) precedes the direct object (theme), as seen in (53). (53) ʔəmè kɒ kon kwaɲ rɤ̀h ŋuə. mother give offspring sweets every day ‘The mother gives her child sweets everyday.’

586

Jenny

The verb kɒ as V2 has benefactive meaning, turning transitive verbal predicates into ditransitive predicates. The order of arguments is normally A V kɒ G T, though some speakers apparently prefer A V T kɒ G. In the latter construction, kɒ is better analyzed as preposition ‘to, for’, as seen above. The difference in meaning between examples (54) and (55) is minimal. (54) mìʔ ràn kɒ kon kwaɲ. mother buy give offspring sweets ‘The mother bought her child sweets.’ (55) mìʔ ràn kwaɲ kɒ kon. mother buy sweets obl offspring ‘The mother bought sweets for her child.’

In other cases, only the benefactive construction is accepted, at least in Myanmar Mon varieties. Example (56) shows the preferred pattern in Thailand Mon, while in Myanmar Mon only (57) is considered grammatical. (56) həɓah lòc tɤʔ kɒ ʔuə. show text dist obl 1sg ‘Show me that book.’ (57) həɓah kɒ ʔuə lòc tɤʔ. show give 1sg text dist ‘Show me that book.’

The T argument is frequently fronted, especially when definite or topical, resulting in the pattern T A V kɒ G, obscuring the distinction between benefactive and prepositional kɒ, as in example (58). (58) lòc kɔ̀ h rɔ̀ ə ràn nɛ̀ŋ kɒ ʔuə. text medl friend buy caus.come give/obl 1sg ‘This book, my friend bought for me.’

Modern Mon

587

5.2.4 Directional, Aspectual and Modal Verbs Verbs describing the manner of movement, like kwac ‘walk’ and pɔ ‘fly’ are not directed and cannot occur with a noun indicating the goal of movement. In order to make the expression directed, a directional verb has to be added immediately after the verb. The main directionals are tɒn ‘ascend, move up’, ceh ‘descend, move down’, lùp ‘enter’, and tɛt ‘exit’. In addition there are the orientational verbs ʔa ‘go’ indicating a movement away from the center of interest, and klɤŋ ‘come’ a movement towards the center of interest. The directionals have corresponding causative forms, viz. pətɒn, phjeh, plop/həplup, and pətɛt respectively, with the suppletive causative orientational verbs na ‘take away’ and nɛ̀ŋ ‘bring here’. If the main verb is transitive and the object is moved by the activity described in the main verb, any directionals in the same verb complex must take the causative form. This depends only on the semantics of the main verb, not on its morphological form as causative. Examples (59) and (60) illustrate the use of basic and causative directionals, respectively. (59) klɒ kɔ̀ h həɗiəŋ krìp ʔa teh, hətum ceh ʔa ɗɔə ɓi. dog medl chase run go top fall descend go loc river ‘As the dog ran after the deer, it fell down into a river.’ (NM_frog01) (60) ʔe ɗɔə kɒ ɗɛh həpak ceh kɔ̀ h ʔaŋkəlòc eh loc obl 3 stab descend medl English kəpɒc phjeh na kɔ̀ h khjɒt ʔɒt.tah. dash caus.descend caus.go medl die completely ‘Now when [the airplane] came crashing down, the English struck it down, they died all.’ (WW2_nc/mn)

The use of the causative or non-causative directional affects the reading of constructions involving a potential movement of the object. The former expresses a directional sense, while the latter expresses aspect, as seen in the difference between examples (61) and (62). (61) ɗɛh hɒm na kɒ rɔ̀ ə. 3 speak caus.go obl friend ‘He told his friend.’ (‘the words moved away from him’)

588

Jenny

(62) ɗɛh hɒm ʔa ʔərè mùə nəɗi. 3 speak go language one hour ‘He went on speaking for an hour.’

Some verbs can express (or stress) the movement of either the actor or the patient. In the former case, the intransitive directional is used, in the latter the transitive/causative, as examples (63) and (64) illustrate. (63) ɗɛh pɛ̀k ʔa ka. 3 follow go car ‘He drove the car (away from here).’ (64) ɗɛh pɛ̀k na klèə. 3 drive caus.go cow ‘He drove the cattle (away from here).’

A large number of secondary verbs are used in Mon to express not only directional, but also aspectual and modal or manner functions. In many cases, the verbs can not be clearly assigned to one of these categories, and the function expressed depends on the context as well as the meaning of the main semantic verb. These V2s occur immediately after the main verb as what has been called root or nuclear serialization. The negation marker hùʔ occurs before the whole verb complex. In most cases, they also have full semantics and can stand alone as main or sole predicate of a clause. Table 4.10 lists the main V2s with their full verb meaning and most common functions. Table 4.10 Secondary verbs (post-verbal) Form Full verb meaning

Grammatical functions

cɒʔ cɔm

‘put, place’ ‘try, feel’

cut

‘put, place, insert’

‘completely’, perfective ‘try to, do and look what happens’, imperative, conative ‘do with an impact on so./sth., completely affecting so./sth.’

589

Modern Mon Form Full verb meaning

ciəʔ kɒ ket kɤ̀ʔ lɔ̀ mɔ̀ ŋ tɛ̀h thɒʔ wɔ̀ ɲ ʔa

Grammatical functions

‘eat, consume’

‘do for one’s own benefit, internally directed, completely’ ‘give’ benefactive, purposive (with following clause) ‘take’ ‘do for/by oneself’ ‘get’ ‘can, may, be able, succeed’ ‘deposit’ ‘do and leave’, preparatory, perfect, resultative ‘stay’ ‘be doing’, imperfective ‘hit, come into contact’ ‘do by accident, inadvertantly, not intentionally’ ‘discard’ ‘do without further consideration, right away, completely’ ‘play’ ‘do without having serious intentions, for fun’ ‘go’ ‘keep doing’, continuous

A number of V2s occur in clause final position, that is, after the object (if present). The negation marker hùʔ occurs either before the main verb or before the V2, depending on the scope of the negation. These V2s include the modals kɤ̀ʔ ‘get’ > ‘can, have a chance to’, lèp ‘skilled’ > ‘be able, know how’, ma£ n ‘win > be (physically or mentally) capable; possible (epistemic)’, and toə ‘finish’ > ‘have done’. The verb tɛ̀h ‘hit, come into contact with’ can also occur as member of this class, meaning ‘do correctly’. 5.2.5 Causative Verbs The inherited causative morphemes, namely p- prefix and vocalic infix are no longer fully productive in modern Mon, though a few recent Burmese loans take the prefix pə- to form causatives, such as pəɕɤk ‘trouble, bother’ from ɕɤk ‘be troubled, unclear’ (Burmese ɕouʔ ‘disturbed, unclear; disturb’; same form for causative meaning) and pətuʔ ‘make fake, disguise’ from tuʔ ‘fake’ (Burmese tú ‘fake’; no corresponding causative form). In some varieties the prefix hə- has been generalized as causative morpheme, but this is also hardly productive. A recent Burmese loan that takes this prefix is pjɤk ‘fall off, break apart, be ruined’ (Burmese pjouʔ ‘fall apart, break off’, with the causative pʰjouʔ ‘break off’), the causative form of which is colloquially həpjɤk ‘ruin’. The normal way to form causatives is by a periphrastic construction involving the verb kɒ ‘give’. This can function as permissive or jussive causative auxiliary before the main verb, or it can be combined with a verb specifying the

590

Jenny

causing act, such as hɒm ‘speak’, həkɔn ‘order, command’. The permissive/jussive causative kɒ can be used only with human (or human-like) causers. In other cases, paʔ kɒ ‘do give’ is used, though in colloquial Mon causative constructions with non-human causers are normally replaced by causal expressions. The animacy of the causee is irrelevant to the choice of the causative construction, as is the degree of control retained by the causee. While lexical causatives usually involve direct causation, periphrastic constructions of the type V kɒ ‘are underspecified in this respect. In both cases the caused action can come about or not, that is the success of the causation is not part of the semantics of the verbal expression. The form kɒ is also used to indicate a change of subject, as in məkɤ̀ʔ ʔa ‘(I) want to go’ versus məkɤ̀ʔ kɒ ʔa ‘(I) want (you) to go’. Examples of causative constructions are given in (65) and (66). (65) ʔəŋàn mùə ŋuə ɗɛh kɒ, ɗɛh kwɒ klon raʔ, quota one day 3 give 3 neg.give make foc ɗɛh kɒ cù raʔ ʔiʔ-kɔ̀ h ɗɛh kɒ cù. 3 give rest foc nml-medl 3 give rest ‘They gave us (the money) for one day’s work quota, then they wouldn’t let us work anymore, they let us rest; they let us rest then.’ (WW2_nc/mn) (66) kja pɒc paʔ kɒ taj lɤ̀m ʔa. wind blow do give hut destroyed go ‘The wind destroyed the hut.’

A few verbs can be used either as causative or non-causative without any overt marking. These include mat ‘close, be closed’, pɔk ‘open, be open’, and həton ‘teach, learn’. For a detailed account of causatives in Mon see Jenny 2005: 113ff. 5.2.6 Copulas Mon has two copulas, namely tɔ̀ h ‘be something’ and nùm ‘be somewhere, exist’. The latter overlaps in translations with mɔ̀ ŋ ‘stay, remain, be placed’, which occurs only with volitional subjects. The existential copula nùm can be used on its own, meaning ‘there is’. It is defective and always replaced in the negative by hùʔ mùə or həmùə lit. ‘(there’s) not one’. It is also used to express predicative possession, where the possessor appears with the locative

Modern Mon

591

preposition ɗɔə or as post-nominal possessor. Examples (67) and (68) illustrate the patterns of predicative possession. (67) ɗɔə ʔuə ka nùm/həmùə. loc 1sg car exist/neg.exist ‘I have/don’t have a car.’ (lit. ‘at my place there is/isn’t a car.’) (68) ka ʔuə nùm/həmùə. car 1sg exist/neg.exist ‘I have/don’t have a car.’ (lit. ‘my car exists/doesn’t exist.’)

The equative copula tɔ̀ h can be omitted in colloquial speech. The common way to form equative expressions is by marking the subject and predicate complement as topic/non-predicative and focal/predicative respectively (see 4.3). In negated expressions, tɔ̀ h is usually replaced by hùʔ siəŋ ‘it is not so’. 5.2.7 Preverbs and Auxiliary Verbs A number of secondary verbs occur before the main verb to express modal and aspectual functions. These include kɤ̀ʔ ‘get > caused event with cause in background; get to, have a chance to’, tɛ̀h ‘hit > have to, must’, chak ‘connect > continuous’, kiəŋ ‘have ever; exeriential’ (with the negative hùʔ kiəŋ ‘never’), thɔc ‘should, ought’. The form məkɤ̀ʔ (negative hùʔ mòc) occurs only as preverb with desiderative function, as in məkɤ̀ʔ ʔa ‘I want to go’, hùʔ mòc ʔa ‘I don’t want to go’. The verb tɛ̀h ‘hit’ can also occur before a complete clause with a passive-like meaning. The expression ʔuə tɛ̀h klɒ kit can be translated as ‘I was bitten by a dog’, but there is no demotion in Mon of the actor klɒ ‘dog’. A more literal translation would be ‘I came into contact with a dog that bit (me).’ The underlying P argument (= derived S/A) is obligatorily omitted in this construction. The subordinate clause is therefore always reduced, which suggests a certain degree of grammaticalziation of this construction. 5.2.8 Negation The general negation marker in Mon is the preverbal particle hùʔ. This negator cannot occur on its own and is only used to negate verbal predicates. It usually has narrow scope over the verb immediately following it. In a number of frequent verbs with velar initial, the negation marker is realized as labial infix,

592

Jenny

as in kwɒ (kɒ ‘give’), kwɤ̀ʔ (kɤ̀ʔ ‘get’), kwet (ket ‘take’), and khwɒh (khɒh ‘good’) (see Jenny 2003). Negation can be reinforced by the sentence final marker pùh, which in colloquial speech can occur as sole negation marker in some cases. Examples (69) and (70) illustrate the use of the direct negation with narrow scope. (69) pèh ʔa kwɤ̀ʔ pùh 2 go neg.get neg ‘You cannot go.’ (70) pèh hùʔ ʔa (lɛ) kɤ̀ʔ. 2 neg go add get ‘You can stay here.’ (‘It is possible for you not to go.’)

If the focus marker raʔ occurs in a negated expression, the most common reading is ‘not anymore’, indicating a change of state or plans, as in (71). (71) ɗɛh hùʔ klɤŋ raʔ. 3 neg come foc ‘He is not coming anymore.’ (‘he changed his plans’ or ‘he used to come, but not any longer’)

To negate non-verbal expressions, the postposed form hùʔ siəŋ ‘it is not (the case that)’ is used. This form is also used for sentence negation. It’s use is illustrated in (72) and (73). (72) həkaoʔ nɔʔ kɔ̀ h ʔəca kəsao lɛ hùʔ siəŋ. body prox medl teacher nml.write add neg be.so ‘She (knew that she) was no writer.’ (cl_ck_ca) (73) ŋìm-pɒəʔ-klɔm-pɔn həkah raʔ lə-kɔ̀ h, həkah kɔ̀ h thousand-three-hundred-four break foc temp-medl break medl

Modern Mon

593

ɗɛh lùp kɤ̀ʔ ʔə-tɤʔ hùʔ siəŋ pùh. 3 enter get loc-dist neg be.so neg ‘In 1304 (the town of Panga) fell, back then; when it fell, it wasn’t that they could get in there.’ (WW2_nc/mn)

5.3 Prepositions and Adverbs Mon makes use of a small number of basic prepositions that have no obvious nominal or verbal origin to indicate spatial, temporal, and other relations. Prepositions combine directly with nouns, pronouns and demonstratives. In some cases, prepositions are also used to introduce adverbial clauses (see 4.2.4). Basic prepositions include nù ‘from, since; ablative’, ɗɔə ‘in, at; locative’, sɒm (kɒ) ‘(together) with; comitative’, ʔəraŋ ‘for; benefactive’, and kɒ ‘with, by, for, to; oblique’. In formal language, swak ‘for’, kom-kaoʔ ‘together with; comitative’, kaoʔ ‘dative’, and nɛ̀ʔ (kɒ) ‘by means of; instrumental’ are also used. The preposition kɒ is homophonous with the verb kɒ ‘give’, with which it shares a number of functions, such as benefactive. Its functions cover both literary kaoʔ ‘dative’ and kɒ ‘give’, making it a general oblique preposition, which is also used in combination with other prepositions, without modifying their function or semantics. Examples are ɗɔə (kɒ) hɒəʔ ‘in the house’ and nù (kɒ) phɛ̀ə ‘from school’. Other spatial and temporal expressions use verbal or nominal lexemes, such as cɒp ‘arrive’ > ‘up to, until’, ʔətao ‘top part’ > ‘on (top of)’. The preposition həm£an (nù) ‘because (of)’ originates in a noun, but is synchronically only used as preposition. Also kərao ‘behind, after’ goes back to a noun in Old Mon meaning ‘space behind’. Similarly hətaʔ ‘in front of, before’ in modern Mon has only grammatical meaning. With human objects, the locative preposition ɗɔə is used to express predicative possession (see 5.2.6). Locative (and allative) expressions with human objects use the noun hənàj ‘place’, as in klɤŋ hənàj ʔuə ‘come to me’, mɔ̀ ŋ hənàj mìʔ ‘stay with mother’. In formal Mon, the relator noun cərìəŋ ‘vicinity’ is used in this function. Adverbial expressions in Mon are used for spatial, temporal and other functions. They occur in different positions in the clause, most commonly after the verb and object(s). Adverbial particles include kɒm ‘too, also’, nɛm ‘yet, still’, kla ‘before doing something else, first’, which occur after V and P, and the clause (or sentence) initial particles nù klaŋ ‘just now, a moment ago’, ləkɔ̀ h ‘now, then’ (shortened from kalaʔ kɔ̀ h ‘that time’), hətɔ̀ h raʔ ‘this being so’, ʔiʔ-kɔ̀ h (kɔ̀ h) ‘therefore’, and others. Other adverbs are flexible in their position within the clause, depending on the pragmatics of the utterance. These include həmùh ‘now’, ʔənɔʔ ‘here’, and others. Interrogative adverbs are chəlɔʔ ‘when’, which normally is in clause initial position when asking about future events and post-verbal when referring to

594

Jenny

past events. The interrogative mùʔ paʔ or mùʔ tɔ̀ h ‘why’ (lit. ‘what do’ and ‘what be’ respectively) always occurs clause initially, while ʔəlɒ ‘where’, həlɒ and pɤ̀m lɒ both ‘how’, are clause initial or post-verbal. 5.4 Sentence Particles A number of particles are used in sentence final position to express different functions. Among these are the emphatic lè ‘emphatic; really, right now, at once, at last’, noŋ ‘assertive, assumptive’, nah ‘emphatic; soliciting consent’, jaʔ ‘new situation, situation after a change of situation’, nɛm ‘still, yet’, phɤh ‘still’, ha ‘interrogative’, pùh ‘negation’, and others. The first three of these, lè, nah, and noŋ can occur after other particles, as in ʔa raʔ nah ‘I’m leaving now, OK’ or ɗɛh hùʔ ciəʔ pùh lè ‘he didn’t eat at all’. The particle noŋ is often used to translate the Burmese future marker mɛ, though it’s basic function is not temporal. Rather, it is an assertive marker, indicating that the speaker is sure about the truth content of his utterance (see Jenny 2006 for details). The particle ʔao can be added to the end of an utterance in conversation for politeness. It is also used to answer or acknowledge a call or request. 5.5 Ideophones and Euphonic Expressions Especially in the literary style, Mon makes extensive use of ideophones describing not only sounds, but also other sensations. These ideophones are partly conventionalized, leaving room for spontaneous formations according to some rather loose rules concerning consonant and vowel qualities conveying different sensations, rather than meanings. Examples of ideophones with their approximate meanings are given in (74). (74) poɲ~poɲ-pɤŋ~pɤŋ ‘sound of drums’ phjaj~phjaj ‘(smile) faintly, lightly’ phjiə~phjiə ‘(flash) sparkingly’ tɛ̀ŋ~tɛ̀ŋ ‘with a creaking, grating noise’ tìn-thin~tìn-thin ‘with a drumming noise’ phji~phji-phja~phja ‘with a flashing light, like lightening’.

Euphonic compounds and elaborate expressions are a common feature of Southeast Asian languages. The former usually consist of a lexical word and a meaningless element which shares some of the phonetic features of the lexical element, the latter are often made up of four syllables, showing vowel rhyme and/or alliteration, and lexical parallelism. These expressions are frequent in literary and poetic language, especially proverbs and sayings, as well as in

595

Modern Mon

prefabricated speech. The use of idiomatic euphonic expressions is considered good language by Mon speakers and is the mark of a good writer. Examples of euphonic compounds are given in (75), elaborate expressions in (76). (75) Euphonic compounds prùʔ-prɛ̀ʔ ‘sounds, noises’ həkaoʔ-həko ‘body, appearance’ ɲìʔ-ɲɛ̀ʔ ‘a little bit’ kəre-kərot ‘lament’ həne-ke ‘vegetables’

prùʔ həkaoʔ ɲìʔ kərot həne

‘make sound, noise’ ‘body’ ‘little’ ‘lament, complain’ ‘vegetables’

(76) Elaborate expressions and idioms cɒt kjɒɲ, rùp kjɛ. ‘(If the) heart is beautiful, the body is pretty.’ chan ɗat, ɗuh kat. ‘love the sweet, hate the harsh’ ɗɒp kaʔ, hətaʔ sùm. ‘(The) head (is a) fish, (the) tail (is a) snake.’ tɤ̀ həpɔh, sɔ̀ h həcam. ‘seven mountains, eight ravines’ (many obstacles) tɛm həkaoʔ, ʔəhmaoʔ sa. ‘(if you) know yourself, the duty (becomes) light.’

6

Semantics Fields

In this section, only a few examples of semantic fields of Mon that are relevant in the areal context are presented. As can be expected, Mon exhibits a large and detailed vocabulary for tropical flora and fauna, including both edible and non-edible species. One striking feature of the Mon vocabulary is that there are no general terms for example for lizards and ants, but only specific names of different species, while birds, fishes, and plants have general class terms which are used as class nouns prefixed to the specific names (see 5.1.5). As traditional rice cultivating culture, Mon has detailed terminology for rice and related fields. The common words for rice are sɒʔ ‘rice in the field’, haoʔ ‘uncooked rice grains’ and pɤŋ ‘cooked rice’, which also serves as generic object for ciəʔ ‘eat’. In addition, there is the word pəlɔɲ for ‘sticky rice’, formally a nominalized form of klɔɲ ‘oily’. The vocabulary of consuming consists basically of two terms, with ciəʔ ‘eat’ covering the consumption of solid food, including betel chewing, and sɤŋ ‘drink’ being used for the consumption of liquids and medicine, as well as smoking tobacco. Mon has many compounds involving cɒt ‘heart, mind’ (from Pali citta ‘mind’) expressing feelings and traits of a person’s character. The noun cɒt

596

Jenny

‘heart, mind’ can occur either as first or second element of the compound, formally thus as subject or object of the phrase. Some examples of these so called psycho-collocations (Matisoff 2004) are given in (77). (77) Psycho-collocations pre-cɒt ‘content, appeased, comforted’ mìp-cɒt ‘happy’ ʔon-cɒt ‘disappointed’, sad’ hloɲ-cɒt ‘take pleasure’ cɒt-klèʔ ‘short tempered’ cɒt-klòɲ ‘easy going, patient’ cɒt-kətao ‘distressed’ cɒt-lɤ̀m ‘discouraged, disappointed’ cɒt-pɤ̀n ‘firm of purpose’ cɒt-hnòk ‘quarrelsome’

pre ‘smooth, appeased’ mìp ‘comfortable’ ʔon ‘few’ hloɲ ‘asleep’ klèʔ ‘short’ klòɲ ‘long’ kətao ‘hot’ lɤ̀m ‘broken, destroyed’ pɤ̀n ‘strong, firm’ hnòk ‘big’

Kinship terms are widely used to address and refer to non-relatives and are an important factor of Mon society. There is some borrowing of kinship terms from Burmese (bold in table 4.11), but basically inherited vocabulary is used. A gender distinction is made from older siblings upwards, but not for people younger than the speaker. There is some dialectal variation in the terminology, and a general trend to simplify the system can be observed. The following table lists the most common kinship terms. Table 4.11 Kinship terms kon (kon-)cao tèʔ ʔəwao, kao ʔəmaʔ, ɓɔ̀ ə ʔəmù ʔiʔci, sɒə ʔəpa, mɛ̀ʔ, ʔiʔta, baʔ jàj, mìʔ, ʔəmè ʔənàj ʔiʔnàj

son, daughter grandchild younger sibling older brother older sister uncle (younger than parent) aunt (younger than parent) father mother uncle (older than parent) aunt (older than parent)

ʔənòk, pa-nòk, lɛ̀ʔ ʔinòk, mìʔ-nòk, pèə pa-nɛ̀k, cɛ̀ʔ mìʔ-nɛ̀k, cùʔ lɛ̀ʔ-cùʔ, ʔəcùʔ ʔiʔcùʔ kon-kao-kon-tèʔ kon-mɛn mɛ̀ʔsaʔ mìʔ-mɛ̀ʔ, jàj-ʔəpa

grandfather grandmother great-grandfather great-grandmother granduncle grandaunt cousin (older or younger) niece, nephew family, relations parents

Modern Mon

7

597

Sample Glossed Text

World War II in Monland (extract), interview with Nai C. 74 years, Mi Ng., 74 years, and Nai O., 40 years, from Kaw Dot village, Ye Township, Myanmar, recorded in 2001. (1) pɤ̀ŋ kjìʔ ceh ɗɤŋ ɗɛh. bomb poisonous descend land 3 ‘The poisonous (= nuclear) bomb fell on their land.’ (2)

ceh ɗɤŋ ɗɛh tɤʔ, ʔe ceh ɗɤŋ ɗɛh toə teh descend town 3 dist filler descend town 3 finish top

cəpan kɔ̀ h kjaʔ siəŋ, cəpan kɔ̀ h kjaʔ teh, ʔe ɓɛ̀ʔ Japan medl defeated be.so Japan medl defeated top eh ref ʔəjaŋ kɔ̀ h ɗɛh hətah kəpac ʔaŋkəlòc plɔn. ɗɛh pn medl 3 turn.around side English again 3 hətah kəpac ʔaŋkəlòc kɔ̀ h toə kɤ̀ʔ ɓɒt ɓa hnam turn.around side English medl finish get aprx two year

ɗɛh lɒ̀ c klɤŋ pəŋaʔ plɔn. kon rɔ̀ ə ɗɛh khjiəŋ 3 arrive come pn again offspring friend 3 Chin

hə-ʔɒt. sɔttɒəʔ ɗɛh nùm, kon rɔ̀ ə ɗɛh khjiəŋ hə-ʔɒt. adv-all strength 3 exist offspring friend 3 Chin adv-all ‘It fell on their land. Well, it fell on their land and then the Japanese lost (the war), right? The Japanese lost and that Ayang turned around to join the English again. When we had turned around and joined the English for about two years, he came to Panga again. His people were all Chin. They were very strong. His people were all Chin.’

598

Jenny

(3) laʔ ʔəkhoɲ kɔ̀ h ʔəca ʔɔŋ.thon tɔʔ pɔk lɔ̀ when time medl master pn pl open deposit tɔ̀ p jaʔ ʔəpa. armed.unit nsit father ‘At that time, had Aung Tun and his people already set up their unit, father?’ (4)

pɔk jaʔ lə-kɔ̀ h kɔ̀ h. ʔuə kɔ̀ h ŋuə cu ʔuə open nsit temp-medl medl 1sg medl day upright 1sg cao ciəʔ pɤŋ həjèh. kɤ̀ʔ lɔ̀ mɛ̀ʔ kɔm return eat cooked.rice morning get deposit seed bullet ɓa.ʔiʔ.sɔn pat lɔ̀ ɗɔə nòɲ kɔ̀ h toə ʔuə kɔ̀ h twenty.five put.around deposit loc waist medl finish 1sg medl tɛt klɤŋ nù tɤʔ ɓɛ̀ʔ kon rɔ̀ ə ʔəjaŋ ɓɛ̀ʔ khjiəŋ exit come abl dist ref offspring friend pn ref Chin tɔʔ kɔ̀ h ɗɛh rɔ̀ p kɤ̀ʔ ʔuə kɔ̀ h teh, ʔuə hɒm ʔərè pl medl 3 catch get 1sg medl top 1sg speak language

həmɛ̀ə lɛ ɗɛh hùʔ tɛm, ʔuə hɒm ʔərè khjiəŋ kɔ̀ h Burmese add 3 neg know 1sg speak language Chin medl

lɛ hùʔ tɔ̀ h. ʔuə həɓah hə-nɔʔ teh ʔuə mɔ̀ ŋ hənàj add neg be 1sg show adv-prox top 1sg stay place

kɔ̀ h ʔuə hɒm, lə-kɔ̀ h ɗɛh pəlɔŋ na hə-cɒp medl 1sg speak temp-medl 3 send caus.go caus-arrive

Modern Mon

599

ʔa, cɒp hɒəʔ ʔɤkkəthaʔ ʔɔŋ.thon teh, ʔəjaŋ kjo ceh arrive house chairman pn top pn peek descend go

ʔe ʔɤkkəthaʔ ʔɔŋ.thon hɒm kɔ̀ h, ʔe kon rɔ̀ ə hɒəʔ eh chairman pn speak medl eh offspring friend house

nɔʔ raʔ, ɗɛh həlɛ̀h na lè. prox foc 3 release caus.go emph ‘They had already set it up. I went back to have my morning meal around lunch. I had got twenty five bullets which I put around waist and I walked out from there. Now those people of Ayang, those Chins, they caught me. When I spoke Burmese they didn’t understand me, and I didn’t speak Chin either. I gestured like this, saying that I stayed there. Then they took me to the house of chairman Aung Tun, and Ayang looked down. Well, the chairman Aung Tun said that I was one of his people, so they let me go.’

8 Bibliography Bauer, Christian. 1982. Morphology and Syntax of spoken Mon. Ph.D. dissertation, SOAS, University of London. Blagden, C.O. 1936. Epigraphia Birmanica vol. 4. (edited by U Mya). Rangoon: Government Printing and Stationery. Diffloth, Gérard. 1984. Dvaravati Old Mon and Nyah Kur. Monic Language Studies vol. 1. Bangkok: Chulalongkorn University. Ferlus, Michel. 1986. Essai de phonétique historique du môn. Mon-Khmer Studies 12: 1–90. Guillon, Emmanuel. 1976. Some aspects of Mon syntax. In Philip N. Jenner, Laurence C. Thompson & Stanley Starosta (eds.), Austroasiatic Studies. Part 1. Honolulu: The University Press of Hawaii, 407–21. ———. 1999. The Mons. A civilization in Southeast Asia. Bangkok: Siam Society. Halliday, R. 1917. The Talaings. Rangoon: Government Printing. ———. 1923. Lik smin asah. The story of the founding of Pegu and a subsequent invasion from South India. With Englsih translation, notes, and vocabulary of undefined words. Rangoon: American Baptist Mission Press. ———. 1955 [1922]. A Mon-English dictionary. Rangoon: Ministry of Union Culture. Haswell, J.M. 1901. Grammatical notes and vocabulary of the Peguan language. 2nd edition edited by Edward O. Stevens. Rangoon: American Baptist Mission Press. Huang, Yan. 2000. Anaphora. A cross-linguistic study. Oxford: Oxford University Press.

600

Jenny

Huffman, Franklin E. 1990. Burmese Mon, Thai Mon, and Nyah Kur: a synchronic comparison. Mon-Khmer Studies 16–17: 31–84. Jenny, Mathias. 2003. New infixes in spoken Mon. Mon-Khmer Studies 33: 183–94. ———. 2005. The verb system of Mon. Zurich: ASAS. ———. 2006. Mon raʔ and noŋ; assertive particles? Mon-Khmer Studies 36: 21–38. ———. forthc. a. Grammatical relations in Mon—syntactic tests in an isolating language. In Alena Witzlack-Makarevitch & Balthasar Bickel (eds.) Handbook of grammatical relations. Amsterdam & Philadelphia: Benjamins. ———. forthc. b. Comparative, similative, and equative constructions in Mon—form, function, and development. In Yvonne Treis & Martine Vanhove (eds.) Similative and equative constructions: A cross-linguistic perspective. Amsterdam & Philadelphia: Benjamins. Matisoff, James. 2004. Areal semantics—is there such a thing? In Saxena, A. (ed.), Himalayan languages. Berlin & New York: Mouton de Gruyter, 347–93. McCormick, Patrick, Mathias Jenny & Chris Baker. 2011. The Mon over two millennia. Monuments, manuscripts, movements. Bangkok: Institute of Asian Studies, Chulalongkorn University. Nai Pan Hla. 1988/89. Introduction to Mon language. Kyoto: Kyoto University Press. Nai Tun Way. 1997. The modern English-Mon dictionary. Bangkok: Tech Promotion and Advertising. ———. 2000. The modern Mon-English dictionary. Bangkok: Tech Promotion and Advertising. Sakomoto, Yasuyuki. 1994. Mon-Japanese dictionary. Tokyo: University of Foreign Studies. ———. 1996. Japanese-Mon dictionary. Tokyo: University of Foreign Studies. Shorto, Harry L. 1962. A dictionary of spoken Mon. London: Oxford University Press. ———. 1971. A dictionary of Mon inscriptions from the sixth to the sixteenth centuries. London: Oxford University Press. South, Ashley. 2003. Mon nationalism and civil war in Burma. The golden sheldrake. London: RoutledgeCurzon. Stevens, Edward O. 1896. A vocabulary, English and Peguan, to which are added a few pages of geographical names. Rangoon: American Baptist Mission Press. Taw Sein Ko & Charles Duroiselle (eds.). 1919, 1921, 1928. Epigraphia Birmanica vols. 1–3. Rangoon: Government Printing and Stationery. L.-Thongkum, Theraphan. 1988. Another look at the register distinction in Mon. In C. Bamroongraks et al. (ed.), The International Symposium on Language and Linguistics. Bangkok: Thammasat University, 22–51. ———. 1990. The interaction between pitch and phonation type in Mon: phonetic implications for a theory of tonogenesis. Mon-Khmer Studies 16–17: 11–24.

section 3 Pearic

․․

chapter 5

Chong Suwilai Premsrirat and Nattamon Rojanakul 1 Background Chong (cog) is a minority language of Thailand; it belongs to the Pearic group of Austroasiatic. Pearic languages, which include Cha-ung or Saoch, Chong, Kasong, Samre, and Pear extend across eastern provinces of Thailand, Chanthaburi and Trat, and western provinces of Cambodia such as Batdambang, Pursat and Kampot. Due to the rapid decrease in mother tongue transmission, now it is difficult to determine the exact number of Chong speakers. Different studies present different numbers of Chong speakers in Thailand, ranging from merely 500 (Grimes 2000) to about 4,000 (Premsrirat et al. 2007). The latter source indicates that, today, the largest group of Chong speakers is in Khao Khitchakut district of Chanthaburi province, although the Chong speaking ability varies a great deal. The younger generation, especially those under 30 years old, cannot speak Chong. Researches on Chong language in Thailand in the past consisted mostly of descriptive studies of phonological and syntactic features (Phonology: Suphannaphaibun 1982; Huffman 1985; Phonology and Syntax: Charoentham 1987; Luuamkhan 1991; Phaetphithak 1996), instrumental analyses of ‘register’ from a phonetic point of view (L-Thongkum 1991; Edmondson 1996), and a comparative study on Pearic dialects conducted by Martin in 1974. The most extensive field research on Chong population and occupation is the work by Weber (1976), which gives a broad view of settlement of Chong people in Chanthaburi province. The oldest account of these peoples dated back as far as the ancient Khmer Empire of 14th century. A Chinese diplomat at the Khmer court about 700 years ago, referred to ‘Chuang’ as uncultured people who were enslaved and sold in the town (Chou 1987: chapter 9): Settlement Areas: From various sources, for instance Pallegoix (1853), Seidenfaden (1967), Weber (1976), and Kenikanon (1979), it can be assumed that the Chong had lived in the hinterland, in contrast with coast area, of Chanthaburi for centuries. That area is distinct for its rugged landscape with small patches of flat plains here and there which are not suitable for wet rice cultivation. But

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_�11

604

Premsrirat and Rojanakul

their Soi Dao Mountains, from where Chanthaburi River originates, has a reputation for its abundance of flora and fauna. Most of these mountains have now become forest reserves, a wildlife sanctuary and a national park. In Chong village is called suk, a loan from Khmer sruk ‘town’, with slight change in meaning. Often, the Chong villages are called suk phrii ‘forest village’, possibly to emphasize their environs. Chandler (1996) proposed that ancient Khmer communities could be classified into 3 categories: towns or sruk, riceproducing villages around sruk, and forest villages. Martin (1974) described the Pear of the Cardamom Mountains, an affiliate of Chong in Cambodia, as cultivator-gatherer who live both on subsistence cultivation and forest goods gathering. We may deduce that Chong communities in Thailand had developed from small forest villages of the same sort. 1.1 Chong Speakers’ Struggle for the Survival of Their Language Apart from the documentation and description of Pearic languages by linguists, there are the reactions from grassroots communities for language revitalization programs with guided cooperation from linguists. Language Revitalization Programs are an attempt to add new linguistic forms or social functions to embattled minority languages with the aim of increasing the languages’ uses or users” (King, 2001: 23). The Chong Language Revitalization Program (CLRP) began with cooperation between Chong elders and Mahidol linguists with strong community motivation and commitment. Financial support was received from the Thailand Research Fund (TRF) as community-based research conducted by the speakers themselves. The technical support as well as spiritual and emotional support was received from the Research Institute for Languages and Cultures of Asia (formerly Institute of Language and Culture for Rural Development (Mahidol University-Linguists and education experts). The Chong Language Revitalization Program (CLRP) is composed of orthography development, literature production, curriculum development, teaching Chong as a subject in school (by native language speakers), and a Chong community learning center for public at large. The curriculum for teaching Chong as a subject has been developed. In this way, Chong students are able to use Chong as well as Thai (official language) in an educational atmosphere. They are proud of their part in Chong Language Revitalization Program (CLRP). The preparation of teaching materials is in accord with the cultural calendar and localized content. Teachers were selected from among the people in the community. The teacher selection criteria are based on proper pronunciation of Chong, dedication to the CLRP, and acceptance by the Chong community.

605

Chong

2 Phonetics/Phonology This section is a synthesis of what preceding linguists have published on Chong phonology, and of my own experience with Chong speakers in Khlong Phlu area. Most examples are from my own field notes. Otherwise references are given. 2.1 Words/Syllable Structure Most words in Chong consist of monosyllabic and disyllabic words, the latter mostly being iambs that can be divided into minor and major syllables. In mono-morphemic disyllables the minor syllables have a short schwa-like vowel (sesquisyllables). Additionally there are various disyllables transparently derived from affixation and compounding—while the first syllable in these is not stressed the vowel quality remains important. Trisyllabic words exist but are rare, and all are formed by compounding a monosyllable with a sesquisyllable. Monosyllabic words CV(C) CCV(C) tiː ‘hand’ prəː ‘use’ pɔː ‘hold’ pʰri̤ː ‘forest’ wa̤ ːj tiger’ plɔ̰ːk ‘mud’ huəc ‘whistle’ boːt ‘younger sibling’

Disyllabic words Cə.CV(C) CCə.CV(C) Cə.CCV(C) CəC.CV(C) kʰənḛːw ‘children’ prəʔoː ‘yesterday’ kəpʰlə̤ːŋ ‘gun’ caŋkriw ‘butterfly’ pataː ‘below’ kəmluː ‘betel’ biŋbaːŋ ‘spider’ kənaːj ‘elephant’ ca̤ mkʰɨ̤n ‘female’ kənṵəc ‘rope’

Trisyllabic structures CVC. Cə.CV(C) luk.kəmoːj ‘salt’

CVC. Cə.CCV(C) luk.kəpʰrik ‘chili sauce’

606

Premsrirat and Rojanakul

2.2 Phoneme Inventories 2.2.1 Consonants The following occur as the initial consonants:

Plosive

Nasal Trill Fricative Approximant

Bilabial

Alveolar

Palatal

Velar

Glottal

p b ph m

t d th n r s l

c ch

k kh

ʔ

ɲ

ŋ

(f) w

h

j

The following occur as main syllable codas:

Plosive Nasal Trill Approximant

Bilabial

Alveolar

Palatal

Velar

Glottal

p m

t n r l

c ɲ

k ŋ

ʔ

w

j

h

2.2.2 Vowels Vowel length is contrastive in stressed syllables, and all vowels can occur as short or long; the full vowel inventory is as follows: Closed Closed—mid Mid Open-mid Open

i iː e eː ɛ ɛː

ɨ ɨː ə əː a aː

u uː ɔ ɔː o oː

607

Chong

2.3 Suprasegmentals Chong is a non-tonal language, like other Pearic languages, Chong has a typologically unusual register system with four register contrasts (Suphannaphaibun 1982; L-Thongkum 1991; Edmondson 1996). Pitch and contour are often part the register distinctions, but tone alone is not phonemic in the language. First register: “normal-clear voice” or sǐang pòkàtì or ‘normal voice’ in ThaiChong. It is usually accompanied by a relatively high pitch and more open or on-gliding vowel as in boːt ‘youger sibling’, tɔŋ ‘house, home’ and kəjaːŋ ‘tortoise’. Second register: “creaky voice”, or sǐang bìːp ‘a squeeze voice’ in Thai-Chong. It is accompanied by high rising-falling pitch and more open vowels as in kʰə̰ːm ‘scented’, kʰənḛːm ‘tree’. Third register: “deep-breathy voice”, or sǐang tak ‘big voice’ in Chong. It is accompanied by low pitch and raised vowel as in lṳːc ‘steal’, kəwa̤ ːj ‘tiger’ and kəla̤ ːŋ ‘ear’. Forth register: “breathy—creaky voice (breathy followed by creaky)”, or sǐang tàm kratuk ‘low-jerky voice’. It is accompanied by a high-falling pitch and raised vowel as in cʰɔ̤ːˀŋ ‘people and name of a language and ethnic groups’. For the Thai loan words in Chong, the five Thai tones on each words ( l̄ evel, ̀ low, ̂ high-falling, ́ high, and ̌ low-rising) are normally kept. However, it is noticeable that in connected speech, the tone seems to be neutral or inconsistent. At the same time, the Chong low-breathy register appears in some words such as ba̤ ːn ‘home’, ŋɔ̤ʔ ‘rambutan’, or sa̤ j ‘put in’. Stress and intonation in Chong The main stress of words and phrases usually falls on the last syllable. The normal sentence intonation falls at the end of the sentence. However, for questions, the pitch is raised at the last syllable. 3

Word Formation

3.1 Compounding 3.1.1 Nominal Compounds There are various types of nominal compounds, the most common are nominal attributive formations (head followed by attribute, which can be a noun or verb), and items that are typically used with a specifier or class noun (such as fruits, animals, tools etc.) in which the class noun functions as head and is followed by the species or common term. In attributive compounds, the

608

Premsrirat and Rojanakul

modifying lexeme can stand in any semantic relationship to the head, such as possessive, agent, patient or general associative. The two types are not always clearly distinct, and the specifier compounds can be seen as a subclass of attributive compounds, with the specifying element modifying a (semantically underspecified) head noun. Attributive compounds: (1)

nāːj tɔŋ owner house ‘head of family’

(2) tʰa̤ ːk cʰaː water eat ‘drinking water’

Specifier compounds: (3) pʰliː pʰriːŋ fruit black.plum ‘black plum’

Another type of compounds are additive, as seen in the following example: (4) kʰeːn coː child grandchild ‘descendant’

3.1.2 Verbal Compounds There are many two-syllable verbal compounds, typically formed by concatenating two semantically related verbs to produce a somewhat semantically richer or more specific meaning, further reinforced by the more rhythmically appealing two syllable structure. Additionally there are forms where the second element is a noun functioning as a generic object. Verb-verb examples: (5) tʰɔː cʰaː do eat ‘to earn’

609

Chong (6) cʰàp rûəp catch gather.up ‘to capture’

Verb-noun examples: (7) tʰɔː sɛː do field ‘to do the rice field’ (8) klam tʰa̤ ːk soak water ‘to soak’

3.2 Derivational Affixation 3.2.1 Deriving Nouns Apparently modeled on the equivalent Thai construction, the Thai loan kʰwāːm can be used as a nominalizing prefix to derive nouns from verbs. E.g.: (9) kʰwāːm-pʰoʔ nml-dream ‘dream’

The prefix /pa-/ ‘side’ can be added to a preposition, demonstrative, or a verb to form a locative expression (adverb or noun), as seen in the following examples. dɨŋ ‘on’ heːn ‘that’ reʔ ‘in’ tʰɛ̤ːw ‘other’ nih ‘where’ kə̰ːt ‘low’ cʰaː ‘to eat’

→ padɨŋ ‘above’ → paheːn ‘there’ → pareʔ ‘inside’ → patʰɛ̤ːw ‘elsewhere’ → panih ‘where’ → pakə̰ːt ‘below’ → pacʰaː ‘a place for food’

The instrumental infix /-an-/ functions as a nominalizer. It is added to a transitive verb to derive instrument nouns, as illustrated by the following examples. kʰɛːp ‘to grill’ kəːj ‘to prop’

→ →

kʰanɛ̰ːp ‘pointed stick used for grill’ kʰanəːj ‘pillow made from wood’

610 kɔ̀ ːk kʰe̤ːt

Premsrirat and Rojanakul ‘to carry on shoulders’ ‘to comb’

→ →

kanɔ̰ːk ‘carrying pole’ kʰane̤ːt ‘comb’

3.2.2 Deriving Verbs The causative prefix /ma-/ combines with intransitive verbs to derive transitive verbs as in (10) hoːc ‘die’ mahoːc ‘to kill’

3.3 Reduplication Full reduplication in Chong is frequently employed to express emphatic, distributive, or plural meanings. In the case of compound nouns, both parts can be reduplicated separately. The reduplicated expression usually functions as nominal (‘adjective’) or verbal (‘adverb’) modifier. ŋa̤ ːˀj~ŋa̤ ːˀj deːw~deːw kʰeːn~kʰeːn-coː~coː kic~kic nḛːw~nḛːw

‘very far’ ‘really’ ‘descendant’ (kʰeːn ‘child’, coː ‘grandchild’) ‘very small’ ‘children’

(11) ma̤ ʔ prɔ̄ ːj~prɔ̄ ːj rain drizzle~red ‘drizzling rain’ (12) kʰeːn kic~ kic children small~red ‘The little children’

4

Phrase and Clause Structure

4.1 Simple Sentences In terms of structural clause, a clause consists of a predicate (usually a verb phrase) plus noun phrases functioning as subject, object, recipient, goal, instrument, etc. A clause is a minimum sentence, just as a verb phrase is a minimum clause. The basic word order in Chong is SV/AVP. Peripheral segments may precede or follow the nucleus, these are considered to be additional elements which provide more details, such as temporal setting, location setting,

611

Chong

instrument, beneficiary, etc. Time and location or circumstantial setting is found in the sentence periphery. A verb phrase consists of the main verb, describing actions or states, plus optimal elements modifying the nature of action. It can be schematized as follows: Chong Verb-Phrase: preV [1]

preV [2]

negation modal

preV [3] preV [4]

modal/ aspect

main negation Verb

postV [1]

postV [2] postV [3] postV [4] postV [5]

directional modal

modal

negation intensifier

4.1.1 Transitive Clause The transitive clause consists of a subject, a predicate and an object, the object being required. Some stative verbs, such as pʰḛːm ‘be angry’ can take a direct object, as seen in example. E.g.: (13) dak kāmlāŋ tàt jāːŋ 3h prog rive rubber.tree ‘He is riving rubber trees (for making a resin).’ (14) pʰəj tap tʰuh me̤ː-wa̤ ːj 3 bite breast mother-tiger ‘They bite the mother tiger’s breast.’ (15) cákcàn pʰḛːm sṵːc si cicada angry ant disc ‘The cicada is angry (is angry at the ant) an ant.’

4.1.2 Intransitive Clause The intransitive clause consists of a subject and a predicate, with no direct object. A bare noun may follow an intransitive verb, usually with adverbial function, as seen in example. (16) kəpaːw hiːt buffalo disappear ‘A buffalo disappears.’

612

Premsrirat and Rojanakul

(17) wa̤ ːj pʰi̤c ŋiʔ tiger sleep noon ‘A tiger sleeps at noon.’

4.1.3 Ditransitive Clause A ditransitive clause normally contains two objects. It consists of a subject, a predicate, a direct object and an indirect object. The normal word order is AVTG. (18) ʔuːɲ ʔɨt kātāŋ boːt father give money younger.sibling ‘Father gives money to my younger brother.’ (19) siŋ pɔ̂ ːn kʰâːw boːt pn feed cooked.rice younger.sibling ‘Sing feeds rice to his younger brother.’

4.1.4 Existential Clause The existential verb ʔiːn1 is used to introduce new reference to a discourse. The S argument regularly occurs postverbally, as in the following examples. (20) ʔiːn təhǎːn pʰo̰ ːn nak get soldier four clf ‘There are four soldiers.’

The existential expression can be modified by an adverbial phrase, as in example , with a reduplicated verb or a prepositional phrase respectively. The adverbial modifier normally occurs in the final position of the existential clause. (21) ʔiːn pʰliː pʰriːŋ mo̤ ːˀj nḛːm cʰə̤n~cʰə̤n get fruit black.plum one clf near~red ‘There is one black plum tree nearby here.’

4.1.5 Non-verbal Predicates Not every sentence or clause necessarily contains a verb as predicate. A nominal element can take this function without a copula, though the general copula pɛn ‘to be’, as in example (22), and the identificational copula kʰɨ̄ː ‘be something,

1 ʔiːn functionally acts as a main verb meaning ‘to get’, ‘to exist’, ‘to have’ or modal which can mean ‘to be able to’, ‘to be possible to’.

Chong

613

be the same as’, as in example (23), (both obvious loans from Thai) are used in Chong. If no copula is used in a clause, the relation of the nominals is determined by their semantics and the context. (22) tɔŋ láŋ tak pɛn tɔŋ pʰûː.jàj house clf big be house village.headman ‘The big house is the village headman’s house.’ (23) kʰâːw.lɨ̌əŋ.mɔ̤ŋ.bāj kʰɨ̄ː kʰâːw lɨ̌əŋ.tāː.hɛ̂ːŋ na.lɛ Rice.Lueng.Mong.Bai idf rice Lueng.Tahaeng disc ‘Lueng Mong Bai rice is (like) Lueng Ta Hang rice.’ (24) nām də̄ːm Ø kəwa̤ ːj name former (be) tiger ‘Its former name is Tiger.’ (25) wa̤ ːj Ø reʔ duːŋ həːj tiger (be) inside well nsit ‘The tiger is in the well.’

4.1.6 Double Subject Clauses Chong clauses can contain what looks like two subjects, a construction described in the literature as ‘double-subject construction’. This clause type consists of a topical NP, about which a statement is made which itself consists of a subject and a predicate. The main predicate is therefore an n-V combination. (26) cʰɔː cʰǎn cʰoːp plak dog 1sg leg broken ‘My dog has broken legs’ (27) tɔŋ heːn kəbuj luʔ house ana roof decay ‘That house has a pierced roof.’

4.1.7 Imperative Clauses The imperative clause is used in direct speech to command somebody to do something or to forbid him/her to do something. Structurally, the subject is presented by addressing terms or the name of person if the speaker wants to focus on the subject. Frequently, the subject position is left empty in imperative expressions.

614

Premsrirat and Rojanakul

The degree of imperative clause depends on the situational context; it can be either a direct command, or a polite request indicated by a clause initial or final particle. Intonation plays an important role in the formation of imperatives. There are different types of imperative clauses: imperative, adhortative, and prohibitive. Imperative Imperative clauses vary from a strong command to a mild request. Normally, a downtoning sentence final particle is added in a mild imperative clause, with a falling intonation additionally indicating a polite request. A strong imperative is usually uttered with a rising intonation and often has no a final particle. (28) ceːw le̤ːŋ ŋa̤ ˀːj~ ŋa̤ ˀːj ceːw go play far~red go ‘Go and play far from here!’ (29) kâːw cḛːn ʔan step come prox ‘Come here!’ (30) kʰɨŋ kəpáw ʔɨt mɔ̤ŋ hold bag give com ‘Hold my bag!’ (31) ʔoːc kʰoː kɛː cḛːn ʔɨt mɔ̤ŋ na take mortar 2sg come give com disc ‘Bring me your mortar, please.’

Prohibitive The prohibitive is commonly indicated by a negative word /ma̰ ːj/ ‘Do not’ before main verb. This clause type expresses a strong command or compulsion to someone without a final particle. Discourse particles are therefore frequently added to downtone the command. (32) ma̰ːj cʰaː kʰɔ̌ ːŋ puk lɔː proh eat thing rotten disc ‘Do not eat the rotten food!’ (33) ma̰ːj ʔɨt ceːw nih ŋa̤ ˀːj lə̄ːj proh give go where far disc ‘Do not let it go far away, please.’

Chong

615

4.1.8 Interrogative A question word is required in this clause type. Usually, an interrogative clause has a pattern similar to other clause types, in the order of AVP, but the differences in the question marker position is added either preceding or following the basic clauses. There are two subtypes of interrogative clause, that is polar questions and content questions. They differ in the types of question words occurring in them. Polar question The polar question clauses are structured as a statement plus the addition in final position of the yes/no question markers ʔih doː, ʔih, rɨː taj, ʔih daːj or hɔʔ hɛː which may be interchanged depending on the context. The subject of interrogative clauses is normally subtracted when it is understood by speakers, especially if it is the addressee. E.g.: (34) cə̄ʔ ʔâj -pʰāj ʔih wa meet addr-pn q disc ‘Did you see Phai?’ (35) ʔiːn lɔ̤ːŋ ʔihdoː get husband q.nsit ‘Do you have a husband?’ (36) hǐw plɔŋ hiːt hungry cooked.rice q ‘Are you hungry (yet)?’ (37) kʰûː ʔan sǒm saː kɔ̂ ː rɨːtaj couple prox suit recp tcl q ‘This couple is suitable, isn’t it?’

The following examples show the use of different question markers with the same nucleus clause for the answer ‘yes’ or ‘no’. The question marker ʔih doː is used to ask whether a certain (expected) event has come about by the time of speaking, while ʔih is used for questions about the event itself. (38) hɔːp plɔŋ ʔihdoː eat cooked.rice q.nsit ‘Have you already eaten?’

616

Premsrirat and Rojanakul

(39) hɔːp plɔŋ ʔih eat cooked.rice q ‘Do you want to have a meal?’

Content question Content question words generally occur in situ, that is, they occupy the position in the clause where the element asked about would occur. No sentence final question marker is needed in content questions. The most common interrogatives are the following. mih nih, panih piʔ, kəpiʔ moːjʔih cʰiːw tʰɔːʔih jaŋʔih naʔih

‘who’ ‘where’ ‘what’ ‘how many, how much’ ‘how many, how much’ ‘why’ ‘how’ ‘when’

(40) mih rɨh pʰlēːŋ ʔan həːp who sing song prox disc ‘Who sings this song?’ (41) ceːw kaːn mih ləpʰa̤ ːŋ go ceremony who pn ‘To whose ceremony are you going at Lampang?’

The interrogatives nih or panih ‘where’ and naʔih ‘when’ usually occurs clause finally, the usual position for adverbial expressions. (42) cḛːn càːk nih come from where ‘Where are you from?’ (43) cʰɔː tap tōŋ nih dog bite loc where ‘Where did the dog bite you?’

Chong

617

(44) wa̤ ːj tʰáːm wə̤t cʰaː panih tiger ask search eat where ‘A tiger asked “where will you find the food?’ (45) klàp naʔih ʔan lɔː return when prox disc ‘When will you go back?’

The interrogative piʔ or kəpiʔ ‘what’ can occur as subject or object. Frequently it functions as copula complement, in which case the copula ‘to be’ is usually omitted. It also occurs as nominal modifier. (46) dak ʔiːn pʰi.tʰiː piʔ 3h get ceremony what ‘What ceremony is it?’ (47) ʔiːn piʔ hɔːp plɔŋ get what eat cooked.rice ‘What do you have for your meal?’ (48) pʰoʔ kəpiʔ pʰuː dream what you ‘What did you dream (last night)?’

Also the other question words, cʰiːw ‘how many’, tʰɔː.ʔih ‘why’ or jaŋ.ʔih ‘how to’ occupy the position where the element they ask about would occur. The latter two are transparent compounds consisting of the interrogative marker and the verb ‘do’ and the Thai loan ‘manner, kind, type’, respectively. (49) ʔiːn cʰiːw tʰáŋ get how.many bucket ‘How many buckets are there?’ (50) ja̤ ːm tʰɔː.ʔih cry why ‘Why do you cry?’ (51) cʰḭːm.kuːk cḛːn jaŋ.ʔih owl come how ‘Owl, how do you come?’

618

Premsrirat and Rojanakul

4.2 Complex Sentences 4.2.1 Coordination, Chaining, and Subordination Clauses can be coordinated with a conjunction or a prosodic pause. E.g.: (52) cʰǎn ceːw wə̤t mɛʔ tɛ̀ː mɛʔ kɨj ʔih 1sg go meet grandmother but grandmother stay neg ‘I went to meet my grandmother but she was not there.’ (53) lūŋ tʰɔː sǔən || tʰɔm kɨj tɔŋ kʰilɛ uncle do garden aunt stay house disc ‘My uncle is planting garden and my aunt is staying at home.’

Two or more independent clauses can be juxtaposed without any linkers or conjunctions. They may be two main clauses or one of them may be a subordinate clause. The relationship between the clauses may imply a contrastive, conditional, causative, purposive or other sentence types. E.g.: (54) rɔ̤ːk mɔ̤ŋ lɛːk kɔ̂ ː le̤ːŋ tʰa̤ ːk wic || tʰo̤ ːŋ ʔiːn cʰṳːc mat egret com chicken tcl play water more rabbit get flesh all ‘The egret and the chicken eat only soup (but) the rabbit get all flesh.’ (55) tìt me̤ːˀw || tìt kamlɔːŋ || cʰap sàj kʰɔ̂ ŋ klàp tɔŋ catch fish catch eel seize put.in creel return house ‘(If they) can catch fishes or eels, (then) we take it home.’

There is a very high frequency of chained clauses found in Chong procedure texts, relating the sequential relationship of events or states without any linkers. An elliptical sentence is common in this sentence type. A deletion of both subject and object generally occurs when they are mentioned in the topic or the aforementioned statement. The sentence boundaries are indicated by intonation breaks. The following examples illustrate the use of chained clauses and sentences in a short connected text. (56) klam nom.cuk ʔoːc kʰoː cḛːn klam tʰa̤ ːk soak rice.noodle take rice come soak water ‘To make Thai rice noodle, firstly, take rice, and soak in the water.’ (57) lɛ.kɔ̂ ː pʰɔʔ tɛ̀ː pʰḛːw kɔ̂ ː ra̤ ːp then when since three tcl wash ‘Then, after three (days), take it to wash.’

Chong

619

(58) pʰḛːw pʰo̰ ːn kʰrɛ̰ːŋ pɨ̀əj kɔ̂ ː prɔːŋ three four night decompose tcl sieve ‘Leave it for three or four nights, when it decomposes, then sieve it.’ (59) prɔːŋ ʔoːc cḛːn jīː prɔːŋ ʔoːc tʰáp sàj wo̤ k sieve take come squash sieve take put.on put.in cloth ‘After sieve it, then take it to squash, and overlay by cloth.’ (60) pʰɔʔ dak pʰah kɔ̂ ː ʔoːc tʰáp sàj wo̤ k when 3h dry tcl take put.on put.in cloth ‘When it dries, then, take it and roll it up with cloth.’ (61) lɛːw.kɔ̂ ː ʔoːc cḛːn pân pân ʔɨt klōm then take come mold mold give round ‘Then, take it out and mold it round.’

A sequential sentence can express the word /sat/ ‘finish’ when one action is already done, then it may use the topic-comment linker kɔ̂ ː ‘then’ to conjoin the next clause. (62) tʰɔː hâːŋ sat pih sat || kɔ̂ ː tâŋ sǎːn.tʰīəmtāː do shooting.stand finish what finish tcl build temporary.shrine ‘After doing the shooting stand, we, then, build the temporary shrine.’ (63) || kɔ̂ː kʰɔːt mṵːc ʔɔːɲ tcl bind ghost keep ‘(They) bind the corpse (with the tree).’

4.2.2 Complement Clauses Complex sentences can consist of a matrix and one or more subordinate clauses. One possibility is to have a matrix verb combined with the complementizer wâː (borrowed from Thai wâː ‘say’) or lok, and followed by the complement clause, as in the following examples. The complement clause can be assertive or interrogative. The subject of the matrix clause may or may not be coreferent with the subject of the complement clause, and it may be omitted in the complement clause. (64) hɛːŋ bɔ̀ ːk wâː ca tʰɔː kaːn mōŋ.kʰōn ʔah.lɔː 1pl tell sub fut do work auspicious disc ‘We tell him we will have an auspicious ceremony.’

620

Premsrirat and Rojanakul

(65) tāː.tʰěːn sǎm.tʰɔ̄ ːŋ kʰūj lok dak pɛn mɔ̄.rə̤j kɛŋ sham.monk pn talk sub 3h be fortune.teller smart ‘A sham monk in Samthong temple boasts that he is an excellent fortune teller.’ (66) pʰɨ̂ən tʰǎːm lok ʔiːn piʔ wəj friend ask sub get what disc ‘Friends ask that what it happened.’

In natural speech, no subordinator may be present and the complement clause paratactically adjoined to the matrix clause. (67) kəwa̤ ːj dak pʰoʔ Ø ʔiːn cʰaː cʰǎn nán tiger 3h dream get eat 1sg disc ‘A tiger dreams it eats me.’ (68) wa̤ ːj tʰǎːm Ø wə̤t cʰaː panih tiger ask search eat where ‘A tiger asks where you will find the food today.’

Also verbs of perception and cognition can function as matrix verbs of complement clauses. (69) mɛʔ tʰa̤ ŋ cʰɔː coːk ceːw tih grandmother see dog run go dist ‘My grandmother sees a dog running out there.’ (70) dak tɔ̰ːŋ mṵːc taː-cʰɨə lɔŋ pʰri̤ː 3 fear ghost addr-pn lost forest ‘He is afraid that Mr. Chue gets lost in the forest.’

4.2.3 Adverbial Clauses Adverbial clauses can be linked by a conjunction. The conjunctions can express various semantic relationships, including cause, condition, purpose, etc. (71) pʰɔʔ cʰə̤n fàŋ || tʰo̤ ŋ kɔ̂ ː hac pʰluc ceːw tih when near bank rabbit tcl jump jump.rapidly go dist ‘When it nearly comes to the bank of the river, they rapidly jump out.’ (72) tʰâː me̤ː ʔuːɲ dak cḛːn || cʰǎn kɔ̂ː tɔ̂ ŋ ceːw ʔih if mother father 3 come 1sg tcl must go neg ‘If their parents come, I do not go with them.’

Chong

621

Clauses or sentences are frequently connected by the conjunctions lə̄ːj ‘then’ or lɛ́wkɔ̂ ː ‘then’, both loans from Thai, or the sentences are juxtaposed without overt coordination marker. The reading is depends on the context and can include sequential, causal, conditional, among others. (73) kʰeːn pʰəj hǐw cʰǎn lə̄ːj tʰɨm plɔŋ ʔɨt chidren 3 hungry 1sg cons cook rice give ‘Children are hungry, then; I cook for them.’ (74) mɛʔ ceːw wə̤t mɔ̌ ː mɔ̌ː nə̤t grandmother go see doctor doctor make.an.appointment ‘(My) grandmother goes to see the doctor as the doctor had made an appointment.’

4.2.4 Relative Clauses A relative clause occurs next to the head noun it modifies, with or without a relative marker. The relativizer tʰîː is a loan from Thai. The relativized function is usually subject or object. (75) nḛːw kʰɨŋ lɔːŋ piːŋ pɛn kʰeːn tāː-kʰîkìət child hold banana ripe be child addr-pn ‘The child who holds a ripe banana is Mr. Khikiet’s child.’ (76) nḛːm.ʔuːt ʔuːɲ kat prəʔoː pɛn nḛːm.ŋɔ̤ tree father cut yesterday be rambutan.tree ‘The tree that father cut down yesterday is a rambutan tree.’ (77) wo̤ k tʰîː can ʔɨt cḛːn prəː ʔiːn ʔih jɔʔ cloth rel pn give come use get neg disc ‘(I) cannot use the cloth Jan gave (me).’

4.2.5 Serialization Verb serialization is common in Southeast Asian languages. Two or more verbs may be used in a close-knit sequence without conjunctions in a single slot in a clause. Some of serial verbs have acquired idiomatic meaning in Chong and can be considered lexicalized. It is not always clear whether a string of verbs actually describes a single event or a sequence of events. (78) kənaːj kɔ̂ ː pʰi̤c tʰaʔ nḛːm.ʔuːt elephant tcl sleep crash tree V1 V2 ‘An elephant lay down and crashed a tree.’

622

Premsrirat and Rojanakul

(79) pʰəj hǎn cḛːn sûː si 3 turn come fight disc V2 V3 V1 ‘It (the snake) returned and encountered me.’ (80) pʰuː kɔ̂ ː lɔ̄ ːŋ rɨh ʔiːn ŋɛ̤ː dūː 2sg tcl try sing get listen look V1 V2 V3 V4 V5 ‘You should try to sing that song.’. (81) tʰɔː cʰaː kə̀ːt wə̤t kāmnə̀ːt ʔiːn tʰəna do eat born seek born get disc V1 V2 V3 V4 V5 V6 ‘(Wish) you make a good living.’

In some cases, an action verb may be reduplicated to describe what the actor continuously does for a while. (82) coːk coːk coːk ceːw paro̤ ːŋ run run run go farmhouse.grange ‘(He) keep(s) running to the farmhouse grange.’

It is noticeable that the verb hiːt ‘lost’ can appear in the final position of a clause or sentence, functioning as a directional or aspectual modifier rather than a main verb. (83) siː pʰəj ceːw patʰɛ̤ːw hiːt snake 3 go other.places lost ‘That snake already disappeared to another place.’

Two or more clauses can be merged into a single sentence with shared arguments. The first sentence commonly takes an action verb, which causes of the verb of the second clause. Frequently, the patient (object) of the first verb functions as subject of the second. (84) siː toːk cʰɨ̤m hoːc snake peck person die ‘The snake strikes someone dead.’

Chong

623

(85) pʰəj ʔoːc tʰo̤ ŋ pʰəj ceːw pat 3 take rabbit 3 go leave ‘He takes a rabbit away.’

4.3 Syntax and Pragmatics The structure of the Chong sentence is, to a certain extent, determined by pragmatics. They are sometimes built according to topic-comment structure. The focused or topical constituent of the sentence may be fronted. (86) ɲa̤ ː dak kɔ̂ ː sàj wild.yam 3 tcl put.in ‘A wild yam, they also put in.’ (87) plɔŋ hɛːŋ ʔoːc ce:w wic cooked.rice 1pl take go more ‘Rice, we bring more.’ (88) ʔaːw pʰro̤ ːŋ me̤ː ʔoːc ceːw həːj blouse white mother take go nsit ‘The white blouse, mother already took it.’

4.4 Noun Phrases Noun phrases have noun or noun-like elements as their head, which normally fill in the subject, object, or peripheral position. The nominal phrase may consist of a simple nominal phrase (noun phrase, pronoun phrase) or a nominal compound (additive, appositional, among others). A noun phrase consists of a noun as its head noun, and additionally slots for Quality, Possessor, Number, Unit, Demonstrative, Definiteness, and Location. Each slot may be filled by classes such as nouns, adjectives (quality verbs), prepositional phrases, numbers, classifiers, demonstrative pronouns, possessive expressions, class markers or an element such as a relative clause. The head, with can be a single noun or a compound, obligatorily appears in the first position; the modifiers obligatorily occur after the head. In some cases a classifier occurs between the head noun and the modifier. The relation of the modifying element to the head is usually determined by the semantics of the elements, and the modifying elements can themselves be complex, consisting of a modified and a modifier. Another possibility of complex noun modifiers are relative clauses (see 4.2.4).

624

Premsrirat and Rojanakul

(89) cʰɔ̤ːˀŋ tak human big ‘an adult’ (90) kəjaː tūə kic scorpion clf small ‘a small scorpion’ (91) tɔŋ heːn house ana ‘That house’ (92) tɔŋ láŋ tak sǐː ŋə̤j house clf big color red ‘the big red house’ (93) ʔaːw pʰro̤ ːŋ kʰɛ̌ːn pʰɔ̄ ːŋ blouse white sleeve swell ‘the white blouse with puffy sleeves’

Apart from that a common noun phrase expression in original Chong expression such as me̤ː—ʔuːɲ ‘mother and father’, for some people ʔuːɲ—me̤ː ‘father—mother’, which is influenced by Thai pʰɔ̂ ː-mɛ̂ː is used instead. Other examples are ‘siblings’ boːt—lɨŋ ‘younger.sibling—elder.sibling’ occurs alternatively with lɨŋ—boːt which is similar to Thai expression pʰîː— nɔ́ ːŋ ‘elder sibling—younger sibling’, mə̤t—ŋa̤ ːj ‘face of a person’ occurs alternatively with ŋa̤ i—mə̤t ‘face—eyes’ which is similar to Thai nâː— tāː ‘face—eyes’. 5

Word Classes

Chong words may consist of one free morpheme (a simple word) or a combination of two or more morphemes (a complex word). In complex words and compounds, the head determines the category of the whole word (or expression). Word classes in Chong include nominals (nouns and pronouns), demonstratives, verbs (including copulas, auxiliaries and adjective-like verbs), adverbs (intensifier, negator, temporal, spatial), numerals, classifiers, prepositions, conjunctions, and particles. These can be distinguished by their syntactic behavior and function.

625

Chong

5.1 Nouns Nominals in Chong include common nouns, proper nouns, measure nouns, and pronouns. 5.1.1 Common Nouns and Proper Nouns Common nouns and proper nouns usually refer to concrete or abstract entities and occur as arguments in clauses and complements of prepositions. Nouns can optionally be modified by quantifiers and demonstratives to express plurality and definiteness. In some cases, nouns obligatorily combine with a classifier. (94) ma̤ ʔ prɔ̄ ːj~prɔ̄ ːj mɔ̤ŋ rain drizzle~red com ‘The rain drizzles.’ (95) ŋiʔ jəːŋ sun high ‘The sun rises.’

In the following sentence, tʰa̤ ːk ‘water’ is a common noun and tɨj functions as a measure noun. (96) luːm tʰa̤ ːk cʰaː mo̤ːˀj tɨj request water eat one bowl ‘Give me a bowl of water, please.’

5.1.2 Pronouns The pronouns of Chong are either loans from Thai, as in the first and second person singular, or indigenous forms. All three persons distinguish singular and plural. The following table shows the personal pronouns in Chong. Polite

1st singular 1st plural 2nd singular 3rd

cʰǎn kɛː dak

Impolite

hɛːŋ

ʔiɲ pʰuː pʰəj

626

Premsrirat and Rojanakul

To indicate plurality, normally the Thai loan mùː ‘group’ is added before the noun or pronoun. Sometimes the word pʰûək (also a Thai loan meaning ‘group’) is used. The word hɛːŋ or mùː hɛːŋ are used to mean ‘we’. Personal pronouns can be modified by nɔ̤ːˀn ‘self’, while caŋ.nɔ̤ːˀn ‘oneself’ itself takes the place of a reflexive pronoun. Apart from the personal pronouns, a number of other words taking the place of pronouns occur in Chong. These include interrogative and indefinite pronouns, such as: kʰi ‘who’ mih ‘anyone’

5.1.3 Demonstrative There are three degrees of distance from the speaker in Chong demonstratives. They are proximal ʔan ‘this, here’, medial tʰən ‘that, there’ and distal tih ‘there (far away)’. (97) ʔan kəpiʔ wa prox what disc ‘What is this?’ (98) tɔŋ dak kɨj tōŋ tih paŋa̤ j house 3h stay loc dist far ‘His home is over there.’

The word heːn ‘that one’ is normally used to anaphorically refer to a point in time, of an action or an object mentioned, as in the following examples. It does not necessarily refer to a specific entity, but rather expresses a remoteness from the speech event. (99) wa̤ ːˀŋ ʔiːn seː kʰom he:n want get rattan clf ana ‘(He) needs that clump of rattan.’ (100) ʔi:n piʔ cʰaː ʔih na samǎj he:n get what eat neg disc period ana ‘There was not much food to eat in the old days.’

5.1.4 Quantifiers: Numerals and Measure Words Numerals and other quantifiers are used to express number and amount of nominals. Nouns usually require the use of a classifier when they are quantified,

627

Chong

though the classifier can be omitted in some cases. Classifiers may be borrowings from Khmer (nak), Thai tūə, lêm, pʰɨ̌ːn), or indigenous forms. The following are the cardinal numbers of Chong. mo̤ ːˀj ‘one’ pʰa̤ ːˀj ‘two’ pʰḛːw ‘three’ pʰo̰ ːn ‘four’ pʰram ‘five’

kətɔːŋ ‘six’ ra̤ ːj mo̤ ːˀj ‘eleven’ kənṳːj ‘seven’ ra̤ ːj pʰa̤ ːˀj ‘twelve’ kətiː ‘eight’ ra̤ ːj pʰḛːw ‘thirteen’ cʰa̤ ːj ‘nine’ ra̤ ːj ‘ten’

Other quantifiers include the following, some obvious loans from Thai. mat ‘all’ (Thai?) tʰáŋ ‘inclusive, all of’ (Thai) tʰúk ‘every’ (Thai) kʰrɨ̂ŋ ‘half’ (Thai) tɛm ‘full’ (Thai) lɔː ‘many’ ʔuːc ‘few’ nə̤c ‘some’ wic ‘more’

The composition of numerals follows the usual Southeast Asian pattern, with lower orders before higher orders indicating multiples, and lower orders following higher orders addition. The ordinal numbers are regularly formed by adding the Thai loan tʰiː in front of the cardinal numeral. The order within the numeral phrase is regularly n-num-clf. If no classifier is present, the numeral may either precede or follow the noun. Classifiers in common use include the following. clf Used with nak people tūə animals lêm a candle, pen, knife or book, or bar-like niʔ a morsel of food pəŋ a flat shaped, paper-like shape pʰɨ̌ːn a broad flat utensils and clothes cʰak seeds and a seed-like things dɔːŋ instance of action

628

Premsrirat and Rojanakul

Measure words take the place of a classifier, that is, they occur after the numeral or quantifier. Frequently, but not always, measure words occur without overt head noun. Common measure words include the following, many of which are loans from Thai. kʰɨ̂ːp a unit of length equal to an estimated distance from the tip of the thumb to the tip of the index finger raːt a unit of length equal to two meters haːt a unit of length equal to the estimated distance from the elbow to the extremity kʰìːt a unit of weight equal to 100 grams kìlōː kilogram méːt metre kʰuːc a piece of a firewood or things kluh a chunk of fish kʰûː a pair of persons or things (101) me̤ːˀw mo̤ ːˀj tūə pʰa̰ ːˀj.seː baːt fish one clf twenty baht ‘A fish costs twenty baht.’ (102) duːŋ mo̤ ːˀj kʰûː coconut one pair ‘a pair of coconuts’ (103) jaːŋ ʔiːn pʰɨ̂ən lɔː turtle get friend much ‘The turtle has a lot of friends.’ (104) kəcʰɨ̤m cʰɔ̤ːˀŋ suk tʰúk tɔŋ person human village every house ‘all villagers’

The following examples show a numeral without classifier which may occur in pre- or postnominal position. (105) jāːŋ.lóp pʰa̤ːˀj eraser two ‘two erasers’

Chong

629

(106) pʰa̤ːˀj cʰɔː two dog ‘two dogs’

Numerals can be modified by elements such as the approximative sàk/sák ‘just, about’, kɨ̀əp ‘almost’ or the additional ʔìːk ‘more’, all loans from Thai, which appear before the cardinal number as shown in examples and. (107) luːm tʰa̤ ːk cʰaː sàk mo̤ːˀj tɨj request water eat aprx one bowl ‘Give me a bowl of drinking water.’ (108) hǒːn ʔi ̀ːk mo̤ːˀj nak astrologer more one clf ‘another astrologer’

5.1.5 Names and Term of Address People in the family are normally addressed by kinship terms or only by names for younger people. The kinship terms are also used to address non-kins by estimating the relative age. Common kinship terms that are frequently used are the following. tāː maternal grandfather (Thai) ja̤ ːj maternal grandmother (Thai) tʰɔm paternal / maternal elder siblings ʔɔʔ father (informal) boːt younger siblings lɨŋ elder siblings kʰeːn child kʰen.lɛːŋ daughter coː grandchild cʰuː great grandson, great granddaughter kləː friend

Apart from kinship terms, some people are addressed by professional titles such as ʔaːcaːn ‘teacher, professor’ (Thai) and mɔ̌ ː ‘doctor’ (Thai). A Chinese loan céː ‘elder sister’ is used for addressing a non-Chong elderly woman, as in the past a number of Chinese immigrants settled in this area.

630

Premsrirat and Rojanakul

5.2 Verbs Chong has different classes of verbs which can be distinguished based on their syntactic behavior. It should be noted that the distinction between transitive and intransitive verbs is syntactically not always clear-cut, as many, but not all, verbs combine with a restricted set of direct objects. Verbs usually function as main predicative element in a clause. Unlike nouns, verbs can be directly negated by the pre- or postverbal negator ʔih or the preverbal negator mâj, the latter a borrowing from Thai. Descriptive (or quality) verbs are a subclass of verbs that are often translation-equivalents of adjectives. Their syntactic behavior is similar to that of other verbs, and they can occur in attributive as well as predicative function. Descriptive verbs are frequently reduplicated when they function as nominal or verbal modifiers. Unlike other verbs, descriptive verbs are also used in comparative constructions, which employ the comparative marker kwàː, a loan from Thai kwàː ‘go beyond, surpass, be more than’. (109) pʰəj cʰālàːt kwàː dak na 3 clever compar 3h disc ‘It is smarter than others.’

There is no grammaticalized marker for the superlative, which is rather expressed by a degree marker, as in the following example. (110) cʰɨ̤m tʰən mɔʔ cat person medl beautiful very ‘That girl is very beautiful.’

5.2.1 Modals, Directionals, and Aspectuals Chong has a large number of pre- and postverbal auxiliaries and secondary verbs expressing different notions such as modality, directionality and aspect. The main preverbal modals are the following, which include many loans from Thai. tɔ̂ ŋ wa̤ ːˀŋ klâː cā

‘must’ (Thai) ‘want’ ‘dare’ (Thai) ‘will’ (Thai)

kʰə̄ːj ‘used to’ (Thai) ʔiːn ‘to be able to’ (get, have, exist) màŋ ‘might’

Chong

631

In Chong, a main verb of motion clause is frequently followed by a directional verb such as ceːw ‘go’, cḛːn ‘come’ or klàp ‘return’, which indicates a direction of action or moving. (111) nḛːw~nḛːw coːk cḛːn càːk tɔŋ child~red run come from house ‘The children run from their home.’ (112) mùː pʰəj də̄ːn klàp tɔŋ saː pʰa̤ ːˀj nak PL 3 walk return house rec two clf ‘Two of them walk back to their house together.’

Directionals also occur with transitive verbs. In this case their position is after the object of the main verb. (113) dak nām siː cḛːn tʰɨ̌ŋ tɔŋ 3h lead snake come arrive house ‘He brings a snake to his home.’

The object and the destination can both be omitted when the object is referred to in the preceding clause or broader context and the destination is uncertain or backgrounded. (114) mān hɔːt ceːw 3.nhum drag go ‘It drags (him) away.’

The clause final particle həːj indicates that a situation change has occurred and the new situation has fully come about. It is glossed here as nsit (‘new situation’). (115) cʰǎn kɔ̂ ː tʰɔː nɔ̤ːˀn rɨəj həːj 1sg tcl do self continuously nsit ‘I often do it by myself now.’ (116) ma̰ ːj kròːt cʰǎn həːj proh angry 1sg nsit ‘Don’t be angry at me anymore, please.’

632

Premsrirat and Rojanakul

5.2.2 Adverbs Adverbs modify clauses, sentences, or verbs and may themselves be modified for degree, limitation, comparison, emphasis, or negation. (117) cʰǎn tʰúk lɨə̌ːkə̄ːn həːj 1sg suffer extremely nsit ‘I’m suffering awfully now.’

Onomatopoetic expressions or ideophones are frequently used as adverbs. They can imitate a sound or other impression. (118) pʰəj hac proːm ləːj 3 jump ideo disc ‘It jumps to the water, prom!’ (119) pʰɔʔ kəwa̤ ːj huːt kʰrɨːt when tiger sip ideo ‘When the tiger sips, it goes khrut!’

Sentential adverbial expressions can be temporal or locative, as seen in the following examples. (120) wə̤n.ʔan cʰǎn cə̄ʔ siː cōŋʔāːŋ today 1sg meet snake king.cobra ‘Today I met a King cobra.’ (121) cʰǎn ceːw tʰúk ʔaːw tʰúk ʔaːw 1sg go every day every day ‘I go (to school) every day.’ (122) tʰi.ʔan kɔ̂ː cʰɔ̤ːˀŋ suk kɔ̂ ː ceːw wə̤t tāː.tʰěːn then tcl huma n village tcl go seek sham.monk ‘Then, the villagers went to meet the sham monk.’

The similative markers mɨ̌ən ‘same’ or jàŋ ‘like, as’, both Thai loans, can introduce an adverbial expression. (123) pʰəj cḛːn mɨ̌ən mɛ̄ːw niːlɛ 3 walk same cat disc ‘It walks like a cat.’

Chong

633

(124) nḛːw~nḛːw tɔ̂ ŋ cʰaː jàŋ cʰḭːm.kəcìp child~red must eat like long.tailed.tailorbird ‘Children should eat like a long-tailed tailor bird.’

5.3 Prepositions Prepositions in Chong often transparently originate in (relator) nouns or (directional) verbs. They are used to express semantic relations between entities, including spatial and temporal, and usually form adverbial phrases. In many cases these prepositional words are loans from Thai. (125) loːc kɨj kʰâːŋ tɔŋ barking.deer stay side house ‘A barking deer is beside the house.’ (126) cʰǎn ceːw tāːm wa̤ ːj noːt 1sg go follow field pn ‘I walked along the Not field’

A few prepositions in Chong don’t have obvious lexical sources. Among these are mɔ̤ŋ ‘with, together with’ indicating the instrument and other oblique roles, and also functioning as adverbial ‘also, together’, and reʔ and nāj, both meaning ‘in’, the latter a loan from Thai. Another form, kìəw.kāp ‘about’ is another loan from Thai, where the first element is a verb meaning ‘be concerned, be hooked up’ and the second a preposition meaning ‘with’ of unknown lexical origin. (127) hɛːŋ kɔ̂ ː ceːw mɔ̤ŋ ro̤ t dak 1pl tcl go com car 3h ‘So we came together by his car.’ (128) dak cḛːn ɲa̤ ːj mɔ̤ŋ kʰɨ̤n 3h come tell com wife ‘He came to tell his wife.’ (129) siː tak reʔ kʰom seː ja̤ ːj snake big inside clump rattan big ‘A big snake is in a big clump of rattan.’

The noun kʰɔ̌ ːŋ ‘thing’ (from Thai) may be used to indicate possessive relationships. It can also be used as a lexical noun.

634

Premsrirat and Rojanakul

(130) pān.jāː kʰɔ̌ːŋ kətʰo̤ ŋ intelligence poss rabbit ‘the intelligence of the rabbit’

5.4 Conjunctions Chong has a number of conjunctions linking clauses. Some conjunctions can also function as prepositions, such as pʰrɔ́ ʔ ‘because’, linking phrases rather than clauses. Clause linkers are the subordinating tʰâː ‘if’, pʰrɔ́ ʔ or pʰɔʔ ‘because’, tâŋ.tɛ̀ː ‘since’, cōn ‘until’, and tɔ̄ ːn ‘when’, all obvious loans from Thai. Coordinating conjunctions are tɛ̀ː.wâː ‘but’, rɨ̌ː ‘or’, and lə̄ːj ‘so, therefore’, also of Thai origin. 5.5 Final Particles Chong is very rich in sentence particles expressing a wide range of meanings. Many of these particles are original Chong, while others are Thai loans. Original Chong particles include the following, the exact functions and meanings of which are not clear. They are assertive or affirmative particles, presumably with varying degrees of emphasis, certainty and speaker involvement: jɔʔ, jaŋ.kʰi, kʰi.lɛ, ʔa.lɔː, həːp, ha.kɔː, cə.kʰi, hɔʔ. (131) cʰǎn wǎj ʔih jɔʔ 1sg able neg disc ‘I’m not able [to do this].’ (132) kaːn taː-lan kʰimaŋ work addr-pn disc ‘It might be Mr. Lan’s work.’ (133) tʰâː tʰi̤w dak pʰa̤ ːˀj lōː kɔ̂ ː pʰram.seː nu if buy 3h two kilogram tcl fifty disc ‘If I buy them, it costs twenty baht a kilo, right.’

In addition, the following Thai sentence particles are widely used in Chong. lɛ nə si nɛ ləːj

‘affirmative particle’ ‘interrogative particle’ (yes/no question) ‘imperative particle’ (implies a mild command) ‘emphatic assertive particle’ ‘emphatic particle’

635

Chong tʰə tʰəːt maŋ

‘persuasive particle’ (implies a mild command) ‘persuasive particle’ (implies a mild command) ‘suggestive particle’ (implies an uncertain answer)

5.6 Negation The negator ʔih ‘not’ may occur either in preverbal or postverbal position, or it may appear in both positions in the same clause. (134) pʰəj ʔih hoːc ʔih 3 neg die neg ‘It does not die.’ (135) cʰǎn prəː ʔih jɔʔ 1sg use neg disc ‘I do not use it.’

5.7 Vocatives Vocatives refer to a personal name, often combined with a kinship term, or a pronoun, put before or after the sentence. They occur independently before or after a sentence. They are used for starting the conversation or addressing a person. Sometimes they co-occur with an exclamation particle such as ʔəj ‘hey’. (136) lūŋ-raːn ceːw nih sɨ̌ə addr-pn go where pn ‘Where does Uncle Ran go, Sueʔ’ (137) ʔɔʔ ɲa̤ ːj nítʰāːn sák mo̤ ːˀj rɨ̂əŋ si grandfather tell folktale aprx one clf disc ‘Grandfather, could you tell a folktale (for me)?’ (138) kɛː ʔə́j wə̤n.ʔan cə̄ʔ siː cōŋ.ʔāːŋ lɛ 2 exclam today meet snake king.cobra disc ‘You, (I) met a King cobra today.’

5.8 Exclamation Exclamations are sentence particles expressing the speaker’s mood, feeling, and attitude towards a situation or event. They usually occur at the initial position of a sentence.

636

Premsrirat and Rojanakul

(139) ʔâw cḛːn jaŋ.ʔan la surp come how disc ‘Eh! How did you come?’

In the following two examples, the speaker’s attitude is expressed not only by the sentence initial exclamation particles, but also by the choice of the adverbials, in both cases loans from Thai. The expression kwàː.cà ‘until (the time comes to)’ implies a long time before an expected or hoped for event occurs, while dīː~dīː ‘well, happily’ expresses disappointment that a pleasant state of affairs has been interrupted. (140) ʔóːj kwàː.cà cak ʔiːn saː exclam until shoot get rec ‘Ouch! At least we could shoot (it).’ (141) mɛ̌ː cʰǎn kam.láŋ pʰoʔ dīː~dīː exclam 1sg prog dream well~red ‘Ah! I was just having a good dream (and now you woke me up)!’

6

Semantics Fields

Even though Chong is heavily influenced by Thai, it still demonstrates the Austroasiatic characteristic of fine-grained distinctions in some fields of the lexicon. Some important semantic fields are presented here. There are many words for ‘rice’ as in haːj haːj kakɔˀ haːj baːj haːj buːɲ haːj kənoːˀj krɔ̰ːk haːj kəkʰoː kʰó (kə)miːp kʰoː .pʰoːj plɔŋ kɔ̰ːc

‘paddy rice’ ‘unhusked rice’ ‘hill rice’ ‘pressed new rice’ ‘millet’ ‘ear of paddy’ ‘husked rice’ ‘glutinous rice’ ‘unglutinous rice’ ‘cooked rice’ ‘rice—gruel’

Apart from that special verbs are used for ‘to eat rice’ hɔːp plɔŋ and ‘to cook rice’, thɨm plɔŋ whereas to eat other kinds of food such as soup, noodles, fruit,

Chong

637

water, including betel chewing is cʰaː, as in cʰaː lɔːŋ ‘eat banana’, cʰaː tʰaːk ‘drink water’ and cʰaː màːk ‘chew-betel nut’. Kinship terms are widely used to address and refer to relative and nonrelatives. Through some words are borrowed from Thai, there still are many terms of Austroasiatic origin. pùː ‘paternal grandfather’ (Thai) ja̤ ː ‘paternal grandmother’ (Thai) tāː ‘maternal grandfather’ (Thai) ja̤ ːj ‘maternal grandmother’ (Thai) kəmraː ‘father/ mother’s younger siblings’ tʰɔm ‘paternal / maternal elder siblings’ ʔuːɲ ‘father’ ʔɔʔ ‘father’ (informal) ʔuːɲ li̤əŋ ‘step-father’ ʔuːɲ jaː mɔːk ‘father-in-law’ me̤ː ‘mother’ kəkʰɨn ‘wife’ kʰɨn tʰɛ̤ːw ‘other wife’ kʰɨn pʰla̤ ː ‘new wife’ kʰɨn cʰo̰ j ‘minor wife’ kʰɨn kʰənḛːm ‘first wife’ kəlɔ̤ːŋ ‘husband’ boːt ‘younger siblings’ lɨŋ ‘elder siblings’ lɨŋ kəlɛ̤ːŋ ‘older sister’ lɨŋ kəmlɔh ‘older brother’ lɨŋ kəhaːj ‘elder sibling of wife/husband’ lɨŋ pəsaː ‘brother/ sister-in-law’ kʰeːn ‘child’ kʰeːn ca̤ mkʰɨ̤n / kʰeːn lɛ̤ːŋ ‘daughter’ kʰeːn ca̤ mlɔ̤ːŋ ‘son’ kʰeːn huə.piː ‘first child’ (Chong + Thai) kʰeːn kədaːj ‘middle child’ kʰeːn pʰa̤ w ‘the last child’ kʰeːn pʰloh ‘twins’ kʰeːn kəma̤ ːŋ ‘orphan’ kʰeːn pəsaː ‘son-in-law’ kʰeːn boːt / kʰeːn lɨŋ ‘cousin’ kʰeːn cʰuː ‘descendants’ cʰuː ‘niece / nephew’

638

Premsrirat and Rojanakul

coː coː pəsaː coː ca̤ mlɔ̤ːŋ pəsaː

7

‘grandchild’ ‘grandchild’s husband’ ‘grandson’ ‘son/daughter-in-law’

Glossed Text

Fragrant Agar Wood is an oral story describing the Chong people’s everyday live as forest goods gatherers 40 years ago. This story was narrated in 2011 by Mr. Chian Phanphai, 73 years old, for use in a Chong writing system guidebook by the Royal Institute, Thailand. ʔuːt mahɔːm2 wood agar.wood kəmraː -cʰom dak pɛn cʰɨ̤m wə̤t ʔuːt mahɔːm tɔːk. dak na ʔoːc addr-pn 3h be person search wood agar.wood sell 3 fut take ceːw tɔːk pʰɔ̤ʔ ʔiːn prak tʰi̤w kəpiʔ tʰi̤w luk cʰaː mo̤ ːˀj nɨ̤m~nɨ̤m. go sell when get money buy what buy salt eat one year~red kəmraː -cʰom dak wə̤t ʔuːt mahɔːm ʔih kʰah cʰiw dɔːŋ tɔ̀ ː addr-pn 3h search wood agar.wood neg know how.many times to cʰiw dɔːŋ. wə̤n mo̤ ːˀj rādūː.pʰraŋ dak kraːk sàbīaŋ ceːw how.many times day one dry.season 3h carry food.stuffs go caŋnɔ̤ːˀn~ caŋnɔ̤ːˀn rə̤j ʔuːt mahɔːm ceːw rɨ̤əj~rɨ̤əj. pʰɔ̤ʔ tʰa̤ ŋ self~red look wood agar.wood go continuously~red when see ʔuːt mahɔːm kəmraː -cʰom dak kɔ̂ ː wa:ŋ sabiaŋ ʔɔːɲ. həːj dak wood agar.wood addr-pn 3h tcl put.down food.stuffs keep nsit 3h

2 ʔuːt mahɔːm in Chong or krìtsanǎː tree in Thai is a kind of large evergreen tree of the Aquilaria Crassna species, found mostly throughout Southeast Asia. The core of Agar wood has been used for medicinal products, perfume and incense, especially for exporting to countries in the Middle East.

639

Chong

kɔ̂ ː ʔoːc kətaːn bɔːŋ nḛːm mahɔːm rə̤j. tɛ̀ː kɔ̂ ː ʔih ʔiːn kɛ̀ːn tcl take axe pierce tree agar.wood look but tcl neg get core ʔih lə̤j kəmraː-cʰom dak kɔː.lə̤j ce:w tɔː wic. dak ceːw paːk neg disc addr-pn 3h cons go cont more 3h go go.up nɔ̤ːŋ cuh hup tʰi̤ŋ pʰri̤ː lɨ́k ceːw riə̤j~riə̤j. mountain go.down valley enter forest deep go continuously~red kəmraː-cʰom dak ceːw ka̰ ːc ŋa̤ ːj jaŋ.ʔih dak kɔ̂ ː ʔih kʰah tūənɔ̤ʔn addr-pn 3h go near far how 3h tcl neg know self kətuːc kəmraːc-cʰoːm dak kɔ̂ ː ʔɨt kʰə̰ːm ʔuːt mahɔːm. həːj kɔ̂ ː nose addr-pn 3h tcl give fragrant wood agar.wood nsit tcl tʰ̤aŋ ʔuːt mahɔːm nḛːm tak.poː jə̤.jɛ̤ ceːw mat kəmraː-cʰom dak see wood agar.wood tree big plentiful go all addr-pn 3h kɔ̂ ː ʔoːc kətaːn bɔːŋ ʔuːt mahɔːm rə̤j wic. həːj dak kɔ̂ ː tcl take axe pierce wood agar.wood look more nsit 3h tcl tʰa̤ ŋ kɛ̀ːn mahɔːm. wə̤n tʰə̤n kəmraː-cʰom dak lə̤j ʔiːn see core agar.wood. day medl addr-pn 3h cons get mahɔːm kraːk klàp tɔŋ jə̤ kwàː agar.wood carry return house much compar

Free translation

tʰúk ʔaːw. every day

Fragrant Agar wood ‘Uncle Chom earns his living by searching for the fragrant Agar wood. He takes it to sell and buy enough preserved shrimp paste and chili to eat for several years. Uncle Chom goes looking for the fragrant Agar wood most of the time. One day in summer time, he takes some food and goes deep into the forest looking for the fragrant Agar tree. When he finds the wood, he puts down the food and, with a Pulu axe, digs into the trunk of the tree but is unable find the core of the fragrant wood. He goes up and down hills, into the deep forest. He does not know how far he has gone. Then he smells the fragrance of the Agar wood. He sees several big Agar trees. He uses his Pulu axe to dig into the trunk of the tree and find the fragrant core. He then goes home with a lot more fragrant Agar wood than before.’

640

Premsrirat and Rojanakul

Bibliography Charoentham, Sirikarn. 1987. Chong language, Thung Ta-In village, Chanthaburi province. M.A. thesis, Silapakorn University. Chou, Ta-Kuan. 1987. The customs of Cambodia (translated by J. Gilman d’Arcy Paul). Bangkok: Siam Society. Edmondson, Jerold A. 1996. Voice qualities and inverse filtering in Chong. Mon-Khmer Studies 26: 107–116. Kenikanon, Kannika et al. 1979. Chong. Bangkok: Bannakij. [in Thai] L-Thongkum, Theraphan. 1991. An instrumental study of Chong registers. In J.H.C.S. Davidson (ed.), Austroasiatic Languages, Essays in honour of H. L. Shorto. School of Oriental and African Studies, University of London, 141–160. Luuamkhan, Saifon. 1991. Chong language, Nam Khum 1 village, Khlong Phlu subdistrict, Makham district, Chanthaburi province. M.A. thesis, Silapakorn University. Martin, Marie A. 1974. Remarques générales sur les dialectes Pear. Asie du Sud-Est et Monde Insulindien 5.1: 25–37. Pallegoix, Jean-Baptiste. 1853. Mémoire sur la mission de Siam. Beaune: [s.n.]. Phaetphithak, Kunwadee. 1996. A description of the Chong language in Klong Seng village, Borrai district, Trat province. M.A. thesis, Silapakorn University. Premsrirat, Suwilai et al. 2007. Prasopkaan kaanfunfoo phaasaa Chong nai prathet Thai koranee phaasaa Chong changwat Chanthaburi. [Experiences of Chong language revitalization in Thailand—the case of Chong in Chanthaburi Province.] Chiang Mai: The Thailand Research Fund Regional. Seidenfaden, Erik. 1967. The Thai peoples. Bangkok: Siam Society. Suphannaphaibun, Surekha. 1982. Rabop Siang nai Bhaasaa Chong Muu Baan Takhian Thong, Tambon Takhian Thong, Amphur Makham. M.A. thesis, SinakharintharaWirote University. Weber, Karl E. 1976. Ethnographic notes of the Chong population in Chanthaburi province in southeast Thailand. Thailand Research Project Diffusion of Innovation Coast Hinterland—Continuum Preliminary. Office of Senior Research Fellow of the South Asia Insitute of Heidelberg University of German Culture.

section 4 Khmeric

․․

chapter 6

Old Khmer Paul Sidwell 1 1 Background Old Khmer, a living language for eight centuries, is attested from the early 7th century A.D. to the first decades of the 15th century. It is the direct ancestor of all Modern Khmer (Cambodian) dialects, which together form the Khmeric branch of Austroasiatic. The language is recorded in more than a thousand inscriptions—on stelæ, walls, lintels, and some bronze objects— found in Cambodia, Thailand, Vietnam and Laos. A good many inscriptions are bilingual, consisting of Sanskrit texts followed by complementary texts in Old Khmer. Most of the inscriptions are legal documents—royal directives, charters, deeds, conveyances—couched in legalistic form and employing a chancery idiom cultivated by an educated élite. Because of their limited concerns (texts consist largely of declarative statements, lists of property, slave names etc.) the texts afford only a partial representation of the language; for example no 2nd person pronouns are attested. Also, many stelæ are weathered and/or broken, rendering them partially unreadable. As a whole, these factors restrict the available data and scope of discussion. The inscriptions show two distinct historical phases: pre-Angkorian (A.D. 612 to 802), and Angkorian (A.D. 802 to the abandonment of Angkor in 1431–2). Strikingly the degree of standardization apparent in the earliest pre-Angkorian inscriptions indicates that the language had been written well before 612. In this sketch no attempt is made to separate pre-Angkorian and Angkorian examples, as the linguistic differences are quite minor, arguably no more significant than may have been found with the language at any particular time, at least it is not clear the extent to which the apparent differences are local orthographic variants or are really phonological. The clearest difference is in the diphthongs, as Jacob (1993b: 358) explains: 1 This chapter is mostly based on an earlier publication by this author together with Philip Jenner (Jenner & Sidwell 2010). The examples have been glossed throughout with the help of the Dictionary of Angkorian Khmer (Jenner 2009a) and the Dictionary of Pre-Angkorian Khmer (Jenner 2009b).

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_�12

644

Sidwell

The only fundamental difference between the Angkorian dialect and the pre-Angkorian lies in the occurrence of two extra diphthongs in the vowel system. These are: (i) a vowel written ya or yya, but never yā (being thus distinguished from īə which is written ya, yya, and yā in the Angkor period). (ii) a vowel written va, but never vā or vo (being thus distinguished from ūə which is written va, vā, and vo in the Angkor period, as in pre-Angkor). Ferlus (1992) reconstructs a pre(-Old)-Khmer contrast /ie – ia/ and /uo – ua/, partly justified by these orthographic indications. However, writing systems characteristically lag in the representation of sound changes, so it is not at all clear when this apparent phonological contrast ceased to be active in the vernacular language. Our approach here is to characterize the phonology—with a high degree of confidence—as it was in the latter Old Khmer period, transcribing the diphthongs ya, va and their varients as /iːə/ and /uːə/. Syntactic analysis of Old Khmer shows that the default ordering of constituents is subject + verb + optional direct object + optional indirect object, with modifiers following their head, rather typical among Austroasiatic languages. Phonologically it is significantly different from its modern descendants, having not restructured vowels or devoiced consonants (see the chapter on Khmer in this volume for comparison). It is apparent that from the first pre-Angkorian inscriptions to the last of the Angkorian the underlying grammar is relatively unchanged, not particularly surprising for a chancery idiom. Such developments in exposition as can be recognized have little or nothing to do with syntax but relate to subject-matter, accessions to the lexicon, and stylistic devices as authors experiment with the resources of the language. Finally, this sketch ends with a partial bibliography, including internet resources which are now greatly improving one’s capacity to work with this language which is otherwise fixed in stone and metal inscriptions. 1.1 Writing System Old Khmer is written in a system brought from South India, and is the direct ancestor of the writing system used for modern Khmer, and for that reason its symbols and organization are not reviewed here. It may be of interest to note that in 2002 UNESCO in Cambodia published a four volume set of transcriptions produced with their own purpose made Old Khmer fonts, one pre-Angkorian and five Angkorian variants. For a complete tabling see UNESCO 2002. And for more on Old Khmer transcription orthography, see the Introduction to

Old Khmer

645

Jenner & Pou 1980–81). For this sketch, however, we utilize exclusively roman transcription. The writing system may be termed a conservative one in that, like Mon (with which it had extensive contact.) it has preserved the conjunct consonant symbols by which ligatures (digraphs) are formed for the representation of consonant clusters. This contrasts with Thai and Lao, for which the writing system was adapted several hundred years later. To the epigraphist whose task it is to read the inscriptions as they appear on stone the inscriptions offer a range of problems. Many are badly weathered or otherwise damaged, resulting in frequent lacunæ in the texts, while a certain proportion was engraved by careless lapicides. Under the best of conditions, the student’s understanding of a text is often attended by uncertainties. These uncertainties will not be touched upon in what follows, our concern being to say what needs to be said to describe the mechanics of the language. The fit of the Indic writing system to Old Khmer as a once-living language is generally good but no better than the fit of the roman alphabet to modern English or French, and thus can offer no more than an imperfect representation of the spoken words. Idiosyncrasies of the orthography (discussed further in the introductions to Jenner’s (2009) pre-Angkorian and Angkorian dictionaries) include: – the doubling of consonant symbols following r (’ācāryya for Sanskrit ācārya); – the replacement of b by v (vuddha for buddha); – vacillation in presyllables between anusvāra (-ṃ) and the four nasal consonants (ṅ, ñ, n, m); – the reinforcement of visarga (-ḥ) by final h (vraḥh ~ vraḥha for vraḥ); – indecision in the representation of consonant clusters (kryal ~ karyyal ‘crane’); – the doubling of certain final consonant symbols (prasapp for prasap) for a motive as yet unclear; – and, in Pre-Angkorian as well as Angkorian, vacillation in representing the prior member of ligatures by an unaspirated or an aspirated stop symbol (kvas ~ khvas). Of special interest are a number of loans from Sanskrit which show final -e where final -a would be expected (kalpe for kalpa). Another group comprises forms with initial ’/ʔ/, difficult to explain by recourse to solely internal evidence, exemplified by ’nak ~ ’anak /nak/ ‘person’ and ’seḥ ~ aseḥ ~ seḥ /seh/ ‘horse’. The correspondence of the vowel symbols to their phonological actuality is poor. Ambiguity is noted in the case of written a, which may represent /ɔː, ɔ, aː,

646

Sidwell

a/, written ā /aː, a/, written o /oː, ɔː, ɔ/, and above all written e and ē /eː, e, ɛː, ɛ, ɤː, ɤ/. The two diphthongs of Angkorian are by post-initial ya /iːə/ and va /uːə/, e.g.: ’yat /ʔiːət/ ‘to be lacking’, chkvat /cʰkuːət/ ‘to be crazed’. On the other hand, the correspondence of the consonant symbols to their phonological counterparts is generally close. Ambiguity is pretty much confined to p and t before a vowel, where it is impossible to know whether these were realized as [p] and [t], or as implosive [ɓ] and [ɗ]. According to his analysis of loan word phonology, Ferlus (1992: 83) dates the glottalization of prevocalic labial and dental stops to the 9th and 10th centuries, corresponding to the early Angkorian period. Consequently, we treat prevocalic p, t as indicating implosive [ɓ] and [ɗ], which also follows the tendency apparent in the modern language. Final written s is regularly rendered /h/, consistent with modern Khmer, as we have no firm idea when the two fricatives in this position merged (the contrast remains robustly maintained in most cognate languages). The ten aspirate consonant symbols transliterated kh, gh; ch, jh; ṭh, ḍh; th, dh; ph, bh are used before a vowel in all stages of Khmer. Care must be exercised in interpreting these same aspirates when they occur as the prior member of clusters. In this case the aspiration, usually shown in Angkorian forms, is subphonemic; Pre-Angkorian cke contrasts with Angkorian chke ‘dog’ only orthographically, both representing /cʰkɛː/. 2 Phonetics/Phonology On the basis of the written evidence we present in this section a reconstructed provisional phonology of late Angkorian Khmer, presented in a broad phonological transcription, in standard IPA symbolization, probably very close to a strictly phonemic transcription. We can be fairly confident that the phonology is well understood; written Khmer has been used in comparative reconstruction for over a century already (e.g. Schmidt 1905 to Shorto 2006), the typology of consonant and vowel restructuring that occurred in various Austroasiatic languages during the previous millennium is well studied (e.g. Haudricourt 1965, Ferlus 1979, Huffman 1985) and there is now a large and growing body of Khmer epigraphic analysis, especially since Cœdès (1953–64). Following this discussion of phonetics and phonology, all textual examples are given in the roman transliteration (according to traditional Indological conventions), reflecting the actual spellings in the sources; and care should be taken to recognize that spellings are quite variable.

Old Khmer

647

2.1 Word/Syllable Structure The structure of the vernacular word is similar to modern Khmer and many other Austroasiatic languages, having monosyllabic, sesquisyllabic (subdissyllables in Jenner & Sidwell 2010) and disyllabic forms. The monosyllable has the canonical form CV(C). In open syllables V is necessarily long; in closed syllables it may be long or short. This monosyllable may function as a full word and the base of derived forms. Sesquisyllables have the canonical form CCV(C). On the phonetic level these range from monosyllables with initial clusters (thmo ~ thma /tʰmɔː/ ‘stone’) to disyllables with juncture vowels (radeḥ ~ rddeḥ /r ədeh ~ rɔdeh/ ‘cart’), when the juncture may take on timbre. Sesquisyllables may also function as secondary bases of derivatives. The disyllable, invariably iambic, consists of a stressed monosyllable or sesquisyllables plus an unstressed presyllable of the shapes Crə- and Cəɴ-. In these formulæ C continues to represent any allowable consonant. This is followed in the first case by /-rə/, in the second case by /-əɴ/, in which ɴ represents a nasal frequently but not always assimilating to the initial of the main syllable. Examples: (1) travāṅ /trəbaŋ/ ‘reservoir’ cramuḥ /crəmʊh/ ‘nose’ prasir /prəsiːr/ ‘writing’ caṅvāt /cəŋwat/ ‘delimited terrain’ kañjuḥ /kəɲɟʊh/ ‘dung’ randaḥ /rəndah/ ‘lightning’ paṃroḥ /ɓəmrɔh/ ‘trash’

The above are the characteristic word forms of vernacular Old Khmer. Other words, relatively few in number, show these same forms grouped into compounds, which may show alliteration or are derived by reduplication. Examples: (2)

slā mlū ‘arecanut [and] betel leaf’ sruk sre ‘villages [and] ricefields’ vagam vagoṃ ‘beads, rosary’ hiṅ huṅ ‘bullfrog’

The structure of loanwords generally follows that of the source language, though assimilation to Khmer phonology is not infrequent, and a good many loans from Sanskrit and Prākṛta conform to the canonical forms of vernacular Khmer. A common, although far from universal pattern, especially in the

648

Sidwell

assimilation of Indic loans, is that source forms with multiple syllables in which the final syllable is open, have the final vowel dropped, such that the onset of the ultima becomes the coda of the preceding syllable. For further discussion of loan assimilation in Khmer see, for example, Gorgoniev (1966), Haiman (2011). Examples: (3) kapāla /kəɓaːl/ ‘head’< Sanskrit kapāla ‘cup, dish; skull, cranium’ kuṇḍala /kəndɔːl/ ‘ring’ < Sanskrit kuṇḍala porāna /ɓuraːn/ ‘ancient’ < Sanskrit and Pāli purāṇa ʼagāra ~ ʼgāra /ʔəgaːr/ ‘building’ < Sanskrit agāra

2.2 Phoneme Inventory and Phonotactics 2.2.1 Consonants The Old Khmer consonants are tabled as follows: Table 6.1

Old Khmer consonants

Initials

p b ɓ m w

t d ɗ n r, l s

Finals

c ɟ

k g

ɲ j

ŋ

ʔ

p m w

t n r, l

c ɲ j

k ŋ

h

h

All of the initials occur before any vowel nucleus—especially due to the extensive Indic contribution to the lexicon—although there are restrictions elsewhere. In final position the stops are all voiceless and unreleased, and we assume that final /s/ does not occur, having shifted to /h/ at some stage. Note also that the Austroasiatic final glottal stop was lost generally, creating open syllables; phonetic final glottal stops in Modern Khmer typically reflect etymological final /k/. 2.2.2 Vowels The vowel inventory has both long and short members, plus the two long diphthongs of Angkorian: /iːə/ and /uːə/. Note also the frequent occurrence of the

649

Old Khmer Table 6.2 Old Khmer vowels

iː eː ɛː iːə

ɯː ɤː aː

uː oː ɔː uːə

ɪ e ɛ

ɤ a

ʊ o ɔ

short schwa [ə], we regard this as the unstressed allophone of /ɔ/ occurring in presyllables and Indic loans. There was probably a tendency, as in the modern language, for both /uː/ and /oː/ in open syllables to be realized as as [ɯːw] and [ɤːw] respectively. CC clusters found only in initial position exhibit great variety. The first member may not be any of /ʔ, ɓ, ɗ, n, ɲ, ŋ, h, w, j/, whereas all twenty-one of the consonants may be found in pre-vocalic position. Examples of the single consonants as well as consonant clusters will be found in abundance in all that follows. Juncture between the prior and latter member of clusters may be close or open. Close juncture is here unmarked while juncture is marked by subphonemic /ʰ/ or /ə/: (4) phlū /pʰluː ~ pʰlɯːw/ ‘road’ manuṣya /mənʊh/ ‘man, human’ ph’van /pəʔuːən/ ‘younger sibling’ saṃkhyā /səŋkʰjaː/ ‘count, reckoning’

2.2.3 Suprasegmentals It is apparent that Old Khmer lacked any tonal or phonation type contrasts, only later in the Middle Khmer period did the tense/lax (head/chest) register distinction develop. Given the nature of the inscriptional record it is not possible to say anything useful about the prosodic features of the language. 3

Word Formation

3.1 Compounding Compounding as a word formation strategy was clearly present, and may have been quite productive in the vernacular language, but is only sparsely attested in the chancery idiom of Old Khmer, where Sanskrit and Pali forms were often

650

Sidwell

utilized for linguistic elaboration. For some examples of attested compound forms see remarks under §2.1 above). 3.2 Derivational Affixes Discussions of affixation in Old Khmer include Bauer (1988), Jacob (1993a,b), and You Sey (1976). It is clear that affixation is important in Old Khmer, especially for derivation. The following table, based on Jacob (1993a,b), summarizes the situation. The nominalizing -mn- apparently originates in multiple affixation (e.g. pvas ‘to enter holy orders’, phnvas ‘holy orders’, paṃnvas ‘holy orders’) although it already appears to behave as a single morpheme by the Old Khmer period. Table 6.3 Old Khmer derivational affixes

Nominalizing

affix

examples

pN-, ’N-

jāv ‘to barter’ > pamjāv ‘bartered goods’ ruṅ ‘to be big, mature’ > ʼaṃruṅ ‘size, extent, area’ pvas ‘to enter holy orders’ > phnvas ‘holy orders’ sre ‘riceland’ > sanre ‘ricefield’2 pre ‘to use’ > pamre ‘servant’ cer ‘to transgress’ > cmer ‘transgressor’ jvan ‘to offer’ > jaṃnvan ‘offering’ duñ ‘to buy’ > daṃnuñ ‘purchase’ tvaṅ ‘coconut’ > thpūṅa ‘head, lump; south’ car ‘to plant in a row’ > c(h)par ‘flower garden, plot’ tval ‘to fall’ > phtval ‘to fell, bring down, overthrow, topple’ ket ‘to be born’ > paṅketa ‘to birth, beget, topple’ slāp ‘to die’ > saṃlāp ‘to cause to die’ cren ‘to grow’ > caṃren ‘to cause to grow’ pān ‘to hold, keep’ > ʼaṃpān ‘to arrest, obtain’ poñ ‘Courtesy title for males’ > kpoñ ‘elder sibling’ tai ‘female’ > kantai ‘wife’

-n-m-mn-pCausative

p-m-

Verbalizing3 Proper name

(’N-) (kN-)?

2 Jacob (1993a) treats this example, and one other word, under a separate category: ‘Quantifier’. 3 Jacob (1993a: 24) notes that the evidence for the ‘verbalizing’ prefix is “slender”.

Old Khmer

651

Morphological Passive A limited number of NPs consisting of a head + genitive dependent, when attributive to a head, can be interpreted as passivized by virtue of their position. In this case the genitive dependent may be said to function as the agent. It will be noted that most of these genitive dependents are derivatives by infix /‑əmn- ~ -əɴ-/. Examples: (5) sre pradāna poñ lon riceland gifted title pn ‘Riceland gifted by the poñ Lon’ (6) kñuṃ ’aṃnoy kurāk hvār ta vraḥ slave given title pn link holy ‘slaves given by the kurāk of Hvār to the divinity’ (7) . . . prasap sre daṃnuñ chloñ vidyāmaya . . . abut field purchased title pn ‘. . . [it] abuts on the ricefield purchased by the chloñ Vidyāmaya’; (8) jvan bhūmi valaya jaṃnau ’āgneya mat ta offer land enclosed purchased southeast entrance link haraṇasthāna thmo quarry stone ‘[he] has offered up a piece of enclosed land purchased [by himself] southeast of the entry into the stone quarry’

3.3 Reduplication Within the corpus one finds numerous examples of reduplication of initial consonants, producing various derivatives. E.g. *krek /krɤːk/ ‘to stir’, > kakrek /kəkrɤːk/ ‘to quake’. 4

Phrase and Clause Structure

Old Khmer is verb-centered, allowing a proposition to be expressed minimally by a bare verb, constituting the simplest autonomous clause or sentence. Clauses are concatenated or embedded to create more complex sentences. Because the inscriptions are mostly concerned with the delimitation of land,

652

Sidwell

inventories of property, slave assignments and other lists, most of them show a mixture of full sentences and nonsentential strings, with the latter outnumbering the former. In most cases the nonsentential strings correspond to the predicates of full sentences, reduced in order to serve as captions. Full sentences in the inscriptions are mostly of the declarative type, plus a few which are optative, while imperatives and interrogatives are rare. 4.1 Simple Sentences The minimal sentence is the autonomous clause. Basic word order is SV/AVP, and arguments are frequently omitted. (9) prāsāda khvuṅ temple high ‘the temple is high’ (10) ’ryāṃ lā elder.sibling depart ‘[my] older brother took his leave’ (11) paṃnvas cya slā monks eat areca ‘the monks were chewing arecanut’

For equational clauses: (12) neḥ mandira mratāñ prox palace lord ‘this (is) the lord’s palace’

Clauses may be expanded with prepositional phrases: (13) ’ryāṃ ’añ mok ’aṃvi nagara elder.sibling 1sg come from capital ‘my older brother came from the capital’ (14) saṅ padaḥ nau le bhnaṃ build house loc upon hill ‘[he] built a house on the hill’

Old Khmer

653

Adverbs and adverbial phrases are widely used to fill out a basic clause: (15) īlū ’ryāṃ ’añ saṅ padaḥ now elder.sibling 1sg build house ‘my older brother is now building a house’ (16) masāñnakṣatra mratāñ saṅ mandira year-of-serpent Lord build palace ‘the lord built a palace in the year of the Serpent’

Questions Questions are extremely rare in the corpus, but from what is available it is apparent that they can be formed with or without an interrogative, such as ’avi ‘what’, which occurs in situ, according to its syntactic function. (17) ’agāra noḥ thāp nu ’āśrama ’agāra ’avi ○ prāsāda building dist near ins āśrama building what temple ‘What building is that next to the āśrama? [It’s] a temple’ (18) vnaṃ khvuṅ dāp ○ khvuṅ hill high low high ‘Is the hill high or low? [It] is high’

Ditransitives Old Khmer texts are frequently concerned with the giving/granting/offering property, either as a result of purchase or sale or as taxation or ritual propitiation, although these are typically expressed in passive constructions (see examples under §3.2). Where we have straightforward ditransitives of the ‘give’ type, the direct object normally comes directly after the verb, and is followed (with or without intervening material) by the indirect object in one (or more) oblique prepositional phrase. For example: (19) . . . kamrateṅ kaṃtvan ’añ viṅ oy prasāda bhūmi . . . high.lord maternal 1sg again give favour land sratāc ṛdval ta loñ vasudeva pn pn link title pn ‘. . . My High Lord in the maternal line again gives the tracts in Sratāc [and] Ṛdval as a royal grant to the loñ Vāsudeva . . .’

654 4.2 4.2.1

Sidwell

Complex Sentences Coordination, Chaining, Subordination

Equational Sentences Equational sentences are those in which the subject is equated or identified with the predicate, the two being jointed without a linker or with a copular verb or the anaphoric copula syaṅ. Examples without a linker: (20) pitara noḥ brāhmaṇa father dist brāhmaṇa’ ‘his forefathers (were) brāhmaṇa’ (21) neḥ dravya nu jau bhūmi noḥ prox valuables ins buy land dist ‘these (were) costly goods for acquiring the said tract’ (22) sruk bhūmi khñuṃ neḥ nai tapasvīndrāśrama village land slave prox belong pn ‘this village and land [and these] slaves (are) the property of the Tapasvīndrāśrama’

With copula syaṅ: (23) gi noḥ sre noḥ syaṅ man oy ta vraḥ idf dist village dist cop rel give link holy kaṃmrateṅ ’añ śivaliṅga high.lord 1sg pn ‘The said ricefields are what [he] has given to My Holy High Lord of the Śivaliṅga’

Compound Sentences Two or more coordinate but otherwise autonomous clauses may be linked to form a compound sentence, without a conjunction. Examples: (24) dep saṅ gośāla duk thpal ta gi saṅ chpār next build stable assign herd link idf build garden ‘[They] next built a cow stable, assigned a herd to it, [and] made a garden’ (25) saṅ gol nu 925 śaka cat caṃnat 1 build boundary.markers ins 925 era arrange designate 1 ‘[He] set up boundary-markers [on it] in the Śaka year 925 [and] laid out a settlement’

655

Old Khmer (26) cāmpa cāp phsok śata vyar jvan ta Champa seize prisoner hundred two offer link

kamrateṅ jagat vrai.lvac Lord people pn ‘. . . ., Champa took two hundred prisoners of war, and offered [them] up to the High Lord of the World at Vrai Lvac’

The absence of coordinate clause conjunctions sometimes makes it difficult to distinguish a compound sentence from a sequence of two or more simple sentences. The question can usually be resolved in the case of parallel constructions, but may be insoluble elsewhere. Punctuation is rarely reliable, but in the following case the repeated ri (topic marker) is decisive: (27) riy bhūmi ta cval ta caṅvat ’āyatta ta ’nak top country link enter link terrain assign link person ta thve rājakāryya ○ riy vraḥ liṅga ’āy link make royal.service top holy linga at vnur.sramo ’āyatta teṅ su pn assign.authority title pn ‘The land within [these] bounds—[it] shall be under the authority of one who carries our royal service and family head. The holy liṅga at Vnur Sramo—[it] shall be under the authority of the teṅ Su’

Subordinate Clauses A minimal sentence may function (with or without marking) as a subordinate clause, linked to a preceding or following main clause to form a complex sentence. Various types of subordinate clause may be distinguished: the temporal clause, the conditional clause, the clause of purpose. Examples: (28) vrai vraḥ bhāga 1 pakṣa khnet forest holy portion 1 group fortnight.of.waxing.moon ti mratāñ khnar oy thlai mās liṅ ti pass lord sanctuary give price gold measure goal pul ta borrow link

vraḥ kamrateṅ ’añ ’āy vrac holy Lord 1sg give pn

656

Sidwell ‘One section of sanctuary forest [reserved] for the fortnight of the waxing moon was given [him] by the lord of the sanctuary for 10 liṅ of gold, borrowed from My Holy High Lord at Vrac’

(29) ’nak sruk oy slā ’aṃlū ta bhikṣugaṇa ○ cya person village give areca betel link monks eat ‘The villagers gave the monks some areca and betel. [They] are eating [it]’

Relative Clauses Relative clauses are those which are either introduced by a relativizer or modify a noun or NP head without such marking. Pre-Angkorian favors the relativizer, which is usually dispensed with in Angkorian. Example: (30) saṅ khñuṃ jmaḥ khdic ’añ jvan ta vraḥ śivaliṅga build 1sg name slave 1sg offer link holy śivaliṅga ‘[he] made [me] the pious gift of a slave named Khdic, whom I offered up to the holy śivaliṅga’

Embedded Clauses Embedded clauses are subordinate clauses that occur within clauses on a higher level. Example: (31) man vraḥ pāda nirvvāṇapada kridā vala pi when holy foot pn raise force purp

’nak tok vraḥ ’āy bhadrapattana nu stuk.ransi○ person extract holy at pn ins pn

man vraḥ svey rājya chnāṃ 2 guḥ steṅ.añ śivācā when holy partake throne year two only title pn

ryya sthāpanā vraḥ noḥ ta nai santāna viṅ top set.up holy dist link property family again ‘When His Majesty Nirvāṇapada raised an army against those who had overthrown the images at Bhadrapattana and Stuk Ransi when the royal one had been on the throne only two years, the steṅ ’añ Śivācārya set up the said images belonging to his family again’

Old Khmer

657

Compound Complex Sentences A sentence may be both compound and complex, i.e. it may consist of two or more clauses, one or more of which contains a subordinate clause. Examples: (32) dep vraḥ kamrateṅ ’añ vidyāspada jā then holy high.lord 1sg pn be

kanmvay mātṛpakṣa ⸋⸋⸋⸋⸋ sthāpanā vraḥ nephew/niece maternal.line ⸋⸋⸋⸋⸋ set.up holy

bhagavatī thve ’āśrama ta bhūmi ⸋⸋ śiva pn make āśrama link ground ⸋⸋ pn ‘Later, My Holy High Lord of Vidyāspada, who was a nephew in the maternal line of ⸋⸋⸋⸋⸋, set up an image of Bhagavatī [and] founded an āśrama on land belonging to ⸋⸋ śiva’

Cleft Sentences Cleft sentences are common in Angkorian Khmer. They open with the dummy subject gi ‘this is’ (identifier, idf), the zero copula, and its complement, in the following example introduced with subordinating conjunction ta: (33) gi neḥ sre ta roḥ gi neḥh phoṅ idf prox field link manner idf prox pl tel kuruṅ maleṅ oy ta vraḥ rel king pn give link holy ‘It (is) these fields aforesaid which the kuruṅ of Maleṅ has given to the divinity . . .’

Passives In addition to the morphological passive, a periphrastic passive is widely used. This is marked by preposition ti (glossed pass here) followed by an optional agent, then followed by a transitive verb. In its full form the prepositional phrase representing the agent is acceptable before the verb. Examples: (34) sruk sre ta ti mratāñ oy ta vraḥ village field link pass lord give link holy ‘village [and] ricefields which were given by the lord to the divinity’

658

Sidwell

(35) gi santāna ti vraḥ pre trā dau nā kanmyaṅ.paṃre idf family pass holy use approve go place Corps.of.Pages’ ‘members of the family were bidden by the royal one transfer to the Corps of Pages’

4.3 Syntax and Pragmatics 4.3.1 Topicalization Topicalization is much favored in the inscriptions. Topicalized NPs are commonly but not always introduced by the prepositions ri ~ ri e, e, or nau ~ nau ru. Commonly topicalised NPs serve as the direct object, brought up to the sentence head. Occasionally they are the subject, and are followed by an anaphoric pronoun, or a genitive NP. Example of a genitive topic: (36) nau ta yokk neḥ ta roḥ neḥ ti pre top link take prox link manner prox goal order kāp thpvaṅ chop head ‘Of those who take these aforesaid—[the executioner] shall be ordered to cut off [their] heads’

Here, preposition nau marks the topic. It opens with headless ta subordinating a relative clause consisting of a transitive verb (yok) with its direct-object NP: neḥ ta roḥ neḥ, referring to previously mentioned sruk and slaves. This topic is followed by a normal clause opening with a passivized verb (ti pre), its subject unexpressed, and continuing with its transitive complement (kāp) with its direct object (thpvaṅ). 4.3.2 Multi Verb Predication As in the modern language, verbs may occur in series without being marked as to function. These series are of two main types: (a) a second verb is a sequel in time to the one before it, and (b) a second verb is the complement of the one before it. The first is exemplified by the much favoured dau ‘go’+ a verb of purpose: (37) dau oy śapata go give swear ‘to go out [and] administer the oath / go forth to administer the oath’

Old Khmer

659

And the second type: (38) svaṃ leṅ ’añ ’aṃpān jā braḥ buddha beg let 1sg achieve be holy Buddha ‘[I] pray to let me get to be a holy Buddha’

4.4 Nominal Phrases Nouns may be grouped into phrases of three types: the noun phrase (NP), the prepositional phrase (PP), and the adverbial phrase (AdvP). 4.4.1 Noun Phrases Within NPs three structures can be distinguished. Head + Genitive Modifier In this type the head noun is modified by a noun, pronoun or NP in a genitive relationship: padaḥ ’añ ‘my house’; padaḥ pitā ’añ ‘my father’s house’. It must be understood that this case relationship is of a general kind, often indistinguishable from the ablative (‘from’) or instrumental (‘by’): vraḥ śāsana dhūli vraḥ pāda ‘a royal directive of (from, by) His Majesty’. Head + Appositional Modifier It often happens that the modifier is co-referential with its head; in NPs referring to royalty this apposition can be carried to some length. For example: (39) dhūli vraḥ pāda dhūli jeṅ vraḥ kaṃmrateṅ ’añ dust holy foot dust leg holy high.lord 1sg śrī sūryyavarmmadeva pn ‘dust of the royal feet, dust of the feet, my High Lord Holy One, Śrī Sūryyavarmmadeva’

Above the NP dhūli vraḥ pāda ‘dust of the royal feet (pāda)’ consists of a noun head (dhūli) modified by the genitive NP (vraḥ pāda), in which the head (vraḥ) is modified by appositional pāda. This much is modified by an appositional gloss, namely the NP dhūli jeṅ ‘dust of the feet ( jeṅ)’. The foregoing is followed by a genitive NP (vraḥ kaṃmrateṅ ’añ śrī sūryyavarmmadeva), in which the head (vraḥ) is now modified by another appositional NP (kaṃmrateṅ ’añ). The whole is then modified by another appositional NP (śrī sūryyavarmmadeva), in which śrī is the head and sūryyavarmmadeva is in apposition to it.

660

Sidwell

Head + Verbal Modifier NPs with verbal modifiers are particularly useful. In padaḥ dhaṃ ‘a big house’ the head is modified by the stative verb dhaṃ ‘to be big’. Such verbal attributes may be compared to relative clauses since the same NP may be equivalent to ‘the house which is big’. Thus, padaḥ tūc mandira: ‘a house like a palace’ or ‘a house which is like a palace’. It is essential to understand that the optional relative pronoun appearing in the translation is brought in to answer the requirements of English. Note moreover that in all such cases the modifier may be introduced by the linker ta, marking its subordination to its head: padaḥ ta dhaṃ, padaḥ ta tūc mandira. In padaḥ ’añ duñ or padaḥ ta ’añ duñ, ‘the house which I bought’, we see that the head noun may be introduced with the linker ta. The same thought can be expressed without the personal pronoun: padaḥ duñ or padaḥ ta duñ ‘the house which [I] bought’. 4.4.2 Prepositional Phrases Prepositional phrases consist of a preposition head followed by its complement: ’aṃbi bhnaṃ ‘from the hill’, nau le bhnaṃ ‘on the hill’. In puṇya nai mratāñ śrī satyāśraya ‘the pious work of the lord Śrī Satyāśraya’, the preposition nai marks the genitive NP mratāñ śrī satyāśraya. As in English, prepositional phrases may occur without an expressed preposition. In vraḥ karuṇā ta paramapavitra ‘a royal command of the Supremely Sacred One’, the unmarked noun paramapavitra is genitive (or instrumental, or ablative) but may optionally be subordinated by ta. In sre kurek saṃreṅ ‘a ricefield belonging to the kurek of Saṃreṅ’ the genitive NP kurek saṃreṅ is neither marked nor subordinated except by position. 4.4.3 Adverbial Phrases Adverbial phrases are common: īlū neḥ ‘at this time, now’; roḥ neḥ/noḥ ‘in this/ that manner, so, thus’. Examples: (40) ’nak hau kamrateṅ śivāśrama roḥha noḥha ’nau person call high.lord pn manner dist stay ‘folk continued to call [him] the High Lord of the Śivāśrama as before’ (41) ’nak ta roḥha neḥha paṅket santāna ’āy sruk kuti person link manner prox beget lineage 1sg village pn ‘The persons aforesaid begat the line in the village of Kuṭi’

661

Old Khmer

5

Word Classes

Old Khmer word classes are readily established on the basis of syntax and semantics. The major (or open) wordclasses are verbs, the nouns, and adverbs, all further divisible into subclasses; the minor (or closed) classes include conjunctions and prepositions. The question of whether Khmer has a class of adjectives is an old one; in our analysis the words in question are treated as stative verbs. Frequently one and the same lexical form may be assigned to more than one wordclass. A given wordform may function now as a noun, now as a verb, while other function variously as a conjunction and a preposition. For example: (42) ’aṃcas varṣa chnāṃ tap praṃ piy age season year ten five three ‘to be aged eighteen years’,

where ’aṃcas, although usually a noun, is here clearly a stative verb ‘to be aged’. 5.1 Nouns Typical of Austroasiatic languages, the noun is unmarked for number, gender or case. Nouns are divisible into several subclasses as described below. 5.1.1 Common Nouns Common nouns are indefinite or non-particularized nouns denoting a broad range of objects and beings. Some examples: (43) dūk /duːk/ ~ dvak /duːək/ ‘boat’ tanlāp /ɗənlap/ ‘caddy’ kalpa /kal/ ‘age, eon’ calācala /cɔlacɔːl/ ‘instability’ thmiṅ /thmɪŋ/ ‘string instrumentalist’ puṇya /ɓʊn/ ‘good, virtue, merit’

lṅāc /ləŋaːc/ ‘sunset’ vas /bɔh/ ‘snake’ śīlā /sɪlaː/ ‘stone, rock’ sākṣi /saksiː/ ‘eyewitness’ huduga /hʊdʊk/ ‘large drum’ ’ādhvā /ʔadhwaː/ ‘road, way’

Common nouns may be optionally pluralized in several ways. Most commonly, a noun may be modified by adverbial phoṅ /phɔːŋ/ ‘as a group’. Example: (44) ge dau naraka nnuv gotra phoṅ 3pl go hell ins family pl ‘They shall go to hell along with members of [their] family’

662

Sidwell

Another common pluralizer is the 3pl pronoun ge ‘they’ occurring as the head of an NP. Example: (45) ge vraḥ phoṅ tel vraḥ kaṃmratāṅ ’añ 3pl holy pl rel holy high.lord 1sg

śrī jayahvarmma oy ta poñ īśvaracita Śrī Jayavarman give link title pn ‘Images which My Holy High Lord Śrī Jayavarman gave to the poñ Īśvaracita’

A noun may also be associated with a noun or verb denoting wholeness or completeness, such as ’aṃpall ‘all’, guḥ ‘fully’. Examples: (46) . . . pi jvan dravya ta ’aṃpall neḥh . . . intend offer goods link all prox ‘. . . with a view to offering up all these costly things’ (47) ri ’nak ta ñyāṅ paṃre is khe pramvāy guḥ . . . top person link cause serve end month six fully ‘Those who endeavor to serve [him] for only six months in all . . . ’

An effect of the pluralizers mentioned above is to convert a mass noun into a count noun. Thus bhūmi ‘land’ is also ‘tract of land’, while sabhā ‘council, court’ is also ‘member of a council’. This is particularly common with mass nouns denoting persons: kula ‘family’, also ‘members of a family. Also a common pluralizer gaṇa ‘class, series, set’, occasionally used in the sense of ‘unit of a set’. Examples: (48) valaya mvay thṅai-ket sruk enclosure one sun-appear village ‘a piece of enclosed land east of the village’ (49) canlyak thmi tap hat yo tap.prām cloth new ten measure measure fifteen ‘ten lengths of new cloth for the lower garment, [one] hat fifteen yo’ (50) is kulapakṣa phoṅ stac prakop saṃpat nu krama ○ all family pl supreme endowed riches ins rank ‘On all members of [their] families His Majesty was pleased to confer riches and rank’

663

Old Khmer (51) gaṇa prāk . . . gaṇa laṅgau . . . group silver group copper ‘articles of silver . . . articles of copper . . . ’

5.1.2 Pronouns Personal Pronouns Old Khmer inscriptions shows a paucity of personal pronouns, with titles, kinship terms, and repeated personal names frequently used rather than pronouns. In fact, no second person pronouns are found at all. The personal pronouns that are attested lack any gender or inclusive/exclusive distinctions. In the third person singular, the identifying marker gi /gɯː/ appears to take the place of a pronoun. It is not clear from the inscriptional material whether this constitutes a real reanalysis in Old Khmer, or whether the identifier just happens to be compatible with the contexts where a third person singular pronoun would be expected. Alternatively, the original function could have been pronominal, with the identificational function arising secondarily. The attested pronouns are as follows: Table 6.4 Old Khmer personal pronouns Singular

1 3 3DIM

Plural

’añ /ʔaɲ/ yeṅ /jɤːŋ/ gi /gɯː/ ge /geː/ va /waː/

The form vā ~ va ~ vah /waː/, glossed here as 3DIM (diminutive) (antecedent of Modern Khmer wiːə) is a pronoun used to refer to animals, things, or very young children, indicating familiar or contemptuous meaning, and is also used as a courtesy title for male commoners. Demonstrative Pronouns The demonstrative pronouns include the proximal neḥ /neh/ ‘this’ and the distal noḥ /nɔh/ ‘that’, referring to both animate and inanimate referents. These usually occur as dependents on a noun head, although noḥ is used occasionally as a third-person possessive. The form gi is frequent used as the dummy

664

Sidwell

subject of cleft sentences (glossed here as idf ‘identifier’), as a modifier in NPs, and as a locative. Various examples of gi: (52) gi neḥ sre ta roḥ gi neḥh phoṅ idf prox field link manner idf prox pl tel kuruṅ maleṅ oy ta vraḥ rel king pn give link holy ‘It (is) these fields aforesaid which the kuruṅ of Maleṅ has given to the divinity . . .’ (53) gi nā ge tel kaṃvoṅ doṅ ge pitāmātā ge idf place 3pl rel occupy and 3pl parents 3pl ‘This (is) the place where they shall forever dwell, both they [and] their ancestors’ (54) gi ti sthāpanā ’āy bhadrapattana idf pass set.up at pn ‘this was set up at Bhadrapattana’ (55) gi ta pradhāna ta kule idf link head link house ‘He (is) the one who (was) head of the house’ (56) rūpyabhājana karap gi silver.vessel lid 3sg ‘a silver vessel [and] its lid’

And an ambivalent use of gi: (57) neḥ kaṃmrateṅ ’añ ta pvan gi vraḥ prox high.lord 1sg link four idf holy

jananī kaṃmrateṅ ’añ ancestress high.lord 1sg ‘These four My High Ladies (were) the royal forebears of My High Lord’

Finally, gi is often reinforced by postposed neḥ or noḥ: (58) gi neḥ sre ‘the this ricefield → this ricefield’ gi noḥ puṇya ‘the those pious works → those pious works’

665

Old Khmer

Relativizers Several relativizers are in use: tel /ɗɛːl/ ‘who, whom, which; where; when’, man /mɔn/ ‘who, which’, kāla /kaːl/ ‘when’, nā /naː/ ‘who, whoso; what, whatever’, maḥ ~ muh /mɔh ~mʊh/ ‘he who; that which’. Pre-Angkorian favors tel, grammaticalised from the homophonous noun ‘the same’ and stative verb ‘to be the same, constant’. Examples: (59) gi neḥ tel mratāñ oy ta poñ . . . idf prox rel lord give link title ‘It is this that the lord has given the poñ . . .’ (60) kñuṃ ta man kloñ ’aṃmṛta oy ta slave link rel title pn give link ‘slaves whom the kloñ Amṛta has given to the divinity . . .’

vraḥ . . . holy

A characteristic of Old Khmer, particularly Angkorian, is the ellipsis of relative pronouns. In this case the dependent clause is unmarked. Example: (61) ’āyoga moy ti ta gi kuṭī paṃnvas cya slā ornament one goal link idf cell cleric eat areca ‘one ornament for the cell where the clerics partake of arecanut’

Reflexive Pronoun Angkorian khlvan /khluːən/ ‘body, person; self’, is found a few times as a reflexive pronoun. Example: (62)

5.1.3

rūpa vraḥ kamrateṅ ’añ śrī dharaṇīndrapaṇḍita cār khlvan image holy high.lord 1sg pn carve self ‘image of My Holy High Lord Śrī Dharaṇīndrapaṇḍita, graven by himself’ (probably meaning ‘. . . graven at his behest’)

Measure and Quantity Words

Numerals Two sets of numerals are in use, one vernacular, the other Sanskrit, both decimal. These are:

666 Indigenous lower numerals moy /moːj/ ~ mvāy /muːəj/ ‘one’ ver /beːr/ ~ vyar /biːər/ ‘two’ pi /ɓiː/ ‘three’ pon /ɓoːn/ ~ pvan /ɓuːən/ ‘four’ praṃ /pram/ ‘five’ praṃ moj ~ praṃ mvāy ‘six’ praṃ ver ~ praṃ vyar ‘seven’ praṃ pi ‘eight’ praṃ pon ~ praṃ pvan ‘nine’ tap /ɗɔp/ ‘ten’

Sidwell Sanskrit derived lower numerals eka /ʔɛːk/ ‘one’ do /doː/ ~ dvā /dwaː/ ‘two’ tri /triː/ ~ trayo /trəyoː/ ‘three’ catur /cəɗʊr/ ~ catvāri /catwariː/ ‘four’ pañca /ɓaɲ/ ‘five’ ṣaṭ /sat/ ~ śaṣṭi /sah/ ‘six’ sapta /sapɗaː/ ‘seven’ ’aṣṭa /ʔasɗaː/ ‘eight’ nava /nɔːp/ ‘nine’ daśa /dɔh/ ‘ten’

Note that the indigenous forms ‘6–9’ are additive compounds, while tap ‘10’ appears to be a loan from Chinese (Pou & Jenner 1973). From ‘eleven’ to ‘nineteen’ the numerals follow the pattern of moy dan tap ‘one and ten’, ver dan tap ‘two and ten’, pi dan tap ‘three and ten’, in which the dan is pre-Angkorian doṅ ~ daṅ /ɗɔŋ/ ‘and’, showing assimilation to the dental initial of tap. ‘Twenty’ is expressed by bhai /bhɤj/, one of a small set of collective quantifiers. ‘Forty’ is 2 bhai, ‘fifty’ is 2 bhai 10. Not all of these forms are attested. The only term for ‘hundred’ is Sanskrit śata /sɔːt/. Note that these numerals are strictly cardinal, no ordinals being attested. Numeral Classifiers Numeral classifiers are not much in evidence. This may be because the language as a whole used them sparingly, or it may reflect the chancery idiom alone. The chief classifiers are ’nak /nak/ for persons, tem /ɗɤːm/ for trees, ’anle /ʔənlɤː/ for places, sāt /sat/ for beasts or beings, and pre-Angkorian sanre /sənrɛː/ ~ sare /srɛː/ for ricefields. The positon is variable, coming before or after the numeral. Some examples: (63) ’aṃcās ’nak pvan elder person four ‘four elders’ (64) teṃ slā 4 tree areca four ‘4 areca palms’

Old Khmer

667

(65) cpar 1 ti bhāga ta vraḥ kuṭī ’anle 2 garden one goal part link holy cell place two ‘one garden for (the use of) the two holy cells’ (66) sre moy sanre field one field ‘one ricefield’ (67) kantai pi sāt woman three being ‘three women’

5.1.4

Names and Terms of Address

Titles and Honorifics Titles and honorifics occur with great frequency in the inscriptions. The two most important are śrī (‘radiance, glory’), and vraḥ (‘holy’), the latter is used as a headword before the titles and names of divinities or personages partaking of divinity such as kings, other royalty, and high-ranking ecclesiastics. All titles and names following vraḥ in the same NP are in apposition to it. The inventory of true titles is not great but at present cannot be ordered convincingly into a hierarchy; nor in some cases can civil and ecclesiastic titles be discriminated. The following comments are open to correction as new knowledge is developed. For divinities, at the top of the hierarchy are kaṃmrateṅ jagat ta rājya ‘the Sovereign High Lord of the World’ and unqualified kaṃmrateṅ jagat ‘High Lord of the World’, both applied to divinities of special pre-eminence. The latter is often found without vraḥ before it. With a few lesser divinities, probably of local origin, the title kpoñ /kəɓɔɲ/ is found in pre-Angkorian. It appears to be a derivative by prefix /k-/ of poñ /ɓɔːɲ/ ‘elder sibling’ and to refer to females. For the sovereign is usually alluded to by reference to his feet (pāda, jeṅ), ritually as well as physically his lowest part. Taking reverence is a step further, he is often alluded to by the dust or dirt clinging to the bottoms of the royal feet. Thus among the full royal titles is, for example, dhūli vraḥ pāda dhūli jeṅ vraḥ kaṃmraten ’añ śrī sūryyavarmmadeva, where dhūli jeṅ is a gloss on dhūli vraḥ pāda. All this is literally ‘dust of the royal pāda (feet), [which means] dust of the jeṅ (feet), of My Holy High Lord Śrī Sūryavarmadeva’. On the other hand, a common equivalent of the foregoing is abbreviated to vraḥ pāda ‘the royal

668

Sidwell

feet’, which by convention is usually rendered ‘His Majesty’. Even simpler is vraḥ alone, ‘the royal one’. Sovereigns in general are referred to as kaṃmrateṅ phdai karom ‘High Lords of the Earth’, in which phdai karom ‘the lower surface’ designates ‘land’. More often vraḥ kaṃmrateṅ ’añ, ‘My Holy High Lord’, is used of the sovereign, exalted ecclesiastics, and divinities. Kaṃmrateṅ /kəmmrəɗɛːŋ/ ‘high lord’, is a form with many variants. Below kaṃmrateṅ comes this mratāñ itself, apparently a civil title only. Below the mratāñ come two titles which appear to be borne by members of the royal family: loñ /loːɲ/ for males and the teṅ just cited for females. Among ecclesiastical titles figure steṅ /sɗɛːŋ/ ~ steñ /sɗɛːɲ/, and its secondary augment Angkorian kaṃsteṅ /kəmsɗɛːŋ/, neither of which has a satisfactory equivalent translation. Below the royalty and ranking clergy, finally, we have pre-Angkorian poñ /ɓɔːɲ/ ‘elder brother’ and Angkorian vāp /baːp/ ‘father’, borne by members of the landowning gentry. The common kloñ ~ khloñ /khloːɲ/ ‘chief’, the less common chloñ /chloːɲ/ (the latter probably ecclesiastical) and the uncommon ’aṃteṅ /ʔəmɗɛːŋ/ are three titles that have not been assigned a place in the hierarchy. Below the true titles mentioned above are a few labels functioning as headwords usually followed by names in apposition to them. Most of these identify the sex, age or status of the individual named. Commonest are the sex-markers used with the names of slaves: for males si /siː/ and vā /waː/, for females ku /kuː/ and tai /ɗɤj/, to which ’me ~ me /meː/ ‘mother’ must be added. Agemarkers and status-markers include ji ’tā ~ ji tā /ɟi taː/ ‘grandfather’ for elderly males and ji tvan /ɟi ɗuːən/ for elderly females. Pre-Angkorian has a ci and one or two others about which we should like to know more. Proper Nouns Proper nouns are definite, particularized nouns naming specific persons, places, and objects or concepts such as months and days of the week. They raise no grammatical problem. Examples: cke ~ chke ‘slavename’ śrī vāgīśvarapaṇḍita ‘personal name’ garuḍa ‘Garuḍa, the vehicle of Viṣṇu’ sahasrayajña ‘name of a major sacrifice’ ’āṣāḍha ‘the eighth lunar month’ vṛhaspati ‘Jupiter’

Old Khmer

669

5.1.5 Interjections A minor wordclass is the interjection, attested by just ten occurrences of hai /hɤj/ ‘Oh’, e.g.: (68) hai ta jā kuruṅ ta vṛkṣa phoṅ Oh link be king link tree pl ‘Oh [thou] who art king of trees!’ (69) hai sādhu satparusa phoṅ Oh holy.man good pl ‘Oh holy men [and] good!’

5.2 Verbs 5.2.1 Intransitive Verbs The intrasitives are a subclass which consists of active verbs that cannot take a direct object. Examples: (70) ter /ɗɤːr/ ‘to walk’ rat /rɔt/ ‘to run’ ’aṅgvay /ʔəŋguːəj/ ‘to sit’

Many verbs show transitive as well as intransitive functions. These must be distinguished from transitive verbs with direct objects unexpressed, such as: gvar /guːər/ ‘[he] plots [it]’. 5.2.2 Transitive Verbs The transitives are active verbs that do take a direct object, and includes those after which the direct object is unexpressed. Examples: (71) soca nirṇṇaya ‘to issue a decision’ vināśa caṃnāṃ ‘to do damage to a foundation’ rāja jvan dravya ‘the king offered up riches’ kvan nāṃ dau smau ‘the boy drove [it = a buffalo] to grass’

5.2.3 Ditransitive Verbs There are many examples of ditransitives in Old Khmer, especially benefactives with oy /ʔoːj/ ‘give’ which take indirect object in the form of an unmarked prepositional phrase, or prepositional phrase subordinated by the general linker ta. Ta is virtually obligatory after verbs of giving, marking the indirect object. Similarly there are purposives with saṅ ‘build, set up, erect; repay’.

670

Sidwell

(72) poñ uy oy kñuṃ ai ta kpoñ kamratāṅ ’añ title pn give slave top link title high.lord 1sg ‘The poñ Uy has given [the following] slaves to the kpoñ My High Lady’

oy is also employed to create periphrastic cuasatives, e.g.: (73) oy śapata ’anak ta sruk pvan give swear person link village four ‘to administer the oath to men of four villages’.

5.2.4 Stative Verbs This subclass corresponds to the adjectives of other analyses and, as the term indicates, describes various states or qualities. Examples: cas /cah/ ‘to be old’; dhaṃ /dhɔm/ ‘to be big’; so /sɔː/ ‘to be white’. Stative verbs are not always easy to distinguish from passives (if we accept that there is a useful distinction to be made in the case of Old Khmer). 5.3 Prepositions Prepositions are a large but closed class of grammatical items marking a variety of relationships: directionality, place in space and time, cause and effect, instrumentality, and possession. They introduce an obligatory complement, usually a noun or NP, occasionally a verb. They may be simple or compound. Following is a selection. 1. 2. 3. 4. 5. 6. 7.

karoṃ /kroːm/ ‘below, beneath, under’, grammaticalised from the homophonous noun ‘bottom’. kaṃluṅ /kəmlʊŋ/ ‘inside, in, within’, grammaticalised from the homophonous noun ‘hollow space, cavity; interior’. kroy /kroːj/ ‘behind, after, next, later’, grammaticalised from the homophonous noun ‘back, rear’. krau /krɤw/ ‘outside (of), out of; beyond’, grammaticalised from the homophonous noun ‘outside, exterior’. ti /ɗiː/, probably /ɗɪ/ in speech: ‘in the direction of: to, toward, on’, indicating a goal, used commonly with points of the compass. tem /ɗɤːm/ ‘on the basis of: by, with; on, over’, grammaticalised from the homophonous noun ‘trunk, stock; base, beginning’. dau /dɤw/: accusative and dative ‘to’ + outward goal (away from the speaker), evolved from the homophonous transitive verb meaning ‘to go to’. The same form occurs as an adverb.

Old Khmer

8.

9. 10. 11. 12.

13. 14. 15. 16. 17. 18.

671

nā /naː/ from the nominal sense of ‘place’, develops the sense of ‘at a point in space’, functionally equivalent to prepositional ideas ranging from ‘at, in, on; during’ to ‘before, in the presence of, at the sanctuary of (a divinity), at the court of (a sovereign)’ to ‘of or belonging to, assigned to (a rank)’, on to ‘on the subject of, concerning, relating to’. nu /nuː/ ‘with’ in three senses: comitative, instrumental, oppositive. nai /nɤj/ ‘of, belonging to’, evolved from the homophonous noun ‘property, possession’. nau1 /nɤw/ ~ nau ru /nɤw ruː/: untranslatable, topicalizes a following NP. Cf. ri. nau2 /nɤw/ locative ‘in, at; on’, showing grammaticalization from the homophonous verb ‘to be in or at, remain’. Note nau le vnaṃ, literally ‘on top of the hill’, i.e. ‘on the hill’, nau kantāl sre, literally ‘in the middle of the ricefields’, i.e. ‘in or among the ricefields’, nau kaṃluṅ vrai, literally ‘in the interior of the forest’, i.e. ‘in the forest’. mok /mɔːk/ accusative and dative ‘to’ + inward goal (toward the speaker), evolved from the homophonous transitive verb meaning ‘to come to’. The same form occurs as an adverb. ri /riː/, probably in the stream of speech untranslatable, topicalizes a following NP. Cf. nau. ru /ruː/ ‘as, like’, evolved from the homophonous noun ‘way, manner’. loḥ /loh/ ~ lvoḥ /lʊəh/ ‘to, as far as, until’, evolved from the homophonous verb ‘to reach, gain’. ai /ʔɤj/ ~ ’āy /ʔaːj/ is a locative preposition: ‘at, in; by, beside; near’, probably by grammaticalization of the homophonous noun meaning ‘side, quarter; this side, here’. ’aṃvi /ʔəmbiː/ is an ablative preposition: ‘from’, probably evolved from a noun meaning ‘source’.

5.4 Adverbs Like the verb and noun, the adverb has no distinguishing form but is recognized by its syntactic function. It is a heterogeneous class consisting of lexical items which modify verbs, other adverbs, and clauses. Some adverbs are lexically adverbial; others show the grammaticalization of verbs and nouns. Partcularly important are the following two: (74) īlū /ʔiluː/ ‘the present time → at the present time’ roḥ /rɔh/ ‘way, manner → in the manner of’

672

Sidwell

A small number of adverbs are clause or phrase modifiers. Among these the most frequent are (75) guḥ /gʊh/ ‘only, completely’ sot /soːt/ ‘in addition’ ukk /ʔʊk/ ‘also; along with’

An occasional adverbializer is leṅ /lɛːŋ/, from grammaticalization of the homophonous verb ‘to let, allow’. Example: (76) ge dār rājabhaya ’aṃvi ihaloka lvoḥ 3pl request royal.punishments concerning world until

paraloka leṅ nānāprakāra ○ other.world adv various ‘they shall suffer royal punishments from this world to the next in many different ways’

Of special interest and not always easy to recognize is a class of nouns which are adverbialized by virtue of occurring directly after verbs. Examples: (77) prājñā siddhividyā wisdom lore.of.supernatural.power ‘wise in the lore of supernatural power’ (78) mratāñ khloñ śrī madhurapandita lord chief pn

tāc broken

santāna lineage

’yat lacking

kule ley . . . kinsman onward ‘The chief lord Śrī Madhurapaṇḍita, [his] family extinct (at an end with respect to [his] family) [and] with no more living kin, . . .’ (79) phsaṃm tap pvan ṅan laṅgau jyaṅ vyar liṅ tap assemble fourteen weigh copper measure two measure ten ‘Total, fourteen [pieces] weighing in copper two jyaṅ ten liṅ’

Old Khmer

673

Nu3 /nuː/ ‘now, on this date, in this year’, occurring at the head of a passage directly after a dateline. The form occurs only a few times in pre-Angkorian, but 220 times in Angkorian, including 65 cases in a cleft sentence. Example: (80) 815 śaka pi ket vaiśākha nu . . . . 815 age three fortnight.of.waxing.moon sixth.lunar.month now . . . . ‘Śaka 815, [day] three of the fortnight of the waxing moon of [the month of] Vaiśākha . . .’

5.5 Conjunctions Conjunctions are of two kinds: those which link nouns or NPs and those which link subordinate clauses. Coordinate clause conjunctions are not attested. Nearly all slavelists avoid the conjunction but divide successive names either by the daṇḍa (|) or a figure 1 or the small circle (○). 5.5.1 The Noun Conjunction When it occurs, conjunction nu ‘with, and’ poses no problem of interpretation. For example: (81) . . . taṃrvac vraḥ-kuti nuv khloñ ’nak vraḥ-kutiy . . . . . . inspector holy-cell ins chief person holy-cell ‘. . . The inspector of holy cells and the chief of staff of holy cells . . .’.

5.5.3 Clause Conjunctions Following in alphabetic order is a selection of conjunctions that introduce subordinate clauses. 1.

2. 3. 4. 5.

ka /kɔː/ is rare in Old Khmer but begins to be used to link clauses, the second of which is usually consequent upon the first. It evolves by grammaticalization from the intransitive verb meaning ‘to come into being, happen’. Example: kāla /kaːl/ [Sanskrit kāla ‘time’]: ‘at or in the time that, when, while’. ta /ɗɔː/ optional subordinating conjunction, glossed as link here. Examples abound in Old Khmer of ta occurring without an expressed head but followed by a noun or verb. daha ~ doḥ /dɔh/ ‘if, on condition that, in the event that’. nā /naː/ grammaticalized from the nominal sense of ‘place’ and develops the sense of ‘at a point in time or place’, functionally equivalent to ‘at the time that, when’ and ‘at the point where’.

674

Sidwell

nu /nuː/, probably /nʊ/ in speech, optional ‘and’ linking the constituents of an NP. 7. pi /ɓiː/, probably /ɓɪ/ in speech, ‘to intend, to come to pass, in order that’. 8. man1 /mɔn/, probably /mən/ in speech, a general conjunction with two main meanings developing out of ‘at the time that’: ‘when, while; then’ and ‘for, because, since’. 9. man2 /mɔn ~ mən/ introduces a clause of indirect discourse. 10. loḥ /loh/ ~ lvoḥ /lʊəh/ ‘up (down) to the time that, until; as long as; to the extent that’, evolved from the homophonous verb ‘to reach, gain’. 6.

6

Glossed Text

To give the reader the flavor of continuous narrative, there follows several lines of the Vằt Saṃroṅ Stone (K.956/2°), assigned to the Śaka 9th century (A.D. 878–977). neḥ gi roḥ kaṁnet santāna yeṅ. prox idf manner NML.born lineage 1pl ‘This [is] the manner of the origin of our line.’ ta gi rājya ta vraḥ pāda stac dau parameśvara link idf reign link holy foot roy go supreme.lord ‘During the reign of His Majesty who was pleased to go to the Parameśvara ’ji yeṅ piy toy mātṛ-pakṣa loñ lakṣana jā ancestor 1pl three follow mother-line title pn be three of our forebears in the maternal line [were] the loñ Lakṣana senāpati kantāl vala tmāl ’āy vraḥ.caṁpak teṁ general center power pn at pn original who was general of the army’s center at Tmāl in former Vraḥ Caṁpak teṅ ’yak jā devī kaḷṣṭa ’āy bhāvapura teṅ pit teṅ pn be queen remove at pn title pn the teṅ ’Yak, who as a queen was removed to Bhavapura; [and] the teṅ Pit, ti oy ta khloñ vala kamrateṅ ’añ ta pass give link chief power high.lord 1sg link who was given to the commandant My High Lord the

675

Old Khmer vraḥ ’ji vraḥ pāda kamrateṅ ’añ ta holy ancestor holy foot high.lord 1sg link royal grandfather of His Majesty who was pleased to go to

stac dau roy go

śvaraloka ta jmah loñ haradharmma ta trvac paṁcāṁ pn link named title pn link inspect guard the Īśvaraloka, who bore the name loñ Haradharma [and] who had inspected the guards ’aṁvi kāla vraḥ pāda stac dau parameśvara. begin time holy foot roy go Parameśvara. from the time of His Majesty who was pleased to go to the Parameśvara (Heaven).’

7 Bibliography Cœdès, George. 1937–1966. Inscriptions du Cambodge. Paris: École française d’ExtrêmeOrient, 8 vols. [transcriptions and translations of texts] Gorgoniev, Ju. A. 1966. The Khmer language. Moscow: Nauka. Haiman, John. 2011. Sesquisyllabicity in Khmer. In Sophana Srichampa, Paul Sidwell and Kenneth Gregerson (eds.), Austroasiatic Studies: papers from ICAAL4. MonKhmer Studies Journal Special Issue No. 3. Dallas, SIL International; Salaya, Mahidol University. Canberra: Pacific Linguistics, 147–161. Jacob, Judith M. 1993a. The Structure of the Word in Old Khmer. In David A. Smyth (ed.), 1–17. ———. 1993b. Prefixation and infixation in Old Mon, Old Khmer and Modern Khmer. In D. A. Smyth (ed.), 18–26. Jenner, Philip & Paul Sidwell. 2010. Old Khmer grammar. Canberra: Pacific Linguistics. Jenner, Philip & Saveros Pou. 1980–1981. A lexicon of Khmer morphology. Mon-Khmer Studies 9–10: 1–517. Jenner, Philip. 2009a. A Dictionary of Angkorian Khmer. Canberra: Pacific Linguistics. ———. 2009b. A Dictionary of Pre-Angkorian Khmer. Canberra: Pacific Linguistics. Martini, François. 1957–1958. La distinction du prédicat de qualité et de l’épithète en cambodgien et en siamois. Bulletin de la Société Linguistique de Paris 63.1: 295–305 Pou, Saveros. 1989. Nouvelles inscriptions du Cambodge. Paris: École française d’Extrême-Orient. [with plates] ———. 1992. Dictionnaire vieux khmer—français—anglais. Paris: CeDORECK. ———. 2001. Nouvelles inscriptions du Cambodge II et III. Paris: École française d’Extrême-Orient, 2001. [volume II was originally published in 1996 but due to printing errors republished in this volume; with plates]

676

Sidwell

———. 2004. Dictionnaire vieux khmer—français—anglais. Paris: L’Harmattan. [2nd enlarged edition] Pou, Saveros and Philip Jenner. 1973. Some Chinese loanwords in Khmer. Journal of Oriental Studies 11.1: 1–90. Smyth, David A. (ed.). 1993. Cambodian Linguistics, Literature and History. London: SOAS. UNESCO. 2002. Ancient inscriptions of Cambodia, Silacaryk buran rapas Kambuja, Pracast Kambuja. Phnom Penh, UNV/UNDP Project, UNESCO. (OCLC 223340556) You Sey. 1976. Some Old Khmer affixation. Mon-Khmer Studies 5: 85–95.

chapter 7

Modern Khmer Walter Bisang 1 Background Khmer (also Cambodian) (khm) is the official language of Cambodia. It has some 12.3 million speakers1 in Cambodia, where it is the mother tongue of about 90% of the population. Together with the Surin dialects in Northeastern Thailand and in the Mekong Delta region of Vietnam (where they are known as the Khmae Kraom or Kiangkleang Khmer, and Kiengang Khmer by the Vietnamese), there is a total of some 14 million speakers of Khmer. Typologically, Khmer is a verb-medial language. Its noun-phrase structure is predominantly head-initial. Phonologically, it is characterized by its rich vowel inventory and by the iambic or sesquisyllabic structure of its words. Compared to other eastern Austroasiatic languages, Khmer morphology is very rich in terms of its number of prefixes and infixes (monosegmental nasals and consonants as well as the polysegmental infix -vmN-/-vN-). At the same time, it has not developed a fully-fledged morphological system and its morphology does not express inflectional categories (e.g. tense-aspect, person, number). Another typological characteristic of Khmer is the high relevance of pragmatic inference. Its grammar allows the omission of arguments (radical pro-drop), tense-aspect markers, adverbial subordinators and various other grammatical markers. In addition, even highly grammaticalized markers can express various grammatical functions (multifunctionality; cf. §4.3). Khmer is a comparatively well-described language. Since the last century, several grammatical descriptions have been published: Maspéro (1915), Gorgoniev (1963, 1966), Huffman (1967), Jacob (1968), Ehrman (1972), Khin (1999) and Haiman (2011). There are the monolingual Khmer dictionary Chuon Nath (1967) and the following English-based dictionaries: Jacob (1974), Headley et al. (1977) and Huffman and Proum (1978). Moreover, there are several course books and readers: Huffman (1970a, b), Sacher and Nguon (1985) and Smyth (1995).

1 Ethnologue on Northern Khmer: http://www.ethnologue.com/show_language.asp?code= khm.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_013

678

Bisang

Since Khmer is associated with a long history of Buddhism and Indian influence in general, an important part of its vocabulary (and some of its morphology) has Sanskrit and Pali roots. In spite of this, the present description of Khmer will only address Austroasiatic issues. Khmer has its own Devanāgari-based script. This chapter only uses the romanization system of Jacob (1968) as far as it reflects morphophonological facts (cf. §2.2). Purely orthographic annotations will be omitted. 2 Phonetics/Phonology 2.1 Word and Syllable Structure, Phonotactics Since Henderson (1952), Khmer is well-known for its canonical iambic or sesquisyllabic (Matisoff 1973) word structure. Thus, a word consists either of one syllable with the structure as in (1) or of two syllables with a reduced or short first syllable as in (2). Henderson (1952) uses the terms major syllable and minor syllable for the two syllable types. (1) Structure of major syllables: C1(C2)VC3 or C1(C2)VV(C3)2 (2) Structure of minor syllables (Huffman 1967: 45): C1 – (C2) – V – (C3), C1: every possible C, but only p, t, c, k, s if there is a C2. C2: only r V: every possible short vowel, but only ə, a, o, ɔ if there is a C2. C3: every C except b, d, r.

Major syllables consist of an onset with one or two consonants. Every syllable starts with a consonant. Syllables do not start with a vowel, there is always a glottal onset [ʔ]. In C1C2 clusters, the set of consonants occurring in C1 are p, t, c, k, ʔ, s, d, m and l. C2 is open to all consonants of the phonological system. Table 7.1 from (Huffman 1967: 24) shows the C1C2-clusters that can be found in Khmer words:

2 V stands for a short vowel, VV for a long vowel. Huffman (1967) also mentions two loan words with the structures CCCVC and CCCVVC: stha:n ‘place’ and lkhaon ‘theatre’.

679

Modern Khmer Table 7.1

C1C2-clusters in Khmer

C1

C2 p − + + + + − − − +

p t c k s ʔ d m l

t + − − + + − − + −

c + − − + − − − + −

k + + + − + − − − +

ʔ + + + + + − − + +

b − + + + + − + − +

d + − + + + − − + −

m − + + + + − − − +

n + + + + + − − + −

ɲ + − + + + − − + −

ŋ + + + + + − − − +

v − + + + + + − − +

j + + + + + − − − −

l + + + + + − − + −

r + + + + + − − + −

s + − − + − − − + −

h + + + + − − − + +

Minor syllables minimally have the structure CV. The only possible C2 in onset clusters is r. The rhyme always consists of a short vowel (V) and can be followed by a monoconsonantal coda (C3). 2.2 Phoneme Inventory (Consonants and Vowels) The consonant inventory of Khmer consists of the following 18 segments:3 Table 7.2 Khmer consonants

/

p b (f) m

v

t d s n l r

c

k

ɲ

ŋ

j

ʔ h

/

The labiodental fricative /f/ only occurs in loanwords. 3 All consonants can occur in the syllable onset. In the coda, we only find the nasals plus /p/, /t/, /c/, /k/, /ʔ/, /h/, /l/ and /j/.

680

Bisang

The two voiced occlusives /b/ and /d/ are implosives. The segment /r/ is an alveolar trill. The segment /h/ is realized as a glottal fricative in syllable-initial position (CV) and as an uvular or pharyngeal fricative in syllable-final position (VC). The segment /l/ is an alveolar lateral that is realised as a retroflex in syllable final position and as a voiceless alveolar in C2 of a C1C2 syllable-onset cluster. In syllable final position, the unvoiced occlusives /p/, /t/, /c/, and /k/ remain unreleased. The palatal segments /c/ and /ɲ/ additionally induce fronting and heightening of the preceding vowel nucleus. Thus, khoːc ‘go bad, go wrong’ is pronounced [khoːic/]. The romanization introduced by Jacob (1968) reflects the Indian writing system that distinguishes between aspirated and unaspirated occlusives (in Khmer: k vs. kh, c vs. ch, t vs. th, p vs. ph). Unfortunately, this system does not fully reflect the phonological system of Khmer. If the Ch sequences are immediately followed by the vowel nucleus in the writing system, they stand for a consonant cluster consisting of C1 = /p, t, c, k/ plus C2 = /h/ (as in khɤŋ ‘angry’, phɔːŋ ‘dust, powder’). If after h another consonant shows up in the writing system, h stands for aspiration as a phonetic effect in a consonant cluster of the type C1C2. Thus, khpùəh ‘high’ stands for the C1C2 cluster /kp/, pronounced [khp].4 The writing system also has v in syllable-final position in oːv and iəv. Its basic phonetic effect is that of adding a weak lip rounding to the nuclear vowel. Thus, phloːv ‘way’ is pronounced [pləːu]. Haiman (2011: 2) analyses oːv as a diphthong. Khmer is known for its rather complex vowel system. The exact number of vowel phonemes is still subject to some controversial discussion, not least because of considerable cross-dialectal variation. This chapter integrates the register-based distinctions that are part of Jacob’s (1968) romanization into a phonological system that takes up insights from Huffman (1967: 240–249), Ehrman (1972: 8–9) and Haiman (2011: 1–2). For that purpose, Table 7.3 presents the vowel segments in their phonological structure on the first line. The second line shows how these phonemes are represented by Jacob (1968; no accent on the vowel stands for first register, the gravis accent [`] for second register, cf. §2.3):

4 This is reflected by the behaviour of the infix -vmn-/-vN-. If this infix, realized here as -ɔm-, is infixed into a C1C2 cluster in which C2 is /h/, C2 remains: kɔmhɤŋ ‘anger’ from khɤŋ ‘angry’. If it is inserted into a C1C2 cluster in which the h-sound is only a phonetic effect it does not show up in the infixed form. Thus, the infixed form of khpùəh ‘high’ is kɔmpùəh ‘height’.

681

Modern Khmer Table 7.3 Khmer monophthongs

/i, iː/ ɪ̀, ɪ̀ː /e, eː/ e, eː, èː /ɛː/ ɛ̀ː

/ɯ, ɯː/ ɯ̀ , ɯ̀ ː

/ə, əː/ ɤ, ɤː, ɤ̀ː /a, aː/ a, aː

/u, uː/ ù, ùː /o, oː/ o, oː, òː /ɔː/ ɔ̀ ː /ɑ, ɑː/ ɔ, ɔː

In addition to the above monophthongs, there are the following diphthongs, presented in the same way as in Table 7.4: Table 7.4 Khmer diphthongs

/iə/ iə, ɪ̀ə, ɪ̀ːə /ɛə/ ɛ̀ə

/ae/

ae

/ɯə/ ɯə, ɯ̀ ə

/ɤɯ/ ɤ̀u /aə/ aə /aɯ/ au

/uə/ ùə, uːə, ùːə /ɔə/ ɔ̀ ə

/ao/ ao

2.3 Suprasegmentals Khmer is not a tone language. It has a stress system based on loudness and pitch. In sesquisyllabic words, the stress is on the major syllable. At the clause level, declaratives are expressed by a falling intonation contour. Questions can have rising contour if the speaker is respectful but falling contour is also possible if the status of the speaker is equal or higher.

682

Bisang

The Khmer writing system distinguishes between two sets of consonants whose pronunciation is phonetically identical in Khmer. Since Henderson (1952), these two sets are termed “first register” and “second register”. Each register is associated with a specific pronunciation of the vowel symbols. Thus, the vowel symbol that stands for aː with the first register t represents ɪ̀ːə with the second register t (cf. taː ‘grandfather’ vs. tɪ̀ːə ‘duck’). The following list represents the relevant syllable nuclei in pairs of first and second register: (3) Register pairs: ɔː — ɔ̀ ː ɔ — ùə, ù aː — ɪ̀ːə e — ɪ̀ ɤ( j) — ɪ̀ː a — ɛ̀ə, ɔ̀ ə

ɤ ɤː o oː uːə aə

— — — — — —

ɯ̀ ɯ̀ ː ù ùː ùːə ɤ̀ː

ɯə iə eː ae ao au

— — — — — —

ɯ̀ ə ɪ̀ə èː ɛ̀ː òː ɤ̀ u

From the perspective of the underlying Indian writing system, the two sets of consonant symbols stand for unvoiced vs. voiced, i.e., */p, t, c, k/ vs. */b, t, j, g/. Thus, the register differences in the vowels (3) are compensations for the loss of this opposition. Initially, that difference was phonetically realized by the opposition clear vs. breathy voice. However, breathy phonation is no longer phonemic in the standard pronunciation. The effect of the two registers only shows up in the high number of vowel phonemes. 3

Word Formation

3.1 Compounding Compounding is a central part of Khmer word formation. The main techniques are modification, parallel constructions, pseudo-compounds and noun-verb constructions. Modification is widely used with nouns. As in syntax, the head comes first and is followed by a second element that specifies it. In bɔntùp-kèːŋ ‘bedroom’, the head bɔntùp ‘room’ is modified by the verb kèːŋ ‘sleep [familiar]’. In other compounds, the specifying element is a noun as in bɔntùp-tɯ̀ k [room-water] ‘bathroom’ or even a VP as in bɔntùp-tətùːəl-phɲɪ̀əv [room-receive-guest] ‘living room’ (Ehrman 1972: 48). This type is particularly productive if the head noun is a class noun, i.e., a noun which denotes rather general or abstract concepts (Bisang 1996). The class noun nɛ̀ək ‘man’ forms agent nouns:

Modern Khmer (4)

683

nɛ̀ək-rɔ̀ əm [man-dance] ‘dancer’ nɛ̀ək-srae [man-rice-field] ‘farmer’ nɛ̀ək-kɔndaːl [man-middle/center] ‘intermediary’ nɛ̀ək-cap-trɤj [man-catch-fish] ‘fisherman’

In parallel constructions, the components are not in a hierarchical relation. In some instances, the components are antonyms as in sok-tùk [happiness/wellbeing-unhappiness/suffering] ‘state of health’. In most cases, the components are near synonyms or share important semantic properties. The components can be nouns or verbs: (5)

ʔoːpùk-mdaːj [father-mother] ‘parents’ khao-ʔaːv [trousers-dress/blouse/shirt] ‘suit, clothes’ cap-phdaəm [catch/begin-begin] ‘begin, start’ kɪ̀ːəp-sɔŋkɔt [pinch-press down] ‘oppress’

The formation of words consisting of two components is extremely productive and even leads to pseudo-compounds, i.e., compounds in which one component cannot occur alone as with -kaːh in prak-kaːh [silver/money-ø] ‘money’. Finally, the combination of stative verbs with nouns produces compound expressions denoting states and conditions that together behave like intransitive verbs. Particularly productive are compounds with the noun cɤt ‘heart, feeling, mind’, which also occurs in transitive expressions. (6) thŋùən-trəciək [heavy-ear] ‘hard of hearing’ srùːəl-mɔ̀ ət [easy/comfortable-mouth] ‘easy to say’ cɪ̀ːə-phnɛ̀ːk [be well/healthy-eye] ‘sound of eye/able to see’ peɲ-cɤt [full-heart] ‘be satisfied’ khoːc-cɤt [go bad/go wrong/break down-heart] ‘[emotionally] devastated’ cap-cɤt [catch-heart] ‘take a liking to, fall in love with’ tùk-cɤt [put away/keep-heart] ‘trust (v.), to have confidence in’

3.2 Derivational Affixes Historically, Khmer has a relatively rich derivational morphology that consists of prefixes and infixes (Jenner and Pou 1982, Bisang 1992: 447–472). In general, morphology is not productive and there is a tendency in spoken Khmer to use simple words with no affixation.

684

Bisang

Khmer morphology is strictly integrated into its iambic or sesquisyllabic word structure as presented in (1) and (2). The following word structures are generated by Khmer morphology (Jenner and Pou 1982):5 (7)

a. CV(C) b. CCV(C) c. CəCV(C) or CəCCV(C) d. CrəCV(C) or CrəCCV(C) e. CvNCV(C) or CvNCCV(C) [v = /ɔ, ùə, ù/, N = /m, n, ŋ, ɲ/, homorganic nasal]

The structures in (7a, b) stand for monosyllabic words, those in (7c–e) for iambic or sesquisyllabic words. The indications in (7e) show that Khmer morphology does not exhaust the full potential of vowel nuclei and nasal codae of the minor syllable. The nasal N (m, n, ŋ, ɲ) is usually homorganic with the consonant that follows it. From the perspective of its semantics, Khmer affixes are characterized by (i) the scarcity of functions they express and by (ii) their multifunctionality. Jenner and Pou (1982; Bisang 1992: 457) list 28 affixes. This number is in strong contrast to the three main functions expressed by Khmer morphology. The first two functions are nominalization and causativization/transitivization. The third function is specialization, a term created by Jenner and Pou (1982) that covers all instances in which the meaning of the affixed form is lexically more specific than the meaning of the base. Given the limited number of functions, each of them is expressed by quite a few affixes. The following examples illustrate the most important affixes that express nominalization (8) and causativity/transitivity (9) (for specification, cf. (10)). (8)

k-: s-: m-: N-: -b-: -m-: -n-:

baŋ ‘to screen, to shade/cover sth.’ pɪ̀ːən ‘pass over, traverse’ hoːp ‘eat’ baoh ‘to sweep’ rɔ̀ əm ‘to dance’ sòːm ‘ask’ kɪ̀ːəp ‘squeeze, apply pincers’

→

k-baŋ ‘screen, movable curtain, veil’

→ → → → → →

s-pɪ̀ːən ‘bridge’ m-hoːp ‘food’ ʔɔm-baoh ‘brush, n.’ rəbam ‘dance, n.) smòːm ‘beggar (someone who asks)’ khnɪ̀ːəp ‘pincers’

5 The representation in (7) neglects vowel length in the major syllable (cf. (1) for that purpose).

685

Modern Khmer

(9)

-vmn-: bvN-: kvN-: svN-:

dam ‘plant, v.’ tùk ‘put away, keep’ cas ‘old’ bɔ̀ ːk ‘peel, strip of bark or skin’

→ → → →

dɔmnam ‘plant, n.’6 bɔntùk ‘cargo, load’ kɔɲcas ‘old man [derogative]’ sɔmbɔ̀ ːk ‘shell, husk, bark, skin’

p-: prə-: crə-: -vmn-: bvN-: svN-:

dac ‘break, be torn apart, intr.’ kaət ‘be born, arise, happen’ mùcː ‘sink, immerse oneself’ daə ‘walk’ rɪ̀ən ‘learn’ rùːəp-rùːəm ‘unite, consolidate, intr.’

→ → → → → →

phdac ‘break, separate, tr.’ prəkaət ‘cause, bring about’ crəmùcː ‘immerse, tr.’ dɔmnaə ‘process’ bɔŋrɪ̀ən ‘teach’ sɔŋrùːəp-sɔŋrùːəm ‘reconcile, restore’

Most Khmer affixes are multifunctional, i.e., one and the same affix can express more than one function. The concrete meaning of an affix is a matter of the lexicon, i.e., it depends on the base with which it occurs. The following list presents affixes which can express (i) nominalization, (ii) causativization/ transitivization and (iii) a third function, which is specialization (p-, bvN-, -vmn-/-vN-) or reciprocity (prə-): (10) Affix p-

Function (i) (ii) (iii)

Base lèːŋ ‘play’ cɔ̀əp ‘to stick, be joined’ haəm ‘swollen’

Base + Affix ph-lèːŋ ‘music’ ph-cɔ̀ əp ‘attach firmly’ ph-haəm ‘swollen, pregnant’

prə-

(i) (ii) (iii)

cheh ‘catch fire’ kaət ‘be born, arise’ deɲ ‘follow, pursue’

prə-cheh ‘wick’ prə-kaət ‘bring about’ prə-deɲ ‘pursue e.o., compete’

bvN-

(i) (ii) (iii)

vèc ‘to parcel up’ baek ‘break [intr.]’ lèːŋ ‘play’

bɔŋ-veːc ‘package, n.’ bɔm-baek ‘break [tr.]’ bɔŋlèːŋ ‘entertain [by playing music]’

lʔɔː ‘beautiful’ slap ‘die’

lùmʔɔː ‘beauty, embellishment’ sɔmlap ‘kill’

-vmn-/ (i) -vN (ii)

6 The form -vmn- occurs with basic syllables of the form CV(C), whereas -vN- is used with C1C2V(C). With C1C2VC-bases, the infix occurs between C1 and C2 (cf. Huffman 1967). The vowel v is ɔ in the first register and ù or ùə in the second register. The nasal is homorganic with C2.

686

Bisang

Haiman (1998, 2011: 76–82) and Haiman & Ourn (2003) offer an explanation for this remarkable combination of functional limitations and multifunctionality that is based on the sesquisyllabic structure of Khmer words and on processes of reduced pronunciation in rapid speech. 3.3 Reduplication Khmer has partial reduplication as well as total reduplication. Partial reduplication is used in derivational morphology and in various forms of alliteration and rhyme. In derivational morphology, the first consonant of the base is repeated. A neutral vowel ə is inserted between the two consonants. In most cases, this type of reduplication expresses repetition or intensification: (11) Jacob (1968: 179, 188–190): thɛ̀ək ‘kick’ → keh ‘scratch’ → baoh ‘sweep, brush’ → ɲɔə(r) ‘shake, tremble’ →

tə-thɛ̀ək ‘to kick repeatedly’ kə-keh ‘to scratch constantly’ bə-baoh ‘to brush or caress lightly and frequently’ ɲə-ɲɔə(r) ‘tremble, dither’

Alliteration, rhyme and chiming are used very frequently in Khmer. In alliteration, the syllable onset (C or CC) or the onset plus the nuclear vowel (CV(V) or CCV(V)) are repeated. Rhyming is defined by the repetition of the nucleus plus, if present, the coda (-VC, -VV, -VVC). Chiming is characterized by the repetition of the onset (C or CC) and the coda (C). (12) Alliteration: rùːəp-rùːəm ‘join, unite, consolidate’, khɔ̀ ət-khɛ̀əŋ ‘prevent [from: khɔ̀ ət ‘forbid, stop’, khɛ̀əŋ ‘prevent, stop’]’, thŋùən-thŋɔ̀ ː ‘heavy, serious, grave’, srənok-srənaːn ‘easy, comfortable’, lhec-lhiəŋ ‘be indistinct [of sound]’. (13) Rhymes: mɛ̀ːn-tɛ̀ːn ‘really’, slòːt-bòːt ‘nice, kind, agreeable’, rɪ̀ːəj-mɪ̀ːəj ‘mentally confused’, cɯ̀ ən-lɯ̀ ən ‘make progress’ [‘speedy’]’, caeŋ-maeŋ ‘snooty’. (14) Chiming Jacob (1968: 189): srəmoːm-srəmaːm ‘to be hairy to the point of untidiness [srəmoːm ‘hairy, bearded’, srəmaːm ‘sloppy, dirty’]’, kaeŋ-kaoŋ ‘insolent, impolite, pretentious [kaeŋ ‘angular, angle of foot or arm’, kaoŋ ‘rude, impolite’]’, kɔntreːk-kɔntraːk ‘to hang in tatters’.

Modern Khmer

687

Finally, there is quite an impressive inventory of words, which only occur in the form of total reduplication: (15) Total reduplication (Jacob 1968: 189): ŋoːv-ŋoːv ‘to make a pleading and whining sound’, khvɤ̀ːc-khvɤ̀ːc ‘limp, hobble’, pdaom-pdɑom ‘to bow and scrape’, ʔɔntrɤ̀ːt-ʔɔntrɤ̀ːt ‘to step on tiptoe, walk stealthily’.

4

Phrase and Clause Structure

4.1 Simple Sentences Khmer basic word order is SV/AVP as is illustrated by the following example: (16) Simple sentence, AVP ( Jacob 1968: 59): thvɤ̀ː mhoːp. a. mdaːj mother make food ‘Mother is making the meal.’

Simple sentences of the type AVP with overt A and P are not very frequent in any type of text, spoken and written. Since arguments that can be inferred from context can be omitted without any overt trace of their properties (radical pro-drop), utterances like (16b-d) are perfectly grammatical in a context in which the omitted argument is known or can be retrieved from context7 (cf. §§4.3, 6.2): (16) b. mdaːj thvɤ̀ː c. thvɤ̀ː mhoːp. d. thvɤ̀ː mother make make food make ‘Mother makes [the meal].’ ‘[She] is making the meal.’ ‘[She] makes [it].’

The word order in (16a) also applies to the copula. In this case, the subject is in the S position and the predicate complement appears in the object position. There are two copula forms in Khmer. One of them is the verb cɪ̀ːə ‘be’, the other one is the identificational particle kɯ̀ ː ‘be’. The former has characterizational

7 Note that prior knowledge or retrievability are important. Empty argument positions do not take the function of indefinite pronouns. Thus, (16c) does not mean ‘someone is making the meal’.

688

Bisang

function (17), while the latter is used for identification (18). It is also possible to form equational predications by the mere juxtaposition of two nouns (19). (17) Haiman (2011: 212): phɪ̀ːəsaː cɪ̀ːə kaː sɔmdaeŋ cɤt kùmnɯ̀ ːt krùp language be nml express heart thought all ‘Language is the expression of all emotions and ideas.’ (18) Jacob (1968: 141): nɪ̀ːh kɯ̀ ː ʔvɤj? — prox idf what ‘What is this?—It’s coral.’

nɪ̀ːh prox

kɯ̀ ː idf

jaːŋ. kind

phkaː-thmɔː. flower-stone

(19) Jacob (1968: 141): kɔ̀ ət mèː-maːj. 3 widow ‘She is a widow.’

Khmer has a sentence-initial topic position. The topic can be a subject, an object or an adjunct. In (20), it is an object: (20) Haiman (2011: 211): phtɛ̀əh thmɤj nɪ̀h kèː khan cɪ̀ːə bɤj house new prox 3 divide be three ‘This new house, they divided into three rooms.’

bɔntùp. room

Topics can be additionally marked by constructions like [cɔmnaek (ʔae) TOP (vèɲ)], [cɔmpùəh (ʔae) TOP (vèɲ)], [rɪ̀ː (ʔae) TOP] and [TOP ʔae neh].8 8 The markers cɔmnaek and cɔmpùəh are formed by the infix -vmn/vN- (cf. example (10) in §3.2). The first marker is derived from the verb caek ‘divide’ and also functions as a noun with the meaning of ‘section, part, division’. The second marker is based on the preposition chpùəh ‘towards’. In its derived from cɔmpùəh, it is also used as a preposition with a broader functional range that goes beyond spatial orientation ‘towards, vis-à-vis, with regard to’. Similarly, the markers ʔae, vèɲ and neh stand for spatial orientation with a functional extension to topicalization. ʔae is a locative preposition (cf. §5.3). vèɲ occurs after verbs of movement and means ‘back’ as in tɤ̀u vèɲ [go back] ‘go back’. It also means ‘again, instead’ (cf. example (64)). In discourse, it is used with the meaning of ‘on the other hand, coming

689

Modern Khmer

The information provided by a simple verb can be extended by additional markers in the preverbal and in the postverbal position. The information expressed in these positions is provided by tense-aspect-mood markers (TAM), direction markers/directional verbs (DIR) and adjuncts of various types (locatives, instrumentals, comitatives, etc.). The schema in (21) indicates the slots in which these functions are expressed. (21) TAM1—Verb—DIR—adjuncts—TAM2

Many of these markers are verbs which are reanalysed as grammatical markers due to their position relative to the main verb (cf. verb serialization and grammaticalization, Bisang 1996). The markers expressing directions are always verbs (directional verbs; §5.2.4). Adjuncts can be introduced by verbs or by prepositions (cf. §5.3). Verbs in adpositional function are called coverbs (cov). Many TAM markers are also derived from verbs (cf. (23) below, §4.3 on baːn ‘come to have’ and §5.2.7). The following example with the main verb phɲaə ‘send’ illustrates the schema in (21) with a TAM marker, a directional verb and a coverb: (22) kɔ̀ ət baːn phɲaə prak mɔ̀ːk 3 get send money come ‘He was able to send money to me.’

ʔaoj give

khɲom. 1

In addition to the sentential extensions given in (21), two other structures of simple sentences need to be mentioned, i.e., the resultative construction and the ‘take’-construction. The resultative construction (Ehrman 1972: 21–23) consists of two verbs, V1 and V2. The second verb of the construction (V2) belongs to a closed class of verbs that express a result reached by V1 (also cf. “completive verb” in Huffman 1967: 171–175 and “second position main verbs” in Jacob 1968: 119). In comparison to Chinese resultative constructions, the number of V2 is relatively limited (less than 20; Bisang 1992: 401–402). Some of the most common V2 are given in Table 7.5:

back to the point of an argument’. The marker nèh is a deictic meaning ‘this, here’. Finally, rɪ̀ː only has the function of a topic marker.

690

Bisang

Table 7.5 Some resultative constructions V2

Resultative construction (V1–V2) Meaning of V1–V2

khɤ̀ːɲ ‘see’

rɔ̀ ːk ‘look for’—khɤ̀ːɲ nɯ̀ k ‘rack one’s brain’—khɤ̀ːɲ deɲ ‘chase’—tɔ̀ ən

‘find’ ‘realize, come to a conclusion’ ‘chase down, chase successfully’

sdap ‘listen’—lɯ̀ ː rɪ̀ən ‘learn’—ceh kɯ̀ t ‘think’—tròːv / khoh smaːn ‘guess’—tròːv / khoh

‘listen intentionally’ ‘have learned, know by learning’ ‘think right/wrong’ ‘guess right/wrong’

tɔ̀ ən ‘be in time, catch up with’ lɯ̀ ː ‘hear’ ceh ‘know’ tròːv ‘be right’, khoh ‘be wrong’

The resultative construction is the source of many postverbal tense-aspectmood markers (on preverbal markers, cf. §5.2.7). The most prominent ones are listed here: (23) baːn ‘be able, succeed in (come to have, cf. §4.3 on its function in the preverbal position)’; kaət ‘be physically able (be born, arise, happen)’; cɔp ‘final completion of an action’; srec ‘temporary completion of an action’; rùːəc ‘successful completion of an action, physical ability (finish, accomplish)’; haəj ‘completion of an action, change of state (perfect), beginning of an action’.

The ‘take’-construction is formed by the verb jɔ̀ ːk ‘take’ plus a non-obligatory noun phrase plus an additional non-obligatory directional verb that specifies the direction of the take-action away from the speaker (tɤːu ‘go’) or towards her/him (mɔ̀ ːk ‘come’): (24) jɔ̀ ːk ‘take’—(NP)—DIR—Verb

This is illustrated by the following example: (25) Khmer (Bisang 1992: 436): daə sɔmdau tɤːu walk towards go

tùː-tɯ̀ k-kɔːk cupboard-water-congealed

baək, open

691

Modern Khmer

jɔ̀ːk tɯ̀ k-kroːc mɔ̀ːk cak kaev pɪ̀ː take water-lemon come pour glass two ‘[She] went to the fridge, opened it and poured lemon-juice into two glasses.’

The structures discussed so far have always been presented in the function of affirmative declaratives. In the remainder of this section, it will be briefly illustrated how they are negated and how they are combined with other illocutionary forces. Negation is expressed by the marker mɯ̀ n, which immediately precedes the verb (including stative verbs) or the preverbal TAM markers. A more colloquial alternative is ʔɔt. In written style, we also find pùm. The scope of the negation can be narrow (limited to the verb) or it can go beyond the verb up to the clause. The negative form of the verb mɪ̀ːən ‘exist, there is, have’ is k-mɪ̀ːən ‘have not, there isn’t’. For nouns, Khmer uses a special negative form of the copula cɪ̀ːə, which is mɯ̀ n mɛ̀ːn cɪ̀ːə ‘not to be [neg be.true be]’. The copula kɯ̀ ː cannot be negated. If resultative constructions are negated, the negative marker must be between V1 and V2: (26) lòːk smaːn mɯ̀ n tròːv. 2 guess neg right ‘You didn’t guess right.’

In addition to the preverbal negators, there is the clause-final negator tèː, which either occurs alone in colloquial informal speach or together with one of the preverbal negators. In the latter case, it may have a certain emphatic effect. (27) mdaːj mɯ̀ n thvɤ̀ː mhoːp mother neg make meal ‘Mother is not making the meal.’

(tèː). neg

Polar questions are minimally expressed by rising (or falling) intonation (cf. §2.3). The most common marker of polar questions is the marker tèː, which is also used in negation. It is clearly distinguished from negation if there is a sentence-final high pitch. The question marker tèː can be combined with additional sentence-final particles to form more specific questions. The combination of rɯ̀ ː ‘or’ (§4.2.1) followed by tèː (rɯ̀ ː tèː) has the same meaning as tèː alone. In the opposite order, tèː rɯ̀ ː marks questions which presuppose an affirmative answer. If a negative answer is expected the sequence tèː dɤŋ [neg know] is used. Alternative

692

Bisang

question words are ʔɔt ‘negation (colloquial)’ and nɤ̀u ‘stay, be at’.9 Both of these question words can also be combined with rɯ̀ ː ‘or’. If the word rɯ̀ ː ‘or’ occurs alone in clause-final position with high pitch the question is less forceful and more tentative. Imperatives are unmarked in their most simple and straightforward form. The subject may be present. Negative imperatives are marked by kom (28). If there is a subject it precedes kom. (28) Huffman (1970b: 174): kom dak mtèːh craən pèːk proh put chili much too.much ‘Don’t put too much chili on it.’

nɤh. emph

There are various ways for expressing imperatives more explicitly. The marker coː is used to introduce orders to addressees of inferior rank. The sentencefinal particle sɤn marks the necessity of immediate action. For more polite orders, the verb soːm ‘ask, ask a favour’ is used, sometimes in combination with ʔɔɲcɤ̀ːɲ ‘invite’ in the meaning of ‘please’. Finally, the directional verbs tɤ̀u ‘go’ and coh ‘descend’ are used sentence-finally as invitational markers. Example (29) combines invitational coh with the verbs soːm ‘ask’ and ʔɔɲcɤ̀ːɲ ‘invite’: (29) Bisang (1992: 439): soːm taː ʔɔɲcɤ̀ːɲ tɤːu ask grandfather invite go ‘Please, you go and thank him.’

ʔɔːkùn thank

kèː 3

coh. imp

4.2 Complex Sentences 4.2.1 Coordination and Chaining The most common markers of coordination are haəj-nɯ̀ ŋ or nɯ̀ ŋ ‘and’ and rɯ̀ ː ‘or’. Disjunctive coordination in the sense of ‘but’ is expressed by the clauseinitial markers tae, pɔntae and kɔː pɔntae. Moreover, Khmer has a rich inventory of symmetric or parallel constructions in which both of the coordinated clauses are marked by the same clause-final conjunction. These conjunctions are kɔː-baːn or kɔː-daoj ‘either . . . or (emphasizes alternative possibilities)’ and kɔː-mɪ̀ːən ‘either . . . or (in a context of listed definite alternatives [mɪ̀ːən ‘exist, there is, have’])’. 9 nɤ̀u ‘stay, be at’ in final position means ‘yet?’. In response to the question haəj rɯ̀ ː nɤːu? ‘(action completed) already?,’ nɤːu means ‘not yet’.

693

Modern Khmer (30) Huffman (1970b: 191): jɤ̀ːŋ tɤ̀u ɲam baːj kɔː-baːn, tɤ̀u 1pl go eat rice tcl-get go ‘We can either go eat or go to a movie.’

mɤ̀ːl see

kon film

kɔː-baːn. tcl-get

The unmarked juxtaposition of clauses, which is extremely common in Khmer, can also express coordination: (31) Jacob (1968: 145): vɪ̀ːə cèː vaj haːn koːn. 3 rebuke beat restrict children ‘He rebukes, beats and restricts the children.’

The unmarked juxtaposition of clauses is frequently used in chaining. In this case, the individual events are arranged in their temporal order. The following example consists of three clausal units, each consisting of more than one verb (the units are separated by | in the glosses). The first two verbs form a directional construction, the third and the fourth verb form a resultative construction and the last unit is a ‘take’-construction (cf. §4.1): (32) Bisang (1992: 435): tɤ̀ːp stùh tɤ̀u deɲ cap jɔ̀ ːk mɔ̀ ːk then jump.up go | pursue catch | take come ‘[She] then jumped up, caught [the duckling] and hugged [it].’

ʔaop. hug

Situation: A young lady is missing her duckling that was taken by one of her lovers. She is looking for it and then sees the duckling. As in the case of coordination, there are many overt markers for sequences of events. The most common ones are clause-final haəj and rùːəc, which express completion (cf. (23)). In the initial position of the following clause, we find tɤ̀ːp ‘then’. (33) Gorgoniev (1966: 155): lòːk.pùː ʔoc baːrɤj rùːəc, tɤ̀ːp bɔɲcɛ̀ək. uncle light cigarette finish then explain ‘The uncle will light a cigarette and [only] then he will give an explanation.’

4.2.2 Relative Clauses Relative clauses follow their head nouns and are optionally introduced by the relative marker dael. The conditions that determine the use of dael are not fully

694

Bisang

understood. It seems that it is likely to occur in long relative clauses or in relative clauses which are separated from the head noun by another modifier (34). The function of the head noun itself remains unmarked within the relative clause in most instances ((34) and (35)) but it is possible to have an overt pronoun in more colloquial contexts (36): (34) Relative clause with subject coreference, separated from head noun: khɲom mɯ̀ n skɔ̀ əl srɤj pɪ̀ː nɛ̀ək nɪ̀h [dael nɪ̀jɪ̀ːəj 1 neg know woman two cl prox rel talk

knɪ̀ːə recip

nɤ̀u mùk saːlaːrɪ̀ən]. be.at front school ‘I don’t know these two women who are talking in front of the school.’ (35) Relative clause with object coreference (Khin 1999: 548): mɛ̀əthjòːbaːj [dael chmùːəɲ kɔpbaːh coːl-cɤt praə means rel trader cotton like-heart use ‘the means [of transport] that the cotton traders like to use most]’

cɪ̀ːəŋ-kèː] most-3pl

(36) Relative clause with coreferential pronoun (Haiman 2011: 315): ʔaːɲɪ̀ː [dael khɲom ʔɔt cɔːŋ vɪ̀ːə tèː. pn rel 1 neg tether 3 neg ‘[the water buffalo] Anji, whom I had not tethered . . .’

The unmarkedness of the head noun within the relative clause does not depend on argumenthood. Head nouns with adjunct or possessor status can remain equally unmarked within the relative clause: (37) Khin (1999: 293–294): a. mənùh [dael khɲom nɪ̀jɪ̀ːəj] man rel 1 talk ‘the man about whom I talk’ b. phtɛ̀əh [dael khɲom kùəh house rel 1 knock ‘the house at whose door I knock’

tvɪ̀ːə] door

4.2.3 Complement Clauses Khmer has the following three complementizers, each of them is also used as a verb: thaː ‘say, quotative marker’, cɪ̀ːə ‘be’ and ʔaoj ‘give, causative marker,

695

Modern Khmer

benefactive marker’. Due to these other functions, they contribute their own meaning. In addition, there are many contexts in which their status oscillates between complementizer and verb. The marker thaː is employed with verbs of speaking, thinking, knowing and perception. The complementizer thaː marks its clause as factual (Haiman 2011: 304–305). (38) Jacob (1968: 100): krùː prap khɲom thaː kmeːŋ nùh prəlɔːŋ cɔ̀ əp. teacher tell 1 say boy dist sit.for.examination pass ‘The teacher told me that the boy will pass the examination.’

The complementizer cɪ̀ːə is used with the verb dɤŋ ‘know’ in its negated form and if there is no overt embedded subject: (39) Haiman (2011: 305): mɯ̀ n dɤŋ cɪ̀ːə mɔ̀ ːk pɪ̀ː haet neg know be come from reason ‘[I] don’t know where this comes from at all.’

ʔvɤj what

sɔh. at.all

The marker ʔaoj ‘give’ made its way into the complementizer function from its causative function (cf. §5.2.5). It occurs with verbs that also have a causative meaning (e.g. bɔŋkɔ̀ əp ‘order’, prap ‘tell’, soːm ‘ask s.o. to do sth.’ and nɔ̀ əm ‘take a person to do sth.’) and with some other verbs such as cɔŋ ‘want’ and prɔ̀ ːm ‘agree’. The function of ʔaoj clearly differs from thaː inasmuch as the proposition it introduces is not seen as a fact (compare (40) with (38)): (40) Bisang (1992: 442): kɔ̀ ət prap tɤ̀u 3 tell go

kmeːŋ boy

ʔaoj give

jɔ̀ ːk take

tɤ̀u go

cùːn cɔmpùəh give to

nɛ̀ək-srɤj bɔndoːl. person-woman pn ‘He told the boy that he take [the parcel] and deliver it to Miss Bondol.’

4.2.4 Adverbial Clauses The expression of adverbial subordination reaches from no marking at all (also cf. §4.2.1) up to the use of very elaborate adverbial subordinators. Since the adverbial subordinators are presented in more detail in §5.4, this section will only offer some examples. In the first two examples that illustrate c onditionality

696

Bisang

(41) and causality (42), the adverbial subordinator occurs clause-initially. In some cases, the clause-initial marker is further specified by an additional clause-final marker (43). Finally, some markers occur in the main clause. One of them is baːn-cɪ̀ːə. In (44), it occurs without any marking in the other clause. In other instances, the other clause can have a causal conjunction. (41) Haiman (2011: 316): baə ʔaoj daə taːm thɔ̀ əmmedaː, prəhael mɯ̀ n if give walk follow normal perhaps neg ‘If [they] had made me walk normally, I might not arrive at all.’

dɔl arrive

tèː. neg

(42) Huffman (1970b: 115): khɲom mɯ̀ n tətɯ̀ k pɔnmaːn tèː, prùəh khɲom mɪ̀ːən ʔaːv-phlɪ̀əŋ. 1 neg wet how.much neg because 1 exist coat-rain ‘I didn’t get so wet, because I have a raincoat.’ (43) Jacob (1968: 99): tùəh koːn even.if child

khɲom kaː 1 marry

kɔː-daoj, tcl-follow

khɲom 1

nɯ̀ ŋ fut

nɤ̀u stay

phtɛ̀əh nɪ̀h cɪ̀ːə dəraːp. house prox be always ‘Even if my daughter were to get married I should always live in this house.’ (44) Huffman (1970b: 218): khɲom ʔɔh baːrɤj haəj, baːn-cɪ̀ːə 1 be.out.of cigarette nsit get-be ‘I am out of cigarettes, so I’m going to the market.’

khɲom 1

tɤ̀u go

phsaː. market

4.3 Syntax and Pragmatics Khmer grammar is characterized by the vast space it leaves to pragmatic inference. This is manifested in the extent to which information on grammatical categories can be omitted and in the multifunctionality of its markers even if they are highly grammaticalized (Bisang 2009, 2011). The omission of information expressed by grammatical markers is mentioned in §4.1 on radical prodrop (16b–c), in §5.2.7 on the lack of obligatory TAM marking and in §4.2 on the option of dropping coordinative markers, complementizers and adverbial markers in a given context (juxtaposition). For that reason, only multifunctionality will be discussed in this section.

Modern Khmer

697

Multifunctionality is a significant property of many grammatical markers. In many instances, the different functions are clearly determined by different syntactic positions. Thus, the verb ʔaoj ‘give, allow/let’ can have the functions of a dative/benefactive marker (22), a causative verb (§5.2.5), a complementizer (§4.2.3) and an adverbial subordinator marking manner or purpose (§4.2.4). The verb troːv ‘go along the correct path, hit the target’ and its five functions of necessity, conformity, detrimental, passive and touch has recently been discussed by Thach & Paillard (2009). In other instances, the interpretation only depends on context. A good example is the verb baːn ‘come to have’ (Bisang 1996, 2011, Enfield 2003). The verb baːn occurs preverbally and postverbally. In its preverbal position, it can have the three functions of (i) ability and permission, (ii) the truth or factuality of an event in contrast to a given presupposition and (iii) past (Bisang 2011). Thus, a simple sentence like the following is open to no less than the following three interpretations (45 a–c): (45) khɲom baːn tɤ̀u phsaː. I get go market a. ‘I was able/allowed to go to the market.’ (ability/permission) b. ‘I went to the market.’ (past) c. ‘I did go to the market.’ (against the presupposition that I didn’t.)

4.4 Noun Phrases Khmer noun phrases are prominently head-initial. Thus, stative verbs, numerals and numeral-classifier constituents [num+cl], demonstratives, possessors and other nominal modifiers as well as relative clauses follow the noun (for relative clauses, cf. §4.2.2). Stative verbs (cf. §5.2.1) in the function of modifiers cannot be strictly distinguished from relative clauses, since relative clauses may also follow their head without overt marking. In fact, the relative marker dael may be used in cases of contrast. (46) Noun plus stative verb: phtɛ̀əh thom house big ‘a big house’

Numerals and the constituent consisting of a numeral plus a classifier (cf. §5.1.3) normally follow the noun (47). As Haiman (2011: 150–151) reports, [num+cl] constituents may also precede it.

698

Bisang

(47) baːrɤj cigarette ‘two cigarettes’

pɪ̀ː two

daəm clf

Demonstratives (nɪ̀ːh ‘this’ and nùh ‘that’) follow the noun: (48) phtɛ̀əh nùh house dist ‘that house’

Possessors may follow their head without overt marking or with the possession marker rəbɔh, which is derived by the infix -b- (cf. (8)) from the verb rùəh ‘live, be living, remain’. As a noun, it means ‘belonging, thing, object’: (49) phtɛ̀əh (rəbɔh) ʔoːpùk-mdaːj house poss father-mother ‘the house of my parents’

khɲom 1

In the case of multiple determination, there are various word-order options. There is a general tendency for shorter constituents to occur before longer ones. An exception is the demonstrative, which always concludes the noun phrase. In general, Huffman’s (1967: 214) word-order rule (50) is still very pertinent. It is illustrated by (51). (50) Noun—single lexeme attribute (often a stative verb)—numeral (+classifier)— phrasal attribute (possessor, relative clause)—demonstrative (51) Huffman (1967: 214): bɔːŋ.srɤj lʔɔː bɤj nɛ̀ək rəbɔh older.sister pretty three clf poss ‘those three pretty older sisters of mine’

5

khɲom 1

nùh dist

Word Classes

5.1 Nouns 5.1.1 Common Nouns Khmer nouns consist of a major syllable or a minor syllable plus a major syllable. Apart from the derivational affixes discussed in §3.2, they show no morphology. Thus, there is no inflectional marking of categories like number

Modern Khmer

699

(plural), case, definiteness or gender. Plural in the sense of collective plural can be expressed optionally on nouns denoting humans by the preposed nominal element pùːək ‘group’ as in pùːək-krùː ‘the teachers [presupposed as forming a group]’. Indefinite plural is optionally marked by the postnominal markers khlɑh ‘some’, definite plural by tɛ̀əŋ-laːj ‘all’ as in krəbɤj khlah ‘some buffalos’ and krəbɤj tɛ̀əŋ-laːj ‘the buffalos [all of a presupposed set]’, respectively. Finally, plurality can be expressed by reduplication, either by the repetition of the noun or by the repetition of its modifier (koːn toːc-toːc [child-small-RED] ‘small children’. Case is expressed by word order (cf. §4.1) or by prepositions and coverbs (§5.3). Definiteness can be marked optionally by the demonstrative nùh ‘that’, indefiniteness is expressed by the numeral mùːəj ‘one’ plus classifier (if required). Gender is only expressed lexically by nominals that follow the main noun—there is no grammatical category of gender in Khmer. The nominals proh ‘male’ and srɤj ‘woman’ are used for humans, while animals can be specified for gender by chmòːl ‘male’ and ɲɪ̀ː ‘female’: (52) smiən-proh [secretary-male] ‘a male secretary’ tɪ̀ːə-ɲɪ̀ː [duck-female] ‘a female duck’

There is a special subset of nouns which are used very frequently as elements of the word class of pronouns or in the function of pronouns. This group includes kinship terms (53) and some other nouns (54): (53) mdaːj ‘mother’, ʔoːpùk ‘father’ prəpùən ‘wife’, pdɤj ‘husband’ bɔːŋ ‘elder sibling’,10 pʔoːn ‘younger sibling’ jɪ̀ːəj, cɪ̀ː-doːn ‘grandmother’, taː, cɪ̀ː-taː ‘grandfather’ mɪ̀ːə ‘uncle, younger than either parent’ pùː ‘uncle, even younger than either parent’ ʔom ‘uncle, older than either parent’ mɪ̀ːŋ ‘aunt, younger than either parent’ (ʔom-srɤj ‘elder sister of a parent is rarely used pronominally’) cau ‘grandchild’ (54) khɲom ‘I, slave [today lòːk-kèː is the common word for ‘slave’]’, lòːk ‘lord, monk, sir’, nɛ̀ək ‘man, human being’, srɤj ‘woman’ in lòːk-srɤj ‘Mrs, madam, Lady’ and nɛək-srɤj ‘Mrs, madam’, nɪ̀ːəŋ ‘young lady, Miss’.

10

The kinship term bɔːŋ ‘elder sibling’ was used for ‘comrade’ under the Khmer Rouge.

700

Bisang

5.1.2 Pronouns and Question Words Personal pronouns: There are only few words which exclusively index person in terms of personal pronouns. As pointed out in §5.1.1, kinship terms and some other nouns are also used pronominally. The following list (adapted and extended from Haiman 2011: 184) is of a prototypical nature, i.e., the number of words that can be used pronominally is larger. The factors that determine the selection of person markers are age, social status/rank, monkhood and royalty. Plural is distinguished from the singular form only in the first person. (55) 1sg

khɲom ‘I’, speaker is acting humble or polite ʔaɲ ‘I’, speaker is arrogant or on intimate terms with addressee ʔaːtmaː ‘I’, speaker is a monk tùːl-bɔŋkùm ‘I’ speaker addressing royalty,

1pl:

jɤ̀ːŋ ‘we’

2

nɛ̀ək ‘you’, addressee is younger or of lower status nɛ̀ək-srɤj ‘you’, addressee is a woman of middle class ʔaeŋ ‘you’, addressee is of much lower status lòːk ‘you’, addressee is older or of higher status lòːk-srɤj ‘you’, addressee is a married female of higher status

3

kɔ̀ ət ‘he’, referent is respected other vɪ̀ːə ‘he’, referent is disrespected other kèː ‘he’, referent may be any other prɛ̀əh-ʔɔŋ ‘he’, referent is royalty

If nouns are used pronominally they do not stand for a single value of person. All kinship terms mentioned under (53) can express first, second or third person, depending on who is addressing or talking about whom in a given speech act. Moreover, the pronominal use of kinship terms is not strictly limited to relatives. Generally speaking, persons with whom a certain familiarity is established by the situation may also be addressed by these terms. A good example of the use of kinship terms in pronominal function is found in the last four lines of the folk tale presented in §7. In this story, the tiger uses the noun taː ‘grandfather’ for referring to himself as ‘I’ and cau ‘grandchild’ for referring to the little girl in the function of the addressee (for some more details,

Modern Khmer

701

cf. Jacob 1968: 156–170, Ehrman 1972: 92–98, Khin 1999: 281–292, Haiman 2011: 183–196). Other pronouns: The reflexive pronouns are khluːən, ʔaeŋ and khluːən-ʔaeŋ ‘self’. The reciprocal marker is knɪ̀ːə ‘each other’. It follows immediately after the verb. Question words: Khmer has the following question words: (56) Noun phrases: Location: Time (Present and future): Time (Past): Quantity: Manner: Reason:

nɛ̀ək-naː ‘who (lit. man-which)’, ʔɤj/ʔvɤj/sʔɤj ‘what’, naː ‘which’ naː ‘where’, nɤ̀u-naː ‘where [cov-where]’, ʔae-naː ‘[prep-where]’ ʔɔŋkal ‘when (colloquial)’, kaːl-naː ‘when [time-which]’ pɪ̀ː ʔɔŋkal ‘when (colloquial) [from when]’, pɪ̀ː-kaːl-naː ‘when [from-time-which]’; ponmaːn ‘how much, how many’; mec ‘how’, jaːŋ-mec ‘how [manner-how]’, doːcmdec ‘how [be.like-how]’; mec ‘how come, why’ plus many more elaborate forms: mdec, mec-kɔː, mdec-kɔː, haet-ʔvɤj-baːn-cɪ̀ːə ‘why, for what reason [reason-what-then]’.

The question words standing for noun phrases are located in situ. They can optionally be marked for singular with mùːəj ‘one’ or khlɑh ‘some’ for singular vs. plural as in naː-mùːəj ‘which one’ and ʔvɤj-mùːəj ‘what (singular)’ vs. naːkhlah ‘which ones’ and ʔvɤj-khlah ‘what (plural)’. If the question word mec occurs in an adverbial position, it means ‘how’. In clause initial position, it typically means ‘why’. 5.1.3 Measure Words and Quantity Words Khmer has measure words as they are found in all languages of the world (e.g. for capacity, weight, space as well as for groups of things) and it has numeral classifiers, a special class of words which are used with numerals. Numeral classifiers individuate nominal concepts in order to make them countable (Bisang 1999). Word order in numeral-classifier constructions is [noun-numeralclassifier] (cf. (47) and (51)). Compared to other Southeast Asian languages, the inventory of numeral classifiers is modest, some are listed below:

702

Bisang

(57) For humans: nɛ̀ək (ordinary people), prɛ̀əh-ʔɔŋ (kings and members of the royal family), ʔɔŋ (clergy), rùːp (people from a philosophical point of view, characters in books); For most animals and for books: kbaːl ‘head’; For fish: kɔntùj ‘tail’; For elephants: khsae ‘rope’; For trees and cigarettes: daəm ‘trunk’; For houses/buildings: khnɔːŋ ‘back’ (cf. Thai lǎŋ ‘back’ in the same function); For crops and seeds: krɔ̀ əp ‘seed, pill’; For sheets (of paper): sɔnlɤk [infixed form of slɤk ‘leaf’]’.

The status of the classifier is relatively weak, i.e., it is not obligatory except for humans. For non-humans, the form with no classifier crùːk pɪ̀ː [pig two] ‘two pigs’ is more common than the one with classifier crùːk pɪ̀ː kbaːl [pig two cl] ‘two pigs’. 5.1.4 Names and Terms of Address The use of kinship terms for addressing people has been described in §5.1.2 on pronouns. In addition to kinship terms, proper names and titles can be used for the same purpose. Titles representing the relative social position are lòːk ‘Mr’ in contexts of formal address (people of higher status, strangers) and nɛ̀ək ‘Mr’ in more familiar contexts. For women, the terms lòːk-srɤj ‘Mrs, madam’, nɛ̀ək-srɤj ‘Mrs’ and and nɪ̀ːəŋ ‘Miss’ are used. Proper names follow the title as in lòːk sɔmrɤt ‘Mr. Somret’. There is a tendency to use somewhat more familiar terms with women than with men. Thus, nɛ̀ək-srɤj may be equal to lòːk in a relatively neutral situation (e.g. addressing a stranger in the street). The noun lòːk-srɤj ‘Mrs, madam’ is used with married women of higher status. 5.2 Verbs 5.2.1 Intransitive Verbs The group of intransitive verbs consists of stative verbs, other non-dynamic verbs with a patient-like argument and dynamic intransitive verbs. Stative verbs are the only intransitive verbs that can be combined with the particle nɑh ‘very’ (Huffman 1967: 164). Khmer has a large number of stative verbs, among them thom ‘big’, toːc ‘small’, lʔɔː ‘good, nice, beautiful’ and ʔaːkrɔk ‘bad’. Color words also belong to this category: khmau ‘black’, sɔː ‘white’, krəhɔːm ‘red’, bajtɔːŋ ‘green’, khiəu ‘blue’ and lɯ̀ əŋ ‘yellow’. Gradable stative verbs form their comparatives with cɪ̀ːəŋ ‘than’: kɔ̀ ət khpùəh cɪ̀ːəŋ khɲom [he tall than I] ‘He is taller than I’. The superlative is expressed by cɪ̀ːəŋ-kèː

Modern Khmer

703

[than someone/anyone]: kɔ̀ ət khpùəh cɪ̀ːəŋ kèː [he tall than someone/anyone] ‘He is the tallest.’ The directional verbs mentioned in §5.2.4 can take the goal noun phrase without additional marking as in tɤ̀u phsaː [go market] ‘go to the market’, mɔ̀ ːk phsaː [come market] ‘come to the market’ or laəŋ phsaː [go.up market] ‘go up to the market’. Normal word order with intransitive verbs is SV. Non-dynamic verbs can also have VS as in cheh phtɛ̀əh [be.on.fire house] ‘The house is on fire’, rəbaək tvɪ̀ːə [be.open door] ‘The door is open’ and dùːəl dɔːp [fall.over jar] ‘A jar fell over’ (Jacob 1968: 109; 1978: 97). This order is also possible in some idiomatic sequences as in dɔl maoŋ [arrive hour] ‘It’s time!’. 5.2.2 Transitive Verbs With transitive verbs, word order is SV/AVP if both arguments are expressed (cf. (16a)). If there is only one argument in the subject position of a transitive verb, it can either be an agent if it is animate (58a) or the patient if it is inanimate (58b).11 In the latter case, we get a passive-like construction. (58) Jacob (1978: 98): a. ʔoːpùk sɔŋ. father build ‘Father builds [it, e.g. a house].’ b. phtɛ̀əh sɔŋ lɤ̀ː cɔntùəl. house build on stilt ‘The house is built on stilts.’

Intransitive verbs, such as pèɲ ‘(be or become) full’ can occur in what looks like transitive constructions, as in (59a), but the actual semantic roles of the arguments are not specified. While srae ‘rice field’ occupies the typical position of the P argument, it can also be seen as location (‘water is full in the rice field’). The normal intransitive use of the same verb is shown in (59b) (59) Jacob (1978: 95, 102): a. tɯ̀ k pèɲ srae. water full rice.field ‘The water is filling the rice field.’ 11 Since fronting into the topic position is possible, the nouns in (58a, b) can also be interpreted as topics.

704

Bisang b.

pɛ̀ːŋ nɪ̀ːh pèɲ cup prox full ‘This cup is full.’

haəj. nsit

5.2.3 Ditransitive Verbs There is a small set of ditransitive verbs. The most common ones are ʔaoj ‘give’ and its more polite equivalent of cùːn ‘give, offer’. The other ditransitive verbs are: phɲaə ‘send’, bɔŋrɪ̀ːən ‘teach’, bɔŋhaːɲ ‘show’ and cɔŋʔol ‘to point at with the finger’. There are strict word-order rules. The agent is in the preverbal position. In the postverbal position, the patient/theme precedes the recipient, the structure is generally AVTG. (60) khɲom ʔaoj luj 1 give money ‘I give him money.’

kòət. 3

5.2.4 Directional Verbs Khmer has six directional verbs, which can be divided into three pairs consisting of two verbs with opposite meaning (Gorgoniev 1963: 100–108; 1966: 192– 199). The verbs of the first pair (V1) express vertical movement, those of the second pair (V2) describe container-related movement and those of the third pair (V3) refer to horizontal movement to or from the speaker or a presupposed reference point. Table 7.6 Khmer directional verbs V1

V2

V3

coh ‘go down’ laəŋ ‘go up’

ceɲ ‘go out from, leave’ coːl ‘enter, move into’

tɤ̀u ‘go’ mɔ̀ ːk ‘come’

Directional verbs can be used as independent verbs, as directional verbs denoting the direction of the main verb (61) and as coverbs (cf. (69)). (61) kɔ̀ ət kraok laəŋ. 3 get.up move.up ‘He is rising / getting up.’

705

Modern Khmer

Individual verbs can have up to three directional verbs whose word order strictly follows the one indicated in Table 7.6: (62) Bisang (1992: 403): kɔ̀ ət jɔ̀ ːk ʔɤjvan coh ceɲ mɔ̀ːk. 3 take luggage move.down move.out come ‘He takes the luggage down and out [of a room situated higher than the speaker and he moves it towards the speaker].’

5.2.5 Causative Verbs In addition to its non-productive derivational causative morphology, Khmer has two widely used causative verbs, i.e., ʔaoj ‘give, let’ and thvɤ̀ː-ʔaoj [makegive/let] ‘make that’. The verb thvɤ̀ː-ʔaoj is the most common causative verb. It can be used with any type of verb. In the following example, it is used with cɔŋ ‘want’: (63) Bisang (1992: 441): thvɤ̀ː-ʔaoj khɲom ceh-tae cɔŋ nɪ̀ːjɪ̀ːəj cɔŋ khɤ̀ːɲ heːŋ ʔaeŋ. make-give 1 constantly want speak want see pn 2 ‘[This] made me constantly want to talk and to see you, Heng.’

The verb ʔaoj ‘give, let’ presupposes a causee that has potential control over an event. It is thus limited to dynamic verbs. The specific interpretation of control depends on context. In (64), the causee has the ability to remove the boxes but does not do so for whatever reasons. (64) Bisang (1992: 440): ʔaː cej, ʔaeŋ exclam pn 2

doːcmdec how

kɔː tcl

ʔaoj give

thlɛ̀ək fall

prəʔɔp box

ʔɔh all

mɯ̀ n rɤ̀ːh laəŋ veɲ? neg collect move.up again ‘Chey, why did you drop all the boxes without recollecting them?’

5.2.6 Existential Verbs The existential verb is mɪ̀ːən ‘exist, there is, have’ (negative: kmɪ̀ːən ‘there is not’). While this verb exhibits AVP order in possessive constructions, it exhibits VS, or verb-first order in existential constructions. It can additionally be preceded by a topic:

706 (65) Haiman (2011: 208): krɪ̀ːə nùh mɪ̀ːən pdɤj prəpùən time dist exist husband wife ‘Once, there was a poor poor couple.’

Bisang

kɔmsɔt destitute

tùrəkùət. wretched

5.2.7 Preverbs and Auxiliary Verbs Khmer has a rich inventory of preverbal forms that express tense/aspect categories and various modal meanings (on postverbal markers, cf. §4.1). Some preverbal forms occur without additional marking, others are marked either by -tae or -cɪ̀ːə (for the later, cf. the copula in (17)). Some of these elements are also used as independent verbs, others only express a grammatical function. The following list is roughly divided into the two big categories of tense/aspect and modality. An additional category is provided for negated forms. If the preverbal marker is related to a lexical verb or noun, the meaning of that verb/ noun is added in brackets. In some cases, the suffixes -tae and -cɪ̀ːə are optional. This is indicated by brackets. (66) List of preverbals in Khmer: a. Tense/aspect: nɯ̀ ŋ ‘future’ stɤ̀ː-tae (nɯ̀ ŋ) ‘on the point of’ tɤ̀ːp-tae ‘have just, just a moment ago (tɤ̀ːp ‘then, next’)’ thlɔ̀ əp ‘habitual past (be used to)’ taeŋ-tae ‘usually, habitually’ kɔmpùŋ(-tae) ‘action in progress at reference time’ ceh-tae ‘keep on, always (know)’ dael ‘experiential aspect’ nɤ̀u(-tae) ‘still, emphasis of incompletion (live, be at)’ kan-tae, rɯ̀ t-tae ‘get, incremental development of a state of affairs (kan ‘hold’, rɯ̀ t ‘draw up tight’)’ cɯ̀ t ‘be close to, almost (be close)’ cap ‘start/begin to’ chùp ‘cease to’ b. Modals cɔŋ ‘want’ baːn ‘possibility, obligation, past, factivity (come to have; cf. §4.3)’ ʔaːc ‘be able, have the power to’ tròːv(-kaː) ‘have to, obligation’ cambac ‘be necessary, have to’ kùːə(-tae) ‘should (kùːə ‘proper, suitable’)’

707

Modern Khmer mùk-tae/mùk-cɪ̀ːə ‘probably (mùk ‘face, front’)’ prəhael-cɪ̀ːə ‘perhaps’ kùəŋ-tae ‘certainly (kùəŋ ‘stay, sit, be located [of monks]’)’ c. Negated forms: mɯ̀ n-dael ‘never (negation of experiential aspect)’ mɯ̀ n-tɔ̀ ən ‘not yet (tɔ̀ ən ‘catch up with, be in time with’)’ mɯ̀ n-soːv ‘hardly, not very [with stative verbs]’ mɯ̀ n-bac ‘don’t need to (cf. (66b) cambac)’

In addition to tense/aspect and modality, some preverbals quantify the action in the sense that they state to what extent an event has been realized or to what extent its arguments have been involved up to reference time: sot-tae ‘all, without exception’, tɔ̀ əl-tae ‘all, without exception; until’, craən-tae ‘mostly, nearly all (craən ‘many, much’)’, sɤŋ(-tae) ‘almost all, practically’ and krɔ̀ ən-tae ‘only just, hardly, barely (krɔ̀ ən ‘enough’)’. 5.3 Locational and Directional Words Location and direction are expressed by coverbs, prepositions and relational nouns. The verb for location is nɤ̀u ‘be at, live at’. It expresses locatives (67) as well as movement towards a location if it is combined with a verb of movement (68). Directions are expressed by directional verbs (cf. §5.2.4). In their coverbal function, directional verbs mark that the action of the main verb moves towards their complement noun phrase (69). Other coverbs used for expressing spatial constellations are taːm and daoj ‘follow, go along’ with the coverb meaning of ‘along’, chlɔːŋ ‘cross, go through’ with the meaning of ‘across’ (70) and dɔl ‘arrive’ with the meaning of ‘up to, as far as’. (67) Huffman (1967: 186): krəbɤj deːk nɤ̀u kraom phtɛ̀əh. buffalo lie stay under house ‘The buffalo are lying under the house.’ (68) Huffman (1970b: 158): dak ʔɤjvan nìh nɤ̀u knoŋ put luggage prox stay in ‘Put this luggage into the room, too!’ (69) kɔ̀ ət baːn phɲaə sɔmbot 3 get send letter ‘He sent a letter to [his] friend.’

tɤ̀u go

bɔntùp room

phɔːŋ. too

mɯ̀ t-sɔmlaɲ. friend

708

Bisang

(70) kɔ̀ ət dɤk-nɔ̀ əm sroːv chlɔːŋ stɯ̀ ŋ. 3 transport rice cross river ‘He transported the rice across the river.’

If the coverb dɔl ‘arrive’ is used in (69), it is clear that the letter has reached its destination. With the coverb ʔaoj ‘give’, the same action is described from the perspective of the recipient. A second set of locational and directional words has nominal properties and is called ‘relational nouns’. In some instances, relational nouns are synchronically still used as full nouns as in the case of mùk ‘face, front, kind’. In other cases, they only have reduced nominal properties. A list of relational nouns is given in (71), followed by an example: (71) lɤ̀ː ‘on, over, above’, kraom ‘under’, knoŋ ‘in, within’, kɔndaːl ‘in the middle of’, cɔnlɔh ‘between, interval’, krau ‘outside, apart from’, mùk ‘in front of’, kraoj ‘behind’, cùmveɲ ‘around’. (72) hɤp nùh nɤ̀u kraom box dist stay under ‘The box is under the bed.’

krɛ̀ː. bed

Relational nouns can be preceded by the nominal element khaːŋ ‘side’ without any significant change of meaning: khaːŋ-lɤ̀ː, khaːŋ-kraom, khaːŋ-knoŋ, khaːŋkrau, khaːŋ-mùk and khaːŋ-kraoj. Khmer has two locational/directional prepositions. They are neither related to verbs nor to nouns. One of them is ʔae ‘at, to’, the other is pɪ̀ː ‘from, since’. In contrast to the coverb nɤ̀u ‘at, to’, the preposition ʔae tends to refer to locations further away from the speaker/center of interest (Jacob 1968: 67). The preposition pɪ̀ː expresses movement away from something. Its additional meanings are ‘since’ and ‘about [talk about]’. It is also used for referring to the material something is made of. Coverbs, prepositions (prep) and relational nouns (RELN) can be combined according to the following word-order rule illustrated by (74): (73) V cov prep RELN NP (74) Huffman (1970b: 393): khɲom sɔŋ nɤ̀u kɔndaːl phùːm. saːlaː school 1 build stay middle village ‘My school is built in the middle of the village.’

Modern Khmer

709

5.4 Conjunctions/Connective Words The structure of Khmer adverbial subordinators is often very transparent and shows a remarkable degree of elaboration. Thus, many temporal clauses are introduced by the nominal elements kaːl ‘time’ and pèːl ‘time’, sometimes additionally marked by the relative-clause marker dael. The verb dɔl ‘arrive’ also means ‘until’ and ‘when [in the future]’ and it is part of the more complex adverbial subordinator dɔl-kaːl-naː [arrive-time-which.ever] ‘when at some time in the future’. The following list presents the most important adverbial subordinators (for the complemenmtizers, cf. §4.2.3, for some examples, cf. §4.2.4): (i)

Conditionals (cf. example (41)): baə ‘if’; kaːl-baə ‘if, whenever’, baə-prəsɤn-cɪ̀ːə/prəsɤn-baə ‘if by any chance’, kaːlnaː-baə/ ‘if ever, whenever’, lɤ̀ːk-tae/kom-tae ‘unless, if not’. (ii) Concessives and concessive conditionals (43): tùəh, tùəhbɤj, tùəh-cɪ̀ːə ‘although, even if’, dbɤt, dbɤt-tae ‘although’. (iii) Temporal clauses: ‘when’-clauses: kaːl, kaːl-dael, pèːl, pèːl-dael ‘when’, kaːl-naː ‘when at any time’, lùh ‘at the time when’, dɔl ‘when [in the future]’; Simultaneity: knoŋ-pèːl-dael ‘while, during the time that [inside-time-rel]’, nɤ̀u-pèːl-dael ‘while, during the time that [be.at-time-rel]’; Before: mùn-nɯ̀ ŋ ‘before [before-with]’, tùmrɔːəm/tùmrɔːəm-dɔl ‘before’; After: kraoj/kraoj-pɪ̀ː ‘after’; Until: dɔl ‘until’. Other forms are: tɔ̀ əl-tae, tɔ̀ əl-dɔl, tɔ̀ əl-tae-dɔl ‘until’; Since: taŋ-pɪ̀ː ‘since’. (iv) Cause (cf. (42)): prùəh/pɪ̀ː-prùəh ‘because’, dbɤt ‘because’. The marker baːn-cɪ̀ːə ‘the reason why, that is why’ is used as follows: (i) in the first clause, followed by prùəh/pɪ̀ː-prùəh (ii), in the second clause with or without prùəh/pɪ̀ː-prùəh in the second clause. (v) Purpose: The most common marker is formed from the verb ʔaoj ‘give, let’. Its negative forms are mɯ̀ n-ʔaoj and kom-ʔaoj ‘so that . . . not, so as not to’. Other markers are: daəmbɤj, daəmbɤj-ʔaoj ‘so that’ and daəmbɤj-mɯ̀ n-ʔaoj ‘so as not to’.

5.5 Particles Khmer has a rich system of particles. The present section will discuss sentencefinal particles, the particles corresponding to English yes and no, and the very frequently used narrative discourse particle kɔː.

710

Bisang

The sentence-final particles expressing negation, polar questions and imperative particles have been presented in §4.1. In addition to these functions, sentence-final particles express the following functions: (75) a. Concerning the facticity of an event: The event may be true: mɤ̀ːl / mɤ̀ːl-tɤ̀u ‘perhaps’ [as a verb, mɤ̀ːl means ‘look (at)’] Emphasis of the truth: mɛ̀ːn-neh / məneh ‘really!’ [as a verb, mɛ̀ːn means ‘true’]. Emphasis of negation: sɔh ‘at all’, laəj ‘at all [cf. Thai ləəj]’ and sɔh-laəj. As Thai ləəj, Khmer laəj is also used in affirmative contexts. b. Addition: phɔːŋ ‘too, as well’ (and as a hortative, e.g.: cuəj phɔːŋ ‘help!’) dae ‘too, also, even so’. This particle differs from phɔːŋ inasmuch as it can also express the truth of a state of affairs against contrary presupposition (overlap with (75a)). saot ‘moreover, in addition’. c. Repetition (Jacob 1968: 102): tìət ‘further, again; do something yet another time/once more’; vèɲ ‘back again, return; tracing the same path over again’. d. Surprise: The particle taə, especially in combination with the negator tèː in tèː taə, expresses that an event does not correspond to what the speaker expected.

Khmer has two particles corresponding to English yes. The particle cah is used by female speakers, the particle baːt by male speakers. The negator tèː is employed by both genders for negative answers. The narrative discourse marker kɔː is one of the most frequent words in Khmer with a wealth of different functions. Since it is not possible to describe all of them in detail, only a small sketch based on Haiman (2011: 328–342) will be given. In his view, it prototypically marks “foregrounded clauses which move the narrative forward” (Haiman 2011: 329). This consequential function is shown in (76), in which kɔː introduces a foregrounded clause that is preceded by background information: (76) Haiman (2011: 329): nɤ̀u pèːl jɤ̀ːŋ stay time 1pl

daə walk

mɔ̀ ːk go

dɔl arrive

phùːm village

mùːəj, one

kɔː tcl

srap-tae mɪ̀ːən khjɔl pjùh jaːŋ khlaŋ. suddenly exist wind storm adv strong ‘While we were walking towards a village, suddenly a strong wind blew up.’

711

Modern Khmer

The relation between the two sequences separated by kɔː is not necessarily temporal nor is it logical. Often, what is asserted in the clause marked by kɔː can be understood as the unexpected opposite that does not follow from the preceding clause. This is the inconsequential function of kɔː. In that function, kɔː often expresses universal concessive conditionality (‘no matter what, kɔː . . .; irrespective of the truth of the preceding clause, the kɔː clause is true). If the parallelism of inconsequential sentences is maintained and their inconsequential meaning function is reduced, the next function of kɔː is reached, i.e., its function of expressing parallel events: (77) Haiman (2011: 333): slap, mɯ̀ n rɛ̀əksaː kɔː slap. rɛ̀əksaː kɔː treat tcl die neg treat tcl die ‘If you treat it you die, and if you don’t treat it you die.’

In addition to the functions of consequential, inconsequential and parallel events, kɔː also operates as a topic-changing discourse marker (cf. Engl. anyhow) and as a marker of conformity (Engl. too) or alternative options (Engl. or). Finally, it is also used as an exclamative with the meaning of ‘how X!’, sometimes supported by ʔɤj ‘what?’: (78) Haiman (2011: 336): kɔː ʔɤj sʔaːt tcl what clean ‘How pretty!’

6

mleh! how.much

Semantics and Pragmatics

6.1 Specialised Vocabulary Politeness is not limited to the use of different pronominals. It also shows up in the selection of verbs. Thus, there is a set of verbs whose selection depends on the social situation in which it takes place and the social status of the person who enacts them. As an example, the concept of eating can be expressed by various verbs, depending on the social situation and on who is eating: sɪ̀ː ‘eat (animals)’, ɲam/ɲam-baːj ‘eat (in the family)’, hoːp ‘eat (rural, in the family)’, pɪ̀ːsaː ‘eat (polite, of others than speaker)’, tətùːəl tɪ̀ːən ‘eat (polite, of oneself)’, bɔrəphòːk ‘eat, have a meal (formal)’, chan ‘eat (of monks)’, saoj ‘eat (of royalty)’. For the action of drinking, there is the special verb phɤk ‘drink’ at the level of family and animals. The more polite forms are identical to the verbs for

712

Bisang

eating. Thus, the distinction between eating and drinking converges into the general notion of ‘consumption of food/drink’ at higher levels of politeness. Some other concepts of actions whose expression depends on the actor and the situation, are ‘come’, ‘go’, ‘sleep’ and ‘give’: (79) a. come: mɔ̀ ːk ‘come (familiar, unmarked)’, ʔɔɲcɤ̀ːɲ mɔ̀ ːk ‘come (formal, of respected persons)’, nìmùən mɔ̀ ːk ‘come (of monks), jɪ̀ːəŋ mɔ̀ ːk ‘come (of royalty)’; b. go: tɤ̀u ‘go (familiar, unmarked)’, ʔɔɲcɤ̀ːɲ tɤ̀u ‘go (formal, of respected persons)’, nìmùən tɤ̀u ‘come (of monks)’, jɪ̀ːəŋ tɤ̀u ‘go (of royalty)’; c. sleep: deːk ‘sleep (familiar, unmarked)’, kèːŋ ‘sleep (familiar, mostly of children)’, tətùːəl tɪ̀ːən dɔmneːk ‘sleep (formal)’, sɤŋ ‘sleep (of monks)’, phtùm ‘sleep (of royalty)’; d. give: ʔaoj ‘give (familiar, unmarked), cùːn ‘give (formal)’, prəkèːn ‘give (to monks)’, thvaːj ‘give (to royalty)’.

For royalty, there is a special royal vocabulary which is of limited importance nowadays. 6.2 Language Use Language use in Khmer is characterized by the high importance of economy and the pragmatic inference of grammatical categories on the one hand and by a very rich system of explicitly marking grammatical categories and politeness on the other hand. Both aspects of language use have been described in several of the above subsections. The economy-side of Khmer shows up in radical pro-drop (§4.1) and the omission of various grammatical markers that can be inferred from context (e.g. unmarked juxtaposition of clauses in §4.2 and the non-obligatory marking of tense-aspect). The multifunctionality of certain grammatical markers contributes to economy by using one and the same marker for various purposes (cf. the functions of the verb baːn ‘come to have’ in §4.3 and the verb ʔaoj ‘give’ in §§4.1, 4.2.3, 4.2.4, 5.2.5). While the grammatical system allows the production of very short and economic utterances, Khmer has developed extensive means for explicitness (tense/aspect, three complementizers and an extensive system of adverbial subordinators, an elaborate system of pronouns and the pronominal use of kinship terms and terms of address).

713

Modern Khmer

7

Sample Glossed Text

rɯ̀ əŋ mənùh lòːp story human greed The story of greedy people kaːl pɪ̀ː prèːŋ nɪ̀ːəj, mɪ̀ːən kɔmaːrɤ̀j m-nɛ̀ək ʔaːjù time from ancient on.far.side exist daughter one-cl age ‘A long time ago, there was a daughter at the age of 14

dɔp.bùːən fourteen

chnam. mdaːj praə ʔaoj tɤ̀u cɪ̀ːk dɔmloːŋ. kmeːŋ nùh year mother order give go dig sweet.potato child dist years. [Her] mother ordered her to dig for sweet potatoes. After the girl [lit.: child, rɪ̀ːəp prepare

cɔːp spade

rùːəc achieve

srec, finish

ceɲ go.out

daə walk

tɤ̀u go

rɔ̀ ːk look.for

dɔmloːŋ. tɤ̀u sweet.potato go unspecified for gender] prepared [her] spade, she went out to look for sweet potatoes. dɔl prèj thom mɪ̀ːən dɔmboːk mùːəj mɪ̀ːən rùːŋ crɤ̀u. arrive forest big exist hill one exist hole deep She went to the big forest [where] there was a hill with [having] a deep hole. kɔmaːrɤ̀j nùh tɤ̀u cɪ̀ːk dɔmloːŋ nɤ̀u cɤ̀ːŋ dɔmloːŋ nùh, rəboːt daughter dist go dig sweet.potato stay foot hill dist slip.out When the daughter was digging for sweet potatoes at the foot of the hill, phlae cɔːp crùh thlɛ̀ək tɤ̀u knoŋ rùːŋ nùh. nɪ̀ːəŋ nùh blade spade fall.off fall go inside hole dist girl dist the blade of the spade slipped out [of her hands] and fell into that hole. The girl mɯ̀ n dɤŋ thvɤ̀ː doːc-mdec kɔː neg know do how tcl didn’t know what to do, so she cried out:

sraek cry.out

prəkaːh proclaim

thaː comp

“lòːk-naː cùːəj jɔ̀ ːk phlae cɔːp ʔaoj khɲom baːn, khɲom nɯ̀ ŋ mister-whoever help take blade spade give 1 get 1 fut “Whoever can help to take the blade of the spade and give it to me, I will

714 sɔːŋ repay

Bisang kùn good.deed

lòːk.” mister

sraek cry.out

tae only

ponnɔh mɪ̀ːən like.that exist

khlaː cah tiger old

mùːəj dɔmbav one wound repay him [his] good deed.” After [she] cried like that, an old tiger with a wound kbaːl rùj baeŋ dɔŋkoːv coh kɔkrec mɔ̀ ːk pɪ̀ː knoŋ prèj, head fly lay.eggs grub move.down wriggle come from inside forest on [his] head into which flies laid [their] grubs was wriggling down out of the forest prap tɤ̀u nɪ̀ːəŋ nùh thaː “nɪ̀ːəŋ, taː jɔ̀ ːk ʔaoj baːn phlae tell go woman dist comp girl grandfather take give get blade and said to the girl: “Girl, I can take the blade cɔːp nùh, tae baə taː jɔ̀ ːk baːn, krɔ̀ ən-tae spade dist but if grandfather take get only.just of the spade for [you], but only if I can get you personally

nɪ̀ːəŋ girl

ʔaeŋ self

rɔ̀ ːk caj ʔaoj taː baːn haəj. taː mɯ̀ n bac look.for louse give grandfather get nsit grandfather neg worthwhile at looking for lices on my behalf. I am not worthy jɔ̀ ːk kùn pɪ̀ː cau take good.deed from grandchild to take a good deed from you.”

tèː.” neg

8 Bibliography Bisang, Walter. 1992. Das Verb im Chinesischen, Hmong, Vietnamesischen, Thai und Khmer. Tübingen: Gunter Narr. ―――. 1996. Areal typology and grammaticalization: Processes of grammatical ization based on nouns and verbs in East and mainland South East Asian languages. Studies in Language, 20.3, 519–597. ―――. 1999. Classifiers in East and Southeast Asian languages: Counting and beyond. In: Gvozdanovic, Jadranka (ed.), Numeral Types and Changes Worldwide. Berlin: Mouton de Gruyter, 113–185.

Modern Khmer

715

―――. 2009. On the evolution of complexity—sometimes less is more in East and mainland Southeast Asia. In: Sampson, Geoffrey, David Gil and Peter Trudgill (eds.), Language Complexity as an Evolving Variable. Oxford: Oxford University Press, 34–49. ―――. 2011. Grammaticalization and typology. In: Narrog, Heiko and Heine, Bernd (eds), Handbook of Grammaticalization. Oxford: Oxford University Press, 105–117. Chuon Nath. 1967. Vacənaːnùkrɔm khmae / Dictionnaire cambodgien. 5ème édition. Phnom Penh: Institut bouddhique. Ehrman, Madeleine. 1972. Contemporary Cambodian: Grammatical Sketch. Washington D.C.: Foreign Service Institute. Enfield, Nicholas J. 2003. Linguistic Epidemiology. Semantics and Grammar of Language Contact in Mainland Southeast Asia. London: RoutledgeCurzon. Gorgoniev, Ju. A. 1963. Kategorija glagola v sovremennom kxmerskom jazyke [The Category of the Verb in Modern Khmer]. Moskva: Izd. vozdoc&noj literatury. ―――. 1966. Gramatika kxmerskogo jazyka [Khmer Grammar]. Akademija Nauk SSSR, institut narodov Azii. Haiman, John. 1998. Possible origins of infixation in Khmer. Studies in Language 22, 597–617. ―――. 2011. Cambodian. Khmer. Amsterdam and Philadelphia: John Benjamins. Haiman, John and Ourn, Noeurng. 2003. Creative forces in Khmer. Iwasaki, Shoichi et al. (eds), South East Asian Linguistics Society XIII. Canberra: Australian National University, Research School of Pacific Studies, 157–164. Headley, Robert K. Kylin Chhor, Lam Kheng Lim, Lim Hak Kheang and Chen Chun. 1977. Cambodian-English Dictionary, 2. Vols. Washington, D.C.: The Catholic University Press. Henderson, Eugénie. 1952. The main features of Cambodian pronunciation. Bulletin of the School of Oriental and African Studies 14, 149–174. Huffman, Franklin E. 1967. An Outline of Cambodian Grammar. Cornell University. PhD Dissertation. ―――. 1970a. Cambodian System of Writing and Beginning Reader. New Haven and London: Yale University Press. ―――. 1970b. Modern Spoken Cambodian. New Haven and London: Yale University Press. Huffman, Franklin E. and Im Proum. 1978. English-Khmer Dictionary. New Haven and London: Yale University Press. Jacob, Judith M. 1968. Introduction to Cambodian. London: Oxford University Press. ―――. 1974. A Concise Cambodian-English Dictionary. London: Oxford University Press. ―――. 1978. Some observations on Khmer verbal usages. Mon-Khmer Studies 7: 95–109.

716

Bisang

Jenner, Philipp and Pou, Saveros. 1982. A Lexicon of Khmer Morphology. Honolulu: University of Hawaii Press. Khin, Sok. 1999. La grammaire du Khmer moderne. Paris: Éditions You-Feng. Lewitz, Saveros Pou. 1967. La dérivation en cambodgien moderne. Revue de l’école nationale des langues orientales 4: 65–84. Maspéro, G. 1915. Grammaire de la langue khmère. Paris: imprimerie nationale. Matisoff, James A. 1973. Tonogenesis in Southeast Asia. In Larry Hyman (ed.) Occasional papers in Linguistics 1: 72–95. Sacher, Ruth & Nguon Phan. 1985. Lehrbuch des Khmer. Leipzig: VEB Verlag Enzyklopädie. Smyth, David. Colloquial Cambodian. A Complete Language Course. London: Routledge. Thach, Joseph Deth and Paillard, Denis. 2009. Description de tròːv en Khmer contemporain. Cahiers de linguistique asie orientale 38: 71–124.

The Handbook of Austroasiatic Languages Volume 2

Grammars and Language Sketches of the World’s Languages Mainland and Insular South East Asia Editor Paul Sidwell (Australian National University)

The titles published in this series are listed at brill.com/gswl

The Handbook of Austroasiatic Languages VOLUME 2

Edited by

Mathias Jenny and Paul Sidwell

LEIDEN | BOSTON

Library of Congress Cataloging-in-Publication Data The handbook of Austroasiatic languages / Edited by Mathias Jenny and Paul Sidwell. p. cm. Includes bibliographical references and index. ISBN 978-90-04-28295-7 (hardback : alk. paper) — ISBN 978-90-04-28357-2 (e-book) 1. Austroasiatic languages—Handbooks, manuals, etc. I. Jenny, Mathias, editor. II. Sidwell, Paul, editor. PL4283.H36 2014 495.9’3—dc23 2014033059

This publication has been typeset in the multilingual “Brill” typeface. With over 5,100 characters covering Latin, IPA, Greek, and Cyrillic, this typeface is especially suitable for use in the humanities. For more information, please see www.brill.com/brill-typeface. issn 2352-9342 isbn 978-90-04-28295-7 (set hardback) isbn 978-90-04-28750-1 (vol. 1) isbn 978-90-04-28752-5 (vol. 2) isbn 978-90-04-28357-2 (e-book) Copyright 2015 by Koninklijke Brill nv, Leiden, The Netherlands. Koninklijke Brill nv incorporates the imprints Brill, Brill Nijhoff and Hotei Publishing. All rights reserved. No part of this publication may be reproduced, translated, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission from the publisher. Authorization to photocopy items for internal or personal use is granted by Koninklijke Brill nv provided that the appropriate fees are paid directly to The Copyright Clearance Center, 222 Rosewood Drive, Suite 910, Danvers, ma 01923, usa. Fees are subject to change. This book is printed on acid-free paper.

Contents volume 1 Acknowledgements ix List of Maps x Abbreviations xi Notes on Contributors xiv

Part 1 General Chapters 1 Editors’ Introduction 3 Mathias Jenny and Paul Sidwell 2 The Austroasiatic Languages: A Typological Overview 13 Mathias Jenny, Tobias Weber and Rachel Weymuth 3 Austroasiatic Classification 144 Paul Sidwell 4 Austroasiatic Comparative-Historical Reconstruction: An Overview 221 Paul Sidwell and Felix Rau 5 Overview of the Munda Languages 364 Gregory D. S. Anderson

part 2 Grammar Sketches Section 1 Aslian 1 Northern Aslian 419 Nicole Kruspe, Niclas Burenhult and Ewelina Wnuk

vi 2 Semaq Beri 475 Nicole Kruspe

Section 2 Monic 3 Old Mon 519 Mathias Jenny and Patrick McCormick 4 Modern Mon 553 Mathias Jenny

Section 3 Pearic 5 Chong 603 Suwilai Premsrirat and Nattamon Rojanakul

Section 4 Khmeric 6 Old Khmer 643 Paul Sidwell 7 Modern Khmer 677 Walter Bisang

volume 2 Section 5 Bahnaric 8 Bunong 719 Becky Butler 9 Kơho-Sre 746 Neil H. Olsen

Contents

Contents

10 Sedang 789 Kenneth Smith and Paul Sidwell

Section 6 Katuic 11 Kui Ntua 837 Kees Jan Bos and Paul Sidwell 12 Pacoh 881 Mark J. Alves

Section 7 Vietic 13 Vietnamese (Tiếng Việt) 909 Marc Brunelle

Section 8 Khmuic 14 Kammu 957 Jan-Olof Svantesson and Arthur Holmer 15 Mlabri 1003 Kevin Bätscher

Section 9 Mangic 16 Bugan 1033 Jinfang Li and Yongxian Luo

Section 10 Palaungic 17 Dara’ang Palaung 1065 Sujaritlak Deepadung, Ampika Rattanapitak and Supakit Buakaw

vii

viii 18 Danau 1104 Aung Si

Section 11 Khasian 19 Standard Khasi 1145 K.S. Nagaraja 20 Pnar 1186 Hiram Ring

Section 12 Nicobarese 21 Car Nicobarese 1229 Paul Sidwell Аppendix 1266 Index of Languages 1320 Index of Subjects 1326

Contents

section 5 Bahnaric

․․

chapter 8

Bunong Becky Butler 1

Language Background

Bunong (cmo) is an Austroasiatic language spoken in eastern Cambodia and in the Central Highlands of Vietnam. There are a number of variants of the language name, the two most widely used being Phnong and Mnong. The former is used by the Bunong in Cambodia as well as ethnic Cambodians. The latter, Mnong, is used by ethnic Vietnamese to refer to the Bunong although it is rarely used by the Bunong themselves as an ethnonym. However, it is the most commonly used variant in formal citations of the language. Bunong is a South Bahnaric language and can be broken down into three dialect groups: Central, Southern, Eastern (Bequette 2008, Lewis 2013). In addition to these three dialects of Bunong, South Bahnaric also includes the languages Kraol, Koho, Maa, Stieng and Chrau (Lewis 2013). Central Bunong, the dialect on which this chapter focuses and which has received the most linguistic attention (with the exception of Blood’s 1966 thesis on the Eastern dialect), is spoken by about 70,000 people worldwide, with about 32,500 speakers in Vietnam, 37,500 speakers in Cambodia and the rest in diasporic communities. The language has a “developing” status, meaning that it is used by some but not widely spread or sustainable (Lewis 2013). Speakers of Central Bunong are divided by the political border between Cambodia and Vietnam, and each community uses a different writing system. In Vietnam, the Bunong script is Romanized, like the Vietnamese script, while in Cambodia, Bunong is written with a Khmer-based script. Nonetheless, the spoken dialect is still intelligible across the border. The earliest printed works on Bunong language are a series of primers and teaching materials made for public consumption, most of which were undertaken by Richard Phillips in the 1960s and 1970s in conjunction with Bunong consultants and co-authors. In terms of linguistic analysis, there has not been much work since that point (cf. Vogel and Filippi 2006 for a full list of references). Phillips (1973a) and (1973b) are short publications on Bunong phonology and syntax respectively. A volume by Vogel and Filippi (2006) contains a series of language lessons to teach Bunong with a short introduction to Bunong grammar, and Vogel (2006) presents a fairly comprehensive grammatical sketch of Bunong along with an analysis of Bunong poetry. Finally, Bequette © koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_014

720

Butler

(2008) presents an analysis of narrative discourse in Bunong. A BunongKhmer dictionary containing a short introduction to Bunong phonology was also recently published (Phaen et al. 2012). This chapter relies heavily on Vogel (2006) and Bequette (2008). For more in depth linguistic analysis, please refer to those sources. 2 Phonetics/Phonology 2.1 Word/Syllable Structure All words in Bunong are either monosyllabic or disyllabic. Monosyllables must be heavy, either with a long vowel or a final consonant (limited to voiceless stops, nasals and liquids) or both. They can have either simplex or complex onsets. A syllable consists minimally of a consonant onset and a vowel. The glottal stop ʔ predictably occurs in the onset if no other consonant is present and is not represented in the transcription applied in this chapter. In disyllabic words, final syllables allow the same consonants and vowels as monosyllables. Penultimate syllables, however, have reduced set of vowels and consonants. In fact, the only vowel allowed in the penultimate syllable is [ə]. In the Khmerbased orthography, this vowel is always rendered as ‹a›, and in the Romanized script used in Vietnam, this vowel is not written at all although it is invariably produced in actual speech. Penultimate syllables are always light and open. As presented by Bequette and Phaen et al. (2012), there is a mutually exclusive relationship between sequences of penultimate- and final-syllable onsets in disyllabic words (C1ə.C2VC) and monosyllabic complex onsets (C1C2VC). In other words, C1 in monosyllabic complex onsets may be any consonant in the inventory of the language, but in simplex onsets of penultimate syllables, C1 must be one of /p pʰ cʰ k r l/. In addition, C2 in complex onsets is limited to /r l w j/. A schema of the possibilities is given in the following table. C1C2VC

C1ə.C2VC

C1

C2

C1

v

C2

any consonant

r l w j

p pʰ cʰ k r l

ǝ

any consonant, except liquids or glides

721

Bunong

In addition to this standard type of disyllable, Bunong also has a small set of disyllabic words in which allow more vowels in the penultimate syllable as well as nasal codas. However, these words are extremely rare. Maximal word shape in Bunong is CVN.CVC. 2.2 Phoneme Inventory 2.2.1 Bunong Consonants The consonantal inventory for Central Bunong is given in the table below. There are a number of discrepancies in the consonantal inventory among previous phonological descriptions. In addition to the obstruent types listed in the table below, Vogel and Filippi (2006) and Bequette (2008) also include a series of plain voiced stops. However, aside from implosives, all stops in Bunong are now voiceless, at least in the dialect of Bunong spoken in Vietnam. In addition, Phillips (1973) includes a palatal and velar implosive that Vogel and Filippi (2006) and Bequette (2008) do not. Finally, Vogel and Fillippi (2006) include a palatal fricative that Bequette (2008) does not, and Bequette (2008) includes preglottalized glides that Vogel and Filippi (2006) do not. Not all consonants possible in all positions. In particular, aspirated stops, implosives and prenasalized never occur word-finally. This restricted set is indicated by the shading on the table.

Voiceless unaspirated stops Voiceless aspirated stops Implosives Prenasalized stops Fricatives Nasals Liquids Glides

Lab.

Alv.

Pal.

Vel.

Glot.

p pʰ ɓ mp

t tʰ ɗ nt

k kʰ (ɠ ) ŋk

ʔ

m

n l, r

c cʰ (ʄ ) ɲc ç ɲ

w, ʔw

ŋ

h

j, ʔj

2.2.2 Bunong Vowels As with the consonants, sources differ on the number of vowels in Bunong’s inventory, but there are approximately 12 to 18 monophthongal vowels and one diphthong (aə) in Bunong. Although Phillips (1973a), Vogel and Filippi (2006)

722

Butler

and Bequette (2008) all present a slightly different inventory of monophthongs, each suggests three contrastive vowel heights and three degrees of backness. i:/i e:/e ɛ:/ɛ

ɨ:/ɨ ə:/ə a:/a

u:/u o:/o ɔ:/ɔ

Bequette (2008) claims that vowel length is contrastive for central and back vowels but not for front vowels. In contrast, Vogel and Filippi (2006) propose that there is a length distinction for front vowels as well; however, no minimal pairs are provided. Phillips (1973a) suggests that a length distinction exists only for central vowels, and that instead there are two front lax vowels ([ɪ] and [ɛ]) and two back lax vowels ([ʊ] and [ɔ]). It is possible that both Phillips (1973) and Bequette (2008) are accurate and that the differences between their accounts represent either a change over time or a difference in dialects. However, there is no doubt that shorter vowels in Bunong are also produced in a more centralized and lax manner, so that both sources are reporting a very similar system, such that the former is presenting them in a more phonetically descriptive way while the latter is providing their phonological representations. 2.3 Suprasegmentals 2.3.1 Stress Words in Bunong are either monosyllabic or disyllabic. In disyllabic words, the main stress always falls on the final syllable. 2.3.2 Register Bunong has two registers, which derive from a historical voicing contrast on word-initial stops. This voicing contrast was lost such that there are no longer voiced consonants in Bunong (aside from implosives), but the contrast has been maintained on vowels. Despite this historical change, the register distinction is still orthographically expressed on word-initial stops instead of following vowels for both the Khmer-based script and the Romanized script. I refer to vowels following historically voiceless stops as Register 1 and vowels following historically voiced stops as Register 2. Register is manifested in a number of ways in Bunong and is contingent on the phonological height of the vowel as demonstrated in the following table. In general Register 1 vowels are produced lower in the mouth than R2 vowels and are modal whereas Register 2 vowels are breathy. However, pitch and diphthongization differ depending on vowel height.

723

Bunong

High and Mid Vowels

Low Vowels

Register 1

Register 2

Lowered in Mouth Modal Lower Pitch Diphthong Lowered in Mouth Modal Equal Pitch Monophthong

Raised in Mouth Breathy Higher Pitch Monophthong Raised in Mouth Breathy Equal Pitch Diphthong

At this point, there is no agreed-upon convention for rendering the register contrast in the Roman script. Vogel (2006) maintains the original voicing contrast on initial stops, but includes below them a breathy symbol (↔) to indicate the register change. Bequette (2008) writes them as voiceless stops with a breathy symbol underneath, and Phillips (1973a, b) simply maintains the original voicing contrast on the stops without any breathy marker. Although breathiness is not necessarily the most salient cue to register, the breathy symbol is useful as a notational device. In this chapter, I have opted to render historically voiced stops as voiceless but to include the breathy symbol under the following vowel instead of the initial consonant since that is now where the register contrast lies. Examples of each notation are given below for the word ‘water’. Orthography

Source

dak d͍ aːk t̤ak ta̤ ːk teak

Phillips 1973a, b Vogel 2006 Bequette 2008 present chapter IPA

Note also in the examples above that vowel length is rendered differently by different authors. Bequette (2008) marks short vowels while leaving long

724

Butler

v owels unmarked. In contrast, Vogel (2006) marks long vowels while leaving short vowels unmarked. I follow Vogel (2006) in this regard. 3

Word Formation

3.1 Compounding Both noun and verb compounding are common in Bunong. Nouns may be combined with other nouns and verbs with other verbs. 3.1.1 Nouns (1)

kʰɔː aw pants shirt ‘clothes’

(2)

meʔ mpə̤ː mother father ‘parents’

3.1.2 Verbs Verbs are usually compounded with like-meaning verbs. (3)

cʰoːŋ eat.rice ‘eat’

cʰaː eat

(4)

kʰit om die finish ‘die’ (Phillips 1973b: 131)

Verbs can also be compounded with adjectives (although note that these categories are not always transparently distinct since adjectives often function as stative verbs). In these types of compounds, whichever element is first dictates the classifier for the compound. Syntactically, these expressions cannot be clearly distinguished from phrases and may not be lexicalized compounds is all cases. (5)

ŋəj mbrəj speak slow ‘speak slowly’ (Phillips 1973b: 132)

(6)

wɛh waɲ good look ‘good looking’

3.2 Derivational Affixes Bunong is largely an isolating language and has very little productive derivational morphology.

Bunong

725

3.2.1 Deriving Nouns Verbs and adjectives can be made into nouns by preceding them with the nominalizer naw. (7) rawe > naw rawe ‘worry (v)’ > ‘worry (n)’ (8) reh > naw reh ‘live’ > ‘life’

3.2.2 Deriving Verbs Vogel (2006) lists three prefixes by which new verbs are derived: təm-, r- and n-. The latter is better interpreted as a causative prefix and is addressed in Section 5.2.5 below. It is not entirely clear that the first, təm- is actually a prefix instead of a preverbal particle. It does seem to change the meaning of some verbs suggesting it is more than a simple reciprocal. See Section 5.2.7.1 for more on təm-. (9) ŋəj > təmŋəj ‘say, speak’ > ‘argue, debate’

The prefix r- is less controversial and transforms intransitive verbs into transitive causative verbs: (10) pi̤c > rpi̤c ‘sleep (INTR)’ ‘put to bed (TR)’ (11)

kə̤p pi̤c 1sg sleep ‘I lie down/sleep.’ (Vogel 2006: 173)

(12) kə̤p rpi̤c kɔːn ci̤ː 1sg caus.sleep child ill ‘I put my sick child to bed.’ (Vogel 2006: 173)

3.3 Reduplication There are two types of total reduplication in Bunong, and each is used for intensification. The first reduplicates the adjective. The second reduplicates the noun being modified and takes the form Noun-Adjective-Noun-Intensifier.

726

Butler

(13) teːp kwɔŋ ~kwɔŋ ŋan ri beard big ~red true dist ‘that really big, long beard’ (Bequette 2008: 27) (14) ce̤ːŋ kaː ŋan kaː ŋeːn ‘become fish true fish ints ‘(He) became a real fish!’ (Bequette 2008: 27)

4

Phrase and Clause Structure

4.1 Simple Sentences The basic order of the declarative Bunong sentence is SV/AVP. While most sentences have at least a subject and a verb, subjects can be omitted, depending on the context. An object is not required for a sentence to be considered grammatical. Any nominal subject or object can be replaced by a classifier if the noun is already known from context or from earlier in the discourse. (15) paŋ kɛ̤h rəwɛh 3sg exist elephant ‘He has (an) elephant(s).’ (Vogel 2006: 110)

Imperative sentences do not usually take an overt subject, and the imperative marker follows the verb and object. In the following example, the object is implied by the verb. (16) cʰoːŋ cʰaː hɔm ta naj eat.rice eat ipfv loc medl ‘Eat that [rice] there!’ (Bequette 2008: 48)

Simple interrogative sentences begin with the question word followed by the verb and then the subject. This order is distinct from a number of related languages like Khmer and Vietnamese, in which question words are not fronted. See Section 5.1.1.2 for more information on interrogatives. (17) mɔːh ca̤ n kʰan.aj tɛ̤ what do 2pl.f contr ‘What on earth are you all doing?’ (Bequette 2008: 36)

727

Bunong

4.2 Complex Sentences 4.2.1 Coordination, Chaining and Subordination Sentences, the most integral part of which is the verb, can be coordinated either with coordinating conjunctions or without an overt conjunction, which usually implies a sequential action. One of the most commonly used coordinators is the phrase ce̤h ri, which I gloss as clf dist. It can be used as a simple coordinating conjunction ‘and’ or as a temporal marker meaning ‘after that’. (18) taŋ ce̤h ri hear clf dist ‘hear and see’

cʰaə see

(19) wɛh moːlah mhik good or bad ‘be good or bad’ (Phillips 1973b: 132)

(20) ɲup paŋ cʰoŋ klup təm ntrṳŋ catch 3sg carry throw in pen ‘Catch it, carry it over and throw it into the pen.’ (Phillips 1973b: 132)

When embedding questions, no connector words are necessary but the question is apparent by the presence of the question word. (21) paŋ iriː oːp cə̤t mɔːh naw maj 3sg foc ask more what nml 2sg.m He (the bull owner) asked further, ‘What is your problem?’ (Bequette 2008: 79)

4.2.2 Relative Clauses Relative clauses follow the nouns or pronouns they modify, and make use of the relativzer iː ‘that, which’. (22) kə̤p rəwat rawɛːh iː cʰaː reːp ta nɛʔ 1sg buy elephant rel eat herb loc there ‘I bought the elephant that is eating grass there.’ (Vogel 2006: 92) (23) paŋ iː cap ntro̤ ːk ŋkwɔŋ ri rəlac 3sg rel graze cow male dist argue ‘That one who grazed a bull argued. . .’ (Bequette 2008: 49)

Objects may be relativized as well, although not so commonly as subjects. The following example shows an object that is relativized and fronted.

728

Butler

(24) rəpuː iː lan cʰrɛh iriː nɨh cʰaː rəɲeː buffalo rel too.late slash foc former eat adv ‘The water buffalo that (they) had already butchered, (a large group) devoured!’ (Bequette 2008: 50)

4.2.3 Complement Clauses Complement clauses differ from relative clauses in that they serve as the argument of a predicate. According to Bequette (2008), with verbs of speech and cognition, a complementizer lah ‘say’ is required, but with verbs of perception or desire, no complementizer is necessary. However, the example she provides (the second below) includes the relative clause marker iː, where the complementizer would otherwise be. (25) kə̤p ntoːn lah ntro̤ j maj ŋan 1sg suspect say fiancée 2sg.m true ‘I suspected that she was really your fiancée.’ (Bequette 2008: 50) (26) ce̤h ri taʔ iː uː paŋ ca̤ n clf dist request rel grandmother 3sg make ‘Then (he) requested that his grandmother make him a crossbow.’ (Bequette 2008: 50)

na crossbow

4.2.4 Adverbial Clauses There are several types of adverbial clauses in Bunong. Because Bunong does not inflect verbs, time is often conveyed through such clauses, which may be located in a variety of locations within the sentence. (27) naːr jaː kə̤p kṳʔ taː jaː day prox 1sg stay loc prox ‘Today, I stay here.’ (Vogel 2006: 112) (28) kə̤p kṳʔ taː jaː naːr jaː 1sg stay loc prox day prox ‘I’m staying here today.’ (Vogel 2006: 112)

Adverbial clauses may be introduced by an optional subordinating conjunction, such as lah ‘say; if’, kɛ̤h ‘exist’, tət ‘arriving’ or ce̤h (ri) ‘after’. (29) lah aj ŋkɔ̤ːn aj ɔn ta̤ ːk wiː . . . say 2sg.f cook 2sg.f put.aside water rice.water. . . ‘If you cook rice, you put aside the rice water for me. . .’ (Bequette 2008: 51)

729

Bunong (30) ce̤h cʰaː plɔn çak ri ɲut lah . . . after eat eggplant type dist pn say . . . ‘After (she) ate that eggplant, Nhut said. . .’ (Bequette 2008: 51)

4.3 Syntax and Pragmatics The notion of the topic is extremely important in Bunong discourse. Bequette has written extensively on the subject, and states the following: Bunong exhibits several characteristics of topic-prominent languages (Li & Thompson 1976: 466), such as the lack of passive constructions, the absence of dummy subjects (e.g. it), the frequent occurrence of double subjects, and the basicness of the topic-comment sentence structure. Whether a language is subject-prominent or topic-prominent has a significant impact on participant reference. Li and Thompson (1976: 461) describe several characteristics of topics, three of which are particularly salient for participant reference in Bunong. These characteristics are: 1) topics occur sentence initially, 2) topics must be definite, and 3) topic is a discourse notion. Since topics occur in the sentence initial position and must be definite, this means that unactivated participants are normally introduced postverbally before they can become topic. Since topic is a discourse notion, one has to look beyond the sentence level to understand topic reference. Bequette 2008: 63

This use of topics in lieu of subjects is apparent in Bunong by the lack of overt subjects in discourse. Because subject drop is not permitted if the topic has not been introduced previously, this is a discourse phenomenon not a sentence level syntactic phenomenon. Following is an example of a sentence with a dropped subject, which had been introduced in the previous discourse. (31) raweː pa̤ ːr pɛː naːr Ø pe̤c moː raɨːj worry two three day sleep neg unconscious ‘Worrying two, three days, [he] did not sleep well.’ (Bequette 2008: 65)

4.4 Noun Phrases Modifiers of all types (nouns, adjectives and verbs) must follow the nouns they modify. (32) kaː ratɔŋ fish smelt ‘smelt-fish’ (Bequette 2008: 28)

730

Butler

(33) cʰah ce̤h back.basket small ‘small back-basket’ (Bequette 2008: 29) (34) pṳː nhum person suck ‘traditional healer’ (Bequette 2008: 29) (35) rəwɛh waː kə̤p elephant uncle 1sg ‘my uncle’s elephant’ (Vogel 2006: 29) (36) rapaj nɨm rabbit emph ‘the rabbit himself’ (Bequette 2008: 34) (37) ca̤ ːj paŋ nɨm house 3sg emph ‘her own home’ (Bequette 2008: 35)

Bequette (2008) lays out the maximal Bunong noun phrase in the following way: (DETERMINER) NOUN (EMPHASIS) (NUMBER) (DEMONSTRATIVE). Each component besides the noun is optional. Only one of each element is permitted, except for the demonstrative, of which there can be multiple. 5

Word Classes

Vogel (2006) argues that word classes in Bunong are flexible, and that one word may have several different grammatical functions. For example, he suggests that wɛh ‘good’ can be a verb, an adjective or an adverb, providing the examples below. Of course, the actual grammatical structure of these sentences is a matter of interpretation. Adjectives that seem to be taking the role of verbs might be adjectives following a null copula since there is some evidence that a null copula exists in Bunong (cf. Section 5.2.9). Alternatively, words like wɛh may just be stative verbs that can be used as a main predicate or as a verbal or nominal modifier. (38) rəwɛh jaː wɛh ŋan elephant prox good true ‘That elephant is very good.’ (Vogel 2006: 17)

731

Bunong (39) kə̤p kɛ̤h rəwɛh wɛh tṳː po̤ ːk 1sg exist elephant good one clf ‘I have a good elephant.’ (Vogel 2006: 17) (40) paŋ mprəː wɛh ŋan 3sg sing good true ‘He sings very well.’ (Vogel 2006: 17)

5.1 Nouns 5.1.1 Pronouns and Question Words 5.1.1.1 Pronouns In addition to the basic pronouns listed in the table below, Bunong also permits the use of kinship terms as informal pronouns, e.g. koɲ ‘young uncle’, kɔːn ‘child’, uː ‘grandmother’, etc. Apart from the first person, there are no specific plural forms of the personal pronouns. Plural referents are marked by adding kʰan before the singular pronoun. Singular Feminine Masculine

Plural Feminine Masculine

kə̤p

1st person

pɔ̤n (inclusive) heːn (exclusive) hɛː (polite)

2nd person

3rd person

aj

maj

paŋ

kʰan aj kʰan maj kʰan aj maj kʰan paŋ

pṳː (indefinite, person)

A possessive suffix rɛ or pɛ̤, whose form is likely conditioned by the phonetic environment, may also be added to pronouns, though not to common nouns. Suffixed pronouns are often phonologically reduced, as well. The form paŋ-rɛ in example (43) is often shortened to prɛ.

732

Butler

(41) kə̤p-pɛ̤ 1sg-poss ‘mine’ (42) aj-rɛ 2sg.f-poss ‘yours (feminine)’ (43) paŋ-rɛ 3sg-poss ‘his/hers’

Pronouns can be made reflexive by adding the word eŋ although this word is not affixed directly to the pronoun. (44) iː joːh paŋ ri laːh eːŋ det sister 3sg dist say refl ‘The older sister said to herself. . .’ (Bequette 2008: 32)

5.1.1.2 Question Words Bunong has four main question words: mɔːh ‘what’, mɨm ‘which’, ah ‘where’, and ta̤ h ‘how much, how many’. mɔːh ‘what/which/why’: (45) a. mɔːh ca̤ n kʰan.aj tɛ̤ what do 2pl.f contr ‘What on earth are you all doing?’ (Bequette 2008: 36) b. haj koɲ mɔːh naw rəwe m-rɛ naj hey younger.uncle what matter worry 2sg.m -contr medl ‘He asked, “Hey uncle, what are you worrying about?” ’ (Bequette 2008: 175) c. mɔːh cɔːʔ maj what long.time 2sg.m ‘What took you so long?’ (Bequette 2008: 185) d. mɔːh ɲcwat paŋ tɨːm oj tɨːm pla naj what run 3sg origin morning origin euph medl ‘Why is he running at this early morning hour?’ (Bequette 2008: 181)

Bunong

733

mɨm ‘what/which’: (46) a. paŋ iː cap ntro̤ k mɛ oːp Ø mɨm kɔːn ntro̤ k 3sg rel graze cow female ask which child cow ‘The one who grazed the cow asked (the bull owner), “Which calf?” ’ (Bequette 2008: 110) b. mɨm naw maj ri koːɲ which matter 2sg.m dist younger.uncle ‘What (is) your problem, Uncle?’ (Bequette 2008: 183) c. mɨm təːŋ lah kɔːn ntro̤ k paŋ craw which res say child cow 3sg mir ‘Why did (he) say (it was) his calf?!’ (Bequette 2008: 178) ah ‘where’: (47) a. maj ah han haʔ koːɲ 2sg.m where go disc younger.uncle ‘Uncle, where are you going?’ (Bequette 2008: 182) b. ah ca̤ t han koːɲ where direction go younger.uncle ‘Where are you going, Uncle?’ (Vogel and Filippi 2006: 31) c. ah han koːɲ where go younger.uncle ‘Where are you going, Uncle?’ (Vogel and Filippi 2006: 33) ta̤h quantity question (48) a. maj ta̤h kɛ̤h nam tɛ̤h 2sg.m how.much exist year born ‘How old are you?’ (Vogel and Filippi 2006: 50) b. ta̤h cɔ̤ʔ how.much duration ‘How long (is the duration of X)?’ (Vogel and Filippi 2006: 93)

734

Butler c. ta̤h lɛʔ cɔ̤ʔ how.much complete duration ‘In how long/how long until (X)?’ (Vogel and Filippi 2006: 93)

5.1.2 Measure Words and Quantity Words Bunong has a decimal number system. Numbers one through ten are given here. The numbers represent the common Bahnaric system, with praːm ‘five’ borrowed from Khmer. mwaːj/tṳː pa̤ ːr pɛː pwaːn praːm praːw piːc pʰaːm cʰin cɨ̤ːt ‘one’ ‘two’ ‘three’ ‘four’ ‘five’ ‘six’ ‘seven’ ‘eight’ ‘nine’ ‘ten’ For numbers 11–19, the number ‘10’ is followed by numbers ‘one’ through ‘nine’, connected with maː ‘with’. (49) 12: 10 + 2 cət maː ten with ‘twelve’

pa̤ ːr two

Multiples of ten are formed with the numbers ‘one’ through ‘nine’ immediately preceding the word for ‘ten’. (50) 30: 3 tens pɛː cət three ten ‘thirty’

For numbers 20 and above which are not multiples of ten, these two are combined. (51) 32: 3 tens + 2 pɛː cət three ten ‘thirty two’

maː with

pa̤ ːr two

When counting in Bunong, classifiers must be used. A list of classifiers is given in the table below, based on Vogel (2006).

735

Bunong Classifier Noun class

Classifier Noun class

əʔ po̤ ːk hɛj nuç təːm ŋkɔ̤r təʔ mləm ŋka̤ r ɲcar

rɔːt rnaʔ ɲoŋ ntrɔːŋ ɓaː/mɛː rɲuaŋ ntuk ntrṳ loːk

non-specific, for non-human entities lit. head, for modes of transportation one or two people only three or more people elongated items lit. sleeve, for handheld tools lit. time certain animals and fruits grains and certain fruits length for cloth and rope

long houses families, houses for families rivers, bodies of water passages of history or text large wild animals adult male boars place, location handful, usually for plants fields or plots of land

Classifiers differ depending on the category of object being counted. Classifiers follow numerals, although numerals can precede or follow nouns. (52) pa̤ ːr po̤ːk rəwɛh two clf elephant ‘two elephants’ (Vogel 2006: 26) (53) pɛː əʔ jar two clf chicken ‘two chickens’ (Vogel 2006: 26) (54) kə̤p kɛ̤h rəwɛh wɛh tṳː po̤ːk 1sg exist elephant good one clf ‘I have a good elephant.’ (Vogel 2006: 17)

While some classifiers are specific to the category of items they refer to, there is also a generic classifier: əʔ, which can also be compounded with the numeral ‘one’ mwaj to form məʔəʔ. (55) ti pa̤ ːr əʔ arm two clf ‘(your) two arms’ (Bequette 2008: 36) (56) ti məʔəʔ arm one.clf ‘(your) one arm’

736

Butler

Nouns can at times behave as classifiers when another noun—already known from previous context—is omitted. In the following example, cṳl ‘fish basket’ serves this function. (57) kɛ̤h Ø ŋan tṳː cṳl pa̤ ːr cṳl exist [fish] true one fish-basket two fish-basket ‘(They) got many (fish)—one to two baskets full!’ (Bequette 2008: 37)

Bequette (2008) also proposes a set of classifiers used for humans. However these classifiers can never be used with the word ‘person’ but instead must always be used in lieu of it. Therefore, it is unclear that these are actually classifiers but instead might be variations on the word ‘people’, which differ in the singular, dual and plural. The phrase pṳː nuç may also be used as a generic term for ‘people’. (58) tṳː hwɛː ‘one person’ pa̤ ːr hɛː ‘two people’ pɛː nuç ‘three people’

5.1.3 Names and Terms of Address As mentioned in Section 5.1.2.1, non-family members may be addressed by kinship terms if the age and status relations are appropriate. Generally, people are spoken about by referring to their family members instead of by name directly. In particular, once a person has a child, they are often referred to as mother or father of that child, in lieu of their given name. While not so common in Cambodia, Bunong people in Vietnam often have a Y- (for males) or H- (for females) prefixed to their names. This is likely due to contact with Ede language groups, who do this as well. 5.2 Verbs Phillips (1973b) states that the Bunong verb phrase minimally comprises a head verb, with optional preverbal, auxiliary and postverbal elements, as well. Bequette (2008) augments this with a verbal object occurring directly after the verb, such that the verb phrase is: (Preverbal) (Auxiliary) Verb (Object) (Postverbal). Adverbs may also occur in the post-verbal slot. The maximal verb phrase is represented as follows: (PreV)

(Aux)

Verb

(Obj)

(PostV) (Adv)

Bunong

737

The main verb may consist of just one verb, or it may also be a chain of serial verbs, which often convey a sequence of events. (59) Ø

kṳc kɔ̤m cʰaː Ø ɲan roast cook eat really ‘They really roasted, cooked and ate (the fish).’ (Bequette 2008: 43)

(60) kə̤p ntum ntiːʔ ŋəj tro̤ ːn 1sg learn study speak Khmer ‘I am learning/learned to speak Khmer’ (Vogel 2006: 115)

5.2.1 Intransitive Verbs Intransitive verbs—by definition—require only one argument. (61) wa krak kṳʔ tɨ̤m old.uncle pn sit only ‘Uncle Krak just sat.’ (Bequette 2008: 45) (62) rəpaj pi ̤c ta rəlaː ca̤ ː rabbit sleep loc meadow grass ‘The rabbit slept in the meadow.’ (Bequette 2008: 45)

5.2.2 Transitive Verbs In transitive clauses, the subject generally precedes the verb while the object follows it. (63) mpə̤ʔ kə̤p coːŋ kə̤p father 1sg pick up 1sg ‘My father picks me up.’

Bequette (2008) argues that the verb cʰoːŋ ‘eat’ is implicitly transitive because it can only take one object, namely pjaŋ ‘rice’. However, while this verb can appear in a serial verb construction with cʰaː, the more general term for ‘eat’, a sentence in which cʰoːŋ appears without an overt object or as part of a serial verb is not licit. 5.2.3 Ditransitive Verbs Ditransitive verbs in Bunong are often formed by the use of a direct object noun phrase denoting the theme and a locative prepositional phrase indicating the recipient.

738

Butler

(64) ŋit ən tap ta ɲcʰot pn give egg loc pn ‘Ngit gave an egg/eggs to Nchot.’ (65) an kaː ta trom kla̤ h ri put fish loc inside pot dist “. . . put the fish inside that pot.” (Bequette 2008: 46) (66) ŋkɔːc ma koːɲ ɲut ri tell to younger.uncle pn dist ‘. . . told Uncle Nhot that . . .’ (Bequette 2008: 47)

5.2.4 Directional Verbs Verbs that entail directionality are most explicitly covered in Phillips (1973), who divides these verbs into five types given in the table below. Deposit

Carry

Motion

Take

Transfer

an ‘give, put’

coːŋ ‘carry with two hands’ tui ‘carry on the shoulder’ nca̤ ʔot ‘carry with one hand’

lɛw ‘move’

sɔk ‘take’

wət ‘transfer a short distance’ ncṳn ‘transfer far’

tə̤ːm ‘put down’ ceʔ ‘put into’ prap ‘put away’

Any of these verb types can occur independently. They can also occur in the following combinations: Carry – [Motion, Take or Transfer] (67) coːŋ hiːp lɛw kaləː ŋiːh carry box move top house ‘carry the box upstairs’ (Phillips 1973: 133) Motion—[Take or Transfer] Deposit—[Motion, Transfer, Carry-Take or Take-Carry] (where Carry-Take is a serial verb construction)

Bunong

739

5.2.5 Causative Verbs A nasal prefix n- may be added to create causatives. Its phonetic realization is dependent on the following consonant. (68) kʰət > ŋ-kʰət ‘die’ ‘to kill’

5.2.6 Existential Verbs Bunong has two verbs that can be used as existentials: kɛ̤h ‘exist’, which in some contexts is the translation equivalent of ‘have’, and pṳt ‘be there, exist’. These verbs are always sentence-initial because the existence is presupposed, and the subsequent subject is asserted. (69) kɛ̤h cum ɔk ŋan taː jaː exist animal many true loc prox ‘There are a lot of animals here.’ (Vogel 2006: 111) (70) moː pṳt krṳː neg exist teacher ‘There is no teacher.’ (Vogel 2006: 111)

Phillips (1973b) notes that in most cases the presence of kɛ̤h does not affect the meaning of the sentence: (71) pṳː raɲ kɛ̤h rɓun leʔ person old exist meeting q ‘Was/is the old man at the meeting?’ (Phillips 1973b: 131) (72) pṳː raɲ rɓun leʔ person old meeting q ‘Is the old man coming to the meeting?’ (Phillips 1973b: 131)

5.2.7 Preverbs and Auxiliary Verbs 5.2.7.1 Preverbs There are five pre-verbal negators: moː, lic, eː, lɨj and pwa̤ j. In addition, moː is accompanied by an additional element oh, which occurs after the entire verb phrase. moː pṳt naw raweː oh neg exist nom worry neg ‘(I) do not have any worries.’ (Bequette 2008: 42)

(73) Ø

740

Butler

There is one pre-verbal reciprocal: təm. (74) ce̤h ri Ø təm ralaːc ŋan clf dist recp argue true ‘After that (the two sisters) argued with each other.’ (Bequette 2008: 42)

In addition, there are several preverbal particles: Particle

Significance

maː ci̤h tə̤j ce͍͍ŋ lɔ̤r lic laŋ ntoː (maː) pə̤j toːl

a potential or intended outcome a completed process the speaker has completed some task not previously possible the speaker has the necessary means to complete an action precede an incomplete action progressive, continuous almost, about to surprise, suddenly

(75) kə̤p ci ̤h haːn 1sg finish go ‘I just got home.’

ta loc

(76) kə̤p pə̤j haːn ta 1sg almost go loc ‘I’m about to go home.’

ŋiːh house

ŋiːh house

5.2.7.2 Auxiliary Verbs Bunong has several auxiliary verbs, which also appear in the pre-verbal position, which are listed in the following table. Most of these can also serve as main verbs, as well. The table below is based on both Phillips (1973b) and Bequette (2008).

741

Bunong Bunong

Auxiliary gloss

Independent gloss

Bunong

Auxiliary gloss

Independent gloss

mɔp hom

ever continue, still

meet remain

uc cʰam

about to do helpfully

kɛ̤h kṳ

accomplished remaining in a state in the process of going

exist be at, sit

mhɛː lɛʔ

move, lead, bring

waj

can, be able to repeat dare

win return brave

nan an

recently completed, finished customarily, typically, habitual on time cause

want treat with medicine new all

lɛw

tɨ̤j pləː ca̤ ɲ

—

— give, put

(77) ɔh lɛʔ kʰət younger.sibling complete die ‘The younger sister has died.’ (‘Younger sister is completely dead.’) (Bequette 2008: 43)

5.2.8 Postverbals Bunong has three post-verbals: tɨ̤ːm, ‘simply, only,’ lɛʔ ‘complete’ and ŋan ‘true’. The latter two can occur with intensifier hoː ‘very’. (78) cʰaː klɛʔ nta̤ ŋ hoː lɛʔ eat tuber continually very complete ‘All (I) had to eat were tubers.’ (Bequette 2008: 44) (79) tə̤ːl ŋan kə̤p meː tired true 1sg voc ‘Totally sick (of them) I was!’ (Bequette 2008: 44)

In addition, Bunong also has an optional emphatic post-verbal negator oh, which cannot occur if a preverbal negator is absent. (80) maj lɨj hom ŋklɔn ma kə̤p 2sg.m proh still think about 1sg ‘Don’t keep thinking about me!’ (Bequette 2008: 44)

oh neg

742

Butler

5.2.9 Copula Verbs Bunong does not have an overt copula. Instead, the juxtaposition of noun phrases with other noun phrases can be assumed to be separated by a null copula. Translation equivalents of adjectives are encoded as stative verbs, and therefore do not require a copula to be used as predicates. (81) waː krak moː ram old.uncle pn not happy ‘Uncle Krak is not happy.’ (Bequette 2008: 45) (82) lah ntri̤ː kɔːn ntro̤ ːk naj Ø kɔn ntro̤ ːk say like.that child cow medl child cow ‘Therefore, that calf is someone else’s calf.’ (Bequette 2008: 47)

pṳː person

(83) lah maj ntɛːr lɛʔ cʰum pri̤ Ø klaːc maj say 2sg.m roar all animal forest afraid 2sg.m ‘If you roar (and) all the forest animals are afraid (of) you. . .’ (Bequette 2008: 48) (84) ce̤h ri jɨːh Ø ram kɛ̤h kraːç clf dist emph happy exist squirrel ‘After that, wow, (he) was happy (that he) got the squirrel.’ (Bequette 2008: 48)

5.3 Locational and Directional Words Bunong has a set of demonstrative words that can function as modifiers when following a noun phrase or as pronouns when standing alone. There is also a set of locational words indicating directions of up, down and ‘over there’. These only modify nouns and do not function as pronouns themselves. Demonstratives: jaː proximal (this or here) naj medial (that or there) ri distal (that or there) (85) paŋ haːn ta ri 3sg go loc dist ‘He goes over there.’ (86) cʰim ri bird dist ‘that bird’

743

Bunong

Locatives: tiː up there tɔː over there chuː down there (87) ŋiːh tiː house loc ‘that house up there’

(88) ŋiːh cʰuː house loc ‘that house down there’

5.4 Conjunctions/Connective Words Bunong uses a number of subordinating conjunctions, which are used in adverbial clauses. The most common are lah ‘if’ (literally ‘say’), ( jɔr)/ (tə)lah ‘because’ and ce̤h (ri) ‘after’. (89) lah aj ŋkəːn aj ən ta̤ ːk wi say 2sg.f cook 2sg.f put water rice.water ‘If you cook [rice], you put aside the rice water. . .’ (Bequette 2008: 51) (90) ŋit moː haːn kaːn jɔrlah rəwɛh caɲ pn neg go work because elephant bad ‘Ngit didn’t go work today because the elephant is bad.’ (91) ce̤h cʰaː pla̤ ŋ çak ri ɲut lah after eat eggplant type dist pn say ‘After eating that eggplant, Nhut said. . .’ (Bequette 2008: 51)

6

Sample Glossed Text

mpə̤ʔ father

kə̤p 1sg

mhɛː recently

kɛ̤h exist

tṳ one

mləm clf

rəwɛh elephant

amɔːh rəwɛh naj ncroːt. name elephant medl pn ‘My father recently got an elephant, a big male named Ncrot.’

nkwaːŋ male

tɔç, big

744 laː tusk

Butler paŋ 3sg

pa̤ ːr ɗaːh tɔç, two side big

cɔ̤ŋ, ce̤h long clf

ri dist

ŋlaːŋ, white

coŋ tip

la paŋ tusk 3sg

mplɔːɲ ŋan. sharp true ‘It has two tusks, which are long and white, and their pointy tips are very sharp.’ ap every

oj morning

ap every

mhɛːw, morning

kə̤p 1sg

waj hab

han go

ca̤ t follow

mpə̤ʔ father

kə̤p 1sg

han tɔ̤ːh rəwɛh. go untie elephant ‘Every day, every morning, I go with my father to untie the elephant.’ mpə̤ʔ father

kə̤p 1sg

nca̤ ʔot peːh kut, carry knife big

tɔːŋ krɛw ce̤h ri pə̤ʔ stick type clf dist father

an kə̤p give 1sg

nca̤ ʔot məːŋ plaːj mat. carry stick clf round ‘My father carries a big knife and a big stick with a hook at the end and gives me a stick with a ball at the end to carry.’ tət arrive

ta loc

ntuk rəwɛh place elephant

kṳʔ stay

ri, dist

mpə̤ʔ father

kə̤p 1sg

coːŋ kə̤ːp pick.up 1sg

hɛw lɔr ta kaləː rəwɛh ce̤h ri paŋ tɔ̤ːh ŋleːŋ. climb first loc back elephant clf dist 3sg untie rope ‘When we get to where the elephant is, my father picks me up to climb onto the elephant and then unties the rope from around its neck.’ ce̤h clf

tɔ̤ːh untie

paŋ 3sg

wan put.on

ŋleːŋ rope

ta loc

trokɔː neck

rəwɛh, elephant

ce̤h clf

ri dist

paŋ 3sg

lɔʔ tɔ̤ːh rseː rwak maʔak ta ntṳːr kəj rəwɛh. complete untie rope chain hang loc back back elephant ‘After untying it, he puts a thick rope around its neck, and then he unties a chain and tosses it over the back of the elephant.’

745

Bunong ce̤h clf

ri dist

kə̤p 1sg

ɗaʔ caus

rəwɛh elephant

tro̤ ːm bend

ka̤ j back

ma so

nəij easy

mpə̤ʔ father

kə̤p hɛw. 1sg climb ‘Then I have the elephant bend down so it’s easy for my father to climb on.’

7 Bibliography Bequette, Rebecca. 2008. Participant Reference, Deixis, and Anaphora in Bunong Narrative Discourse. M.A. thesis, Graduate Institute of Applied Linguistics, Duncanville, TX. Lewis, M. Paul, Gary Simons & Charles D. Fennig (eds.). 2013. Ethnologue: Languages of the World (17th ed.). Dallas, TX: SIL International. Retrieved from http://www .ethnologue.com/. Phaen, T., M. Sok & T. Bequette, T. 2012. Bunong-Khmer Bilingual Dictionary. Mondulkiri Province, Cambodia: International Cooperation Cambodia, READ Project. Phillips, Richard. 1973a. Mnong vowel variations with initial stops. Mon-Khmer Studies 4: 119–127. ―――. 1973b. A Mnong pegagogical grammar: The verb phrase and constructions with two or more verbs. Mon-Khmer Studies 4: 129–138. Vogel, Sylvain. 2006. Introduction a la langue et aux dits traditionnels des Phnong de Mondulkiri. Phnom Penh: Editions Funan. Vogel, Sylvain & Filippi, Jean Michel. 2006. Elements de langue Phnong. Phnom Penh: Editions Funan.

chapter 9

Kơho-Sre Neil H. Olsen 1 Background Kơho (kpm) is one of the South Bahnaric languages of Vietnam (along with Chrau, Mnong, and Stieng). According to Sidwell (2002), the South Bahnaric group falls within the Central sub-branch of the Bahnaric branch of Austroasiatic. The speakers call themselves kon cau [kɔn.caw],1 and are often referred to as montagnards or highlanders. The Kơho number over 207,500 people (2009 Vietnam Census); additionally, several thousand live overseas in France and the U.S.A. The Kơho homeland is located in the southernmost part of the Tây Nguyên (Western Highlands) region of Việt Nam. Their habitat includes most of Lâm Đồng province and includes settlements in neighboring provinces. Within the territory inhabited by the Kơho, the topography includes the Lang Biang mountains (2,200 meters) in the northeast that overlook the Đà Lạt Plateau. The Đà Lạt Plateau, made up of rolling hills, is separated from the Djiring (Di Linh) Plateau by the wide alluvial valley of the Đa Đưng (Đồng Nai) River. South of the Di Linh Plateau, a chain of mountain peaks, averaging 1,000 meters, separate the plateau from the coastal plain. The region is known for its terres rouges—very fertile red volcanic soils. In the southwest, the elevation drops to 200 meters at Madagui on the coastal plain. The climate is characterized by a tropical savanna consisting of two monsoon seasons: the southwest summer monsoon; and the northeast winter monsoon. The average annual temperature is 19° C.; average annual precipitation is 2,070 mm. There are at least twelve Kơho dialect groups for the area: Chil (Cil, Til); Kalop (Tulop); Kơyon (Kodu, Co-Don); Làc (Làt, Lach); Mà; Nồp (Nop, Xre Nop, Noup); Pru; Ryông Tô (Riồng, Rion); Sop, Sre (Chau Sơre, Xrê); Talà (To La); and Tring (Trinh). Although, Mà is linguistically a Kơho dialect group, it is

1 The ethnonym Kơho [kə’hɔ] is derived from a Cham word that refers collectively to a group of several peoples speaking mutually intelligible dialects in the southern part of the highlands (Olsen 1968, 1976). © koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_015

Kơho-sre

747

considered by the Mà community and ethnologists to be a separate ethnicity.2 Mà dialects include: Cau Mà (Chau Mà); Mà Huang; Ngan; Preng; Cop (Xop); Krung; and (Cau) Tô. Ethnographic and linguistic sources present considerable variation on the number or designation of the various dialect groups. Within some dialect groups, several distinct dialects have been documented (e.g. Sre dialects A, B, & C). Among the Kơho-speaking peoples, Sre is considered the prestige dialect group. This is due to the fact that the land inhabited by Sre experienced early contact during the French colonial period. The French organized the area through government, churches, and schools. An important highway, National Route 20, that connects Hồ Chí Minh City (Sàigòn) and Đà Lạt, was built through the heart of their territory (1898–1899). After the establishment of Đà Lạt as a hill station and resort (1907), the French further colonized and settled the area. Because of their location, the Sre were one of the first Kơho peoples to be converted to Christianity by French and, later, American missionaries. During this period, the French restricted Vietnamese settlement and contacts in the four-province Pays Montagnards du Sud, but after the Geneva Agreements (1954), administration was transferred to the Sàigòn government and the highlands were opened up for trade and subsequent settlement by the Vietnamese. Several congregations of North Vietnamese Catholics chose to join the population exchanges and were relocated to Lâm Đồng province. During the Second Indochina War, most of the heavier fighting was located outside of the Kơho-speaking area, but that war had an effect on these people in several significant ways. The Kơho underwent abrupt social and cultural upheaval when entire village communities were forcibly relocated into strategic hamlets strung along National Route 20 and clustered around district and provincial centers between Bảo Lộc and Dà Lạt. Many innocent people who remained in their native villages (‘free fire zones’) were decimated along with their crops and livestock as a result of these actions. After reunification (1976), the Vietnamese government initiated a massive internal migration program where many lowlander Kinh (people of Vietnamese ethnicity) were relocated to the highland provinces. In addition to new economic zones, there were other incentives to attract Vietnamese to

2 The distinction between the Kơho and the Mà is more cultural than linguistic. Most Kơho groups traditionally followed a matrilineal descent pattern and cultivated a wet rice regime (sre), while the Mà people practiced patrilineal descent and employed upland dry rice techniques (mir). The Kơho people in the eastern section of their homeland were heavily influenced by the Austronesian-speaking Chamic kingdoms.

748

Olsen

settle in Lâm Đồng.3 Even in 1968, the Kinh population comprised 60% of the province’s inhabitants.4 In 1976, 67% were Kinh, and by 1989, the proportion was 76% of the population. Principal sources on the Kơho people are the works of Jacques Dournes, a French missionary turned ethnographer, who extensively documented traditional Kơho society and language. His annotated bibliography is a useful guide to his publications (Dournes 1974). See also Schrock et al. 1966. Principal sources on the Kơho language include Bochet & Dournes 1953, Dournes 1950, Evans & Bowen 1962, Olsen 1968, and Manley 1972. Contemporary Vietnamese materials include Hoàng Văn Hành et al. 1983, Lý Toàn Thắng et al. 1985, and Tạ Văn Thông 2004. Resources for Kơho and Mà are available on the Ethnologue web site at: http://www.ethnologue.com. Sre is one of the main languages of commerce used in the area and is employed in radio and television broadcasts in Kơho-speaking areas. In his fieldwork, Manley (1972: 11–12) noted the existence of three distinct dialects within the Sre dialect group which he termed Dialects A, B, and C. Both Dialects A and B are spoken in Di Linh, apparently in contiguous territories within the town. Speakers of Dialect A have inhabited the area for a long time. Because of this, its speakers are wealthier and better educated, so socially Dialect A is considered the prestige dialect. Dialect B is spoken by a group that migrated from north and west of Di Linh into the area in the mid 1900s. Although speakers of Dialect B have intermingled with Dialect A speakers, Dialect B speakers still maintain the characteristics of their dialect. These two dialects are mutually intelligible and exhibit only minor phonetic and lexical differences. Smalley documented dialect C, spoken to the south of Di Linh, in his important article “Sre Phonemes and Syllables” (1955). Manley notes that Dialect C “is apparently different still from Dialects A and B—more different, in fact, than A is from B” (1972: 11–12, 15, 18–19). This sketch includes data from Dialects A and B; dialectal differences are not discussed unless significant. 2 Phonetics/Phonology 2.1 Word/Syllable Structure As in many Austroasiatic languages, Kơho has two syllable types: a presyllable and a main syllable. Thus, a word in Sre may be defined as consisting of a main syllable, optionally preceded by a presyllable.

3 Former Tuyên Đức province merged with Lâm Đồng; Đà Lạt became the province capital. 4 This figure does not include Tuyên Đức province.

749

Kơho-sre

2.1.1 Presyllable The fundamental shape of a presyllable is C1V1(C2) where C1 may be any consonant, V1 is a central vowel /ə/, and C2 is a nasal or liquid.5 Presyllables tend to weaken or disappear in many environments. There is a strong tendency within South Bahnaric for presyllables to be shortened, replaced, or dropped entirely. One pattern of presyllable weakening emerges if c2 is a nasal, another if c2 is a liquid. When c2 is a nasal, there is a tendency for v1 (/ə/) to disappear and the nasal becomes syllabic: /sənduj/ > [sn̩ ˈduˑj] ‘lips’; a voiced c1 then devoices: /ləmpjat/ > [l̥m̩ˈpjat] ‘tongue’. If c2 is a liquid, /l/ devoices or is deleted preceding a main syllable initial plosive: /gəltaŋ/ > [gəl̥ˈtaŋ] ‘knee’, and /gəltaw/ > [gəˈtaw] ‘skin’. Occasionally, nasals in word-initial position preceding plosives are interpreted as presyllables (of the C2 (index 2) category) rather than as part of a consonant cluster belonging to the main syllable. For example, in some dialects and idiolects, the devoiced [l̥] in [l̥m̩ˈpjat] ‘tongue’ is deleted, leaving only the syllabic [m̩ ] as a remnant of the presyllable: [l̥m̩ˈpjat] > [m̩ ˈpjat]. 2.1.2 Main Syllable The main syllable has the canonical shape C1(C2[C3])V2(C4[C5]), where C1 represents any consonant, C2 may be a glide or liquid (with constraints associated with C1, C3 can only be a glide (with constraints associated with C1 and C2), V2 may be any vowel with attendant pitch length attributes, C4 can only be a glide, and C5 can only be /Ɂ/ (with C4 being either /j/ or /w/) or /h/ (with C4 being /j/). The permitted main syllable patterns are exemplified in Table 9.1. Table 9.1

CV CVC CVCC CCV CCVC CCVCC CCCV CCCVC

Examples of permitted Sre main syllable patterns

sa kis ləwɁ sre mpaŋ glaːjʔ ŋgwi krjaŋ

‘eat’ ‘live (v.)’ ‘fold’ ‘irrigated rice paddy’ ‘foot’ ‘satisfy, expiate’ ‘sit down’ ‘type of hard wood’

5 Lower case letters (c v) indicate presyllable symbols; upper case letters (C V) indicate main syllable symbols.

750

Olsen

2.1.3 Selected Phonological Rules Sre invokes several phonological rules to maintain the shape of the canonical syllable pattern. The following rules are illustrative of the system; they are not the only rules. Geminate consonant clusters are not permitted; these clusters reduce to a single consonant. No nasal or liquid consonant clusters are permitted. If there are two co-occurring nasals, then the first is deleted (1a). If two alveolar consonants occur in succession, then the second cannot be a lateral (i.e. the first segment is deleted) as in (1b). If the initial consonant of the root is not a nasal or lateral, then the /n/ of /tən-/ assimilates homorganically to the point of articulation of that consonant (1c). (1)

a. /tən + muːʔ/ > [tə-muːʔ] caus-go.down caus-go.down ‘cause or make someone or something (to) go down’ b. /tən + lik/ > [tə-lik] caus-come.out caus-come.out ‘cause or make someone or something (to) come out’ c. /tən + gəs/ > [təŋ-gəs] caus-(to) have caus-to beget/to have more ‘cause or make someone or something to beget or have more’

Phonemically, a palatal glide may not follow another palatal consonant (2). (2) *cj- *ɟj- *ɲj- (*ɲhj-)

Syllable-final glottal stop following a long vowel vːʔ is deleted between two syllables (3a–c). There is no deletion following a short vowel, as illustrated by (3d). (3)

a. b. c. d.

/dàʔ mɛ/ /bòʔ daʔ/ /dàʔ daʔ/ /luʔ da/

> > > >

[daː mɛ] [boː daʔ] [daː daʔ] [luʔ daʔ]

‘river (lit. water mother) ‘that head’ ‘that water’ ‘that rock’

751

Kơho-sre

2.2 Phoneme Inventory 2.2.1 Consonants Kơho phonology patterns similarly with the sound systems seen in other Austroasiatic languages. Kơho-Sre has consonant segments at five places of articulation with voiceless unaspirated and aspirated stops, voiced stops, implosives, fricatives, unaspirated and aspirated nasals, trills, laterals, and glides (Table 9.2). Table 9.2 Consonant phonemes Initials

p pʰ b ɓ m w

Finals

t tʰ d ɗ s n r l, r

c cʰ ɟ

ɲ

k kʰ g

ŋ

ʔ

h

p m w

t n s l, r

c ɲ j

k ŋ

ʔ h

j

Before the palatal finals /c/ and /ɲ/, there is an audible palatal offglide after the vowel [Vʲ], so that /pwac/ ‘flesh’ is pronounced as [pwaʲc] and /ʔaɲ/ ‘I (1st person singular)’ as [ʔaʲɲ]. The phoneme /r/ is commonly a voiced alveolar trill [r] but also often reduces to a flap [ɾ] when it occurs as the second segment in a consonant cluster. 2.2.2 Vowels Sre vowels are phonemically categorized by four primary features: height, backness, length, and, tentatively, by tongue root position for allophonic overlap. The vowels are delineated in Table 9.3. The front vowels are unrounded, while back vowels are rounded, except for [a] which is unrounded in this dialect. The central vowels vary widely as to lip rounding and height, but mostly occur as allophones of /ǝ/, especially as presyllable vowels. The high, central vowel allophones [ɨ ~ ɯ] occur very rarely in the data. Several vowels [e, o, a] almost always occur long, except in some personal names.

752

Olsen

Table 9.3 Vowels

i e ɛ

ɨ (~ɯ) ǝ a

u o ɔ ɑ

Manley (1972: 15–18) discusses allophonic overlap in the high and mid-high front vowels, where the high front vowel /i/ has [i] and [ɪ] as allophones, while the mid-high front vowel /e/ allophones are [i], [ɪ], and [e̝]. In a minimal pair where /i/ and /e/ contrast, the allophones can be indistinguishable as far as tongue height alone is concerned. For example, in the words /ntiːŋ/ ‘bone’ and /nteːŋ/ ‘where’, both vowels are long, high, tense, and front [ɪː]. He notes that speakers will invoke a slight tongue-root advancement [ɪ̘ː] to distinguish between the two overlapping allophones. Manley notes that in Dialect B, allophones of /o/ overlap with allophones of /u/, so an advanced tongue root [u̘ ] is triggered to disambiguate that overlap. It appears that some dialects of Sre are gradually evolving a nascent system to disambiguate overlapping allophones using advanced tongue root. The /ɑ/ phoneme, also written â in some SIL sources, varies phonetically [ɒ ~ ɑ ~ ɐ] according to speaker/dialect. The symbol chosen here is for convenience to distinguish from the low central vowel in the italic font. 2.3 Suprasegmentals Smalley observed that “Srê speech is characterized by a marked tenseness and preciseness of articulation. Phrasal groupings may be identified by final stress, and are bounded by space. A sharp syllable-timing, plus the allophones of tone-length, make syllable division fairly easy medially on the phonetic level” (1954: 218–219). 3

Word Formation

3.1 Compounding Compounding is a generic term for a linguistic unit composed of two or more elements, each of which could function independently in other circumstances. Examples are illustrated in (4–6).

753

Kơho-sre (4) mɛʔ-baːp ‘parents’ < mɛʔ ‘mother’ + baːp ‘father’ (5) ɟəluʔ-məŋan ‘dishes’ < ɟəluʔ ‘large bowl’ + məŋan ‘small bowl’ (6) muh-mat ‘face’ < muh ‘nose’ + mat ‘eye’

3.2 Derivational Affixes Sre prefixes generally attach themselves to verbs, but a few do act on other syntactic categories (e.g. nouns). It is often difficult to distinguish between a prefix and a presyllable because of phonological similarity (homophones) and the unstable nature of any segment that precedes a main syllable. Sre affixation is synchronically complex; this section only touches on the more prominent and recurring forms. 3.2.1 Causative Prefix tənOne of the more productive prefixes in Sre is tən-. This prefix converts intransitive verbs to causative verbs. The derived verb has the meaning of ‘causing someone or something to do something unintentionally,’ or ‘to make to happen to someone or something else.’ Allomorphs are created by phonological rules triggered by the initial consonant of the verb (or noun) root. The pattern in (7) serves as a template to illustrate the sentential changes that occur after the prefix is added to the root word. Examples (8–11) show regular derivational morphology. (7)

a. kʰaj git 3sg know ‘S/he knows it.’

gə 3ndef

b. ʔaɲ tən-git gə 1sg caus-know 3ndef ‘I cause (or make) him to know it.’ (8)

cʰət ‘to die’

tən-cʰət ‘to make to die; kill’

(9)

duh ‘to be hot’

tən-duh ‘to make hot’

(10) re: ‘to go home’

tən-re: ‘to make someone go home’

kʰaj 3sg

ʔin dat

754 (11)

Olsen sɔh ‘to wear, dress’

tən-sɔh ‘to dress someone else’

The nasal cluster avoidance rule operates in (12–15). (12) mu:ʔ ‘to go down’

tə-mu:ʔ (*tən-mu:ʔ) ‘to lower; to make go down’

(13) mut ‘to go in, enter’

tə-mut (*tən-mut) ‘to make go in’

(14) ndaw ‘to wear on the head’

tə-ndaw (*tən-ndaw) ‘to put on someone’s head’

(15) ŋac ‘to be well (goodbye)’

tə-ŋac (*tən-ŋac) ‘to wish one well; to farewell someone’

The alveolar cluster avoidance rule (§2.1.3.1(1b)) operates on (16). (16) lik ‘to go out’

tə-lik (*tən-lik) ‘to make to go out’

The nasal assimilation rule operates on (17–18). (17) gəs ‘to have’

təŋ-gəs ‘to beget; to have more’

(18) kah ‘to remember’

təŋ-kah ‘to remind; to make to remember’

Additionally, Nguyễn (1973: 26; §4.3.1.4) posits a related form tər- that generally denotes a certain movement associated with a causative meaning (19). (19) a. hiw dɔ caw sɔr house prox person demolish ‘They took down this house entirely.’ b. hiw dɔ tərlah.tərləm house prox collapse ‘This house collapsed completely.’

tərlah completely

ɟəh completely

755

Kơho-sre c. tərlah.tərliŋ to collapse.demolish ‘to collapse completely’

3.2.2 Causative Prefix bəThe causative prefix bə- is added to stative verbs to indicate ‘cause to become/ do’ (20–21). This is similar to the English suffix -en which performs the same operation ‘to make, render (of a given character or quality)’: hard > hard-en. (20) sɔŋ ‘straight’ (21) saːr ‘hard’

bə-sɔŋ ‘to straighten (to cause to be straight)’ bə-saːr ‘to harden (to cause to be hard)’

When prefixed onto a transitive verb root, it has a causative meaning (22–23). (22) cah ‘to break (intr.)’ (23) kap ‘to bite’

bə-cah ‘to cause to separate, sort out’

bə-kap ‘to cause to bite (press)’

3.2.3 Transitivizing Prefix pənThe prefix pən- is added to stative verbs: [to] cause (someone) to have the quality indicated by the verb (24–25) (Manley 1972: 44–45). (24) haːŋ ‘to be beautiful (woman)’ (25) hɑːp ‘to be envious’

pən-haːŋ ‘beautify’

pən-hɑːp ‘to make (s.o.) envious’

When the stative verb root begins with a glottal (and a vowel), the alveolar feature of the /n/ in the prefixed pən- is progressively assimilated into the initial glottal segment of the main syllable (26–28). (26) ʔum ‘to bathe’

pən-ɗum ‘to bathe (s.o.)’

(27) ʔaːŋ ‘to be bright’

pən-ɗaːŋ ‘to illuminate (s.t.)’

756

Olsen

(28) ʔja:ŋ ‘to be comfortable, at ease’ pən-ɗja:ŋ ‘to put (s.o.) at ease’

3.2.4 Passivizing Prefix gəThe prefix gə- is added to monosyllabic verb roots to express a passive notion. This prefix is possibly related to the 3rd person indefinite pronoun gə. In Sre, animate Subjects are deleted in passive expressions, while inanimate Subjects are retained and demoted to an oblique role, marked by the instrumental preposition mə (the same form mə is used as clause coordinating conjunction ‘and’). (29) a. kʰaj paːʔ mpoːŋ 3sg open door ‘S/he opens the door.’ b. mpoːŋ gə-paːʔ Ø door pass-open ‘The door (was) opened.’ (30) a. caːl paːʔ mpoːŋ wind open door ‘The wind opened the door.’ b. mpoːŋ gə-paːʔ mə caːl door pass-open ins wind ‘The door was opened by the wind.’

3.2.5 Homorganic Nasal prefix /N-/ The homorganic nasal prefix /N-/ may be prefixed to at least two sets of bases to form interrogatives or pronouns of place. When /N-/ is prefixed to a nominal or adpositional base, the resultant form is an interrogative (Nguyễn 1973: 46–47). Examples: Base form bɛ ‘that; as’ bəh ‘from’ (origin, source) teːŋ ‘place, direction’ caw ‘person’ cʰi ‘(any-, some-) thing’ [no apparent base form]

Nasal prefix + base form mbɛ ‘how’ mbəh ‘(from) where; what source’ nteːŋ ‘where’ ɲcaw ‘who, which’ ɲcʰi ‘what’ nɗɑː ‘how much/many’

757

Kơho-sre

When /N-/ is prefixed to certain determiners, the resultant form is a pronoun of place (Nguyễn 1973: 48–49). Examples: Base form dɔ ‘this’ (proximal) dɛn ~ gɛn ‘that’ (medial) daʔ ‘that’ (2nd of two objects being compared)

Nasal prefix + base form ndɔ ‘here’ ndɛn ~ngɛn6 ‘there’ ndaʔ ‘there’

3.2.6 Adverb of location prefix hə-7 The adverb of location prefix hə- occurs with nominal and determiner bases to form adverbs of location with a semantic function that indicates relative spatial position (Nguyễn 1973: 47–48): Base form gah ‘side’ nɛ ‘that’ (equidistant from the object) ɗap ‘the portion of land in front of’ ɗaŋ ‘up, above, top’ ɗəm ‘below, under(neath)’

Prefix + Base form hə-gah ‘on the side/edge’ hə-nɛ ‘over there’ hə-ɗap ‘in front of’ hə-ɗaŋ ‘on top of’ hə-ɗəm ‘undereath’

3.2.7 Nominalizing Infix -ənThe infix -ən- is inserted into transitive verbs (or nouns to produce classifiers), it has several allomorphs: -ərn- and -əmp-. The derived noun has the meaning of ‘that which is the goal of the action indicated by the verb’. The derived cognate forms often have instrumental (31), result (32), locational (33), or classifier (36–37) meanings. After infixation, (31), (34), and (35) undergo the alveolar cluster avoidance rule. (31) plɛh ‘to make way’

p-ən-ɛh8 (*p-ən-lɛh) ‘an obstruction to divert water’

(32) pat ‘to knead, squeeze’ (33) sɛ ‘to turn, detour’

p-ən-at ‘something kneaded (clay, dough, etc.)’ s-ən-ɛ ‘place where detour begins or ends’

6 /ngɛn/ is pronounced [n̩ .gɛn]. 7 The prefix hə- is not to be confused with the determiner həʔ ‘that; out of sight, temporally or spatially’. 8 Derived forms in the pattern of p-ən-ɛh are pronounced pə-nɛh.

758

Olsen

(34) blɔ ‘to wear in the ear’ (35) klɔ ‘to hear’

b-ən-ɔ (*b-ən-lɔ) ‘earring’

k-ən-ɔ (*k-ən-lɔ) ‘to hear’

(36) gar ‘seed (n.)’ (37) tap ‘egg (n.)’

g-ən-ar ‘classifier for seeds, kernels’ t-ən-ap ‘classifier for eggs’

The following nouns derived with -ərn- have instrumental meanings (38–39). (38) kal ‘to bolt, bar’

k-ərn-al ‘a wooden door bolt’

(39) ndəp ‘to cover up’

d-ərn-əp ‘to cover up, hide’

-əmp- is inserted into monosyllabic transitive verbs. The derived noun has the meaning of ‘that which is used in the action indicated by the verb’. In the data observed, this infix only occurs in verb roots with initial /s-/. No verb root forms have high vowels: [i, ɨ, u]. (40) seːt ‘to plug up’

s-əmp-et ‘a plug’

(41) saːc ‘to fish by draining’ (42) sɔːl ‘to illuminate’ (43) sroːm ‘to sheathe’

s-əmp-aːc ‘the place one stands to fish by draining’

s-əmp-ɔːl ‘a torch’ s-əmp-roːm ‘a scabbard’

(44) sraŋ ‘to sting (of a fish)’

s-əmp-raŋ ‘the stinger (of a fish)’

759

Kơho-sre

3.3 Reduplication Reduplication is usually considered a component of affixation, since many of the phonological processes are similar or identical (Olsen 2010). However, lexical and semantic issues often come into play as phonological explanations alone cannot completely account for the data. A relevant descriptive prosodic framework is that explicated by Svantesson (1983: 84), based on classic syllable structure, and his work with Kammu/Khmu. He identifies five types of reduplication (with examples in Kơho from Evans & Bowen 1962), delineated below: Type of reduplication

Description

Example (in Kơho)

R(Ø) R(O) R(N) R(C) R(R)

no change onset-change nucleus-change coda-change rhyme-change

sum-sum ‘always, forever’ bih-kʰih ‘to be poisonous’ cɔk-cɛk ‘to gossip’ ɁalaɁ-Ɂalaj ‘carelessly’ crih-cra:j ‘strange, wonderful’

The following examples were elicited by Evans & Bowen (1962: 149–155). These forms are classified following Svantesson’s typology. Forms are cited phonemically, followed by glossing as per Evans & Bowen, an illustrative sentence, and translation.9 R(Ø): No change (45) kʰaj la kənɔm njam dɔŋ mɛʔ sum.sum 3sg is child good help mother always ‘She’s a good child, always helping her mother.’ E&B046 (42)

R(O): Change in onset (46) bəs.tur həʔ gəs bih.kʰih ŋan snake dist.nvis be poisonous very ‘That viper is very poisonous.’ E&B149 (3) 9 E&Bnnn indicates the page number of the original citation, and the number in parentheses cites the sentence number on that page (e.g., E&B046 (42) refers to sentence 42 on p. 46).

760

Olsen

(47) kʰaj pədah ʔaɲ kəɲaw pʰan ʔaɲ Ɂwan.ɟan 3sg suspect 1sg steal things 1sg endured.wrongfully ‘He suspected me of stealing things; I endured a lot.’ E&B152 (49)

ŋ an very

R(N): Change in nucleus (48) mɔː nɛ ɟak ɗəs cɔk.cɛk woman equd clever speak gossip ‘That lady is very gossipy.’ E&B150 (12)

ŋan very

(49) baɲ ɗəs cəluk.cəlak bɛhəʔ proh speak wrongly that.way ‘Don’t talk like that.’ E&B150 (13)

R(C): Change in coda (50) kʰaj ləh.brwaʔ dɔ ʔalaʔ.ʔalaj ŋan 3sg work prox carelessly very ‘He did this work very carelessly.’ E&B149 (1) (51) kʰaj ŋac.ŋar ŋan tɑːm jəh ʔalaʔ.brwaʔ 3sg zealous very loc all work ‘He is zealous in all his work.’ E&B152 (42)

R(R): Change in rhyme (52) kʰaj ʔəm ɓəʔ wam muh-mat gen ɓas.sil.ɓas.sɔ 3sg stay dirty soiled nose-eye then ashamed ‘Her face was dirty so she was very ashamed.’ E&B149 (2)

ŋ an very

(53) kʰaj ɗəs lɔh.laŋ ŋan 3sg speak clearly very ‘He speaks very clearly.’ E&B151 (31)

As can be seen from the examples cited, some adverbial-like forms often function as other syntactic categories. It appears from these examples that the lexical or semantic component has little or no effect on the phonological or prosodic shape of the reduplicant. Presyllables are exempted from most phonological or prosodic processes, a common phenomenon in Austroasiatic languages. Some complex onsets may

761

Kơho-sre

not actually be reduplicated binomials, but base forms with fossilized second elements, where the meaning has been lost. Lexicographic research produced no matches for any of the second elements. 3.4 Clitics 3.4.1 The Reciprocal Proclitic tam= The reciprocal proclitic tam= indicates that the plural actors involved in the verbal action do something to each other. The derived verbal form requires two subjects or plural actors. (54) a. kʰaj ləh caw 3sg hit person ‘He hits someone.’ b. caw dɔ mə caw nɛ person prox ins person equd ‘This man and that man fight each other.’

tam=ləh RECP=fight

3.4.2 The Possessive Enclitic =tam The enclitic =tam denotes personal possession. Personal possession is indicated by Noun + =tam (55) dɔ la: sraʔ=tam prox be book=poss ‘This is my own book.’ (56) dɔ la: sraʔ=tam prox be book=poss ‘This is his own book.’

kʰaj 3sg

ʔaɲ ɟe 1sg poss (possessive/genitive de > ɟe after ʔaɲ ‘1ps’) de poss

3.4.3 The Reflexive Enclitic =tam Probably cognate with =tam above, when attached to saʔ ‘body’, =tam becomes an emphatic (57–58). (57) saʔ=tam body=refl ‘myself, yourself, him/herself’

762

Olsen

(58) saʔ=tam ʔaɲ kɔɲ lɔt body=refl 1sg want go ‘I myself want to go to market.’

4

draːʔ market

Phrase and Clause Structure

4.1 Simple Sentences Kơho-Sre basic word order is subject + verb + object (SV/AVP). The word order is fairly rigid, but there can be some phrase or clause movement to indicate certain syntactic functions. Order of Subject, Object and Verb: AVP (59) kʰaj ləh hiw pa 3sg built house new ‘S/he built a new house.’

Order of Adposition and Noun Phrase: Prepositions (60) kʰaj lɔːt tɔm daːlaːc 3sg go to Dalat. ‘He/she went to Đà Lạt.’

Order of Possessive (Genitive) and Noun: Noun-Possessive (Genitive) (61) sraʔ kʰaj book 3sg ‘his/her book’

Order of Adjective and Noun: Noun-Adjective (62) kʰaj ləh hiw pa 3sg built house new ‘S/he built a new house.’

Order of Demonstrative and Noun: Noun-Demonstrative

763

Kơho-sre (63) sɔ nɛ dog equd ‘that dog (over there)’

Order of Numeral and Noun: Numeral-Noun (64) baːr kənhaj two month ‘two months’

Order of Degree Word and Adjective: Adjective-Degree Word (65) dan ɗəs mham ʔet please speak fast little ‘Please speak a little faster.’

taj more

4.1.1 Interrogative Sentences There are two types of interrogative sentences: polar (yes/no) questions, and content questions. The position of polar question elements is phrase final. səl reflects dialect B (66–67). ʔə reflects dialect A (68). (66) me ɟəglɔh 2sg.m hungry ‘Are you hungry?’

səl q

(67) me lɔːt draːʔ səl 2sg.m go market q ‘Are you going to market?’ (68) kʰaj lɔːt hə səgɔːŋ 3sg go loc Saigon ‘Did s/he to to Sài-Gòn?’

ʔə q

In answering ‘yes’, the verb of the question is repeated. For a negative answer the verb is preceded by ʔaːʔ (‘not’). Less commonly, one can also give a simple affirmative answer with di or ʔəj (‘yes’). Content questions are used when one desires information. These questions require the interrogatives listed below.

764

Olsen

Interrogative

Gloss

mbəh ‘from where, what source’ mbɛ ‘how’ ɲcʰi ‘what’ ɲcʰi bəh tɑːj ‘why’

ɲcaw ‘who’ nɗaː ‘how much/many’ tus de ‘when’ nteːŋ (ɗah) ‘where (place)’

Content questions generally require the sentence-final word tajh. In ordinary conversation where there is an interrogative element in the sentence, tajh is optional (70a–b). For focus, an interrogative may be fronted to sentence-initial position (70b). If a formal answer is strongly desired, tajh is obligatory (71). (70) a. me lɔːt nteːŋ (tajh) 2sg.m go where q ‘Where are you going?’ b. nteːŋ me lɔːt (tajh) where 2sg.m go q ‘Where are you going?’ (71) ɲcʰi me ɗəs what 2sg.m say ‘What did you say?’

tajh q

4.1.2 Imperative Sentences There are three types of imperative sentences: polite inclusive (Ɂih or laʔ), polite exclusive (jə), and non-polite exclusive (tɛʔ). Ɂih and jə occur sentence final. laʔ and tɛʔ occur after the relevant verb. These sentences are illustrated below (72–75). Polite inclusive particle ʔih (72) (bɔl.hɛ) lɔːt ɲo 1pl.incl go drink ‘(Let’s) go drink wine.’

tərnɑːm rice.wine

Non-polite inclusive particle laʔ

ʔih hort

765

Kơho-sre (73) lɔːt laʔ, ʔaɲ ʔaːʔ lɔːt go imp 1sg neg go ‘(You) go, I’m not going!’ (a strong form of ʔih; almost scolding)

Polite exclusive particle jə (74) pəs hiw gɔh jə sweep house clean imp ‘(You) sweep the floor clean.’

Non-polite exclusive particle tɛʔ (75) lɔːt tɛʔ bal.mə bɔl.hɛ go imp com 1pl.excl ‘(You) come along with us.’

The negative imperative, or prohibitive (proh), is indicated by baɲ ‘don’t’, which is preverbal and phrase initial (76). (76) baɲ ɲim taj ɲɔʔ proh cry more laugh ‘Don’t cry any more, laugh!

tɛʔ imp

4.2 Complex Sentences Complex sentences can involve various dependent clauses, generally employing coordinating or subordinating conjunctions. Often these conjunctions overlap syntactically, but semantically the sense of the sentence is made clear by the context. Coordination involves the conjoining of constituents if they are of the same type and share the same syntactic function. These coordinate structures are constituents linked by a conjunction like mə ‘instrumental and’ (77) or hala ‘or’ (78). Note that identical constituents may undergo conjunction reduction (i.e. all but the initial co-indexed conjuncts are deleted). (77) kʰaj ŋgum kɔj mə ŋkʰjaŋ pʰɛ 3sg winnow paddy ins separate uncooked.rice ‘She winnows the paddy and [she] separates the rice.’ (78) kʰaj ʔaːʔ git kʰaj lɔːt hala ʔəm 3sg neg know 3sg go or stay ‘S/he doesn’t know [if] s/he will go or [s/he will] stay’.

766

Olsen

4.2.1 Conditional Sentences Conditional sentences are introduced by dilah ‘if’, with the secondary clause optionally beginning with gɛn ‘as a result or consequence’ (79). The same syntax is employed in resultative sentences, where gɛn is generally obligatory and the resultant clause content signals the type of clause (80). (79) dilah ʔaɲ bɛp gə ʔaɲ if 1sg sip it 1sg ‘If I sip it, (then) I can drink it.

rəgəj able

huːc drink

(80) dilah gəs jət naʔ caw tɑːm rəndɛh, gɛn if exist ten clf person loc vehicle then ‘If there are ten people in the car, then it is very crowded.’

hat crowded

ŋan very

4.2.2 Causal Sentences Causal sentences indicate a cause or consequence of an action. The conjunction gɛn introduces the causal clause (81). (81) kʰaj ʔat do paŋ sraʔ, gɛn sraʔ ruŋ 3sg hold one clf paper then paper fall ‘He held a sheet of paper, (and) it fell to the ground.’

de to

ʔuː ground

4.2.3 Contrastive Sentences Contrastive sentences often present contradictory information in the subordinate clause, which is introduced with mə.ja ‘but’. Often this clause will have a negative element, usually ʔaːʔ ‘not’ (82–83). (82) ʔaɲ tonlah kʰaj ʔəm tɑːm hiw, mə.ja 1sg suppose 3sg stay loc house but ‘I supposed he was (at) home, but he wasn’t (there).’ (83) ʔoːŋ kra nɛ sənɛŋ.tor mə.ja man old equd be.hard.of.hearing but ‘That old man is hard of hearing, but not yet deaf.’

kʰaj 3sg

ʔaːʔ.heːt not.yet

ʔaːʔ neg

gəs have

tiʔ.tor deaf

4.2.4 Concessive Sentences Concessive sentences are introduced by bulah, meaning ‘even though, although’; the following main clause is usually introduced by mə.ja ‘but’ (84).

767

Kơho-sre (84) bulah ʔaɲ ləbən, mə.ja ʔaɲ pənɗəl even.though 1sg weak but 1sg force ‘Even though, I’m weak, [but] I force myself to work.’

ləh.brwaʔ work

4.2.5 Purposive Sentences Purposive sentences have either naŋ (85) or to (86) both meaning ‘in order to’ (purp) to indicate the purpose or reason implied in the introductory clause. (85) ʔaj gaj kərnal naŋ kal take clf rod purp bolt ‘Take the rod (in order) to bolt the door.’

mpoŋ door

(86) ʔaɲ sa to hɑːm 1sg eat purp satiate ‘I eat so that I’ll be satiated/satisfied.’

4.3 Noun Phrases Noun phrases are composed of the following components: (quantifier) noun (adjective) (noun phrase) (sentence) (determiner) Examples of noun phrase possibilities are illustrated in (87–97). Noun: (87) sɔ dog

Noun + determiner: (88) sɔ nɛ dog equd

Quantifier + noun + determiner: (89) jəh all

ʔalaʔ pl

Noun + adjective:

caw person

dɔ prox

768 (90) caw person

Olsen ʔuːr woman

haːŋ beautiful

Quantifier + noun + adjective + determiner: (91) pwan four

naʔ clf

ca w person

ʔuːr women

haːŋ beautiful

nɛ equd

Quantifier + noun + adjective + noun phrase: (92) pram five

nəm clf

sraʔ books

pa new

kʰaj 3sg

Quantifier + noun + adjective + determiner: (93) pram five

nəm clf

sraʔ books

pa new

ləj any

Noun + sentence: (94) sraʔ book

kʰaj 3sg

bləj buy

ŋaj day

ʔɔːr previous

Quantifier + noun + sentence + determiner: (95) pram five

nəm cl

sraʔ books

kʰaj 3sg

bləj bought

ŋaj day

ʔɔːr previous

nɛ equd

Quantifier + noun + adjective + sentence + determiner: (96) pram ‘five

nəm clf

sraʔ books

jwan Vietnamese

kʰaj 3sg

bləj bought

ŋaj day

ʔɔːr previous

nɛ equd

Manley (1972: 158) cites the following example: (97) baːr nəm sraʔ təj ʔaɲ kʰaj bləj ŋaj ʔɔr dɔ two clf book French 1sg 3sg buy day previous prox ‘these two French books of mine he bought yesterday’ (M158: p45)

769

Kơho-sre

5

Word Classes

5.1 Nouns 5.1.1 Common Nouns Nouns are a distinct open word class in Sre, distinguished from other open classes by their syntactic and morphological characteristics. Common nouns do not have special phonological characteristics; they can be mono- or sesquisyllablic, mono-morphemic or compounded; some examples of monosyllabic common nouns: caw luʔ kɔːj pwac jwas

‘man, person’ ‘stone, rock’ ‘rice’ ‘meat’ ‘shoulder ax’

5.1.2 Pronouns Table 9.4 lists Sre personal pronouns. First person plural has an exclusive/ inclusive dichotomy; second person singular and plural distinguish masculine, feminine, and affinal forms. There is also an ‘indefinite’ pronoun that can stand for an unspecified argument (see (7, 8). Table 9.4 Personal pronouns Person

Singular

Plural

1

exclusive inclusive

ʔaɲ

bɔl ʔaɲ (hi) bɔl hɛ

2

masculine feminine affinal

me ʔaj ɗi ~ ʔi

bɔl me bɔl ʔaj bɔl ɗi ~ ʔi

indefinite

kʰaj gə

bɔl kʰaj

3

Dournes (1950: 58) notes that ɗi (masculine/feminine) replaces me and ʔaj as a term of respect in addressing in-laws (affines). Evans & Bowen (1962: 50)

770

Olsen

comment that the second person singular familiar (= affinal) ʔi is a generally accepted term for ‘you’ used among equals of either sex. More often than not, the second person pronouns are replaced in discourse by the relevant kin term that indicates a degree of respect from the speaker. First person singular as a subject pronoun is illustrated in (98); dative pronouns examples are (99–101); and possessive usages are shown in (102–104). (98) ʔaɲ kɑːɲ 1sg want ‘I want that book.’

sraʔ book

(99) ʔaj ʔaɲ ʔin take 1sg dat ‘Give me that book.’

həʔ dist.nvis sraʔ book

(100) dan cʰaŋ baːr-pɛ please fry two-three ‘Please fry some eggs for us.’

həʔ dist.nvis

tənap egg

bɔl.hɛ 1pl.incl

ʔin dat

(101) ʔaj kʰaj ʔin huːc daːʔ take 3sg dat drink water ‘Give (to) him/her water to drink.’

Possession is indicated by adding de ( ɟe for 1st person) after the pronoun. (102) a. sraʔ ʔaɲ book 1sg ‘my book’

ɟe poss

b. sraʔ kʰaj de book 3sg poss ‘his (her, its) book’ c. sraʔ kɔn de book child poss ‘the child’s book’ (103) ʔaj ʔaɲ ʔin take 1sg dat ‘Give me my book’

sraʔ book

ʔaɲ 1sg

ɟe poss

771

Kơho-sre (104) kʰaj la bəjo 3sg is friend ‘S/he is our friend.’

bɔl.hɛ 1pl.incl

de poss

5.1.3 Measure Words and Quantity Words 5.1.3.1 Numerals Sre employs a decimal numeral system; form are listed here: duːl ‘one’ baːr ‘two’ pɛ ‘three’ pwan ‘four’ pram ‘five’

praw ‘six’ pɔh ‘seven’ pʰaːm eight’ sin ‘nine’ ɟət ‘ten’

ɟət baːr ‘twelve’ baːr ɟət ‘twenty’ pɛ ɟət ‘thirty’ pwan ɟət pram ‘forty-five’ r əhjaŋ ‘hundred’ r əbo ‘thousand’ pɔh r əhjaŋ sin ɟət praw ‘seven hundred ninety-six (796)’

Numerals preceding a noun are interpreted as cardinals (105); numerals following a noun are interpreted as ordinals (106). (105) praw kənhaj six month ‘six months’ (106) kənhaj praw month six ‘the sixth month (June)’

5.1.3.2 Classifiers Sre classifiers may be divided into two groups—those that are termed concrete and those that termed abstract. All Sre concrete nouns can be counted and must be preceded by a classifier when being enumerated, with certain exceptions. The concrete classifiers fall into three major subcategories (107). (107) a. naɁ humans b. naj roundish, solid objects (fruit, rocks, footballs, grains of rice or wheat, etc.)

772

Olsen c. nəm non-human animate creatures, other inanimate objects not classified by naj

The three basic classifiers listed above are replacing the more specialized classifier functions.10 Examples of these classifier constructions are exemplified in (108). (108) a. baːr naɁ caw two clf person ‘two women’ b. pɛ naj three clf ‘three rocks’

luɁ rock

c. praw nəm six clf ‘six books’

sraɁ paper

Ɂuːr woman

In addition to these three classifiers, there are other, more specialized classifiers that are not used that much and appear to be dying out. They include measure classifiers borrowed from French (often via Vietnamese)(109). (109) a. lit liter < Vietnamese lít < French litre b. kiɁ kilogram < Vietnamese kí < French kilogramme c. tʰək meter < Vietnamese thước There are also traditional indigenous measure classifiers (110). (110) a. boːŋ b. laːs c. nɗɑːm d. tal

one length (standing with arm upraised) one length (distance, when arms outstretched and fingers also extended, from fingertips of one to fingertip of the other) one span (distance from thumb to middle finger of spread hand) one cubit (length from elbow to fingertips of the same arm)

Examples of contemporary classifier constructions include (111a–c).

10 An explanation for this is that many of the objects that required these more restrictive categories are not found in the speakers’ increasingly urbanized environment, both in Vietnam and overseas.

773

Kơho-sre (111) a. baːr lit tərnɑːm two clf rice.wine ‘two liters of wine’ b. pwan kiɁ pwac four clf meat ‘four kilograms of meat’ c. ɟət tʰək baːj ten clf cloth ‘ten meters of cloth’

All abstract nouns must be preceded by the classifier ɟənaw when counted. ɟənaw classifies words, languages, songs, poetry, and legends, etc. (112). (112) baːr ɟənaw two clf ‘two poems’

pənɗik poem

Evans & Bowen (1962: 14a), in their discussion of Kơho classifiers, demonstrate the anaphoric function of classifiers in discourse (113). (113) a. bol.kʰaj bləj pɛ nəm Ɂjari, Ɂaɲ kɔɲ bləj do nəmi 3pl buy three clf chicken 1sg want buy one clf ‘They are buying three chickens, I want to buy one.’ b. ɟət naj krwaci nɗɑː ɗɔŋ tajh Ɂaɲ kɔɲ ɟət naji ten clf orange how.much coin q 1sg want ten clf ‘How much are ten oranges? I want ten.’ c. teːŋ dɔ gəs pram naɁ cawi, pɛ naɁi nɛh re raw place prox have five clf person three clf pfv return already ‘There are five people here; three have gone home already.’

They also show the quantifiable specificity associated with the use of a particular classifier (114–115). (114) a. do one

blah clf

sraɁ paper

(blah = clf for layers of s.t.)

774

Olsen b. do on

(115) a. do one b. do one

paŋ clf naɁ clf

sraɁ paper (paŋ = clf for flat, sheet-like things: cloth, mats, paper) ca w person

mpol/ɲcəm group

(naɁ = clf for counting human beings) caw person

(mpol or ɲcəm = clf for a group of people)

c. do sɛŋ caw one clf person (sɛŋ = clf for objects occurring in sequence or linear order)

Kơho numeral classifiers are semantically determined by the specificity of the noun categorized. The classifiers are not rigid in their application and can apply in several related or overlapping contexts with respect to the noun. Table 9.5 lists all the classifiers cited in this section, illustrating their scope with examples. Table 9.5 Inventory of Kơho classifiers cited in this section. Classifier

Scope

Example nouns

blah

layers of something (clothing, paper):

ɟənau

abstract nouns for words, languages, songs, poetry, and legends: measure for weight: kilogram: measure for liquids (liter): group(s) of people (clan): counting human beings: small, round objects (fruit, rocks, balls, grains of rice): non-human animate creatures and all other inanimate objects not classified by naj: a group of people: flat, sheet-like things (cloth, mats, paper):

ʔaw ‘shirt, blouse’; sraɁ ‘ a sheet of paper’; ʔoj ‘blanket’ caw ‘Kơho language’

kiɁ11 lit mpol naɁ naj12 nəm ɲcəm paŋ

caw ‘clan (or people)’ bar naɁ caw ‘two people’ kroac ‘orange (fruit)’ glɛ ‘type of bamboo’, sɔ ‘dog’, səgən ‘hat’ caw ‘a group’ sraʔ ‘paper’ , mpan ‘board, plank’

11 This is the current term (< French); an older term is kər. 12 kroac ‘orange’ can also be used with plaj, the classifier for fruit, etc.

775

Kơho-sre Classifier

Scope

sɛŋ

objects occurring in a sequence or linear caw ‘people lined up, hiw order: ‘row of houses’ measure of volume based on capacity (~20 kg. paddy):

səɁ

Example nouns

5.1.4 Names and Terms of Address Sre personal names, of either sex, are usually prefixed with kə-, as in kə.sɛm. Some names are preceded by ha-, as in ha.sɔl. They are usually monosyllabic. Certain naming taboos and a preference for a unique name sometimes violate basic phonological rules. In dialect B, the (interrogative) vocative form hah (pol) is used to address family members (116). (116) a. mbəh me tus, hah bɛːp from.where 2sg.m come pol father ‘Where are you coming from, father?’ b. ɲcʰi ʔaj ləh, hah mɛː what 2sg.f do pol mother ‘What are you doing, mother?’

Another practice is teknonymy, where the parents’ name derives from their children (usually the oldest) (e.g. [mɛː sɛm], ‘the mother of Sem’). 5.1.5 Relator Nouns In Sre, there is a set of locative nouns that act as relator nouns; that is, they generally indicate relative spatial or temporal location, analogous to English prepositions (e.g., on the table, under the house, etc.) and share some characteristics of both nouns and prepositions.13 The four example relator nouns focused on are: ɗaŋ, ‘above, up(per)’; ɗəm, ‘under(neath)’, ‘low(er), below’; dəlam, ‘in(side), interiorness’; and bədiːh, ‘out(side), exteriorness, away’(117) to (120). (117) a. hə ɗaŋ cənɑːŋ loc above table ‘on top of the table’ (M127/p21) 13

Relator nouns are also termed relative location nouns.

776

Olsen b. Ɂaɲ ɗuːh plaj bəh ɗaŋ cʰi 1sg drop fruit from above tree ‘I dropped the fruit from the tree.’ (E&B119/094)

(118) a. hə ɗəm cənɑːŋ loc below table ‘underneath the table’ (M127/p22) (119) a. tam dəlam ɓɔːn loc interior village ‘inside the village’ (M127/p23) b. tam dəlam hiw ɡəs dra mə rɑːc loc interior house exist pillar ins beam ‘Inside (of) the house, there are pillars and beams. (E&B142/006) (120) a. tam bədiːh ləɡar jwan loc outside place Vietnam ‘outside Việt Nam’ (M127/p24) b. baɲ lɔːt bədiːh proh go outside ‘Don’t go outside.’ (E&B125/WS1)

The examples for the relator nouns pattern similarly—(locative particle + relator noun + noun)—with no intervening elements. However, relator nouns can serve other syntactic functions when used in other patterns, other than the ones delineated here. Something parallel also occurs in English, where prepositions, such as inside and outside, may function as nouns, adjectives, or adverbs. 5.1.6 Adpositions Adpositions in Sre are grouped into three categories: abstract, concrete nonspatial, and spatial case markers. There is a question whether what Manley terms (spatial) case markers actually function as prepositions (1972: 69–72). This is an area for further study. 5.1.6.1 Genitive postposition de The genitive postposition de (ɟe after ʔaɲ) indicates possession; it occurs after nouns and personal pronouns (121) repeating (102).

777

Kơho-sre

5.1.6.2 Dative and Benefactive Postposition ʔin The dative postposition ʔin marks an animate indirect object or rercipient (121–122). The dative sense often is ambiguous in that it includes or overlaps a benefactive interpretation in many of these sentences (105). Alternatively, the locative preposition te can also appear in the same environment, but seems to be obligatory only with sensory verb constructions (124). (121) kʰaj ʔaj tərnɑːm rəpu ʔin 3sg take rice.wine buffalo dat ‘S/he gave rice wine to the water buffalo.’ (122) koɲ ʔaɲ sen.gar ʔəsɛh caw pɑː uncle 1sg take.care.of horse person chief ‘My uncle takes care of the village chief’s horse.’-

ɓɔn village

ʔin dat

The dative case marker can also indicate a benefactive (BEN) sense (00). (123) bi klaw ʔaɲ taɲ do nəm səʔ caw ʔur kʰaj ʔin older.sibling male 1sg weave one clf basket person female 3sg dat ‘My (older) brother is weaving a basket for his wife.’ (124) ʔaɲ tən-ŋɔːt gə 1sg caus-fear 3 ‘I cause him to fear it.’

te loc

kʰaj 3sg

5.1.6.3 Comitative and Instrumental Preposition (bal) mə The instrumental/comitative preposition (bal) mə indicates an accompaniment relationship (‘together with x’). When only the comitative marker appears in a sentence, the first element bal is optional (125). (125) ʔaɲ lɔːt hə daːlaːc (bal.)mə gəp 1sg go loc Dalat com friend ‘I go to Đà Lạt with my friend.’ (M68/19–20)

ʔa ɲ 1sg

However, if both comitative and instrumental markers occur in the same sentence, then the instrumental phrase is indicated by mə alone and the comitative phrase is obligatorily indicated by bal mə (126). (126) ʔaɲ lɔːt hə daːlaːc mə rəndɛh bal.mə 1sg go loc Dalat ins vehicle COM ‘I go to Đà Lạt with my friend by car.’ (M68/21)

gəp friend

ʔaɲ 1sg

778

Olsen

As mentioned, there is disagreement about whether these spatial cases are indeed case markers (as per Manley) or are prepositions with a rather wide semantic scope. 5.1.6.4 Locative preposition tam (Dialect A), tɑːm (Dialect B) The locative preposition tam indicates a general location in time and/or space. There appears to be no difference in the sense of this preposition in either dialect A or B (127). (127) a. kʰaj ʔəm tam ɓɔːn 3sg stay loc village ‘He lives in the village.’ b. ʔaɲ rɛ tam pwan kəso/ɟə 1sg swim loc four kilometer/hour ‘I swam for four kilometers/hours.’

5.1.6.5 Goal Preposition tus The preposition tus from the verb tus ‘come’ has different interpretations depending on the dialect. In dialect A, tus indicates the goal or destination where someone or something is going (110). The English preposition ‘until’ best translates the sense (perhaps it could be characterised as Allative). In dialect B, the semantic scope of the word is extended to indicate recent arrival in addition to referring to future events in time and space. In dialect B, tus often co-occurs with bəh when it functions as a full verb (128). (128) a. kʰaj bəsram tus drim 3sg study come morning ‘He studied until morning.’ b. kʰaj lɔːt tus daːlaːc 3sg go come Dalat ‘He went to Đà Lạt.’ (129) a. mbəh me tus where 2sg.m come ‘Where are you (m.) from?’

779

Kơho-sre b. ʔaɲ tus bəh daːdəŋ 1sg come from pn ‘I come from Dà Dơng.’

5.1.6.6 Source Preposition bəh The source preposition bəh indicates the source or place where someone or something is from. It occurs initially in the prepositional phrase (130–131). (130) kʰaj loːt bəh daːlaːc 3sg go from Dalat ‘He came from Đà Lạt.’ (131) caw ləh mpal bəh tɑːm person make mortars from clf ‘People make mortars from kriang wood.’

krjaŋ k.o.tree

5.1.6.7 Direction Preposition te (Dialect A), de (Dialect B) The preposition te ‘to, at’ indicates a more specific location than tam/tɑːm, the locative preposition. There appears to be no difference in the sense of this preposition in either dialect A or B (132). (132) a. kʰaj cəm luʔ te 3sg throw rock loc ‘He throws rock at the dog.’ b. ʔaɲ ʔəm te bər 1sg stay loc mouth ‘I am in the doorway.’

sɔ dog

mpoːŋ door

5.2 Verbs This section details intransitive verbs, transitive verbs, and ditransitive verbs in Sre. Stative verbs, copular verbs, modals, and aspect auxiliaries are also discussed. 5.2.1 Intransitive Verbs The basic word order in intransitive clauses is SV. (133) kʰaj loːt 3sg go ‘S/he goes.’

780

Olsen

(134) ʔaɲ bic 1sg sleep ‘I sleep.’

5.2.2 Transitive Verbs Transitive clauses have the basic word order AVP, though it is not common to have both A and P overtly expressed in the same clause. (135) dan sen seŋ please read sentence ‘Please read this sentence.’

dɔ prox

(136) ʔaɲ nɛh saw hɑːm raw 1sg pfv eat full already ‘I have eaten to my fill.’ (= ‘I’m satiated.’)

5.2.3 Ditransitive Verbs In transitive constructions, the recipient (G) is generally marked by the passive postposition ʔin. The notion of ‘give’ is expressed by the verb ʔaj ‘take’ with the dative marker. The basic order is AVGT. (137) kʰaj ʔaj ʔaɲ ʔin do 3sg take 1sg dat one ‘S/he gave me a bowl of water.’

məŋan small.bowl

(138) kʰaj bətɔ ʔaɲ ʔin ɗəs 3sg teach 1sg dat speak ‘S/he teaches me to speak Kơho.’

daːʔ water

caw person

5.2.4 Stative Verbs There are two types of stative verbs: quantifiers and true statives, which can include some verbs, adjectives, and adverbs. An example of a quantifier is ʔwaʔ ‘to be much, many’ (139); an example of a true stative (derived from an adjective) is mwat ‘to be sad’ (140). (139) gɛh ʔwaʔ ca w ʔuːr existential many person women ‘There are many women at the market.’

tam loc

draːʔ market

781

Kơho-sre (140) ʔoːŋ kra nɛ mwat man old equd sad ‘That old man is always sad.’

suːm always

5.2.5 Comparative and Superlative Degree of Stative Verbs The comparative degree of statives is rəlaw ‘to exceed, be greater than’ + mə which is a comitative/instrumental particle meaning ‘than’ in this construction (141). (141) cʰi dɔ kraʔ rəlaw.mə tree prox hard comp ‘This tree is harder than that tree.’

cʰi tree

nɛ equd

The superlative degree of statives is rəlaw + ɟəh ‘to be finished, complete’ (124). (142) ʔuːr nɛ haːŋ rəlaw.ɟəh woman equd pretty super ‘That woman is the prettiest.’

5.2.6 Copula Verbs There are five copula verbs in Sre: la(h) ‘to be’, ɟeːŋ ‘to be/become’, lah ɟeːŋ ‘to be’, gəs ‘to exist’ (and consequently ‘have’ or ‘become’), and gələh ‘to become’. The usage and meaning of these verbs varies depending on the dialect. Sentences (143–145) illustrate some of these dialectal differences. (143) a. kʰaj lah (ɟeːŋ) caw ywan 3sg be person Vietnamese ‘S/he is a Vietnamese.’ (Dialect A) b. kʰaj la bəjo ʔaɲ ɟe 3sg be friend 1sg poss ‘S/he is my friend.’ (Dialect B)

In dialect B, lah means ‘to speak, say’; ɟeːŋ does not appear in the data observed. (144) a. kʰaj gəs kwaŋ 3sg exist official ‘S/he became an official.’ (Dialect A)

In dialect A, gəs serves a mutative function.

782

Olsen b. kʰaj gəs jət lɔʔ sre 3sg exist ten clf paddy.field ‘S/he has ten (rice) fields.’ (Dialect B)

In dialect B, gəs is an existential verb; sentence initially; it often introduces a content question. (145) a. kʰaj gələh kɔːp/cʰaːʔ.haːp 3sg become sick/happy ‘S/he got sick/became happy.’ (Dialect A) b. ɲcʰi gələh kʰaj ɲim what become 3sg cry ‘What made her/him cry?’ (Dialect B)

In dialect B, gələh has the general meaning of ‘to make something happen to s.o.’. It occurs mostly in interrogative sentences. 5.2.7 Modals (Preverbs) Modals modify the sense of the main verb without changing its basic meaning; they precede the main verb. The same forms mostly occur also as regular main verbs. The principal Modals are listed below. The first three are true modals. In dialect A, up to four Modals may precede the main verb (138), but Manley (1972: 203–204) notes that intelligibility breaks down if there are more than four and many combinations of multiple Modals produce ungrammatical sentences. The last two Modals do not occur in the data examined for dialect B. Modal di pal rəgəj loːc mɑːŋ təlɔːŋ

[modality] [permission] [necessity] [ability] [completion] [accustomed] [attempt]

Gloss ‘be permitted to; be (all) right, correct’ ‘have to; must; be worthy of’ ‘be able to; can’ ‘finish’ ‘be accustomed to; be used to’ ‘try to’

(146) gəs ʔwaʔ caw kənɔm cih sraʔ ɓaː di ɓaː ʔaːʔ existential many person child write paper some right some not ‘Many children write, some right, some not.’ (Dialect B)

783

Kơho-sre (147) ʔaɲ pal ʔəm tam ɓɔːn 1sg must stay in village ‘I have to/must stay in the village.’ (Dialect A) (148) ʔoːŋ pal ɗəs lɔh.laŋ man must speak clearly ‘You must speak clearly.’ (Dialect B) (149) kɔːn nɛ rəgəj rɛ kid equd can swim ‘That kid can swim.’ (Dialect A) (150) kun saʔ gɛn.soŋ rəgəj mut tɑːm hiw stoop body consequently can enter loc house ‘Stoop over so you can come into my house, . . .’ (Dialect B)

ʔaɲ, . . . 1sg

(151) kʰaj loːc ləh hiw pa mɛʔ-baːp kʰaj ʔin 3sg finish make house new mother-father 3sg poss ‘S/he is finishing building the new house for her/his parents.’ (Dialect A)

In Dialect B, loːc only occurs with bɛhəʔ with the meaning ‘after that’. It generally appears clause initially: (152) loːc bɛhəʔ kʰaj re hiw finish after.that 3sg return house ‘After that, s/he went home.’ (Dialect B) (153) kʰaj mɑːŋ ləh brwaʔ dɔ 3sg hab do work prox ‘S/he is used to doing this work.’ (154) caw nɛ təlɔːŋ bəsram person equd try study ‘That man is trying to study French.’

daʔ language

təj French

Manley cites a ‘monster’ sentence that contains four Modals, which borders on unintelligibility, but was acceptable to a native speaker, though not something he would say in daily conversation.

784

Olsen

(155) kʰaj di təlɔːŋ mɑːŋ rəgəj ɗal-sraʔ 3sg right try hab can read-paper ‘It is all right for him to try to get used to being able to read.’ M1972: 206 (185)

5.2.4 Aspect Auxiliaries Sre has two optional aspect auxiliaries that precede the main verb: nɛh and rəp. Both of these auxiliaries function not so much as tense markers as indicators of whether an action has been completed or not. Nɛh indicates that the action has been completed (pfv), while rəp signals that an action has not been completed and is either ongoing or will be completed in the future (fut). Generally, Sre speakers will often add a temporal unit to the sentence to narrow down the context or time frame of the utterance (156–158). (156) kʰaj lɔːt tam daːlaːc. 3sg go loc Dalat ‘S/he goes to Đà Lạt (daily/every month/next year).’ ‘S/he went to Đà Lạt (last year/yesterday).’ ‘S/he will go to Đà Lạt (tomorrow/next month).’ (157) kʰaj nɛh lɔːt tam daːlaːc 3sg pfv go loc Dalat ‘S/he went to Đà Lạt yesterday.’ (158) nam d aʔ kʰaj rəp year next 3sg fut ‘Next year, I will go to Đà Lạt.’

lɔːt go

ŋaj day t am loc

ʔɔr previous daːlaːc Dalat

5.3 Locational and Directional Words The following lists locational and directional (deictic) words in Sre from Manley (1972: 150–151). dɔ nɛ dɛn/gɛn daʔ həʔ

‘this’ (closer to speaker; the first of a pair of objects being compared or contrasted: prox) ‘that’ (object not proximate but equidistant from speaker and addressee: equd) ‘that’ (closer to addressee, not speaker: medl) ‘that’ (the second of a pair of objects being compared or contrasted: dist) ‘that’ (not visible, spatially or temporally; anaphoric, old information: dist.nvis)

785

Kơho-sre

5.4 Conjunctions/Connective Words The following are some of the connecting or coordinating words: aj, aj . . . gɛn ‘as for’, bu(lah) ‘if’, digəlan ‘maybe’, di(lah) ‘if’, gɛn ‘as a result; consequence’, hala ‘or’, mə taj. . . kuŋ ‘moreover’ and mə ja ‘but’. See §4.2 above for illustrative sentences involving these words. 5.5 Particles There are four imperative particles: polite inclusive (Ɂih or laʔ), polite exclusive ( jə), and non-polite exclusive (tɛʔ). See §4.1.2 above for illustrative sentences involving these particles. There are several sentence final particles: (di) ləm ‘to be all right, turn out all right’ (159); mɑː ‘immediately, right away’ (143); and mɑːn ‘indicates that the result of the sentence is uncertain’ (160). (159) tuɁ ləj di.ləm time any all.right ‘Any time will be all right.’ (M155/060) (160) ʔaɲ soːr kʰaj gɛn ɲɟɯːt kʰaj ntaw mɑː. 1sg tell 3sg then stand 3sg stand immediately ‘I told him and he stood up right away.’ E&B140/2BA (161) baɲ lɔːt ŋaːj ʔir rəŋɔːt bɔl.hɛ bɔːl.glar mɑːn. proh go far too afraid 1pl.incl tired uncertainty ‘Don’t go too far away, for fear that (lest) we’ll be tired.’ (E&B127/013)

6

Semantics and Pragmatics

Specialized Vocabulary (Kinship) Selected Sre kinship terms are listed below. The Sre reckon kinship and descent from the mother’s side of the family (i.e. matrilineal). mɔː ʔoŋ mɛː bɛːp kɔn saw kɔn ʔur kɔn klaw

‘grandmother’ ‘grandfather’ ‘mother’ ‘father’ ‘children & grandchildren’ ‘daughter’ ‘son’

786 bi ʔɔh lɔʔ

7

Olsen ‘older sibling (ʔur = ‘female’; klaw = ‘male’) ‘younger sibling (ʔur = ‘female’; klaw = ‘male’) ‘sibling of the opposite sex of the speaker’

Glossed Text

Traditional Village Work (Evans & Bowen 1962: 134–135 [§§50–60]) (1)

kɑːp drim mɛː ʔaɲ pəs hiw, bəʔ daː mə pjah kɔːj every morning mother 1sg sweep house draw water ins husk paddy ‘Every morning my mother sweeps the house, draws water, and husks the rice.’

(2)

kʰaj duŋ ŋgum kɔj mə 3sg also winnow paddy ins ‘She also winnows it and separates it.’

(3)

bi ʔur ʔaɲ dɔŋ mɛː truʔ pjaŋ mə gəm older sister 1sg help mother cook rice ins cook ‘My older sister helps my mother cook rice and vegetables.’

bjap vegetables

(4)

sir saw gɛn ʔalaʔ caw ʔur raw ɟəluʔ already eat then pl person women wash large.bowl ‘After eating, the women wash the dishes.’

məŋan small.bowl

(5)

kɑːp ŋay ʔɔh ʔur ʔaɲ lɔːt ɟɔj lɔŋ gen kʰaj blah lɔŋ every day younger sister 1sg go find firewood then 3sg split firewood ‘Every day, my younger sister goes to find firewood and then she splits it.’

(6)

kɑːp mhɔ mɛː ʔaɲ lɔːt bəʔ d aː every evening mother 1sg go draw water ‘Every evening my mother goes to draw water again.’

(7)

kʰaj piːh ʔoːj ʔaw gɔh 3sg wash blanket shirt clean ‘She washes the clothes very clean.’

(8)

tɑːm kənhaj loc season

praŋ caw dry person

ŋkʰjaŋ separate

pʰɛ uncooked.rice

taj more

ŋa n very

ʔur t aɲ ʔoːj bel women weave blanket mat

rənɗɔː large.bag

787

Kơho-sre mə pərlɑː ins small-bag ‘In the dry season, the women weave blankets (=skirts), mats, and bags.’ (9)

tɑːm troː maŋ bɔl.kʰaj ʔaj ɗeːs roːj loc sky night 3pl take cotton spin ‘At night, they take cotton and spin it into thread.

(10) caw ʔur ɓa ː sen gaːr person woman some see guard ‘Some women take care of the children.’

ʔalaʔ pl

gəs have

braːj thread

kɔn child

8 Bibliography Bochet, Gilbert & Jacques Dournes. 1953. Lexique polyglotte. Saigon: Éditions France-Asie. Dournes, Jacques. 1950. Dictionnaire Srê (Kơho—Français. Saigon: Imprimerie d’Extrême-Orient. ―――. 1974. Une documentation sur les parlers köho. Asie du Sud-East et Monde Insulindien 5.1: 161–170. Evans, Helen E, and Peggy Bowen. 1962. Kơho language course. 2 vol. Dalat: Christian and Missionary Alliance. Hoàng Văn Hành, Vũ Bá Hùng, Ðoàn Vă Phùc, Tạ Văn Thông, K’Gim, K’Broh, K’Mé, K’Brờk, Kon Yai Treng Tài. 1983. Từ Ðiển Việt-Kơho [Vietnamese-Kơho dictionary]. T.P. Hố Chí Minh: Sở Văn hóa và Tông Tin Xuất Bản. Lý Toàn Thắng, Tạ Văn Thông, K’Brêu, K’Bròh. 1985. Ngữ pháp tiếng Kơho [Kơho grammar]. Sở Văn hóa và Thông Tin Lâm Đồng. Lâm Đồng. Manley, Timothy M. 1972. Outline of Sre structure. Oceanic Linguistics Special Publication, 12. Honolulu: University of Hawaii Press. Nguyễn văn Hoan. 1973. Kơho-Sre affixation. Saigon: University of Saigon MA thesis. Olsen, Neil H. 1968. Basic Kơho. Bảo-Lộc, Lâm-Đồng: MACV Advisory Team 38. ―――. 1976. Matrilineal societies in Southeast Asia: examples from Highland Vietnam. In David J. Banks (ed.), Changing Identities in Modern Southeast Asia. The Hague & Paris: Mouton Publishers, 249–255. ―――. 2010. Aspects of reduplication in Kơho, a Mon-Khmer language. Journal of the Southeast Asian Linguistics Society 3.1: 77–85. Schrock, Joann L., William Stockton, Jr., Elaine M. Murphy, & Marilou Fromme. 1966. Minority groups in the Republic of Vietnam. Ethnographic study series. Department

788

Olsen

of the Army pamphlet 550–105. Washington, D.C.: U. S. Government Printing Office. [Koho, pp. 389–431; Ma, pp. 436–472]. Sidwell, Paul. 2002. Genetic classification of the Bahnaric languages: a comprehensive review. Mon-Khmer Studies 32: 1–24. Smalley, William. 1955. Sre phonemes and syllables. Journal of the American Oriental Society 74.4: 217–222. Svantesson, Jan-Olof. 1983. Kammu phonology and morphology. Lund: Travaux de l’Institut de Linguistique de Lund. Tạ Văn Thông. 2004. Ngữ Âm Tiếng Kơho [Kơho phonetics]. Hà Nội: Nhà Xuất Bản Khoa Học Xã Hội.

chapter 10

Sedang1 Kenneth Smith and Paul Sidwell 1 Background The Sedang (sed) speak an Austroasiatic language of the Northern sub- division of the Bahnaric branch. They have lived traditionally in central Kon Tum Province of Vietnam, where the 1999 national census counted 101,000 Sedang. The Sedang mainly refer to themselves as rəteə̰ŋ, which also has the more general connotation of “montagnard”; the presyllable rə- occurs as s- in some dialects, hense the romanised form sedang. The Bahnar equivalent by normal sound change correspondences is hədaŋ. The Sedang area is generally very mountainous; although the Sedang generally live below 1000 meters or so of elevation (Kon Tum City has an elevation of about 600 meters). The highest peak of the central highlands, Ngoc Linh (2,598 m) known by the Sedang as Ngo Eang, is in the north of the Sedang range. It is in an area of tremendous rainfall and is the scene of much Sedang folklore. The Sedang have been characterised by the French colonialists as warriors and makers of arms, utilising iron deposits in their area, and trading metal goods historically over a large area. This reputation as warriors was keenly valued, along with a desire to maintain as far as possible independence from external powers. During the 1930s the Sedang opposed French administration and attacked their outposts and burned bridges along the highway. During the brief Japanese occupation of 1945, it is said that the Japanese feared only the “moi” (savage) regiments in French uniform, while they were rather contemptuous of the other native regiments (Devereux 1947: 394). During the 1960s–70s infiltration of Nationalist Vietnamese units into the remote areas caused the Sedang people to be divided, forcibly or otherwise. Sedang fought on both sides of the conflict, and are especially well known for fighting in units especially created and armed by the Americans, who occasionally 1 This text is largely derived from Kenneth Smith’s (1979) Sedang Grammar. The grammar was written in terms of classical phonemics and the tagmemic framework that was the basis of much SIL descriptive work of the time. The present sketch is substantially rewritten according to more current norms of linguistic description, including use of IPA in place of Smith’s romanised orthography.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_016

790

Smith and Sidwell

complained that these units were often more keen to fight the Vietnamese generally, not just those pushing in from the north. With the unification of Vietnam, and nearly four decades of consolidation and development of the new regime in the highlands, social conditions have changed considerably. Both authors have visited the area in recent years and observed that the Sedang, along with other Montagnard groups, are now competing with large numbers of Vietnamese transmigrants for opportunities in the now booming inland cities such as Kontum. Similarly in the countryside, which is now opened up with modern roads, communications and services, much the land is being reformed into a version of Vietnamese civilisation that does not sit entirely comfortable with the historical ethnic diversity of the highlands. The Sedang are largely farmers, hunters, trappers, and fishers. The basic crop was rice obtained by slash and burn agriculture, practiced in swiddens scattered around the villages, using a given field for two or three years before abandoning it. In suitable areas paddy rice is also cultivated, and other starch crops such as maize and manioc are important. Within a village there are gardens for tobacco, squash, pepper, potatoes, bananas, and other fruits and vegetables. In the surrounding jungles they hunt for deer, boar, wild chickens, etc., although their crossbows and arrows have been largely replaced with guns. The men weave and make a variety of bamboo traps for catching small animals, birds, and fish. The women go out and cut firewood which they carry back to the village in high-piled back baskets. The women have the responsibility of carrying water, caring for the children, weaving cloth, pounding and winnowing rice, and meal preparation. Rice wine, as well as wine prepared from other foods, is almost always present in large jars tied to the center post of their houses. Chickens and pigs, cows and water buffalo are kept for sacrifices and food. As late as the 1960s Hickey (1967: 757) was reporting that instances of slavery among the Sedang continued. The largest Sedang social unit is the village, inter-village political ties are not predominant among the Sedang (Devereux, 1937). A Sedang village can have as few as three houses or as many as several hundred. The extended family lives together in a house though each immediate family has its own apartment. Kinship is bilateral and after marriage they practice bilocal residence, i.e. after marriage they will spend a few weeks with one set of parents—either set— and thereafter live permanently with the other. Marriage is permitted only if the couple do not have the same great-grandparents. Marriage outside the tribe is permitted; historically warring parties have gone to Bahnar and Jarai areas to secure women for wives, while today intertribal marriages are com-

Sedang

791

mon and one can meet many ‘mixed’ families in Kontum. Respect is given to the elders, who make decisions meeting in the village communal house which also often serves as the dormitory for bachelor boys (the often spectacular nha rong that are characteristic of the highlands). Harking back to the warrior tradition, Soulié (1927: 69) claimed that a Sedang boy would not be respected as a man until his first murder. The Sedang have been animists, sacrificing offerings to the spirits which surround their lives. Sacrifices are demanded by the shamans in time of death, illness, planting, harvest, etc. Chickens, pigs, and water buffalo are slain at the base of the spirit pole after a night of drinking and dancing. Expensive Laotian garments are brought out for the dancing, accompanied by the playing of gongs, cymbals, and drums. At death, dancing proceeds from the house to the burying ground outside the village where the casket with the body is buried and a small shelter is erected in which are placed various foods and implements for use by the spirit of the deceased. Human sacrifice was reported initially by Cupet (1893: 218) and as recently as 1930 by Devereux (Hickey 1964: 149). These days many Sedang, perhaps a majority, are at least nominally Roman Catholic and have forsaken many of the former practices. In Kontum on any Sunday one can observe hundreds of Sedang Christians, mixing with Bahnars, Rengaos and others, attending services at the great wooden cathedral, and other churches about the district. The traditional dress of Sedang men has been a loincloth, and of Sedang women a skirt. Beads around the neck, bracelets on the wrists, or rings through the ear are adornments often seen on women. The women have exquisite skirts and the men robes from Laos which are reserved for special times of sacrifice, dancing, or festivity, although these days one is as likely to see jeans and t-shirts, even on significant occasions. Other descriptions of or references to Sedang culture—none of any significant depth—include the following: Baudesson (1932), Cupet (1893), Devereux (1937, 1938), Guilleminet (1952), Hickey (1964, 1967), Hoffet (1933), Mansuy (1929); Maspero (1929), Maitre (1912). Principal sources on Sedang language are the works of Kenneth Smith (e.g.: 1969, affixation; 1979, grammar; 2000, dictionary). 2 Phonetics/Phonology The details presented in this sketch apply to the dialect of Sedang known as rəteə̰ŋ kəklai ‘Sedang that uses kəklai “what”, which is a principal and central (ethno-)dialect. More particularly, this subdialect is most readily intelligible

792

Smith and Sidwell

both orally and in written form to the broadest scope of Sedang speakers; it is spoken in villages such as Tea Kəlap, Dak Rəwang and Va Məna. Sedang phonology was initially described by Smith (1968), and later rewritten for the (1979) grammar. The practical phonemic orthography developed at that time by Smith has been re-rendered in IPA transcription for the present sketch. 2.1 Word/Syllable Structure Phonological words in Sedang are either mono- or disyllabic iambs. Monosyllabic words consist of one stressed main syllable. Disyllabic words have an additional unstressed presyllable. The main syllable consists of an onset consonant position filled by an initial consonant (Ci) with or without a following medial consonant (Cm), a nucleus filled by a simple or diphthonged vowel (V) (with optional nasalisation (N)) and optional final consonant (Cf). Main syllables are also marked for register (±creaky). Presyllables consist of a presyllable consonant (Cp) and presyllable vowel (Vp). Consonant clusters occur more frequently in monosyllabic than disyllabic words, although only one word in seven in texts has a consonant cluster. A phonological word may be summarised: (CpVp).Ci(Cm)V(N)(Cf)R 2.2 Phoneme Inventory and Phonotactics The Ci consonants are presented in the table below. Table 10.1 Sedang Ci consonants

stop -voice stop +voice implosive nasal nasal glottalised nasal -voice fricative approximant approximant glottalised approximant -voice

p b [mb] ɓ m ˀm m̥ w ˀw w̥

t d [ⁿd] ɗ n ˀn n̥ s, ʂ l, r ˀl, ˀr l ̥, r̥

c ɟ [ᶮɟ] (ʄ) ɲ ˀɲ ɲ̊ j ˀj j̊

k g [ŋg] ŋ ˀŋ ŋ̊

ʔ

h

The four voiced stops—generally pronounced as prenasalized—, occur relatively infrequently in that they are not historically part of the language. The least

793

Sedang

frequent of all consonants is initial /j/ which only re-entered the language through loans from Bahnar, Vietnamese and French. The implosive palatal only occurs in personal names. Consonant clusters are formed with medials (Cm) h, l, r; the following are attested: -h: ph, th, kh (a very small proportion of Sedang speakers claim to distinguish c and ch). -l: pl, kl, bl, ɓl, gl, ml (and tl perhaps as a dialectal variant of kl). -r: pr, tr, kr, br, ɓr, dr, ɗr, gr, mr, ˀmr, ŋr, ˀŋr, ŋ̊ r.

The main syllable coda consonants (Cf) are shown below. Table 10.2 Sedang Cf consonants

stop nasal approximant fricatives

p m w

t n (l ~ r)

( jʔ) j ç

k ŋ

(ʔ)

h

All consonants except nasals occur with nasalised vowels. Only final nasals and approximants occur in tense register rhymes whereas all occur in lax register. Coda -jʔ represents an articulation that is similar to a final palatal stop [-c] but instead of the tongue making a complete closure against the palate there is only an approximation during which phonation is interrupted by glottal closure. In Smith (1979) both -ç and -jʔ were treated complex phonemes /jh, jʔ/ and classed together, but these codas have different histories and there is no strong case for positing the complication of final clusters within the Sedang phonological word. Codas -ʔ, -jʔ, -l, and -r are very infrequent, entering the language through loans from neighboring languages. Only codas -ŋ, -k and -h have been observed with all seven monophthongs. Codas -m, -n, -p and -t have been observed following five or six vowels, although it seems as though these gaps are incidental. Apparently -j is restricted to just five rhymes ej, aj, uj, oj and ɔj, and -w is restricted to just rhymes iw, aw and ow. Presyllables are the unstressed-consonant-plus-vowel syllables which precede the stressed main syllables described above. A small proportion of presyllables arise due to reduplication. Major presyllables are restricted to a limited set of consonants followed by a schwa vowel, or when Cp is a glottal stop the vowel may be either schwa or i

794

Smith and Sidwell

(suggesting an underlying palatal consonant plus schwa). Presyllables bə-, sə-, ɟə- occur very infrequently. The major presyllables are shown in the following table: Table 10.3 Sedang major presyllables

pəbəmə-

tə-

lə-, rəsə-

ɟə-

kə-

ʔə-, ʔi-

hə-

Notes:

· pə-, tə-, kə-, and mə- are sometimes, but not usually, affixes; lə- is usually an affix. hə· and rə- are followed by a non-phonemic [s] before voiceless stops; thus: rəta [rəsta]. mə· only rarely precedes a voiceless consonant except a glottal, and never precedes homorganic b or w. At the same time pə- only rarely precedes a voiced stop (sometimes pə- and mə- alternate in words with voiced stops) and never precedes homorganic m or w. kə- does not precede homorganic g. No basic presyllable precedes a consonant identical to the presyllable consonant. Clusters rarely occur following a presyllable. Initials j and j̊ are not observed in disyllabic words (except Bahnar loan bəjaŋ ‘God’).

· · · ·

The presyllable vowel discriminates between disyllabic words and monosyllabic words with an initial consonant cluster in the following pairs: ph- and pəh-; pl- and pəl-; pr- and pər-; th- and təh-; tl- and təl-; tr- and tər-; kh- and kəh-; kl- and kəl-; kr- and kər-; mr- and mər-; also hə- presyllables before nasals and approximants contrast with voiceless nasals and approximants. There is no long-short contrast among vowels as commonly found in related languages, rather the basic distinction is between monophthonged versus diphthonged vowels; all monophthongs tend to be pronounced long in

795

Sedang

main syllables, and short in presyllables. The schwa vowel /ə/ is restricted to presyllables. Table 10.4 Sedang vowels

i e ɛ

(ə)

u o ɔ

iə eə

uə oə

io eo

uo

iɪ

oɛ

ɑ

The following asymmetries in the vocalism are noted:

· Central glides occur in open syllables and before all final consonants except -j, -w, and -ç. · Back glides occur in open syllables and before velars -ŋ and -k, and before -h. · Front glides occur only in open syllables. 2.3 Suprasegmentals Each vowel rhyme is identified as belonging to either one of two registers (R) characterized as tense/creaky and lax/modal. We treat the tense register as the marked register, which is essentially the reverse of the principle applied to most other register languages which—like Mon—contrast a tense/modal phonation with a lax/breathy phonation. Tense register vowels are articulated with a pronounced creak (referred to as laryngealised or glottalised by Smith 1979, 2000, etc.). The term “tense” refers to the tensing of the vocal folds required to produce this marked, slow trillisation of the vocal folds that produces the creaky sound. Some examples of register contrastive pairs: ka ‘to eat’ / ka̰ ‘fish’ kan ‘big’ / ka̰ n ‘chief’ bau ‘to wash face’ / ba̰ w ‘field rice’ pəla ‘chaff’ / pəla̰ ‘elephant tusk; between’ ma ‘eye’ / ma̰ ‘we two (excl.)’ soj ‘to err’ / so̰ j ‘to sarifice’

796

Smith and Sidwell

Spectrograms of the Sedang creaky syllables appear the same as the corresponding clear vowels during the syllable peak with normal pulsing. After the peak the time duration between successive glottal pulses increases and glottal vibrations become irregular. Some pulses are separated by as much as 25 to 32 milliseconds (which, if periodic or continuous, would correspond to as low as 40 to 30 hertz). One series of laryngealised glottal pulses in the syllable [pa̰] has been measured as having successive pulses separated by 11, 13, 15, 18, 18, 24, 32, 25, 27 milliseconds. Spectrographic analysis shows that energy is concentrated about the first and second formants with no lower harmonics visible on the typical spectrogram. The formants may appear quite fuzzy, in that there are many very closely spaced harmonics. In words with diphthongs, approximants, or final nasals, there is generally a point toward the end of the vowel duration at which glottal pulse timing starts to spread out, marking the beginning of the creaky phase—approximating a weak glottal stop—with the creak then carried through even the final approximant or nasal. Vowel nasalisation is contrastive in the main syllable in certain restrictive preceding and following environments. The preceding environment for nasalisation of the vowel must be either ʔ or h, and/or r or w. Most presyllables seem to permit nasalisation of the main syllable vowel if the above criterion is present. The environment following a nasal vowel must be an open syllable (-ø), -h, voiceless stop (although p has not been so observed), and -j and -w. As noted above, the final stops and -h do not occur in tense register rhymes, so do not permit nasolaryngealisation either. These restrictions are as follows. Table 10.5 Sedang environments permitting nasal vowels Cf

-ø

-h

-t

-k

-j

-w

nasalized vowel nasal-creaky vowel

ɛ̃ hɛ̰̃

ɛ̃h x

məhɔ̃ t x

hãk x

həʔũj hã̰

məhãw kəʔõ̰w

2.4 Phonological Reduplication There are two kinds of phonological reduplication, both of which seem to be related to expressive and onomatopoeic vocabulary; reduplicative presyllables and word reduplication. With reduplicative presyllables, Cp is a copy of Ci, and the presyllable vowel is either schwa, -i-, or -u- (e.g. cicow, dudat, ɲiɲɔn). There are also reduplicative presyllables that copy both Ci and vowel (e.g. kiki, hihiə,

797

Sedang

iʔiw), and some that copy initial clusters although without copying the main syllable vowel (e.g. krikro̰w, tritrow). Examples of complete and partial word reduplication follow: Complete word reduplication: ʔblut ʔblut ‘little by little’

Consonant-vowel reduplication: baŋ bal ‘kind of catfish’ daŋ dak ‘to climb up steep mountain’

Initial and final consonant reduplication: ʔbut ʔbat ‘scattered all over the place’ kuŋ kiŋ kɔ̰ŋ ‘elbow’

Initial consonant reduplication: re rɔ̰ŋ ‘a fearful cry’ blu blɛp ‘to rush in’ bliŋ bleə ‘an “x” mark’ cek caŋ ‘to lie on one’s side’ kuŋ keo̰ ‘knee tendon’

Presyllable and initial consonant reduplication: kəʔblou kəʔbla ‘very crowded’ kəʔnok kəʔnak ‘(of wet flesh) to be white and puffy’

3

Word Formation

Sedang word formational processes discussed here are affixation (prefixing and infixing), compounding and reduplication. As a largely isolating language, word formation is mostly derivational. See also Smith (1969a) for a discussion of Sedang affixation using a generative transformational model.

798

Smith and Sidwell

3.1 Prefixing/Infixing 3.1.1 Causative pə-, məThe causative prefixes pə- and mə- attached to monosyllabic verbs (except quotative, equative, or existential verbs) and stative verbs transform these to a transitive verb in which the subject as actor causes the stated predication upon the object as undergoer. These prefixes are semi-productive, in that they may not be indiscriminately affixed to semantically viable verbs though occasionally a native speaker may use it in a new or novel situation. However, not all presyllables pə- or mə- are the causative affix. pə- occurs with words having initial voiceless consonants, and mə- with those having initial voiced consonants or glottal stop. Examples: mə- with morphophonemic change: ta̰ ‘to go around’ taŋ ‘to be erect’ caj ‘to be sick’ mə- without morphophonemic change: ʔat ‘to fast’ ha̰ ‘to open mouth’ r̥iəm ‘to study’ Affixation with pə-: loj ‘to abandon’ kṵm ‘to pile’ rɛh ‘to be alive’

məda̰ ‘cause to go around’ mədaŋ ‘cause to be erect’ məɟaj ‘cause to be sick’ məʔat ‘cause to fast’ məha̰ ‘cause to open mouth’ mər̥iəm ‘to teach’ pəloj ‘cause to abandon’ pəkṵm ‘cause to be piled’ pərɛh/mərɛh ‘to nourish’

Some affixed forms have (vestigial?) nasalization and/or preglottalization not present in the unaffixed forms. ʔow ‘to drink’ riw ‘to awake’ rɛh ‘to be alive’

məʔõw ‘cause to drink’ məˀrĩw ‘cause to be awake’ məˀrɛh/pəˀrɛ̃h ‘to resurrect’

In rapid speech the presyllable mə- may be taken to be a contraction of mɔt ‘to enter’ in some instances. For example: (1)

ga̰ hiə̰ŋ ʔo̰ j ʔa kḭŋ teə ga̰ məhuəm 3sg already remain loc edge water 3sg bathe ‘He was at the edge of the water (and) he went in to bathe.’

799

Sedang (2)

kiə məkɔj tuŋ kuət ghost go.to.sleep inside communal.house ‘The ghost went in to sleep in the communal house.’

3.1.2 Reciprocal təThe reciprocal prefix tə- (apparently quite productive) attached to a transitive verb usually indicates that the predication is common to the plurality if the (necessarily plural) subject actors (i.e. not strictly reflexive). tə- is prefixed only to monosyllabic verb. Sometimes it is realised as ti-. The VP is usually followed by the reciprocal phrase dej pɔ̰ ‘with each other’. Examples: (3)

ma̰ tə-cuə̰ dej 1du recp-obey with ‘We two obey each other.’

pɔ̰ each.other

(4)

prḛj tə-weə̰ŋ dej pɔ̰ 3du recp-help with each.other ‘Those two help each other.’

(5)

mɛ prḛj tə-l ̥ɔ dej medl 3du recp-see with ‘Then those two saw each other.’

pɔ̰ each.other

3.1.3 Adversative ləThe adversative prefix lə- prefixed to a verb (except equative or existential verbs) indicates that the (potential) (object as) undergoer is liable to the adversity of the predication. lə- seems to be an active affix, being used with more verbs and with greater frequency than the causative or reciprocal affixes. ləfrequently cooccurs with the verb saw ‘to be afraid’. Examples: (6)

poə ʔɛh lə-how grandfather 2sg advrs-hate ‘Grandfather, you hate us two.’

ma̰ 1du.excl

(7)

pin saw rəka̰ j lə-lɔ 1pl.inc fear boar advrs-go.through ‘We’re afraid boar will go through (into our gardens).’

800 (8)

Smith and Sidwell ga̰ lə-mɔt tuŋ ko̰ 3sg advsrs-enter inside prox ‘He entered into here (to bother us).’

Prefixing of lə- to a disyllabic verb has been noted: (9)

ʔa̰ saw ʔɛh lə-kəde 1sg fear 2sg advrs-kill ‘I’m afraid you will kill me.’

ʔa̰ 1sg

Such a trisyllabic word gives rise to the possible analysis of lə- as a verbal particle: (1) no other affix is prefixed to a disyllabic word and trisyllabic words are not otherwise within the established syllable pattern; (2) the vowel of lə- is sometimes spoken as a phonetic [ɔ] with slight stress, unlike the more typical schwa. 3.1.4 Nominalizing -ənThe nominalizing infix -ən- is inserted into monosyllabic verbs (except equative and existential verbs) creating semantically related nouns with disyllabic structure. However, this semantic relationship is not strictly predictable; instrumental, resultant and locative have been observed. If the onset of the stem is a cluster, the infix goes after the initial consonant. When the cluster is of the type Cr, the underlying sequence -ənr- is realized as -ədr-. This -ən- infix is productive and usable in new situations. Examples: ciə ‘to dig’ soə̰ŋ ‘to divide’ pa̰ n ‘to raise’ pra̰ n ‘to bait’ kro̰w ‘to roll’

həniə ‘shove-hoe’ hənoə̰ŋ ‘problem’ məna̰ n ‘domestic animals’ mədra̰ n ‘bait’ kədro̰w ‘ball’

3.1.5 Nominalizing kəThe prefix kə- is attached to loans, mostly Vietnamese nouns, as it originates from the Vietnamese classifier cái. Examples: Sedang Vietnamese kəɓaŋ ‘a table’ cái bàn kəhop ‘a can’ cái hộp kəkeo̰ ‘a bottle’ cái kéo kəli ‘a glass’ cái ly kəmajʔ ‘a machine’ cái máy

801

Sedang

3.1.6 Digital təThe digital prefix tə- is used in three instances with the numbers two to four to indicate fingers’ width; in one instance there is an inconsistent voicing of the initial consonant. Examples: peə̰ ‘two’ pa̰ j ‘three’ pṵn ‘four’

təbeə̰ ‘two fingers’ width’ təpa̰ j ‘three fingers’ width’ təpṵn ‘four fingers’ width’

3.1.7 Ordinal məThe ordinal number prefix mə- is apparently borrowed from Bahnar; it is prefixed to the monosyllabic numbers one through four and, less commonly, other numbers. More typical in Sedang is the ordinal number phrase ki +numeral. Examples: məmoj, ki moj ‘first’ məpeə̰, ki peə̰ ‘second’

məpa̰ j, ki pa̰ j ‘third’ məpṵn, ki pṵn ‘fourth’

3.2 Morpheme Reduplication 3.2.1 Complete Morpheme Reduplication Complete morphemes reduplication refers to open class words (usually verbs, stative verbs, and adverbial elements) being repeated in their entirety for emphasis or intensification. Additionally reduplicative concatenation of main verbs indicates a repeated or prolonged action, especially as a device in storytelling. Examples: (10) ga̰ khḛn ga̰ tej tej 3sg say 3sg strong strong ‘She said that he is much stronger.’ (11)

ta more

ga̰ tuə̰n tuə̰n ta ta 3sg smart smart more more ‘He is much much smarter (than they).’

(12) kiə kəcow kəhḛ̃ə kəcow kəhḛ̃ə kəcow ghost spit spittle spit spittle spit ‘The ghost spit spittle, spit spittle, spit spittle.’

kəhḛ̃ə spittle

802

Smith and Sidwell

3.2.2 Partial Morpheme Reduplication Reduplication of a part of a morpheme is (generally) a storytelling style which heightens the climactic points and affords establishment of a rhythm between stressed and unstressed syllables, used with seemingly few constraints. Apart from interrogative reduplication (see below) partial morpheme reduplication is the reduplication of the initial consonant or consonant cluster of monosyllabic verbs, creating new presyllables with an i or u vowel. Examples: (13) mɛ ˀni~ˀnɔ ku~ka mi~mot tuŋ mɛ medl red~pull.out red~bite red~enter inside medl ‘(The snake) came out (of the water) killed (the girl) and entered into that (water).’ (14) bi~brok lu~leə kleə pah red~go red~peel.open intestine snake ‘(He) went and slit open the snake’s intestines.’

Interrogative reduplication is an emphatic response to the unreduplicated interrogative. Examples: ki klaj ‘what?’ ki klaj ki klaj ‘who knows what?’ la laj ‘when?’ la laj la laj ‘who knows when?’

ʔu laj ‘where?’ ʔu laj ʔu la ‘who knows where?’ tɔ laj ‘how many?’ tɔ laj tɔ laj ‘who knows how many?’

3.3 Compounding Most nouns are mono- or disyllabic words, i.e. one phonological word. However, there are also compound nouns and formula nouns which function syntactically like all other nouns. 3.3.1 Compound Nouns Compound nouns consist of two phonological words. Some compound nouns have a meaning derivable from that of their parts. Other compound nouns have a meaning which cannot be derived from their composite parts, and some are composed of parts with no known meaning apart from the compound. Examples: now-pa ‘mother-father: parents’ ɟa-poə ‘grandmother-grandfather: grandparents’ l ̥a̰ -məʔeə̰ ‘leaf-?: paper’

803

Sedang ɲɔ̰ŋ-ʔɔ ‘elder-younger sibling: relatives’ kuə̰n-kiə ‘child-ghost: animals’ ʔɔn-ŋ̊ ej ‘fire-house: family’ bliŋ-bleə ‘an “x” mark’

3.3.2 Formula Nouns Formula nouns group two or four specific items together to represent a generic grouping. pah-po̰w ‘snake-lizard: formula for all forest creatures’ ka̰ -kɛt ‘fish-frog: formula for all water creatures’ cu-ʔḭ-kəpow-rɔ ‘pig-chicken-buffalo-cow: formula for all domestic animals’

4

Phrase and Clause Structure

Clauses are the principal, and frequently only, component of Sedang sentences. The smallest viable Clause is a single Verb or a Verb Complex; maximally Clauses can be concatenated, with subordination and chaining noted without any special constraints beyond those imposed pragmatically. Additional peripheral elements, specifically Adverbials and Final Particles, are also commonly added. 4.1 Basic Clauses Basic clause structure is very simple, with constituents usually ordered according to the following structure: (Subject) Verb Complex (Direct Object) (Indirect Object) (Prepositional Phrase)

or SV, AVP, AVTG All of the bracketed constituents are optional or may be pragmatically elided. We can characterize Basic Clauses by argument structure according to the presence/combination of the post verbal constituents, e.g.: Intransitive: V (PP) Transitive: V (T) (G) Ditransitive: V (T) (G) (PP)

804

Smith and Sidwell

Examples: 4.1.1

Intransitive Clauses

(15) kiə kətɔŋ spirit stumble ‘The spirit stumbled.’ (16) waj co̰ j neo̰ 3pl plant newly ‘They keep planting.’

4.1.2

Transitive Clauses

(17) seə̰ŋ weə̰ŋ ga̰ mɛ spirit help 3sg medl ‘The spirit helped him.’ (18) ga̰ w̥ aŋ ga̰ tuŋ 3sg throw 3sg inside ‘He threw him in the mud.’ (19) ga̰ co̰w Ø tuŋ 3sg put inside ‘He put (it) in the house.’

trap mud

ŋ̊ ej house

(20) co̰ prḛj ko̰w moj tɔ dog 3du bark one clf ‘Their dog barked at an anteater.’

4.1.3

məɲo̰w anteater

Ditransitive Clauses

(21) ŋin ʔa̰ m ʔɛh pəkeə̰ŋ 1pl.excl give 2sg powder ‘We give you the powder.’ (22) pin ɟa ʔɛh troə̰ŋ həda 1pl.incl show 2sg path escape ‘We will show you the escape route.’

805

Sedang (23) waj ʔoh ta ʔa̰ m ko ga̰ m̥ ɛ ka 3pl show not give ben 3sg cooked.rice eat ‘They don’t give him rice to eat.’ Note that a Benefactive (Dative) Indirect Object may be introduced with ko.

4.2 Clause Coordination, Chaining and Subordination Complex Clauses involve variously dependent clauses, typically although not always without a coordinating or subordinating complementizer. Both Basic and Complex Clauses can also be chained, without conjunctions, and more or less without limit. 4.2.1 Clause Coordination 4.2.1.1 Conditional Coordination Conditional sentences are optionally introduced with one of the “if” conjunctions: taŋ ‘if’, co̰w/cu ‘if’, ciŋ ‘if (with improbable condition)’. Examples: (24) ʔɛh ti lɔj ʔu loj 2sg proh believe purp stop ‘(If) you don’t believe (then) that’s all.’ (25) taŋ cuo tɔ̰ poj ta wɛh if in.law laugh imp not turn ‘If you (my in-law) laugh, don’t turn away.’

ʔoh neg

4.2.1.2 Causal Coordination In Causal coordination the dependent clause is optionally introduced with one of the “because” conjunctions: suə̰, la̰ . Examples: (26) ga̰ ko̰ pa̰ ŋ rədej klaj suə̰ ga̰ pa seə̰ŋ weə̰ŋ ga̰ mɛ 3sg prox able strong what because 3sg father spirit help 3sg medl ‘This one was quite strong because he—the father spirit—helped him.’ (27) pa sow wa la̰ m tuŋ kɔŋ hã̰ suə̰ pa wa sɔ ka̰ father pn want go inside jungle also because father want take fish ‘Sow’s father went into the jungle also because father wanted to get fish.’

4.2.1.3 Contrastive Coordination For Contrastive Coordination two independent clauses are coordinated with one of the following contrastive conjunctions glossed as ‘but, except’: suə̰, mɛ, təma. Examples:

806

Smith and Sidwell

(28) ga̰ thḛ kom kuə̰n mɛ kiə ki mɛ wa ka ga̰ mɛ.ʔaj 3sg command wait child but ghost attr medl want eat 3sg soon ‘He said to wait for the child but that ghost wanted to eat him soon.’ Note the use the Assertive Marker ki (attr) above, this is optional. (29) nah kla̰ hiə̰ŋ ka rɔ waj təma ʔoh ta previously tiger already eat cow 3pl but neg not ‘Previously a tiger had killed their cow but (they) didn’t catch (it).’

ca get

4.2.1.4 Correlative Comparative A Correlative Comparative sentence sees each clause introduced with rḛ ‘though, even if’. Examples: (30) rḛ tɔn rḛ ʔiə though longtime though less ‘The longer (he did it) the less (he had).’ (31) rḛ hɛn rḛ caj though more though sick ‘The more (doctors she saw) the sicker (she got).’

4.2.1.5 Resultative Construction Consequential Constructions see the dependent clause (which describes action that transpires in response to the action described in the first clause) introduced with the medial demonstrative mɛ ‘that, so, thus’. Example: (32) waj wa kəde ga̰ mɛ ga̰ n̥ o̰j preo̰ l ̥o̰ j 3pl want kill 3sg medl 3sg immediate return immediately ‘They were about to kill him so he returned immediately.’

4.2.2 Clause Chaining Clauses are readily concatenated if chained in discourse, such independent clauses strung together within a common intonation contour but without any further structural interrelationship. Semantically such clauses are closely related, frequently citing successive rapid action. These concatenated clauses are each analysed as simple sentences. Embedded Clauses The Embedded Clause may be of any clause type, including additional embedded clauses. There is no juncture preceding the embedded clause. Occasionally

Sedang

807

the attributive marker ki (attr) introduces the Embedded Clause, although it can be as easily elided, how its use is governed is not understood. The structure of constructions with Embedded Clauses is: (S) Pred:V-O:CL

Examples: (33) ʔɛh khḛn ʔoh tu wa ʔa̰ 2sg say not not want 1sg ‘You said (that you) don’t want me.’ (34) ʔa̰ taŋ waj to̰ j kla̰ hiə̰ŋ ka co̰ j 1sg hear 3sg tell tiger already eat deer ‘I hear (that) they say (that) a tiger has killed a deer.’ (35) ga̰ ˀnaj maw kiə ka məŋḛ 3sg know anim.pl ghost eat person ‘He knows (that) those ghosts eat people.’

When the embedded clause is a quotative, there is usually a juncture or brief pause preceding the object quotation, and the quotation has an intonation independent of that of the main clause. Examples: (36) kiə khḛn kəbo̰ thḛ ghost say who command ‘The ghost said, “Who commanded?”’ (37) mɛ rəta̰ m mɛ kreo̰ ʔa.piə̰ medl fellow so call pn ‘Then that fellow called A-Pia.’

4.3 Noun Phrases Noun phrases (NPs) are composed of one or more words with, potentially (i.e. barring only ellipsis), a noun as head, and typically denote arguments with roles such as actor, recipient, beneficiary, goal, means, location, etc. NPs are used for the subject, (direct) object, indirect object, locative, and complement elements of clauses and for the peripheral temporal, locative, and adverbial clause elements.

808

Smith and Sidwell

We distinguish five types of Sedang NPs:

· Basic Noun Phrase · Pronoun Reference Phrase · Count Noun Phrase · Pluralised Noun Phrase · Prepositional Phrase The structure of NPs is summarised as follows: Table 10.6 Sedang NP structure

N

-N

-N

– Clause Personal Name – Noun Phrase Pronoun -ki- + – Demon. PronounRef.Phrase – Number – Geograph.Name – DescriptiveName – Prepositional Phrase – Descriptive

– Demonstrative – Locative – Temporal Demonstrative – Nominal Interrogative

Sedang NP structure consists of any one or more items in the order given, but not more than one item per box; hyphenated items are non-initial and optional. 4.3.1 Basic Noun Phrase The basic NP may consist of a single noun phrase head, being a noun, a personal name, or a pronoun. In the basic noun phrase the head is always phrase initial. Examples: kəta ‘egg’ ŋ̊ ej ‘house’ now ‘mother’ ʔa-piə̰ ‘A-Pia (a story princess’s name)’ ga̰ ‘he, she, it’

4.3.2 Complex Noun Phrases A noun phrase head noun may be modified by a second and, infrequently, a third (descriptive or modifying) noun. Examples:

Sedang

809

kəta ḭ ‘egg chicken: chicken egg’ mənat ŋ̊ ej ‘wall house: house wall’ ʔiŋ bɔ ceə̰ŋ ‘side opening gate: the gate side (of village)’

A noun phrase head, whether or not followed by a second or third descriptive noun, may be modified by any of the following: (1) a personal name:

rətam preə̰ŋ ‘youth Preang (a story prince)’ kia neak ‘ghost Neak (a story ghost)’ now ʔa-ɟɔk ‘mother (of) A-Jok’ (parents are named by any of their children’s names)

(2) a personal name or pronoun:

ba̰ w pəʔoŋ ‘Po-ong’s rice’ pəle ga̰ ‘his village’ ŋ̊ ej waj təmo̰ j ‘the strangers’ house’

(3) a geographical name:

kɔŋ suə̰n ‘Vietnamese land’ wa̰ ŋ poə ‘Poa Pass’ teə pəsaj ‘Poxai River’

(4) a descriptive name:

cem l ̥um ‘hlum bird: Kingfisher’ loə̰ŋ plai traj ‘banyan fruit tree’ cem loə̰ŋ plaj traj ‘banyan fruit tree bird’

(5) a verbal descriptive:

məŋḛ ˀmej ‘evil person’ kiə kok ‘crazy ghost’ ɓɔk so̰ i ‘sacrificing westerner: priest’

Any of the preceding varieties of the basic noun phrase may be followed (1) by the assertive particle ki plus an embedded clause, noun phrase, demonstrative, or number, or (2) by a prepositional phrase. The nominaliser tədroə̰ŋ ‘thing, matter, problem’ frequently does not have ki before a clause or noun phrase. ki may be elided before clauses. Neither relative clauses nor prepositional phrases have been observed iterated in this construction. Examples:

810

Smith and Sidwell

(38) peang ki l ̥a side attr leaf ‘the side with leaves’ (39) kiə ki ghost attr ‘that ghost’

mɛ medl

(40) hã̰j ki moj day attr one ‘the first day’ (41) tədroə̰ŋ ʔa̰ ˀnaj thing 1sg know ‘the things I know’ (42) tədroə̰ŋ kɔŋ kliə̰ŋ thing land pn ‘the matter of Kliang land’ (43) tritrow ki ʔa̰ j mɔ̰ŋ mosquito attr exist beak ‘mosquito which has a beak’ (44) pəkeə̰ŋ ki ciəŋ siam kəla powder attr become clump bamboo ‘powder which becomes a bamboo clump’

Further, any of the above varieties of the basic noun phrase may be followed by a demonstrative, locative, temporal demonstrative, or, if the noun phrase is non-specific, by a nominal interrogative. Examples: (45) teə mih mɛ water pn medl ‘that Mih River’ (46) hã̰j ta̰ day dist ‘that day’

811

Sedang (47) ciək cam field down.there ‘the field down there’ (48) kɔŋ suən taj land Vietnamese up.there ‘the Vietnamese country up there’ (49) kɔŋ pin nah land 1pl.incl previous ‘our country formerly’

5

Word Classes

The assignment of word classes in Sedang is problematic, and for the purposes of this sketch somewhat arbitrary, to some extent utilizing the categories employed by Smith (1979). There are no particular phonological traits that correlate with word class, and many lexical items are used in a variety of constructions, suggesting nominal, verbal, descriptive or even various grammatical functions. It is also significant that in the history of Sedang there was phonological restructuring which created a large number of homophones, and this in turn fed some restructuring of semantics and grammar. Consequently it may be more appropriate to recognise that word classes are arising out of grammatical constructions rather than dwelling principally in the lexicon. 5.1 Nouns & Other NP Constituents Nouns form a large open word class of words; while most are mono- or disyllabic phonological words, there are also compound nouns and formula nouns which function syntactically like other nouns. Besides Common Nouns, Smith (1979) also describes several other minor classes; Personal Names, Geographical Names and Descriptive Names, as well as several other kinds of nominals. 5.1.2 Names 5.1.2.1 Personal Names Sedang personal names form a large open class that is almost always (99%) monosyllabic. In determining personal names parents avoid both meaningful words and other known names. With the search for unique personal names there is consequently a skewing of the phonological system, and various

812

Smith and Sidwell

c ommon phonological features of the general vocabulary are rare among personal names. 5.1.2.2 Geographical Names Geographical names identify specific geographic features and do not occur apart from the preceding designator: rivers (tea . . . or dak . . .), countries (kɔŋ . . .), mountains (ŋɔ . . .), mountain passes (wa̰ ŋ . . .), and villages. Village names usually begin with one of the following designators (to show relative use of these designators the number in parentheses indicates the number of specific villages known to use the term as included in the Sedang dictionary): teə (or dak. . ., a Bahnarism) ‘water’ (62) kuə̰n (or kɔn. . ., a Bahnarism) ‘child’ (56) tu ‘an anthill’ (10) wa̰ ŋ ‘mountain pass’ (3) loŋ ‘swamp’ (2)

There are also some village names not utilising one of the above designators. 5.1.2.3 Descriptive Names Descriptive names are assigned to things in nature, as a folk taxonomy that narrows reference from the generic to the specific. Consequently descriptive names do not occur apart from the noun being described. Descriptive names frequently indicate colour, size, etc., although many have no (recoverable) meaning apart from the specific thing so named. For example, the group for kla̰ ‘tiger’ includes: kla̰ məŋḛ ‘man-eating tiger’ (məŋḛ ‘people’) kla̰ cem ‘a small tiger’ (cem ‘bird’) kla̰ həniən ‘a tiger’ (həniən ‘contentment’) kla̰ treə̰ŋ ‘leopard’ (treə̰ŋ ‘?’)

5.1.2.4 Verbal Descriptives The template for NPs includes a slot for Verbal Descriptives (VDs); these are words that might be characterised as Adjectives, but we hesitate to use the label. Essentially any MainVerb that may be used statively can also be used descriptively within NPs. Examples: cḭw ‘to burn’: m̥ ɛ cḭw ‘the burned rice’ kan ‘to be big’: m̥ o̰w kan ‘the big rock’

813

Sedang kom ‘to wait’: tḭw kom ‘the waiting place’ loj ‘to abandon’: plaj loj ‘abandoned fruit’ məŋuə ‘to starve’: həna̰ m məŋuə ‘famine (“year-starve”)’ tiə ‘to answer’: ŋḛ tiə ‘the answering person’ tuən ‘to ripen’: ba̰ w tuən ‘ripe rice’ ʔəcɛ ‘to be near’: pəle ʔəcɛ ‘the nearby village’

5.1.3 Pronouns and Question Words 5.1.3.1 Personal Pronouns Personal pronouns form a small class of words characterised by singular, dual and plural number (there is no dual/plural contrast for the second person), and inclusive and exclusive first person forms, as shown in the table below. Interestingly Sedang pɔ̰ is not cognate with any pronoun of the neighbouring languages (Smith 1974). Table 10.7 Sedang personal pronouns SG

DU

PL

1st Person

ʔa̰

2nd Person 3rd Person

ʔɛh ga̰

ma̰ pa̰

ŋin pin

prḛj

pɔ̰

EXCL INCL

waj

There are additional pronouns with other semantic features:

· kəbo̰ ‘who?, anyone’; personal interrogative and indefinite pronoun · kiklaj ‘what?’; impersonal interrogative · ˀna ‘some(one)’; indefinite pronoun · cuo and kədra̰; in-law respect pronouns cuo is a second person singular pronoun ‘you’ used with all in-laws except brother- and sister-in-laws who use the ʔɔ/maj reciprocal terms. kədra̰ is a third person singular and dual pronoun ‘he, she, those two’ of all married couples if one of them is addressed cuo. 5.1.3.2 Demonstratives The Demonstratives are straightforward; they show three degrees of proximity:

814

Smith and Sidwell

ko̰ ‘this, here’ (prox) mɛ ‘that’ (medl) ta̰ ‘that’ (dist)

There is also an indefinite demonstrative lai translated ‘any’ here. 5.1.3.3 Interrogatives There are three interrogatives (question words) which are used in noun phrases. Examples: laj ‘which?, what?’ e.g.: troə̰ŋ laj ‘which path?, what path?’ klaj ‘which?, what?’ e.g.: kəno̰w klaj ‘which man?, what man?’ kiklaj ‘which?, what?’ e.g.: kɔŋ kiklaj ‘which country?, what country?’ ʔu laj ‘where?’

5.1.4 Measure and Quantity Words 5.1.4.1 Counting Counted amounts are specified with a Count Noun Phrase. A number is phrase initial and, except in special cases of ellipsis must be followed by either a Classifier or Countable Noun which does not require a classifier. The head noun categorised by the classifier follows the classifier but, in a simple count noun phrase, is usually elided if the context is already known or implied. Examples: (50) moj tɔ one clf ‘one frog’

kɛt frog

(51) pṵn ŋḛ rəta̰ m four clf young.man ‘four young men’ (52) peə̰ tɔ two clf ‘two (things/of something)’ (53) pəta̰ m liə̰n five money/piastre (countable noun) ‘five piastres’

The head noun of the count noun phrase is effectively the head noun of a basic NP and may be modified as in basic NPs. Examples:

815

Sedang (54) pa̰ j rənɔ̰ kəsa̰ j ra̰ j three clf vine rattan ‘three pieces of rattan vine’ (55) moj ŋḛ ʔa.piə̰ mɛ one clf pl medl ‘that one person A-Pia’ (56) moj pəleə̰ŋ ra̰ ŋ one clf arrow ‘his one arrow’ (57) moj su one penny ‘one red cent’

ga̰ 3sg

mɛ medl

khej red

5.1.4.2 Numerals Sedang employs a conventional decimal system, typical for the area. The lower numerals and forms for 10, 100, 1,000 and 1,000,000 are as follows: moj ‘one’ peə̰ ‘two’ pa̰ i ‘three’ pṵn ‘four’ pəta̰ m ‘five’

tədro̰w ‘six’ təpah ‘seven’ təheə̰m ‘eight’ təcen ‘nine’

cat ‘ten(s)’ rəpaw ‘thousand(s)’

r̥ ḭŋ ‘hundred(s)’ rətuh ‘million(s)’

Values for ten and above are formed as follows: moj cat ‘10’ peə̰ cat pa̰ i ‘23’ pṵn r̥ ḭŋ ‘400’ pəta̰ m r̥ ḭŋ tədro̰w cat təpah ‘567’ təheə̰m rəpaw ‘8000’

The phrasal tɔ laj ‘how many, how much’ (literally tɔ ‘thing’ laj ‘any/what’) functions syntactically as a numeral, e.g.:

816 (58) tɔ laj ɓa̰ ŋ thing any clf ‘how many fields’

Smith and Sidwell ciək field

(59) tɔ laj r̥ ḭŋ tɔ hənow thing any hundred clf rice.house ‘how many hundred rice houses’

Ordinal numbers are formed by placing ki before the numeral. 5.1.4.3 Classifiers Classifiers form a modest-sized closed set of words and are required for counting most items. Each classifier is used with a specific (though open) set of words which typically have qualities such as shape, size or function in common. The general classifier tɔ ‘thing’ can be used with most items, thus overlapping with many other classifiers. Used without a following number tɔ has the meaning ‘only’ or ‘only one’ and in this usage it usually includes the final particle sɔ ‘only’. Some classifiers are also nouns which require some other classifier when used as a noun; there is also some overlapping within the classifiers; e.g. ʔnoə̰ŋ and pṵm which both classify kəta ‘egg’. A list of classifiers follows: buəŋ: blankets and garments ɓa̰ ŋ /ˀma̰ ŋ: plots/fields; also peə̰m and fish traps ɗɔ: bamboo hənoŋ: ears of corn and stalks of bananas kəmeə: clusters of bamboo kəpaw: garments and nets kəta: flat things; also ‘gongs’, ‘kettles’. kətow: crabs kəto̰w: pliable things held in the hand like clay, cooked rice kəwa̰ : hands of bananas ˀnoə̰ŋ/noə̰ŋ: round objects ŋḛ: humans and human-like beings paŋ: sections of bamboo pṵm: round objects pla: rolled up things rənɔ: long, narrow objects siəm: loə̰ŋ ‘trees’, ‘jugs’, ‘rice wine’

817

Sedang

5.1.4.4 Measure Words Measure words (or Countable Nouns) do not require a classifier when preceded by a number; they belong to several semantic groups of quantity: Time: həna̰ m ‘year’, cow ‘hour’, hã̰j ‘day’, khḛ ‘month’, ma̰ ŋ ‘night’, sej ‘afternoon’, etc. Money: liə̰n ‘a piastre’, kak ‘a tenth piastre’, su ‘a hundredth of a piastre’ Parts of wholes: hədroh ‘verse of song, short time span’, na ‘time, turn’, pa ‘side, half’, poe ‘piece’, soh ‘time, turn’, cuə̰n ‘row in field’ Baskets: caŋ, ceə, rɔ̰, etc., various sized baskets Measurements: həka ‘elbow to finger’, həta̰ ‘thumb to any finger of same hand’, kəsoʔ ‘kilometer’, kra ‘arc length of arms (in measuring tree trunk)’, ˀɲuəŋ /ˀɲoŋ ‘man’s height’, plej ‘finger to finger of outstretched arms’, sɔh ‘finger to chest with outstretched arm’

5.1.4.5 Pluralised Nouns Plurals indicate quantities without use of a specific number. Plural markers precede the basic NP with or without an intervening classifier, or precede a Pronoun Reference Phrase with waj. There are general and animate plural markers: General plural markers: taj taŋ ‘all (of a given set)’, hɛn ‘many’, tṵm ‘every’, rɛm ‘each’, and two compounded emphatic g eneral plural markers tṵm rɛm ‘each and every’ and tṵm taj taŋ ‘every last one’. Animate plural marker: maw ‘they/them’ (anim.pl) is followed by a basic noun phrase whose noun phrase head must be an animate noun (without a classifier), by ki (attr), or by the pronoun waj (3pl).

Examples: (60) taj.taŋ kiə all ghost ‘all ghosts’ (61) hɛn məŋḛ many clf ‘many girls’

droh girl

818

Smith and Sidwell

(62) tṵm ka̰ n pəle every chief village ‘every village chief’ (63) tṵm rɛm ŋḛ each every clf ‘each and every person’ (64) tṵm taj.taŋ pəle each all village ‘every last village’ (65) maw ki ciə tənap anim.pl attr dig grave ‘those who dig graves’ (66) maw waj kra anim.pl 3pl old ‘the male elders’

kəno̰w man

Two of these general plural markers, tai taŋ and hɛn, may occur after (as well as before) a noun (though not after a longer noun phrase). In the post-noun position a reduplicative emphatic form of hɛn also occurs: hɛn hḭn/ hɛn hḭŋ. Examples: (67) kiə taj.taŋ ghost all ‘all the ghosts’ (68) tədroə̰ŋ hɛn problem many ‘many problems’ (69) tədroə̰ŋ hɛn hḭn problem many very.many ‘very many problems’

5.2 Verbs Verbs are described in terms of the places and functions they take within the Verb Phrase (VP). VPs maximally consists of one or more Main Verbs, one or

Sedang

819

two preceding PreVerbs, optionally preceded successively by a Verbal Particle and/or a Preverbal Adverb. All four elements may be present while a Main Verb or PreVerb must be present. The VP may thus be summarised as follows: VP: Preverbal Adverb- Verbal Particle- PreVerb(-PreVerb) ((Main Verb-)Main Verb-)Main Verb

As indicated, both Main Verbs and PreVerbs can be concatenated within the VP, creating Multi-Verb Predications. Main Verbs can be classified according to the argument structures they occur in as follows: 5.2.1 Intransitive Verbs Intransitive Verbs, unsurprisingly, take no object NP. Examples include: ʔeək ‘to defecate’ hiəm ‘to breathe’ heə̰ ‘to vomit’ huəm ‘to bathe’ kɔj ‘to sleep’ sah ‘to play’

5.2.2 Transitive Verbs Transitive Verbs are followed maximally by an object NP. Examples: (70) ka m̥ ɛ eat cooked.rice ‘Eat the rice.’ (71) ha̰ rəkɔŋ open.wide mouth ‘Open (your) mouth wide.’ (72) pɔ ɲa hoe weed ‘Hoe the weeds.’

5.2.3 Ditransitive Verbs Ditransitive Verbs are followed maximally by an object NP and an indirect object PP. Examples: (73) cow ba̰ w tuŋ ŋ̊ ej place rice inside house ‘Place the rice into the house.’

820

Smith and Sidwell

(74) w̥ a̰ ŋ ga̰ tuŋ trap throw 3sg inside mud ‘Throw him into the mud.’ (75) saŋ klɔ̰ŋ ʔa kɔ̰ŋ slide bamboo.cylinder onto arm ‘Slide the bamboo cylinder onto the arm.’

5.2.4 Semitransitive Verbs Semitransitive Verbs are followed maximally by Prepositional Phrases. Examples: (76) caj ʔa return loc ‘Return home.’

ŋ̊ ej house

(77) klɛh tuŋ l ̥aw fall inside bamboo ‘Fall into the bamboo.’ (78) kleŋ Ø ŋɔ ʔeəŋ go.upstream (at) mountain bright ‘Go up Mount Eang.’ (‘Mount Bright’)

5.2.5 Other Main Verb Types Smith (1979) offered several other subclasses of Main Verbs, although the criteria for distinguishing them are somewhat mixed in type: The Existential Verb ʔa̰ j ‘exist’ is characterised as the only member of this subclass by Smith (1979). Equative Verbs occur in equative constructions of the ‘to be/become’ type: se ‘to be’ ʔo̰ j ‘to be’ (also ‘to live’) ciəŋ ‘to become/able to’ (‘to be able to’ as a PreVerb)

Smith also includes a zero copula (Ø) in the above, but one could as well take the approach that there are no zero markers, but that in both Existential and Equative constructions the verb can be pragmatically elided. Quotative Verbs form a small subclass which includes the following and may be followed by a direct quote. Examples:

821

Sedang ʔɛŋ ‘to ask’ tiə ‘to answer’ khḛn ‘to tell, say’ to̰ j ‘to tell, say’ kreo̰ ‘to call’ təmiət ‘to think, consider’ pa ‘to request’ thḛ ‘to command’ (otherwise ‘must’) pəcan ‘to advise, warn’

The distinction between khḛn and to̰ j is sometimes given as a matter of respect. Younger people to̰ j with older people whereas older people khḛn with younger people. 5.2.6 PreVerbs and Auxiliary Verbs PreVerbs follow Verbal Particles and precede the Main Verb. Preverbs may occur alone as a clause or be the only element of the VP present in a clause with the absent Main Verb implied from the preceding context. PreVerbs convey Aspect and/or Mood (there is no Tense as such) and thus may be reasonably characterized as Modal Verbs. Examples include the following. 5.2.6.1

Aspectual PreVerbs

(79) ga̰ ʔu hiət 3sg continue forget ‘She continued to forget.’

dej self

(80) kɔŋ wa mej forest want rain ‘It is about to rain.’

Note that wa means ‘to want’ as a Modal Verb. In temporal/aspectual function, it indicates immediate future. (81) ʔɛh da̰ deə loj you continue water release ‘You continue to pour out (the liquid).’ (82) ʔa.hom ʔo̰j ʔa̰ j q remain exist ‘Is there still more wine?’

drow wine

neo̰ newly

Note that ʔo̰ j means ‘to sit, remain, dwell’ as a Main Verb.

822

Smith and Sidwell

(83) ga̰ pəsiəm mɔt neo̰ she begin enter newly ‘He began to go in some more.’ (84) ʔuən pḛ do.quickly.first.time cook ‘(he) quickly cooked rice.’

pḛ cook

m̥ ɛ cooked.rice

ˀnoj immediately

(85) klej pɔ ɲa neo̰ waj pəte finish hoe weed newly 3pl rest ‘After hoeing weeds for some time they rest.’ (86) ʔa̰ 1sg

pa neg

lḭŋ habitually

ʔu continue

rəŋej sing

suə̰ because

ʔɔ younger.sibling

ʔa̰ ʔu krow 1sg continue cry ‘I cannot continue signing because my child is crying.’

5.2.6.2

Modality PreVerbs

(87) pin ca ka 1pl.incl get eat ‘We can eat boar.’

rəka̰ j boar

Note that ca means ‘to get’ as a Main Verb and is used in modal function in preverbal position. (88) ʔa̰ hɛŋ ka 1sg crave eat ‘I crave to eat fish.’

ka̰ fish

(89) ga̰ ʔoh ta ciə̰ŋ tɔ 3sg neg able wade.through ‘It wasn’t able to cross by itself.’

sɛh self

Note that ciə̰ŋ means ‘to become’ as a Main Verb. (90) pin thḛ deə loj 1pl.incl must water release ‘We must pour (it) out first.’

hədro̰ j in.front

ʔoh neg

823

Sedang (91) ga̰ khen ʔoh ʔo̰ j tuŋ 3sg dare neg remain inside ‘He did not dare to stay in this basket.’

rɔ̰ basket

ko̰ prox

(92) sej ko̰ ʔah məʔeə̰m ka dej kɛt drɔ afternoon prox fut strive eat self frog k.o.frog ‘(In the) afternoon (she) will try hard (to remember) to eat the dro frog.’ (93) pin r̥ɔ ɓlej 1pl.incl able win ‘We are unable to win.’

ʔoh neg

(94) ʔa̰ kaj ɓlej ʔoh 1sg able win neg ‘I’m unable to win.’ (95) ʔoh ta khɔh neg neg permit ‘(It is) taboo.’ (96) ga̰ ta ʔa̰j l ̥a he neg exist die ‘He (definitely) did not die!’

5.2.7 Verbal Particles Verbal Particles precede PreVerbs or a Main Verb (except for the responses hiə̰ŋ ‘already’ and ta hã̰j ‘not yet’). Functionally the Verbal Particles deal with Negation/Affirmation (including negative imperative, although some final particles are also imperatives), Aspect and Interrogation. Verbal Particles may not be concatenated or compounded. Examples include: ʔa hã̰j / hã̰j ‘yet?’, ʔa hã̰j cɔ̰j ‘ever yet?’; with stress on hã̰j the ʔa is frequently elided; the answer requires either dḛ ‘presently’, hiə̰ŋ ‘already’, ta hã̰j ‘not yet’, cɔ̰j (see below) or the response (hã̰j) ˀɲiəh ‘not yet’ ʔa hom / hom yes/no question marker; with stress on the hom the ʔa is frequently elided; the answer requires either a response hom or, ʔoʔoh, ʔo tah or an affirmative statement. cɔ̰j, hã̰j cɔ̰j, ta ʔa̰ j cɔ̰j ‘never’, ta ʔa̰ j cɔ̰j la laj ‘never ever’ used as a response to an ʔa hã̰j question

824

Smith and Sidwell

dḛ ‘currently (progressive action)’ hiə̰ŋ ‘already’ (completed action)’ used as a response to an ʔa hã̰j question ma ta ‘don’t!’ (negative imperative)’ ʔoh pa, pa ‘not (negative); ʔoh is stressed (but may be elided), pa is unstressed ʔoh ta, ʔoh ti, ʔoh tu ‘not (negative); ʔoh is stressed (but may be elided), ta/ti/tu are phonological varients dependent on following segments/syllables poj ta ‘don’t!’ (negative imperative)’ ta hã̰j ‘not yet’ used as a negative answer to an ʔa hã̰j question

5.2.8 PreVerbal Adverbs PreVerbal Adverbs are always initial in the VP, they cannot occur alone but must be followed by at least a PreVerb or Main Verb. Some PreVerbal Adverbs may precede the Subject. Examples include: (97) ŋɛ̰ mɛ ga̰ hənun ʔoh clf medl 3sg usually neg ‘That person usually has no money.’ (98) ga̰ hlik 3sg necessary ‘He had to eat.’

wa about.to

ta neg

ʔa̰ j exist

liə̰n money

ka to eat

Note that hlik was observed only before wa. (99) m̥ eo.pa tɔ̰ strangely laugh ‘(He) laughs unnaturally.’ (100) ʔa̰ m̥ ow.pa ciəŋ 1sg with.great.desire become ‘I really want to be like this.’

sɛh self

ti as

ko̰ prox

(101) ga̰ neo̰ lɔj 3sg newly believe ‘He just now (came to) believe.’

Note that neo̰ means ‘again, more, further, now’ as a Final Particle.

825

Sedang (102) waj wa kəde ga̰ mɛ ga̰ n̥o̰j preo̰ 3pl want kill 3sg medl 3sg immediately return ‘They were about to kill him so he returned immediately.’

l ̥o̰ j immediately

Note that n̥ o̰j frequently co-occurs with Final Particle l ̥o̰ j ‘immediately’ (103) kɔŋ2 rḛ sej forest slowly.but.surely afternoon ‘Afternoon came on slowly.’ (104) kəno̰w suən huəm hoŋ peŋ man also bathe vicinity.of upstream ‘The husband also bathed further upstream.’

Note that suən frequently occurs before the subject. 5.3 Locational and Directional Words Locations and Directions are encoded with Prepositions and Diectics (Adverbials). 5.3.1 Prepositions Prepositional Phrases (PPs) are straightforwardly constructed with Prepositions preceding NPs. They typically follow the VP in intransitive/bitransitive constructions. Examples: ʔa ‘to, toward’: ʔa ɲ̊ ej ‘to the house’ ɓa̰ ŋ ‘with’: ɓa̰ ŋ ʔa̰ ‘with me’ drɛŋ ‘with’: drɛŋ ʔɛh ‘with you’ dro ‘in the middle, in the midst’: dro troə̰ŋ ‘in the path’ hiŋ/hoŋ ‘in the vicinity of’: hiŋ peŋ ‘in the area up above’ kədam ‘beneath, below’: kədam kɔn r̥ ɛŋ ‘below Kon Hreng village’ ˀnaŋ ‘up to’: mɔt ˀnaŋ kraŋ ‘Enter up to the knees’ pəla̰ ‘between’: pəla̰ hãj tḭŋ ‘between sacrifice days’ pəpeŋ ‘above’: pəpeŋ tənej ‘above the ground’ ȿap ‘from’: la̰ m ȿap kɔn tum ‘go from Kontum’ ti ‘up high in, up high on’: ga̰ ʔo̰ j ti loə̰ŋ ‘It lives up in the tree’ tuŋ ‘in, within, into’: tuŋ mɛ ‘in there’ ko ‘to, in relation to’: ko waj kəno̰w ‘vis-à vis the men’ 2 kɔŋ as a noun refers to the physical world (forest, lands etc.) and conceptually to the source of weather and the general conditions of existence.

826

Smith and Sidwell

The preposition ko also has a relator use identifying and marking (sometimes) indirect objects. 5.3.2 Deictic Adverbials Deictic Adverbials of Location/Direction take peripheral positions typically after VPs (but before Final Particles). Examples: (105) lɔ hoŋ cam go.out vicinity.of down.there ‘Go out—on the down side.’ (106) tak to̰ j taj go.up tell up.there ‘Go up there and tell (it).’

Note that cam and taj form an antonymic pair, cam also means ‘downstream’ or ‘southward’ and taj also mean ‘upstream, northward’. (107) ʔɛ͂h, taŋ ti mɛ ʔɛh la̰ m hey if as medl 2sg go ‘Hey, if (it is) like that you go in front.’ (108) rəta̰ m mɛ ʔo̰ j young.man medl continue ‘That fellow was down low.’

hədro̰j in.front

ˀnaj down.low

(109) kəno̰w suən huəm hoŋ peŋ man also bathe vicinity.of above ‘The husband also bathed further upstream.’ (110) la̰ m kəˀnaj go behind ‘to go behind/go last/go after’ (111) loj ʔa ˀŋej release loc up.high ‘leave (it) up high’ (112) ʔa̰ j pəle waj hoŋ suəp exist village they vicinity.of below ‘There is a village of theirs in the area down there.’

neo̰ newly

ʔɛ͂h hey

827

Sedang (113) həni kəˀnɛh tuŋ kədam comb fall inside underneath ‘The comb fell in the place underneath.’

Also, two locatives may occur together, as in the following: (114) ga̰ heo̰ loə̰ŋ ʔa ˀŋej 3sg cimb tree loc up.high ‘He climbed the tree way up high.’

taj up.there

5.4 Conjunctions/Connective Words Generally Sedang only employ conjunctions in clause coordination, see the relevant section. 5.5 Final Particles Final Particles are non-obligatory clause final elements. Sequences of two or (occasionally) three Final Particles may co-occur. The particle ko frequently precedes Final Particles, but its significance is not clear. 5.5.1

Interrogative Particles

ʔɛʔ ‘(question marker)’: suə̰n mɛ ʔɛʔ ‘The Vietnamese, huh?’ ʔɛ͂h ‘(weak question marker)’: saw ʔɛ͂h ‘Afraid, huh?’

5.5.2 Tense/Aspect Particles Tense and Aspect in Sedang is expressed by grammatical particles or adverbs. The existential verb ʔaj can be used to mark an utterance as realis. (115) ga̰ tio pin 3sg follow 1pl.incl ‘He will follow us.’

ʔah fut

(116) ʔɛh ʔa.hom l ̥ɔ rəta̰ m la̰ m troə̰ŋ 2sg q see young.man go path ‘Have you just seen a fellow going on this path?’ (117) ga̰ pak pin 3sg sting 1pl.incl ‘It stung us right away.’

l ̥o̰j immediately

ko rel

ʔaj exist

828

Smith and Sidwell

(118) ɓɔk thẽ ʔa̰ rəŋa̰ j foreigner command 1sg sing ‘The foreigner told me to sing.’

nah previous

(119) ʔɔk neo̰, ka neo̰ pour newly eat newly ‘(He) poured more, ate more.’ (120) prɔ ʔa̰ lɛ̰m ˀnoj make 1sg beautiful immediately ‘Make me beautiful right away.’ (121) ga̰ kuə dej rəpiɪ ko̰ ʔu kəte neo̰ ʔu l ̥a ˀɲɛh 3sg scrape self tongue prox continue break newly continue die finally ‘He scraped this tongue until it broke more and (he) finally died.’ (122) tɔ mɛ poç, thing medl finally ‘Like that, that’s all, I tell.’

5.5.3

ʔa̰ , 1sg

toj tell

Intensifier Particles

(123) məŋḛ tro̰ j pa̰ person same.as 1du.incl ‘A person like us two!’ (124) ga̰ ʔu rəke 3sg continue clever ‘He is very clever.’

ko rel

hɛh exclam

l ̥a̰j very

(125) ga̰ tɔn ˀnaŋ 3sg long.time very ‘He (was away) a real long time.’ (126) cu to̰ j pɛʔ go.down tell imp ‘Go down and tell (them).’ (127) l ̥ik wa ka prḛj mɛ necessary want eat 3du medl ‘(I) certainly want to eat those two.’

ra certainly

829

Sedang

5.5.4

Negation/Affirmation Particles

(128) ʔo̰ j ti loə̰ŋ hoh live up.high tree of.course ‘(It) lives up in a tree, of course!’ (129) klej mɛ prḛj mɛ suən ʔoh ta ca kɔj ɓa̰ ŋ hã̰ joh after medl 3du medl also neg neg get sleep com also of.course ‘After that those two also weren’t able to sleep together, of course.’ (130) ma̰ wa la̰ m 1du.excl want go ‘of course we want to go!’

koh of.course

(131) hiə̰ŋ klej hã̰ mɛ already finish also medl ‘(I) have also finished. . . .’

Note that mɛʔ communicates a weak suggestion of resignation. (132) wa mɛʔ want certainly ‘(I) certainly want to.’

Note that mɛʔ is usually employed in brief responses. (133) pəˀlaŋ ʔõʔ goodbye okay? ‘bye now, okay?’ (134) poj ta ʔa̰ m ma̰ imp neg give 1du ‘No, don’t give us any.’

5.5.5

ʔoh neg.emph

Other Final Particles

(135) ga̰ hiə̰ŋ pok kɔ 3sg already grey head ‘He is also grey-haired.’

ko rel

hã̰ also

830

Smith and Sidwell

(136) mɛ pin ca ka rəka̰ j, ci, kḛ tuŋ kɔŋ, hiə̃.̰ hã̰ medl 1pl.incl get eat boar barking.deer thing inside forest etcetera ‘Then we can eat boar, barking deer, jungle animals, deer, etc.’ (137) ʔa̰ təpuj ɓa̰ ŋ cu ɓa̰ ŋ cɔ̰ 1sg talk com pig with dog ‘I only talk with the pigs and the dogs.’

mɛ medl

te only

(138) waj saŋ dej ʔa kɔ̰ŋ ko̰ sɛʔ 3pl slide with loc arm prox as.you.know ‘They slide (them) on their arms, as you know.’ (139) tuŋ khḛ ʔa̰ j tɔ inside moon exist for.sure ‘On the moon there is only sand.’

prej sand

sɔ only

5.6 Temporals Sedang Temporals are divided into Temporal Adverbs and Temporal Nouns/ NPs. Temporals precede Main Clauses, establishing the temporal context of the clause. 5.6.1 Temporal Adverbs The Temporal Adverbs form a small class of words/phrases that include the following: ʔa hədro̰ j ‘at first’ la lai ‘when?’ no ko̰ ‘now’ pəʔiə ‘in a little while’ tɔn ‘for a long while’

The Final Particle neo̰ ‘more, again, further’ can be used after a Temporal as an intensifier or a marker of subsequent action: no ko̰ neo̰ ‘and now’ pəʔiə neo̰ ‘and then again, in a little while’

831

Sedang

5.6.2 Temporal Nouns/NPs Temporal Nouns/NPs are constructed like basic NPs, but are used in the archetypally adverbial pre-clause position much like the Temporal Adverbs. Examples: hãj ‘day’ khḛ nah ‘last month’ hãj ko̰ ‘today’ khḛ moj ‘month one: January’, etc. hãj nah ‘yesterday’ kəsej ‘in the afternoon’ hãj m̥ ej ʔah ‘day after tomorrow’ sej ko̰ ‘this afternoon’ hãj m̥ ej nah ‘day before yesterday’ la ‘time’ (hãj) ʂɔ ʔah ‘tomorrow’ la laj ‘when?’ hãj tḭŋ, hãj pəte ‘Sunday’ la mɛ nah ‘that former time’ hãj moj ‘day one: Monday’, etc. roh ‘period’ hãj kəsoʔ moj ‘day number one: first day’ roh nah ‘ancient times’ hãj dḛ ‘noontime’ roh ko̰ ‘these times’ pəlã mãŋ tḭŋ ‘week’ rəno̰ ‘season’ pəlã mãŋ tḭŋ ko̰ ‘this week’ khe ‘month’ rəno̰ kɔŋ mej ‘rainy season’ ʂɔ ʔah m̥ oj ‘eventually, at some time in future’

6

Sedang Text: The Rat Trap

This is a story told by a soldier in Nguyễn Hụê hospital 23/1/1973. ko̰ ʔa̰ wa təpuj tədroə̰ŋ kəna̰ j prox 1sg want tell story rat ‘Now I want to tell the story of a rat.’ ʔa̰ ʔo̰ j ʔa ŋ̊ ej nah ʔa̰ ɲɔ̰ŋ ti.ta̰ ŋ pej sɛh kətrɔ joh 1sg be loc home previous 1sg elder look.for set self bowtrap of.course ‘Some time ago I was at home I looked to set us a rat trap.’ ti.ta̰ ŋ pej kətrɔ wa ta̰ ŋ sɛh kəcaj ka look.for set bowtrap want hunt self food eat ‘To look to set a rat trap in order to hunt for something to eat.’ caŋ kətrɔ kəna̰ j mɛ ʔa̰ ˀnaj hãk if bowtrap rat medl 1sg know happy ‘If the trap got a rat I would know a little happiness.’

ʔiə little

832

Smith and Sidwell

ʔa̰ j kəcaj ka exist food eat ‘Then I would have something to eat.’ ʔa̰ məŋḛ ti ʔa̰ j liə̰n ʔoh 1sg person not exist money exclam ‘I am a person who does not have any money.’ wa roe kəcaj ʔa̰ suə̰n ko̰ want buy food 1sg Vietnamese prox ‘I have to buy my rice from the Vietnamese.’ ʔa̰ pej kətrɔ ʔa̰ j 1sg set bowtrap exist ‘I set the trap and got a rat.’ ʔa̰ hãk sɛh ʔɔ 1sg happy self exclam ‘Boy was I ever happy!’

kəna̰ j rat

ˀnaŋ for.sure

7 Bibliography Baudesson, Henry. 1932. Indo-China and its Primitive People. Translation by E. Appleby Holt. London: Hutchinson. Cupet, Pierre Paul. 1893. Chez les populations sauvages du sud de l’Annam. Devereux, George. 1937. Functioning units in Ha(rh)ndea(ng) Society. Primitive Man 10.1: 1–7. ―――. 1938. Principles of Ha(rhn)de:a(ng) Divination. Man 38.143: 125–7. ―――. 1947. The potential contributions of the Moi to the cultural landscape of Indochina. Far Eastern Quarterly 6/4: 390–5. Guilleminet, Paul. 1952. Coutumier de la tribu Bahnar, des Sedang, et des Jarai de la province de Kontum: selon le coutume appliqué dans les tribunaux de cette province de 1908 a 1938. Hanoi & Paris, Publications de l’Ecole Française d’Extrême Orient 32. Hickey, Gerald C. 1964. Sedang. In Frank M. LeBar, Gerald C. Hickey and John K. Musgrave (eds.), Ethnic Groups of Mainland Southeast Asia. New Haven: H.R.A.F.P., 146–149. ―――. 1967. Some Aspects of Hill Tribe Life in South Vietnam. In Peter Kunstadter (ed.), Southeast Asian Tribes, Minorities, and Nations, vol. 2. Princeton, N.J.: Princeton University Press, 745–70.

Sedang

833

Hoffet, J.-H. 1933. Les mois de la chaine annamitique entre Tourane et Les Boloven. Terre Air Mer 59.1: 1–43. Maitre, Henri. 1912. Les populations de l’Indo-Chine. Bulletin et Memoires de la Soeieté d’Anthropologie de Paris (6e sér.) 3: 107–15. Mansuy, H. 1929. Prehistoire et protohistoire. In Georges Maspéro (ed.), 63–80. Maspéro, Henri. 1929. Moeurs et coutumes des populations sauvages. In Georges Maspéro (ed.), 233–55. Maspéro, Georges (ed.). 1929. Un empire colonial français: l’lndochine. Paris & Brussels: Les Éditions G. van Oest. Smith, Kenneth D. & Marilyn Smith 1967–1977. Sedang dictionary. Dallas: Summer Institute of Linguistics microfiche publications. Smith, Kenneth D. 1967b. Sedang vocabulary. Saigon: Summer Institute of Linguistics (also republished by Summer Institute of Linguistics Manila 1977). ―――. 1967c. A phonological reconstruction of Proto Central North Bahnaric. University of North Dakota Summer Institute of Linguistics Workpapers. ―――. 1967d. Sedang dialects. MA Thesis, University of North Dakota. ―――. 1969a. Sedang affixation. Mon-Khmer Studies 3: 108–129. ―――. 1969b. Sedang ethnodialects. Anthropological Linguistics 11.5: 143–147. ―――. 1972. A phonological reconstruction of Proto-North-Bahnaric. Dallas, Language Data Series, Summer Institute of Linguistics. ―――. 1973. More on Sedang ethnodialects. Mon-Khmer Studies 4: 43–51. ―――. 1974. Sociolinguistics and the Bahnar Pronoun System. Typescript. ―――. 1975a. Phonology and syntax of Sedang: a Vietnam Mon-Khmer language. PhD. dissertation, University of Pennsylvania. ―――. 1975b. The velar animal prefix relic in Vietnam languages. Linguistics of the Tibeto-Burman area 2.1: 1–18. ―――. 1976. North Bahnaric numeral systems. In Gérard Diffloth & Norman H. Zide (eds.), Austroasiatic number systems. The Hague: Mouton, 61–64. ―――. 1979. Sedang grammar. Pacific Linguistics. Series B–50. Canberra: Australian National University. ―――. 2000. Sedang dictionary. Mon-Khmer Studies Special Volume No. 1. Dallas: Summer Institute of Linguistics. Soulié, Maurice. 1927. Marie 1er, roi des Sedang, 1888–1890. Paris: Marpon.

section 6 Katuic

․․

chapter 11

Kui Ntua Kees Jan Bos and Paul Sidwell 1 Background The Kui language (kdt, also Kuy, Kuoy) is spoken in Cambodia, Thailand and Laos (where it is also known as Souei), with an estimated speaker population of around 400,000. Sidwell (2005: 7) classifies Kui as a member of the Western sub-branch of the Katuic branch of Austroasiatic. Throughout this sketch the term Kui refers to the Ntua dialect spoken in Cambodia, which is the variety described in this sketch. The word Kui /kuəj/ simply means ‘human/person’ in the language, and has cognates in various Austroasiatic branches (e.g. Car Nicobar /kuj/ ‘head’, Nyah Kur /kúəj/ ‘Thai (people)’, Kensiu /kuj/ ‘head’). In Cambodia, the Kui are an ethnic minority of at least 20,000, located primarily in the two central northern provinces of Kampong Thom and Preah Vihear. Four distinct but closely related dialects have recently been identified: Kui Ntra, Kui Ntua, Kui M’ai and Kui Mla. The first two dialects have the most speakers and show the highest language vitality (Markowski, 2005). The data for this sketch was collected by Kees Jan Bos (SIL International) working with Kui Ntua speakers in Promey village (Promey commune, Tbeng Meanchey district, Preah Vihear province), with Paul Sidwell (Australian National University) contributing significantly to the analysis and write-up. The current analysis is based on a corpus of 3,000 words plus a collection of texts. Due to political and economic factors, low social status and schooling exclusively in Khmer, the Kui are being assimilated rapidly into the dominant Khmer society, along with a shift to monolingualism in Khmer. There are only a few pockets of (the more remote) villages left where Kui is still being used as first language. Recently language preservation and language-based development efforts have been started in a number of Kui communities (Bos et al., 2008). Members of both major dialects were involved in developing a Kui script and writing system, which is based on that of Khmer. This work was officially recognized by the Institute of National Language of the Royal Academy of Cambodia in July 2008.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_017

838

bos and sidwell

2 Phonetics/Phonology 2.1 Word/Syllable Structure As typical for Austroasiatic languages of Indo-China, Kui has both monosyllabic and sesqui syllabic words. In the latter, the stressed main syllable is preceded by an unstressed minor syllable or pre-syllable. Additionally there is a modest number of disyllabic loans (such as kaːkwe ‘coffee’). The indigenous phonological word template can be schematised as follows: minor-σ main-σ (C(C)V) C(C)V(ː)(C) ((CV)N)

2.1.1 Minor Syllable Phonotactics Approximately one-third of the words in the corpus studied have a minor syllable of one of the following structures: CV, CCV, CVN or N̩ (syllabic nasal). The vowel in the minor syllable is a short and unstressed and shows no productive contrast. Depending on the immediate phonetics context it can be pronounced as [ɜ], [ɔ], [ɪ] or [ə]; by default it will be treated as a schwa [ə] throughout. CV minor syllable onsets are restricted to /p, b, m, t, r, l, s, c, k/. Examples: /pəbaːt/ ‘to chew’ /ləwəːj/‘weak’ /bəraːɲ/ ‘side’ (location) /sədaːw/ ‘night’ /məsiəw/ ‘kettle’ /cəkɒːŋ/‘carry behind the shoulder’

In CCV minor syllable the onset can only be /p, t, c, k, s/ while the second consonant is restricted to /l, r/. Examples: /kləbɒk/ ‘brains’ /prəɟəːŋ/ ‘north’ /trəbɒh/ ‘beak’

/crəlia/ ‘thorn’ /krəwiən/ ‘sickle’ /srəŋo̤ ːk/ ‘to snore’

With minor syllables CVN and N̩ the nasal is assimilated to the place of articulation of the following C. Examples: /mbɜm/ ‘pounded rice’ /mpɨːt/ ‘very big’ /pəmboːl/ ‘intoxify’ /məmbaj/ ‘last night’

/ɲciːŋ/ ‘measuring scales’ /ŋkit/ ‘little, small’ /məntɒːr/ ‘star’ /təmlah/ ‘ploughed soil’

839

kui ntua

2.1.2 Main Syllable Phonotactics Consonant clusters in mainsyllable onset are limited to stop plus approximants of fricative /w, r, l, s/ or aspirated stop plus nasal or lateral. Some examples of consonant cluster combinations: /kwial/ ‘to herd’ /stia/ ‘handspan’ /tʰnoːr/ ‘grave’ /prɛh/ ‘to germinate, sprout’ /brɒw/ ‘mountain’

/kruɒp/ ‘to cover’ /cʰlɔh/ ‘to illuminate’ /klɒː/ ‘snail’ (generic) /ksaːl/ ‘wind’ /psaj/ ‘full, sated’

Consonant combinations involving /s/ are often pronounced ambiguously; in casual speech the Kui tend to break up the consonant clusters by including a short [ə] vowel. Examplesː /srɒŋ/ → [sərɒŋ] ‘bamboo’ /ksaɲ/ → [kəsaɲ] ‘snake’

2.2 Phoneme Inventories The full set of main syllable onsets is tabled as follows: Table 11.1 Kui mainsyllable onset consonants

stop aspirated stop voiceless stop voiced nasal fricative trill approximant

pʰ p b [ɓ] m

w [β]

tʰ t d [ɗ] n s r l

cʰ c ɟ ɲ

kʰ k ŋ

ʔ

h

j

Voiced stops are sometimes [+implosive]; by default they will be treated as [-implosive] throughout. The labial-velar approximant /w/ has the allophones [β] in onsets and [w] in codas.

840

bos and sidwell

Main syllable codas are tabled as follows: Table 11.2 Kui mainsyllable coda consonants

stop nasal trill approximant

p m

t n r l

w

c ɲ j

k ŋ

ʔ

h

It is also observed that the final /r/ is occasionally produced as [l], although in the corpus there are clear examples of minimal pairs that contrast on the final consonants /l/ and /r/. The main syllable vowel phonemes are shown below. Table 11.3 Kui main syllable vowels

close close-mid open-mid open diphthong

i iː e eː e̤ː ɛ ɛː ɛ̤ ɛ̤ː

iə ia

ɨː ə əː ə̤ː ɜ ɜː ɜ̤ a aː a̤ ɨə

u uː o oː o̤ o̤ ː ɔ ɔː ɔ̤ ɒ ɒː ɒ̤ ua uə

Allophonic variation:

• • • •

The close-front vowel short /i/ becomes [ɪ] before final palatals, otherwise it only occurs before /h/. The close-mid central vowel /ə/ is raised to [ɨ] before a final /h/. When close/close-mid vowels are followed by a final /h/ or /ʔ/ they do not contrast in length. In the presence of velar or glottal consonants the more open back vowels sometimes undergo phonetic nasalization.

841

kui ntua

2.3 Suprasegmentals Like various other Katuic languages (see Miller & Miller 1995), Kui is a register language. This means that a vowel can have either a ‘clear’ or ‘breathy’ voice quality (first and second register, respectively). The Kui refer to this phenomenon as ‘light’ or ‘heavy’ voice. However, in the Kui dialect researched here the close vowels (and some close-mid vowels) have no clear register pattern. Among the open-mid and open vowels a clear pattern exists, with the second register marked with a central vowel onglide, as has been observed for other Katuic languages (Miller & Miller, 1995). The following onglided ‘breathy’ vowels occur (from front to back position): /ɛ̤/ > [əɛ̤] /a̤/ > [əa̤] /ɜ̤/ > [əɜ̤] /ɒ̤ / > [əɒ̤ ]

/_ h, ʔ /_ k, ŋ, h, ʔ /_ p, m, t, n, l, r, ʔ /_ k, ŋ, h, ʔ

Long open-mid and open vowels with second register have not been observed (except for /ɛ̤ː/ which only occurs in words borrowed from Khmer). Long vowel /ɔː/ [ɔ̝ ː] is pronounced somewhere between the open-mid and close-mid back vowels [ɔː] and [oː]. 3 Morphology Kui is a predominantly isolating language with no inflectional morphology. Grammatical relations are generally realized without morphological marking, affixation appears to be mainly derivational. The following observations are made in relation to morphological processes. 3.1 Compounding 3.1.1 Compound Verbs Various verbal compounds are created with a head followed by another verb, a noun or adverb to form a single lexical unit that refers to a single event. These include coordinated compounds as a conjunction of opposites, compositions creating new meaning, and rhyming compounds formed with nonse words. Examplesː luaj daːʔ caː dɒːj ʔiː plɒː

swim+water (v+n) ‘to swim’ eat+rice (v+n) ‘to eat’ sore+head (v+n) ‘to have a headache’

842

bos and sidwell

bɛc ləŋeːt

lie+asleep (v+adv)

‘to sleep’

bɛc caː ɲceː suər dɒːʔ cəpɔ̤h ciə cɒw sɒh sɛːŋ

lie+eat (v+v) speak+tell (v+v) keep+discard (v+v) come+go (v+) ascend+descend (v+v)

‘to be idle’ ‘to narrate’ ‘to desert’ ‘to and fro’ ‘up and down’

pət pɔŋ

gone+ nonse word

‘to be lost’

3.1.2 Compound Nouns Compunding of nouns is apparently productive in Kui. The listing below provides examples divided into three groups reflecting the taxonomy of Bisetto & Scalise (2005). Such forms as listed below are accepted as compounds because their usage is highly lexicalised. Attributive compounds exocentric blɒŋ kəpɔŋ (alc.beverage+can) = ‘canned beer’ kaː kəpɔŋ (fish+can) = ‘tinned fish’ dɔŋ təmaːw (dwelling house+stone) = ‘brick house’ cɛːk daːʔ (medicine+liquid) = ‘liquid medicine’ Subordinate compounds exocentric prɒ̤ ŋ təkɒːŋ (hole+neck) = ‘throat’ klɒːŋ mɜ̤t (round object+eyes) = ‘eye balls’ hlaː luɒŋ (leaf-like object+wood) = ‘leaf’ pələj luɒŋ (fruit+wood) = ‘fruit’

endocentric (all generic/specific type) kaː səmoːɲ (fish+name) = ‘k.o. fish’ taːk kɛh (metal+to snap) = ‘lighter’ klɒː pieh (snail+name) = ‘k.o. snail’ tənɒːm priət (pole-like object+banana) = ‘banana tree’

Coordinate compounds exocentric exocentric kɒːn cɛːm (child+to keep) = ‘animal’ mɛː keːʔ (mother+small) = ‘younger aunt’ tənɒːm luɒŋ (pole.like object+wood) = ‘tree’ kɒːn ŋaː (child+little) = ‘newborn’ muh mɜ̤t (nose+eyes) = ‘face’ kɒːn ntrɒ̤ k (child+cow) = ‘calf’ kətɛːk loː (ground+plot) = ‘plot of land’ kɒːn kuəj (child+person) = ‘doll’

843

kui ntua

3.2 Derivational Affixes Morphological causation, which increases valence of the verb by one degree, is marked on intransitive verbs by prefixation or infixation. The causative affix has allomorphs pəN- on monosyllabic roots and -m- as an infix on sesquisyllables, creating a coda for the minor syllable. The nasal of pəN- assimilates to the onset of the following segment, while -m- only shift to -ŋ- before glottals but otherwise does not change. Stem + pVN- boːl ‘drunk’ pəmboːl ‘intoxicate’ bɔːl ‘run’ pəmbɔːl ‘cause to run’ bɛc ‘sleep’ cəmbɛc ‘cause to sleep’ ntruh ‘fall’ pəntruh ‘drop’ lɒp ‘stupid’ pənlɒp ‘cheat’

Stem + -mtəjɜŋ ‘stand’ təmjɜŋ ‘cause to stand’ kəcɛːt ‘dead’ kəmcɛːt ‘kill’ pəʔɜːl ‘scared’ pəŋʔɜːl ‘scare’ təkɒw ‘sit’ təmkɒw ‘cause to sit’ təliəj ‘broken’ təmliəj ‘break’

3.3 Reduplication Adjectives and Adverbs are often reduplicated for augmentation/plural or to indicate repeated or distributed action. In the following examples the redupication of the attributive indicates that there are multiple children and buffalos. (1)

pɛ̤ːt cɒw cat ʔɒːn kɒːn ŋ̩kiːʔ~ŋ̩kiːʔ doctor come inject give child small~red ‘The doctor is coming to give injections to small children.’

(2) triəʔ plɜm~plɜm to̤ ʔ buffalo fat~red dist ‘Those fat buffalos.’

Interestingly, animal names, in particular of smaller creatures, are often composed of a duplicated base word, either the entire syllable is redupilcated, or the just the onset with a linking vowel. The individual morphemes appear to have no independent meanings. The following examples were encountered: Redupilcated initials cəcrɛ̤h ‘flying termites’ tətak ‘woodpecker’ məmɒ̤ ŋ ‘k.o. little fly’

Reduplicated syllables kuɒ kuɒ ‘k.o. millipede’ cɒk cɒk ‘small lizzard’ leːp laːp ~ laːp laːp ‘butterfly’

844 4

bos and sidwell

Phrase and Clause Structure

4.1 Simple Sentences 4.1.1 Declarative Statements Statements exhibit the basic SVO word order and have neutral intonation. Examples: (3) nɔːɲ bɛc boy sleep ‘The boy is sleeping.’ (4) cuəh to̤ ʔ cɒh cətɒːŋ man dist plant bean ‘The man over there is planting beans.’ (5) kruː.pɛ̤ːt ʔɒːn beːʔ ʔiː krən.caɲ muŋ doctor give person ill malaria mosquito.net ‘The doctor is giving the malaria patients mosquito nets.’ (6) kəː pə̤ːʔ cuəh.tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw 1sg see tiger sleep stay in cave mountain ‘I saw a tiger sleeping in a mountain cave.’ (7)

haj mɜʔ pə̤ːʔ cuəh.tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw ʔəː 1incl neg see tiger sleep stay in cave mountain emph ‘We didn’t see a tiger sleeping in a cave.’

A third person subject is often followed by the generic third person pronoun naːw, in what we might call a Resumptive construction: (8) kuəj naːw wuɒ truŋ liːk jaŋ.nia person 3 make cage.fold pig how ‘How do people make a pig pen?’

Frequently in normal speech arguments may be omitted (zero-anaphora). Since there is no agreement marking on the verb, the referents are interpreted based on the immediate context. Examples: (9) maj ciə pʰsaːr dɛːl te̤ː mɜʔ bəːn ciə ʔəː 2 go market also neg neg get go emph ‘Are you going to the market too? No. [I] cannot go [to the market].’

kui ntua

845

4.1.2 Questions Polar questions A yes-no question can be asked with a raising intonation clause finally, or with a falling final pitch if the topic is fronted. Examples: (10) niː ʔaːw maj ↑ prox shirt you ‘Is this your shirt?’ (11) ʔaːw maj niː ↓ shirt you prox ‘Is this your shirt?’

Also, a number of clause-final particles can give rise to closed questions, each with particular intonation. waːʔ, with final rising pitch, or a simple yes-no question: (12) maj pə̤ːʔ cuəh.tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw waːʔ ↑ 2 see tiger sleep stay in cave mountain q ‘Did you see a tiger sleeping in a cave?’

hɜːj to ask about the completion of an event, which can be answered by saying ‘not yet’ or ‘already’, with final falling pitch. (13) maj pə̤ːʔ cuəh.tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw hɜːj ↓ 2 see tiger sleep stay in cave mountain nsit ‘Have you already seen the tiger sleeping in the cave?’

rɜː mɜʔ meaning ‘or not’ for an alternative question, with neutral intonation: (14) maj pə̤ːʔ cuəh.tʰaːw bɛc kuə lɜŋ pə̤ːŋ brɒw rɜː mɜʔ 2 see tiger sleep stay in cave mountain or neg ‘Did you see the tiger sleeping in a cave or [did you] not?’

bɜː for a tag question, with rising-falling intonation: (15) cuəh.tʰaːw ʔɛŋ kəcɛːt me̤ːn bɜː ↗↘ tiger ana.dist die true tag ‘Is that tiger truly dead, or not?’

846

bos and sidwell

Alternative questions: Either/or questions are straightforwardly formed by contrasting two clauses with a conjunction: (16) dɜh maj ŋuɒc daːʔ rɜː ŋuɒc kaːkweː ago 2 drink water or drink coffee ‘A minute ago, did you drink water or coffee?’

Wh questions In the case of Wh-questions the question words, ntuɒ and cəntuɒ ‘what?’ are usually accompanied by rising intonation. (17) maj səʔiə caː ntuɒ ↑ 2 want eat what ‘What do you want to eat?’ (18) niː cəntuɒ ↑ prox what ‘What’s this?’

The question word kɜːt ntuɒ ‘why’, literally ‘what is’, is used clause-initially: (19) kɜːt.ntuɒ naːw wuɒ truŋ liːk mɜʔ ro̤ ːc why 3 make cage pig neg achieve ‘Why did they not manage to make the pig pen?’

The question particle nia ‘which’ occurs in situ: (20) maj səʔiə tac ʔaː nia 2 want sell ref which

ʔaː kiːʔ rɜː ʔaː pɨːt ʔaː ref small or ref large ref

kan rɜː ʔaː trɔːh female or ref male ‘Which one do you want to sell? The small one or the big one, the female one or the male one?’

The word nia is also used in conjunction with jaŋ (‘as’ or ‘like’) to create howquestions. It occurs in situ too.

kui ntua

847

(21) kuəj naːw wuɒ truŋ liːk jaŋ.nia person 3 make cage pig how ‘How do people make a pig pen?’

When? questions are distinguished as to whether the event is in the past (tɜh naː) or in the future (pɒ̤ h.nia). The former occurs clause-initially, the latter clause finally. (22) ciə pʰsaːr pɒ̤ h.nia go market when ‘When do [you] go to town?’ (23) təh.naː kɒːn maj n̩ druh sɛːŋ rɜː dɔŋ when child 2 fall descend from house ‘When did your child fall from the house?’

Where? naː can be clause-initial or clause-final. (24) ciə naː go where ‘Where [do you] go?’

The Who? question word ŋkuəj occurs in situ (and can occur in the subject, object and indirect object positions). (25) maj ciə rɜː ŋkuəj 2 go with who ‘With whom do you go?’

The question word for quantity is mənia (how much/many) which is used for both mass and count. The quantity word follows the object it refers to, and can be followed by the noun classifier. The NP that it modifies occurs in situ. (26) maj kɜːt kaʔ mənia 2 exist money how.much ‘How much money do you have?’

4.1.3 Commands (imperative) Kui imperatives are formed using the command word ciə, which is also the verb ‘to go’. It is used clause-finally, and in addition can be clause initial for greater effect. Of course, non-verbal guestures also play an important role here.

848

bos and sidwell

(27) ciə maj sɒh dɔŋ ciə go 2 ascend house imp ‘Go into the house!’

A softer form to mobilize someone to do something is to use a first person inclusive word, cɛ̤h (‘let’s’; hortative mood), such as when one wants an official guest to enter the house. With the word cɛ̤h the speaker is always included in the action. (28) cɛ̤h sɒh dɔŋ ŋuɒc daːʔ sɛn hort ascend house drink water first ‘Let’s go into the house and drink water first.’

Imperatives use a special form of negation ɲcɒ̤ ʔ (‘don’t’ proh). This negation is found just before the verb, like the negation word mɜʔ. The particle dəː can be added clause finally for softening effect. (29) maj ɲcɒ̤ ʔ sɒh dɔŋ dəː 2 proh ascend house ptcl ‘Don’t go into the house.’

There is another use of the negative imperative ɲcɒ̤ ʔ in the sense of ‘if it hadn’t been for . . .’ which is a conditional adverbial clause type (see Section 4.2.4). 4.2

Complex Sentences

4.2.1 Coordination, Chaining, and Subordination Coordination takes place between clauses of equal ranking. Two clauses are either juxtaposed or connected using a conjunction. Kui uses the following conjunctions: rɜː mɒːŋ tɜŋ / lɒŋ

‘and’ or ‘or’ (only at NP level and VP level) ‘but’ (only on sentence level) ‘as well as’

Example of juxtaposition: (30) beːʔ.re̤ːn ʔɛːl kɒŋ cih ciə dɔŋ ciə krɔːj student take bicycle drive go house go back ‘The students took their bikes and rode back home.’

kui ntua

849

Example with mɒːŋ: (31) mɛː kəː səʔiə ʔɒːn kəː ciə mɒːŋ kəː kəcəl ciə həːj mother 1sg want give 1sg go but 1sg reluctant go nsit ‘My mother wants me to go, but I cant’ be bothered to.’

In a clause that contains sequential verb phrases the VPs are juxtaposed. (32) cəlah dəh haj ciə toːr ruɒc kɒŋ morning just 1incl go buy tube bicycle

toːr ləpiː toːr səkɒːr buy fermented.fish buy sugar ‘This morning I went to buy inner tubes, fermented fish and sugar.’

For a sequence of events that is to take place, the inclusive conjunction lɒŋ (‘as well (as)’) is used. (33) prənɒː haj ciə srɛː croːt srɒː lɒŋ wa̤ h daːʔ tomorrow 1incl go fields harvest rice incl make.incision water srɛː lɒŋ rɜː taʔ pələj cətɒːŋ lɒŋ fields incl with pick.pluck fruit bean incl ‘Tomorrow I go to the fields to harvest rice, to regulate the water and to pick beans.’

The construction ‘both . . . and’ is realized by the word tɜŋ (‘including’) or by lɒŋ or by a combination of the two: (34) pɛ̤ːt niː ciaŋ tɜŋ cat lɒŋ ciaŋ tɜŋ deːr lɒŋ doctor prox able including inject incl able including sew incl ‘This doctor can both do injections and stitches.’

The negative construction is realized by juxtaposition or with tɜŋ (lɒŋ would be ungrammatical). (35) pɛ̤ːt niː mɜʔ ciaŋ tɜŋ cat mɜʔ ciaŋ tɜŋ deːr doctor prox neg able including inject neg able including sew ‘This doctor can neither do injections nor do stitches.’

850

bos and sidwell

4.2.2 Relative Clauses Kui has both marked and unmarked relative constructions. There follow two examples of unmarked relative clauses; note that the anaylsis of the first is ambiguous as one could treat kəntruːh baːr naʔ as a single NP object of kɜːt: (36) lɜŋ dɔŋ kɜːt kəntruːh baːr naʔ in house exist male two person ‘There are two men (who are) in the house.’ (37) kɜːt cuəh tiː baːr naʔ kuə ləŋ dɔŋ exist man tall.high two person stay in house ‘There are two tall men (who are) in the house.’ (38) kəː cə̤ː [rə̤ːŋ kəjaːk kəntɛːl pətah.kʰnia]rel 1sg believe story husband wife divorce ‘I believe the story of the man and woman who got divorced.’

The marker ləm introduces relative clauses that relativize past events. By employing a gap strategy both subjects and direct objects can be relativized, as exemplified below. (39) maj ʔaː ləm Ø kəmcɛːt ja̤ k bəːn 2 ref rel kill giant get ‘You who Ø managed to kill the giant.’ (40) ʔaːw niː ləm naːw hɛːk bəːn Ø shirt prox rel 3 tear get ‘This shirt that he managed to tear up Ø.’ (41) ʔaː ləm kuəj bɒn.srɒn mənia mənia ref rel person beg how.many how.many ‘That which the people had begged [during the past year].’

However, the marker ləm, which always refers to some activity in the past, has a wider use as a general linker, as seen in the following example sentences: (42) cuəh.tʰaːw ʔɛŋ cɛh.kɜʔ mɒ̤ h ləm ləm tiger ana.dist persistence ask rel rel ‘The tiger kept on asking what-he-asked-before.’

kui ntua

851

(43) maj ŋuɒc ləm kəː maj mɜʔ rəʔiək waːʔ 2 drink rel 1sg 2 neg disgust q ‘You drink [the water/beaker] of mine. You don’t mind?’ (44) kəː cɔ̤k pəkaːw ləm kuəj ʔəː 1sg smoke tobacco rel person emph ‘I smoke tobacco that people [had discarded].’

4.2.3 Complement Clauses A sentential complement clause functions as an argument of another clause. Sentential complements can be introduced with the complementizers paːj ‘say’ and ʔɒːn ‘give’. Different verbs use different complementizers. Examplesː (45) teːhiən ʔɒːn [beːʔ srɔk ʔar luɒŋ lɜŋ kɒh pɨːt]COMP soldier give person village saw wood in forest large ‘The soldiers allowed the villagers to cut wood in the jungle.’ (46) kəː kət [paːj daːʔ.kʰiəl haj sɔt me̤ːn.te̤ːn]COMP 1sg think say honey 1incl pure intens ‘I think that my honey is pure indeed.’ (47) trɔ̤ŋ [paːj kɜːt kaː trəpa̤ ŋ niː] COMP doubt say exist fish natural.pond prox ‘There are possibly/probably fish in this pond.’ (48) təhiən pəŋkʰɒm [ʔɒːn cuəh kreːŋ rɜ̤m] COMP soldier force give man old dance ‘The soldiers forced the old man to dance.’ (49) ja̤ ʔ to̤ ʔ piɲcia [ʔɒːn caw naːw ʔɜt daːʔ ʔɒːn] COMP woman dist command give grandchild 3 draw water ben ‘That woman over there commanded her grandchild to draw water for [her].’

4.2.4 Adverbial Clauses Adverbial clauses function as adjuncts to a full proposition, adding extra information to it. Adverbial clauses can express information such as time, location, manner and reason. Examples of several kinds are shown below.

852

bos and sidwell

Time (before, when) (50) [ɲo̤ ːŋ ciə prənaːk]ADV haj re̤ːp kaʔ sɛn before go town 1incl prepare money first ‘Before going to the town, I’ll first count my money.’ (51) [pɒ̤ h cɛːn]ADV bəːn kuəj naːw taʔ caː swaːj to̤ ʔ when ripe get person 3 pick eat mango dist ‘When [they are] ripe, people will pick and eat those mango’s over there.’

Location (where, right at) (52) naːw wuɒ rəntɒːl [ntrəm kuəj cəlɒk.daːʔ rɜː.ɲo̤ ːŋ]ADV 3 make bridge up.to person drown before ‘They make the bridge where that person had drowned.’ (53) naːw paɲ liːk [cɒm bɛh.doːŋ ʔaj]ADV 3 shoot pig exactly heart 3nhum ‘They shoot the pig right in its heart.’

Manner (like, according) (54) sɜh naːw səseːr lɜŋ luɒŋ [toːn krṳː suər]ADV disciple 3 write in wood follow teacher tell ‘The students wrote on the wood, as the teacher had instructed.’ (55) kɒːn haj ciaŋ cih triəʔ [jaŋ haj rɜː.dɜːm]ADV child 1incl able drive buffalo as 1incl in.the.past ‘Our children can ride a buffalo, just like us when we were young.’

Reason (because, because of) (56) [cɒw.rɜː naːw səʔiə bəːn]ADV bəːn naːw kʰɒm wuɒ.kaːr cənap because 3 want get get 3 make.effort labour strong ‘It’s because he wants to get [it] that he’s working so hard.’ (57) [cəbɒh maj mɜʔ wuɒ]ADV ʔɛŋ bəːn maj mɜʔ bəːn because 2 neg make ana.dist get 2 not get ‘It’s because you won’t do[it] that you now don’t have it.’

kui ntua

853

(58) maj bəːn [ʔaːsaːr.kɜʔ kəː ʔəː]ADV 2 get because.of 1sg emph ‘You got [it] because of me.’

Subtractive (except) (59) naːw ciaŋ cəmuŋ kuəj tɜŋ.mat kʰnia mɜʔ.kɜʔ beːʔ kan kaʔ 3 able speak Kui all.every together except person hold money ‘They can all speak Kui, except the person who handles the money.’

Conditional (‘if’, ‘if it were not for’) (60) prənɒː [bɜː mia cənap]ADV lic tuɒn həːj krənaː tomorrow if rain strong flood again nsit road ‘If it rains hard tomorrow the road will be flooded again.’ (61) sətrɜ̤p kəː [ɲcɒ̤ ʔ bəːn maj ʔɛːl]ADV ʔɒːn cəpɔ̤h ʔaː ʔɛŋ həːj shoes 1sg if.not get 2 take give discard ref ana.dist nsit ‘My shoes—if you hadn’t brought them—I would have counted them lostǃ’

4.2.5 Direct Quotations Direct and indirect speech are treated differently. Between the two types, as the reference point shifts, there are changes in pronouns, demonstratives and question words. Examples of various direct and indirect speech are compared below. Direct quotation (statement) (62) pɛːn suər paːj kəː mɜʔ haːn mṳːt baːŋ niː ʔəː Pen tell say 1sg neg dare enter stream prox emph ‘Pen said: “I don’t dare to enter this stream”.’

Indirect statement (63) pɛːn suər paːj naːw mɜʔ haːn mṳːt baːŋ ʔɛŋ ʔəː pn tell say 3 neg dare enter stream ana.dist emph ‘Pen says he doesn’t dare to enter that stream.’

854

bos and sidwell

Direct statement (content question) (64) map mɒ̤ h paːj ŋkuəj ciə kwial ntrɒ̤ k kəː pn ask say who go tend cow 1sg ‘Map asked: “Who will tend my cows?” ’

Indirect content question (65) map mɒ̤ h paːj ŋkuəj ciə kwial ntrɒ̤ k naːw pn ask say who go tend cow 3 ‘Map asked who will tend his cows.’

Direct quotation (polar question) (66) cuəh cʰɔn mɒ̤ h paːj kʰoːc blɒw ʔaj ciaŋ wuɒ dɛːl waːʔ addr pn ask say broken axle 3nhum able make also q ‘Mr. Chon asked: “The axle is broken. Can you fix that?” ’

Indirect polar question (67) cuəh cʰɔn naːw cɒw mɒ̤ h pɛːn tɜh.təbaj kʰoːc blɒw ʔaj man Chon 3 come ask Pen yesterday broken axle 3nhum naːw ciaŋ wuɒ dɛːl bɜː 3 able make also TAG ‘Mr. Chon came to ask Pen yesterday and wondered if Pen was able to fix it or not.’

Direct quotation (command) (68) pɛːn paːj ɲcɒ̤ ʔ kɛːl pɛ̤ʔ dəː Pen say proh play mud ptcl ‘Pen said: “Don’t play in the mud”.’

Indirect command (69) pɛːn suər kɒːn naːw ɲcɒ̤ ʔ ʔɒːn kɛːl pɛ̤ʔ Pen tell child 3 proh allow play mud ‘Pen tells his child not to play in the mud.’

855

kui ntua

4.3 Syntax and Pragmatics 4.3.1 Grammatically marked valence alterations 4.3.1.1 Causatives Both Morphological and Periphrastic causatives are possible. Verb

Causative affix + verb

Phrasal causative + verb

kəcɛːt (to.die) (intr) boːl (to.be.drunk) (intr) ŋuɒc (to.drink) (trans)

kəmcɛːt (to.kill) pəmboːl (to.intoxicate someone) *ŋuɒc

wuɒ ʔɒːn kəcɛːt (cause.to.die) wuɒ ʔɒːn boːl (cause.to.be.drunk) wuɒ ʔɒːn ŋuɒc (cause.to.drink)

Periphrastic causation of both transitive and intransitive verbs can be realized by involving a separate causative verbal phrase. Constructions with wuɒ ‘make’ follow the Khmer pattern, while constuctions based on resultatives are also possible. (70) haj pɒːʔ kəmpuŋ mia wuɒ ʔɒːn kɜːt hoːr 1incl walk prog rain make give exist cold.flu ‘I got a cold from walking in the rain.’ (71) cɒw.rɜː pɒːʔ kəmpuŋ mia bəːn haj kɜːt hoːr because walk prog rain get 1incl exist cold.flu ‘It’s from walking in the rain that I now have a cold.’’

4.3.1.2 Reciprocals and Reflexives The reciprocal kʰnia decreases the valence of the clause by one. The expression nɜ̤m.kʰnia is still somewhat reciprocal in that people start out to do a certain concerted effort. The reciprocal uses follow the verb (replacing NPobj) but the comitative uses follow the NPsubject. (72) kɜːt cɒː kap kʰnia exist dog bite recp ‘Dogs are biting each other.’

856

bos and sidwell

(73) kuəj naːw pə̤ːʔ kʰnia person 3 see recp ‘People are seeing each other.’ (74) naːw nɜ̤m.kʰnia tɜm cɒː 3 together strike dog ‘They [together] hit the dog.’

Analytical reflexives are realized by the words cak ʔɛːŋ (the first word meaning ‘body’). In these cases the reflexive replaces an NP argument. (75) mədɛːŋ ʔɛŋ tɜm cak ʔɛːŋ woman ana.dist strike body self ‘That woman is beating herself.’

An emphatic form is formed by the words hɒn ʔɛːŋ see the example below. In these cases the action of the verb is emphasized as undertaken solely by the argument. (76) naːw ciə pɜːj hɒn naːw ʔɛːŋ 3 go bathe by 3 self ‘They went to bathe solely by themselves (i.e. in their own strength).’

4.3.1.3 Non-verbal Clauses In this section predications that lack a semantically full lexical verb are described. Equative clauses Predicate nominals are formed by NP-NP juxtaposition: (77) naːw beːʔ tac 3 person sell ‘They are sellers.’

A verb is obligatorily used in the presence of tense, aspect and modality (TAM) markers. The default choice is the semantically weak verb wuɒ ‘make, do’. Compareː (78) cuəh ʔɛŋ krṳː kəː man ana.dist teacher 1sg ‘That man is my teacher.’

kui ntua

857

(79) cuəh ʔɛŋ təlɜ̤p wuɒ krṳː kəː man ana.dist exper make teacher 1sg ‘That man used to be my teacher.’

Attributive clauses Like equative clauses, predicate adjectives are also juxtaposed. The adjectival phrase, however, always occurs with modifiers, such as intensifiers or aspect markers. (80) daːʔ niː cəkɒːk me̤ːnte̤ːn water prox cold intens ‘This water is very cold.’ (81) swaːj to̤ ʔ cɛːn həːj mango dist ripe nsit ‘That mango is already ripe.’

The experiencial modifier haːw is used to intensify or emphasize an attribute. Here the word order is different: (82) daːʔ niː haːw cəkɒːk water prox intens cold ‘This water is so coldǃ’

Locative clauses Predicate locatives are formed using the locative word kuə ‘stay’ followed by a prepositional phrase. In the prepositional phrase the location word precedes the nominal. (83) liːk ʔɒw haj kuə lɜŋ truŋ pig father 1incl stay in cage ‘My father’s pigs are in the pen.’

Existential clauses Existential constructions are formed using the verb kɜːt ‘to.exist’. The following phrase orders are possible: (PP-kɜːt-NP) and (kɜːt-NP-PP). (84) kʰaːŋ tɨh brɒw mɜʔ kɜːt kɒh side nvis mountain neg exist forest ‘There is no forest on the other side of the mountain.’

858

bos and sidwell

(85) kɜːt kuəj baːr naʔ kuə lɜŋ dɔŋ exist person two CLF stay in house ‘There are two persons in the house.’

Possessive clauses Like the existential clauses, possessive constructions also use the verb kɜːt as a copular. In the case of inalienable possession the two noun phrases are juxtaposed. (86) haj kɜːt səpʰow sɒːŋ 1incl exist book five ‘I have five books.’ (87) mɛː haj kɒːn sɒːŋ mother 1incl child five ‘My mother has five children.’

4.4 Noun Phrases The basic order of possible NP constituents is captured in the formula below: NP: [ N (Adj) (Num/Quant (CLF)) (NPposs) (Dem) ] Between N and (Dem) only on of the [Adj, Num/Quant (CLF)), NPposs] can occur; no two of them can co-occur. Example NPs: Pronouns and proper names occur only as head of minimal NPs, that is, they are not normally modified by demonstratives, possessives or relative clauses. Pronouns, less commonly proper names, may occur with numeral expressions. (88) haj bəːn sac caː 1incl get meat eat ‘We obtained meat to eat.’ (89) pɛːn po̤ ːj tɛh kraːh kuə lɜŋ sraj pn chase beat.kill squirrel stay in plantation ‘Pen is chasing the squirrel in the plantation.’

859

kui ntua

Common noun as head (90) dɔŋ baːr to̤ ʔ house two dist ‘The two houses over there.’ (91) naːw tuəŋ cuəh.tʰaːw pɨːt ʔɛŋ 3 fear tiger large ana.dist ‘They are afraid of that big tiger.’

Compound noun as head (92) sɛːm.saːj haj kuə ʔaː srɔk relatives 1incl stay ref village ‘Our relatives are in the village.’

With the pronominal determiner as head (93) ʔaː blaːj ʔɛŋ kuə ləŋ tənɒːm luɒŋ ref white ana.dist stay in tree ‘The white one is in the tree.’

Sequential nouns in an NP are simply juxtaposed (94) tɜh.təbaj haj pə̤ːʔ liːk cɒː ŋiaw naːw pɒːʔ pəntəːr kʰnia yesterday 1incl see pig dog cat 3 walk in.row together ‘Yesterday I saw a pig, a dog and a cat walking in a row.’

Possession In a possessive NP the possessor follows the possessed item. (95a) dɔŋ kəː house 1sg ‘my house’

(b) ŋiaw maj (c) m̩ bɒh naːw cat 2 well 3 ‘your cat’ ‘their well’

NP possessors can be recursive: (96) ntrɒ̤ k pʰoːk.maːk krṳː sɛːm.krəpaj haj cow friend teacher sister.younger 1incl ‘My younger sister’s teacher’s friend’s cow.’

860

bos and sidwell

In specific situation when it involves sharing of things between people, the possessive particle hɒn is optionable. However, this particle is ungrammatical when it concerns inalienable objects. Compare the examples below. (97) a. klɒːŋ.kʰliː kəː b. klɒːŋ.kʰliː niː hɒn maj marbles 1sg marbles prox of 2 ‘My marbles’ ‘These are your marbles’ c. plɒː kəː d. plɒː hɒn kəː head 1sg head of 1sg ‘My head’ ‘My head’

There is another particle relating to possession, ʔaj, that refers to a possessor retrievable from the context. It can be translated with ‘it’ or ‘its’, glossed here as 3nhum (3rd person non-human). The generic third person Tpronoun naːw can replace ʔaj, but then it weakens the relationship between the two objects. The use of ʔaj for human being is impolite. (98) ntrɒ̤ k to̤ ʔ kɒːn ʔaj ruɒk nɔː cow dist child 3nhum beautiful tag ‘That cow—its calf is beautiful, isn’t it?’

5

Word Classes

5.1 Nouns The broad class of Nominals includes the most important components of NPs: the open class of Nouns and closed classes of Pronouns, Interrogatives, Quantifiers, Classifiers and Demonstatives and Determiners. One could go further and debate the status of Adjectives and Locatives as possibly more appropriately classed as nominals, but the question is not taken further here. Within nouns we can arbitrarily distinguish Common nouns and Proper nouns, with the latter being names of people, places, etc. but syntactically and morphologically there there is no basis to separate them. 5.1.1 Common Nouns Common nouns have a diversity of morphological forms; they can be simple mono- or disyllabic roots, prefixed and/or infixed forms, and counpounded forms are also common (see §3.1.2). There is no obligatory morphological

861

kui ntua

marking of nouns, and in some cases the same form may surface as noun, verb or attributive; e.g. mia ‘rain’ is morphologically unmarked as both ‘rain’ (the noun) and ‘to rain’ (the verb), with only the syntactic context signalling word class where this is recoverable. There are traces of historical processes of nouns derived from verbs by infixation, but this appears to have ceased to be productive long ago, and now compounding and borrowing appear to be the most important ways of forming new nouns, apparently under the heavy influence of Thai/Lao. 5.1.2 Pronouns and Question Words The free pronouns are tabled as follows: Person

Singular

1

kəː

2 3

sɔːl maj

Plural

haj

Comment

informal, used when speaking with younger audience or people having a similar age polite form, inclusive archaic; still used with in-laws and respected persons

muɒŋ naːw generic third person form ʔaj generic non-human possessed

For 1st and 2nd person there is no distinction for animate/inanimate. The generic 3rd person pronoun naːw is often inserted after the subject resumptively. An adverb like ‘yesterday’ can be inserted between the NP and naːw, e.g.: (99) pɛ̤ːt tɜh.təbaj naːw miː krədɒːj təj nɔːɲ həːj doctor yesterday 3 look finger hand boy nsit ‘Yesterday, the doctor already inspected the boy’s fingers.’

Additionally, there is a pair of referentials that can act as dummy nominal heads replacing nouns in discourse: ʔaː (masculine and generic subjects, ref) and miː (feminine subjects, ref.f). (100) maj səʔiə tac ʔaː nia ʔaː kiːʔ rɜː ʔaː pɨːt 2 want sell ref which ref small or ref large

862

bos and sidwell

ʔaː kan rɜː ʔaː trɔːh ref female or ref male ‘Which one do you want to sell? The small one or the big one, the female one or the male one?’ (101) miː blaːj ref.f white ‘That white one (fem.)’

These determiners are also used with names of people in the direct known context of hearer and speaker, in a informal way. Men talking together informally can even address each other by calling ʔaː. (102) ʔaː pɛːn ref pn ‘(Mr.) Pen’

The table below summarizes Kui question words and particles. q Word

Gloss

Position in the clause

waːʔ hɜːj rɜː mɜʔ bɜː rɜː

q nsit (new situation) or not tag question either/or

mənia ntuɒ cəntuɒ kɜːt ntuɒ nia jaŋ.nia pɒ̤ h.nia tɜh naː naː ŋkuəj

how much what what (broad sense) why which how when (future) when (past) where who

clause final clause final part of clause-final constituent clause final conjunction (between conjoined constituents e.g. [XP rəːXP]xp) in situ part of clause-final constituent part of clause-final constituent clause initial in situ in situ in situ in situ clause initial or final in situ

kui ntua

863

5.1.3 Measure Words and Quantity Words 5.1.3.1 Numerals Kui has a decimal numbering system. Cardinal numbers 1–10 are uniquely Kui. ‘Zero’ and numbers 11 and up follow the Khmer counting system. Kui cardinal numbers: mṳːj ‘one’ təpɜ̤t ‘six’ baːr ‘two’ təpo̤ ːl ‘seven’ paj ‘three’ təkuɒl ‘eight’ pɒːn ‘four’ təke̤ːh ‘nine’ sɒːŋ ‘five’ cɛt ‘ten’

When counting real objects (i.e. non-abstract counting), numbers two, three, four, and ten are often prefixed (with a phonologically reduced form of ‘one’, pə- or mə- depending on context), as follows: pəbaːr ‘two’; pəpaj ‘three’; pəpɒːn ‘four’; məcɛt ‘ten’. These forms cannot occur with a classifier. Here follow some examples of counting. Lower numbers precede the bases they modify, and the sequence descends from the largest to the smallest base. (Underscore where Khmer numbering is followed): 5 sɒːŋ (five) 15 pramtəndɒp 205 baːr-ruɒj sɒːŋ 3,132 paj-pɜ̤n mṳːj-ruɒj saːmsɜp-piː

Kui has one numeral interrogative mənia (how many/much). Some of its uses are exemplified below. The overall syntactic form isː [NP mənia {CLF/ countword}] (103) ʔajuʔ mənia kraj ʔɛːl kəjaːk age how.much should take husband ‘What’s the right age to take a husband?’ (104) maj re̤ːk kəndap mə-kʰaːŋ mənia 2 carry bunch one-side how.much ‘How many bunches do you carry on each side?’

864

bos and sidwell

5.1.3.2 Non-numeral Quantifiers Members of this subclass specify the general quantity of an entity without indicating the exact number or measure. The quantifier phrase has the formula QP: [N Quant (CLF)]. Depending on which classifier, mass nouns and/or count nouns are modified. In casual speech the boundaries for count/mass are not very strict. kəlɜŋ tɜŋ.mat mṳj mṳj rɜ̤l nɨːc nia nɨːc ŋkit tədiː kəmpɛ̤h səmboːr

‘many/much’ (count/mass) ‘all’ (count) ‘each’ (count) such as inː ‘naʔ muːj naʔ muːj’ (each person). ‘every’ (count) for things related to time, such as year, times etc. ‘some’ (count/mass) ‘some’ (count), ‘nia’ may be glossed ‘which’. ‘a little’ (mass) ‘half’ ‘abundant’ (count/mass) when actually seeing ‘abundant, plentiful’ (count/mass) when talking about it

5.2.3.3 The Quantifier Phrase The quantifier phrase generally follows the formulaː QPː [N Quant (CLF)]: (105) srɒː tədiː kəpɔŋ rice half tin ‘Half a can of rice.’ (106) toːŋ baːr taːj coconut two bunch ‘Two bunches of coconuts.’ (107) kɜːt səpʰo̤w kəlɜŋ tʰɨː exist book much kind ‘There are many kinds of books.’

5.1.3.4 Classifiers This class has the general schema of [NP Num (CLF)]. The classifier for ‘person’ is naʔ (a generic classifier for people of any status). For people the classifier is obligatory also when the NP is omitted. (108) cuəh baːr naʔ ʔɛŋ man two clf ana.dist ‘Those two men.’

865

kui ntua (109) baːr naʔ ʔɛŋ two clf ana.dist ‘Those two persons.’

Otherwise the classifiers are apparently optional. The subclasses below all fill the classifier slot in the general schema. Natural groups This subclass counts the number of natural groups that are formed by particular entities. pʰoːŋ ksaj taːj cəŋkɔːm tro̤ ːm

e.g. a flock of cattle, swarm of parrots, school of fish and even ants e.g. an extended ‘stream’ of marching ants; or for roads bunch of coconuts bunch, generic, for other fruits e.g. a large growing bunch of bamboo

Proportion measures These are based on body-part dimensions. Examples areː stia kɜ̤t lṳːk cɒŋ

from tip of the thumb to the tip of the index finger from finger tips to elbow from finger tips to armpit both arms stretched out wide

Capacity measures kəpɔŋ tənɒːm srəlaː klam

tin, as in a tin of rice, used as a standard measure a classifier for trees (primarily) used to count for pages, sheets and the like a generic classifier for objects and animals, also young children

Portion measures cəmriək

part (a part of anything that’s been e.g. cut up)

Temporal measures This subgroup is somewhat different, since these temporal measures do not modify a NP but rather an event.

866

bos and sidwell

mɔːŋ hour təŋaj day (of 24 hours) trəbɨː night (e.g. staying so many nights) crəlah morning (as a part of the day) ʔaːtət week ksaj month kəmɒː year

Abstract classifiers bɒːn tʰɨː lə̤ːk dɒːŋ cə̤ːŋ

for location model, kind event, issue event (focus on counting) event (focus on journey)

5.1.4 Demonstratives Demonstratives encode for three degrees of distance from the speaker: niː to̤ ʔ tɨh

‘this/these, (close, prox)’ ‘that/those, (distant within sight, dist)’ ‘that/those, (distant out of sight, nvis)’

Within discourse (where there is shared knowledge between the discourse participants) a set of three anaphoric demonstratives is used: nɛŋ ‘this/these’ (ana.prox) ʔɛŋ ‘that/those’ (ana.dist) ʔɛɲ ‘that/those ’ (with emphasis) (ana.emph)

In the NP demonstratives are phrase-final: (110) dɔŋ niː house prox ‘This house’ (111) kɒːn liːk to̤ ʔ child pig dist ‘Those piglets overthere’

kui ntua

867

(112) kɒːn liːk tɜŋ paj ʔɛŋ child pig including three ana.dist ‘All those three piglets’

When the time referred to is unspecified (in particular for future) the demonstrative kəj (‘then, that’) is used. It occurs with adverbs of time (‘that day’ etc.) but not with nouns. (113) re̤ːn cɒp pɒ̤ h kəj maj wuɒ ntuɒ learn finalized time then 2 do what ‘When you finish your studies, what are you going to do?’

5.2 Verbals Verbals in Kui are principally identified syntactically as those entities that fall within the Verbal Complex; it has the characteristics that it can be preceded by the negator mɜʔ, and followed by the post-verbal aspect marker completive/ nsit həːj: (neg) (Asp)/(preV) V* (postVAsp) neg: mɜʔ ‘not’ can negate the entire verbal complex Asp.: Aspect markers preV: Pre-Verb V: Verb postAsp: Post-verbal Aspect marker

5.2.1 Verbs An intransitive, transitive, ditransitive, or stative verb is the minimum constituent of the verb complex. Where one verb surfaces in a given clause, it is straightforwardly recognised as the main verb (even though the same form may do duty as pre-verb elsewhere, see further below). Examples: (114) haj kraj.kaːr taːk.ko̤ ːl təpɜ̤t sɔŋ 1incl need nails six pack ‘I need six packs of nails.’ (115)

caː dɒːj kəː ʔɒh təj we̤ːr mɜʔ kəwɛːn ʔəː eat rice 1sg scoop hand left neg able emph ‘When I eat, I cannot serve rice with my left hand.’ (Lit. ‘Eat rice, I scoop (with) left hand, (I) can not!’)

868

bos and sidwell

Main verbs can also be concatenated to form multi-verb predicates which have potentially ambiguous clausal structure, consider the following: (116) cam miː wuɒ saːk.miː sɛn wait look make try first ‘Wait and see, I’ll try it first.’

In the above example there is a sequence of four verbs, apparently forming at least two clauses, the first being a command to the hearer to “wait and look”. Is this two imperative clauses joined without a conjunction or a single unit? Either way it is clear that each of the verbs has similar status and there are no particular morphosyntactic clues to force an analysis. In the next part in which the speaker explains that he will try first, the sequence ‘make’ plus ‘try’ can similarly be treated as a sequence of main verbs along the lines of “I do it, I try it”. In the next example we see the sequence wuɒ bəːn ‘make get’ translated here as “(I) can fix (it)” refering to a punctured inner tube: (117) krɜ̤n.kɜʔ kʰoːc ruɒc kɒŋ haj wuɒ bəːn only broken tube bicycle 1incl make get ‘Only the inner tube is punctured. I can fix it.’

This shows very typical SEAsian areal semantics. The ‘make’ here has the broad sense of doing which in this case indexes the specific task in hand of fixing the tube. On the other hand ‘get’ indexes a resultative meaning, somewhat like the colloquial English “I’ll get it . . .” which indicates intention to do or finish a task. Again it is clear that both verbs have relatively equal status within the construction, there being no morphosyntactic clues that would suggest otherwise. So while below we adopt an analysis that recognises modal Pre-Verbs in addition to Main Verbs, this is really a provisional analysis, as it may it may be sufficient often to simply recognise multi-verb predication of ambiguous structure. 5.2.2 Pre-verbs In Kui there is a rich inventory of at least 20 pre-verbs that index modality in predication. The forms are all also potentially full verbs that communicate important information about intention, attitude, capacity, etc. to perform/ undergo the action/experience indexed by the immediately following main verb. Those forms recorded are presented below, classified according to agent versus speaker orientation:

869

kui ntua

Agent-oriented modality Abilitative Conative Permissive Desiderative Directive Obligative

kəwɛːn (able to – a kind of unlearned skill) ciaŋ (able to—acquired by learning) bəːn (‘to get’ → able to—circumstances allow it) saːk miː (to try) ʔɒːn (‘to give’ → to allow) səʔiə (to want) pəɲcia (to command) pəŋkʰɒm (to force) prɜː (to use) suər (to tell) tiar (to ask, beg) kraj (should) kraj kɜʔ (must) kraj kaːr (need to)

Speaker-oriented modality Dubitative Hearsay Non-factive Pretense

trɔ̤ŋ (possibility/probability) smaːn (to guess) sɒŋ paːj (hear-say) cə̤ː (to believe) kət (to think) wua taːŋ (to pretend)

Some examples: (118) haj səʔiə ʔɒːn sɛːm saːj pələj kroːc mɒːŋ mɜʔ kɜːt kəlɜŋ 1incl want give relatives fruit orange but neg exist much ‘We want to give oranges to my relatives, but we don’t have many.’ (119) haj kraj bəːn mɒːŋ pɒ̤ h naːw cɛːk me̤ːn.te̤ːn haj mɜʔ bəːn 1incl should get but when 3 divide indeed 1incl neg get ‘I should get one, but at the time when they will distribute [them] I won’t get [one].’ (120) haj mɜʔ ciaŋ cih kɒŋ ʔəː bɜː haj mɜʔ dɛːl re̤ːn 1incl neg able drive bicycle emph so 1incl neg ever learn ‘I cannot ride a bike, since I’ve never learned to.’

870

bos and sidwell

(121) prənɒː haj kraj.kɜʔ ciə pʰsaːr haj mat ntuɒ ntiː tac tomorrow 1incl must go market 1incl empty goods sell ‘Tomorrow we have to go to the market. We have no more goods to sell.’ (122) wuɒ.taːŋ ləŋe̤ːt sɛn ɲcɒ̤ ʔ ʔɒːn kɒːn naːw kəjaː pretend go.to.sleep first proh give child 3 call ‘[I] pretend to sleep, so that the children won’t call [me].’

5.2.3 Pre-verbal Aspect Markers In Kui there is a rich inventory of aspect markers that precede the pre-verb/ main-verb. Some of these are bi-syllabic, and in usage the second syllable may be dropped or in the case of mɜʔ tɜ̤n ‘not yet’ there is the option of a phonologically reduced monosyllablic form: ndə̤ːp kɜʔ təlɜ̤p

‘just’ completive; a recent change of state ‘ever’ perfect, habitual; a current relevant state brought about by the situation (normally an event) expressed by the verb mɜʔ dɛːl ‘never’ (irrealis) kəmpuŋ kɜʔ ‘continue’ progressive; an ongoing, dynamic process cɛh.kɜʔ ‘keep on’ progressive for durative verbs, iterative for inherently punctual verbs; the action requires force or energy. cʰɔp ‘stop, quit’ perfective; the end of a state or ongoing event kɒːp pədɜːm ‘start’ inceptive; the starting point of an event mɜʔ tɜ̤n (ntɜ̤n) ‘not yet’ (irrealis, in anticipation of happening though) nɒŋ (kɜʔ) ‘still’ progressive; in anticipation of ending bih (tɜŋ) ‘almost’ (irrealis) on the verge of doing etc. and in retrospect kəlɜŋ (kɜʔ) ‘always’ often/always tɜŋ ‘about to’ inchoative or progressive

Some examples: (123) tɜh.təbaj naːw tɜŋ ciə sraj mɒːŋ cɒw.rɜː naːw yesterday 3 about.to go plantation but because 3 ʔiː krən caɲ naːw mɜʔ bəːn ciə be.ill malaria 3 neg get go ‘Yesterday he was about to go to the plantation, but since he got malaria he wasn’t able to go.’

kui ntua

871

(124) maj dak trṳː hɜːj ndə̤ːp.kɜʔ re̤ːp 2 put k.o.fish.trap nsit just prepare ‘Have you set the fish trap already? [I have] just got it ready.’ (125) maj təlɜ̤p pɒːʔ ciə kɒh rəːj.rəːj poːc 2 exper walk direction forest frequent uproot

rej ʔɒːn kəː mṳːj tənɒːm rattan give 1sg one clf ‘You frequently go to the forest, (would you) take one stem of rattan for me?’

(126) haj ciə cʰlɔh mɜʔ.dɛːl paɲ bəːn jaŋ naːw 1incl go illuminate never shoot get as 3 ‘I have been hunting, but never got like they did.’ (127) naːw kəmpuŋ.kɜʔ ŋuɒc ntɜ̤n mat blɒŋ naːw ʔəː 3 prog drink not.yet empty wine 3 emph ‘They are still drinking. Their wine isn’t finished yet.’ (128) liːk to̤ ʔ mɜʔ sɒŋ bɜː tre̤ːŋ kəjaː naːw cɛh.kɜʔ ciə pig dist neg hear if owner call 3 cont go ‘That pig over there doesn’t hear. When its owner calls it always goes away.’

Kui makes use of a wide range of pre-verbal modals. Semantically a twofold distinction can be drawn between agent-oriented and speaker-oriented modality. Of special note, various of the modals are grammaticalised full verbs, in particular bəːn ‘to get’ which as a modal means ‘to be able to/to achieve’ and ʔɒːn ‘to give’ which comes to mean ‘allow’, very much as their equivalents in neighbouring language of the region. These are glossed consistently to reflect their principal verbal meanings and their model senses emerge in the modal constructions. 5.5.3 Post-verbal Aspect Marker (nsit) In addition to the pre-verbal aspect markers, the morpheme həːj can be placed at the end of the verbal complex. The meaning is then is completive, such that a new situation prevails, hense the glossing of həːj as nsit (New Situation, much like Khmer haǝj or Thai lɛ́ːw which are typically glossed as ‘already’ or ‘finished’). Həːj can be proceeded by bəːn ‘to get’ or ro̤ ːc ‘achieve’ (c.f. Khmer ruəc ‘to complete, achieve’) to indicate active achievement. Where the action is instantaneous bəːn is not used. Examples:

872

bos and sidwell

(129) haj wuɒ bəːn həːj 1incl make get nsit ‘I have done/completed it.’ (130) swaːj to̤ ʔ cɛːn həːj mango dist ripe nsit ‘That mango is already ripe.’ (131) haj pac pəŋko̤ ːl rəmbɒːŋ 1incl chop pole fence ‘I have logged fence poles already.’

bəːn həːj get nsit

The use of ro̤ ːc alone signifies an active achievement: (132) ksaj nɛŋ kuəj stṳːŋ ro̤ːc həːj month ana.prox person plant.rice achieve nsit ‘This month the people have completed the rice planting.’ (133) kaː kəː dɒːʔ lɜŋ truŋ ro̤ːc mat fish 1sg put in cage achieve empty ‘The fish that I kept in the cage have all escaped.’

5.2.8 Adjectival Verbs (~Adjectives) Kui has a class of verb-like Adjectival Verbs, referred to here simply as ‘Adjectives’ for convenience, which can fill the intransitive predicate slot. They can be distinguished from other verbs by the criteria that only they be intensified by adverbs like na̤ h ‘too (much etc.)’ and me̤ːn.te̤ːn ‘very’. (134) cəmṳŋ cənap na̤ h speak strong too ‘speaking too strongly’

In comparative constructions, adjectives, but not other verbs, are compared using lə̤ːh. (135) təwɛːŋ brɔː lə̤ːh blaːj black bad compar white ‘Black is worse than white.’

873

kui ntua

Adjectives in the corpus are given below sub-catergorised according to semantic fields, many in antonymic pair relation: Age təmaːj ‘new’ krənɛːn ‘young’

vs. vs.

tiə ‘old’ (inanimate) kreːŋ ‘old’ (animate)

vs. vs. vs. vs. vs.

kiːʔ ‘small’ tiap ‘small, low’ ŋ̩kɛh ‘short’ ntɒːr ‘shallow’ mat ‘empty’

Dimension pɨːt ‘large’ tiː ‘tall, high’ ntrɨːŋ ‘long’ ntruː ‘deep’ təbal ‘full’

Value cəʔɒːp ‘easy, well’ vs. ruɒk ‘good, beautiful’ vs. kraj ‘correct, right’ vs.

pəbaːk ‘hard, difficult’ brɔː ‘bad, ugly’ lo̤ ːh ‘false, wrong’

Physical property ŋ̩haːl ‘light weight’ kətaːw ‘hot’ cɛːn ‘ripe, cooked’ kɒŋ ‘hard’ trəweːk ‘dark’

vs. vs. vs. vs. vs.

Human propensity

ntɒŋ ‘heavy’ cəŋɛːt ‘cold’ hɒw ‘uncooked’ ləme̤ːn ‘soft’ bəraːŋ ‘light, clear’

Colour

cəciəh ‘stubborn’ blaːj ‘white’ cʰlaːt ‘clever’ təwɛːŋ ‘black’ mo̤ ːŋ ‘stupid/crazy’ kʰiəw ‘blue, green’ ʔɜːk ‘happy/grateful’ hlɨəŋ ‘yellow’ tuəŋ ‘scared’ krəhɒːm ‘red’ krɒː ‘poor’

874

bos and sidwell

A subset of adjectives can be reduplicated and/or pre-nasalized. In that way the characteristic becomes a generic one. (136) pɛ̤ːt cɒw cat ʔɒːn kɒːn ŋ̩kiːʔ ʔɛŋ doctor come stab give child small ana.dist *‘The doctor will come to give an injection to that small child.’

Similarly, relating to specific/non-specific: (137) haj cɒw toːr ʔaː pɨːt~pɨːt 1incl come buy ref large~red ‘I’ve come to buy the big one/those big ones.’ (138) haj cɒw toːr ʔaː m̩ bɨːt~m̩ bɨːt 1incl come buy ref large~red ‘I’ve come to buy big ones.’

5.3 Locational and Directional Words 5.3.1 Locative Prepositions Kui has a set of Locative Prepositions that fall withing the prepositional phrase according to the following the schema: [(Dem) P NP]. The Kui Locative Prepositions in the data are as follows: pɒːŋ tam pəne̤ːk təndiː kraːw

‘on’ ‘right’ ‘in front of’ ‘middle’ ‘outside’

lɜŋ we̤ːr krɔːj kədɒːp

‘in, on’ ‘left’ ‘behind’ ‘under’

5.3.2 Directional Verb Constructions The spatial grounding of an event is encoded in multi-verb constructions that include a verb of motion, giving information about the direction of the action. There may be two motion verbs in sequence, or if only one is a motion verb it will come first in the verbal sequence. Verbs of motion and illustrative examples follow: ciə ‘go’ sɒh ‘ascend’ mṳːt ‘enter’ kat ‘through’

cɒw ‘come’ sɛːŋ ‘descend’ lɒ̤ h ‘exit’

kui ntua

875

(139) cəlah sɒh lo̤ ːk sɛːŋ cɒw rɜː brɒw morning up monk descend go from mountain ‘In the morning the monk came down from the mountain.’ (140) krṳː pɒːʔ lɒ̤h cəlɒʔ rɜː kɒ:n.səh teacher walk exit angry with students ‘The teacher walked away, angry with the students.’ (141) tʰoː to̤ ʔ lo̤ːt mṳːt lɜŋ prɒ̤ ŋ spider dist jump enter in hole ‘That spider jumped into a hole.’

5.4 Conjunctions/Connective Words Conjunctions are discussed in section 4.2. 5.5 Particles 5.5.1 Negators Negation in Kui employs the pre-verbial particle mɜʔ, both for clausal and constituent negation. There is no derivational negation. Often negative statements (in response to a question, or contrary belief) are emphasized by the clausefinal particle ʔəː, whereas in telling others about one’s own actions/ideas the particle ʔəː is not used e.g. in a command sentence the weakening particle is dəː. A negative imperative is indicated with ɲcɒ̤ ʔ ‘don’t’ before the verb. (142) prənɒː kəː mɜʔ ciə pac ʔuːh tomorrow 1sg neg go chop firewood ‘Tomorrow I won’t go and log firewood [telling others about onesself].’ (143) prənɒː naːw mɜʔ ciə pac ʔuːh ʔəː tomorrow 3 neg go chop firewood emph ‘Tomorrow they won’t go and log firewood.’ (144) ɲcɒ̤ʔ pac ʔuːh dəː proh chop firewood ptcl ‘Don’t log firewood.’

To assert to the contrary one can respond with the single word te̤: ‘noǃ’, though this can be a single response, often it is followed by a full negative clause explaining why the initial clause was refuted. See dialogue below.

876

bos and sidwell

(145) A. maj ciə pʰsaːr waːʔ 2 go market q ‘Are you going to the market?’ B. te̤ː təŋaj niː kəː mɜʔ bəːn ciə ʔəː No! day prox 1sg neg get go emph ‘Noǃ Today I cannot go.’

5.5.2. The particle kɜʔ The particle kɜʔ (glossed as restr ‘restrictive’) has several uses and functions similar to the word tae ‘only’ in Khmer. 1)

It can mean ‘only’, as in the example below.

(146) naːw kɜːt kɒːn kɜʔ mṳːj 3 exist child restr one ‘They got only one child.’

2)

It functions as discourse marker in narrative, see short story below:

(147) kɜːt kuəj kəntruːh mənaʔ ʔɛːl kəntɛːl exist person male one.person take wife kɜʔ mɜʔ bəːn kaːr ʔəː krɜ̤n.kɜʔ sɛːn restr neg get marry emph only offer kɜʔ bəːn kɒːn mṳːj kɒːn krəpaj restr get child one child female ‘There was a man who took a wife, but they didn’t marry [properly], [they] only offered [to ancestors] and they got one child—a girl.’

3)

kɜʔ also occurs as as element in compound such as various Pre-verbal Aspect particles.

5.5.3 Politeness Particle na: Unlike Khmer, Kui society is not especially hierarchical, so it does not lexically encode social status. However, Kui has some particles to express politeness when addressing somebody, in particlar the clause-final particle na: softens the speech act, and dəː (discussed above) softens a command/request.

kui ntua

877

5.6 Adverbs Adverbs are an open class that occur in the sentence periphery, most commonly clause initially, to provide information about the manner of the time or manner of the action/experience. Manner adverbs can be formed by reduplication of the core adjectives. In a complement (non-verbal) construction the adverb cannot be reduplicated: (148) laːn kəː bəːk jɨːt~jɨːt car 1sg drive slow~red ‘My car is driving slowly.’ (149) naːw səʔiə sɛːn sŋat~sŋat kʰnak naːw 3 want offer silent~red alone 3 ‘He wants to offer (to the spirits) quietly by himself.’ (150) prənɒː naːw ciə pac ʔuːh tomorrow 3 go chop firewood ‘Tomorrow they go and log firewood.’

The temporal adverbs in the corpus include the following: rɜː dəːm ‘in ancient times’ kəmuŋ ‘(a) long (while) ago (non-specific)’ kəmɒː tɨh ‘last year’ tɜh ndro̤ ːn ‘five days ago’ tɜh ndrət ‘four days ago’ tɜh ndrəj ‘three days ago’ tɜh ndria ‘day before yesterday’ məmbaj ‘last night’ tɜh təbaj ‘yesterday’ tɜhdɜh ‘a minute ago’ rənɛŋ ‘at this moment’ rəŋaj ‘presently, today’ prənɒː ‘tomorrow’ ntria ‘day after tomorrow’ ntrəj ‘two days after tomorrow’ ntrət ‘three days after tomorrow’ ntro̤ ːn ‘four days after tomorrow’ kəmɒː ŋiar ‘next year

878

bos and sidwell

5.7 Expressives Kui is rich in expressives. They are adverbial in nature and add extra information (such as visual, auditory and sensory impressions) to the clause. They often come in pairs, in which the two words are phonetically similar. A few expressives are described below: 1) The verb ‘to walk’ plusː kəkədaʔ kəkəduŋ or pəpruŋ pəpraːw Both expressions mean ‘walk stomping loudly’. The first word in each pair cannot be used on its own. Independently the words kəduŋ and praːw mean ‘hard sound’. 2) The verb ‘to be tired’ plusː ləhət ləhaj The expression means that a person is very tired and without strength. The pair of words cannot be used on its own, and the individual words in the pair have no meaning. 3) The physical nature of a surfaceː rəkɔp rəkəːl or rəkaʔ rəkɔp The expression describes an uneven surface, with high and low spots. Individuals words in these pairs are never used on their own. 4) The verb ‘to shiver’ plusː pəpraʔ pəprəːk The expression means to shiver or shake intensely/uncontrollably. 5) The verb ‘to shout/cry’ plusː kuɒŋ rəmpuɒŋ When shouting is heard from afar. The word rəmpuɒŋ means ‘from a distance’. 6

Glossed Text

Part of traditional story ‘The lazy cicadae’, recorded in Cambodia 2012. kɜːt teːr mṳːj pʰoːŋ naːw kʰɒm kɜʔ criəŋ səbaːj rɜ̤l təŋaj exist cicada one swarm 3 make.effort restr sing happy every day ‘There was a swarm of cicadae, all they did was singing and being happy every day.’ mɜʔ.paːj sədaːw mɜʔ.paːj təŋaj naːw mɜʔ bəːn pəːr mɜ̤r caː ʔəː though night though day 3 neg get look.for thing eat emph ‘Whether day or night, they never went to look for food.’

kui ntua

879

kɜʔ naːw pja̤ h cənap me̤ːnte̤ːn restr 3 hungry strong intens ‘and they were very hungry.’ pe̤ːl cəlah sɒh me̤ː teːr naːw kəjaː kɒːn.caw naːw when morning up chief cicadae 3 call subject 3 tɜŋ.mat ʔɒːn cɒw kʰnia kɜʔ naːw suər paːj cṳm all give come together restr 3 tell say gather ‘The next morning the cicada chief called all his subjects to come together, and he saidː’ təŋaj niː haj nɜ̤m.kʰnia ciə tiar sədac səmoːc mɜ̤r caː day prox 1incl do.together go beg king ant thing eat tuəŋ.krɛːŋ naːw cədaːl might 3 pity ‘Today we will go and beg the ant king for food in the hope that he will pity us.’ kɜʔ naːw nɜ̤m.kʰnia ciə restr 3 together go ‘and they all went.’ pe̤ːl naːw pɒːʔ toːn krənaː kɜʔ naːw prəta̤ h rɜː ciəŋ pɨːt mṳːj when 3 walk along road restr 3 come.upon with elephant big one ‘When they were walking along the road, they met a big elephant.’ naːw mɒ̤ h ciəŋ ʔɛŋ paːj 3 ask elephant ana.dist say ‘They asked the elephant:’ naː jaːj maj səkɜ̤l dɔŋ sədac səmoːc dɛːl waːʔ where older.sibling 2sg know house king ant also q ‘Brother, would you know where the ant king lives?’ ciəŋ ʔɛŋ cʰlɜːj paːj ʔɛŋ kuə ʔaː pəne̤ːk pɒːʔ elephant ana.dist answer say ana.dist stay ref in.front walk

880

bos and sidwell

ŋkit tuɒn tɒk həːj little more arrive nsit ‘The elephant answered: “There, in front of you, walk on a little more and you’ll be there.” ’

7 Bibliography Bos, K.J., G.M. Bos, and C.J. Page. 2008 Community based orthography development: Experiences from the Kui in Cambodia. www.seameo.org. Johnston, Richard. 1969. Kuy Basic Word List. Austroasiatic Studies 3:1–4. Mann, N., & Markowski, L. 2004. A rapid appraisal survey of Kuy dialects spoken in Cambodia. Chiang Mai: Payap University. Markowski, Linda M. 2005 A comparative study of Kui varieties in Cambodia. MA thesis. Chiang Mai, Thailand: Payap University. Miller, J. and Miller, C. 1995, Notes on phonology and orthography in several Katuic Austroasiatic groups in northeast Thailand. Austroasiatic Studies 24:27–51. Preecha Sukgasame. 1993. Correlates of the Register Complex in Kuay. Austroasiatic Studies Journal, 22:245–251. ———. 2005, Phonological variations and changes due to language contact: A case study of consonants in four Kuai-Kui (Suai) dialects. Austroasiatic Studies 35.37–54. Sidwell, Paul. 2005. The Katuic Languages. Munich, Lincom Europa. Somsonge Burusphat. 1989–1990. The Functions of kʌʔ in Oral Kui Narrative. Austroasiatic Studies Journal, 18–19:223–231. ———. 1993. Kui Narrative Repetition. Austroasiatic Studies Journal, 22:149–162. Theraphan L. Thongkum. 1986. An Acoustic Study of the Register Complex in Kui (Suai). Austroasiatic Studies, 15:1–20. Van der haak, Feikje and Brigitte Woykos. 1987–1988. Kui Dialect Survey in Surin and Sisaket. Austroasiatic Studies 16–17: 109–142.

chapter 12

Pacoh Mark J. Alves 1 Background The Pacoh people are a hill tribe that lives primarily in the mountainous regions of Central Vietnam in Binh Tri Thien and Quang Tri provinces and bordering areas in Laos. The name ‘Pacoh’ appears related to the Pacoh word ko̰ h ‘mountain’. The Pacoh language belongs to the Katuic sub-branch of the Austroasiatic language family, making it related to Bru, Katu, Taoih, and Ngeq, among others. Sidwell (2005) places Pacoh in its own sub-group within Katuic, thus making Pacoh not particularly close to any other Katuic language. Pacoh is not officially recognized in Vietnam as a language and is essentially geographically grouped with the Taoih language on official state maps of ethnic minorities despite the fact that there is little mutual intelligibility due to significant lexical and phonological differences (cf. the 1986 VietnamesePacoh-Taoih dictionary). The one existing Vietnamese-Pacoh-Taoih dictionary makes it very clear that while Pacoh and Taoih are related, in the same way that they are both to other Katuic languages, these are indeed two distinct varieties. Further complicating these numbers is a lowland variety of Pacoh, Pahi, of which there are no existing linguistic descriptions. This blurring of groups makes it difficult to determine the population with certainty. Published numbers range from 10,000 to 30,000 (the Ethnologue shows 29,200), but there is not always clear indication of which specific ethnic or linguistic groups these belong to. Despite the relatively small number of speakers of Pacoh and the somewhat diverse linguistic situation, Pacoh remains the first language of the majority of Pacoh. Pacoh interact with Vietnamese as well as other Katuic speakers, such as Bru in Quang Tri and Taoih in Thua Thien province. There is some lexical variation according to the languages of contact. For instance, speakers in Quang Tri may also use the word teː ‘that’, likely a Bru word. Like other hill tribes in the area, the Pacoh do not have a single main leader but rather are grouped by villages and clans. Pacoh society is largely egalitarian, without classes, though some Pacoh are recognized as more capable or are able to accumulate somewhat more wealth than others. They generally hunt and trap and practice swidden agriculture and horticulture. The Pacoh either

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_018

882

alves

share their products within their smaller groups or trade at large markets. The Pacoh historically have a less aggressive reputation than some of their Katuic neighbors. They marry primarily within the Pacoh tribe, perhaps a factor in the Pacoh language sustainability. Their traditional homes are longhouses and raised houses on stilts. Traditionally, they practice weaving and make wooden goods, such as crossbows, to barter. Today, while some Pacoh still wear traditional clothing, many have at least some modern clothing, such as tee-shirts among children or buttoned shirts with collars among adult males. Pacoh women in rural areas tend to wear traditional wrapped clothing. Literacy rates among the Pacoh are low, though some young Pacoh do attend schools, where they receive instruction in Vietnamese. Orthographies were developed for the Pacoh in the second half of the 20th century. Richard and Saundra Watson developed one system in the early 1960s which is used in their numerous publications. There is another similar system attributed to a Vietnamese creator, as described and used in Nguyễn et al. (1986: 11–12). The systems have not been adopted by the Pacoh, and Pacoh essentially remains an unwritten language. The primary linguistic descriptions are those of Saundra Watson (1964, 1966, 1976, and co-authored work with Richard Watson), Richard Watson (1964, 1966a, etc.), and Mark Alves (2000, 2001, etc.). The primary description of Pacoh culture is that of Mole (1970: 101–139). 2 Phonetics/Phonology Pacoh is typologically representative of Southeast Asian Austroasiatic languages in that it has sesquisyllabism, five places of articulation of consonants, and nine basic vowel phonemes which are then distinguished by length and phonation. It is distinctive in having post-glottalized off-glides and in having some distinctive vowels in presyllables. 2.1 Word/Syllable Structure Phonological words (in contrast with compounds) in Pacoh are either monosyllabic or disyllabic, in which case the presyllables are unstressed. The minimum form of a word is open CV: or closed CVC, though open single syllable words are much less common. The maximal shape is C1V1C2.C3C4V2ːC5. The classes of phonemes that can occur in each segment are summarized below. Overall, the largest number of phonemes occurs in main syllables, while presyllables have some restrictions. Unlike many Austroasiatic languages, which tend to have a single neutral vowel in presyllables as part of the sesquisyllabic phenomenon, open presyllables (i.e., those without final consonants) may

883

pacoh

have distinct vowels /a/, /i/, and /u/, though those with closed syllables generally have only /ə/. Voiced and aspirated stops are restricted to syllable-initial position. Segment

Constraints

C1 (presyllable initial)

any consonant (except post-glottalized glides or aspirated stops) C2 (presyllable coda) only nasals or liquids C3 (main syllable initial) any consonant (except post-glottalized glides) C4 (second in main syllable initial /r/ and /l/ cluster) all consonants except voiced or aspirated stops C5 (main syllable coda) V1 (presyllable) /i/, /a/, or /u/ in open presyllables, schwa in closed syllables, short, and no phonation V2 (main syllable) any vowel

Presyllables may also be sonorants (nasals and liquids) alone without vowels. The number of phonemes that occur in any of the four consonant slots varies. The largest number of consonants can occur in onsets of stressed syllables. Syllable Shape

Sample

CVː CVC CCVC CC.CVC CV.CVːC CVC.CCVC

pɛː ‘three’ ŋɛʔ ‘all’ klɨŋ ‘many’ ʔŋkoh ‘that’ kidiːt ‘poor’ kəntro̰ ʔ ‘a downpour of rain’

2.2 Phoneme Inventory As mentioned, the Pacoh phoneme inventory is representative of an Austroasiatic language. It has five places of articulation, including palatals and a glottal stop which is distinctive between syllables. Stops have the full range of place of articulation, while other manners of articulation have smaller numbers.

884

alves

The table below uses hyphens to show which consonants occur only as initials, only as finals, or in either position. -ppʰb-m-

-w-wʔ

-ttʰd-n-r-l-

-cɟ-ɲʃ-/-ç

-kkʰ-ŋ-

-ʔ-

-h-

-j-jʔ

The Pacoh vowels show length distinction. In addition, mid vowels and diphthongs also have a contrast in register. The marked phonation type is [+RTR] (retracted position of the tongue root), which is associated with tensed phonation. The full range of these vowels can occur in main syllables, while presyllables are restricted to /i/, /a/, and /u/. Table 12.1 Pacoh vowels in main syllables Front

Central

Back

High Mid [-RTR] Mid [+RTR] Low

i / iː e / eː ḛ / ḛː ɛ / ɛː

ɨ / ɨː ə / əː ə̰ / ə̰ː a / aː

u / uː o / oː o̰ / o̰ ː ɔ / ɔː

-RTR +RTR

iə ḭə

ɨə ɨ̰ə

uə ṵə

Clusters in Pacoh main syllables include those in the following table. These are only voiceless, unaspirated stops that co-occur with the liquids /l/ or /r/. Aspirated consonants are treated as single consonants and not clusters.

885

pacoh Table 12.2 Pacoh consonant clusters Cluster

Example

Gloss

[kl] [kr] [pl] [pr] [tr]

klɔː kruːm ploh prɛːŋ truː

‘oyster’ ‘thunder’ ‘to ask’ ‘dry’ ‘deep’

3

Word Formation

Pacoh word-formation involves compounding, affixation (prefixes and infixes), and reduplication. These are summarized below. Affixation and reduplication, while not as phonologically systematic, are actually much more common in word-formation in Pacoh than compounding and show much larger numbers of lexical entries in the Pacoh lexicon. 3.1 Compounding Compounding consists of either left-branching compounds or dual head compounds. While this can be syntactic with predictable semantics, compounding also results in words of arbitrary meanings. In left-branching compounds, the dependent modifies the head, as for example daːʔ-mat (water-eye) ‘tears’. Dualhead compounds are often used to express generalized semantic classes and have meanings beyond the individual lexical components. Examples include duŋ-vḛːl (house-village) ‘neighborhood/society’ and ʔaʔi-ʔaʔam (motherfather) ‘parents’. 3.2 Derivational Affixes Affixes in Pacoh include both prefixes and infixes, but not suffixes. Affixes in Pacoh derive nouns, active verbs, and stative verbs. These are largely fossilized in words, but some used to derive nouns and verbs are still recognizable enough to speakers to the extent that new forms can be created. 3.2.1 Deriving Nouns A common means of deriving nouns from verbs is an infix with the general shape or [-an-], though it has numerous allomorphs which often involve assimilation

886

alves

to the following consonant. The following samples show the verb base and derived noun forms. Table 12.3 Nominalized verbs Verb

Noun

katɨp ‘to cork’ tapaʔ ‘to make fish sauce’ taŋɨh ‘to breathe’ kar ‘to drill a hole’

kəntɨp ‘cork’ təmpaʔ ‘fish sauce’ tərŋɨh ‘breath’ kanaːr ‘hole-driller’

While not having an obvious derivational relationship, most Pacoh kinship terms and a noticeable number of words for animals appear to have an ʔa- prefix. This prefix is attached to indigenous kinship terms as well as to Vietnamese loans such as ʔa-ʔɛːm ‘younger sibling’. Other examples include ʔa-miəŋ ‘brother, older’, ʔa-ʔam ‘father’, ʔa-kaːʔ ‘grandmother’, ʔa-ʔiː ‘mother’, ʔa-ʔɛːm ‘sibling, younger’, and ʔa-mo̰ ːʔ ‘sister, older’, among others. Animal names with the ʔa- prefix are, among others, ʔa-ʔət ‘animal’, ʔa-cḛːʔ ‘bird’, ʔa-mḛːʔ ‘cat’, ʔa-cɔː ‘dog’, ʔa-taː ‘duck’, ʔa-ciəŋ ‘elephant’, ʔa-bil ‘mouse’. 3.2.2 Deriving Verbs Pacoh has several affixes that derive a variety of verbs from other verbs. They derive variously intransitive verbs, transitive verbs, and stative verbs all of which have a variety of semantic functions, including causation, completion, pretense, involuntary action, generalizing, and reciprocity. There are various allomorphs of these base morphemes. All samples in Table 12.4 are taken from S. Watson (1966). 3.3 Reduplication Reduplication in Pacoh includes both full, unaltered reduplication and alternating reduplication in which segments of the base are copied while others alternate. Reduplication has some degree of productivity in Pacoh. In Watson’s 1979 dictionary of 10,000 entries, approximately 1,200 are reduplicated forms. However, it is still primarily seen in lexicalized forms, and the process is not productive. Full reduplication occurs but is relatively rarely used compared to base alternating reduplication.

887

pacoh Table 12.4 Pacoh verbal affixes Affix

Source Form

[pa-] verb sər ‘go up’ [ta-] verb pḛɲ ‘shoot’ [pi-] stative verb cɛːn ‘clean’ [par-] transitive verb cəw ‘argue’ [ti-]

transitive verb hɛʔ ‘tear’ [Ca-] transitive verb kʰɔ ‘roast’ [tar-] transitive verb ʔacuən ‘hit’ [CV-] intrans. Verb poːk ‘walk’

Derived Form

Derived Form Meaning

causative verb ‘to cause to X’ pasər ‘raise’ involuntary verb ‘to X involuntarily’ tapḛɲ ‘shoot accidentally’ causative verb ‘to cause to be X’ picɛːn ‘clean’ ‘to cause each other causative reciprocal verb to X’ parcəw ‘cause each other to argue’ stative verb ‘to be X-ed’ tihɛʔ ‘torn’ intransitive verb ‘to do X completely’ kakʰɔ ‘roast completely’ reciprocal verb ‘to X each other’ ʔrcuən ‘hit’ intransitive verb ‘to X in general’ papoːk ‘walk around generally’

Phonological Constraints

both 1+2 syll. 1-syll. only 1-syll. only 1-syll. only

1-syll. only 1-syll. only both 1+2-syll. 1-syll. only

The term ‘partial reduplication’ is here reserved for copying of only one syllable of disyllabic words, such as kəntɨʔtɨʔ ‘sometimes’ and tərkitkit ‘be close to’. Instead, the more common alternating reduplication may involve both copying and alternating parts of syllables. Category

Template

Examples

Repetitive (no alternation) Rhyming (onset alternation) Ablauted (vowel alternation) Alliterative (rhyme alternation)

C1V1C2-C1V1C2 C1V1C2-C3V1C2 C1V1C2-C1V2C2 C1V1C2-C1V2C3

Onset-Vowel (coda alternation)

C1V1C2-C1V1C3

ʃə̰ː-ʃə̰ː ‘pleasant’ to̰ k-vo̰ k ‘endless amount’ puːc-paːc ‘flutter’ kʰaŋ-kʰɛr ‘push rice into mouth continuously’ krəːŋ-krəːw ‘property’

888

alves

In addition to reduplication of single syllables, Pacoh reduplication also involves copying combined with additional elements, such as an initial consonant or a prefix, the template-plus-affix category. Moreover, parts of sentences may be reduplicated, essentially phrasal reduplication. Category

Form

Gloss

Template-plus-Affix

papɨːt vḭəl-ʔivḭəl taʔ-ʔmbiʔ-ʔibiʔ ŋaːj-taʔ-prṵəʔ-taʔ-təmpaʔ they-do-work-do-work

‘big (of a group of things)’ ‘full of twists and bends’ ‘to pretend to sleep’ ‘They’re working.’

Clause-Incorporation

Alternating reduplicants do not always have identifiable roots. They fall into a number of semantic categories. Some refer to vivid adjectives or verbs (e.g., tɨŋpɨŋ ‘bushy’, seɲ-sel ‘to cackle’, etc.), some refer to general categories (e.g., ʔasəʔʔasḛːɲ (rice-red) ‘rice (in general)’, sec-seː (meat-red) ‘meat (in general)’, etc.), and others. Such reduplicants can also be syntactically separated with the sentence material copied. 4

Phrase and Clause Structure

Pacoh basic sentence structure is AVP. However, it is also a topic-comment language, allowing for fronting of sentence elements. Passivity can only be expressed indirectly through semantic means, such as topic fronting; there is no morpho-syntactic way of marking the passive voice. Moreover, subjects can be omitted entirely or replaced by subject prefixes on verbs. While the translation of the following two sentences suggests that the first one has an instrumental interpretation and the second a locative one, this is a more semantic role than grammatical one. There is no morphology or preposition to indicate specific semantico-syntactic roles. Rather, these are topics, and the relationships between them and the clauses they precede, such as instrument and location as in these examples, are contextually interpreted. (1) ʔalɔːŋ kiː ʔi-taʔ tino̰ ːl. wood medl nref-do pole ‘As for that wood, one makes poles.’ OR ‘With that wood, we make the poles.’

pacoh

889

(2) kərnaː ʔnnɛh ʔi-poːk ʔɔː lɨː. road prox nref-go good very ‘As for this road, one goes well.’ OR ‘This road is very good to go on.’

Aspect, interrogative, and mood can be expressed by a variety of sentence final particles. In addition to word choice and word order, prepositions, relator nouns, and conjunctive words express inter- and intra-clausal and phrasal relationships. Juxtaposition can also be used for some types of coordination and comparison. Noun phrase structure is largely left-branching, though numeral and measure words precede nouns, but modifiers, possessives, and demonstratives follow nouns. It is largely an isolating language with some derivational morphology. However, while word order encodes grammatical relations, Pacoh also has a system of pronouns which are marked for dative and genitive case. 4.1 Simple Sentences (Word Order, Questions, Commands) In basic clauses, subjects always precede verbs, and while objects generally follow verbs, they may be fronted. See §5.4 for more description of topiccomment constructions. (3) kɨː pacɔːm ʔamaj kaŋ ʔaɲ. 1sg teach dat.2sg language English ‘I teach you English.’

Body parts can occur as direct objects of verbs of sensations: (4) dɔː ʔarlɔːh ʔatiː ʔaviə̰r. 3sg hurt hand left ‘His left hand was hurt.’

Yes-no questions are most often constructed by the use of the sentence-final particle. See §4.6 for more discussion of sentence-final interrogative particles. (5) tikuəj pako̰ h viː klɨŋ jaːʔ ləʔj people Pacoh exist many family.name q ‘Do the Pacoh people have many family names?’

Prefix Subjects: Pacoh also has two non-referential or impersonal subject prefixes ʔu- and ʔi-. In the following sentence, the general subject is indicated with the ʔi- prefix, and the conditional clause has no verb. They can be used for general statements, though they also occur after certain verbs, such as after

890

alves

the abilitative hoːj below. The full range of contexts in which these prefixes are used has not yet been fully explored. (6) nam pṵən naʔ tikuəj ʔi-taʔ pɛː kərlɔh. if four clf person nref-do three floor.layer ‘If there are four people, they make three karloh floor layer sections.’ (7) ʔn.kar ʔa.pən hoːj ʔi-sɨp. skin bear able nref-wear ‘As for bearskin, one can wear it.’

Copula: In copula constructions, the use of the copula verb laː, a likely Vietnamese loanword, is optional. Indeed, it is more like a topic-marker. (8) hɛː (laː) tikuəj pako̰ h. 3pl be person Pacoh ‘They are Pacoh people.’ (9) ʔnnɛh ʔalimɔː tiən. prox how.much money ‘How much does this cost?’ (10) məh pərlaːç ʔən peɲ (laː) kraj lɨː. one shot 3sg shoot be accurate very ‘One shot, he shoots accurately.’

Impersonal/Existential Sentences: Existential sentences can be expressed with the verb viː ‘exist’. Usually, the subject follows the verb: (11) viː ʔeː tikuəj viət. exist many people Vietnamese ‘There are many Vietnamese people.’

The noun negator ih ‘not be (the case that)’ can also have scope over the whole clause or sentence. (12) ʔih viː ʔamo̰ ːʔ ʔŋkoh poːk.boː bar ʔiŋaj neg.be exist aunt medl walk two day ‘It’s not the case that she walked for two days.’

891

pacoh

Negation: While verbs are negated with the preverbal word ləjʔ, noun predicates and complete clauses (as in the preceding sentence) are negated with ʔih. (13) ʔrviət ʔnnɛh ʔih ʔmmaj pen prox neg.be 3pl.poss ‘This is not your pen.’

Interrogative Words: Wh-words may be fronted, though speakers sometimes keep the in-situ position. The factors that can cause this involve the original position as determined by the grammar of the language, the tendency towards topic-comment constructions, contact with Vietnamese, which maintains a consistently in-situ manner (for instance, object interrogative pronouns only occur after verbs and cannot be fronted in questions), as well as universal tendencies in human language. See §5.1 for more discussion on types of pronouns and their other functions. (14) ʔa.məh maj pa.piː what 2sg say ‘What are you saying?’ (15) ʔalɨŋ ʔnnaw maj with who 2sg ‘With whom did you go?’

poːk go

(16) duŋ ʔnnaw tarnəh lɨː house who old very ‘Whose house is old/older?’

4.2 Complex Sentences Clauses in Pacoh can be juxtaposed without marking, linked in serial verb constructions, or connected with conjunctive words. (17) hɛː tartaʔ viː ʔaʔɛːm kiː ʔɔː lɨː bəh 3pl recp.do exist younger.sibling medl good very asrt ‘They fought with each other because of that fine looking one.’ (18) ʔi-biʔ kiː biʔ pat dɛːn nref-sleep then sleep put.out lamp ‘When you sleep, put out the lamp.’

892

alves

(19) ʔrbaŋ bɔː mat lɨː sky rain cool very ‘When it rains, it’s very cool.’ (20) tikuəj ʔən hoːj lɨː taʔ tumiəŋ person rel able very do crossbow ‘A person who is very skilled at making crossbows.’

Serial Verb Constructions: Serial verb constructions involve two matrix verbs without conjunctions or other intervening lexical elements. They tend to express sequences of actions or purpose. (21) kinoː kɨː poːk mamɔjʔ boːn pḛɲ kunɛː yesterday 1sg go hunt get shoot deer ‘Yesterday, I went hunting and was able to shoot a deer.’ (22) pḛːh kənnoh koh sḛːc buəjʔ take cutter cut meat fish ‘Take the cutter and cut the meat of the fish.’

Purposive Constructions: Purposive actions can be expressed lexically, such as with the word /ɟo̰ ːn/ ‘give’, which also expresses the benefactive as in the first example, or through serial verb constructions, as described above. (23) pḛːh cɛːn ɟo̰ ːn kɨː ŋɔjʔ daːʔ take glass give 1sg drink water ‘Get a glass for me to drink water.’ (24) kɨː taʔ ɟo̰ ːn boːn caː 1sg do give get eat ‘I work in order to be able to eat.’

Conditional Sentences: Conditional sentences can be explicitly marked with the clause-initial word nam ‘if’, but condition can also be expressed via juxtaposition without overt use of words, in which case, context is key. (25) nam ʔarbaŋ bɔːh kɨː ləjʔ boːn cho̰ ː tuː duŋ cond weather rain 1sg neg get return to home ‘If it’s raining, I can’t go home.’

pacoh

893

In the following sentence, both the subject and conditional meaning are recoverable from the context of the utterance. (26) ʔaʔaj palluŋ lɨː kaʔḛh pɨːt lɨː sick stomach really defecate much very ‘If one is really sick to one’s stomach, one really gets the runs.’

Comparison: Comparison can be explicitly expressed with tilət ‘more than’, but it is also possible to express comparison simply by juxtaposition and context. (27) lam ʔnnɛh ʔɔː lɨː tilət dɔː ʔntih clf prox good very compar that dist ‘This one is much better than that one over there.’ (28) hammɔːŋ kidit lɨː juə̰n Hmong poor very Vietnamese ‘The Hmong are poorer than the Vietnamese.’

4.3 Syntax and Pragmatics (Topic-Comment, etc.) While Pacoh is an AVP language, it has a number of contexts in which subjects can be dropped and in which non-subject elements can be fronted. Zerosubjects are common in direct commands and in various situations in which the subject is recoverable from the context. (29) pḛːh daːʔ ɟo̰ ːn kɨː ŋɔjʔ take water give 1sg drink ‘Get water for me to drink.’

This is also often true for lower clauses in which the subject of the first, upper clause is the same as in the lower clause. While this could be construed as a serial verb construction, the lower clause in these subordinate constructions can take subjects in the lower, dependent clause. (30) maj kucet vaːç ʔarəʔ pinah 2sg die become like ghost ‘When you die, you become a ghost.’

Topicalization is common in Pacoh. New, highlighted information may be upfront in sentences. In the following two-clause sentence, the topic is ‘traps’,

894

alves

which is the first elements in both sentences. However, the trap is the semantic agent in the first sentence, but the patient in the second one (note the subjectmarking [ʔi-] prefix/preclitic on the verb). Thus, while the grammatical roles differ, the central pragmatic role as topics is maintained by the word order. (31) kɨp cuŋʔ ʔacuət ʔabil maː siŋ nɨːm ʔi-taʔ palloː ʔalɔːŋ trap also catch mouse but trap only nref-do tube wood ‘The kup trap catches mice, but as for the sing trap, they only make it of a wooden tube.’

4.4 Noun Phrases Pacoh is a classifier language. In a noun phrase, a classifier or measure word is the syntactic head, being the only essential word, while the measure word’s following noun complement is the semantic head but syntactically optional in contexts in which the noun is understood. The following is a diagram of the noun phrase in Pacoh. QUANTITY + MEASURE + NOUN + MODIFIER + POSSESSIVE + DETERMINER Such a long phrase is not natural, and so a few samples are required to demonstrate this ordering. Modifiers are usually indicated with the dependent clause initial marker ʔən. In both quantified noun phrases, the semantic head can be omitted in context. (32) baːr bɛː ʔariːw ʔən ɟiə̰m two bottle wine rel tasty ‘two bottles of wine that is tasty.’ (33) ʔalimɔː ʔntraç buːt.chiː ʔnnɛh several clf pen prox ‘these several pens.’ (34) duŋ ʔaʔɛːm kɨː house younger.sibling 1sg ‘the house of my younger sibling.’

4.5 Aspect Pacoh indicates time frames lexically with time words or aspectual markers. The progressive ʔat (homophonous with the word meaning ‘to be located at’)

pacoh

895

and negative completion marker joːh are preverbal elements, while completion is indicated by a sentence-final particle ɟəː or ɟeː. (təmbəːj ‘about to’, kʰɔjʔ ‘already’, ʔat ‘prog’, təmmɛː ‘just’, joːh ‘not yet’, ʔiɲ ‘(future)’, etc.), (35) ʔacaːj ʔat toːŋ ʔaməh older.sibling stay say what ‘What are you saying?’ (36) ʔaʔiː kɨː prɛːŋ ʔaːw ɉeː mother 1sg dry clothes compar ‘My mother has already dried the clothes.’

Completion interrogative is expressed by sentence final joːh ‘yet’. (37) ʔalɛʔ ʔijoːh tired nsit.q ‘(Are you) tired yet?’

The word ʔiɲ ‘want’ has additional semantic functions, including obligation/ suggestion and prediction. (38) ʔaməh ʔiɲ pləj What want buy ‘What will you buy?’

4.6 Modality Modality can be expressed lexically through preverbal elements and sentencefinal particles (see §5.6). Auxiliary verbs express categories such as ability (boːn ‘get’ and hoːj ‘able’), aspect (təmbəːj ‘about to’, kʰɔjʔ ‘already’, etc.), and the negative imperative (ʔakəp ‘don’t’). See §5.2 for examples of words and discussion of their functions. (39) ʔato̰ ʔ ləjʔ ʔiɲ ŋɔjʔ siːw hot neg want drink liquor ‘When’s it’s hot, one shouldn’t drink alcohol.’

Sentence-final particles express the imperative, the interrogative, assertion, and light tone. In other cases, modality is contextually determined. Here, ʔən and həh indicate assertion of the statement.

896

alves

(40) ʔndɔː laː ʔaroːh ʔən 3sg.poss be aroh.trap asrt ‘This is the “aroh” trap.’ (41) kiː maː ʔabil ʔn mo̰ ːt toq noh həh thus top mouse it enter to there asrt ‘So the mouse enters here.’

5

Word Classes

The basic set of Pacoh word classes includes nouns, verbs, prepositions, conjunctions, adverbs, and sentence particles. Each of these categories has subcategories which can be determined by their distributional properties and semantic-syntactic functions, as summarized in following subsections. 5.1 Nominals Pacoh nominals include common nouns and proper names as well as pronouns. Additionally, there are such nominal categories as numerals and quantifiers. Common nouns are not countable and must co-occur with measure words or classifiers in quantified noun phrases. (42) a. bar lam ʔakaj nɛh b. *bar ʔakaj nɛh three clf child prox three child prox ‘These three children.’ Intended: ‘These three children.’ (43) ʔrviət ʔnnɛh ʔih ʔmmaj pen prox neg.be 2sg.poss ‘This pen is not yours.’

Countable nouns consist of classifiers, measure words, and time words, all of which can be directly preceded by numerals, while time words tend to be noun phrase head without complements, measure words and classifiers are directly countable words that take other nouns as complements. The nouns they take as complements are selected according to general semantic categories. Measure words are semantically concrete words that are derived from concrete nouns. Examples of these include baːw ‘bag of’, ʔatɛh ‘basket of’, and kənto̰ h ‘group of’. Single instances can also be expressed by terms such as noh ‘type/kind’ and kəntɨʔ ‘time/episode’. Units of weight (e.g., kən ‘kilogram’) and time (e.g., kisaj ‘month’) are also directly preceded by numerals. Classifiers, on

897

pacoh

the other hand, are semantically categorical, taking broad semantic categories. These include plaːh for flat objects such as leaves, naʔ and tikuəj for humans, toːm for trees, and the default classifier lam for all things. In addition to having a classifier for humans naʔ, Pacoh also has a quantity noun specifically for humans ʔapɛː, which is derived from the third-person pronoun. The semantic feature ‘human’ is also relevant in certain conjunctions (see §5.4). (44) puə̰n naʔ tikuəj four clf people ‘Four people’ (45) ʔapɛː ʔaʔɛːm cɔːm ləjʔ 3pl younger.sibling know q ‘Do you (younger ones) know (it)?’

Personal, Demonstrative, and Interrogative Pronouns are all contextually referential words but have various distinct functions as summarized below. Personal pronouns in Pacoh have a variety of case-marking prefixes (see §3.4). Demonstrative pronouns in Pacoh have three levels of distance: near the speaker, away from the speaker, and farther away from the speaker. Note that the difference between distal and far distal is iconically indicated by vowel length, with short vowels for distals and long vowels for far distals. Table 12.5 Pacoh personal pronouns Number

Person

General

Dative

Possessive

Singular

1st 2nd 3rd 1st 2nd 3rd 1st 2nd 3rd

kɨː maj dɔː ɲaŋ ʔiɲaː ʔaɲaː hɛː ʔipɛː ʔapɛː / ŋaːj

ʔakɨː ʔamaj ʔadɔː ʔaɲaŋ ʔadɔː-ʔiɲaː ʔadɔː-ʔaɲaː ʔahɛː ʔadɔː-ʔipɛː ʔadɔː-ʔapɛː / ŋaːj

ʔŋkɨː ʔmmaj ʔndɔː ʔɲɲaŋ ʔndɔː-ʔiɲaː ʔndɔː-ʔaɲaː ʔŋhɛː ʔndɔː-ʔipɛː ʔndɔː-ʔapɛː / ŋaːj

Non-Singular

Plural

898

alves

Table 12.6 Pacoh demonstrative pronominals Distance

Proximal Medial Distal Remote Distal

Fore/Higher

Aft/Lower

Beside

ʔntih ʔntiːh

ʔnnɛh / nɛh ʔŋkoh / koh ʔnto̰ h ʔnto̰ ːh

ʔntrah ʔntraːh

Interrogative pronouns (ʔnnaw ‘who’, ʔaməh ‘what’, tumɔː ‘where’, mɔː ‘which’) can also be used as indefinite nouns. Word order and context determines the meaning. Note that these then occur in their case-marked questions and are not fronted, as in the following example. (46) tikuəj kardoːk ləjʔ cɔːm taʔ ʔaməh person lazy neg know do whatever ‘Lazy people don’t know how to do anything.’

These can also co-occur with nouns to generalize them. (47) mɨəŋ kɨː duŋ ʔnnaw cuŋʔ viː tiriək.ʔaraːk village 1sg house whichever also have buffalo ‘In my village, every house has buffalo.’

5.2 Verbs Verbs in Pacoh share the properties of being negatable by ləjʔ ‘no/not’ (the exception being the preverb ləjʔ itself and the nominal negator ʔih) and of being marked by the completive sentence-final particle ɟəː. Verbs in Pacoh can be further distinguished by their specific sentence frames and complements. In terms of distribution, differing verb types may take (a) only noun complements, (b) verb complements, or (c) take grammatical functors, including phrases headed by prepositions or relator nouns. The major classes are summarized below. Basic verbs include intransitive, transitive, ditransitive, and locative verbs, which can be progressive, and impersonal and stative verbs, which generally cannot. All noun complements follow verbs unless topicalization occurs (see §5.4 and 4.3).

pacoh

899

(48) dɔː ʔat caː do̰ ːj 3sg stay eat rice ‘He’s eating rice.’ (49) viː klɨŋ peʔ ʔən do̰ ːŋ exist many bananas rel ripe ‘There are many ripe bananas.’ (50) ʔaɉɨːŋ kɨː ʔaç lɨː leg 1sg swollen very ‘My leg is very swollen.’

Morphologically derived verbs have additional special properties. Two common subcategories of morphologically derived verbs include causative verbs, which can take a variety of complements, and reciprocal verbs, which take plural subjects. (51) kɨː paŋɔjʔ ʔadɔː jəw daːʔ 1sg caus.drink 3.dat friend water ‘I made my friend drink some water.’ (52) hɛː pərcuŋ toʔ ʔŋkɨːm trɨəŋ 1pl caus.gather loc middle school ‘We gathered in the middle of the school.’

Verbs taking prepositional phrases or relator nouns/relator noun phrases include locative, ditransitive, and a few other categories. (53) ʔaɉɨːŋ kɨː ʔaç lɨː leg 1sg swollen very ‘My leg is very swollen.’ (54) juə̰n patḛːc ʔacɔː ʔadɔː pako̰ h Vietnamese sell dog 3.dat Pacoh ‘The Vietnamese sell dogs to the Pacoh.’

Preverbs requiring following verb complements have more specific semantic and grammatical functions: aspect, mood, causation, motion, negation, emotions/thought, among others. They include both active and stative verbs, the latter of which have adjectival semantic functions but which syntactically

900

alves

cannot be easily distinguished. Indeed, both categories of verbs can be modified by lɨː ‘really/very’, though the progressive preverb ʔat tends to occur with active verb. The fact that both semantic types can occur with the intensifying lɨː and the fact that they otherwise have no true syntactic distinctions suggests their singular syntactic status. (55) naːm lɨː ʔiɲ tərchɔːj bɔːŋ pn really want play ball ‘Nam really wants to play ball.’ (56) kɨː ʔat ko̰ ːn peːɲ ʔaceʔ 1sg stay wait shoot bird ‘I am waiting to shoot a bird.’

Ability can be indicated with preverbal hoːj or boːn ‘get’. (57) viː klɨŋ tikuəj ʔən ləjʔ hoːi toːŋ kaːŋ pako̰ h exist many people rel neg able speak language Pacoh ‘There are many people that can’t speak the Pacoh language.’

Some of the verb complements are marked by the ʔi- prefix (see §3.4). (58) baːj dɔː lɨː ʔiən ʔi-cɔːm lesson 3sg really easy nref-know ‘That lesson is easy to understand.’

5.3 Prepositions and Relator Nouns Semantic and some grammatical relations are expressed by either nouns or verbs. Verbs used to mark semantic and grammatical relations include ɟo̰ ːn ‘to, for’ (lit. ‘give’), toʔ ‘to, at, until’ (lit. ‘arrive’), ʔat ‘at’ (lit. ‘be at’), tilət ‘over’ (lit. ‘surpass’): (59) kɨː vit peːʔ toʔ dɔː 1sg toss banana arrive 3sg ‘I threw the banana at him.’ (60) ʔat kruːŋ.kutiə̰k hɛː viː tikuəj pako̰ h katuː stay country 3pl exist people Pacoh Katu ‘There are Pacoh and Katu people in the country.’

pacoh

901

Apart from these, there is a handful of prepositions, such as ʔalɨŋ ‘to, with’ and tɛ ‘from’. Relator nouns express a variety of grammatical categories, including location (tatuːn ‘back’, məmmat ‘front’, kəlluŋ ‘inside’, etc.), the dative (ʔadɔː ‘to (for humans)), attributives time (hoːj and ʔndo̰ ŋ ‘when/at the time of’), and instrument/means (daŋ, tək, and baŋ ‘with/by’). These can serve as predicate centers. (61) kalluŋ ro̰ ː lɨː ʔeː noh ʔalɔŋ inside field very many type tree ‘There are many types of trees in the field.’ (62) tumiəŋ ʔiɲɛʔ daŋ ʔalɔŋ crossbow make place wood ‘Crossbows are made of wood.’

Possessive relations are expressed by the linker ʔən ‘of, which’. When preceding a pronoun, it is shortened to ʔn-. (63) ʔrviət ʔndɔh ʔn-dɔː ʔacaːj pen medl poss-3sg older.sibling ‘That pen belongs to that fellow.’

5.4 Conjunctions and Clause-Connecting Words Clause-connecting word distribution includes (a) clause-initial in moveable phrases, (b) inter-clausal words, and (b) topic-markers connecting topics and comments. Clause-initial words include nam and lah ‘if’, viː and co̰ ː ‘because’, ʔalɨŋ ‘and’, ‘but’, koh and the compound koh-maː ‘so’. (64) nam ʔacaːj taʔ viək ʔarəʔ ʔŋkoh laː ʔalɛʔ lɨː cond older.sibling do work like medl then tired very ‘If you work like that, you’ll be very tired.’ (65) dɔː mo̰ ːt toʔ ʔŋkoh koh boːn caː 3sg enter loc medl medl get eat ‘It enters here and so it can eat and shakes the bait.’

They are sometimes optional, in which case contextual clues provide meaning, such as cause-effect or condition.

902

alves

(66) vit kulaj ʔntih kɨː ʔiɲ caː throw fruit dist 1sg want eat ‘Throw that piece of fruit to me (since) I’d like to eat it.’

Topic-comment linking connective words kiː ‘cons’ koh ‘medl’, maː ‘top’ and laː ‘be’ appear after sentence-initial noun phrases. (67) kɨː kituh ʔacaːj kiː prɨp 1sg push older.sibling cons fall ‘I pushed, so you fell.’ (68) ʔŋkoh laː taluəŋ kunɛː poːk medl be road pig go ‘That, then, is the road where the pig goes.’

5.5 Adverbials Whether there is a separate word class to describe adverbial functions in Pacoh is not certain. Verbs and nouns can have adverbial functions (e.g., serve as adjuncts of verbs, expressing manner, etc.), but there is no morphological marking to make such distinctions clear. Generally, adverbials occur post-verbally. (69) dɔː laluh ɲaʔ lɨː 3sg run fast very ‘He ran very fast.

Other more specialized adverbial functions include intensifying, iterative, resultative and simultaneous notions, among others. Their distribution is more complex, with some occurring variously before or after head verbs, and moreover, some of these can be considered more grammaticalized in nature (i.e. co-occur with words without semantic restrictions, semantically opaque, specialized syntactic functions, etc.). (70) a. toːm lɔːj mo̰ ːj kəntɨːʔ b. caː do̰ ːj lɔːj say again one time eat rice more ‘Say that again one time.’ ‘Eat more rice/eat rice more.’

5.6 Sentence-final Particles Sentence final particles are a word class that cannot be negated or quantified and express abstract grammatical and pragmatic notions. In terms of syntactic distribution, Pacoh has (a) topic-marking sentence particles which occur

903

pacoh

between sentence-initial topics and main sentence matrix and (b) sentencefinal particles. Assertion of statements can be expressed with həː, hɨː, bəː, and ʔən. Commands can be emphasized with the particle ʔaw. Interrogative particles include ləjʔ for general questions, joːh for questions about completion of an action or state, and ʔalɨŋ ləjʔ (literally, ‘or not’) for confirmation of a situation. It is worth noting that, while interrogative particles are transparently related to homophonous non-particle forms, the assertive and urging particles are not. (71) ʔaʔɛːm ŋɔːj hɨːt hɨk kʰɔː lɨː bəːh younger.sibling consume tobacco much difficult very emph ‘If you (said to person younger than speaker) smoke a lot, it is really hard to bear.’ (72) kɨː ʔiɲ caː həː 1sg want eat emph ‘Goodness, do I want to eat!’

6

Semantics and Pragmatics

6.1 Specialized Vocabulary Kinship terms are also used as general terms of address among family members. Some Pacoh kinship terms can be used with non-family members, but these appear to be very formal situations and possibly even primarily a result of interaction with Vietnamese, who use kinship-based referential terms (e.g., Vietnamese anh ‘brother’ to refer to ‘I (male)’, ‘you (male)’, or ‘he’) almost to the exclusion of native pronouns, which are considered extremely informal and even rude. For this interaction with others, the Pacoh may use ʔacaːj (meaning ‘older sibling’) with males and ʔamɔːʔ (meaning ‘older sister’) with females. 6.2 Language Use The Pacoh people most often speak Pacoh as their first language. They may have varying degrees of ability to listen to or speak other minority languages, such as the Bru in Quang Tri. Vietnamese has become increasingly common as more Vietnamese move into the highlands. Moreover, the Vietnamese government has established schools for young adult Pacoh where they learn to read and write in Vietnamese.

904

alves

6.3 Names in Pacoh Watson (1969) describes the name system of Pacoh, including both the structure and selection of Pacoh names, how they change during an individual’s lifetime, and their socio-linguistic role in Pacoh society. Kinship terms are often added to personal names to indicate social status or relations. (73) ʔacaːj kɨəŋ plɨm lɨː older.sibling pn plump very ‘You, mister Cường, are rather plump.’

6.4 Grammaticalization The grammatical lexical elements in Pacoh show clear patterns of grammaticalization in which words with semantically abstract, more grammatical functions are derived from content vocabulary. Many of these instances are similar to languages in the region and thus represent both the result of areal contact, but also universal processes. This is clearly observable among aspectual markers (e.g., ʔat ‘to be located in’ > progressive, təmmɛ ‘new’ > ‘just’, and ʔiɲ ‘want’ > future). The comparative tilət ‘more than’ is clearly derived from ‘surpass’, a common Southeast Asian areal grammaticalization cline (Ansaldo 2010). Moreover, modal sentence-final particles are related to preverbs, classifiers and measure words are often related to nouns, and some relator nouns and plural nouns are derived from pronouns (Alves 2007). 7

Glossed Text

The sample of a Pacoh story below is from Watson (1980). paːŋ ŋaːj ʔntiəʔ ʔinoː-ʔntraːh lɨː ləjʔ cɔːm ʔuraʔ cɔːm ʔurar life 3pl old.days yesterday-rem.dist very neg know writing know writing.euph ‘In the old days, they really didn’t know books and letters.’ koh-maː paːŋ ŋaːj nɨːm ʔllam taʔ ʔllam caː ʔaboːn tiən-praʔ medl-top life 3pl only sg.clf do sg.clf eat get money-silver taʔ pidaː.klaːj taʔ haːŋ.hɔːʔ hɨːt kido̰ ːl trɔːj.turuːjʔ do field do goods tobacco hemp betel.nuts ‘But in their days they just worked just ate, earned money, worked fields and lands, traded goods and products, tobacco and hemp, betel nuts.’

pacoh

905

koh-maː toʔ paːŋ hɛː ʔakiː ʔnnɛh pɨːt dɔː ʔən cɔːm thus to life 3pl time prox many 3sg rel know ʔuraʔ cɔːm ʔurar mḛʔ tərliː jṵən writing know writing.euph nearly equal Vietnamese ‘But at the present time, there are many who know books and letters, nearly equal to the Vietnamese.’ koh-maː vaːç ʔitah ʔaʔiː ʔitah ʔaʔɛːm-ʔacaːj medl-top become abandon father abandon younger.sibling-older.sibling ʔitah kruəŋ daːʔ məh mɨəːŋ məh naʔ məh daːʔ abandon valleys water each region each person each water ‘Thus it comes that people are abandoning their mothers, abandoning their fathers, abandoning brothers and sisters, abandoning valleys and rivers; each one to his own region, each one to his own river.’ lɨː diəʔ lɨː tuh pɨːt ŋəh cḛːt pɨːt ŋəh pit very hard very desperate many 3pl die many 3pl lose ‘It’s very difficult, very desperate; many are dead, many are lost.’ ləjʔ ʔi-boːn hoːm ʔaʔiː ʔaʔam duŋ vḛːl ʔakaj kampaːj neg nref-get see father mother home village children wife ‘It’s not possible to see mothers and fathers, homes and villages, children and wives.’

8 Bibliography Alves, Mark J. 2000. A Pacoh Analytic Grammar. Unpublished doctoral dissertation at the University of Hawai’i. ———. 2001. Distributional properties of Mon-Khmer causative verbs. Mon-Khmer Studies 31: 107–120. ———. 2006. A Grammar of Pacohː A Mon-Khmer Language of the Central Highlands of Vietnam. Pacific Linguistics. The Australian National University. ———. 2007. Pacoh Pronouns and Grammaticalization Clines. In Iwasaki Shoichi et al. (eds.), SEALS XIII Papers from the 13th Annual Meeting of the Southeast Asian Linguistics Society 2003. Canberra: Pacific Linguistics, 1–12. Ansaldo, Umberto. 2010. Surpass comparatives in Sinitic and beyond: typology and grammaticalization. Linguistics 48.4: 919–950. Cubuat and Richard Watson. 1976. Bai ihoc cang Pacoh (Pacoh language lessons). Huntington Beach, CA: Summer Institute of Linguistics.

906

alves

Nguyễn Văn Lợi, Ðoàn Văn Phúc, and Phan Xuân Thành. 1986. Sách học tiếng PakôhTaôih (Text to study the Pacoh and Taoih languages). Hà Nội: Ủy Ban Nhân Dân. Mole, Robert L. 1970. The Montagnards of South Vietnamː a study of nine tribes. Rutland, Vermont: Charles E. Tuttle Company Publishers. Sidwell, Paul. 2005. The Katuic Languages: Classification, Reconstruction and Comparative Lexicon LINCOM studies in Asian linguistics, 58. München: Lincom Europa. Watson, Richard. 1964. Pacoh phonemes. Mon-Khmer Studies 1: 135–48. ———. 1966a. Clause to sentence gradations in Pacoh. Lingua 16: 166–188. ———. 1966b. Reduplication in Pacoh. Hartford CT: Hartford Seminary Foundation thesis. ———. 1969. Pacoh names. Mon-Khmer Studies 3: 77–89. ———. 1970. The Pacoh. The Montagnards of South Vietnam, ed. by Robert Mole, 101– 136. Rutland, Vermont: Charles E. Tuttle Company Publishers. ———. 1976. Pacoh numerals. Linguistics 174: 81–88. ———. 1977. Discourse elements in a Pacoh narrative. Mon-Khmer Studies 6: 279–322. ———. 1980. A grammar of two Pacoh texts. Arlington: University of Texas at Arlington PHD dissertation. ———. 1984. Scheme and point in Pacoh expository discourse. In Robert E. Longacre (ed.), Theory and application in processing texts in non-Indo-European languages. Hamburg: Buske, 113–51. ———. 1996. Why three phonologies for Pacoh? Mon-Khmer Studies 26: 197–205. ———. 2011. The case for clitics in Pacoh. Austroasiatic Studies: Papers from ICAAL 4. Mon-Khmer Studies Journal, Special Issue No. 3, 222–232. Watson, Richard and Saundra Watson. 1973. Pacoh wordlist and texts (includes cassette tape recordings of Pacoh speakers reading those materials). ms. Watson, Richard, Saundra Watson, and Cubuat. 1979. Pacoh dictionary. Huntington Beach, FL: Summer Institute of Linguistics. Watson, Saundra. 1964. Personal pronouns in Pacoh. Mon-Khmer Studies 1: 81–97. ———. 1966. Verbal affixation in Pacoh. Mon-Khmer Studies 2: 15–30. ———. 1976. The Pacoh noun phrase. Mon-Khmer Studies 5: 85–95.

section 7 Vietic

․․

chapter 13

Vietnamese (Tiếng Việt) Marc Brunelle 1

Background

Vietnamese, the official language of Vietnam, belongs to the Vietic branch of the Austroasiatic family. It is by far the largest Austroasiatic language, with about 80 million native speakers and 5–10 million second language speakers (based on the ethnic minority figures in the 2009 Vietnamese census). Among these, 2–3 million speakers live abroad, including about 1 million in the US (2000 US census) and probably close to half a million in Cambodia. Paradoxically, the largest Austroasiatic language is typologically very divergent from its Austroasiatic neighbors because intensive contact with Chinese dramatically restructured its lexicon and affected its phonology. Vietnamese was also written in a Chinese derived script, chữ nôm, from the 14th to the early 20th century, but is now exclusively written in quốc ngữ, a Latin script developed by Portuguese Catholic missionaries from the 16th century and first fully codified in Alexandre De Rhodes (1651)’s Vietnamese-Latin-Portuguese dictionary. Vietnamese exhibits strong dialectal variation (Hoàng 1989). Although it is often described as having three main dialects (northern, central and southern), the linguistic reality is far more complex. The northern dialect (from Thanh Hoá province to the Chinese border) and southern dialect (from Khánh Hoà province to the southern tip) are relatively homogeneous, but the area in between is a patchwork of often mutually unintelligible dialects that cannot be lumped together as a unified ‘Central Vietnamese’. In practice, mutual intelligibility is insured by the existence of interlocked national and regional standards. The national standard promoted by the national media is based on the Hanoi variety, but it is competing with a southern standard, based on the Hồ Chí Minh City variety, that extends its influence all the way to central Vietnam and is used in southern-based media broadcast nationwide. To these two major standards, one must add a number of regional standards, often based on the variety of the largest city in a given area. In this article, examples will be given in quốc ngữ, alongside narrow IPA transcriptions representing the surface form of the standard northern dialect (peculiarities of the southern phonological system will be briefly discussed in

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_019

910

brunelle

the phonology section). Phonetic transcriptions are given in IPA, except tone notation, because IPA tone marks do not allow a precise notation of the complex Vietnamese tone contours and of their voice qualities. Instead, I follow an alphanumerical system developed in Tai and Chinese historical linguistics.1 Glossing follows the Leipzig conventions, but I use a dot to link the elements of polysyllabic words (including opaque compounds), which are separated by a space in the native orthography. Vietnamese is a well-described language, with several dictionaries and comprehensive grammars. The first systematic grammars date back to the 19th century (Aubaret 1867; Trương 1883) and influential grammars in English were published in the second half of the 20th century (Emeneau 1951; Nguyễn 1997; Thompson 1965). Since this is a reference book, the citation strategy I adopt is to privilege recent work published in English, the language of this volume, sometimes at the expenses of seminal but less up-to-date work or materials published in other languages. I would nonetheless emphasize that exhaustive research on Vietnamese also requires a good knowledge of the considerable literature published in Vietnamese, French and even Russian. 2 Phonetics/Phonology 2.1 Word/Syllable Structure Vietnamese is often described as the stereotypical monosyllabic language. This is a gross oversimplification. Besides monosyllables (1), it also comprises a large number of transparent native compounds (2) and of semi-opaque SinoVietnamese compounds (3). There is also a small, but significant number of polysyllabic loanwords (4). (1) Monosyllabic words a. sân [sə̆ nA1] ‘yard’ b. bay [ɓăjA1] ‘to fly’

c. máy [măjB1] ‘machine’ d. bàn [ɓanA2] ‘table’

1 In this system each tone receives a combination of a letter and a number. ‘A’ tones derive from originally open syllables, ‘B’ tones derive from creaky syllables and syllables originally closed by a glottal stop, and ‘C’ tones stem from syllables originally closed by an –h. ‘D’ tones are found in checked syllables. ‘1’ is used for tones found on syllables that originally had a voiceless onset, while ‘2’ is used for syllables that originally had a voiced onset.

vietnamese ( tiếng việt )

911

(2) Native disyllabic compounds (semantically transparent) a. sân bay [sə̆ nA1 ɓăjA1] yard + to fly ‘airport’ b. quán ăn [kwanB1 ɁănA1] shop + eat ‘restaurant’ c. bàn ghế [ɓanA2 ɣeB1] table + chair ‘furniture’ d. cây cỏ [kɛ̆jA1 kɔC1] tree + grass ‘vegetation’ (3) Semi-opaque Sino-Vietnamese disyllabic “compounds” a. tiểu hổ [tiewC1.hoC1] to be small + tiger ‘cat (esp. cat meat)’ b. Hà Nội [haA2.nojB2] river + interior ‘Hanoi’ c. tham gia [tʰamA1.zaA1] to attend + to add ‘to participate’ d. khu biệt [xuA1.ɓietD2] area + to part ‘to distinguish, distinctive’ (4) Polysyllabic loanwords a. Sài Gòn [sajA2.ɣɔnA2] < Khm. prej nokor (forest city) ‘Saigon’ b. ban công [ɓanA1.kowŋ͡mA1] < Fr. balcon ‘balcony’ c. phô tô cóp pi [foA1.toA1.kɔpD1.piA1] < Fr. photocopie ‘photocopy’ d. Niu Di Lân [niwA1.ziA1.lɤ̆nA1] < Engl. ‘New Zealand’

Native disyllabic compounds (2) can be either subordinative (2ab) or coordinative (2cd). These categories will be further defined in §3.1, but what matters here is that both of these types are semantically transparent. Polysyllabic loanwords (4), on the other hand, are opaque, e.g. cannot be decomposed into morphemes or roots. Sino-Vietnamese compounds (3) fall somewhere in between and thus have an interesting status in the language. While they obey Chinese constituent structure (modifier + head rather than native head + modifier) and are mostly composed of units that cannot be normally used as free standing words (exceptions here: hổ in 3a and khu in 3d), they are not necessarily semantically opaque (or at least not for educated speakers). Their status is very similar to Greek and Latin roots in Western languages, which are not productive morphemes, but whose meaning can be uncovered or at least guessed from their occurrence in sets of semantically related words. There is little evidence of word stress in subordinative compounds. In fact, the prominence structure of such compounds is phonetically indistinguishable from that of phrases in natural conditions (Nguyễn & Ingram 2007a). Evidence for a greater prominence of the final syllable has been found in coordinative compounds and reduplicated forms (Nguyễn & Ingram 2007a; Nguyễn & Ingram 2007b), but at this point it is difficult to know for certain if this relatively small effect is due to word stress or word-final lengthening.

912

brunelle

The syllable (5) is composed of an obligatory simple onset (which can be a glottal stop), followed by an optional medial glide -w-. The rhyme consists of a monophthong or diphthong that can be closed by a coda. A minimal weight requirement forces rhymes to have at least two moras, which prevents the occurrence of short vowels in open syllables. All syllables must bear tone. (5) T C(w)V(V)(C)

2.2 Phoneme Inventory and Phonotactics The inventory of onsets is given in (6). (6) Vietnamese onsets Northern standard Southern standard t ʧ k Ɂ t ʧ/ʈ c k Ɂ tʰ tʰ ɓ ɗ ɓ ɗ g f s x h f s (ʂ) x h v z ɣ v m n ɲ ŋ m n ɲ ŋ l r, l j

There are significant differences between the main two dialects. Southern Vietnamese maintains a contrast between a slightly affricated retroflex stop (spelled tr-) and a palatal stop (spelled ch-) that are merged in the Northern dialect. This contrast between palatals and postalveolars is also maintained for voiceless fricatives in hyper-formal speech: a contrast between /s-/ (spelled x-) and /ʂ-/ (spelled s-) is taught in schools and relatively frequent in southern media. A second important difference is that the Northern Vietnamese voiced velar fricative /ɣ-/ is realized as [g-] in Southern Vietnamese. The last important difference is that Southern Vietnamese /r-/ and /j-/ are merged into /z-/ in Northern Vietnamese. There is also a strong tendency to realize /v-/ as [j-] in Southern dialects, but this is substandard. The medial glide /-w-/ has two allophonic variants, [-ɥ-] before front vowels and [-w-] before other vowels. It is phonotactically banned after labial onsets in both dialects (except in a handful of French loanwords). There is a tendency to reduce Cw- sequences in Southern dialects by deleting the glide (in /sw-/ and /tw-/), deleting the onset (in /kw-/, /gw-/ and /hw-/) or merging the two elements of the cluster (/xw/ → [f]).

vietnamese ( tiếng việt )

913

Codas are a subset of onsets, as shown in (7). The only significant departure to this generalization is the presence of a voiceless bilabial stop /p/, absent in onsets. (7) Vietnamese codas Northern dialect Southern dialect p t k p t~k m n ŋ m n~ŋ w j w j

The two velar codas undergo significant allophonic variation. In Northern Vietnamese, they become palatalized as [- jk, - jŋ] after front monophthongs and labio-velarized as [-k͡p, -ŋ͡m] after back monophthongs. The only exception to this generalization are the rhymes [-ăjk, -ăjŋ], in which palatalized consonants surface after a central vowel. This has led Haudricourt (1952) to reanalyze these rhymes as underlying /ɛ̆k, ɛ̆ŋ/, thus acounting for the otherwise unexplained absence of palatal and velar codas after the vowel /ɛ/ and shedding light on the behavior of long /ɛː/ and /ɔː/ in loanwords. Southern Vietnamese has an even more complex allophonic distribution of codas. It does not have palatals and has lost the contrast between alveodentals and velars in favor of an allophonic distribution predictable from vowels. In a nutshell, front monophthongs can only co-occur with bilabial and alveodental codas while central and back vowels can only be followed bilabials or velars. This complicated phonotactic problem is a central issue in Vietnamese phonology (reanalyzed in English in Phạm 2006). The Northern Vietnamese vowel system (8) has three contrastive heights and three contrastive degrees of frontness. There are also three underlying offgliding diphthongs and two pairs of vowels contrasting only in length: /ă ~ aː/ and /ə̆ ~ əː/ (in further IPA transcriptions, a brevis is used for the contrastive short vowels [ă, ə̆ ] and their allophones; length is otherwise unmarked). Two additional long vowels, /ɛː/ and /ɔː/, are rare and overwhelmingly found in loanwords. A slightly controversial point is the exact place of articulation of the central vowels, which are often claimed to be back. Although recent acoustic evidence suggest a fairly backed place of articulation, which could support the choice of the symbol /ɯ, ɤ/ rather than /ɨ, ə/ (Kirby 2011), articulatory evidence is still needed. Overall, vowels display relatively little allophonic variation: /o/ and /ɔ/ have diphthongized variants, [ow] and [ɔw~aw], before velar codas and the diphthongs /ie, ɨə, uo/ are realized as [iə, ɨə, uə] in open syllables.

914

brunelle

(8) Northern Vietnamese vowels ie ɨə uo i ɨ u e ə̆/əː o ɛ̆/(ɛː) ă/aː ɔ̆ /(ɔː)

The Southern Vietnamese vowel system is slightly different from that of Northern Vietnamese. The Northern diphthongs are realized as long vowels /iː, ɨː, uː/ in the South and the short vowels /ă/ and /ə̆ / merge to /ă/ except before glides. There are also tendencies to high vowel laxing and diphthongization of long vowels. The last point that needs to be addressed is the phonotactic distribution and allophonic realization of rhymes composed of a vowel and a coda glide. First of all, /-j/ is never found after front vowels, while /-w/ is never attested after back vowels, while central vowels can be followed by either. Moreover, the contrasts between /ăj ~ ə̆ j/ and /ăw ~ ə̆w/ tend to be neutralized in Northern Vietnamese: this results in [ɛ̆j, ɔ̆w]. In contrast, Southern Vietnamese /ăj/ and /ăw/ merge with /aːj/ and /aːw/, yielding [aːj, aːw]. 2.3 Suprasegmentals Northern Vietnamese has six phonemic tones in open syllables and syllables closed by a sonorant. Only two tones are found in syllables closed by stops (or checked syllables): they are often treated as allophonic variants of tones B1 and B2—which is reflected in the orthography—and have tone shapes that are very similar to B1 and B2, though slightly shorter (Vũ 1981; Vũ 1982). A peculiarity of Northern Vietnamese is the importance of phonation in the realization of some tones (Brunelle et al. 2010; Michaud 2004; Nguyễn & Edmondson 1997). Besides the pitch contours given in (9), three tones have specific phonation types: tone B2 ends in a dramatic glottal stop, C1 has a tense/slightly creaky phonation towards its end and tone C2 has a strong glottal constriction at its lowest point. Note that tone D2 shows no evidence of glottalisation, contrary to B2, despite a similar contour (Michaud 2004). Two of the tones represented in (9) have other, more conservative, variants. Tone B1 is often realized as a high-rising tone (especially by older speakers and/ or outside Hanoi), while tone C1 is still often produced as a falling-rising tone by more conservative speakers.

915

vietnamese ( tiếng việt )

(9) Northern Vietnamese tone system in unchecked syllables, female speaker (from Brunelle & Jannedy 2013)

Mean f0

350

Tone

C2

300

A1 (ngang) ˦ A2 (huyȇn) ˨˩ B1 (sắc) ˨˧/˦˥ B2 (nặng) ˦˧ C1 (hōi) ˦˧ C2 (ngã) ˦˥

A1 B1

B2 C1 250

A2 200

0

50

100

150

200

250

300

Time (msec)

The importance of voice quality in Northern Vietnamese tones has led some researchers to question the importance of pitch in this system (Phạm 2001). However, experimental studies conducted since have revealed that both properties are important for perception, even if pitch is less important than previously assumed (Brunelle 2009b; Kirby 2010). The Southern Vietnamese tone system, by contrast, only has five tones in unchecked syllables due to a merger of C1 and C2. It also has two checked tones, D1 and D2, which are relatively similar to B1 and B2 (Vũ 1981; Vũ 1982). Contrary to Northern Vietnamese, it makes no use of phonation contrasts. The Southern Vietnamese tone system is given in (10). Note that the idiosyncratic realization of tones B2 and C1/C2 shown in (10) are not typical in that their final portions are flat. Most speakers have final rises at the end of these tones. Despite significant tonal coarticulation (Brunelle 2009a), no phonological tone sandhis have been reported in Vietnamese dialects. However, some types of reduplication provide evidence that tones are organized into phonological classes (see §3.3).

916

brunelle

(10) Southern Vietnamese tone system in unchecked syllables, one male speaker (from Brunelle & Jannedy 2013)

Tone

180 B1

Mean f0

160 A1

140 A2

120

C1-C2

100 80

A1 (ngang) ˦ A2 (huyȇn) ˧˨ B1 (sắc) ˦˥ B2 (nặng) ˧˩˩/˧˩˨ C1-C2 (hōi-ngã) ˧˨˨/˧˨˧

B2 0

50

100

150

200

250

300

Time (msec)

3

Word Formation

3.1 Compounding As already briefly mentioned in §2.1, there are two major types of native compounds, coordinative and subordinative (Nguyễn 1997). Coordinative compounds (11) are composed of two juxtaposed lexical words of the same class that have no syntactic relationship. Semantically, coordinative compounds usually designate a class of objects of which the two members of the compound are a subset. Examples of coordinate compounds made up of the three lexical parts of speech are given in (11). (11) Coordinative compounds (N = noun, SV = stative verb, V = action verb) N+N a. cha mẹ b. quần áo SV+SV c. lười biếng d. nghèo khổ

[ʧaA1 mɛB2] [kwə̆ nA2 ɁawB1]

[lɨəjA2 ɓieŋB1] [ŋɛwA2 xoC1]

father + mother pant + shirt

‘parents’ ‘clothes’

to be lazy + to be lazy ‘lazy’ to be poor + to be miserable ‘extremely poor’

917

vietnamese ( tiếng việt )

V+V e. mua bán [muəA1 ɓanB2] to buy + to sell ‘to trade’ f. đi qua đi lại [ɗiA1 kwaA1 ɗiA1 lajB2] go+cross+go+come ‘to keep coming and going’

It has been proposed (Nguyễn 1997) to distinguish generalizing compounds, e.g. compounds composed of two members of a set but which denote the entire set (like 11b), from other coordinative compounds. However, there is no special reason to establish a clear theoretical distinction between these two types: the semantic scope of given compounds is largely lexicalized and even sometimes open to interpretation. While some speakers perceive bàn ghế (2c) ‘table + chairs’ as meaning ‘furniture in general’, other speakers insist that it only means ‘tables, chairs and stools’. Interestingly, there is a small number of quadrisyllabic compounds, like (11f), which are composed of two juxtaposed short phrases. There is also a rich array of subordinative compounds (12). These follow the usual constituent order of native Vietnamese (§5.1), in which heads precede modifiers. (12) Subordinative compounds (N = noun, SV = stative verb, V = action verb) N+N a. cá heo b. mì gói

[kaB1 hɛwA1] [miA2 ɣojB1]

fish + pig wheat noodle + pack

‘dolphin’ ‘instant noodle’

N+V c. người làm [ŋɨəjA2 lamA2] person + work d. bài hát [ɓajA2 hatD1] written piece + to sing

‘maid’ ‘song’

N+SV e. thuốc tây [tʰuokD1 tɛ̆jA1] drug + to be western f. canh chua [kăjŋA1 ʧuəA1] soup + to be sour

‘Western medicine’ ‘k.o. soup’

V+V g. làm thuê h. kiếm ăn

‘to work as hired hand’ ‘to earn a living’

[lamA2 tʰɥeA1] to work + to rent [kiemB1 ɁănA1] to look for + to eat

V+SV i. ăn chay [ɁănA1 tʃăjA1] to eat + to be vegetarian ‘to be vegetarian’ j. coi thường [kɔjA1 tʰɨəŋA2] to watch + to be ordinary ‘to underestimate’

918 SV+N k. chán đời l. mù chữ SV+SV m. tối mù n. chết đói

brunelle

[ʧanB1 ɗəjA2] [muA2 ʧɨC2]

to be bored + life to be blind + letter

[tojB1 muA2] to be dark + to be blind [ʧetD1 ɗɔjB1] to be dead + to be hungry

‘extremely bored’ ‘illiterate’

‘pitch dark’ ‘starving’

The same headedness structure is also found in longer compounds, as shown in (13). In these compounds, not only does the main head precede the modifier, but the embedded compound also follows that structure. (13) Complex native compounds a. máy hút bụi [măjB1 hutD1 ɓujB2] machine + (to suck + dust) ‘vacuum cleaner’ b. xe đạp điện [sɛA1 ɗapD2 ɗienB2] (vehicle + to trample) + electricity ‘electric bicycle’

Based on this, it has been argued that Vietnamese compounds are nothing more than lexicalized phrases (Noyer 1998). In fact, in the absence of clear word-level stress or of regular cliticization phenomena, there is little evidence for the existence of the prosodic word in Vietnamese (but see Phạm 2008). Besides semantically transparent native compounds, there is a large category of Sino-Vietnamese compounds that, as discussed in §2.1, are semi-opaque. Since a large majority of the Vietnamese lexicon is composed of Chinese loanwords, it is important to define what a Sino-Vietnamese compound is. However, this is not a trivial issue (Cao 1985). Stereotypical Sino-Vietnamese compounds contain at least one Sino-Vietnamese bound morpheme (where a bound morpheme is a syllable that cannot be a free standing word), like giải phóng [zajC1. fɔŋ͡mB1] ‘liberation’, in which neither giải nor phóng can be free-standing. By that definition, Sino-Vietnamese compounds would not be real compounds, but would be polysyllabic words. The problem is that some compounds are semantically transparent, while obeying the Sino-Vietnamese order of constituents, in which the modifier precedes the head rather than the opposite native order. An example is bệnh viện [ɓejŋC2-vienC2] ‘hospital’, which is composed of free-standing words (bệnh = ‘disease’, viện = ‘institute’), but has a modifierhead order. On the one hand, such words do not follow regular compound formation rules and should thus be treated as polysyllables. On the other, they are composed of roots that can be free-standing words, which suggests they are compounds. If we add to this conundrum individual variation in the richness and structure of the lexicon, we are left with a problem that is unlikely to be solved by a categorical classification. In the end, even if the meaning of

vietnamese ( tiếng việt )

919

their constituting morphemes is not obvious to (most) speakers, the fact that morphemes and syllables coincide makes Sino-Vietnamese morphemes more easily parsable than Latin or Greek roots in Western languages: few native English speakers are aware of the internal morphemic structure of a word like ‘con+cep+tion’, but all Vietnamese speakers are aware that giải phóng ‘liberation’, is composed of two morphemes, even if their meanings are especially opaque. Whenever complex compounds (14) are composed of one native monosyllabic and one Sino-Vietnamese disyllabic word, the Sino-Vietnamese element is treated as a single word and the compound follows native order. (14) Hybrid compounds (Sino-Vietnamese compounds are underlined) a. thứ điện tử [tʰɨB1 ɗienB2-tɨC1] letter + (electricity + seed) ‘email (electronic letter)’ b. hoa đơn đỏ [hwaA1-ɗənA1 ɗɔC1] (to return + bill) + red ‘official (red) receipt’

3.2 Derivational Morphology A few morphemes are occasionally treated as affixes in Vietnamese linguistics (Nguyễn 1997: 60–66). However, as they are phonologically and morphological indistinguishable from other Sino-Vietnamese monosyllabic morphemes, the reasons to label them as ‘affixes’ seem to be 1) their fixed position as the initial or final edge of the word, 2) their transparent semantics and 3) their productivity. Examples of productive ‘prefixes’ include tiền [tienA2] ‘pre-’, bán [ɓanB1] ‘semi-’ and siêu [siewA1] ‘super-’, while examples of productive suffixes include học [hɔwk͡pD2] ‘-logy’ and hoá [hwaB1] ‘-ize’. Thus ‘presyllable’ is tiền âm tiết [tienA2 Ɂə̆ mA1-tietD1] and ‘nominalization’ is danh từ hóa [zăjŋA1-tɨA2 hwaB1]. Overall, the main derivational processes in Vietnamese are, once again, akin to compounding. 3.2.1 Deriving Nouns A few morphemes can be used to derive nouns from active and stative verbs. The first one is the noun việc [viekD2] ‘business, matter’, which can be appended to any verb to nominalize it. Thus, sữa chửa [sɨəC2 ʧɨəC1] ‘to repair’, can be nominalized into việc sửa chữa [viekD2 sɨəC1 ʧɨəC2] ‘repair’. Another nominalizer is the bound prefix sự, [sɨB2] which derives abstract nouns from stative verbs. For instance, kiên nhẫn [kienA1-ɲə̆ nC2] ‘to be patient’ can be nominalized into sự kiên nhẫn [sɨB2 kienA1-ɲə̆ nC2] ‘patience’. Besides việc and sự, the normal strategy to derive nouns from verbs is to use a classifier that defines them as a countable unit. More details are given in the section dealing with classifiers (§4.2.3), but classifiers used to nominalize verbs include cuộc [kuokD2] ‘period of time’ and suất [swətD1] ‘unit of occurrence’, while those used to nominalize

920

brunelle

stative verbs include cái [kajB1] ‘abstract noun’, nỗi [nojC2] ‘unit of negative emotion’, niềm [niemA2] ‘unit of positive emotion’, and tính [tijŋB1] ‘character’. 3.2.2 Deriving Verbs Aside from the suffix hoá (§3.2), that can be used to derive verbs in ‘-ize’ from any noun or stative verb, there are no strategies for deriving verbs in Vietnamese. 3.3 Reduplication Vietnamese has a rich array of reduplication strategies that have been extensively described and classified (for instance, Emeneau 1951). They consist in reduplicating a syllable, while changing one or more of its constituents (onset, whole rhyme, tone or vowel). A few examples are given in (15). (15) Examples of reduplication (from Emeneau 1951) vội [vojB2] ‘to be hurried’ vội vàng [vojB2-vaŋA2] chồm [ʧomA2] ‘to spring up’ chồm chỗm [ʧomA2-ʧomC2] vấp [və̆ pD1] ‘to stumble’ vấp váp [və̆ pD1-văpD1] cộp [kopD2] ‘to knock’ lộp cộp [lopD2-kopD2]

‘to be hurried’ ‘to crouch down’ ‘to meet difficulties’ ‘to make noise with shoes walking’

A crucial observation here is that reduplication strategies are largely lexicalized. They are usually limited to a handful of words (sometimes a single one), rarely have clear derivational semantics and are not normally applied to new words. Only a few strategies seem to have real productivity. A quasi-exhaustive list (there might be individual and dialectal variation) is given in (16). (16) Productive reduplication strategies a. Full reduplication: Attenuation of stative verbs đỏ [ɗɔC1] ‘red’ đỏ đỏ [ɗɔC1-ɗɔC1] ‘reddish’ nhẹ [ɲɛB2] ‘light’ nhẹ nhẹ [ɲɛB2-ɲɛB2] ‘rather light’ b. Tone changes to pair A: Attenuation of stative verbs (same meaning as 16a) đỏ [ɗɔC1] ‘red’ đo đỏ [ɗɔA1-ɗɔC1] ‘reddish’ nhẹ [ɲɛB2] ‘light’ nhè nhẹ [ɲɛA2-ɲɛB2] ‘rather light’ c. Tone changes to pair A, coda stop nasalizes (checked syllable variant of 16b): Attenuation of stative verbs mát [matD1] ‘fresh’ man mát [manA1-matD1] ‘rather fresh’ sạch [săjkD2] ‘clean’ sành sạch [săjŋA2-săjkD2] ‘rather clean’

vietnamese ( tiếng việt )

921

d. Rhyme changes to -ẻ/ẽ [-ɛCX]: Intensification of stative verbs vui [vujA1] ‘happy’ vui vẻ [vujA1-vɛC1] ‘very happy’ mạnh [măjŋB2] ‘strong’ mạnh mẽ [măjŋB2-mɛC2] ‘very strong’ e. Rhyme changes to -iếc/iệc [-iekDX]: Sarcastic generalizing noun reduplication sách [sajkA1] ‘book’ sách siếc [sajkA1-siekD1] ‘damned books and stuff’ kẹo [kɛwB2] ‘candy’ kẹo kiệc [kɛwB2-kiekD2] ‘damned candies and sweets’

Most of the tone alternations involved in reduplication (especially the productive reduplication strategies in (16) follow well-defined tone classes. Vietnamese tones are organized along several phonological classes that correspond to their diachronic sources but no longer match their phonetic shapes. There are two registers, grouping together tones A1, B1, C1, D1 vs. A2, B2, C2, D2, and four tone pairs, A1-A2, B1-B2, C1-C2 and D1-D2. In reduplication strategies (16b) and (16c), the tone of the base changes to pair A, without changing register. In strategy (16d), the tone is changed to pair C, again without changing register. A similar process happens in (16e), where the tone of the reduplicant must belong to pair D, but keeps the same register as the base (although here many speakers seem to always prefer tone D1). Reduplicated forms can occasionally alternate with other words in complex expressives. An example formed along strategy (16e) is mỳ Ý [miA2 ɁiB1] ‘spaghetti (noodle + to be Italian)’, which can be turned into mỳ Ý mỳ iếc [miA2 ɁiB1 miA2 ɁiekD1] ‘damned pastas’. 3.4 Sound Symbolism Sound symbolic expressions are very prevalent in Vietnamese (for a summary, see Brunelle & Lê 2013). Besides reduplication, the language has a number of phonesthemes, e.g. arbitrary sound sequences that occur repeatedly in words with a similar connotation (17), and a large quantity of ideophones. Ideophones are onomatopoeic expressions that can be used as action verbs or stative verbs to express physical or psychological states. For instance, the ideophone oàm oạp [wamA2 wapD2] conveys the sound that waves make as they break on the shore. There are even a few ideophonic expressions where minute sound changes correlate with changes in the intensity of an action, as in (18). (17) ép dẹp kẹp

Rhyme –ep [ɛp]: to compress, to squeeze, to flatten [ɁɛpD1] ‘to press, to crush’ bẹp [ɓɛpD2] ‘flattened’ [zɛpD2] ‘flat’ lép [lɛpD1] ‘flat’ [kɛpD2] ‘to pinch’ khép [xɛpD1] ‘to close gently’

922

brunelle

nẹp [nɛpD2] ‘splint’ nép [nɛpD1] ‘to crouch’ ẹp [ɁɛpD2] ‘to deflate, to flatten’ xẹp [sɛpD2] ‘to flatten, to deflate’ hẹp [hɛpD2] ‘narrowly delimited, confined’ (18) Varying degrees of intensity in [k- ɲ-] sequence denoting ‘to complain’ càu nhàu [kăwA2-ɲăwA2] ‘to complain (mildly irritating)’ cằn nhằn [kănA2-ɲănA2] ‘to complain (more irritating)’ cảu nhảu [kăwC1-ɲăwC1] ‘to complain (very irritating)’

4

Phrase and Clause Structure

To follow the same structure as other chapters, we will discuss phrase structure before addressing the question of word classes (§5). More details on parts of speech can be found in that section. Note that from this point on, ‘to’ (and ‘to be’ for stative verbs) will be omitted from word-to-word glosses to facilitate parsing and alignement. 4.1 Noun Phrases Based on Nguyễn (1997) and Nguyễn (2013), the noun phrase can be described as having up to ten slots, as in (19). (19) TOT QUANT FOC CLF/MEAS NOUN MAT COL/SIZE ATTRIB Cả bảy cái con cò gỗ cao bị mất [kaC1 ɓajC1 kajB1 kɔnA1 kɔA2 ɣoC2 kawA1 ɓiB2 mə̆ tD1 all seven foc clf crane wood tall pass lost DEM POSS này của bác năjA2 kuəC1 ɓakD1] prox poss uncle2 ‘All these seven lost tall wooden cranes of yours.’

The ten slots are the following: 1. TOT is a totality marker, which in practice, can only be tất cả [tə̆ tD1 kaC1] or cả [kaC1] ‘all’.

2 See §5.2.1 for the use of kinship terms as pronouns.

vietnamese ( tiếng việt )

923

2.

QUANT is a quantifier, which can be either a numeral or the pluralizers những [ɲɨŋC2] or các [kakD1] (§5.2.3). 3. FOC can only be cái [kajB1], which although homophonous with a classifier, is here used as a focus marker (Nguyễn 2013). 4. CLF/MEAS is used for sortal or mensural noun (nouns behaving syntactically like classifiers). 5. NOUN is the noun itself. 6. MAT is the material from which the noun is made. 7. COL/SIZE is a slot reserved for stative verbs denoting color or size. 8. ATTRIB is used for other stative verbs or for relative clauses modifying the noun. Relative sentences can be introduced by the complementizer mà, but this is optional, as exemplified by the relative bị mất in (36). 9. DEM is used for demonstratives (§5.2.1). 10. POSS is the slot containing possessor phrases. Possessors are headed by the preposition của [kuəC1] ‘of’, but it can occasionally be omitted when the possessed noun is a kinship term. A noun phrase must contain at least a noun (20a), a classifier or a mensural noun followed by any post nominal modifier (20b). Nounless NPs are also possible: they must contain a numeral sequence composed of cả and a numeral (20c) or of a quantifier followed by a classifier or mensural noun (20d).

(20) a. Cò sản xuất ở Trung Quốc. [kɔA2 sanC1.swə̆ tD1 ɁəC1 ʧuŋ͡mA1.kwokD1] crane produce reside China ‘Cranes are produced in China.’ b. Con gỗ sản xuất ở Trung Quốc. [kɔnA1 ɣoC2 sanC1.swə̆ tD1 ɁəC1 ʧuŋ͡mA1.kwokD1] clf wood produce reside China ‘The wooden ones are produced in China.’ c. Cả bảy sản xuất ở Trung Quốc. [kaC1 ɓăjC1 sanC1.swə̆ tD1 ɁəC1 ʧuŋ͡mA1.kwokD1] all seven produce reside China ‘All seven are produced in China.’ d. Bảy con sản xuất ở Trung Quốc. [ɓăjC1 kɔnA1 sanC1.swə̆ tD1 ɁəC1 ʧuŋ͡mA1.kwokD1] seven clf produce reside China ‘Seven ones are produced in China.’

924

brunelle

4.2 Verb Phrases 4.2.1 Preverbal Markers The function words that are directly part of the verb phrase (as opposed to phrase-final ones) are all preverbal. Nguyễn (1997) grouped them into seven slots, shown in (21). (21) cũng đều vẫn/còn TENSE/ASP/MD rất NEGATION hay/năng VERB

Most of these slots can only be occupied by a single (or two semantically identical) function word(s). This is the case with cũng [kuŋ͡mC2] ‘also’, đều [ɗewA2] ‘equally’, vẫn/còn [və̆ nC2/kɔnA2] ‘still’, rất [zə̆ tD1] ‘very’, hay/năng [hăjA1/năŋA1] ‘often’. TENSE/ASPECT/MOOD can be filled in by a variety of tense-aspectmood markers like thường [tʰɨəŋA2] ‘habitual’, sẽ [sɛC2] ‘future’, sắp [săpD1] ‘immediate future’, đang [ɗaŋA1] ‘progressive’, đã [ɗaC2] ‘perfective’, vừa/mới [vɨəA2/məjB1] ‘recent perfective’ (more details in §5.2.4). Occasionally, two tense-aspect markers can co-occur in that slot, like đã đang ‘to be in the process of V already’ and sẽ đang ‘will be in the process of V’. NEGATION can be filled in by không [xowŋ͡mA1] ‘negative’, chẳng/chả ‘emphatic negative’ [ʧăŋC1/ ʧaC1], chưa [ʧɨəA1] ‘negative perfective’ and VERB can contain any stative or action verb. Although this template does capture the order of preverbal markers, it is important to note that many of them cannot co-occur for semantic reasons. Further, there seems to be a pragmatic limit of four or five markers after which the interpretation of the verbal phrase becomes impossible. 4.2.2 Complements Besides preverbal markers, action verbs can be followed by several types of complements, as in (22). (22) VERB (NP) (PREP/V2 NP)

Depending on their transitivity, verbs can take zero, one or two complements. Although most bivalent verbs require a direct and an indirect object, there is a handful a ditransitive verbs, like cho [ʧɔA1] ‘to give’, tặng [tăŋB2] ‘to offer’, kể [keC1] ‘to narrate’, viết [vietD1] ‘to write’. The basic order of arguments can be reversed (23), in which case a preposition (§5.2.5), or more frequently a co-verb with a prepositional use, must introduce the second NP complement. Note that the argument closest to the verb seems to be mildly focalized.

vietnamese ( tiếng việt )

925

(23) a. Mỗi khi về Việt Nam, Hương tặng họ hàng [moiC2 xiA1 veA2 vietD2.namA1 hɨəŋA1 tăŋB2 hɔB2.haŋA2 each moment return Vietnam Hương offer relatives

thuốc kháng sinh. tʰuokD1 xaŋB1.sijŋA1] drug antibiotics ‘Every time she goes back to Vietnam, Hương offers relatives antibiotics.’

b. Mỗi [moiC2 each

khi về Việt Nam, Hương tặng thuốc xiA1 veA2 vietD2.namA1 hɨəŋA1 tăŋB2 tʰuokD1 moment return Vietnam Hương offer drug

kháng sinh cho họ hàng. xaŋB1.sijŋA1 ʧɔA1 hɔB2.haŋA2] antibiotics give relatives ‘Every time she goes back to Vietnam, Hương offers antibiotics to relatives.’

Interestingly, inversion is ruled out with cho [ʧɔA1] ‘to give’ in (24), because it is impossible to use it both as a main verb and as a co-verb with a prepositional meaning. (24) a. Hương cho họ hàng thuốc. [hɨəŋA1 ʧɔA1 hɔB2.haŋA2 tʰuokD1] Hương give relatives drug ‘Hương gives relatives drugs.’ b. *Hương cho thuốc họ hàng. [hɨəŋA1 ʧɔA1 tʰuokD1 hɔB2.haŋA2] Hương give drug relatives ‘Hương gives drugs to relatives.’

4.2.3 Adverbial Modifiers The verb can also take adverbial modifiers. If the adverbial modifier is a stative verb, it must be positioned immediately after the verb (25a). Placement after the first NP is questionable (25b–c), perhaps because a stative verb can be syntactically interpreted as modifying the preceding noun. The final position is probably ruled out for the same reason (25d–e), although it gets stronger ungrammaticality judgments. Interestingly, if the adverbial modifier is a

926

brunelle

phrasal adverb, like một cách miễn cưỡng [motD2 kajkD1 mienC2-kɨəŋC2] (one + manner + reluctant) ‘in a reluctant way’, (25d–e) become grammatical. This is probably because phrasal adverbs cannot be confused with noun modifiers, contrary to stative verbs. (25) a. Hương tặng miễn cưỡng một hộp thuốc kháng sinh [hɨəŋA1 tăŋB2 mienC2.kɨəŋC2 motD2 hopD2 tʰuokD1 xaŋB1.sijŋA1 Hương offer reluctant one box drug antibiotics

cho họ hàng. ʧɔA1 hɔB2.haŋA2] give relatives ‘Hương reluctantly offers a box of antibiotics to relatives.’

b. ?Hương tặng một hộp thuốc kháng sinh miễn cưỡng [hɨəŋA1 tăŋB2 motD2 hopD2 tʰuokD1 xaŋB1.sijŋA1 mienC2.kɨəŋC2 Hương offer one box drug antibiotics reluctant

cho họ hàng. ʧɔA1 hɔB2.haŋA2] give relatives ‘Hương offers a box of antibiotics reluctantly to relatives.’

c. ?Hương tặng cho họ hàng miễn cưỡng một hộp thuốc [hɨəŋA1 tăŋB2 ʧɔA1 hɔB2.haŋA2 mienC2.kɨəŋC2 motD2 hopD2 tʰuokD1 Hương offer give relatives reluctant one box drug kháng sinh. xaŋB1.sijŋA1] antibiotics ‘Hương offers relatives reluctantly a box of antibiotics.’ d. *Hương tặng một hộp thuốc kháng sinh cho họ hàng [hɨəŋA1 tăŋB2 motD2 hopD2 tʰuokD1 xaŋB1.sijŋA1 ʧɔA1 hɔB2.haŋA2 Hương offer one box drug antibiotics give relatives miễn cưỡng. mienC2.kɨəŋC2] reluctant ‘Hương offers a box of antibiotics to relatives reluctantly.’

vietnamese ( tiếng việt )

927

e. *Hương tặng cho họ hàng một hộp thuốc kháng sinh [hɨəŋA1 tăŋB2 ʧɔA1 hɔB2.haŋA2 motD2 hopD2 tʰuokD1 xaŋB1.sijŋA1 Hương offer give relatives one box drug antibiotics miễn cưỡng. mienC2.kɨəŋC2] reluctant ‘Hương offers relatives a box of antibiotics reluctantly.’

Unlike action verbs, stative verbs can only be modified by a complement that defines their scope (Nguyễn 1997). This is exemplified in (26). (26) a. Lâm thông minh nhưng rất dốt lịch sử Đảng. [lə̆ mA1 tʰowŋ͡mA1.mijŋA1 ɲɨŋA1 zə̆ tD1 zotD1 lijkD2.sɨC1 ɗaŋC1] Lâm intelligent but very ignorant history party ‘Lâm is intelligent, but really bad at Party history.’ b. Măng cụt mắc mà dễ bán. [măŋA1.kutB2 măkD1 maA2 zeC2 ɓanB1] mangosteen expensive but easy sell ‘Mangosteen are expensive, but easy to sell.’

Adverbial modifiers can also be adverbial clauses, which are introduced by a preposition. Although these are technically sentential adjuncts rather than a part of the verb phrase, they are discussed here to be coherent with other chapters. Important types of adverbial clauses are introduced by function words like nếu [newB1] for conditional clauses, để [ɗeC1] for purposive clauses, tại vì [tajC2 viA2] for causal clauses, etc. A number of markers can be used for time clauses, like khi [xiA1] and lúc [luk͡pD1], and hồi [hojA2], which is used for past time clauses only. 4.2.4 Serial Verb Constructions A crucial aspect of Vietnamese verb phrases is the omnipresence of serial verb constructions. A first type of such constructions is illustrated in (23b), where the verb tặng selects the coverb cho to introduce its indirect object. Other relatively lexicalized co-verbs are also used to express aspects, as mentioned in (§5.2.4). Examples of such verbs include thử [tʰɨC1] ‘to try’, xem [sɛmA1] ‘to watch’, bù [ɓuA2] ‘to compensate’. This is illustrated in (27).

928

brunelle

(27) a. Huyền phải học bù vì làm biếng [hwienA2 fajC1 hɔwk͡pD2 ɓuA2 viA2 lamA2 ɓieŋB1 Huyền ought study compensate because make lazy

cả tuần. kaC1 twə̆ nA2] all week ‘Huyền must catch up on her studying because she’s been lazy all week.’

b. Tối hôm qua cả lớp mời giáo viên Mỹ đi [tojB1 homA1-kwaA1 kaC1 ləpD1 məjA2 zawB1.vienA1 miC2 ɗiA1 night day-past all class invite teacher USA go

ăn thử thịt chó. ɁănA1 tʰɨC1 tʰitD2 ʧɔB1] eat try meat dog ‘Last night, the whole class invited the American teacher to go try dog meat.’

Another frequent type of serial construction consists of a V1, the main verb, and a V2, a movement verb used as a co-verb to indicate the direction, physical or figurative, of the action (Hanske 2013), as in (28). The directional co-verbs are ra [zaA1] ‘go out’, vào [vawA2] ‘enter’, lên [lenA1] ‘go up’, xuống [suoŋB1] ‘go down’, đi [ɗiA1] ‘go’, lại [lajB2] ‘come back’, về [veA2] ‘return’. (28) a. Trẻ em leo lên đồi cát đỏ. [ʧɛC2 ʔɛmA1 lɛw A1 len A1 ɗojA2 katD1 ɗɔC1] young younger.sibling climb go.up hill sand red ‘Kids climb up the red sand dunes.’ b. Cả cơ quan đi vào Sài Gòn chơi. [kaC1 kəA1.kwanA1 ɗiA1 vawA2 sajA2.ɣɔnA2 ʧəjA1] all office go enter Saigon play ‘The whole office goes on a trip to Saigon.’

However, serial verb constructions can also be composed of several main verbs with a temporal sequence (lexicalized compound verbs like nấu ăn [nɔ̆wB1 ɁănA1] (cook + eat) ‘to cook’ are not analyzed as serial verb constructions here). These structures are used to express a sequence of tightly related events, but also causality and purpose. When verbs are action verbs, they can either share complements or each have their own complements. In any case, objects must

vietnamese ( tiếng việt )

929

follow the first verb. In (29a), for instance, mua and đem share a single object, which must follow mua. In (29b), each object follows the verb that heads it. There are also complex sentences in which nouns phrases can be the object of one verb and the subject of another. In (29c), not only is tiền the object of đốt and the subject of xuống, but tổ tiên is one of the objects of cho (the other, tiền, is not repeated) and the subject of mua. (29) a. Dũng đi mua báo đem về cho ông. [zuŋ͡mC2 ɗiA1 muəA1 ɓawB1 ɗɛmA1 veA2 ʧɔA1 owŋ͡mA1] Dũng go buy newspaper bring return give grandfather ‘Dũng goes to buy the newspaper and bring it back for her grandfather.’ b. Dũng lấy báo An Ninh Thế Giới gói cá. [zuŋ͡mC2 lɛ̆jB1 ɓawB1 ɁanA1.nijŋA1 tʰeB1.zəjB1 ɣɔjB1 kaB1] Dũng take newspaper security world wrap fish ‘Dũng takes the newspaper “The World’s Security” to wrap fish.’ c. Mẹ Dũng đốt tiền xuống cho tổ tiên mua đồ. [mɛB2 zuŋ͡mC2 ɗotD1 tienA2 suoŋB1 ʧɔA1 toC1.tienA1 muəA1 ɗoA2] Mother Dũng burn money go.down give ancestor buy thing ‘Dũng’s mother burns money to the ancestors so that they can buy things.’

Finally, serial verb constructions can also be composed of an action verb and a stative verb. In such cases, the stative verb specifies the result of the action verb or the manner in which it is realized. This is illustrated by the sequences đánh chết and ngủ không sâu in (30a–b). (30) a. Nhà bác khổ vì sét đánh chết cả [ɲaA2 ɓakD1 xoC1 viA2 sɛtD1 ɗăjŋB1 ʧetD1 kaC1 House aunt miserable because lightning hit dead all

đàn trâu. ɗanA2 ʧɔ̆w A1] herd buffalo ‘My family is miserable because lightning killed the whole buffalo herd.’

b. Duy ngủ không sâu vì hàng xóm đang xây nhà. [zɥiA1 ŋuC1 xowŋ͡mA1 sɔ̆w A1 viA2 haŋA2.sɔmB1 ɗaŋA1 sɛ̆jA1 ɲaA2] Duy sleep neg deep because neighbor prog build house ‘Duy does not sleep well because the neighbors are building a house.’

930

brunelle

4.3 Sentence Structure Vietnamese root sentences can be composed of a subject noun phrase and a verb phrase (along with optional adjuncts). This is the case of sentences (23–30) above. However, contrary to what is depicted in most examples above, Vietnamese is heavily pro-drop. Subjects and objects can both be dropped if they can be inferred from the context, which is a way of encoding givenness. Another aspect in which previous examples are not representative is that most Vietnamese natural utterances follow a theme-rheme scheme (also topiccomment), where a sentence-initial theme, which can be either a noun phrase or a verb phrase, is modified by a rheme, which can be either a noun phrase, a verb phrase or a root sentence (Cao 1992). As shown in (31), the rheme is generally, but not obligatorily, introduced by the equative copula là [laA2] or by the copula thì [tʰiA2], which puts special focus on the theme. (31) a. Lái xe an toàn là [lajB1 sɛA1 anA1.twanA2 laA2 drive vehicle safe cop

hạnh phúc hăjŋB2.fuk͡pD1 happiness

của mọi kuəC1 mɔjB2 poss each

gia đình. zaA1.ɗijŋA2] family ‘Driving safe is the happiness of all families.’ b. Bún riêu (thì) ai cũng thích. [ɓunB1-ziewA1 tʰiA2 ɁajA1 kuŋ͡mC2 tʰijkB1] vermicelli-crab cop who also like ‘Everybody likes crab vermicelli soup (as opposed to other dishes)’.

Some sentences only consist of a rheme (Cao 1992), as in (32). (32) Mệt quá! [metD2 kwaB1] tired too.much ‘Too tired!’

While other sentences have multiple embedded themes and rhemes (Cao 1992), as shown in (33).

vietnamese ( tiếng việt )

931

(33) a. Bún riêu ai cũng thích thì em ăn [ɓunB1-ziewA1 ɁajA1 kuŋ͡mC2 tʰijkD1 tʰiA2 ɁɛmA1 ɁănA1 vermicelli-crab who also like cop younger.sibling eat

một bát nũa. motD2 ɓatD1 nɨəC2] one bowl again ‘Everybody likes crab vermicelli soup, so you have another bowl.’

While the importance of the theme-rheme configuration in Vietnamese is not controversial, trying to reduce all sentences to such a template is problematic. Such a solution would involve reinterpreting subject-verb sentences as themerheme sequences, even if they do not contain clause linkers. This has been the subject of considerable debate in Vietnamese linguistics. 4.4 Complex Sentences Structures 4.4.1 Sentence Types As shown by the non-exhaustive examples in (34), yes-no questions are derived by adding a final particle to the end of the sentence. Open questions are formed by replacing a constituent with an in situ wh-word. (34) Base sentence: Lan tập thể dục. [lanA1 tə̆ pD2 theB1.zuk͡pD2] Lan practice exercise ‘Lan works out.’

a. Không [xoŋ͡mA1] ‘not’: yes-no question Lan tập thể dục không? ‘Is Lan working out?’

b. Chưa [ʧɨəA1] ‘not yet’: yet-question Lan tập thể dục chưa? ‘Has Lan worked out yet?’

c. À [ʔaA2] ‘isn’t it’: confirmation question Lan tập thể dục à? ‘Lan is working out?’

Variation in intonation is also used for marking interrogatives, as well as imperatives and the emotional content of a sentence (summary in Brunelle et al. 2012). However, intonational strategies seem to be highly idiosyncratic and optional. Final particles are a much more reliable cue (§5.2.7).

932

brunelle

4.4.2 Superlatives and Comparatives Superlatives are formed by adding nhất [ɲə̆ tD1] ‘first’ to a stative verb. Inferiority superlatives can be formed by adding ít . . . nhất [ɁitD1 . . . ɲə̆ tD1] to the same stative verbs. Action verb phrases can also be turned into superlatives by adding nhiều nhất [ɲiewA2 ɲə̆ tD1] ‘the most’ / ít nhất [itD1 ɲə̆ tD1] ‘the least’. Examples are given in (35). (35) a. Thành phố lớn nhất Việt Nam là Sài Gòn. [tʰăjŋA2.foB1 lənB1 ɲə̆ tD1 vietD2.namA1 laA2 sajA2.ɣɔnA2] city large first Vietnam cop Saigon ‘The largest city in Vietnam is Saigon.’ b. Thành phố ít nóng nhất Việt Nam là Lào Cai. [tʰăjŋA2.foB1 ɁitD1 nɔwŋ͡mB1 ɲə̆ tD1 vietD2.namA1 laA2 lawA2.kajA1] city little hot first Vietnam cop Lào Cai ‘The coolest city in Vietnam is Lao Cai.’ c. Thành phố có hồ nhiều / ít nhất [tʰăjŋA2.foB1 kɔB1 hoA2 ɲiewA2 / ɁitD1 ɲə̆ tD1 city exist lake lot / little first ‘The city that has the most / fewest lakes is Hanoi.’

là Hà Nội. laA2 haA2.nojB2] cop Hanoi

d. Thành phố có nhiều / ít hồ nhất là Hà Nội. [tʰăjŋA2.foB1 kɔB1 ɲiewA2 / ɁitD1 hoA2 ɲə̆ tD1 laA2 haA2.nojB2] city exist lot / little lake first cop Hanoi ‘The city that has the most / fewest lakes is Hanoi.’

Comparatives are formed by adding hơn [hənA1] / ít hơn [ɁitB1 hənA1] after a stative verb. As with superlatives, the only way to form a comparative with an action verb phrase is to first modify it with the stative verbs nhiều or ít. This is shown in (36). (36) a. Sài Gòn đông hơn Hà Nội / ít đông hơn [sajA2.ɣɔnA2 ɗoŋ͡mA1 hənA1 haA2.nojB2 / ɁitD1 ɗoŋ͡mA1 hənA1 Saigon crowded more Hanoi few crowded more Hà Nội. haA2.nojB2] Hanoi ‘Saigon is more/less crowded than Hanoi.’

vietnamese ( tiếng việt )

933

b. Hà Nội có nhiều / ít hồ hơn Sài Gòn. [haA2.nojB2 kɔB1 ɲiewA2 / ɁitD1 hob2 hənA1 sajA2.ɣɔnA2] Hanoi exist lot / few lake more Saigon ‘Hanoi has more/fewer lakes than Saigon.’ c. Hà Nội có hồ nhiều / ít hơn Sài Gòn. [haA2.nojB2 kɔB1 hoB2 ɲiewA2 / ɁitD1 hənA1 sajA2.ɣɔnA2] Hanoi exist lake lot / few more Saigon ‘Hanoi has more/fewer lakes than Saigon.’

4.4.3 Relatives and Subordinate Clauses As mentioned in §4.1, relative clauses can be headed by the complementizer mà [maA2] or just be juxtaposed to the noun phrase they modify, as in (37). The modified noun phrase is gapped in the relative, as in most Western languages. (37) a. Nhà (mà) Ø bán giòchả ở đầu [ɲaA2 maA2 ɓanB1 zɔA2-ʧaC1 ɁəC1 ɗɔ̆wA2 house comp sell boiled.sausage-fried.sausage reside head đường giàu. ɗɨəŋA2 zăwA2] street rich ‘The family that sells sausage at the beginning of the street is rich.’ b. Giò chả (mà) họ bán Ø hơi béo. [zɔA2-ʧaC1 maA2 hɔD2 ɓanB1 həjA1 ɓɛwB1] boiled.sausage-fried.sausage comp they sell slightly fat ‘The sausage that they sell is a bit fat.’

Subordinate clauses introduced by verbs are normally headed by the equative là [laA2] which acts as a complementizer (38a). The only exceptions to this are verbs that can also take the quotative complementizer rằng [zăŋA2], like nói [nɔjB1] ‘to say’, bảo [ɓawC1] ‘to tell’, kể [keC1] ‘to narrate’, biết [ɓietD1] ‘to know’, nghĩ [ŋiC2] ‘to think’ and tưởng [tɨəŋC1] ‘to think wrongly’. This is shown in (38b). (38) a. Nhiều người Hà Nội gốc thấy là mình tinh tế [ɲiewA2 ŋɨəjA2 haA2.nojB2 ɣowk͡pD1 tʰɛ̆jB1 laA2 mijŋA2 tijŋA2.teB1 many clf Hanoi root see cop oneself sophisticated

934

brunelle

hơn người khác. hənA1 ŋɨəjA2 xakD1] more clf different ‘Many old-time Hanoians think that they are more sophisticated than other people.’ b. Người Hà Tây lại nghĩ rằng người Hà Nội gốc [ŋɨəjA2 haA2.tɛ̆jA1 lajB2 ŋiC2 zăŋA2 ŋɨəjA2 haA2.nojB2 ɣowk͡pD1 clf Hà.Tây again think comp clf Hanoi root

hơi kiêu ngạo. həjA1 kiewA1.ŋawB2] slightly arrogant ‘As for Hà Tây people, they think that old-time Hanoians are a bit arrogant.’

4.4.4 Passives There are a few passive-like constructions in Vietnamese. The first two, là and do, have an obligatory agent (39a–b). Once again, là is behaving as a default copula that links a theme and a rheme. Do is similar but puts emphasis on the agent (là do is also possible, with the same meaning). Được and bị, on the other hand, are used for describing actions in which the patient undergoes negative and positive experiences, respectively (39c–d). Contrary to do and là, they do not require an agent. (39) a. Quyển sách này là *(Vũ Trọng Phụng) viết ở [kɥienC1 săjkD1 năjA2 laA2 vuC2 ʧɔwŋ͡mB2 fuŋ͡mB2 vietD1 ɁəC1 clf book prox cop Vũ Trọng Phụng write reside Hà Nội. haA2.nojB2] Hanoi ‘This book was written by Vũ Trọng Phụng in Hanoi.’ b. Quyển sách này do *(Vũ Trọng Phụng) viết ở [kɥienC1 săjkD1 năjA2 zɔA1 vuC2 ʧɔwŋ͡mB2 fuŋ͡mB2 vietD1 ɁəC1 clf book prox pass Vũ Trọng Phụng write reside Hà Nội. haA2.nojB2] Hanoi ‘This book was written by Vũ Trọng Phụng in Hanoi.’

vietnamese ( tiếng việt )

935

c. Quyển sách này được (Vũ Trọng Phụng) viết [kɥienC1 săjkD1 năjA2 ɗɨəkD2 vuC2 ʧɔwŋ͡mB2 fuŋ͡mB2 vietD1 clf book prox pass Vũ Trọng Phụng write ở Hà Nội. ɁəC1 haA2.nojB2] reside Hanoi ‘This book was written by Vũ Trọng Phụng in Hanoi.’ d. Quyển [kɥienC1 clf

sách này bị (Vũ Trọng Phụng) đốt ở săjkD1 năjA2 ɓiB2 vuC2 ʧɔwŋ͡mB2 fuŋ͡mB2 ɗotD1 ɁəC1 book prox pass Vũ Trọng Phụng write reside

Hà Nội. haA2.nojB2] Hanoi ‘This book was burned by Vũ Trọng Phụng in Hanoi.’

Another difference between là/do and được/bị is that the latter can head clauses with a subject gap (40) or even precede nouns or stative verbs (41). This wide range of complements makes được and bị somewhat untypical for passive constructions (Simpson & Hồ 2013). (40) a. Lan được chọn vào Đảng. [lanA1 ɗɨəkD2 ʧɔnB2 vawA2 ɗaŋC1] Lan pass choose enter party ‘Lan was chosen to enter the Party.’ b. Lan bị bắt vào Đảng [lanA1 ɓiB2 ɓătD1 vawA2 ɗaŋC1] Lan pass catch enter party ‘Lan got forced to enter the Party.’ (41) a. Tuấn được bằng khen / vui. [twənB1 ɗɨəkD2 ɓăŋA2 xɛnA1 / vujA1] Tuấn pass certificate praise happy ‘Tuấn receives a certificate of praise / Tuấn is happy.’ b. Tuấn bị tai nạn / ướt. [twənB1 ɓiB2 tajA1.nanB2 / ɁɨətD1] Tuấn pass accident wet] ‘Tuấn has an accident / Tuấn gets wet.’

936 5

brunelle

Word Classes

5.1 Lexical Words Vietnamese distinguishes three types of lexical words: nouns, action verbs and stative verbs (adjectives). 5.1.1 Nouns Nouns refer to physical entities or abstract concepts. They can be modified by demonstratives. Vietnamese nouns can be divided into count nouns and mass nouns; mass nouns are far more common than count nouns (Cao 2003 [1980]).3 The class of count nouns includes time expressions (ngày [ŋăjA2] ‘day’, năm [nămA1] ‘year’, etc.) and a number of nouns that are difficult to attribute to well-defined semantic categories (tỉnh [tijŋC1] ‘province’, màu [măwA2] ‘color’ , bài [ɓajA2] ‘text’, etc.). All other nouns need to be individualized by a classifier before being countable (§5.2.2). Interestingly, the use of a classifier is optional with kinship terms (bác [ɓakD1] ‘uncle’, cô [koA1] ‘aunt’, cháu [ʧăwB1] ‘nephew, grandchild’, etc.), making them difficult to classify as count or mass nouns. 5.1.2 Verbs Verbs can be divided into two categories: action verbs and stative verbs. Action verbs denote actions while stative verbs denote states, like adjectives in Western languages. The decision to lump the latter category with verbs derives from their verb-like properties. Stative verbs can appear as predicates in the same way as action verbs, as shown in (42). (42) a. Phương ăn. [fɨəŋA1 ɁănA1] Phương eat ‘Phương eats’. b. Phương cao. [fɨəŋA1 kawA1] Phương tall ‘Phương is tall’.

3 Alternatively, Nguyễn (2013) considers that most Vietnamese nouns are ‘mass-like’ or ‘non-individuated’.

vietnamese ( tiếng việt )

937

However, the two categories are syntactically distinct in that some adverbs can only modify one of the two categories and in that imperative markers can only modify action verbs, as illustrated in (43). (43) a. Phương ngủ nhiều. b. *Phương rất ngủ. c. Phương ngủ đi! [fɨəŋA1 ŋuC1 ɲiewA2] [fɨəŋA1 zə̆ tD1 ŋuC1] [fɨəŋA1 ŋuC1 ɗiA1] Phương sleep much Phương very sleep Phương sleep imp ‘Phương sleeps a lot.’ ‘Phương very sleeps.’ ‘Sleep, Phương!’ d. *Phương cao nhiều e. Phương rất cao f. *Phương cao đi! [fɨəŋA1 kawA1 ɲiewA2] [fɨəŋA1 zə̆ tD1 kawA1] [fɨəŋA1 kawA1 ɗiA1] Phương tall much Phương very tall Phương tall imp ‘Phương is tall a lot.’ ‘Phương is very tall’ ‘Be tall, Phương!’

Stative verbs can also modify verbs, thus having adverb-like behavior, as in (44b). In fact, Vietnamese does not have an independent class of lexical adverbs. (44) a. Phương vui- vẻ. b. Phương đi chơi vui- vẻ. [fɨəŋA1 vujA- vɛC1] [fɨəŋA1 ɗiA1 ʧəjA1 vujA1- vɛC1] Phương happy- red Phương go play happy- red ‘Phương is very happy.’ ‘Phương goes out happily.’ Vietnamese also has two verb-like copulas, có [kɔB1], an existential or possessive copula, and là [laA2], an equative. Có can either mean ‘there is’ or ‘to have’, as shown in (45). It behaves just like a verb in every respect. Là has a more complex behavior. Although it must be used as an equative copula between a subject and a nominal predicate (46a), it can also be used as a non-verbal connector linking a theme to a rheme (see §4.3 and §5.2.6) and does not behave as a regular verb with respect to verbal modifiers (contrast 46b with 46c). (45) a. Có một con chuột qua đường. [kɔB1 motD2 kɔnA1 ʧuotD2 kwaA1 ɗɨəŋA2] cop one clf rat cross street ‘There is a rat crossing the street.’ b. Thảo có một cái xe máy mới. [tʰawC1 kɔB1 motD2 kajB1 sɛA1 măjB1 məjB1] Thảo cop one clf vehicle machine new ‘Thảo has a new moped.’

938

brunelle

(46) a. Thảo là cánbộ của khoa. [tʰawC1 laA2 kanB1.ɓoB2 kuəC1 xwaA1] Thảo cop cadre poss faculty ‘Thảo is an employee of the faculty.’ b. Thảo không phải là cán bộ của khoa. [tʰawC1 xoŋ͡mA1 fajC1 laA2 kanB1.ɓoB2 kuəC1 xwaA1] Thảo neg ought cop cadre poss faculty ‘Thảo is not an employee of the faculty.’ c. Thảo không làm ở khoa. [tʰawC1 xoŋ͡mA1 lamA2 ɁəC1 xwaA1] Thảo neg work reside faculty ‘Thảo does not work at the faculty.’

5.2 Function Words 5.2.1 Pronouns There is a relatively limited set of personal pronouns. The most common are given in (47). (47) Common pronouns tôi [tojA1] ‘neutral 1SG’ ta [taA1] ‘1SG or 1PL inclusive, familiar/3SG impersonal’ mình [mijŋA2] ‘1SG or 1PL inclusive, familiar’ tao [tawA1] ‘1SG used with social equals or inferiors (or as an insult)’ mày [măjA2] ‘2SG used with social equals or inferiors (or as an insult)’ nó [nɔB1] ‘3SG or 3PL for inanimate objects, animals or social inferiors’ chúng [ʧuŋ͡mB1] ‘3PL for inanimate objects, animals or social inferiors’ chúng tôi [ʧuŋ͡mB1 tojA1] ‘1PL exclusive’ chúng ta [ʧuŋ͡mB1 taA1] ‘1PL inclusive’ họ [hɔB2] ‘3PL used for people’

However, Vietnamese normally uses non-shifting kinship terms instead of pronouns. For example, the normal way of addressing an elderly woman for a young speaker would be to use bà [ɓaA2] ‘grandmother’, while referring to oneself as cháu [ʧăwB1] ‘grandchild’. As such, the normal way of expressing the third person is not pronominal but consists in using a kinship term with a demonstrative: ông ấy [oŋ͡mA1 Ɂɛ̆jB1] means ‘that grandfather (he)’, chị này [ʧiB2 năjA2] means ‘this older sister (she)’. Kinship terms can be combined with the

vietnamese ( tiếng việt )

939

demonstratives used for any of three degrees of distance. Proximal demonstratives are này/ đây [năjA2/ɗăjA1], medial are đó/đấy/ấy [ɗɔB1/ɗə̆ jB1/Ɂə̆ jB1] and distal are kia [kiəA1] and nọ [nɔB2]. There are also three pronouns that can be used both as WH-words and as indefinite pronouns: ai [ɁajA1] ‘who?; anybody, everybody’, đâu [ɗɔ̆wA1] ‘where?; anywhere, everywhere’, gì [ziA2] ‘what?; anything, everything’ (Trần & Bruening 2013). 5.2.2 Classifiers Vietnamese has a small set of sortal classifiers, like cái [kajB1] (inanimate objects), con [kɔnA1] (animal, some objects), chiếc [ʧiekD1] (vehicles and furniture), niềm [niemA2] (positive emotions), but regular nouns are also frequently used as classifiers, like cuốn [kuonB1] ‘roll’, cây [kɛ̆jA1] ‘tree’, quả [kwaC1] ‘sphere, fruit’. The function of classifiers is to individualize the noun they precede. In (48a), an unspecified mass of chicken is referred to. In (48b), specific chickens are being killed. (48) a. Uyên mổ gà trong lavabô. [ɥienA1 moC1 ɣaA2 ʧɔŋ͡mA1 laA1.vaA1.boA1] Uyên cut.open chicken inside sink ‘Uyên kills (a) chicken(s) in the sink.’ b. Uyên mổ con gà trong lavabô. [ɥienA1 moC1 kɔnA1 ɣaA2 ʧɔŋ͡mA1 laA1.vaA1.boA1] Uyên cut.open clf chicken inside sink ‘Uyên kills the chicken(s) in the sink.’

Classifiers are normally used between nouns and numerals or pluralizers (although they can optionally be dropped in lists or repetitive tasks). This is exemplified in (49). (49) a. Uyên mổ ba con gà trong lavabô. [ɥienA1 moC1 ɓaA1 kɔnA1 ɣaA2 ʧɔŋ͡mA1 laA1.vaA1.boA1] Uyên cut.open three clf chicken inside sink ‘Uyên kills three chickens in the sink.’ b. Uyên mổ các con gà trong lavabô. [ɥienA1 mo C1 kakD1 kɔnA1 ɣaA2 ʧɔŋ͡mA1 laA1.vaA1.boA1] Uyên cut.open PL clf chicken inside sink ‘Uyên kills the chickens in the sink.’

940

brunelle

Whenever the context is clear, classifiers can substitute with the nouns they refer to, as in (50). (50) Nhà Uyên có hai con trâu: môt con đực [ɲaA2 ɥienA1 kɔB1 hajA1 kɔnA1 ʧɔ̆w A1 motD2 kɔnA1 ɗɨkD2 house Uyên cop two clf buffalo one clf male và một con cái. vaA2 motD2 kɔnA1 kajB1] and one clf female ‘Uyên’s family has two buffaloes: one male and one female.’ Con đực ngu như bò. [kɔnA1 ɗɨkD2 ŋuA1 ɲɨA1 ɓɔA2] clf male stupid like cow ‘The male one is as stupid as a cow’.

5.2.3 Numerals, Quantifiers and Intensifiers The Vietnamese numeral system shows no traces of non-decimal bases. Basic numbers are given in table (51). Numbers above ten are compositional. Thus 13 is mười ba [mɨəjA2 ɓaA1] ‘ten three’ and 286 is hai trăm tám mươi sáu [hajA1 ʧămA1 tamB1 mɨəjA1 săwB1] ‘two hundred eight ten six’. Complex numbers are subject to minor tone and onset changes. For instance, mười ‘ten’ changes to mươi in tens and the number 5 goes from năm [nămA1] to lăm [lămA1] after tens. Besides regular Mon-Khmer numbers, there is a Sino-Vietnamese parallel set that is used in Sino-Vietnamese compounds (and in some games). (51) Basic numerals one một motD2 two hai hajA1 three ba ɓaA1 four bốn ɓonB1 five năm nămA1 six sáu săwB1 seven bảy ɓăjC1 eight tám tamB1 nine chín ʧinB1 ten mười mɨəjA2 twenty hai mươi hajA2 mɨəjA1

vietnamese ( tiếng việt ) hundred thousand million billion

941

trăm ʧămA1 nghìn (N), ngàn (S) ŋinA2, ŋanA2 triệu ʧiewB2 tỷ tiC1

Plural is only optionally marked in Vietnamese. They are two nominal pluralizers: các [kakD1] and những [ɲɨŋC2]. Các refers to an entire set of elements, while những [ɲɨŋC2] refers to a subset of a larger set. While count nouns can be immediately preceded by these two pluralizers, pluralized mass nouns need to be individualized by classifiers. Pronouns and kinship terms can also be pluralized by adding các [kakD1], chúng [ʧuŋ͡mA1], bọn [ɓɔnB2] or a numeral before them. Thus, các anh [kakD1 ɁăjŋA1] or bọn anh [ɓɔnB2 ɁăjŋA1], mean ‘you (older brothers)’. Since an exhaustive description of quantifiers is impossible in this short sketch, universal quantifiers will be used as an illustration. When used between a verb and a subject, indefinite pronoun or indefinite expression (marked with nào [nawA2] ‘which’), cũng [kuŋ͡mC2] ‘also’ takes on the meaning ‘all’ (52a–b). Another option is to use the universal quantifier tất cả [tə̆ tD1 kaC1] before the subject (52c). To express ‘all’ in other positions, one needs to use tất cả (52d). (52) a. Ai cũng muốn đi đón Tuấn ở sân bay. [ɁajA1 kuŋ͡mC2 muonB1 ɗiA1 ɗɔnB1 twə̆ nB1 ɁəC1 sə̆ nA1 ɓăjA1] who also want go pick.up Tuấn reside yard fly ‘Everybody wants to go pick up Tuấn at the airport.’ b. Bác nào cũng muốn đi đón Tuấn ở [ɓakD1 nawA2 kuŋ͡mC2 muonB1 ɗiA1 ɗɔnB1 twə̆ nB1 ɁəC1 older.uncle.aunt which also want go pick.up Tuấn reside sân bay. sə̆ nA1 ɓăjA1] yard fly ‘All the uncles and aunts want to go pick up Tuấn at the airport.’ c. Tất cả họ hàng muốn đi đón Tuấn ở sân bay. [tə̆ tB1.kaC1 hɔB2.haŋA2 muonB1 ɗiA1 ɗɔnB1 twə̆ nB1 ɁəC1 sə̆ nA1 ɓăjA1] every relatives want go pick.up Tuấn reside yard fly ‘All the relatives want to go pick up Tuấn at the airport.’

942

brunelle

d. Tuấn phải đi đón tất cả họ hàng ở sân bay. [twə̆ nB1 fajC1 ɗiA1 ɗɔnB1 tə̆ tB1.kaC1 hɔB2.haŋA2 ɁəC1 sə̆ nA1 ɓăjA1] Tuấn ought go pick.up every relatives reside yard fly ‘Tuấn must go pick up all the relatives at the airport.’

‘Every’, with a focus on a whole group, is mọi [mɔjB2], whereas ‘each’, with a focus on individual elements of the group is mỗi [mojC2]. Từng [tɨŋA2] is used to express the idea of ‘each, in turn’. These are illustrated with an example modified from Thompson (1965) in (53). (53) a. Công an đi đến mọi nhà để điều tra. [kowŋ͡mA1.ɁanA1 ɗiA1 ɗenB1 mɔjB2 ɲaA2 ɗeC1 ɗiewA2.ʧaA1] police go arrive every house in.order.to investigate ‘The police goes to every house to investigate.’ b. Công an đi đến mỗi nhà để điều tra. [kowŋ͡mA1.ɁanA1 ɗiA1 ɗenB1 mojC2 ɲaA2 ɗeC1 ɗiewA2.ʧaA1] police go arrive each house in.order.to investigate ‘The police goes to each house to investigate.’ c. Công an [kowŋ͡mA1.ɁanA1 police

đi đến từng nhà để ɗiA1 ɗenB1 tɨŋA2 ɲaA2 ɗeC1 go arrive each.in.turn house in.order.to

điều tra. ɗiewA2.ʧaA1] investigate ‘The police goes to each house in turn to investigate.’

As shown in (54), intensifiers include nhiều [ɲiewA2] ‘much, a lot’ and ít [ɁitD1] ‘few, little’, which mostly behave like stative verbs in that they can modify both nouns and action verbs (ít also means ‘rarely’, in which case it is preverbal). However, contrary to other stative verbs, they precede the noun they modify when they are not used as the main predicate (54c) (54) a. Tuấn ăn nhiều / ít. [twə̆ nB1 ɁănA1 ɲiewA2 / ɁitD1] Tuấn eat much / little ‘Tuấn eats a lot / little.’

943

vietnamese ( tiếng việt ) b. Quần áo của Tuấn [kwə̆ nA2 ɁawB1 kuəC1 twə̆ nB1 pant shirt poss Tuấn ‘Tuấn has many / few clothes.’

nhiều/ ít. ɲiewA2 / ɁitD1] much / little

c. Tuấn có nhiều / ít bạn đi du học [twə̆ nB1 kɔB1 ɲiewA2 / ɁitD1 ɓanB2 ɗiA1 zuA1.hɔwk͡pD2] Tuấn exist much / little friend go study.abroad ‘Tuấn has many / few friends who went to study abroad.’

The intensifier rất [zə̆ tD1] ‘very’ modifies (and precedes) either stative or action verbs, as in (55). Hơi [həjA1] ‘little’, follows the same syntactic placement as rất. The fact that rất and hơi can modify ít and nhiều, as in (55c) is additional evidence that the latter are actually stative verbs. (55) a. Tuấn rất thích đi nhậu. [twə̆ nB1 zə̆ tD1 tʰijkD1 ɗiA1 ɲɔ̆wB2] Tuấn very like go drink.alcohol.and.eat.light.food ‘Tuấn likes going out for drinks very much.’ b. Đứa bạn rất vui tính của Tuấn cũng [ɗɨəB1 ɓanB2 zə̆ tD1 vujA1 tijŋB1 kuəC1 twə̆ nB1 kuŋ͡mC2 clf friend very happy character poss Tuấn also

đi nhậu. ɗiA1 ɲɔ̆w B2] go drink.alcohol.and.eat ‘Tuấn’s very fun friend also goes out for drinks.’

c. Tuấn nhậu rất nhiều / ít! [twə̆ nB1 ɲɔ̆wB2 zə̆ tD1 ɲiewA2 / ɁitD1] Tuấn drink.alcohol very much / little ‘Tuấn drinks very little / much.’

On the other hand, lắm [lămB1] ‘very’ and quá [kwaB1] ‘too much, extremely’ are always phrase-final and modify the entire predicate, as in (56a). As shown is (56b), they cannot have scope over a stative verb only, contrary to rất.

944

brunelle

(56) a. Tuấn thích đi chơi [twə̆ nB1 tʰijkD1 ɗiA1 ʧəjA1 Tuấn like go play ‘Tuấn likes to go out a lot!’

lắm / quá! lămB1 / kwaB1] very / too.much

b. *Đứa bạn cao lắm / quá của Tuấn [ɗɨəB1 ɓanB2 kawA1 lămB1 / kwaB1 kuəC1 twə̆ nB1 clf friend tall very / too.much poss Tuấn

là người laA2 ŋɨəjA2 cop clf

Hơ Mông. həA1.mowŋ͡ɱA1] Hmong ‘Tuấn’s very tall friend is Hmong.’

5.2.4 Tense, Aspect and Mood Markers Tense, aspect and mood marking is optional in Vietnamese and is not marked through affixation. However, there are a number of verbal particles that encode tense, aspect and mood (Đỗ-Hurinville 2009b). A subset is given in (57). TAM markers can either immediately precede the verb (§4.2.1) or stand at the end of the sentence (§5.2.7). Note that there is significant debate about the exact semantics of many of these TAM markers, and that proposed translations are indicative only. Overall, (57a–e) seem to convey aspectual distinctions, while (57f–h) also convey some notion of tense. Examples (57i–l) mark moods. As mentioned in §4.2.1, combinations of TAM markers are possible, although subject to semantic restrictions. (57)

Base sentence: Lan về quê. [lanA1 veA2 kweA1] Lan go.back hometown ‘Lan goes back to her hometown’.

a. Đang [ɗaŋA1] (preverbal): progressive Lan đang về quê ‘Lan is going back to her hometown.’ b. Rồi [zojA2] (clause-final): accomplished, with currently relevant state Lan về quê rồi ‘Lan is back in her hometown.’ c. Đã [ɗaC2] (preverbal): perfective, accomplished Lan đã về quê (rồi) ‘Lan went back to her hometown.’

vietnamese ( tiếng việt )

945

d. Đã [ɗaC2] (clause-final): the action must be completed before the realization of another action Lan về quê đã ‘Lan goes back to her hometown first.’ e. Vẫn [və̆ nC2] (preverbal): continuative Lan vẫn về quê ‘Lan still goes back to her hometown.’ f. Vừa [vɨəA2], mới [məjB1] (preverbal): recent past Lan vừa/mới về quê ‘Lan just went back to her hometown.’ g. Sắp [săpD1] (preverbal): immediate future Lan sắp về quê ‘Lan is on the verge going back to her hometown.’ h. Sẽ [sɛC2] (preverbal): incertain future/conditional Lan sẽ về quê ‘Lan will/would go back to her hometown.’ i. Hãy [hăjC2] (preverbal): mild imperative Lan hãy về quê ‘Lan should go back to her hometown.’ j. Đi [ɗiA1] (clause-final): imperative Lan về quê đi! ‘Lan, go back to your hometown!’ k. Đừng [ɗɨŋA2] (preverbal): prohibitive Lan đừng về quê! ‘Lan, don’t go back to your hometown!’ l. Được [ɗɨəkD2] (clause-final): ability/permission/success (Enfield 2001) Lan về quê được. ‘Lan can go back to her hometown. / Lan is allowed to go back to her hometown. / Lan is able to go back to her hometown.’

Many of these final particles can also occupy other syntactic positions, a multifunctionality that has been analyzed as a consequence of syntactic movement (Duffield 2013). Besides these verbal particles, a number of verbs frequently cooccur as co-verbs in lexicalized serial verb constructions. Many of these verbs have taken on a lexicalized meaning, like còn [kɔnA2] ‘to continue to V’, thử [tʰɨC1] ‘to try to V’, xem [sɛmA1] ‘to V and see’. They are treated in more detail in §4.2.

946

brunelle

5.2.5 Prepositions Vietnamese has prepositions, but no postpositions. Prepositions listed in Nguyễn (1997) include của [kuəC1] ‘of (possessive)’, bằng [ɓăŋA2] ‘by means of, made of’, với [vəjB1] ‘with’, tại [tajB2] ‘at’, vì [viA2] / tại vì [tajB2 viA2] / bởi [ɓəjC1] / bởi vì [ɓəjC1 viA2] ‘because, in favor of’, từ [tɨA2] ‘since, from’, do [zɔA1] ‘by, because of’. However, a number of prepositions also listed by Nguyễn (1997) are actually verbs, and as such, can be treated as co-verbs in serial verbs constructions (§4.2). The most frequent such verbs are ở [ɁəC1] ‘to stay, to be at’, đến [ɗenB1] / tới [təjB1] ‘to arrive’, cho [ʧɔA1] ‘to give’. Examples are given in (58a–c). Some other verbs, like về [veA2] ‘to go back’ and để [ɗeC1] ‘to put’, seem to be more grammaticalized in that their prepositional meaning is fairly different from their main verbal meaning (58d–e). A discussion of the grammaticalization processes involved in formation of prepositions can be found in Đỗ-Hurinville (2010). (58) a. Lan bị tai nạn xe ở đâu? [lanA1 biB2 tajA1.nanB2 sɛA1 ɁəC1 ɗɔ̆wA1] Lan neg.exp accident vehicle reside where ‘Where did Lan have an accident?’ b. Lan ngủ trưa từ một giờ đến ba giờ. [lanA1 ŋuC1 ʧɨəA1 tɨA2 motD2 zəA2 ɗenB1 ɓaA1 zəA2] Lan sleep noon from one hour arrive three hour ‘Lan naps from one to three.’ c. Lan nấu rau muống cho chồng nó. [lanA1 nɔ̆w B1 zăwA1.muoŋB1 ʧɔA1 ʧoŋ͡mA2 nɔB1] Lan cook morning.glories give husband 3 ‘Lan cooks morning glories for her husband.’ d. Lan kể chuyện về hàng xóm. [lanA1 keC1 ʧwienB2 veA2 haŋA2.somB1] Lan tell story go.back neighbor ‘Lan is telling stories about the neighbors.’ e. Chúng nó ăn tiền để nuôi con. [tʃuŋ͡mB1 nɔB1 ɁănA1 tienA2 ɗeC1 nuojA1 kɔnA1] pl 3p eat money put feed child ‘They take bribes to feed their children.’

vietnamese ( tiếng việt )

947

There is also a class of preposition-like spatial elements (trong [ʧɔwŋ͡mA1] ‘inside’, ngoài [ŋwajA2] ‘outside’, dưới [zɨəjB1] ‘bottom’, trên [ʧenA1] ‘top’, giữa [zɨəC2] ‘middle’, etc.) that are sometimes analyzed as a special sub-class of nouns (Đỗ-Hurinville 2010; Nguyễn 1997; Thompson 1965). They are optional in the main predicate and emphasize the physical location of the action, as in (59a). Otherwise, they function as prepositions and are obligatory, as in (59b–c). In the latter examples, the presence of the verb lên ‘to go up’ before trên ‘top’ would entail a second event. (59) a. Kiên lên (trên) gác. [kienA1 lenA1 ʧenA1 ɣakD1] Kiên go.up top rooftop ‘Kiên goes up to the rooftop.’ b. Kiên phơi áo trên gác. [kienA1 fəjA1 ɁawB1 ʧenA1 ɣakD1] Kiên dry shirt top rooftop ‘Kiên dries shirts on the rooftop.’ c. Trên gác Kiên phơi áo. [ʧenA1 ɣakD1 kienA1 fəjA1 ɁawB1] top rooftop Kiên dry shirt ‘On the rooftop, Kiên dries shirts.’

5.2.6 Conjunctions, Linkers and Discourse Markers Vietnamese has conjunctions (Paris & Lê 2013) like và [vaA2] ‘and’, hay [hăjA1] ‘or’, hoặc [hwăkD2] ‘either’, nhưng [ɲɨŋA1] / mà [maA2] ‘but’ and markers of clausal subordination like nếu [newB1], ‘if’, dù [zuA2] / tuy [tɥiA1] ‘although’, cho nên [ʧɔA1 nenA1] / nên [nenA1] ‘for this reason’. It also has two linkers which are used to concatenate themes and rhemes (§4.3): là [laA2], the default equative copula and thì [tʰiA2], an attributive copula, which have been claimed to respectively background and focus the theme (Clark 1996; Đỗ-Hurinville 2009a). Besides these, there are also discourse markers such as backchannel utterances and repair initiation markers (Hà 2012). 5.2.7 Final Particles There is a rich array of clause final particles that are used to express emotional nuances. A small set of examples is given in (60). Note that many of the sentence type markers (§4.4.1) and of the tense-aspect-mood markers (§5.2.4)

948

brunelle

could be analyzed as final particles, but they are not repeated here. Some of the intensifiers in §5.2.3 (quá, lắm) could also arguably be treated as final particles. (60) Base sentence: Anh ăn cơm [ɁăjŋA1 ɁănA1 kəmA1] older.brother eat cooked.rice ‘You eat a meal.’ a. Nhé [ɲɛB1]: invitation Anh ăn cơm nhé! ‘Go ahead and eat your meal.’ b. Thế [theB1]: ‘thus’, interrogative reinforcement (Southern Vietnamese vậy) Sao anh ăn cơm thế? ‘Why do you eat your meal like that?’ c. Ạ [ɁaB2]: politeness, deference (Northern Vietnamese) Anh ăn cơm ạ. ‘You eat your meal.’ d. Chứ [tʃɨB1]: of course Anh ăn cơm chứ! ‘You eat your meal, of course!’

6

Semantics and Pragmatics

6.1 Specialized Vocabulary An exhaustive description of Vietnamese specialized lexicon is obviously out of the question here, but two areas of the vocabulary have been chosen as illustrations. The first one is the lexicon related to rice, in (61). (61) ruộng [zuoŋB2] mạ [maB2] rơm [zəmA1] trấu [ʧɔ̆w B1] cám [kamB1] tẻ [tɛC1] cốm [komB1] cơm [kəmA1] bún [ɓunB1]

‘irrigated rice field’ ‘rice seedling’ ‘rice straw’ ‘rice husk’ ‘rice bran’ ‘normal rice’ ‘young sticky rice’ ‘cooked rice’ ‘rice vermicelli’

rẫy lúa thóc lứt gạo nếp xôi cháo

[zɛ̆jC2] [luəB1] [thɔwk͡pD1] [lɨtD1] [ɣawB2] [nepD1] [sojA1] [ʧawB1]

‘dry (rice) field’ ‘rice plant’ ‘unhusked rice’ ‘whole rice’ ‘husked rice’ ‘sticky rice’ ‘steamed sticky rice’ ‘rice porridge’

vietnamese ( tiếng việt )

949

Another representative part of the specialized lexicon is communist formulaic terms that have been bleached of their political connotations and are now used in everyday life (61). (62) Original meaning cơ quan [kəA1-kwanA1] ‘administrative organ’ cán bộ [kanB1-ɓoB2] ‘cadre’ báo cáo [ɓawB1-kawB1] ‘to report’ nhất trí [ɲə̆ tD1-ʧiB1] ‘to be unanimous’ thành tích [tʰăjŋA2-tijkD1] ‘achievement’ phong trào [fɔwŋ͡mA1-ʧawA2] ‘movement’

Bleached meaning ‘office’ ‘employee’ ‘to state, to declare’ ‘to agree’ ‘fake achievements’ ‘fashion, fad’

6.2 Language Use As the national language of Vietnam, Vietnamese is used in all spheres of life, both in speech and writing, by the overwhelming majority of the population (UNESCO estimates literacy at 92.8%). A large majority of Vietnamese speakers are monolingual, but a number of foreign languages are also spoken. English is the second language taught in schools in most provinces; there does not seem to be a serious assessment of the actual number of its speakers, but their level of fluency is usually extremely low. French has lost a lot of ground since the colonial period, but is still spoken by 623,000 people, generally fairly fluently (OIF 2010). Languages of the Soviet bloc (Russian, German, Czech, Hungarian . . .) are still spoken to various degrees by former guest workers and exchange students. Interestingly, Korean and Japanese have become increasingly popular foreign languages in recent years, because of growing Northeast Asian economic and cultural influence. Chinese is probably spoken by close to one million people, but it is more than a foreign language: while there is an important number of second language speakers, due to longstanding Chinese cultural and political influence, a large proportion of the 823,000 ethnic Chinese (2009 Vietnamese census), who mostly live in Hồ Chí Minh City, still speak southern Chinese varieties (Cantonese, Hokkien, Hainanese) as their first language. There is a significant number of ethnic minority people for which Vietnamese is a second language (10–15% of the population). For them, Vietnamese is the language of education, the media and bureaucracy. Some ethnic minority languages are losing ground to Vietnamese, like the Tai-speaking Tày and Nùng of Northern Vietnam. There is a strong Vietnamese influence on many minority languages, especially on their lexicon.

950

brunelle

6.3 Focus Marking A final issue that has not yet been satisfactorily explored, but deserves mentioning, is information structure in Vietnamese. The order of presentation of focused and backgrounded information not only underlies the syntactic organization of theme-rheme alternations (§4.3) and the ellipsis of arguments (§4.4.2), but is also crucial in determining the use of classifiers (§5.2.2), quantifiers (§5.2.3) and focus particles (Hole 2013). 7

Sample Glossed Text

Excerpt from the novel Số Đỏ (‘Dumb Luck’), chapter 5, by Vũ Trọng Phụng (1936) Câm đi, [kə̆ mA1 ɗiA1 mute imp

đồ ngu! Khi người ta cổ động đàn ɗoA2 ŋuA1 xiA1 ŋɨəjA2.taA1 koC1.ɗowŋ͡mB2 ɗanA2 thing stupid moment one propagandize flock

bà ɓaA2] grandmother ‘Shut up, stupid thing! When one campaigns for women . . .’ thì phải biết là cũng có năm bảy thứ đàn bà! [tʰiA2 fajC1 bietD1 laA2 kuŋ͡mC2 kɔA1 nămA1 ɓăjC1 tʰɨB1 ɗanA2 ɓaA2] cop ought know cop also exist five seven kind flock grandmother ‘One has to know that there are several kinds of women!’ Khi người ta nói phụ nữ . . . [xiA1 ŋɨəjA2.taA1 nɔjB1 fuB2.nɨC2] moment one speak women ‘When one says women . . .’ là nói vợ con chị em người khác, [laA2 nɔjB1 vəB2 kɔnA1 ʧiB2 ɁɛmA1 ŋɨəjA2 xakD1] cop speak wife child older.sister younger.sibling person different ‘One is talking about other people’s wives, sisters and children, . . .’ chứ không phải vợ con chị em của ta! [ʧɨB1 xowŋ͡mA1 fajC1 vəB2 kɔnA1 ʧiB2 ɁɛmA1 kuəC1 taA1] but neg ought wife child older.sister younger.sibling poss one ‘but not their own wives, sisters and children!’

vietnamese ( tiếng việt )

951

Mợ đã hiểu chưa? Người khác thì được, mà [məB2 ɗaC2 hiewC1 ʧɨəA1 ŋɨəjA2 xakD1 tʰiA2 ɗɨəkD2 maA2 mother ant understand not.yet person different cop allowed but mợ, məB2] mother ‘Do you understand? It’s fine for other people, but you . . .’ mợ là vợ tôi, [məB2 laA2 vəB2 tojA1] mother cop wife I ‘you are my wife’ thì mợ không thể tân thời như người khác được! [tʰiA2 məA1 xowŋ͡mA1 tʰeC1 tə̆ nA1.tʰəjA2 ɲɨA1 ŋɨəjA2 xakD1 ɗɨəkD2] cop mother neg possible modern like person different allowed ‘so you can’t be modern like other people!’

8 Bibliography Aubaret, Gabriel. 1867. Grammaire annamite suivie d’un vocabulaire: Imprimerie impériale. Brunelle, Marc. 2009a. Northern and Southern Vietnamese Tone Coarticulation: A Comparative Case Study. Journal of Southeast Asian Linguistics 1: 49–62. ———. 2009b. Tone perception in Northern and Southern Vietnamese. Journal of Phonetics 37: 79–96. Brunelle, Marc, Kiều Phương Hạ & Martine Grice. 2012. Intonation in Northern Vietnamese. The Linguistic Review 29: 3–36. Brunelle, Marc & Stefanie Jannedy. 2013. The Cross-dialectal Perception of Vietnamese Tones: Indexicality and Convergence. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. John Benjamins, 9–34. Brunelle, Marc & Thị Xuyến Lê. 2013. Why is sound symbolism so common in Vietnamese? In J.P. Williams (ed.), The Aesthetics of Grammar: Sound and Meaning in the Languages of Mainland Southeast Asia. Cambridge: Cambridge University Press, 83–98. Brunelle, Marc, Duy Dương Nguyễn & Khắc Hùng Nguyễn. 2010. A Laryngographic and Laryngoscopic Study of Northern Vietnamese Tones. Phonetica 67: 147–69. Cao, Xuân Hạo. 1985. Về cương vị ngôn ngữ học của “tiếng”. Ngôn Ngữ: 25–53.

952

brunelle

———. 1992. Some Preliminaries to the Syntactic Analysis of the Vietnamese Sentence. Mon-Khmer Studies 20: 137–51. ———. 2003 [1980]. Hai loại danh từ của tiếng Việt. In D.K. Nguyễn (ed.), Tiếng Việt: mấy vấn đề ngữ âm, ngữ pháp, ngữ nghĩa. Đà Nẵng: Nhà xuất bản Giáo dục, 265–304. Clark, Marybeth. 1996. Conjunction as copula in Vietnamese. Mon-Khmer Studies 26: 319–31. De Rhodes, Alexandre. 1651. Dictionarium annamiticum lusitanicum, et latinum Rome: Sacrae congregationis de propaganda fidae. Đỗ-Hurinville, Danh Thành. 2009a. Étude du topicalisateur thì en vietnamien. Bulletin de la Société de Linguistique de Paris CIII: 411–43. ———. 2009b. Temps, aspect et modalité en vietnamien. Étude comparative avec le français Paris: L’Harmattan. ———. 2010. Les parties du discours en vietnamien: grammaticalization et transcatégorialité. Bulletin de la Société de Linguistique de Paris CV: 327–70. Duffield, Nigel. 2013. Head-First: On the head-initiality of Vietnamese clauses. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 127–54. Emeneau, Murray. 1951. Studies in Vietnamese (Annamese) grammar. Berkeley: University of California Press. Enfield, Nick J. 2001. On genetic and areal linguistics in Mainland South-East Asian: Parallel Polyfunctionality of ‘acquire’. Areal Diffusion and Genetic Inheritance. In A. Aikhenvald & R.M.W. Dixon (eds.), Problems in Comparative Linguistics. Oxford: Oxford University Press, 255–90. Hà, Kiều Phương. 2012. Prosody in Vietnamese: Intonational Form and Function of Short Utterances in Conversation. Cologne: University of Cologne Ph.D. Hanske, Theresa. 2013. Serial Verbs and Change of Location Constructions in Vietnamese. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 185–215. Haudricourt, André-Georges. 1952. Les voyelles brèves du vietnamien. Bulletin de la Société de Linguistique de Paris 48: 90–93. Hoàng, Thị Châu. 1989. Tiếng Việt trên các Miền Đất Nước. Hà Nội: Nhà Xuất Bản Khoa Học Xã Hội. Hole, Daniel. 2013. Focus particles and related entities in Vietnamese. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 265–304. Kirby, James. 2010. Dialect experience in Vietnamese tone perception. Journal of the Acoustical Society of America 127. ———. 2011. Vietnamese (Hanoi Vietnamese). Journal of the International Phonetic Association 41, 381–92.

vietnamese ( tiếng việt )

953

Michaud, Alexis. 2004. Final Consonants and Glottalization: New Perspectives from Hanoi Vietnamese. Phonetica 61: 119–46. Nguyễn, Đình-Hoà. 1997. Vietnamese. Amsterdam and Philadelphia: John Benjamins. Nguyễn, Thị Anh Thư & John Ingram. 2007a. Acoustic and perceptual cues for compound-phrasal contrasts in Vietnamese. Journal of the Acoustical Society of America 122: 1746–57. ———. 2007b. Stress and tone sandhi in Vietnamese reduplications. Mon-Khmer Studies 37: 15–39. Nguyễn, Tưởng Hùng. 2013. The Vietnamese Noun Phrase. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 57–86. Nguyễn, Văn Lợi & Jerold Edmondson. 1997. Tones and voice quality in modern northern Vietnamese: Instrumental case studies. Mon-Khmer Studies 28: 1–18. Noyer, Rolph. 1998. Vietnamese ‘Morphology’ and the Definition of Word. U. Penn Working Papers in Linguistics 5: 65–89. OIF, Observatoire de la langue française de l’Organisation internationale de la Francophonie. 2010. La langue française dans le monde 2010. Paris: Nathan. Paris, Marie-Claude & Thị Xuyến Lê. 2013. On conjunction and comitativity in Vietnamese. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 243–64. Phạm, Andrea Hoà. 2001. Vietnamese Tone: Tone is not pitch. Toronto: University of Toronto. ———. 2006. Vietnamese Rhyme. Southwest Journal of Linguistic 25: 107–42. ———. 2008. Is there a prosodic word in Vietnamese? Toronto Working Papers in Linguistics 29: 1–23. Simpson, Andrew & Hảo Tâm Hồ. 2013. Vietnamese and the typology of passive constructions. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 155–84. Thompson, Laurence. 1965. Vietnamese Reference Grammar Seattle: University of Washington Press. Trần, Thuần & Benjamin Bruening. 2013. Wh-Phrases as Indefinites: A Vietnamese Perspective. In D. Hole & E. Löbel (eds.), Linguistics of Vietnamese. Amsterdam: John Benjamins, 217–42. Trương, Vĩnh Ký. 1883. Grammaire de la Langue Annamite Saigon: C. Guillaud et Martinon. Vũ, Thanh Phương. 1981. The Acoustic and Perceptual Nature of Tone in Vietnamese. Canberra: Australian National University. ———. 1982. Phonetic Properties of Vietnamese Tones across dialects. In D. Bradley (ed.), Papers in Southeast Asian Linguistics. Australian National University, 55–75.

section 8 Khmuic

․․

chapter 14

Kammu Jan-Olof Svantesson and Arthur Holmer 1

Background

Kammu (/km̀ .múʔ/ or /km.m̥ uʔ/; also written Kmhmu’, Khmuʔ, Khmu, etc.) belongs to the Khmuic branch of the Austroasiatic family. It is the largest minority language in Laos, spoken by about 600,000 people mainly in the northern parts of Laos; there are also speakers in adjacent areas of Thailand, Vietnam and China. There are three main dialects: Northern, Western and Eastern Kammu. Eastern Kammu lacks lexical tones, while the other dialects have developed a tone system consisting of one high and one low tone (see 2.2.1 and 2.4 below). The Kammu divide themselves into subgroups (t.mɔ́ j), which are associated with different (sub-)dialects. Northern Kammu consists of the three sub-dialects Yùan /jùan/, Kwɛ̀ɛn and Cwàa (or Lɨ̀ɨ) and Western Kammu consists of the Rɔ̀ɔk and Krɔ́ɔŋ sub-dialects. The Eastern dialect is known as Ùu to the speakers of Northern Kammu, although they themselves use other t.mɔ́ j names as well. The differences between the dialects are small, and all of them are mutually comprehensible. The only major phonological difference is the development of tones in the Northern and Western dialects. Otherwise the difference is mainly in the vocabulary; the negation is used as a shibboleth: it is pə́ in Yùan, ʔàl in Kwɛ̀ɛn, kʰát in Rɔ̀ ɔk, pé in Krɔ́ ɔŋ and ʔam in the Eastern dialect. Almost all Kammu in Laos are at least bilingual with Lao as their second language. Lao is the only language of instruction in the schools and Kammu is primarily a spoken language. All modern scientific and political terms are taken over directly from Lao, as are everyday terms not belonging to traditional Kammu life. In the Northern Kammu area, all inhabitants of many villages have moved to live closer to rivers and roads and are now losing their traditional way of life based on swidden farming of glutinous rice and are becoming integrated in the Lao majority society. Here the tonal Northern dialect is described. The syntactic examples in Sections 4 and 5 are taken from folk tales recorded by Kristina Lindell and Kàm Ràw from the 1970s onwards and thus reflect a rather conservative language. A few syntactic examples were elicited from Kàm Ràw; they are marked K.R.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_020

958

svantesson and holmer

The first publication on the Kammu language is a ‘Khmous’ vocabulary recorded by Doudart de Lagrée in Luang Prabang (de Lagrée & Garnier 1873). Some wordlists were also published in other early works but the first scholarly publication is Henri Maspero’s ‘T’èng’ vocabulary, based on his investigations in 1912, but published posthumously in 1955. A short grammar was published by William Smalley in 1961. Michel Ferlus has written several articles about Kammu language and folk literature, and also articles which deal with the place of Kammu in Austroasiatic (Ferlus 1974). Suwilai Premsrirat at Mahidol University and her group have published a Kammu grammar (Premsrirat 1982, 1987) and several dictionaries of different dialects, including Premsrirat (2002). Another dictionary was published by Suksavang Simana, Somseng Sayavong and Elisabeth Preisig (1994). Kammu in China has been studied by Lǐ Dàoyǒng and more recently by Dài Qìngxià and others, who also investigated Kammu in Laos (2012). The Kammu Language and Folklore project at Lund University was initiated by Kristina Lindell in 1972 and Kàm Ràw (Damrong Tayanin), a native speaker of the Yùan sub-dialect, joined the project in 1974. The first publication on the Yùan dialect (Lindell 1974) was followed by a description of the phonology and morphology of this dialect (Svantesson 1983) and a comprehensive encyclopedic dictionary (Svantesson et al. 2014). The phoneticians David House (KTH, Stockholm) and Anastasia Karlsson (Lund University) made detailed studies of Kammu prosody (Svantesson & House 2006; Karlsson, House & Svantesson 2012) within the project. In addition to the language studies, Kammu oral literature was investigated extensively in a series of books written in cooperation with the folklorist Jan-Öjvind Swahn (e.g. Lindell, Swahn & Tayanin 1984). Kammu music has also been studied by the ethnomusicologist Håkan Lundström (e.g. Lundström 2010) within this project. A comprehensive bibliography of works on Kammu language and culture is given in Svantesson et al. (2014). 2

Phonetics and Phonology

2.1 Word and Syllable Structure There are two kinds of syllables in Kammu, traditionally called major and minor syllables. Major syllables have an onset, a vowel kernel and a non-obligatory coda; they also have a (high or low) tone. The major syllable onset is any of the consonants given in (1) below, or one of the clusters pl, pr, ɓl, tr, cr, kl, kr, kw, kʰw. The kernel is any of the vowels in (2), i.e. either a short or a long monophthong or a diphthong. The coda consists of at most one consonant. Stop consonants are pronounced without a release when they are codas, and voiced stops, aspirated stops and glottalized glides cannot be codas.

kammu

959

Minor syllables are unstressed pre-syllables which have no phonemic vowel. Together with a major syllable they form ‘sesquisyllabic’ words, e.g. km̀ .múʔ ‘human being’. In careful pronunciation, minor syllables are often pronounced with a schwa-like vowel as kernel ([kə̀ m.múʔ] in this example). There are two kinds of minor syllables: tonal and non-tonal. Tonal minor syllables (such as km̀ in km̀ .múʔ) have a consonant onset and coda and also a tone (exceptions are the minor syllables l̀ and r̀, which are analysed as lacking an onset). There are some restrictions on the codas of tonal minor syllables; in particular the laryngeal consonants h and ʔ cannot be minor syllable codas, and several minor syllable codas occur only in words where the major syllable has the same coda, see 3.2 below. Non-tonal minor syllables consist only of a consonant onset and have no tone, e.g. c in c.mə̀ʔ ‘rope’. In careful speech, these minor syllables are pronounced with the same tone as the major syllable of the word: [cə̀.mə̀ ʔ]. Although these minor syllables lack a tone, we analyse words with them as sesquisyllabic. One reason for this is that they behave differently from monosyllabic words in the morphology. For example, causatives of monosyllabic verbs (including those with an initial cluster) are formed with the prefix pn- (3.2.2): kléʔ ‘visible’ > pǹ.kléʔ. Applying this prefix to a sesquisyllabic word (e.g. k.tám ‘heavy’) would produce a phonotactically ill-formed word (*pn.k.tám) with two minor syllables; instead the infix -m- is used, giving the causative form km̀ .tám. The great majority of indigenous monomorphemic words are either monoor sesquisyllabic. About one third of them are sesquisyllabic and two thirds are monosyllabic, according to a dictionary count. There are also longer words, most of them derived. Except for a few onomatopoeic words and Tai loans they consist of two or four mono- or sesquisyllables. Kammu has a rich morphology, partly based on reduplication, which builds words of this kind, in particular expressives (3.2.3) and intensifying reduplicatives (3.3). There are also some longer words which at least synchronically are monomorphemic. Many of them are formally built up by reduplication, but no regular morphological operations can be identified. Examples are sɛ́ŋ-sɛ́ŋ ‘pygmy shrew’, ʔàːm-làːm ‘mildew’, cŕ.lɔ̀ ŋ-tɔ̀ ŋ ‘garfish’, tŕ.lèːs-téːs ‘wagtail’. In some words of this kind, reduplication is not involved, e.g. pùːl-wàh ‘buckwheat’, cĺ.ʔùŋ-mùːs ‘to fall prone’. Compounds are common in Kammu, in particular compound nouns, which behave syntactically as other nouns; see also 3.1. 2.2 Phoneme Inventory The Kammu consonant phonemes are given in (1). The three series of stops contrast only in syllable onset position; in codas, supraglottal stops are unreleased, here taken as realizations of the voiceless unaspirated series /p, t, c, k/. The contrast between plain and glottalized glides is upheld only in onset

960

svantesson and holmer

position. Aspirated consonants are sometimes formed by a morphological operation; for example, applying causative p- (3.2.2) to the verb háːn ‘to die’ yields the causative pháːn ‘to kill’; in such cases we will regard them as clusters, although the pronunciation is not different from other aspirated stops ([pʰáːn] in this example). (1) Kammu consonant phonemes voiceless unaspirated stops p t c k ʔ voiceless aspirated stops pʰ tʰ cʰ kʰ voiced implosive stops ɓ ɗ voiceless fricatives s h nasals m n ɲ ŋ lateral l trill r glides w j glottalized glides ˀw ˀj

The vowel phonemes are given in (2). There is a length contrast for the vowels, and in addition to long and short monophthongs there are three diphthongs. Long vowels are almost twice as long as short vowels in positions where they contrast. There is no length contrast in open syllables or before the laryngeal consonants ʔ and h. Before ʔ and h, the duration of non-contrastive vowels is intermediate between the durations of long and short contrasting vowels, and in open syllables the vowel duration is rather long; in spite of this, vowels will be written without a length mark in all non-contrasting positions. The open mid vowels ɛ, ʌ, ɔ do not contrast in length and for this reason they will be written without length mark; when followed by a non-laryngeal consonant, their duration is similar to that of the long vowel of a contrasting pair, however. (2) Kammu vowel phonemes i ɨ u iː ɨː uː e ə o eː əː oː ɛ ʌ ɔ a aː

ia ɨa ua

2.2.1 Tones Northern Kammu has a simple tone system with two level tones, high and low. All syllables (except non-tonal minor syllables) have tones. The phonetical difference between the tones is rather small, typically 15–30 Hz (Svantesson & House 2006). Although there may be small voice quality differences

961

kammu

between the tone categories, perception tests (Gandour, Gårding & Lindell 1978; Svantesson & House 2006) have shown that fundamental frequency is the most important cue for differentiating the categories both in Northern and Western Kammu. The Eastern Kammu dialect does not have lexical tones, and the tones in tonal dialects have developed in connection with the merger of voiced and voiceless initial consonants; see 2.4. As a consequence of the development of the tone system from consonant contrasts, there are strong restrictions on how tones combine with onset consonants: Monosyllabic words whose onset is a voiceless fricative (s, h), an aspirated or implosive stop (pʰ, tʰ, cʰ, kʰ, ɓ, ɗ ) or a glottalized glide (ˀw, ˀj) always have high tone, and those with the glottal stop as onset always have low tone. Several facts, in particular from tonal morphophonology, show that the tones should be analysed as belonging phonologically to the syllable onset, although they are phonetically realized on the rhyme. For example, when the nominalizing infix -rn- (see 3.2.1) is applied, the tone of the monosyllabic base is transferred to the minor syllable of the verbal noun, and the major syllable, which gets the onset n, always has low tone (3a). Another example is verbs derived with the causative prefix p- (3.2.2), which always get high tone (b). See Svantesson (1983) for more details on Kammu tonal morphophonology. (3) Tonal morphophonology a. pɔ́ ʔ ‘to sweep’ pɔ̀ʔ ‘to carry in a sling’

pŕ.nɔ̀ ʔ ‘broom’ pr̀.nɔ̀ ʔ ‘carrying-sling’

b. kɨ́an ‘to climb’ cùːr ‘to go down’

p.kɨ́an p.cúːr

‘to lift up’ ‘to lower’

2.3 Intonation The description of Kammu intonation given here is based on Karlsson, House & Svantesson (2012). The main function of sentence intonation in Kammu is to mark prosodic boundaries. The right edge of each prosodic phrase is marked with a phrase boundary realized as high (or high falling) pitch. Phrase boundaries are multifunctional, conveying phrasing, focus, information structure and engagement. The phrase boundary tone is the normal tonal carrier of focus. Focus is by default placed at the end of an utterance, coinciding with the place of the boundary tone, and the pitch of the phrase boundary tone is raised. There is usually no additional tonal gesture for focal accent, and Kammu can thus be characterized as an edge-marking language in the typology proposed by Jun (2005).

962

svantesson and holmer

It is also possible to focus constituents that are not phrase-final. In this case as well, focus is marked by increased pitch, but this increase is small and focus on a pre-final word does not override the boundary tone, which still gives the final word higher pitch than the other words in the prosodic phrase (provided that the influence of lexical tones is compensated for, see below). This shows that focal accent is subordinate to the boundary tone, a conclusion that is confirmed by differences in lengthening: focused words are lengthened considerably in phrase-final position, but a non-final focused word is lengthened much less. The lexical tones interact with the boundary tone in interesting ways. Here, comparison with the Eastern dialect, which lacks lexical tones, is illuminating. Basically, the intonation pattern is the same in the Northern (tonal) and Eastern (non-tonal) dialects. In Eastern Kammu, a boundary tone normally creates a final rising gesture, the final word of the prosodic phrase getting higher pitch than the pre-final one. In Northern Kammu there is a strong influence of lexical tone, however. When the combination of lexical tones on the last syllables is low–high, completely compatible with this rising gesture, the final rise has a size similar to that in Eastern Kammu. The lexical tone combinations low–low and high–high do not conflict with the final rise, which occurs in these cases as well, even if the rise is rather small for the combination high–high. The high– low lexical tone configuration is in direct conflict with the rising gesture, which is levelled out when the final word is focused, and may even become a fall. In this way, the identities of the lexical tones are preserved in all environments, showing that the marking of lexical tones by pitch is the strongest prosodic feature in Northern Kammu. This leads to the following prosodic hierarchy: (4)

Kammu prosodic hierarchy 1. Lexical tone 2. Boundary tone 3. Focal accent

Boundary tones also serve to mark information structure. The flow of information in a narrative defines informational phrases starting with old information and ending with new information. These informational phrases consist of one or more prosodic phrases, and there is usually upstepping of the boundary tones of the prosodic phrases inside an informational phrase (Karlsson, Svantesson & House 2013). The pitch range is generally larger in the Eastern dialect, which lacks lexical tones, than in the Northern dialect, and there is also greater variation in pitch

kammu

963

range between speakers of Eastern Kammu. It seems that the presence of lexical tones restricts the Northern Kammu speakers’ use of pitch for other purposes. Northern Kammu speakers rely on rather small pitch differences both in the production and perception of lexical tones, which may necessitate greater control of pitch than Eastern speakers need. The intonation systems of Northern and Eastern Kammu are basically identical, and the main differences between the dialects are adaptations just large enough to enable the Northern Kammu speakers to uphold the identities of the lexical tones in different contexts. The intonation patterns adapt to the lexical tones, but do so only when the identities of the tones are jeopardized. 2.4 Tonogenesis Both Northern Kammu described here and Western Kammu have a simple two-tone system, but Eastern Kammu lacks tones. This is the only major phonological difference between the dialects. Eastern Kammu represents the original state, and the other dialects have developed tones rather recently from a historical linguistic perspective. In Eastern Kammu, there is a contrast between voiceless and voiced stops (p, t, c, k vs. b, d, ɟ, g) and also between voiceless and voiced sonorants (m̥ , n̥ , ɲ̊ , ŋ̊ , l ̥, r̥, w̥ , j̊ vs. m, n, ɲ, ŋ, l, r, w, j). In Northern Kammu, the voiceless/voiced pairs have merged into the unmarked member of the pair (voiceless for stops and voiced for sonorants), giving rise to high/low tone (5a). Western Kammu had a similar development, except that the voiced stops became aspirated. The other consonants in Eastern Kammu, aspirated stops (pʰ, tʰ, cʰ, kʰ), voiceless fricatives (s, h), glottalized glides (ˀw, ˀj, ˀm, ˀn) and the glottal stop ʔ do not form voiceless/voiced pairs. They were not changed in the other dialects except for the glottalized nasals (ˀm, ˀn) which became voiced, slightly implosive stops ɓ, ɗ. Words with these onsets got high tone in Northern and Western Kammu (5b), except that words with a glottal stop onset got low tone in Northern Kammu (but high tone in Western Kammu, as expected). (5) Kammu tonogenesis Eastern Northern Western a. klaːŋ kláːŋ kláːŋ ‘eagle’ glaːŋ klàːŋ kʰlàːŋ ‘stone’ m̥ aːn máːn máːn ‘to bury’ maːn màːn màːn ‘pregnant’

964

svantesson and holmer

b. kʰuʔ kʰúʔ kʰúʔ ‘weaver ant’ sɨaŋ sɨ́aŋ sɨ́aŋ ‘pig’ ˀjiak ˀjíak ˀjíak ‘shit’ ˀmec ɓéc ɓéc ‘question particle’ ʔɔŋ ʔɔ̀ ŋ ʔɔ́ ŋ ‘hornet’

Kammu tonogenesis must have taken place rather late. An exact date is difficult to establish, but the phonology of Tai (Lao and Lü) loanwords in Kammu shows that Kammu tonogenesis took place after the creation of Tai writing systems in the 13th century and after the subsequent Tai tone split (Svantesson 2011), although it may have occurred considerably later. The preservation of a dialect without tones indicates recent tonogenesis, as does the fact that the tones in the tonal dialects have retained their original shape as high and low, consistent with the tonogenetic mechanism that created them. 2.5 Tai Loanwords There are several layers of loanwords from Lao and related languages in Kammu. Lao is genetically unrelated to Kammu and belongs to the Southwestern branch of the Tai language group, which also includes Lü, Thai (the main language of Thailand), Shan and several other languages. See Svantesson (2011) for a detailed description of the phonology of Tai loanwords in Kammu. The older loans are fully integrated into Kammu phonology, and are regarded by native speakers as normal Kammu words. Kammu derivational morphology may be applied to them, e.g. the nominalizing infix -rn-: tɔ́ p ‘to answer’ > tŕ.nɔ̀ p ‘answer (n.)’. The phonological forms of the old loans show that they started to be borrowed into Kammu at the Proto-Southwestern Tai stage (Li 1977), before the Southwestern Tai languages had differentiated from each other. Kammu has retained many features of Proto-Southwestern Tai in these loans, for example the contrast between voiceless and voiced stops and sonorants, which is preserved unchanged in Eastern Kammu and as a tone contrast in Northern and Western Kammu. There is now a tendency to use the Lao initial consonant instead of the traditional Kammu one in some of these loanwords, e.g. tʰòːt instead of tòːt ‘guilt’ (Lao tʰoːt), hók instead of rók ‘six’ (Lao hok) and sù instead of cù ‘lover’ (Lao suː). This has created words that disobey the restrictions on how tones and onsets can be combined in traditional Northern Kammu (see 2.2.1).

965

kammu

3

Word Formation

Kammu is an isolating language lacking inflectional morphology, but there is a fair amount of derivational morphology using prefixes, infixes and reduplication. The three main open word classes are nouns, verbs (including adjectives) and expressives (words which describe how something sounds, looks or feels). 3.1 Compounding Compounding is a common process for forming new words in Kammu. In some cases their meaning is more or less clearly derived from the constituents (c.ʔáːŋ káːp [bone chin] ‘jawbone’; hḿ.púːr t.nɔ́ h [skin mouth] ‘lip’; háʔ lɔ̀ h [burn body] ‘to have fever’; cúʔ hɨ́al [want vomit] ‘to feel sick’), but in other cases not (hḿ.púːr tráːk [skin buffalo] ‘Engelhardtia spicata’, a tree). In many cases it is difficult to differentiate compound words from syntactically formed noun phrases. Class nouns such as túːt ‘plant’, síːm ‘bird’, tís ‘mushroom’, etc., are often used as heads of compounds. 3.2 Derivational Affixes In addition to compounding and reduplication, prefixes and infixes are commonly used for deriving words. A special feature of Kammu is the use of coda-reduplicating prefixes, where the minor syllable coda of a prefix is assimilated to the coda of the major syllable forming the base word. This process is involved in the derivation of verbal nouns (e.g. rc̀.húːc ‘stinging’ from húːc ‘to sting’; see 3.2.1) and causative verbs (e.g. pɲ́ .pàːɲ ‘to make somebody drunk’ < pàːɲ ‘drunk’; 3.2.2); it is also involved in the formation of static expressives (3.2.3). The assimilating coda will here be written ‘C’, so that the verbal noun prefix is written rC- and the causative prefix pC-. In general, only non-tonal (coda-less) minor syllables and tonal minor syllables with the codas m, n, ŋ, l, r can combine freely with major syllables. Minor syllable codas consisting of an obstruent (p, t, c, k, s), a glide (w, j) or the palatal nasal (ɲ) are bound in the sense that they must be identical to the major syllable coda. Words with a bound minor syllable coda are often formed by assimilation, but some of them are at least synchronically monomorphemic, e.g. pc̀.ràːc ‘mesentery’. Derivational affixes are not entirely productive in Kammu, although some of them, like the nominalizers rn/rC- and -rn- and causative pn/pC-, can be applied to rather many words.

966

svantesson and holmer

3.2.1 Deriving Nouns Verbal nouns are derived from verbs with the prefix rn/rC- (rǹ.pɔ́ ʔ ‘sweeping’ < pɔ́ ʔ ‘to sweep’; rt.̀ kìat ‘anger’ < kìat ‘angry’) or the infix -rn- (pŕ.nɔ̀ ʔ ‘broom’ < pɔ́ ʔ ‘to sweep’; kŕ.nèɲ ‘stabbing’ < kéɲ ‘to stab’). Derived nouns are often action nominals denoting the action of the verb or the way of performing the verb action. However, in many cases a more concrete noun is derived, often denoting the instrument of the action (hŕ.nùːc ‘sting (of an insect)’ < húːc ‘to sting’), but also other functions (rǹ.tèn ‘chair’ < tèn ‘to sit’; tŕ.nə̀ːm ‘song’ < tə́ːm ‘to sing’; rǹ.cét ‘sourness’ < cét ‘sour’). If two nouns are derived from the same verb, it is often (but far from always) the case that the prefix derives an action nominal, while the infix derives a more concrete noun (as in the examples rǹ.pɔ́ ʔ ‘(action of) sweeping’ vs. pŕ.nɔ̀ ʔ ‘broom’). Action nominals can also be derived with the marker sə́ŋ (see 5.1). The choice between the prefixes rn- and rC- is partly phonological and partly lexical. Since *rʔ and *rh are not possible minor syllables, verbs with the major syllable codas -ʔ or -h (or no coda) must take the prefix rn-; if rC- is possible, it is often preferred, but not always, as the example rǹ.cét ‘sourness’ shows. There is some variation between dialects (and even between speakers) in this respect. Some dialects prefer rn- more often than the Northern dialect described here. 3.2.2 Deriving Verbs There are several morphological processes that derive verbs from other verbs, including: Causative verbs are formed with the prefixes pn- (pń.màh ‘to feed’ < màh ‘to eat’) or pC- (pś.ràːs ‘to dry’ < ràːs ‘dry’) and the infix -m- (sḿ.kàr ‘to straighten’ < s.kár ‘straight’). The choice between pn/pC- and -m- is phonologically conditioned: they are used with monosyllabic and sesquisyllabic verbs, respectively. As is the case with the nominalizer rn/rC-, the choice between pn- and pC- is partly phonological and partly lexical. There is also a prefix p-, which can be added to monosyllabic words (p.cúːr ‘to take downː < cùːr ‘to go down’; pháːn ‘to kill’ < háːn ‘to die’). For some verbs, there is a contrast between p- and pn/pC-. For example, háːn ‘to die’ has two causatives: pháːn and pǹ.háːn. In such cases, the p- causative emphasizes the intention and the pn/pC- causative emphasizes the result. Thus, although pháːn usually corresponds to ‘to kill’, a more exact definition is ‘to do something with the intention to kill, to try to kill’. It is possible to say kə̀ pháːn tráːk, tráːk pə́ háːn [3SG.M pCAUS-die buffalo, buffalo NEG die] ‘He tried to kill the buffalo, but the buffalo did not die’. By contrast, the sentence kə̀ pǹ.háːn tráːk

kammu

967

[3SG.M pnCAUS-die buffalo] implies that the buffalo actually died, although this may not have been the intention of the causer. Thus it can be paraphrased as ‘He did something that resulted in the death of the buffalo’. Another example is the two causatives p.túʔ and pń.tùʔ formed from the verb tùʔ ‘to run away’. The causative p.túʔ can mean ‘to (try to) scare away’ e.g. wild animals from the village (without necessarily succeeding), and pń.tùʔ can mean ‘to scare away (game) by mistake’ when hunting. However, in most cases there is only one derived causative—usually of the pn/pC- (or -m-) type; see also 5.4.2. Anticausative (or passive) verbs are rather few in number. They are usually derived with the prefix hn-: hń.cáːk ‘torn’ < cáːk ‘to tear’. Reciprocal verbs are formed with the prefix tr-: pók ‘to bite’ > tr̀.pók ‘to bite each other’. If the base verb has the onset r, the prefix is tŋ-: ràːc ‘to scratch’ > tŋ́ .ràːc ‘to scratch each other’. Quotative verbs meaning ‘to say X’ are formed with the prefix kn-. Most of them are derived from interjections or onomatopoeic words: ʔàr-ʔàr ‘call used for calling pigs’ > kń.ʔàr ‘to call pigs’; ʔə̀ ‘yes’ > kń.ʔə̀ ‘to agree’. Intensive verbs have the same basic meaning and syntactic function as the verb they are derived from, but express a more forceful action, or have a more intensive meaning in some sense. They are derived by various prefixes and infixes, e.g.: kǹ.cís < cís ‘to recoil’; pr̀.cɨ́ < cɨ́ ‘to remember’. 3.2.3 Deriving Expressives As in many other Mon–Khmer languages, there is a separate word class of expressives: words that describe how the speaker perceives a situation with the senses—how it looks, sounds, smells, tastes or feels—or that describe the speaker’s feelings. Syntactically they are usually adverbs giving a more precise or intense meaning to the verb. They include the class of onomatopoeic words. Most Kammu expressives are formed from a monosyllabic root that carries the basic meaning, but usually does not occur as a free word (exceptions include some onomatopoeics where the root describes a sound heard once). A set of up to eleven expressives can be formed from each root by regular morphological operations which include reduplication, prefixation, infixation and combinations of them. This process is not entirely productive, and in all cases the words actually attested represent only a subset of all theoretically possible forms. A few expressives are derived from other word classes, usually verbs. Each expressive-forming operation has its own meaning which is related to the structure of the situation it describes. The forms are shown in (6), exemplified with the roots ŋàk ‘bend down’ and króp ‘clanking sound’. Here, R is the root, C is a lexically determined consonant, Cf is the coda consonant of the root, and R1 is a variant of the root with a different vowel (often uː or u); if the punctual

968

svantesson and holmer

form 7 is monosyllabic, R in form 11 is this syllable rather than the root, and R1 is based on this syllable. The consonant C is always the same in forms 5, 7 and 9, but in form 11 it may be another consonant, often s. (6) Expressive morphology 1 R momentaneous 2 R-R dynamic durative 3 R-knR dynamic durative successive 4 R-rŋR dynamic durative plural 5 CCfR static 6 rCfR static plural 7 CR punctual 8 rR punctual plural 9 CR-CR iterated 10 rR-rR iterated plural 11 ClR1-ClR irregular Examples: root: *ŋàk 3 ŋàk-kń.ŋàk 4 ŋàk-rŋ̀ .ŋàk 5 cḱ.ŋàk 6 rk̀.ŋàk 7 c.ŋák 8 r̀.ŋàk 9 c.ŋák-c.ŋák 10 r̀.ŋàk-r̀.ŋàk 11 cĺ.ŋùk-cĺ.ŋàk

‘one being keeps nodding downwards’ ‘many beings keep nodding downwards’ ‘one being whose head is bent down’ ‘many beings whose heads are bent down’ ‘one being nods once’ ‘many beings nod once at the same time’ ‘one being nods at intervals’ ‘many beings nod simultaneously at intervals’ ‘many beings keep nodding here and there’

root: króp 1 króp 2 króp-króp 4 króp-rŋ̀ .króp 9 c.króp-c.króp 11 sĺ.krúp-sĺ.króp

‘clanking sound heard once’ ‘clanking sounds heard many times (in one place)’ ‘clanking sounds heard many times (in many places)’ ‘clanking sounds heard at intervals’ ‘clanking sounds heard here and there’

The expressive form 1 describes an event (in almost all cases a sound) that happens once. Forms 2, 3 and 9 describe an ongoing event; the event in form 3 consists of part-events that are more or less equally spaced in time, while the event in form 9 consists of repetitions of a punctual event. Form 7 describes events that are punctual (or brief), and it is also used with inchoative meaning: entering a state.

kammu

969

Form 5 describes states. The ‘plural’ forms 4, 6, 8 and 10 describe events which involve several entities. In the corresponding ‘singular’ forms 2–3, 5, 7 and 9, by contrast, only one entity is involved. The ‘irregular’ expressive form 11 describes events that take place at irregular intervals and/or in many different places. Expressive morphology is to a large extent based on processes which are iconic, showing some parallelism between the linguistic form and the meaning. For example, durativity, i.e. extension in time, is expressed by reduplication which extends the duration of the word (forms 2–4 and 9–11), and the use of the vibrant [r] for plurality is clearly iconic, since [r] is the only Kammu phoneme which audibly consists of several repetitions of one sound. The vowel change in the irregular expressives (form 11) is also iconic. Another kind of iconicity is also involved. For many expressives, the relative size of the entities involved is expressed by the use of different vowels in the root. For example, the five words tɨ̀ːɲ-tɨ̀ːɲ, tɔ̀ ɲ-tɔ̀ ɲ, tʌ̀ɲ-tʌ̀ɲ, tèːɲ-tèːɲ and tɛ̀ɲ-tɛ̀ɲ describe fires of different sizes, from large to small. Similarly, ŋùkkń.ŋùk, ŋòk-kń.ŋòk, ŋàk-kń.ŋàk and ŋèk-kń.ŋèk describe nodding done by beings of different (decreasing) sizes. Properly speaking, the root consists only of the consonants (and the tone). For onomatopoeic expressives, the scale is from low-pitched to high-pitched sound, as if produced by beings of decreasing size. Although the vowels in the size series are different for different expressives, they are with very few exceptions consistent with each other, each forming a subset of the following size scale: u, uː, ua > ɨ, ɨː > ɨa > o, oː > ə, əː > ɔ > a, aː > ia> i, iː, ʌ > e, eː > ɛ (Svantesson & Tayanin 2003). Thus the vowel [u] describes the largest and [ɛ] the smallest entities. The scale goes diagonally through the vowel space starting with [u], a vowel for which both the first and the second formant are small and ending up with [ɛ], which has relatively high values for both F1 and F2. 3.3 Reduplication The meaning of many words, mainly verbs and expressives, but also some nouns, can be intensified by suffixing a changed copy of the word. For example, péŋ-póːŋ is an intensified form of péŋ ‘to cut’ and pʰá-pʰáŋ is an intensified form of pʰá ‘to cleave’. As these examples show, either the vowel or the whole rhyme is changed in the suffixed form. The changed vowel is often oː if the vowel of the base is short, and i if it is long. For the rhyme-changed suffix, the rhyme aŋ is by far the most frequent one, and eːŋ is also common. It can be noted that the tone never changes. Total reduplication is often used for intensification or for showing that something takes place for some time, e.g. jɔ̀ h jɔ̀ h ‘[he] went and went’, càʔ càʔ ‘very far away’ (see the text in section 7).

970

svantesson and holmer

A special kind of reduplication is used for adjectives, e.g. nà plìa-plìa [3SG.F pretty-pretty] ‘She is very pretty.’ In this case, the first repetition of the word is usually said at a much higher fundamental frequency (sometimes even using falsetto) and has longer duration than the second one. 4

Phrase and Clause Structure

4.1 Simple Sentences Kammu is an isolating language without inflectional morphology, so the marking of grammatical functions relies mostly on word order and a few markers. The basic word order is SV in sentences without direct object (7a), including sentences (b) with an adjectival predicate (arguably a sub-class of verbs, but see 5.4 for syntactic verb–adjective distinctions). In sentences with a direct object, the basic word order is AVP. ‘Direct object’ (P) is here taken to mean a noun phrase that follows the verb and has no special marking; a verb can be followed by at most one unmarked noun phrase argument. Ditransitive constructions are expressed by verb serializations, where the indirect object precedes the direct object, see 4.2.1 for details. There is no clear difference between intransitive and transitive constructions; most verbs (including adjectives) can be used with a direct object of this kind, which in addition to a prototypical direct object (c) can function e.g. as a goal (d) or location (e, f). Noun phrase predicates take the copula mə̀h (g). Prototypical direct objects can also be expressed with a serial verb construction with the verb mʌ̀t ‘take’, see (13). Clauses without an overt subject are common; see e.g. (7f) and several examples below. The direct object can be fronted for emphasis or contrast, see (24c). (7) a. ròk tə́ːm toad sing ‘The toad sang/is singing/sings.’ b. túːt k.cɔ́ k kì plìa plant banana PROX beautiful ‘This banana plant is beautiful.’ c. róːj l̀.wàːŋ kóh cŕ.kùːl tíʔ kə̀ spirit sky cut finger 3SG.M ‘The sky spirit cut off his fingers.’

kammu

971

d. s.ná wèc rɔ̀ t kúŋ 3DU return come village ‘They returned to the village.’ e. cḿ.rɔ̀ ʔ nìʔ kàːj jèt kàːŋ cm̀ .kɨ́n man MEDL still stay house woman ‘The men were still living in the women’s house.’ f. cúʔ plùʔ pə́ pɨ̀an jɔ̀ h sore leg NEG can walk ‘[He] got a bad leg and could not walk.’ g. kĺ.ʔàːk mə̀h róːj réʔ crow COP spirit field ‘The crow is a field spirit.’

4.1.1 Questions Polar questions can be marked by intonation only (8a) or with various question particles, including ɓéc (b), which can be used both clause-initially and clause-finally with no apparent difference in meaning, and clause-final há (c), as well as a V–NEG–V alternation construction (d), where the negated second instance of the verb may follow the entire verb phrase. (8) a. mè mʌ̀t mɨ́ar ʔòʔ 2SG.M take loincloth 1SG ‘Did you take my loincloth?’ b. ɓéc hŋ́ .kúːr k.màʔ rɔ̀ t Q storm rain come ‘Has there been a rainstorm?’ c. pɔ̀ kùːɲ plàːŋ rɔ̀ t kì há 2PL see PN come PROX Q ‘Have you seen Plàaŋ coming here?’ d. mè cə̀ pə̀ʔ màh pə́ pə̀ʔ 2SG.M IRR eat food NEG eat ‘Are you going to eat?’ (K.R.)1 1 Examples marked (K.R.) are elicited from Kàm Ràw (see 1. Background).

972

svantesson and holmer

In question-word questions, an interrogative pronoun (mə̀ʔ ‘whoʔ’, mə́h ‘whatʔ’, see 5.2.3) remains in the same place as a corresponding argument in a declarative clause (9a–c). Questions with an interrogative as subject are often constructed as cleft sentences (d); cf. 4.2.2. (9) a. mə̀ʔ cə̀ klám pàːr who IRR carry two ‘Who will carry two [loads]?’ b. mè mə̀h kɔ́ n mə̀ʔ 2SG.M COP child who ‘Whose child are you?’ c. ʔàʔ cə̀ pə̀ʔ mə́h 1DU IRR eat what ‘What shall we eat?’ d. mə̀h mə́h jèt tà pléʔ s.kóʔ k.nàːj COP what stay LOC fruit gourd DIST ‘What is it that is in that gourd?’

4.1.2 Imperative Sentences Commands often have an overt second person pronoun subject (10a), which is, however, not obligatory (b). Negated commands are formed with the prohibitive negation tá (c, d). The verb ʔɔ̀ r ‘to lead’ forms imperatives with first person subjects (e). When the imperative is intended as a suggestion, a hortative particle ʔə́ːm is often used (f). (10) a. mè pòk ʔàʔ tèʔ kàːn 2SG.M cut 1DU get carrying.pole ‘Cut a carrying-pole for us!’ b. mʌ̀t ˀwɨ́ar jɔ̀ h kɨ̀t kə̀ take knife go cut 3SG.M ‘Go cut it with a knife!’ c. mè tá pók ʔòʔ 2SG.M PROH bite 1SG ‘Don’t bite me!’

973

kammu d. tá ʔòːr PROH shout ‘Don’t shout!’ e. ʔɔ̀ r ʔàʔ jɔ̀ h ʔə̀h crɔ́ ʔ lead 1DU go make weir ‘Let’s go and make a weir!’ f. pà jɔ̀ h sɔ́ k pà tèʔ klèʔ ʔə́ːm 2SG.F go seek 2SG.F get husband HORT ‘Go find yourself a husband!’

4.2

Complex Sentences

4.2.1 Serial Verb Constructions Serial verb constructions portray a single event with two or more verbs, each verb representing part of that event. The verbs also share one or more arguments. Serial verb constructions are common in Kammu, for example a chain of motion verbs often with the resultative rɔ̀ t ‘to arrive’ as the final verb (11a) or verbs referring to speech actions (b). Verb chains where the object of the first verb is the subject of the second one can have resultative meaning, illustrated in (c). See Holmer (2005) for more details on serial verb constructions in Kammu. (11) a. ɲàːr tàr kàːj rɔ̀ t tà kàːŋ tè PN run return arrive LOC house REFL ’Ñàar ran back to his house.’ b. jàʔ nà nìʔ tɔ́ p lʌ̀ sáh, tá kà, old.woman 3SG.F MEDL answer speak say PROH climb kə̀ ʔàh k.ʔáːɲ 3SG.M exist hornet ‘The old woman answered, “Don’t climb, there are hornets in it”.’ c. jàʔ tʰáw nìʔ ɓák sɔ́ ʔ tè háːn old.woman old MEDL beat dog REFL die ‘The old woman beat her dog to death.’

974

svantesson and holmer

Indirect objects are expressed with a serial verb construction with the verb tèʔ ‘to get’, often combined with the verb ʔùːn ‘to give, let’ (12a); cf. (b) for evidence that ʔùːn in itself expresses the concept of giving. The direct object is placed after the serial verb construction. This construction is also used with a more general benefactive meaning (c). The same kind of construction can be used with other verbs than tèʔ (d, e). Some verbs like lə̀ŋ ‘scold’ and ròːs ‘angry’ are not constructed with a direct object but with a tèʔ construction (f). (12) a. ʔòʔ cə̀ ʔùːn mè tèʔ ŋɔ́ ʔ sḿ.làh 1SG IRR give 2SG.M get rice seed ‘I will give you some rice seeds.’ b. ʔòʔ ʔùːn k.múːl jʌ̀ kə̀ 1SG give money INS 3SG.M ‘I handed money to him.’ (K.R.) c. kḿ.pɨ̀ar kʰór kə̀ tèʔ hń.túʔ pangolin dig 3SG.M get hole ‘The pangolin burrowed a hole for him.’ d. r̀.wàːj cúʔ pók tè pə̀ʔ tráːk tiger want bite REFL eat buffalo ‘The tiger wanted to kill and eat the buffalo.’ e. jèm jə̀ʔ, táːm kòn tʰáw kɛ́ kùʔ tŕ.tɔ̀ h ʔìʔ kḿ.ɲèŋ when long.ago, follow person old old like tell 1PL listen ‘[This was] long ago, according to what old people used to tell for us to listen to.’ f. nà lə̀ŋ klèʔ tè tèʔ 3SG.F scold husband REFL get ‘She scolded her husband.’

In addition to being constructed as an unmarked noun phrase following the verb (7c), a prototypical direct object can also be expressed with a serial verb construction with the verb mʌ̀t ‘to take’ (13). This is common when there is an indirect object (b).

kammu

975

(13) a. mʌ̀t t.púʔ nìʔ wák tà cń.trɨ̀ŋ t.jáːk nìʔ take bugle MEDL hang LOC antler deer MEDL ‘[He] hung up the bugle on the deer’s antlers.’ b. nɔ̀ kàːj mʌ̀t máːr ʔùːn kə̀ tèʔ 3PL then take salt give 3SG.M get ‘Then they gave him some salt.’

4.2.2 Cleft Sentences Cleft sentences, which serve to focus on the identity of a single referent, can be formed with the copula mə̀h (14); the expletive subject kə̀ ‘he’ is common in them (b–c). This construction is particularly commonly used to focus on the identity of the subject, especially when the subject is an interrogative pronoun (9d). (14) a. mə̀h ʔòʔ kə̀m píɲ kə̀ nìʔ COP 1SG REL shoot 3SG.M MEDL ‘It was I who shot it.’ b. kə̀ tɔ́ ŋ mə̀h tɛ́k lɛ̀ʔ kə̀m pə̀ʔ kə̀ 3SG.M must COP PN ASRT REL eat 3SG.M ‘It must have been Tɛ́ɛk who ate it.’ c. kə̀ mə̀h jʌ̀ pɔ̀ jɔ̀ h klám kə̀ tə̀ŋ kúŋ tə̀ŋ kàːŋ 3SG.M COP INS 2PL go carry 3SG.M all village all house ‘It was because you carried it to all the houses in the whole village.’

4.2.3 Coordination Coordination of clauses is often marked by juxtapostion only (15a, b). For events that follow each other in time, the sentences are often linked by words as lə̀ ‘then’ (c) or wàːj nìʔ ‘after that’ (d). (15) a. r̀.wàːc k.máŋ tè, cùːr tà kàːŋ, rù hḿ.ràŋ, pàk hḿ.ràŋ grab sword REFL go.down LOC house lead horse ride horse tè jɔ̀ h REFL go ‘[He] grabbed his sword, went down from his house, fetched his horse and rode away.’

976

svantesson and holmer

b. ʔòʔ jɔ̀ h téːj kɔ́ n téːj kḿ.ràʔ tè 1SG go fetch child fetch wife REFL ‘I went to fetch my child and wife’. c. kə̀ kwáːt tà pɨ̀ːn kì hóːc, lə̀ rə̀h kwáːt tà còŋ 3SG.M scrape LOC ground PROX finish then rise scrape LOC high ‘When he had scraped the ground here, he climbed up to scrape above.’ wàːj nìʔ d. kə̀1 pǹ.trùʔ trɨ́al hóːc 3SG.M decorate pheasant finish after MEDL

kə̀1 3SG.M

cə̀ ʔùːn trɨ́al pǹ.trùʔ tè1 IRR give pheasant decorate REFL ‘It [the crow] had finished adorning the silver pheasant. After that it was going to let the pheasant adorn it [the crow].’

4.2.4 Relative Clauses The relativizer kə̀m in Yùan Kammu is probably a contraction of kə̀ mə̀h [3SG.M COP], since some dialects use the relativizer nàm (presumably from nà mə̀h [3SG.F COP]; Premsrirat 1982: 133). Relativization of subject (16a, b), direct object (c, d), oblique noun phrase (e) and possessor (f, g) are exemplified. The relativized position can be represented with a gap (a, c, e, f) or a resumptive pronoun (b, d, g). Even if resumptive pronouns seem to be possible for each of these positions, it is used more seldom for subject and direct object and is more common for possessor. (16) a. táʔ kə̀m jɔ̀ h téc pléʔ kók nìʔ old.man REL go sell fruit hogplum MEDL ‘The old man who went to sell hogplums.’ b. jàʔ kə̀m nà ʔàh róːj k.nìʔ old.woman REL 3SG.F exist spirit MEDL ‘The old woman who was possessed by the spirit.’ c. míaŋ kə̀m kə̀ ʔɨ̀m k.nàːj tea REL 3SG.M hold.in.mouth DIST ‘The fermented tea leaves that he held in his mouth.’ d. pɨ̀ːŋ kə̀m r̀.wàːj pók nɔ̀ nàːj group REL tiger bite 3PL DIST ‘Those people whom the tiger bit.’

kammu

977

e. c.mə̀ʔ kə̀m kə̀ túk púas k.nìʔ rope REL 3SG.M tie deer MEDL ‘The string with which he had tied the deer.’ f. kòn kə̀m róːj ʔèːs túk hŕ.màːl k.nàːj person REL area.spirit tie soul DIST ‘The person whose soul the area spirits have tied.’ g. ʔàːj sí.nà.wàn.nàʔ kə̀m jòŋ kə̀ jɔ̀ t hḿ.ràŋ nìʔ HON PN REL father 3SG.M fuck horse MEDL ‘Síi-nàa Wàn-nà whose father fucked a horse.’

It is also possible, in some restricted contexts, to form relative clauses without kə̀m (17). This forces a non-restrictive meaning and its distribution seems to be restricted to noun phrases occurring clause-finally (existentials, predicatives and objects; but crucially not subjects). These constructions may even be appositions of two main clauses. An alternative construction which does not involve kə̀m is available when the direct object of a transitive clause is relativized: this construction is verb-initial, with the agent following the verb (c). (17) a. ʔàh jòːr nìʔ mòːj kòn jèt tà pɨ̀ːn exist orphan MEDL one CLF stay LOC ground ‘There was an orphan who stayed on the ground.’ b. kì mə̀h lɨ̀aŋ màʔ ʔòʔ kùʔ tŕ.tɔ̀ h ʔìʔ tèʔ PROX COP story mother 1SG like tell 1PL get ‘This is a story that my mother liked/used to tell us.’ c. kì mə̀h tráːk pháːn ʔòʔ PROX COP buffalo kill 1SG ‘This is the buffalo I killed.’ (K.R.)

The relativizer kə̀m can also be used with adjectival attributes, and then conveys a restrictive reading of the adjective (18). (18) kón kə̀m hḿ.mèʔ nìʔ pŕ.lòːj tà ʔòm píc lòːc, skirt REL new MEDL set.afloat LOC water abandon exhaust nɔ́ ŋ nə̀ŋ kə̀m tr̀.téc nìʔ remain only REL worn.out MEDL ‘All the new skirts he set afloat in the river and abandoned them, and only the worn-out ones remained.’

978

svantesson and holmer

4.2.5 Complement Clauses The quotative marker sáh, which can also function as a verb ‘to say’ (19a) often introduces complement clauses (b, c) to verbs of saying or cognition; it is, however, not obligatory (d). Examples (a) and (b) also exemplify indirect and direct speech, differing only in the use of subject pronoun in the complement clause. Complement clauses can also be nominalized with the nominalizer sə́ŋ (e); see also 5.1. A subordinate clause subject which is coreferent with the matrix subject is normally realized by the long-distance reflexive tè (e); see 5.2.2 for a further discussion of the properties of tè. (19) a. cḿ.rɔ̀ ʔ1 kɔ̀ sáh, nɔ̀ tə́ːm cɔ́ l kàːŋ tè1 man then say 3PL sing riddle house REFL ‘Then the men said that they [the women] were singing a riddle about their [the men’s] house.’ b. plàːŋ rə̀h màːɲ sáh, mè tàr ʔə̀h.mə́h, rwàːj PN rise ask say, 2SG.M run why tiger ‘Plàaŋ got up and asked, “Why are you running, tiger?” ’ c. km̀ .múʔ tŕ.kə̀t sáh lòːk kì ʔàh sáːm cèn Kammu believe say world PROX exist three level ‘The Kammu believe that this world has three levels.’ d. kə̀ màːɲ, ʔə̀ pɔ̀ cə̀ jɔ̀ h mə́h 3SG.M ask oh 2PL IRR go what ‘He asked, “Oh, where are you going?” ’ e. nà tŕ.kə̀t sə́ŋ tè cáːk p.nɨ́ːr st.́ mìat pr̀.jɔ̀ ŋ kǹ.cə̀ʔ 3SG.F believe NMLZ REFL tear wing cricket yesterday ‘She believed that she had torn the wings of the cricket yesterday.’

A complement clause as subject usually precedes its predicate (20). (20) kʰ.ɗíʔ mè tòːn cùːr pɨ̀an ʔìʔ tèʔ lə̀ʔ lɛ̀ʔ now 2SG.M jump go.down can 3PL get good ASRT ‘It was good that you could jump down for us now.’

4.2.6 Adverbial Clauses Conditional clauses can just be juxtaposed to the main clause (21a), but a subordinator, ʔùːn, ʔàn or tʰá can also be used (b). The words ʔùːn and ʔàn also

kammu

979

function as verbs meaning ‘to give, let’ in the Yùan and Kwɛ̀ɛn sub-dialects, respectively; tʰá is a Tai loan. The main clause normally takes the irrealis marker cə̀ (5.4.1). (21) a. mè ʔùːn ʔòʔ tèʔ sɨ́p màn, ʔòʔ cə̀ ʔə̀h 2SG.M give 1SG get ten piastre, 1SG IRR do ‘If you give me ten piastres, I will do it.’ b. ʔùːn klàːŋ mè k.nìʔ tə́ːm sáh, ʔìʔ cə̀ mʌ̀t give stone 2SG.M MEDL sing say, 1PL IRR take kʰɔ́ ŋ.krùa kì ʔùːn mè tèʔ lòːc merchandise PROX give 2SG.M get exhaust ‘If your stone sings, we will give you all this merchandise.’

Causal clauses are formed with the instrumental preposition jʌ̀ (22a) or the subordinator pʰɔ́ ʔ wà, borrowed from Tai (b). (22) a. jàːm jʌ̀ tè cə̀ pə́ pɨ̀an pə̀ʔ màh mɨ̀ s.kí cry INS REFL IRR NEG can eat rice day today ‘[I] am crying because I will not be able to eat anything today.’ b. kĺ.ʔàːk nɔ́ ŋ jíaŋ cén kʰ.ɗíʔ, pʰɔ́ ʔ.wà trɨ́al tà crow still black until now because pheasant paint kə̀ jʌ̀ kǹ.sáh 3SG.M INS charcoal ‘The crow is still black until now because the pheasant painted it with charcoal.’

Temporal clauses can also be formed by juxtaposition (23a), but can be marked with the word jèm ‘when’ (b). (23) a. kʰɨ́aŋ hn.túʔ nìʔ hóːc, mʌ̀t kòn háːn nìʔ máːn dig hole MEDL finish take person dead MEDL bury ‘When [they] had dug the hole, they buried the dead man.’ b. jèm kə̀ tàr tùʔ, nɔ̀ pɨ̀ːŋ kə̀m jèt tà hĺ.ŋə̀ːj when 3SG.M run flee 3PL group REL stay LOC above

980

svantesson and holmer

k.nìʔ k.rúk lòːc MEDL fall exhaust ‘When he ran away, all those higher up fell down.’

4.3 Syntax and Pragmatics Information structure is usually not marked in any special way, except by intonation; see 2.3. In some cases a topic–comment construction is used, where the topic is first stated and a sentence is then said as a comment (24a, b). As mentioned in 4.1, a direct object can be fronted for contrast or emphasis (c); see also (18). (24) a. kòn ʔàh kɨ̀n, jèm nɔ̀ ʔə̀h káp róːj tè person exist kɨ̀n.spirit when 3PL do COM spirit REFL nìʔ, nɔ̀ pɨ̀an cɔ́ t ʔòh, cɔ́ t pʰɛ́ MEDL 3PL can heal wound heal cut ‘As for people who are possessed by the kɨ̀n spirit, they can heal their wounds and cuts when they cooperate with their spirit.’ b. láʔ kə̀m ʔàh ˀjíak síːm, tá tèʔ ʔə́ːm leaf REL exist shit bird PROH get HORT ‘Don’t take those leaves that have bird shit on them!’ c. kʰ.ɗíʔ prìaŋ pháːn sɨ́aŋ, prìaŋ pə̀ʔ ʔàh sɨ́aŋ, sɔ́ ʔ now people kill pig people eat meat pig dog prìaŋ pə́ pháːn people NEG kill ‘Nowadays people slaughter pigs and eat pork, but dogs they don’t slaughter.’

4.4 Noun Phrases All modifiers of a noun phrase follow the head. Except for relative clauses marked with the relative marker kə̀m (see 4.2.4), their only syntactical or morphological marking is word order. Examples of modifiers are demonstratives (25a), often in combination with a pronoun (see 5.2.5). The medial demonstrative nìʔ (or k.nìʔ) often marks only definiteness (b); it is usually the last modifier of a definite noun phrase, serving as a kind of boundary marker, as e.g. in (d) and (i). Sometimes nàːj (or k.nàːj) serves the same purpose. Other modifiers include nouns (c), possessors (d) that consist of a noun or a noun phrase (or

kammu

981

a pronoun), adjectives (e) which may be followed by an adverb, other verbs (f), preposition phrases (g), numeral+classifier phrases (h) and relative clauses with or without the relativizer kə̀m (i, j). (25) a. kúŋ k.nàːj village DIST ‘that village’ b. tráːk nìʔ buffalo MEDL ‘the buffalo’ c. kɔ́ n cḿ.rɔ̀ ʔ child man ‘boy’ d. tráːk kàːŋ ʔà ʔòʔ nìʔ buffalo house uncle 1SG MEDL ‘my uncle’s family’s buffalo’ e. klàːŋ còŋ nàn cliff high very ‘a very high cliff’ f. ʔàh tóm meat boil ‘boiled meat’ g. kòn km̀ .múʔ tà kúŋ príːm person Kammu LOC village old ‘the Kammu people in the old village’ h. hń.táʔ cét sén tail seven CLF ‘seven tails’ i. h.jíar kə̀m nɔ̀ pháːn jèm ʔə̀h kàːŋ nìʔ chicken REL 3PL kill when make house MEDL ‘the chicken that they killed when they built the house’

982

svantesson and holmer

j. kɔ́ n ʔàh jòŋ ʔàh màʔ child exist father exist mother ‘a child who has a father and a mother’

A noun that is not a possessor follows the head immediately and precedes other modifiers (26a). An adjective usually precedes a possessor (b). A numeral+classifier phrase usually follows an adjective (a) and a possessor (c), but it can be placed rather freely and may precede an adjective (d); it can be placed after the determiner (c) and even outside the noun phrase (e). (26) a. sɨ́aŋ tlóh klɔ́ k mòːj tó pig male.animal white one CLF ‘one white boar’ b. sɨ́aŋ klɔ́ k tè pig white REFL ‘my white pig’

k.nìʔ MEDL

c. klə̀ʔ nà nìʔ mòːj sén hair 3SG.F MEDL one CLF ‘one of her hairs’ d. túːt crìʔ mòːj túːt nám tree banyan one CLF big ‘one big banyan tree’ e. kə̀ píɲ k.néʔ háːn tà kàːŋ ʔòʔ mòːj tó 3SG.M shoot rat die LOC house 1SG one CLF ‘He shot one rat to death in my house.’

5

Word Classes

There are three open word classes, nouns (5.1), verbs (5.4) and expressives (see 3.2.3). There is some overlapping between these word classes, and several words can function both as nouns and as verbs, e.g. màh ‘rice, food; to eat’ and rɛ̀ŋ ‘strong; strength’. In a few cases, verbs function as expressive roots.

kammu

983

5.1 Nouns Since there is no inflectional morphology in Kammu, grammatical categories such as number, case or definiteness are not expressed morphologically on the nouns. Verbal nouns can be derived morphologically from verbs, see 3.2.1. Compounding is a common way of forming nouns, e.g. c.ʔáːŋ plùʔ [bone thigh] ‘femur’. Class nouns such as síːm ‘bird’, káʔ ‘fish’, túːt ‘plant, tree’, c.mə̀ʔ ‘vine’ and kòn ‘person’ are commonly used as heads of compounds, e.g. síːm l̀.wàːŋ [bird heaven] ‘bee-eater’, túːt crìʔ [tree banyan] ‘banyan tree’. With class nouns, the compound (e.g. túːt crìʔ) and the modifier alone (crìʔ) can often be used with the same meaning. There are some general nominalizers, including sə́ŋ, kwàːm and káːn; all these are Tai loans, but sə́ŋ is an older loan than the others and is more integrated in Kammu grammar. Examples are sə́ŋ màh [NMLZ eat] ‘food’, kwàːm háːn [NMLZ die] ‘death’ and káːn tèʔ kḿ.ràʔ [NMLZ get wife] ‘marriage’; see also (19e) for an example with sə́ŋ. 5.1.1 Names and Terms of Address Most Kammu personal names have a meaning, and almost all of them can be used for both men and women. There are rather few personal names. A person carries the name of his or her father (or, if the parents are not married, mother) added after the personal name as a possessive modifier. For example, the name Kàm Ràw means ‘Ràw’s Kàm’ and consists of the bearer’s personal name, Kàm, and his father’s name, Ràw. It is considered impolite to use the personal name for addressing elderly people, and the appropriate kinship term should be used instead (see (52)). 5.2 Pronouns and Determiners 5.2.1 Personal Pronouns The personal pronouns have three number categories: singular, dual and plural. The second and third person singular pronouns differentiate genders. The feminine forms refer only to human females. It can be noted that the dual and plural forms are related to the feminine singular forms rather than to the masculine ones (27).

984

svantesson and holmer

(27) Kammu personal pronouns

1st person 2nd person 3rd person

feminine masculine feminine masculine

singular

dual

plural

ʔòʔ pà mè nà kə̀

ʔàʔ s.pá

ʔìʔ pɔ̀

s.ná

nɔ̀

There are no special possessive pronouns, but personal pronouns in the possessive modifier position are used; see e.g. (25d). 5.2.2 Reflexive Pronoun There is a reflexive pronoun tè, unmarked for person (28a–c), which usually refers to the subject of the sentence, although this subject need not be overt (d). In some cases, the referent is not the subject but is taken from the context (e). The reflexive pronoun tè can also be used instead of ʔòʔ as a first person singular pronoun referring to the speaker (f, g). In this respect, tè is actually more aptly described as a logophor than as an anaphor, although by far the most common use is still anaphoric (if there is any possible antecedent available, tè will be interpreted as coreferent with it). (28) a. nà lə̀ŋ klèʔ tè tèʔ 3SG.F scold husband REFL get ‘She scolded her husband.’ b. ʔòʔ cə̀ jɔ̀ h sɔ́ k tè tèʔ kḿ.ràʔ 1SG IRR go seek REFL get wife ‘I will go and seek a wife for myself.’ c. kʰ.ɗíʔ mè làːk ʔìʔ sáh klàːŋ tè tə́ːm now 2SG.M lie 3PL say stone REFL sing ‘Now you told us a lie saying that your stone sang.’ d. pán sí pàŋ mʌ̀t k.tóŋ nìʔ kʰúa pń.màh kɔ́ n tè when day next take egg MEDL fry CAUS-eat child REFL ‘The next day [he] fried the eggs and fed his children.’

kammu

985

e. klàːŋ jèt tà km̀ .póŋ sís màʔ tè k.nìʔ stone stay LOC head sleep mother REFL MEDL ‘The stone was at the head end [of the grave] where his [dead] mother slept.’ f. tè kɔ̀ cə̀ ʔə̀h jʌ̀ tɛ́k ɗɛ́ REFL then IRR do INS PN also ‘I will do the same to Tɛ́ɛk then’ g. màʔ tè cúʔ pə̀ʔ k.táːm mother REFL want eat crab ‘My mother wants to eat crabs.’

In (28b) above we see reflexive tè used as an indirect object. This tè can also be used as a direct object, but only in an embedded clause, and then refers to the subject of the main clause (29a). However, when used as a possessor, it may also refer to the subject of its own clause (b, c). There is instead a local reflexive tè sɛ́t [REFL self] which can serve in object position and refer to the subject of its own clause (d). A personal pronoun may be used instead of the reflexive to make the reference more clear (e). (29) a. t.kɔ́ k1 kùːɲ sár rə̀h cə̀ pók tè. malkoha see civet rise IRR bite REFL ‘The malkoha saw the civet come up to bite it.’ b. kə̀1 mèc jòŋ tè1 cə̀ jɔ̀ h tɨ̀aŋ màʔ tè1 2SG.M hear father REFL IRR go fetch mother REFL ‘He heard that his father was going to fetch his mother.’ c. ɲàːr1 prá.tàn kùːɲ kə̀2 pháːn sɨ́aŋ tè2 k.nìʔ, kə̀1 PN not.yet see 3SG.M kill pig REFL MEDL 3SG.M tàr jɔ̀ h káːl run go before ‘Before Ñàar had seen him [Tɛ́ɛk] slaughter his [Tɛ́ɛk’s] pig, he [Ñàar] ran in advance.’ d. pr̀.tɨ́ŋ kə̀ kóh kɨ̀t tè sɛ́t owner 3SG.M cut cut REFL self ‘Its owner cut himself.’

986

svantesson and holmer

t.lóh pàːr tó péʔ tó rɔ̀ t e. nà1 kùːɲ lm̀ .pòʔ 3SG.F see cattle male two CLF three CLF come cɔ́.rɔ̀ nà1 nìʔ towards 3SG.F MEDL ‘She saw some two or three bulls coming towards her.’

5.2.3 Interrogative Pronouns The major interrogative pronouns are animate mə̀ʔ ‘who?’ and inanimate mə́h ‘what?’; see (9) for examples. Other question words formed with them include tà mə̀ʔ [LOC who] ‘where’ (30a), ʔə̀n mə̀ʔ [as who] ‘how’ (b) and ʔə̀h mə́h [do what] ‘why’ (c). (30) a. cə̀ máːn cèʔ tè tà.mə̀ʔ IRR bury grandchild REFL where ‘Where shall [I] bury my grandchild?’ b. mè kʰór ʔə̀n.mə̀ʔ 2SG.M dig how ‘How did you dig?’ c. ʔò, mè jàːm ʔə̀h.mə́h, jòːr oh 2SG.M cry why orphan ‘Oh, why are you crying, orphan?’

5.2.4 Indefinite Pronouns Indefinite pronouns are expressed by ɓéc tè in affirmative environments (31a, b), and in negated evironments they are expressed using the corresponding interrogative pronoun, mə̀ʔ or mə̀h (c, d). (31) a. ʔə̀, ɓéc.tè tə́ːm tà ʔòm súʔ mɨ̀an-mɨ̀an oh INDF sing LOC water down nice-nice ‘Oh, someone is singing very nicely down there at the river.’ b. kɔ́ n tè tùʔ tà ɓéc.tè child REFL run.away LOC INDF ‘My child ran away somewhere.’ c. kráːŋ pə́ ʔùːn mə̀ʔ tèʔ prevent NEG give who get ‘[He] prevented everyone from marrying [her].’

987

kammu d. màt ʔòʔ ʔès, plɔ́ kùːɲ mə́h eye 1SG swell NEG.EMPH see what ‘My eyes were swollen and I could not see anything at all.’

5.2.5 Demonstratives The demonstratives differentiate four degrees of closeness in relation to the speaker and listener, as well as a height dimension: (32)

kì nìʔ nàːj hóʔ ɗɨ́ŋ súʔ

proximal, close to speaker medial, close to listener distal, away from speaker and listener remote distal, far away up down

Demonstratives are usually combined with a third person pronoun, most often singular masculine kə̀, with which it forms the reduced forms k.kì (or, for phonetic reasons, just kì), k.nìʔ, k.nàːj, etc. The feminine pronoun nà and other third person pronouns occur in this position as well, see e.g. (11b), but k.kì, k.nìʔ, etc., seem to be default forms that can be used also with female referents. The demonstrative nìʔ (or k.nìʔ) is often used as a marker of definiteness without any deictic interpretation; see 4.4). 5.3 Quantity Words and Measure Words The Kammu basic numerals are given in (33). Except for the first three ones, all are Tai loans. (33) Kammu Numerals 1 mòːj 8 pɛ́t 2 pàːr 9 káw 3 péʔ 10 sɨ́p 4 sí 20 sáːw 5 há 100 rɔ̀ j 6 rók 1,000 pàn 7 cét 10,000 mɨ́ːn

Other numerals are formed using the decimal system (34); in combination with other numerals, the Tai loans ʔèt, sɔ́ ŋ, sáːm are used for ‘one’, ‘two’, ‘three’ (b).

988

svantesson and holmer

(34) a. há pàn rók rɔ̀ j káw sɨ́p sí five thousand six hundred nine ten four ‘5,694’ (K.R.) b. sɔ́ ŋ rɔ̀ j sáːm sɨ́p ʔèt two hundred three ten one ‘231’ (K.R.)

A numeral is usually not combined directly with a noun but is constructed with a classifier (35a, b). Each noun normally takes one specific classifier. Some nouns, e.g. kúŋ ‘village’, and t.lɔ̀ ŋ ‘log’ are their own classifiers (c). A numeral is sometimes used alone, without a classifier or a noun (9a). The classifier itself can sometimes be omitted if it is identical to a class noun: in (d), a classifier túːt is not necessary but it may be used. (35) a. cḿ.rɔ̀ ʔ pàːr kòn man two CLF ‘two men’ b. tíʔ mòːj plàh hand one CLF ‘one hand’ c. t.lɔ̀ ŋ nám mòːj t.lɔ̀ ŋ log big one log ‘one big log’ d. túːt crìʔ mòːj (túːt) nám tree banyan one CLF big ‘one big banyan tree’

Most classifiers are Tai loans, but some are indigenous Kammu words, e.g. cɨ̀aŋ ‘leg’ which is the classifier for tís ‘mushroom’. The following classifiers are common in ordinary language: (36) Common Kammu classifiers ʔàn small short things; things for which other classifiers cannot be used ɓɔ́ n places, areas cɨ̀aŋ mushrooms kòn human beings

kammu

989

láːŋ things made of bamboo, traps, mouth harps làm boats, aeroplanes láŋ houses, mosquito nets, weirs lém long and slender things, e.g. pencils, arrows, poles, firewood, bamboo stems, books lùːk (same as núaj) màʔ female animals núaj round things that can contain something, e.g. fruits, drums, houses, eggs, baskets, mortars, termite hills pʰɨ́ːn clothes, mats, trays plàh one of a pair, e.g. hand plɔ́ k necklaces, rings sén strings, hairs, threads, chains, roads tə̀ːr sheets, leaves, tiles tʰían knives, swords, axes tì places, field areas t.lóh male animals tó animals; spirits; letters of the alphabet; tattoos túːt growing plants

Measures are combined directly with numerals (37a, b). Nouns denoting containers of different kinds can be used as measures (c). (37) a. sɨ́p.sɔ́ ŋ kám twelve handbreadth ‘twelve handbreadths’ b. há pí five year ‘five years’ c. màh cét réːj rice seven rice.steamer ‘seven rice-steamer-fuls of steamed rice’

5.4 Verbs Kammu verbs can be defined as negatable words, i.e. words that can be preceded by the negation pə́ ‘not’ (5.4.4). This includes adjectives, which differ from other verbs in some respects. For example, they can modify nouns without an intervening relativizer kə̀m normally required by verbs, and they can also

990

svantesson and holmer

take part in comparative constructions with the words kwá or lɨ́a ‘more than’ (38), both Tai loans. If the standard of comparison is a word like prìaŋ ‘others’ or indefinite mə̀ʔ ‘anyone’, a kind of superlative is formed (c, d). (38) a. púas ʔàn.tàʔ.ràːj kwá ràːŋ.cɔ́ k barking.deer dangerous COMPAR laughing.thrush ‘The [spirit of the] barking-deer is more dangerous than that of the laughing-thrush.’ b. kàːŋ ʔòʔ lə̀ʔ lɨ́a kàːŋ mè house 1SG good COMPAR house 2SG.M ‘My house is better than yours.’ c. r̀.mèːt jíaŋ kwá prìaŋ Lamet black COMPAR others ‘The Lamet are blackest.’ d. nà plìa tó lɨ́a mə̀ʔ 3SG.F pretty EMPH COMPAR who ‘She was prettiest of all.’

Causative, reciprocal, anticausative and some other verbs can be derived morphologically from other verbs, see 3.2.2. As mentioned in 4.1, there is no clear difference between transitive and intransitive verbs, and there are no ditransitive verbs; indirect object is expressed with a serial verb construction (see 4.2.1). 5.4.1 Tense, Mood and Aspect Since Kammu is an isolating language, tense, mood and aspect are not expressed by inflections. Tense markers with a purely temporal meaning are lacking, but there are markers for irrealis mood, and for perfect and habitual aspect. The Tai progressive marker kḿ.làŋ has recently been borrowed into Kammu, but it occurs very seldom in the folk tales and will not be treated here. Irrealis mood denotes non-actual events. The Kammu irrealis marker cə̀ is probably an old Tai loan but it is fully integrated in Kammu. It immediately precedes the verb and only the negation and some other sentence abverbs can intervene between cə̀ and the verb. Irrealis is used when the speaker does not know whether an event actually takes place or not (39a) or disbelieves someone else (b). It can also show an intention that may or may not be realized (c). It is more or less obligatory for future events (d) and in the main clause of a sentence with a conditional clause (e). It is not used in imperatives, see (10).

kammu

991

(39) a. kə̀m tə́ːm mɨ́an cə̀ mə̀h klàːŋ kì lɛ̀ʔ REL sing seem IRR COP stone PROX ASRT ‘The one who sang seems to be this stone.’ b. ʔò, cə̀ pə́ mə̀h klèʔ pà ʔə̀n.mə̀ʔ oh, IRR NEG COP husband 2SG.F how ‘Oh, how can it be that I am not your husband?’ (said by a man whose wife had forgotten him) c. mə̀ʔ rə̀h s.ŋə́ːj cə̀ tòːn kɔ̀ ŋɔ̀ ʔ INDF go.up look IRR jump then afraid ‘Whoever climbed up to have a look if they would jump became afraid.’ d. ʔòʔ cə̀ jɔ̀ h sɔ́ k tè tèʔ kḿ.ràʔ 1SG IRR go seek REFL get wife ‘I will go and seek a wife for myself.’ e. tʰá pə́ ʔàh túːt t.wáʔ k.nàːj, mè kɔ̀ cə̀ háːn if NEG exist plant fern DIST 2SG.M then IRR die ‘If it had not been for that fern tree, you would have died.’

Perfect aspect typically denotes a situation that occurred before present time (or before some other reference time) and has ‘present relevance’, so that its result remains and is relevant for the discourse situation at the present time (or at the reference time). Perfect aspect is marked by the word hóːc, which may also function as a verb meaning ‘to finish’ (40a). The Tai loan lɛ̀w is sometimes used instead of (or together with) hóːc. As an aspect marker (b, c), hóːc is either preverbal or clause-final. If the situation is static, hóːc marks inchoative aspect (d). Negated perfect is expressed with the negation prá (e). (40) a. mə̀ʔ pə́ hóːc hń.túʔ cń.tràŋ tè pə́ pɨ̀an pə̀ʔ màh who NEG finish hole pole REFL NEG can eat rice ‘Those who have not finished their hole for the pole will not get anything to eat.’ b. túːt crì nìʔ tíːŋ hóːc, wáːŋ nìʔ kɔ̀ kr̀.lɨ̀aŋ háːn plant banyan MEDL fall finish, PN MEDL then fall die ‘When the banyan tree had fallen, Wáaŋ fell down dead.’

992

svantesson and holmer

c. tráːk kə̀m mè téc k.nàːj, nɔ̀ pháːn kə̀ hóːc buffalo rel 2SG.M sell DIST 3PL kill 3SG.M finish ‘That buffalo which you sold, they have already slaughtered it.’ (K.R.) d. kɔ́ n sís hóːc child sleep finish ‘The child had fallen asleep.’ e. nɔ̀ lʌ̀ nà prá wèc 3PL say 3SG.F NEG.PRF return ‘They said that she had not come home.’

Habitual aspect is expressed with the marker kùʔ, which also functions as a verb meaning ‘to like’ (41a). Sometimes the two meanings are difficult to distinguish (b), but in other cases the habitual meaning is the only possible one (c). (41) a. ʔàh cḿ.rɔ̀ ʔ pàːr kòn, s.ná kùʔ cm̀ .kɨ́n mòːj kón exist man two CLF 3DU like woman one CLF ‘There were two men who both liked one woman.’ b. kì mə̀h lɨ̀aŋ màʔ ʔòʔ kùʔ tŕ.tɔ̀ h ʔìʔ tèʔ PROX COP story mother 1SG like tell 1PL get ‘This is a story that my mother liked/used to tell us.’ c. pə́, tàːŋ kúŋ ʔìʔ pʰɔ́ n tók, mə̀h nə̀ŋ màr kùʔ tók NEG lizard village 1PL never bite COP only snake like bite ‘No, the lizards in our village never bite, it is only the snakes that bite.’

5.4.2 Causative Verbs As seen in 3.2.2, causatives can be formed morphologically. For monosyllabic verbs, the prefix pn/pC- is used and for sesquisyllabic words, the infix -m-; both are semi-productive. For some verbs, there is a contrast between pn/pC- and another prefix p-, in which case the former emphasizes the result and the latter the intention. The infix -m-, which never contrasts with the prefix p-, seems to be neutral between these interpretations. There is also a periphrastic causative formed with the marker tòk, which emphasizes the result (42a); in contrast, the infixed causative (in this case km̀ .sés < k.sés ‘to fall’) can have an interpretation emphasizing the intention (b).

kammu

993

(42) a. kə̀ lə̀ tòk k.sés róːŋ tè 3SG.M then CAUS fall mouth.harp REFL ‘He then dropped his mouth harp (by mistake).’ b. k.tóŋ h.jíar k.nìʔ tó plɔ́ tń.làh, nà k.sés egg hen MEDL EMPH NEG.EMPH break, 3SG.F fall kə̀, kə̀ tó plɔ́ tń.làh lɛ̀ʔ 3SG.M, 3SG.M EMPH NEG.EMPH break ASRT ‘The egg did not break at all, she threw it down, but it still did not break.’

Causative meaning can also be expressed with serial verb constructions with the verbs ʔùːn ‘to give, to let’ (43a) or ʔɔ̀ r ‘to lead’ (b). (43) a. ʔòʔ cə̀ ʔùːn klàːŋ nìʔ tə́ːm 1SG IRR give stone MEDL sing ‘I shall make that stone sing.’ b. kə̀ ʔɔ̀ r màʔ tè nìʔ tùʔ tʰɛ́m 3SG.F lead mother REFL MEDL run.away again ‘He made his mother run away again.’

5.4.3 Existential Verbs The verb ʔàh ‘exist, there is; have’ (44a) functions as an existential verb in Kammu (b). The copula is mə̀h, see (7g). (44) a. jòŋ k.nìʔ kə̀ ʔàh kɔ́ n kɨ̀ʔ nàn father MEDL 3SG.M exist child many very ‘The father, he had very many children.’ b. tà tr̀.sáːp ʔòm ʔù nìʔ ʔàh klàːŋ còŋ nàn LOC mouth river PN MEDL exist cliff high very ‘At the mouth of the U river there is a very high cliff.’

5.4.4 Negation There are four negations in Kammu, neutral pə́ (45a–c), perfective prá (40e), emphatic plɔ́ (31d) and pʰɔ́ n ‘never’ (41c). There is also a prohibitive negation tá ‘don’t!’ (see 4.1.2). A negation immediately precedes the verb that it negates, and has scope over this verb but not over verbs preceding it.

994

svantesson and holmer

(45) a. sɨ́aŋ mè pə́ lə̀ʔ pig 2SG.M NEG good ‘Your pork is not good.’ b. kə̀ pə́ nə̀ːŋ ŋɔ̀ r 3SG.M NEG know road ‘He did not know the way.’ c. mè pə́ mə̀h klèʔ ʔòʔ 2SG.M NEG COP husband 1SG ‘You are not my husband.’

5.5 Prepositions There are two Kammu words which function only as prepositions: locative tà and instrumental jʌ̀. Locative tà can denote place (46a), or direction to (b) or from (c). As seen in (7d, e), the locative preposition is not always obligatory. Spatial relations can also be expressed with locative nouns (46d, e). (46) a. nɔ̀ jèt tà réʔ 3PL stay LOC field ‘They were staying in the field.’ b. táʔ cɛ̀ʔ rɔ̀ t tà kúŋ km̀ .múʔ old.man Lao come LOC village Kammu ‘A Lao man came to a Kammu village.’ c. nà cùːr tà kàːŋ tè 3SG.F go.down LOC house REFL ‘She went down [=out] from her house.’ d. ʔòʔ cə̀ jèt tà kń.trùːm kàːŋ 1SG IRR stay LOC underside house ‘I will stay under the house.’ e. kə̀ kùːɲ k.néʔ tàr tà hĺ.ŋə̀ːj tm̀ .rìaŋ kàːŋ s.ná k.nìʔ 3SG.M see rat run LOC overside floor house 3DU MEDL ‘He saw a rat running on the floor of their house.’

The preposition jʌ̀ denotes the instrument of an action (47a); as seen in (22a), it can also be used as a causal subordinator, and it also marks oblique objects

995

kammu

of different kinds (12b). Instrument can also be constructed with a serial verb construction with the verb mʌ̀t ‘to take’ (b) or tèʔ ‘to get’ (c). (47) a. kə̀ cŋ́ .kùːl jʌ̀ hḿ.púːr.t.nɔ́ h tè 3SG.M point INS lip REFL ‘He pointed with his lips.’ b. cɨ́aŋ mʌ̀t k.máŋ tè cráh klàːŋ nìʔ PN take sword REFL split cliff MEDL ‘Cɨáŋ split the cliff with his sword.’ c. tèʔ tíʔ tè kʰór get hand REFL dig

‘[He] dug with his hands.’

5.6 Conjunctions Coordination of nouns (or noun phrases) is often shown by simple juxtaposition (48a) or is marked with the conjunction káp ‘with, and’ (b) which is a Tai loan. The third person dual pronoun s.ná is also used to coordinate two nouns forming a natural pair (c). (48) a. príʔ máːr chilli salt ‘chilli and salt’ b. húal káp kɔ̀ j bear COM treeshrew ‘the bear and the treeshrew’ c. pré s.ná mɔ̀ k brother 3DU sister ‘the brother and the sister’

6

Semantic Fields

Here we will present two structured semantic fields. The first one is the terminology of the agricultural year which is regulated by the Kammu calendar. The second one is kinship terminology, which serves to regulate social relations, in particular marriage rules. Together they form a basis for the material and spiritual culture of traditional Kammu society.

996

svantesson and holmer

6.1 Agriculture and Calendar Glutinous rice (ŋɔ́ ʔ) cultivated in dry swidden (slash-and-burn) fields (réʔ) is the staple food of the Kammu, and the agricultural year is centred around the different types of work done in the rice fields. The Kammu use a lunisolar calendar, where each month begins on a new moon. A lunar month (ɗɨ́an) is 29 or 30 days, and the year usually begins on a new moon near the beginning of December in the Western calendar. Because a period of 12 lunar months is shorter than one solar year, an extra month must be intercalated approximately every three years. Four distinct seasons are recognized. The approximate correspondences between the Kammu months, the Western months, the seasons and the agricultural work periods are given in (49). Like other Southeast Asian peoples, the Kammu use two series of cyclic words for counting days and years, one consisting of ten and the other of twelve terms (50). (49) Kammu seasons, months and agricultural work periods

season

Kammu month

hŕ.nɨ̀m ‘cold’

1. ɗɨ́an cíaŋ (December) 2. ɗɨ́an ɲì (January) 3. ɗɨ́an sáːm (February) 4. ɗɨ́an sí (March)

p.nɔ́ ŋ ‘hot’

5. ɗɨ́an há (April) 6. ɗɨ́an rók (May) 7. ɗɨ́an cét (June)

k.màʔ ‘rainy’

8. ɗɨ́an pέt (July) 9. ɗɨ́an káw (August) 10. ɗɨ́an sɨ́p (September)

agricultural work

— rɨ́am ‘clearing’ kùːc ‘1st burning’ púːr ‘2nd burning’ c.mɔ̀ l ‘sowing’ kòh ‘1st weeding’ pɔ̀ j ‘2nd weeding’ cràʔ ‘3rd weeding’

11. ɗɨ́an sɨ́p ʔèt (October) rm̀ .ɲàŋ ‘chilly’

12. ɗɨ́an sɨ́p sɔ́ ŋ (November)

hɔ́ t ‘harvest’

kammu

997

(50) Kammu cyclic words 10-term cycle 12-term cycle 1 káːp cə́ 2 ɗáp plɔ́ 3 r̀.wàːj ɲì 4 mə̀ŋ mɔ́ 5 plə́k sí 6 kát sə́ 7 kót s.ŋá 8 rùaŋ mòt 9 tɔ́ sén 10 ká ràw 11 sét 12 kə̀

The days of the ten-day cycle (called sí káːp, sí ɗáp, etc.) form a kind of ten-day week important for many work taboos and for other purposes. Most people know their ‘birthday’ (sí nɛ̀ʔ)—the day in the ten-day week when they were born. The two series are combined to form a 60-term cycle, which is used for counting years and days. The corresponding years of the Western calendar are shown in (51). For example, the year 2013 (and also 1953 and 2073) is ká-sə́. See further Lindell et al. (1982) for more information on the Kammu calendar and agricultural year. (51) 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995

Kammu 60-year cycle káːp-cə́ 1996 r̀.wàːj-cə́ 2008 plə́k-cə́ 2020 kót-cə́ 2032 tɔ́ -cə́ ɗáp-plɔ́ 1997 mə̀ŋ-plɔ́ 2009 kát-plɔ́ 2021 rùaŋ-plɔ́ 2033 ká-plɔ́ r̀.wàːj-ɲì 1998 plə́k-ɲì 2010 kót-ɲì 2022 tɔ́ -ɲì 2034 káːp-ɲì mə̀ŋ-mɔ́ 1999 kát-mɔ́ 2011 rùaŋ-mɔ́ 2023 ká-mɔ́ 2035 ɗáp-mɔ́ plə́k-sí 2000 kót-sí 2012 tɔ́ -sí 2024 káːp-sí 2036 r̀.wàːj-sí kát-sə́ 2001 rùaŋ-sə́ 2013 ká-sə́ 2025 ɗáp-sə́ 2037 mə̀ŋ-sə́ kót-s.ŋá 2002 tɔ́ -s.ŋá 2014 káːp-s.ŋá 2026 r̀.wàːj-s.ŋá 2038 plə́k-s.ŋá rùaŋ-mòt 2003 ká-mòt 2015 ɗáp-mòt 2027 mə̀ŋ-mòt 2039 kát-mòt tɔ́ -sén 2004 káːp-sén 2016 r̀.wàːj-sén 2028 plə́k-sén 2040 kót-sén ká-ràw 2005 ɗáp-ràw 2017 mə̀ŋ-ràw 2029 kát-ràw 2041 rùaŋ-ràw káːp-sét 2006 r̀.wàːj-sét 2018 plə́k-sét 2030 kót-sét 2042 tɔ́ -sét ɗáp-kə̀ 2007 mə̀ŋ-kə̀ 2019 kát-kə̀ 2031 rùaŋ-kə̀ 2043 ká-kə̀

998

svantesson and holmer

6.2 Kinship Terms The Kammu kinship system depends crucially on marriage relations, which are as important as consanguinity. The people of a Kammu village are divided into three marriage groups (pɨ̀ːŋ), each consisting of one or more patrilineal totem groups (sə́ŋ táʔ) associated with a totem, which is a plant, a bird or a land animal. A person normally belongs to the same group and sub-group as his or her father. A woman from a plant group must marry a man from the corresponding ‘wife-taking group’ (pɨ̀ːŋ cèʔ), which is the land animal group, and a man from a plant group must marry a woman from the corresponding ‘wife-giving group’ (pɨ̀ːŋ ʔèːm), which is the bird group. Finally, a land animal woman must marry a bird man. Each group is thus wife-taking in relation to one of the other groups, and wife-giving in relation to the other one. Usually, the same term can refer both to consanguineous relatives and to relatives by marriage. One example is ʔèːm which can mean both ‘maternal uncle’ (mother’s brother) and ‘father-in-law’ (wife’s father). Since a man’s mother has married his father, she must belong to his (and his father’s) wifegiving group. Thus this man’s uncle (his mother’s brother) also belongs to this group, like the uncle’s daughter, who is then eligible to marry him. Her father (the uncle) then becomes his father-in-law. A man’s maternal uncle is thus regarded as his prototypical father-in-law. In practice a man is not allowed to marry his uncle’s daughter, who is regarded as too close kin, but he should marry a woman who has the same kin relation to him as she has, i.e. a woman of his own generation belonging to the wife-giving group (and usually younger than he). By extending the kin relations in this way, everybody in a Kammu village has a kin relation to everybody else. The kinship terms are defined by four factors: generation, marriage group, sex and relative age, but all these factors are not used in all generations. For example, táʔ basically means ‘grandfather’, but by extension each man in the grandparent generation is a táʔ. The Kammu kinship terms are listed in (52). For kin of one’s own generation and the child generation, men and women use different terms, as shown in the table. Terms separated by a slash ‘/’ refer to older and younger relatives. Thus, táːj is a man’s older brother (or a woman’s older sister) and hɛ́m is a man’s younger brother (or a woman’s younger sister). For more information on Kammu kinship terminology, see the introduction to Svantesson et al. (2014) and Lindell, Samuelsson & Tayanin (1979). (52) Kammu kinship terms Grandparent generation: grandfather táʔ grandmother jàʔ

999

kammu Parent generation: father mother father’s brother father’s sister mother’s brother mother’s sister wife’s father wife’s mother husband’s father husband’s mother

jòŋ màʔ tèːŋ/ʔà kɨ́n ʔèːm tèːŋ/màʔ ʔèːm ʔèːm kúːɲ kɨ́n

Own generation: brother, male parallel cousin sister, female parallel cousin father’s sister’s son mother’s brother’s son father’s sister’s daughter mother’s brother’s daughter wife/husband wife’s/husband’s brother wife’s/husband’s sister brother’s wife sister’s husband

in relation to a man in relation to a woman táːj/hɛ́m pré mɔ̀ k táːj/hɛ́m jɛ̀ʔ táːj/hɛ́m jɛ̀ʔ kɔ̀ ʔ kɔ̀ ʔ ʔàŋ pə́ːj/hɛ́m nɨ̀ŋ kḿ.ràʔ klèʔ jɛ̀ʔ táːj/hɛ́m pə́ːj/hɛ́m ʔàŋ pə́ːj/hɛ́m nɨ̀ŋ jɛ̀ʔ táːj/hɛ́m

Child generation: child brother’s son brother’s daughter sister’s child son’s wife daughter’s husband

in relation to a man in relation to a woman kɔ́ n kɔ́ n kɔ́ n, cèʔ k.mɔ́ n kɔ́ n, cèʔ ʔàm cèʔ kɔ́ n, cèʔ ʔàm ʔàm cèʔ, kʰə́ːj cèʔ, kʰə́ːj

Grandchild generation: grandchild

cèʔ

7

Sample Text

kə̀ k.nìʔ ʔàh kɔ́ n mòːj kòn ʔà.ɲùʔ sí há rók pí ʔə̀n.nàːj. 3SG.M MEDL exist child one CLF age four five six year like.that

1000

svantesson and holmer

nɔ̀ jèt tà réʔ jèm s.ŋíʔ, kə̀ cúʔ sís jʌ̀ kḿ.ràʔ 3PL stay LOC field when day 3SG.M want sleep INS wife tè. REFL

kə̀ jɔ̀ h ɲɨ́p k.táːm kàːj mʌ̀t c.mə̀ʔ túk cɨ̀aŋ k.táːm 3SG.M go catch crab then take string tie foot crab

nìʔ, lə̀ ʔùːn kɔ́ n tè nìʔ jɔ̀ h pǹ-tɨ́ːr k.táːm nìʔ. MEDL then give child REFL MEDL go CAUS-fly crab MEDL kɔ́ n jɔ̀ h pǹ-tɨ́ːr k.táːm nìʔ tà nɔ̀ k c.ʔòʔ. jòŋ cə̀ child go CAUS-fly crab MEDL LOC outside field.house father IRR sís jʌ̀ kḿ.ràʔ, kɔ́ n kàːj tàr kàːj nìʔ tʰɛ́m. kɔ́ n lʌ̀, sleep INS wife child then run return MEDL again child say kə̀ pə́ tɨ́ːr, jòŋ. jòŋ kàːj lʌ̀, jɔ̀ h càʔ-càʔ, kə̀ tɨ́ːr 3SG.M NEG fly father father then say go far-far 3SG.M fly lɛ̀ʔ. kɔ́ n kàːj jɔ̀ h-jɔ̀ h càʔ ʔèt, k.táːm kɔ̀ pə́ tɨ́ːr. kɔ́ n ASRT child then go-go far a.bit, crab then NEG fly child kàːj tàr kàːj hóp jòŋ káp màʔ tʰɛ́m. kə̀ pə́ tɨ́ːr, then run return meet father COM mother again 3SG.M NEG fly jòŋ. jòŋ kàːj lʌ̀, jɔ̀ h tà sŕ.nàːŋ kàːŋ ʔèːm ɗɨ́ŋ, kə̀ father father then say go LOC boundary house uncle up 3SG.M tɨ́ːr lɛ̀ʔ. kɔ́ n kàːj tàr jɔ̀ h tà sŕ.nàːŋ kàːŋ ʔèːm. kɔ́ n fly ASRT child then run go LOC boundary family uncle child nɛ̀ʔ . . . nɔ̀ kùʔ túk síːm lɛ́ʔ ʔàːm.pláːm ʔùːn kɔ́ n tè hɔ́ j. small 3PL like tie bird and butterfly give child REFL play tʰá kə̀ túk síːm, síːm cə̀ tɨ́ːr kə̀ hɔ́ j, lə̀ jèt káp if 3SG.M tie bird bird IRR fly 3SG.M play then stay COM jòŋ màʔ. jòŋ túk k.táːm ʔùːn kɔ́ n tè jɔ̀ h càʔ-càʔ father mother father tie crab give child REFL go far-far (Told by Kàm Ràw)

kammu

1001

There was a man who had a child who was around four, five or six years old. They were staying in the rice field in the day, and he wanted to sleep with his wife. He went to catch a crab, tied a string to its legs and told his child to go and make the crab fly. The child went out from the field-house and tried to make the crab fly. When the father was going to sleep with his wife, the child came running back and said, “It does not fly, father!” The father said, “Go far, far away, and it will fly!” The child then went a bit further away, but the crab still did not fly. The child came running back to his parents again, “It does not fly, father!” The father then said, “Go to the boundary of your uncle’s family’s field up there, then it will fly!” The child ran away to the boundary of his uncle’s field. Children . . . People tie birds and butterflies to let their children play with. If he had tied a bird, the bird would have flown and the child would have stayed with the parents to play with it, but the father tied a crab to make the child go far, far away. 8 Bibliography Dài Qìngxià et al. 2012. Lǎowō Lángnántǎ shěng Kèmùzú jíqí yǔyán [The Kammu people of Luang Namtha Province in Laos and their language]. Běijīng: Zhōngguó shèhuì kēxué chūbǎnshè. Ferlus, Michel. 1974. Les langues du groupe austroasiatique-nord. Asie du sud-est et monde insulindien (ASEMI) 5.1: 39–68. Gandour, Jackson, Eva Gårding & Kristina Lindell. 1978. Tones in Northern Kammu: a perceptual investigation. Acta Orientalia 39: 181–89. Holmer, Arthur. 2005. Verb serialization in Kammu. Working papers (Dept. of Linguistics, Lund University) 51: 65–84. Jun, Sun-Ah. 2005. Prosodic typology. In Sun-Ah Jun (ed.), Prosodic typology: the phonology of intonation and phrasing. Oxford: Oxford University Press, 430–58. Karlsson, Anastasia, David House & Jan-Olof Svantesson. 2012. Intonation adapts to lexical tone: the case of Kammu. Phonetica 69: 28–47. Karlsson, Anastasia, Jan-Olof Svantesson & David House. 2013. Multifunctionality of prosodic boundaries in spontaneous narratives in Kammu. Proceedings ProsodyDiscourse Interface 2013. Leuven. Lagrée, Doudart de & Francis Garnier. 1873. Vocabulaires Indo-Chinois. In Francis Garnier, Voyage d’exploration en Indo-Chine effectué pendant les années 1866, 1867 et 1868 . . . vol 2. Paris: Hachette, 495–517. Li, Fang-kuei. 1977. A handbook of comparative Tai. Honolulu: University Press of Hawaii. Lindell, Kristina. 1974. A vocabulary of the Yuan dialect of the Kammu language. Acta Orientalia 36: 191–207.

1002

svantesson and holmer

Lindell, Kristina, Håkan Lundström, Jan-Olof Svantesson & Damrong Tayanin. 1982. The Kammu year: its lore and music. London: Curzon Press. Lindell, Kristina, Rolf Samuelsson & Damrong Tayanin. 1979. Kinship and marriage in Northern Kammu villages: the kinship model. Sociologus 29: 60–84. Lindell, Kristina, Jan-Öjvind Swahn & Damrong Tayanin. 1984. Folk tales from Kammu III: pearls of Kammu literature. London: Curzon Press. Lundström, Håkan. 2010. I will send my song: Kammu vocal genres in the singing of Kam Raw. Copenhagen: NIAS Press. Maspero, Henri. 1955. Matériaux pour l’étude de la langue t’èng. Bulletin de l’École français d’Extrême Orient 47: 457–507. Premsrirat, Suwilai. 1982. A description of Khmu, including comparisons with Thai. Melbourne: Monash University, PhD. diss. ———. 1987. Khmu, a minority language of Thailand (Papers in South-East Asian linguistics No. 10, Pacific Linguistics A-75). Canberra: Australian National University. ———. 2002. Thesaurus of Khmu dialects in Southeast Asia. Salaya: Mahidol University. Simana, Suksavang, Somseng Sayavong & Elisabeth Preisig. 1994. Kmhmu’—Lao— French—English dictionary. Vientiane: Institute for Research on Lao Culture, Ministry of Information and Culture. Smalley, William A. 1961. Outline of Khmuʔ structure. New Haven: American Oriental Society. Svantesson, Jan-Olof. 1983. Kammu phonology and morphology. Lund: Gleerup. ———. 2011. The representation of Tai onsets in Kammu loanwords. Linguistics of the Tibeto-Burman area 34: 27–84. Svantesson, Jan-Olof & David House. 2006. Tone production, tone perception and Kammu tonogenesis. Phonology 23: 309–33. Svantesson, Jan-Olof, Kàm Ràw (Damrong Tayanin), Kristina Lindell & Håkan Lundström. 2014. Dictionary of Kammu Yùan language and culture. Copenhagen: NIAS Press. Svantesson, Jan-Olof & Damrong Tayanin. 2003. Sound symbolism in Kammu expressives. In Maria-Josep Solé, Daniel Recasens & Joaquin Romero (eds.), Proceedings of the 15th International congress of phonetic sciences. Barcelona: Universitat Autònoma de Barcelona, 2689–92.

chapter 15

Mlabri Kevin Bätscher 1 Background The language of the Mlabri (mra), mlaʔ briʔ ‘forest people’ (known in Thailand under the derogatory term pʰǐː tɔːŋ lɯ̌ əŋ “Spirits of the Yellow Leaves”) belongs to the Khmuic branch of Austroasiatic. The roughly 400 ethnic Mlabri live in five villages in Nan and Phrae provinces of Thailand. Oral history and a considerable amount of loanwords from Lao suggest that the Mlabri entered Thailand from Laos in recent history (possibly no earlier than the second half of the 20th century; Chanan 1992). Apparently, there are still Mlabri living in the Lao province of Sayabouli, although information on them is scarce, and contact even with government officials is sporadic. Almost certainly, their number is much lower than the population in Thailand. The Mlabri used to live as nomadic hunter-gatherers in the densely vegetated hills of the area, but have been led to sedentary life in very recent times (starting about 30 years ago), partly due to ongoing deforestation of the area and also government development programs (Rossi 2012). The Mlabri have long worked for local farmers (mostly Hmong, Khmu, and T’in people), but some of them are tending their own fields now with the support of the government. However, they still obtain a large part of their livelihood by hunting and gathering. Mlabri marry other Mlabri and speak the language to their children. Hence, despite their modest numbers, language use is vigorous. However, their society is changing rapidly, and their low population and social status renders their traditions very vulnerable. Every Mlabri speaks at least Northern Thai besides their ancestral language. Many also speak some White Hmong or even Mien dialects. In the last two decades, children have been learning Standard Thai in school and some of them code-switch frequently. Hence, borrowings invade the native vocabulary. Furthermore, young children show signs of applying prescriptive Thai phonology in their speech, although it is unclear whether this represents a looming change or just a phase in language acquisition. Rischel (2000) has identified three major dialects (or ethnolects), which he labels α (a.k.a. Mrabri), β (a.k.a. Minor Mlabri), γ (a.k.a. Yumbri). Phonological

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_021

1004

bätscher

differences are minimal, but they differ in their vocabulary to such a degree that they are substantially mutually unintelligible. This chapter is primarily a description of the α-dialect, although I will mention dialectal discrepancies where relevant (most notably in the word list). It is unclear whether speakers of the β-dialect remain. The Mlabri in Laos are assumed to speak the γ-dialect. Previous research on the Mlabri language has been fairly limited. The main resource is Rischel’s (1995) description of the β-dialect. His work represents rather a compilation of fieldnotes with hypotheses and anecdotes than a fullfledged grammar. Vernacular examples are scarce, especially of connected speech. Besides this, Egerod and Rischel published a rather extensive lexicon for the α-dialect (Egerod et al. 1987). Other publications have been either sketchy (L-Thongkum 1992, Nimmanahaeminda 1963) or directed towards one specific aspect of Mlabri, mostly historical in the case of Rischel’s publications (for example Rischel 2007), otherwise primarily phonetic (some student papers), but also semantic (Ratyanawong 2012). In short, the significant publications on the Mlabri language can be counted on one hand and a comprehensive understanding is far from attained. The present sketch will contribute little to this end, but offers a concise description of the past analyses, some new proposals, as well as novel data gathered in my own fieldwork (June 2012, April–May 2013).1 Some of the analyses have not been published previously, and they cannot be justified in due detail in this chapter. More detailed analyses with the necessary data will be forthcoming. 2 Phonetics/Phonology With a large phonemic inventory and a typical Austroasiatic syllable pattern, the conservative Mlabri phonology is valuable for historical reconstruction. 2.1 Word/Syllable Structure Mlabri word structure follows the typical Austroasiatic sesquisyllabic pattern with all disyllabic roots featuring an unstressed first syllable that may peak in

1 Most of the examples are drawn from my own fieldwork. Other examples are tagged with the source, and marked if they are in another dialect (for example [β]).

mlabri

1005

a vowel (with restricted variation in its quality) or a resonant.2, 3 This variation is exemplified below:4 (1)

Variation in the minor syllable təguk ~ tuguk chəmɔn ~ chɔmɔn chr̩kɛŋ ~ chərkɛŋ bm̩ bwaj ~ bumbwaj

I consider the vowel in the minor syllable to be epenthetic for the following reasons: first, there are no minimal pairs of words that only differ in the presence of a vowel in the minor syllable or in the vowel quality thereof;5 and second, the vowel quality of the minor syllable seems to be predictable, but this – as well as the conditions for the (optional) occurrence of such vowels— needs further investigation.6 Besides sesquisyllabic roots, there are invariably monosyllabic roots, but no roots longer than two syllables, except for very recent borrowings from Thai. The structure of a phonological word can be illustrated with the following template:

2 It has yet to be confirmed under systematic phonetic analysis whether words can fluctuate between one and two syllable peaks, but my impression is that they can, as for example in ɟram vs. ɟəram ‘exhausted’. 3 In one scenario, a resonant can be subject to variation, too: if the second consonant is the lateral /l/ a transitional tap is often produced before or instead of the lateral, for example klmuj ~ krlmuj ~ krmuj. 4 I also represent some of this phonetic variation in the Mlabri word list given in . . ., although my account is based restrictively on forms attested in natural speech, and does not provide an exhaustive list. 5 Rischel (1987) gives one dubious minimal pair: tarɯŋ/tərɯŋ ‘bamboo section used as a large cup or pot’ vs. trɯŋ ‘kind of leaves used for wrapping salt’, respectively. Of course, one example does not suffice to establish a phonological distinction, especially if the two words are semantically so close that they could in fact refer to the same thing. 6 The quality of minor vowels seems to be influenced by the quality of the vowel in the major syllable (often, the vowel in the minor syllable is a copy thereof), as well as the immediate phonetic environment (for example /u/ usually occurs in front of an /m/). Cf. Rischel (1995:64). However, Rischel (1995) argues for phonemic distinctions in minor vowel quality.

1006

bätscher

(2) Mlabri word structure (C)(V)(C)ˈ(C)(R)V(C)7

Codas /h/ or /ʔ/ are sometimes pronounced very weakly or are completely dropped, especially in connected and rapid speech. At this stage, it is unclear whether true open syllables exist. (3) Examples of word structure brʔiʔ ‘name’ bnn̥ ɛʔ ‘man’ grwɛc ‘finger’ cakɗar ‘squirrel’ n̥ daʔ ‘thin’ diŋmriŋ ‘louse’ cnbrin ‘cricket’ kokdroj ‘dove’

2.2 Phoneme Inventory and Phonotactics The phonemic inventory of Mlabri is very rich, with four different manners of articulation in stops, as well as a voicing distinction in resonants and a glottalization distinction in glides. (4) Mlabri consonant inventory voiceless asp. stops: ph th ch [~s] kh voiceless plain stops: p t c k ʔ voiced stops: b d ɟ g implosive stops ɓ ɗ voiced nasals: m n ɲ ŋ devoiced nasals m̥ n̥ ɲ̊ ŋ̊ voiced liquids: r, l (devoiced liquids) (r̥, l ̥) spirants: s ( j̊) h semivowels: w j (preglottalized semivowels) (ʔw) (ʔj)

Some exemplary minimal pairs are provided below. (5) Minimal pairs for dental stops ɗiŋ ‘big’ vs. diŋ ‘older sibling’ druɲ ‘tussock moth’ vs. truɲ ‘hard, crusty’ tal ‘day’ vs. thal ‘long’ 7 R = resonant consonant.

mlabri

1007

(6) Minimal pairs for devoiced vs. voiced sonorants m̥ ɛʔ ‘new’ vs. mɛʔ ‘rain’ l ̥ɛŋ ‘bracelet’ vs. lɛŋ ‘red’

The phonemicity of voicing in resonants, as well as glottalization in glides and voiced stops has yet to be examined.8 The phoneme /ch/ is in free variation with /ɕ/ or /s/. This seems to be an areal feature and is also apparent in Northern Thai dialects (Lanna and Lao).9 Most phonotactic restrictions occur in the syllable coda. All manners of articulation for stops collapse into a single unreleased voiceless series. Furthermore, only voiced nasals are found in codas. However, voicing of liquids is distinctive in coda position, but only in elders’ speech. Middle-aged and younger speakers do not maintain that distinction and have only voiced liquids in coda position. The same holds for the phoneme /j̊/, which only occurs in coda position, and the voiceless trill /r̥/ when it occurs in coda position. Yet, the occurrence of both is very rare.10 As the examples in (3) show, Mlabri allows for complex clusters. However, one remarkable restriction is that clusters consisting of a stop followed by a nasal are not found at all. (Rischel 1995:74) The Mlabri vowel system is complex in cross-linguistic comparison, yet quite typical for AA:

8

9 10

See Rischel (1995: 72ff.) for a discussion. He considers these segments to be clusters, and justifies this by the fact that “such entities as /hm, ʔb/ do not form clusters together with other consonants: they are themselves clusters.” However, words like n̥ taʔ ‘tail’ show that at least the nasal series contradicts this prediction. Furthermore, the occurrence of the epenthetic vowel in minor syllables provides evidence for the opposite analysis: (2) shows that these always occur after the first consonant in the onset. Yet, supposed clusters like / hm, ʔb/ are never split. For instance, for a sort of bamboo, m̥ rkal an epenthetic vowel can occur as in m̥ ɯrkal, but not *hɯmrkal. (I only have relevant evidence for nasals, but not for liquids.) As Rischel mentioned, glottalized consonants do not seem to occur in clusters, at all. Yet, this might be a surface phenomenon.) Besides, in any other onset (or pre-stress groups) of four consonants, the last one is never a stop. Morphological processes, especially infixation, can provide more evidence. I discuss this issue in detail in a forthcoming paper. For the purpose of consistency, I chose the notation /ch/ to represent this phoneme. I have not been able to identify it, and Rischel (1995) admits that he found it only in the β-dialect.

1008

bätscher

(7) Mlabri vowel inventory i ɯ u e (ə) ɤ o ɛ ʌ ɔ a

Among these, /ə/ only occurs as an epenthetic vowel in minor syllables and is thus not distinctive. Besides sequences of vowels and the semivowels /w/ and /j/, true diphthongs are very rare and possibly only occur in loanwords.11 There is no evidence for the phonemicity of vowel length in the α-dialect. Rischel speaks for such a distinction in the β-dialect, and proposes that it was lost in the α-dialect. (Rischel 2000:306) In the α-dialect, final syllables are commonly lengthened for emphasis, especially at the end of a clause. (cf. 2.4.) 2.3 Suprasegmentals There are no phonemic tones in Mlabri. Likewise, there is no register distinction as can be found in many other AA languages. This is due to the fact that Mlabri did not go through the loss of phonemic distinctions such as the devoicing of initial consonants, which is associated with suprasegmental features in other languages in the area. 2.4 Sentence Prosody Mlabri features a very distinct sentence intonation. This prosodic pattern is characterized by crescendo of pitch during the utterance with a clearly audible sharp and prolongued fall in pitch on the sentence-final syllable.12 3

Word Formation

Considering the areal context, it is not surprising that Mlabri is a predominantly isolating language lacking inflectional morphology. However, Mlabri features a comparatively rich and fairly transparent regular morphology (Rischel 1995: 83).

11 12

Rischel (1995:68) lists /ia, iʌ, iɤ, ua, uʌ, uɤ/. See L-Tongkum (1992: 46) for a description of this pattern. Despite the high frequency of this pattern, it is likely tied to some communicative function, which has not been identified so far. See Rischel (1995: 80–1) for a description of different prosodic contour types. Nota bene: the α- and β-dialects vary considerably in this regard.

1009

mlabri

3.1 Compounding Nouns can be attributed by simple apposition, just like other attributes, and like stative verbs (cf. 4.4). However, there is no prosodic evidence that the two components would form any such unit as a single phonological word. 3.2 Derivational Affixes The only derivational process that results in a change of word class is the infixation of verb roots to form nouns, which is described below.13 There are historical traces of different infixes—,

, —but the only one that shows a synchronically consistent pattern is .14 Primarily, it seems to derive nomina instrumentis from verb roots, as illustrated in the following examples:15 (8) The infix tɛk ‘hit’ tɯj ‘pound’ chrɛt ‘comb (v.)’ kwac ‘sweep’ gwɛc ‘poke’

> > > > >

tɛk ‘hammer’ tɯj ‘mortar’ chrɛt ‘comb (n.)’ kwac ‘broom’ gwɛc ‘finger’

Evidently, the infix has different surface forms that depend on the structure of the basic root: note that all the derivations start with a three-consonant cluster. This environment favours the occurrence of a minor vowel (cf. 2.1.). If the onset already contains a /r/, a simple /n/ is inserted not after, but before the trill. If the root contains any other sonorant in the onset, a simple /r/ is infixed before that. 3.3 Reduplication Reduplication plays a small role in Mlabri syntax. That is, reduplication is not employed to mark intensification or form adverbs, as for example in Thai.16 However, many nouns exhibit full reduplication. In such cases, the base is usually not attested in isolation, and when it is, it has the same semantic content. Hence, the process of reduplication is lexicalized and not productive synchronically. 13 14 15 16

But see 5.3. for verbal affixation that does not involve a change of lexical category. Rischel (1995: 87–8) hypothesizes that some of these infixes might in fact be allomorphs that arose as the result of dissimilation. Rischel (1995: 85) gives examples where the infixed form is a nomen actionis, but those might be roots specific to the β-dialect. L-Tongkum (1992: 53) argues that reduplication of (animate?) nouns can be used to mark plurality. It is not clear how productive this is.

1010

bätscher

(9) Reduplicated nouns micmɛc ‘ant’ ɲ̊ imɲ̊ ɛm ‘eyelashes’ muŋmoŋ ‘lymph gland’

Reduplication of verbs is less common than in other languages in the area (for example Thai or Hmong). Its function has not been clearly defined yet. It may convey an iterative or diminutive meaning. (10) Transparent reduplication in verbs pumpam ‘play around’ (pam ‘play’) boboŋ ‘eat’ (boŋ ‘eat’)

Unlike noun reduplication, these forms are synchronically derivable. (10) shows examples of both partial and full reduplication of verb roots. It is not clear if they have different functions. 4

Phrase and Clause Structure

4.1 Simple Sentences (Word Order, Questions, Commands) Mlabri follows the word order AVP, like most AA languages. (11) ʔoh kɔbɔ boŋ cin 1sg neg eat meat ‘I don’t eat meat.’

Yet, unlike most other languages in the area, Mlabri is not consistently headinitial. Deviation is found in noun phrase structure, as possessors and adjectives precede their heads. (see 4.4. and) Fronting of topics (see 4.3.), as well as deletion of given arguments (zero-anaphora) is very common. Polar (yes-no) questions are formed with the clause-final particle lɛh. (12) mɛh kɔbɔ ɟak lɛh 2sg neg go q ‘Aren’t you going?’

Open (or content) questions lack this particle, but feature an in-situ question word instead.

mlabri

1011

(13) naŋsɯ ʔoh dɔk gəlɛŋ book 1sg put where ‘Where [shall] I put the book?’

Imperatives are optionally marked by di:17 (14) (mɛh) (di) hrlɛʔ 2sg link smile ‘Smile!’

When the verb is modified by an adverb, the linker di separates the two: (15) rɛʔ ɟak di ʔjɛʔ escape go link far ‘Run far away!’

Negative imperatives (prohibitives) are introduced by a preverbal gʌm, as in the following example: (16) mɛh gʌm ɟak 2sg proh go ‘Don’t go [there]!’

4.2 Complex Sentences 4.2.1 Serial Verb Constructions Mlabri clauses are rich in verb serialization, as are other Southeast Asian languages. The following example shows how serial verb constructions are employed to introduce adjuncts such as instruments: (17) ʔek lam tɛk take wood hit ‘hit with a stick’

Besides this construction, it is not uncommon for adjuncts to follow a verb in what seems to be the direct object slot:

17

di shows up in different syntactic environments and with various functions. It is unclear whether these represent the same marker, or whether they homonyms.

1012

bätscher

(18) wɤk chmbɛp, kɔbɔ wɤk kɛw drink lips neg drink glass ‘Drink out of the bottle [that is, with your lips], don’t drink out of a glass [because there are none].’ (19) glaʔ mɤʔ mɤm speak mom dad ‘We’ll speak like my parents [that is, in Mlabri].’ [β] (Rischel 1995: 55)

The verb maʔ ‘give’ is used to introduce not only recipients, but also beneficiaries—a common phenomenon in Southeast Asia. (20) ʔoh maʔ mɛh ɗɤŋ ʔok ʔɛw 1sg give 2sg watch 1sg.poss child ‘I show you my child.’

Unlike Thai or Chinese, the verbs in a serialization are not just stacked in their basic forms. Instead, non-initial verbs are frequently marked by perfective (nsit) and causative prefixes. This topic needs further investigation, but it seems that some types of semantic relations within a serial verb construction require that the components be symmetrical in transitivity. Possibly, verbs that denote an effect need to agree in transitivity, but verbs of direction clearly do not. (21) tɛk mut pa-bɯl hit mosquito caus-die ‘hit the mosquito dead’ (22) dor ɟak ni gɛŋ throw go loc house ‘throw inside the house’

4.2.2 Clause Combining Clauses are frequently adjoined without conjunctions; the logical relation between the clauses is left to pragmatic interpretation. (23) ʔot glaŋ chuːŋ, ʔoh chibɛʔ 1sg.poss husband tall 1sg short ‘My husband is taller than me.’ [β] (Rischel 1995: 185)

1013

mlabri (24) kibi biːʔ, boŋ di chɛʔ neg satisfied eat link much ‘If you aren’t full yet, eat some more!’ [β] (Rischel 1995: 193)

This type of clause linkage is the norm for comparatives (23) and conditionals (24), temporal sequences and adverbial clauses. 4.2.3 Relative Clauses No unambiguous cases of relativization are found in the recorded data. The following example shows what appears to be a post-head relative clause: (25) khep mɤm maʔ ʔoh ʔa ɲon, ʔa tac slippers father give 1sg nsit neg.exist nsit break ‘The slippers father gave me are used up, they’re broken.’ [β] (Rischel 1995: 190)

However, since subordination is not formally marked in any way, it would be possible to analyze such sentences as asyndetically joined clauses with a fronted topic. Further investigation will show if (subordinated) relative clauses exist in Mlabri. Constructions such as (25) are not very frequent. 4.2.4 Complement Clauses Complement clauses are usually unmarked: (26) kraw braɲ krʌp fear dog bite ‘afraid that the dog might bite’ (27) mɔj sop leh ɗaj lɛh 3sg ask come can q ‘He asks if he can come.’

However, tana seems to serve as an optional complementizer. (28) mɔj tʌɲ (tana) 3sg say comp? ‘He said he’d gone.’

mɔj ʔa ɟak 3sg nsit go

4.3 Syntax and Pragmatics (Topic-Comment, etc.) The fronting of topical and focal elements is very common in Mlabri. This is exemplified in the following sentence, which was uttered in response to the question whether Mlabri eat ducks:

1014

bätscher

(29) kap boŋ duck eat ‘(we do) eat ducks’

The above example also demonstrates the high occurrence of zero anaphora in Mlabri. 4.4 Noun Phrases As mentioned in 4.1, Mlabri shows deviations from a general head-initial syntax. Namely, possessors, the partitive determiner and adjectives precede their heads. The latter will be discussed in 5.2. Besides these two classes, other modifiers, such as adjoined nouns, demonstratives (gʌh ‘this’, ɲʌʔ ‘that’) and adjoined stative verbs (relative clauses?), follow their heads, as in the following example: (30) mlaʔ briʔ person forest ‘people of the forest’

A direct succession of two nouns either represents the structure of a head with a following modifier (as in 30) or a coordinate construction where each noun denotes a part of a pair that together form a whole, as in the following: (31) diŋ roj older.sibling younger.sibling ‘siblings’

The structures of the two types of noun phrases differ in that the first one refers to one entity, whereas the second one has two referents. Thus, a modified noun can be preceded by only one determiner, whereas each noun in a coordinate construction needs to have its own determiner. If two nouns are coordinated in a phrase, but they do not form a natural whole, they are conjoined by the conjuction nɯŋ. (cf. 5.5.) A remarkable feature of Mlabri considering the areal context is its possessive construction: unlike other modifiers, possessors precede their head. Furthermore, the possessor and its head are linked by the particle di. (32) taʔ chi di gɛŋ uncle pn link house ‘Uncle Chi’s house.’

mlabri

1015

This possessive construction seems to require that the possessor be animate. Part-whole relationships are not usually expressed with this construction.18 (33) gwɛc tiʔ poke hand ‘fingers’

The connective di may precede a head without the overt occurrence of a possessor: (34) ŋ̊ uh di gɛŋ sit link house ‘be in one’s own house’

In such cases, di has a reflexive reading. 4.4.1 The Partitive Determiner The prenominal determiner ʔak identifies the referent of the noun phrase as a “part of a complex structure or organism.” (Rischel 1995: 152) More specifically, it precedes a noun that is inherently part of a whole. This is the case when it refers to a given possessor, specifically a third person, and in fact Mlabri often paraphrase ʔak with moj di ‘3sg link’. Inalienable objects, such as body parts, are usually preceded by the determiner even in isolation. Yet, ʔak is not only used to refer to a possessor; it also occurs when the referent naturally occurs in combination with other parts, as in the following: (35) ʔak ʔuj ʔak ɟioŋ part mother part father ‘mother and father’

Rischel (1995) refers to ʔak as a definite article. However, unlike a Europeanstyle definite article it does not mark anaphoric or deictic reference and is thus much less frequent in texts (Rischel 2006). In order to make that difference and because ʔak encodes a part-whole relationship, I refer to it as a partitive

18

However, when the possessum is part of an animate possessor, consultants do not reject a construction with di (for example braɲ di grwɛc ‘the dog’s paw’), although direct apposition seems to be less marked in this situation (for example grwɛc braɲ ‘dog paw’).

1016

bätscher

determiner.19 Although this seems to be the default usage of the determiner, it has other functions—for example to refer a trajectory as in the following example—which are difficult to describe under one unifying description. See Rischel (2006) for an extensive description of its functions. (36) ɟak khɯn ʔak wɤk go rise part water ‘go upstream’ (Rischel 2006: 87)

This determiner has an alternative form ʔat which seems to have the same distribution as the possessive forms ʔot and mɛt (as opposed to ʔok vs. mɛk). (See 5.1.2.) Although the etymology of ʔak/ʔat is unclear, at least synchronically it seems to form a paradigm with the above-mentioned possessive pronouns. 4.5 Adverbs Adverbs with scope over the whole sentence (mostly adverbs of time, but also location) usually occur in clause-initial position. (37) tal gʌh ciʔŋʌ kəbɔ ɟak rmap day prox what neg go field ‘Why didn’t you work on the field today?’

When an adverb directly modifies a verb, the linker di separates the verb from the following adverb. (38) kalaj pit ʔak glɤʔ di ɗi neg.pst close part head link good ‘[I] didn’t close the bottle properly.’ (39) rmɯt puŋ di wind blow link ‘The wind blows cool.’

19

lɛk cool

Rischel (2006: 101) asserts that the communicative goal of this determiner is quite distinct from “western definite articles”: “Generally speaking, [the Mlabri determiner] takes a more specific, relational type of reference (subsuming the notion of associative reference) to licence the use of the DetP.”

1017

mlabri

5

Word Classes

5.1 Nouns 5.1.1 Common Nouns Except for derived nouns (see 3.2.1.), nouns are not marked in any way, but are defined functionally: they may stand in argument position to form the head of a noun phrase, they can occur as arguments of adpositions (see 5.4.), or they can modify another noun by juxtaposition or with the linker di. Furthermore, they may be preceded by the partitive determiner (see 5.1.3.). 5.1.2 Pronouns Mlabri exhibits the following basic pronoun system: Singular Dual Plural

1st Person 2nd Person 3rd Person ʔoh mɛh mɔj ʔah bah — ʔah thɤŋ bah thɤŋ —

The dual category is based on the two forms ʔah and bah. Yet, a complete understanding of these forms is far from reached at this point. Nota bene, the same forms are used in the plural pronouns, where they are extended by the numeral thɤŋ ‘5’. (cf. 6.2.) Although the plural forms are firmly grammaticalized, the basic forms may be extended by other numerical values, so in fact their usage is not restricted to the dual number. It does seem to be the case, however, that they denote dual number if they occur independently. Notably, there are two gaps in the system, namely for 3rd person dual and plural. Descriptive noun phrases (often cʰɔŋ mlaʔ ‘2 people’ and m̥ u ‘group’) are used to fill these gaps. Also note that there is no inclusive-exclusive distinction as in other Khmuic languages (for example T’in). A very intriguing and areally highly marked feature of Mlabri is the presence of the possessive pronouns ʔot ‘1sg.poss’ and mɛt ‘2sg.poss’. At first sight, these seem to be simple phonological reductions of the sequence pronoun and linker di (cf. 4.4.). Indeed, these possessive pronouns are in free variation with a completely analytic construction. However, they exhibit an interesting phonological alternation: both forms often end in a velar instead of in a dental stop. This alternation seems to be subject to assimilation, but also to sociological factors (mainly age).20 Importantly, this alternation is not grounded in the general phonology of Mlabri. Hence, they clearly form a paradigm, which 20

Younger speakers end these forms in a velar more often than older speakers.

1018

bätscher

is possibly complemented by the partitive determiner ʔak (see below), which exhibits the same phonological alternation. 5.1.3 Question Words The following question words have been identified in Mlabri: gəlɛŋ ‘where / which’, ciʔ ɲʌ/piʔ ɲʌ ‘what’, tɯm mlaʔ ‘who’, w̥ aŋ ɗɤ ‘when’, ch(i)nɗɛh ‘how’, pan ɗɤ ‘how many’. (40) ʔek lam gəlɛŋ take wood which ‘Which [piece of] wood [shall I] take?’

It is striking that many of them seem to be composed, although the separate elements are not always clear. For instance, tal ɗɤ can be used to ask for a time. The first part is the word for ‘day’. One could expect that the second part means ‘which’—but gəlɛŋ is normally used for that (cf. 40). Furthermore, the morpheme w̥ aŋ is unknown otherwise. tɯm mlaʔ ‘who’ is clearly composed of mlaʔ ‘person’ and perhaps Hmong twg ‘which’, White Hmong twb twb ‘who’. 5.1.4 Measure Words and Quantity Words Mlabri does not have an extensive classifier system.21 The only readily occurring classifiers are mlaʔ for people, and klɔʔ for animals (and possibly fruits). Although this description is a mere first approach, the scarcity of classifiers would not be surprising for an AA language.22 The following example demonstrates the word order of head noun, numeral and classifier: (41) chɯ cəbut bɛr klɔʔ buy pig two clf ‘buy two pigs’

Not only is the number of classifier types very low, but also their frequency. Notably, classifiers are not required with demonstratives, like in other AA languages (for example Mon), but unlike Thai or Mandarin. 21 22

Rischel (2007: 143) argues that the β-dialect has a much more elaborate system, listing 13 items. In fact, even human referents can be modified by a bare numeral: mɤm toc bɛ:r m̥ aj ‘father took two wives’ [β] (Rischel 1995:143).

mlabri

1019

(42) kwʌr gʌh foreigner prox ‘this foreigner’

In fact, numerals are often used in isolation without classifiers: (43) ʔa-leh cʰiː nsit-come four ‘There are four [people] left.’

As for quantifiers, they are rather limited as well. The crucial terms are chɛʔ ‘many’ and patok ‘all’. Other measures of quantification are mostly derived from these, for example ki chɛʔ ‘not many, little, few’. patok does not get used in attribution to a noun (like English every or Thai thúk), but as a sort of resultative in a serial verb construction (much like Thai mòt). (44) leh patok come all ‘Everybody came.’ (45) pɤj ɗaj patok eat can all ‘You can eat all [types of bamboo].’

5.1.5 Names and Terms of Address “Both the title and the personal name itself may change with major events during a lifetime. A person may have at least three different names through his or her life. These changes occur with marriage and childbirth.” (Rischel 1995: 131) Mlabri names are commonly monosyllabic—examples are Bliah, Kuŋ, Kaʔ, Mon, Tik—but sesquisyllabic names do occur. Senior males are adressed with the title taʔ (see below) preceding their name. Their wives commonly share the same name, preceded by jaʔ. Most often, though, people are addressed as diŋ ‘older sibling’ or roj ‘younger sibling’, whether the people are actually related or not, as is very common in other languages in the area. Yet, the two terms refer to more than just relative age of people, and diŋ can be used to address younger people to humble oneself, as when asking for a favour.

1020

bätscher

Used as kinship terms, taʔ and jaʔ can refer both to one’s grandfather and grandmother, and uncle and aunt.23 The corresponding opposite nɔʔ ‘grandchild/nephew/niece’ makes no gender distinction. Referring to one’s parents, mɤʔ ‘mother’ and mɤm ‘father’ are used to address them, but ʔuj and joŋ are used in third person. This latter pair also generally refers to female and males, respectively. 5.2 Adjectives Mlabri has a pair of small words that pattern very interestingly: ʔaj ‘small’,24 and baj ‘big’ (β: blaj). These words precede their head, which can either be a noun or verb. This is exemplified with baj below: (46) a) baj wɤk (ɗiŋ) big water big ‘river’ b) *wɤk baj water big Intended: ‘river’

This shows that baj may not follow wɤk if it modifies it directly. This is not to say that baj may not follow a noun, in general. This is shown below: (47) chɔmɔɲ baj tɯrnap star big big ‘The stars are big.’

Prosodic cues tell us that baj and tɯrnap form a constituent. This is corroborated by the close bondage between baj and other stative verbs, such as thal ‘long’. That is to say, baj may not serve as a head. For instance, it could not alone serve as a predicate. This is why *wɤk baj is ungrammatical. However, it can modify both a noun or a predicate. The following example shows that ʔaj patterns likewise:

23 24

The terms taʔ and jaʔ basically refer to an older family member who is separated by two nodes upwards in a family tree. It seems that the form ʔaj is a phonological of ʔɛw ‘child; small’.

mlabri

1021

(48) ʔaj kəpit ʔaj tak25 small bottle small small ‘the bottle is small’

A possible third member of this class of adjectives might be chi ‘hurt’, although it does not pattern exactly analogously to baj and ʔaj. Still, consider the following distribution: (49) a) chi thɔŋ (gret) hurt stomach hurt ‘[My] stomach hurts.’

Because of their special distribution, which distinct from stative verbs, I treat the items baj and ʔaj (and possibly others) as a separate part of speech, which I here call adjectives. b) *thɔŋ chi stomach hurt Indended: ‘[My] stomach hurts.’

5.3 Verbs The majority of Mlabri verbs occur in their basic citation form. There is no morphological marking of passives or reciprocals.26 Transitivity is largely a lexico-semantic parameter with no formal distinction, aside from causative forms (cf. 5.3.5.). As ellipsis of arguments is very frequent, transitive verbs often occur without overt arguments (that is, both subjects and objects). Furthermore, transitive verbs may be used in an unaccusative frame (which looks like an unmarked passive): (50) ʔac gʌh boŋ ɟʌj̊ bird prox eat tasty ‘This bird is tasty (to eat).’ (L-Tongkum 1992: 54)

25

26

In fact, the prosodic unity of ʔaj and tak is so strong that it seems like one word. ʔaj is often phonetically reduced to /e/, and tak is not attested elsewhere. This form may be fully lexicalized. Instead, reciprocals are expressed by an expression tom mlaʔ ‘each other’ following the verb.

1022

bätscher

However, verbs are productively inflected for tense, viz. by the marker a denoting a new situation (nsit, or perfective aspect). (51) ʔa bɯl patok nsit die all ‘They all died.’ (52) ki ʔa pluŋ neg nsit hot ‘not hot anymore.’

5.3.1 Ditransitive Verbs The only truly ditransitive verb that has been identified in Mlabri is maʔ ‘give’. Although referential density is generally very low in Mlabri (cf. 4.1.), the following two (elicited) utterances show the possible word orders in a ditransitive clause without topicalization: (53) Recipient-Theme word order chɯ maʔ ʔetak khanom buy give child cookie ‘buy cookies for the child’ (54) Theme-Recipient word order maʔ khanom mɛh give cookie 2sg ‘give the cookie to you’

Apparently, both word orders are acceptable in Mlabri, leaving the distinction between theme and recipient to semantic resolution. However, multiple overt arguments in one clause are avoided in Mlabri. The recipient seems to be expressed in post-verbal position always, but the theme may either be contextgiven and dropped, or otherwise topicalized as in the following example: (55) wɤk maʔ di ɟioŋ water give link father ‘give water to one’s father’

5.3.2 Directional Verbs The most common motion verbs are ɟak ‘go’, leh ‘come’, hɔt ‘arrive’, wʌl ‘go back’ khɯn ‘go up’, and ɟur ‘go down’. Out of these, especially the first two are used very frequently in serial verb constructions. Furthermore, the first four are

1023

mlabri

transitive, that is, they can be followed by a location without an interceding preposition. (56) Directional verbs with DO ɟak rəmap wʌl ‘come back from the field’ mɛh leh gəlɛŋ ‘Where are you from?’ a-hɔt gɛŋ ‘We’re back home’

In contrast, kʰɯn and ɟur require either a locative marker or a transitive motion verb (ɟak or leh) after them: (57) kʰɯn *(ni) khɔn go.up loc log ‘[They] climbed on a log.’

5.3.3 Causative Verbs Mlabri applies the typical AA way of forming causatives, namely with the prefix pa-/pə-. Its usage is fairly productive and very transparent. However, in some cases the prefix does not seem to add any causative meaning to the stem, and sometimes it seems that the corresponding (underived) verb stems have been lost.27 This prefix shows a phonological peculiarity, namely (possibly optional) voicing dissimilation: if it is attached to a verb stem that begins with a voiceless consonant, the initial labial of the prefix becomes voiced. This is exemplified in the following list: (58) Causative verbs ɟur ‘descend’ leh ‘come, appear’ chɛŋ ‘flame up, be on fire’ chɔn ‘teach’ (Thai)

> > > >

pa-ɟur ‘make something fall’ pa-leh ‘produce, give birth’ ba-chɛŋ ‘set afire’ bə-chɔn ‘teach’

The following example demonstrates how causative verbs subcategorize for a direct object: (59) pa-leh ʔak rtlat caus-come part tongue ‘stick the tongue out’ (Rischel 2006: 72)

27

My evidence for analyzing the initial /p/ as the causative prefix is the presence of a schwa, which—if analyzed as an epenthetic vowel—would not occur in a root.

1024

bätscher

5.3.4 Existential Verbs The basic existential verb in Mlabri is pɤʔ (ɲaŋ in the β-dialect).28 The argument may either precede or follow pɤʔ. (60) ni briʔ pɤʔ thak loc forest exist leech ‘There are leeches in the forest’

This verb can also be used to express possessive relations like English ‘have’:29 (61) a) ʔoh juk pɤʔ 1sg rice exist ‘I have rice.’ b) ʔoh pɤʔ juk 1sg exist rice ‘I have rice.’

To express the absence of something, the verb l ̥ak (noɲ in the β-dialect) ‘not exist, lack’ is employed. This verb is much more frequent than its counterpart pɤʔ, and is also used to respond negatively to question whether an action has occurred or not, as in the following conversation: (62) Not yet A: ʔa ʔɤʔ juk lɛh nsit eat rice q ‘Have you eaten yet?’ B: l ̥ak neg.exist ‘not yet’

28 29

pɤʔ is unlike Thai jùː in that it is not used in locative constructions (in which case ŋ̊ uh is used). Thus, it resembles the function of Thai mi: more closely. The construction with the two NPs in juxtaposition suggests that the predicate is still intransitive, but the possessor is added as a topic.

mlabri

1025

5.3.5 Preverbal Adverbial Mlabri features a list of preverbal adverbs and grammatical particles that do not comply with the general head-initial structure. Among these are the following: (63) Preverbal adverbs ar ‘first’ chʌm ‘again’ ʔa ‘nsit’30 gʌm ‘PROH’ ʔɛl ‘yet’

The usage of such adverbs is exemplified below: (64) ar chuaj tiʔ first wash hand ‘Wash [your] hands first!’ (65) ki ʔɛl ʔɤʔ juk neg yet eat rice ‘[I] haven’t eaten yet.’

5.4 Locational and Directional Words Spatial relations in Mlabri are expressed by a variety of different strategies, which are only summarily presented here. To express static location, the verb ŋ̊ uh ‘sit’ gets used to express location at a specific point. It can be accompanied by a locative marker, but does not need to be: (66) ŋ̊ uh (ni/ti) gɛŋ sit loc/loc house ‘be at home’

This example also shows the locative marker ti/ni, which is neutral with regard to spatial specifications. The former is an apparent loan from northern 30

The exact status of the New-situation (NSIT) marker as a prefix or rather a clitic remains to be investigated. It’s high phonological dependence and exclusive (?) appearance in front of verbs suggest that it is a prefix, but its status is unclear. In this text, it is represented as a free morpheme, awaiting further evidence.

1026

bätscher

Thai (Kammueang) tî ‘place; at’. In contrast, the following spatial terms are more specific: lətɤŋ ‘up, on top’, cʰibɛʔ ‘down, low’, naj ‘inside’ (another loan from Northern or Standard Thai, naj ‘in’)), luah ‘outside’, kal ‘in front’, and gɔt ‘behind’. Some of these are used as adverbs, whereas others pattern more like adpositions.31 Some of this variation is demonstrated in the following examples: (67) ʔac pʌr ɟak lətɤŋ bird fly go up ‘The birds fly up.’ (68) ʔac ŋ̊ uh ni toʔ lətɤŋ bird sit loc table top ‘The bird is on the table.’

The last three items in the list above—luah ‘outside’, kal ‘in front’, and gɔt ‘behind’—are relatively frequent in combination with another locative marker luŋ/nuŋ. Thus, they are used as adverbials with either locative or dynamic meaning. (69) mɛh chi ɗaj ŋ̊ uh luŋ-luah lɛh 2sg want get sit loc-outside q ‘Do you want to go sit outside?’ (70) leh lɯŋ-kal32 come loc-front ‘[He] came before [the others]’ or ‘[He will] come later.’

Notice how lɯŋkal is used to refer to the future, that is, metaphorically, what is lying ahead. With this meaning, it can also modify a noun: (71) n̥ am lɯŋ-kal year loc-front ‘next year’

31 32

Theoretically, some of these “postpositions” (specifially lətɤŋ ‘on top’) could also be spatial nouns that attribute the preceding noun. This will have to be tested. The reason why I analyze this compound as one word is because the locative marker luŋ seems to undergo the type of vowel harmony apparent in minor syllables. (cf. 2.1.)

mlabri

1027

Besides this, plain kal can be used as a preposition, similar to naj: (72) leh kal taŋ mlaʔ come front other person ‘[He] came before the other people.’

The locative marker luŋ/nuŋ is also used in combination with the demonstratives to form luŋgʌh ‘here’ and luŋɲʌʔ ‘there’. In short, spatial relations in Mlabri are expressed through a combination of motion verbs (see 5.2.4.), adpositions (both pre- and postposition), and adverbs. The boundaries between the last two categories are open. Locative markers are unnecessary if the verb itself has a locative meaning. 5.5 Conjunctions / Connective Words The coordinating conjunction nɯŋ most often links arguments, as in (73), but can coordinate predicates, as well: (73) braɲ nɯŋ kwʌr hot ɟur ni wɤk dog and foreigner fall go.down loc water ‘The dog and the boy fall down into the water.’ (74) pluŋ nɯŋ gem hot and spicy ‘It’s both hot and spicy.’

a-lac (a) is a connective that is used to link clauses, very similar to Thai lɛ́:w kɔ̂ . (75) kwʌr duʔlon hot ɟur foreigner surprised fall go.down ‘The boy is so surprised that he falls down.’ ʔa lac ʔa ʔac pʌr leh nsit finish ? bird fly come ‘And then the bird comes flying out.’

kibɛ and kətʌɲ are used to contrast two assertions, like English but. They can be placed either at the beginning of a clause or after the subject, depending on the focus. Apart from these few conjunctions, Mlabri features very few connectives. Clauses are often adjoined without intervening linkers; the nature of logical cohesion is left to pragmatic interpretation. (cf. 4.2.2.)

1028

bätscher

5.6 Particles There are two preverbal negators, ki and kɔbɔ, the latter of which is an obvious borrowing from Northern Thai kɔ̀ bɔ̀ , a lexified combination of topic linker kɔ̀ and negator bɔ̀ , whereas the former could represent an indigenous form. The two seem to be used interchangeably, except for certain quasi-lexified expressions. As described above, Mlabri makes frequent use of a particle di. At this stage, it is not clear whether this represents several homophones or a multifunctional marker, but it is used for marking the marking of possession, adverbs and imperatives. Besides the question particle lɛh (see 4.1), the Mlabri employ a number of different sentence-final particles, including no, which seems to be used as a tag.33 Although this aspect of the grammar is completely unstudied, it seems that the system is not as rich as in Thai. Instead, the Mlabri convey stance in prosodic cues. 6 Semantics 6.1 Specialized Vocabulary As would be expected for a people like the Mlabri who live in close relation with their natural surrounding, even young Mlabri know a multitude of plant names, few of which have been identified at this point. They have independent names for different kinds of bamboo, lacking a generic term. Similarly, the Mlabri lack generic terms for certain groups of animals, for example snakes (constrictor snakes are different from poisonous ones), monkeys, or any generic term for animals as a class. Mlabri has three verbs for eating: boŋ for any sort of meat, ʔɤʔ for harder vegetarian food (that needs to be chewed)—for example rice or cookies—and pɤj for softer vegetarian food—like cooked bamboo, fruits, or ice-cream. All of these verbs are equally marked, and the distinction is very consistent. One word, mʌc, describes various ways of perception and covers English ‘see’, ‘hear’, and ‘know’. 6.2 Language Use The Mlabri do not use their numerals very consistently. Although many of them are able to count from one to ten in succession, especially younger speakers have trouble accessing any given numeral, especially younger speakers. Most of the time, they use Thai numerals, especially as nominal attributives. 33

A proper understanding of this seems to require a sound knowledge of Northern Thai, because I have the impression that the two are similar.

1029

mlabri

The numeral ‘3’, pɛʔ, is not used in attribution, at all. Instead, the following construction is employed: the numeral ‘2’, bɛr or more the Thai loan cʰɔŋ, precedes the head noun, and the word l ̥oj ‘extra’ follows (see below). This latter word is not attested in other usages beyond this construction. Other numerals have adopted distinct functions beyond their origin as numerals: mɔj ‘1’ is used as the third person singular pronoun. Also, it is shortened to mɔ-/ma-, in expressions like ma-n̥ ʌm ‘one year’ (like in Khmer). Furthermore, occurs in lexicalized forms like dəmɔj ‘alone’.34 The numeral pon ‘4’ is used quite frequently with the meaning ‘many’. Similarly, thɤŋ ‘5’ is used to mark plural number in the pronominal forms, without specifying the exact number of people involved.35 Days are counted in dusks: For instance, the way to say ‘in three days’ is cʰɔŋ lɛk l ̥oj (‘two dusk extra’), and the sun and moon are referred to as joŋ ‘father’ and ʔuj ‘mother’ respectively. 7

Sample Glossed Text

Excerpt from a story based on the picture book Frog where are you? by Mercer Mayer (2003), New York: Dial.

lac ʔa kwʌr duʔlon hot ɟur finish ? foreigner surprised fall go.down ‘The boy is so surprised that he falls down.’

ʔa lac ʔa ʔac pʌr leh nsit finish ? bird fly come ‘And then the bird flies out.’

ʔetak kwʌr gʌh kraw child foreigner prox fear ‘The boy is afraid.’

ʔac ʔa pʌr ɟak bird nsit fly go ‘But the bird flies away.’

34 35

A similar form may be used in attribution, as well: dəma lɛk ‘one day’. There are more examples of numerals being used with bleached semantics in specific constructions. For example bɛr ‘2’ is used to mark plurality in ʔɛw bɛr ‘children’, without indicating an exact number. (cf. Rischel 1995: 126)

1030

bätscher gəhit braɲ bah kwʌr kamlaŋ na a kʰit tana pen lam this dog no foreigner prog ? nsit think quot cop tree ‘Here the dog . . . no, the boy . . . thinks that that’s a tree.’

kibɛh ki mɛn lam, pen pol ̥ but neg true tree cop deer ‘But it’s not a tree, it’s a deer.’

mɔj ŋ̊ uh pol ̥ di glɤʔ 3sg sit deer poss head ‘The boy’s sitting on the deer’s head.’

8 Bibliography Chanan, Vongvipak. 1992. Economic and Social Change among the Mlabri. In Surin Pookajorn (ed.) The Phi Tong Luang (Mlabri): A Hunter-gatherer group in Thailand. Bangkok: Prasith Santiwattana, 92–103. Egerod, Søren and Jørgen Rischel. 1987. A Mlabri-English dictionary. Acta Orientalia 48: 35–88. L-Thongkum, Theraphan. 1992. The Language of the Mlabri (Phi Tong Luang). In Surin Pookajorn (ed.) The Phi Tong Luang (Mlabri): A Hunter-gatherer group in Thailand. Bangkok: Prasith Santiwattana, 43–65. Nimmanahaeminda, Kraisri. 1963. The Mrabri Language. Journal of the Siam Society 51.2: 179–184. Ratyanawong, Amonrat. 2012. A Semantic Analysis of Extended Meaning in Mlabri Body-Part Terms. MA thesis, Mahidol University. Rischel, Jørgen. 1995. Minor Mlabri. A Hunter-Gatherer Language of Northern Indochina. Copenhagen: Museum Tusculanum Press. ———. 2000. The Enigmatic Ethnolects of the Mlabri (Yellow-Leaf) Tribe. http://seal ang.net/sala/archives/pdf8/rischel2000enigmatic.pdf (accessed on June 8, 2014) ———. 2006. The “definite article” in Mlabri. Mon-Khmer Studies 36: 61–102. ———. 2007. Mlabri and Mon-Khmer: Tracing the history of a hunter-gatherer language. Copenhagen: Det Kongelige Danske Videnskabernes Selskab. Rossi A. 2012. Turning Red Rural Landscapes Yellow? Sufficiency Economy and Royal Projects in the Hills of Nan Province, Northern Thailand. ASEAS—Austrian Journal of South-East Asian Studies 5.2: 275–291.

section 9 Mangic

․․

chapter 16

Bugan1 Jinfang Li and Yongxian Luo 1 Background Bugan is the language and the name of an ethnic group who call themselves pə⁵⁵ka̱ n³³. Bugan speakers number approximately 3,000. They reside in the mountainous areas in the southeastern part of Yunnan Province on the Guangxi-Yunnan border, in southern Guangnan 广南 and northern Xichou 西畴 counties. They are found in the following villages: pə⁵⁵luŋ¹³ (Chinese: Laowalong 老挖龙), pə⁵⁵ʦuŋ¹³ (Ch. Xinwalong 新挖龙), pə⁵⁵ʦiaŋ³¹ (Ch. Jiuping 九坪), pə⁵⁵ɕe¹³ (Ch. Shibeipo 石碑坡), li³¹laŋ¹³ (Ch. Xinzhai 新寨), pə⁵⁵ɣu³¹ (Ch. Malong 马龙) and Nala 那腊. They co-inhabit with the Han Chinese in Jiuping, Xinzhai, and Nala; the other four villages are exclusively inhabited by Bugan. They are officially classified as Yi by the Chinese government. Linguistically their language belongs to the Austroasiatic stock. The local Han Chinese refer to the Bugan as Hualuo 花倮 ‘colourful Lolo’ or Huazu 花族 ‘colorful people’, Bugan women wear colorful traditional dresses made from triangular swathes of printed cloth sewed together. The Bugan do not consider themselves to be natives of Guangnan and Xichou but believed they had come before the Han, the Miao/Hmong and the Yao/Mien. Some Bugan people claim that their ancestors migrated from Jiangxi, Sichuan and Guizhou provinces, and that they have lived in Guangnan and Xichou for ten generations or more. The Bugan celebrate their New Year in April of the lunar calendar. During the festival days they sing native or Han songs, and dance to music played by a kind of Bugan instrument made of bamboo and calabashes. Nowadays only a handful of elderly people are still able to sing their native songs. Bugan women are accustomed to wearing long black skirts, colorful dresses and head wraps. Marriage used to be arranged primarily within their own ethnic group, although in recent years an increasing number of Bugan girls married young Han men.

1 The authors would like to record their thanks to Professor Jerold A. Edmondson for his help in writing this sketch. Any errors and misinterpretations are our own responsibility. This chapter is a revised and expanded version of an earlier study (Li 1996). © koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_022

1034

li and luo

The most common family names for the Bugan include Li 李, Wang 王, Guo 郭 and Lou 娄; other common family names include Yan 岩, Lu 陆, Pu 普 and Yi 益. All Bugan people speak their native language and adults also can speak Chinese. No dialect variation is found in Bugan. Data for the present study comes from the variety spoken in Nala Village 那腊寨 of Nasa Township 那洒镇of Guangnan County. 2 Phonetics/Phonology 2.1 Word/Syllable Structure Words can be monosyllabic or disyllabic, the majority being monosyllabic; examples: Monosyllabic words pi³⁵ ‘sun’ tɑ⁵⁵ ‘moon’ tɔ⁵⁵ ‘grandfather’ ʑu³³ ‘grandmother’ buŋ³1 ‘skin’ nɑ̱ u̱⁵⁵ ‘shoe’ li̱³⁵ ‘to change’

Disyllabic words bɔ⁵⁵mɔ³³ ‘star’ ŋgɔ1³nɑu⁵⁵⁽³³⁾ ‘thing’ fu⁵⁵dɑ1³ ‘money’ ʦi³1lo̱ ³⁵ ‘rabbit’ tɑ̱ ³⁵ti⁵⁵ ‘to give’ ȵou1³ʦɑu³1 ‘to frighten (sb.)’ e⁵⁵e⁵⁵ ‘sand’

Lexical roots are monosyllabic, disyllabic words are built up by affixation and compounding. Syllables are basically CV(V)(C)T. The minimal onset is a glottal stop [Ɂ], which is not marked in the description offered in Li (1996); maximal onsets include affricates and prenasalised stops, both homorganic and heterorganic (the latter being very infrequent in the data). Both diphthongs and triphthongs are recognized; the latter restricted to /iau/ and /iaɯ/ and could just as reasonably be treated as diphthongs following a medial /j/ but that approach is not taken here. Examples illustrating the diverse realizations of the monosyllabic template follow:

Open

Simple onset

Prenasalised onset

ʔa³³ ‘return’ ta³¹ ‘be close’ ʔa̱ i³̱ ³ ‘excrement’ na̱ i³̱ ¹ ‘flat, level’

ndɛ³⁵ ‘beat’ ŋgaɯ³⁵ ‘slippery (road)’ ŋqu³³ ‘village’

1035

bugan

Closed

Simple onset

Prenasalised onset

bi̱ou̱ ⁵⁵ ‘leech’ ʔa̱ m³¹ ‘warm’ na̱ ŋ³³ ‘sleep’ bi̱a̱ŋ³⁵ ‘claw’

ndzuŋ³³ ‘thin (human)’ ŋguaŋ¹³ɕɛ¹³ ‘intestines’

The Bugan sound system is characterized by a double series of initials, manifested in plain and prenasalized stops and affricates. Most vowels exhibit a tense (v) and lax (v) contrast. Final consonants, stops and tones are prone to change. A number of vocabulary items can be pronounced with either final stops or nasals with the same place of articulation. Quite a number of words (in citation form) can be pronounced in two different tones. Tone sandhi is common, particularly in fused syllables, e.g. sɑu³³ ‘bird’, bi³¹ ‘two’, bi³³ ‘clf’ → sɑu³³bi̱ə¹³ ‘two birds’; wi³¹ ‘we’, bi³¹ ‘two’, pɑu³¹ ‘clf’ → wi³¹bi̱ɔ³¹ ‘we two’; lɛ³¹ ‘possessive marker, belonging to’, ʔɔ³¹ ‘I’ → lio⁵⁵ ‘mine, belonging to me’. 2.2

Phoneme Inventory

2.2.1 Initials Bugan has 36 initials, listed below. pʰ tʰ ʦʰ p t ʦ mʦʰ mʦ b d mb nd ndz md mdz m n ȵ f θ, s ɕ w l ʑ

kʰ k q ʔ g ŋg ŋq ŋ x h ɣ

Voicing and aspiration are contrastive features in certain stops and in initial clusters of the type ‘nasal + affricate’: /p b pʰ , k g kʰ, t d th, mʦ mʦʰ mdz/. Durations of the prenasalized segments are longer than those of corresponding non-nasalized stops/affricates, but their nasal resonance is not very strong, sometimes it is only slightly perceptible.

1036

li and luo

With the tone 31, nasal /m/ in /mʦ mʦʰ mdz/ is also pronounced as stop /p-/: mʦe³¹~pʦe³¹ ‘three’, mʦʰɑ³¹~pʦʰɑ³¹ ‘to rub with the hands, make a cord’. Before /-i-/, /ʦ-/ is palatalized to [tɕ-]: e.g. ʦʰiou³¹ ~ tɕhiou³¹ ‘to sneeze’, ʦi̱u̱³⁵ ~ tɕi̱u̱³⁵ ‘to have money on sb.’ Prenasalized /ŋg/ sometimes is pronounced as a plain nasal [ŋ], e.g. ŋgɑ³¹~ ŋɑ³¹ ‘yellow’. Words taking initials g-, ŋq-, md-, and mdz- are very few in number; only one example can be found for each of /gŋq-/ in our data: gɑm³⁵ ‘to stab’, ŋqu³³ ‘village’. 2.2.2 Rhymes Bugan has a system of 10 simple vowels, a e ɛ i o ɔ u ə ɯ y, all with tense-lax contrast, except for y. Length is not contrastive. In addition, there are nasal vowels ã õ ũ ũ̠ ə̃. There are 13 diphthongs and 2 triphthongs. Rhymes can be open or closed with codas -m, -n, -ŋ, -p, -t, -k. Altogether there are 90 attested rhymes (including phonation contrasts), as listed below. Bugan rhymes: i/i̠ in iɛ̃ iɔ ia/ia̠ iã iap i̠a̠m iak iaŋ/i̠a̠ŋ iu/i̠u̠ i̠a̠ɯ̠ ie/i̠e ̠ io/i̠o̠ ioŋ/i̠o̠ŋ iou/i̠o̠u̠ y ɯ/ɯ̠ e/e̠ et en/e̠n ɛ/ɛ̠ ɛm/ɛ̠m ɛn/ɛ̠n ɛi/ɛ̱i ̱ a/a̠ ã/ã̠ ap am/a̠ m at/a̠ t an/a̠ n ak/a̠ k aŋ/a̠ ŋ au/a̠ u̠ aɯ/a̠ ɯ̠ ai/a̱ i ̱ ə ə̃ ən ɔ op ɔ̠k ɔ̠ŋ o õ oŋ/o̠ ŋ ou/o̠ u̠ u/u̠ ũ̠ uk uŋ/u̠ ŋ ui/u̠ i ̠ ua uã uan uaŋ uɛ

The three final stops, -p, -t, -k, are not released. Words with final stops are few in number; these words are found to occur only with tones 55, 33, 31 (only two examples are found in our data with tone 33). In a number of cases, words can be pronounced either with final nasal or stop (same place of articulation) or the coda can be lost altogether. E.g. ȵam⁵⁵ ~ ȵap⁵⁵ ‘to close one’s mouth’; ɕet⁵⁵ ~ ɕen⁵⁵ ‘goat’; biak⁵⁵ ~ biaŋ⁵⁵ ‘grape’; pə⁵⁵bop³¹ ~ pə⁵⁵bou³¹ ‘head’; nuk³¹ ~ no̠ ³¹ ‘to come’; wa̠ k³¹ ~ wa̠ ³¹ ‘iron pot’.

1037

bugan

A few words with nasalized vowels lose their nasalization or they may lose the final nasal, e.g. saŋ³³ ~ sã³³ ‘eight’, laŋ³¹ ~ lã³¹ ‘horse’. /iã iap uã ɛm ɔ̃ ə̃ / only occur in Chinese or Zhuang loans. For rhymes /i̱a̱ɯ̱ i̱a̱m iã iap iak ɛm ɛ̱m ɔk ən ə̱ /, only one example for each is found in our data. Most of those words take final nasal or stop. /uak/ appears only with one suffixed syllable: ho³¹ŋuak³¹ ‘slow + suffix → very slow’. 2.3 Suprasegmentals Vowels are either tense or lax to show vowel harmony; tense vowels and lax vowels do not co-occur in the same syllable. Compared with lax vowels, the tongue position for tense vowels is lower and further back, as /a/ in ta³¹ ‘near’, with the vowel realized as [a], but the tense /a̠/ in ta̠ ³¹ ‘to bet’ is close to [ɑ]. With syllables taking the rising tone, the tense vowels are easy to hear; the laryngeal setting is very tense; but with those taking the falling tone (31) the laryngeal setting is weak. Examples of tense-lax contrast include kui³³ ‘fire smoke’ cf. ku̠ i³̱ ³ ‘to dry by smoke’, luŋ³³ ‘well’ cf. lu̠ ŋ³³ ‘abyss, deep cave’. There are six tones in open or unchecked syllables (those ending in vowel coda and/or nasals): two level (55, 33), two rising (35, 13), one falling (31) and the neutral tone (atonality) ‘0’. The neutral tone ‘0’ occurs only with a number of prefixes. Three of these tones, high level 55, mid-level 33 and falling 31, occur in checked syllables. These occur with either lax or tense vowel syllables taking final -p -t -k or nasals in the same place of articulation. Bugan tone distribution: Tone

open syllables

checked syllables

55 high level 33 mid level 35 high rising 13 low rising 31 low falling 0

x x x x x x

x x

x

Tones are apparently very unstable in Bugan. A significant number of words can be pronounced with two tones, e.g. ʑou³³/⁵⁵ ‘earth’, do¹³/³³ ‘to burn the grass on waste land for cultivation’, ʦa¹³/³³ ‘to bite’, χau⁵⁵/³¹ ‘valley’, mʦʰi⁵⁵/¹³ ‘to milk’, lɛi¹³/³⁵ ‘the sting of a bee or wasp’, ʦi³³/³⁵ ‘to stretch out (one’s hand)’.

1038

li and luo

Tone change is common in speech contexʦ. Many show progressive assimilation, e.g. (the tone values in parentheses are in citation values) mə⁵⁵χɛ⁵⁵⁽³¹⁾ ‘taste bad’, mə⁵⁵nam⁵⁵⁽³⁵⁾ ‘one year’, bi³¹nam³¹⁽³⁵⁾ ‘two years’, nam³³⁽³⁵⁾ ni³³ ‘this year’, ɣo¹³tɛn³³⁽³⁵⁾ ‘very crowded’, pə⁰ loŋ³⁵⁽³³⁾ ‘upper, above’. 3

Word Formation

3.1 Compounding A compound is formed in any of several ways: parallel, verb-object, headmodifier, subject-predicate, or verb-complement. Parallel: (1)

tu³³-ʑau⁵⁵⁽³¹⁾ (2) mau³³-na³³ exit-go younger.brother-younger.sister ‘go out’ ‘sibling’

Verb-object: (3) ʦɔ̱³¹-ɕɔ̱³⁵ (4) bi³³⁽³⁵⁾-ma̱ n⁵⁵ hunt-game buy-wife ‘go hunting’ ‘marry (a woman)’

Head-modifier: Modifiers typically follow the head; a few modifiers may precede the head. (5) biou³³-ɣɔ³³ (6) ʑuŋ⁵⁵⁽³¹⁾-ʦe³¹ hill-rock foot-mountain ‘rock hill, karst formation’ ‘foot of mountain’ (7) da¹³⁽³⁵⁾-na̱ i⁵̱ ⁵ (8) da³⁵-ta̱ ɯ̱³⁵ water-jump water-field ‘wave’ ‘rice field’

Subject-predicate: (9) mɛ̱⁵⁵⁽³³⁾lã⁵⁵ (10) pə0qou⁵⁵-lu̱ ŋ³³ eye-blind sky-make.noise ‘the blind’ ‘thunder’

bugan

1039

Verb-complement: (11) ȵu³³-so̱ ³¹ (12) ka̱ i³̱ ³-sɔ̱³¹ do-easy stay-good ‘easy’ ‘comfortable’

3.2 Affixation 3.2.1 Prefixes About a dozen or so prefixes are found in Bugan. Their characterization as prefixes is somewhat arbitrary, the main point being that they do not occur in isolation as fully lexical morphemes. Prefixes can carry lexical tones; a few take the neutral tone. pə⁵⁵ – Prefix for body parʦ, kinship, nations, people, made objects, or locative nouns: -bop³¹/bou³¹ ‘head’, -san³³ ‘navel’, -du³³ ‘brother’, -dɛ³¹ ‘relatives by marriage’, -mio³¹ ‘the Miao/Hmong’, -kʰui⁵⁵ ‘the Han’, -kʰa¹³ ‘guest’, -ʑuŋ⁵⁵⁽³¹⁾ ‘old people’, -si⁵⁵ ‘broom’, -poŋ⁵⁵ ‘hammer’, -ɕi̱⁵⁵ ‘left side’, -ɕɑ⁵⁵ ‘right side’. tə⁵⁵/⁰ – Prefix for body parʦ, localities, or animals: -nou⁵⁵ ‘the back of the body’, -qo̱ u̱³⁵ ‘the palm of the hand’, -ka̱ u̱³⁵ ‘middle’, -ɣo⁵⁵ ‘inner, inside’, -kɔ̱³³ ‘tiger’, -kɛ̱³³ ‘squirrel’ te⁵⁵ – Prefix for plants, birds, rivers, fauna, made objects, or weather: -tɑŋ⁵⁵ ‘moss’, -ta̱ n³³ ‘pumpkin’, -ʦo³¹ ‘sparrow’, -kɔ̱ŋ³⁵ ‘shrimp’, -map³¹ ‘knot’, -ɕɛ³³ ‘waistband’, -ʦi³³ ‘snow’, -kuk³¹ ‘frost’. lə⁰ – Prefix for a few body part names: -kə³³ ‘lower jaw’, -po⁵⁵ ‘shoulder’. ȵa³³ – Prefix for small person and things: -la̱ ³³ ‘baby’, -mu̱ ³³ ‘little girl’, -ta̱ u̱³³ ‘piglet’, -ʦa̱ ŋ⁵⁵ ‘mouse’, -tie³¹ ‘index finger’, -hau³³ ‘taro’. pu⁵⁵ – Prefix in some animals: -lɑŋ³¹ ‘stallion’, -ɕen⁵⁵ ‘nanny (goat)’. mu³³ – Prefix in some plants and female animals: -pɑŋ³¹ ‘peach tree’, -san⁵⁵ ‘pine’, -lou³³ ‘rice straw’, -tɑ̱ u̱³³ ‘sow’, -ʦɑu³³ ‘bitch’. Some female animals take the prefix mu³³, but the corresponding males do not have the prefix pu⁵⁵, e.g. lə⁰pɔ³¹ tɑ̱ u̱³³ ‘boar’, ʦɑu³³li̱o̱³¹ ‘dog’. ʦe⁰ – Prefix in some birds and made objects: -qa³⁵ ‘duck’, -ŋɑŋ³⁵ ‘goose’, -vɑ̱ n³¹ ‘(ringnecked) pheasant’, -lou³⁵⁽¹³⁾ ‘pliers’. mə³³ – Prefix for certain ready-made tools or objects: -ʦɑ³¹ ‘hand straw cutter’, -dou³³ ‘firewood knife’, -pe̱⁵⁵ ‘patch’. o⁵⁵ – Prefix for locality: -du³³ ‘upper reaches (of a river), upper place (of land)’, -ndɑ³³ ‘the lower’, -loŋ³³ ‘sky’, -lɑ̱ ³¹ ‘under the sky’. di³³ – Prefix in January–March, or December of the Chinese lunar calendar: -ʦɑŋ³⁵ ‘January’, -ȵi³¹ ‘February’, -sã¹³ ‘March’, -liã³¹ ‘December’. mə⁵⁵ – Prefix in April–November of the Chinese lunar calendar: -pɑu³³ ‘April’, -mi³³ ‘May’, -bɔ⁵⁵ ‘November’.

1040

li and luo

3.2.2 Descriptive Formations A significant number of descriptive or adverbial formations are created with the addition of one or two syllables (if reduplicated) following the lexical root (these were characterized as suffixed forms in Li 1996). The added morphemes are obscure but can be characterized as descriptive formatives (desc). Examples: (13) ho³¹ ŋuak³¹ (14) nda̱ ³³ slow desc light (weight) ‘very slow’ ‘very light’

pə⁵⁵ lou⁵⁵ desc

(15) ndzou³¹ mi̱³¹ (16) la̱ n¹³⁽³¹⁾ tə0 tan¹³ bright desc naked desc ‘diamond bright’ ‘be stark-naked’ (17) na̱ ¹³⁽³⁵⁾ ndo⁵⁵ (18) luã¹³ black desc disorder ‘shiny black’ ‘in a mess’

hõ⁵⁵~hõ⁵⁵ desc~red

(19) pou³¹ pe ³³ (20) ŋa̱ ³⁵ wa³⁵~wa³⁵ careless desc dark desc~red ‘very careless’ ‘very dark’

3.2.3 Infix The morpheme xi³¹ can be analysed as an infix. It is a diminutive that indicates small female animals. E.g. mu³³li⁵⁵ ‘cow’, mu³³xi³¹li⁵⁵ ‘small cow’; mu³³lɑŋ³¹ ‘mare’, mu³³xi³¹lɑŋ³¹ ‘small mare’; mu³³ɕen⁵⁵ ‘nanny (goat)’, mu³³xi³ɕen⁵⁵ ‘small nanny (goat)’. 3.3 Alternations Alternation of vowels, consonants and tones as a morphological process is not uncommon in Bugan, a feature that is quite widespread in this linguistic area. These are word groups or doublets whose meanings are related to each other phonologically and semantically, apparently derived from common morphemes, although it is possible that some matchings are coincidental. Examples:

1041

bugan

Vowel alternation: kui³³ ‘fire smoke’ mda³³ ‘be light(of a lamp)’ luŋ³³ ‘well’ pʰɔ³⁵ ‘(maternal) grandfather’ mɯ³¹ ‘you (singular)’ ɕi⁵⁵po³¹(⁵⁵⁾ ‘yesterday evening’ ŋɔ³³ ‘peppery’ ʦa̱ ¹³ ‘to bite’ la̱ ³⁵ ‘to return’

kui̱³³ ‘to dry with smoke’ mda̱ ³³ ‘be light(of the sky)’ lu̱ ŋ³³ ‘abyss, deep cave’ pʰɛ³⁵ ‘(maternal) grandmother’ mi³¹ ‘you (plural)’ ɕa⁵⁵po³¹(⁵⁵⁾ ‘tomorrow evening’ ŋa³³ ‘salted’ ʦo̱ u̱³¹ ‘to eat’ li̱³⁵ ‘to change’

Tonal alternation: ʦo̱ ⁵⁵ ‘rake’ ʦʰuŋ³¹ ‘throat’ ʦa̱ n²⁴ ‘to small’ kʰo⁴⁴ ‘ladle’

ʦo̱ ³⁵ ‘to harrow’ ʦʰuŋ³³ ‘thirsty’ ʦa̱ n³³ ‘smelly’ kʰo²⁴ ‘to scoop’

Consonant alternation: la̱ ŋ³³ ‘clear(water)’ mʦʰa¹³ ‘to kill’ nap⁵⁵ ‘be closed’ duŋ³³ ‘hole’

ma̱ ŋ⁵⁵ ‘muddy(water)’ mʦa³¹ ‘to die’ ȵap⁵⁵ ‘to close one’s eyes, to shut up’ luŋ³³ ‘well’

3.4 Loanwords As the result of contact the Bugan language has borrowed words from Han and Zhuang languages, generally modern words. Some Bugan words look similar to Yi/Lolo or other Tibeto-Burman languages, but it is uncertain whether they are cognates or borrowings. Chinese loans: Bugan Chinese (pinyin) miou³¹ miào ‘temple’ pɛi³⁵ bēi ‘tablet’ ʦʰaŋ³¹ cháng ‘to taste’ mə³¹ mài ‘pulse’

1042

li and luo

-kan¹³ jīn ‘axe’ pin³⁵ bīng ‘soldier’ min³¹ míng ‘understand’ lə̃⁵⁵ɕi³¹ liángshi ‘grain’ hu³¹ʦiu¹³ hújiāo ‘pepper’ ʦa̱ u̱³³ jiǔ ‘wine’ ʦau³⁵ zào ‘kitchen range’

Zhuang loans: Bugan Wuming Zhuang -man¹³ -man⁵⁵ ‘plum’ -ʦiap³¹ kep⁵⁵ ‘chives’ -kɯ⁵⁵ kɯ³¹ ‘eggplant’ ke¹³ kve²⁴ ‘towel gourd’ tɛ̱⁵⁵ tai³³ ‘bag’ mɔ̃ ⁵⁵ mau⁴² ‘roof’ laŋ³³ liːŋ⁵⁵ ‘umbrella’ bə³⁵ bɯ³⁵ ‘be bored with’

4

Phrase and Clause Structure

4.1 Sentences Bugan is basically an SV/AVP language. Modifiers generally follow the modified. However, a number of adverbials precede their heads. Complements always follow their heads. (21) pə⁵⁵ka̱ n³³ ȵu³³mʦaŋ³⁵ ka̱ i³̱ ³ ʔo⁵⁵ biou³³ Bugan celebrate festival mountain top ‘The Bugan celebrates their festivals on top of the mountain.’ (22) mi³¹ ʔa³³ ta³³⁽³⁵⁾pɯ⁵⁵ li̱o̱³¹ ni³³ ʑa̱ ³³ mə⁵⁵ ʑa̱ ³³ 2pl come look.at coat prox good neg good ‘Please come here to have a look at my coat, and tell me if it is good or not.’ (23) ʔɔ³¹ ndɔ³¹ se³¹ ta̱ u̱³³ mə⁵⁵ni³³ 1sg like sell pig prox ‘I’d like to sell this pig.’

1043

bugan (24) kou¹³ ta̱ i³̱ ¹⁽⁵⁵⁾ mə⁵⁵ni³³ kaŋ⁵⁵ mo̱ ³¹ ni⁵⁵ʔa⁰ sweet.corn basket prox very heavy ptcl ‘This basket of corn is very heavy indeed.’

A head may often be modified by several attributives (modifiers). In these cases, adjectives are closest to the head, followed by personal pronouns; demonstratives are always the farthest from the head. (25) laŋ³¹ pou³⁵ mə⁵⁵ni³¹ horse white prox ‘this white horse’ (26) ma̱ n⁵⁵ ʔɔ³¹ mi³³ pau³⁵⁽³¹⁾ lɛ¹³ ki³³ daughter.in.law 1sg five clf all medl ‘All of my five daughters-in-law.’

For existential sentences (see 3.4.1 for further discussion), elderly speakers (60 years old and over) tend to place the existential verb ka̱ i³³ in between the head noun and the numeral modifier phrase, forming a split numeral-modifier expression where the head noun is separated from the numeral classifier by the existential verb. (27) li⁵⁵ ka̱ i³³ mi³³ ox exist five ‘There are five oxen.’ (28) xa³³ ka̱ i³̱ ³ mə⁵⁵ stick exist one ‘There is one stick.’

biə³³ clf

li̱³³ clf

4.1.1 Existentials and Possessives The canonical existential and possessive verb in Bugan is ka̱ i³̱ ³, which designates that someone possesses something or something exists in some place. In this construction type, the entity in existence or the possessee is typically placed before the verb ka̱ i³̱ ³, forming what may be regarded an PAV word order. (29) pə⁰qou⁵⁵ pi³⁵ ka̱ i³̱ ³ mʦe³¹ bɔ̱³¹⁽⁵⁵⁾ sky sun exist three clf ‘There are three suns in the sky.’

1044

li and luo

(30) toŋ¹³ saɯ³¹ ʔɔ³¹ ka̱ i³̱ ³ bi̱ɔ̱³¹ bucket wood 1sg exist pair ‘I have a pair of water buckeʦ.’ (31) mʦe̱³³ʦo̱ ³³ mɯ³¹ ka̱ i³̱ ³ ma³³ mʦɛ̱⁵⁵ banana 2sg exist or neg ‘Do you have any bananas?’

4.1.2 Assertion, Negation and Copula Clause A number of assertive/declarative sentences are verbless. (32) ki³³ ma̱ n⁵⁵ ʔɔ³¹ medl daughter.in.law 1sg ‘That is my daughter-in-law.’ (33) ʔi³¹ kʰɔ³³ pə⁵⁵kʰui⁵⁵ 3sg maybe Han ‘He may be a Han.’ (34) bi³¹ wɛ³³ du³³ ka̱ ŋ³¹ two brother blood.related ‘The two are blood brothers.’

Others are formed through the copula verb ‘be’, which has two forms, ni³³ and ȵu³³. ȵu³³ also has the meaning ‘do’; it is not often used as a copula verb. ni³³ may be related to ni³⁵ ‘this’. ni³³ and ȵu³³ are not used in negative declaratives. The concept of ‘not be’ is expressed by mə⁵⁵sa̱ ŋ⁵⁵. (35) ʔɔ³¹ ni³³/ȵu³³ pə⁵⁵ka̱ n³³ 1sg be Bugan ‘I am a Bugan.’ (36) ʔɔ³¹ mə⁵⁵ sa̱ ŋ⁵⁵ piau¹³ pə⁵⁵se³³ 1sg neg be person Guangnan ‘I am not a native of Guangnan.’ (37) mo̱ u̱³³ mə⁵⁵ni³³ lio⁵⁵ mə⁵⁵ sa̱ ŋ⁵⁵ knife prox my neg be ‘This knife is not mine.’

bugan

1045

4.1.3 Comparative Construction The canonical form for comparative is ‘COMPAREE + INDEX (verb/adjective) + ʦai³¹/pi¹³ + STANDARD’, with ʦai³¹/pi¹³ as comparative markers. (38) li⁵⁵ tʰɔ³¹ ʦai³¹/pi¹³ ta̱ u̱³³ ox big compar pig ‘An ox is bigger than a pig.’ (39) mɛ̱⁵⁵ ʔi³¹ sɛ̱³¹ ʦai³¹/pi¹³ mɛ̱⁵⁵ mɯ³¹ child 3sg smart compar child 2sg ‘His children are smarter than your children.’

4.2 Phrases At the phrase level, phrases may be syntactically analyzed as units bearing the syntactic relation of subject-predicate, verb-object, parallel, head-modifier, verb-complement, and numerical-classifier. 4.2.1 Subject-Predicate Phrase In these construction types, the subjects are usually formed by nouns, pronouns, nominal head-modifier phrases or parallel phrases. Predicates are usually formed by verbs, adjectives, verbal and adjective parallel (or headmodifier) phrases, or verb-complement phrases. (40) ʦioŋ³³ ŋgɔ³⁵ deer step.on ‘the deer stepped on’ (41) hɔŋ³³ zuŋ⁵⁵⁽³¹⁾ ʦioŋ³³ ʦan³³ footprint foot deer smelly ‘Footprints of deer are smelly.’ (42) ʔi³¹ no³¹ mə⁵⁵ ʦi³⁵ 3sg come one time ‘He has once come here.’ (43) we³³ lɛ³³ pe̱³⁵ ȵe³¹bio³³ di³¹ elder.brother and sister.in.law quarrel terribly ‘Elder brother quarreled with sister-in-law bitterly.’

1046

li and luo

(44) mɛ⁵⁵ ʔɔ³³⁽³¹⁾ ʑou¹³ tʰɔ³¹ ʑou¹³ ɣo³³ child 1sg also big also tall ‘My child is big and tall.’

4.2.2 Verb-Object Phrase These appear to bear a verb-object relationship between the verb and the object, although various semantic roles can be observed in the object. (45) te³¹ tʰuŋ³¹go³¹ worship buckwheat ‘to offer buckwheat as a sacrifice to (ancestors)’ (instrument) (46) taŋ³⁵ sa̱ ɯ³¹ lɛ³³ laɯ³⁵(¹³⁾ put garlic and prickly.ash ‘to put some garlic and Chinese prickly ashes (in the bag)’ (transferred entity) (47) ta̱ ³⁵ti⁵⁵ mɯ³¹ give 2sg ‘to give you’ (indirect object) (48) pə⁵⁵ka̱ n³³ ɕɔ³³ nuŋ³¹ʦi¹³kɔ³⁵ tʰie¹³ʦɯ⁵⁵ Bugan afraid pn Emperor ‘The Bugan were afraid of Emperor Nong Zhigao.’ (stimuli)

4.2.3 Parallel Phrase Parallel phrases can be formed through coordination of two or more nouns, pronouns, verbs, or adjectives with or without the use of conjunctions. (49) kaŋ⁵⁵ tʰɔ⁵⁵⁽³¹⁾ kaŋ⁵⁵ ɣo³³ very big very high ‘big and high’ (50) ɕu³³ ta̱ u̱³³ ɕu³³ ʦa̱ ³⁵ meat pig meat chicken ‘pork and chicken’

bugan

1047

(51) ʔɔ³¹ lɛ³¹⁽³³⁾ ʔi³¹ 3sg and 1sg ‘he and I’ (52) mɯ³¹ lɛ³³ ȵa³³mu̱ ³³ ʑa̱ ⁵⁵ pɔ³¹ 2sg and girl dance reed.pipe ‘You and the girls dance following the music (played by a reed pipe wind instrument).’ (53) ʦaŋ³¹ ʔa³³ ʦou̱ ³¹ take return eat ‘take (it) back and eat’

4.2.4 Head-Modifier Phrase There are two types. When nouns act as the head they may be modified by adjectives, pronouns, nouns, or numerical classifier phrases (see § 3.1.1.). When verbs or adjectives act as the heads, they may be modified by adverbs and time nouns (see § 3.1.5. and § 3.1.6.) 4.2.5 Verb-Complement Phrase Verbs can be followed by adjectives, verbs, or numerical classifier phrases to form a verb-complement phrase. The complement is to describe the conditions or situations in which an event is being conducted. (54) ȵe³¹ bio³³di³¹ quarrel terribly ‘to quarrel terribly’ (55) ka̱ n³⁵ tʰuŋ³³ cut fall ‘to cut down’ (56) bi³⁵ ʦu̱ ³¹ bi³¹ ʦʰe¹³ hold cont two clf ‘to hold two’ (57) ʑau³¹ ɣau³¹sau⁵⁵ ʦu̱ ³¹ mə⁵⁵ nam⁵⁵ a0 go read cont one year ‘have gone to the school for one year’

1048

li and luo

(58) ʔi³¹ ɣo³³ mʦe³¹ ʦʰɯ³³ 3sg high three measure ‘He is three chi (a unit of length = 1 meter) high.’

Complements may occur as a clause. (59) ʔɔ³¹ ʑaɯ³¹ bɔ⁵⁵ mə³³ bɔ⁵⁵ mda³³ te³¹ qou³³ nan⁵⁵~nan⁵⁵ 1sg see star one clf shine very bright desc ~red ‘I saw a star (which was) shining bright.’

4.2.6 Serial Verb Constructions Verb serialization is a feature of Bugan. A string of verbs usually occur in a sentence without any grammatical linking device. Sometimes, a noun phrase may be inserted in between the verbs. Various semantic relations can be observed between the verbs and the noun phrase(s). (60) mɛ̱⁵⁵ mə³³ ka̱ i³³ ŋa⁵⁵ʑuŋ³¹ bi³³⁽¹³⁾ ʦu̱ ³¹ bi³¹ ʦʰe¹³ ʔɔŋ³⁵ child you stay door hold cont two clf sugarcane ‘Your child stayed at the door carrying two sugarcanes.’ (Verb 2 is accompanying action of Verb 1) (61) ʔɔ³¹ ʨaŋ³¹ ʦe³³ kɔ³¹ po³¹ huã¹³ ɕu³³ ʔa³³ ʦou̱ ⁵⁵ 1sg take rice little.bit go exchange meat come eat ‘I’ll take some rice to exchange for some meat to eat.’ (Verbs 2, 3 and 4 indicate purpose of Verb 1)

5

Word Classes

The following word classes can be recognized for Bugan: nouns, pronouns, numerals, classifiers, verbs, adjectives, adverbs, structural particles, prepositions, conjunctions, discourse particles. 5.1 Nouns 5.1.1 Common Nouns Common nouns can stand by themselves, or can be modified by adjectives, pronouns, or numeral classifiers. Modifiers generally follow the head (noun). (62) laŋ³¹ pou³⁵ horse white ‘white horse’

1049

bugan (63) wɛ³³ mɛ̱⁵⁵ younger.brother child ‘the child’s younger brother’ (64) mɛ̱⁵⁵ ʔɔ³³⁽³¹⁾ child 1sg ‘my children’ (65) mɛ̱⁵⁵ ki³³ child medl ‘that child’

Numeral-classifier constructions usually take the form of N-Numeral-Classifier. (66) ʔɔ³¹ bi³³⁽³⁵⁾ mo̱ u̱³³ mə⁵⁵ ʦʰe¹³⁽³⁵⁾ 1sg buy knife one clf ‘I bought a knife.’

When the quantity of the noun is one, it can be directly modified by bɔ̱⁵⁵ ‘one, clf’, which is a fusion of num + clf. (67) wɔ̱³³ bɔ̱⁵⁵ stone clf ‘one stone’ (68) puŋ⁵⁵ bɔ̱⁵⁵ flower clf ‘one flower’ (69) bɔ⁵⁵ kou¹³ bɔ̱⁵⁵ corn clf ‘one corn (cob)’ (70) pɯ̱ ⁵⁵ li̱o̱³¹ coat ‘one coat’

bɔ̱⁵⁵ clf

Nouns generally cannot be reduplicated, except those that are concurrently used as classifiers or measures: hɛ⁵⁵ hɛ⁵⁵ ‘every day’.

1050

li and luo

5.1.2 Pronouns and Question Words 5.1.2.1 Personal Pronouns Bugan personal pronouns distinguish between singular, dual, and plural. First person plural pronouns distinguish between inclusive and exclusive. They invariably share the mid-falling tone, 31. The dual is formed by adding bi̱ɔ̱³¹ ‘two’ to the plural pronouns. Bugan personal pronouns person singular plural 1 ʔɔ³¹ wi³¹ (inclusive) pɛ³¹ (exclusive) 2 mɯ³¹ mi³¹ 3 ʔi³¹ hɛ³¹

dual wi³¹ bi̱ɔ̱³¹ mi³¹ bi̱ɔ̱³¹ hɛ³¹ bi̱ɔ̱³¹

It is worth noting that personal pronouns undergo a tone change into high tone 55 when used as possessive pronouns. E.g.: pa²⁴ ʔɔ⁵⁵ ŋɡu⁴⁴ pɛ⁵⁵ tɕo³¹ ʔi⁵⁵

‘my father’ ‘our village’ ‘his uncle’

There is also a reflexive pronoun mbaŋ⁴⁴ mbi⁴⁴, ‘self, one’s own’. 5.1.2.2 Interrogative Pronouns Interrogative pronouns are formed with the interrogative element -pau³⁵: E.g: ʔo⁵⁵pau³⁵ ʦə⁵⁵pau³⁵ tə⁵⁵pau³⁵ ɣen⁵⁵pau³⁵ ɣɯ³¹pau³⁵ mɯ⁵⁵ pau³⁵ ta⁵⁵ pau³⁵

‘what place, where’ ‘what time, when’ ‘how many, how much’ ‘why, for what’ ‘how, in what way/manner’ ‘which one, anyone’ ‘which ones’

As can be seen from above, some interrogative pronouns also function as indefinite pronouns, such as mɯ⁵⁵ pau³⁵, which also means ‘anyone’. This is also true of the interrogative pronoun mə0dze⁵⁵ ‘what’, which may also be used as an indefinite pronoun ‘whatever’, as in

bugan

1051

(71) kai⁴⁴ mə0dze⁵⁵ ti̱⁴⁴ mə0dze⁵⁵ exist what give what ‘Give whatever is available.’

5.1.2.3 Demonstrative Pronouns and Deictic Words Demonstrative pronouns have two basic forms: proximal ni³³ ‘this’, medial ki³³ ‘that’, and distal kɛ̱³³ ‘that (far away)’. They can be used to form deictic expressions: ta⁵⁵ni³³ ‘here’ ta⁵⁵ki³³ ‘there’ ʦə⁵⁵ni³³ ‘now’ ʦə⁵⁵ki³³ ‘then’ mə⁵⁵ni³³ ‘this side’ mə⁵⁵ ki³³ ‘that side’ nɑ̱ ŋ¹³ ni³³ ‘like this’ nɑ̱ ŋ¹³ ki³³ ‘like that’ ɣɯ³¹ni³³ ‘this way’ ɣɯ³¹ ki³³ ‘that way’

There are three deictic words to indicate distal: ki³³/dɔ̱¹³ ‘that far’, kɛ̱³³ ‘that farther’, o⁵⁵kɛ̱³³ ‘that farthest’. dɔ̱¹³ is not often used. (72) mɛ̱⁵⁵ ki³³ xu⁵⁵⁽¹³⁾ laŋ³¹ child medl ride horse ‘That boy is riding a horse.’ (73) mɛ̱⁵⁵ kɛ̱³³ ʦo̱ u̱³¹ fɛi³³ child dist eat meal ‘That boy over there is having a meal.’ (74) mɛ̱⁵⁵ ki³³ mʦʰau³¹ li⁵⁵ child medl herd oxen ‘That boy yonder is herding cattle.’

Pronouns cannot be modified or reduplicated. 5.1.3 Numerals Bugan numerals from 1 to 10 are given below. bɔ⁵⁵ ‘one’ pi̱o̱³³ ‘six’ biɔ³¹ ‘two’ po̱ u̱³¹ ‘seven’ mʦe³¹ ‘three’ sɑ̃ ³³ ‘eight’

1052

li and luo

pɑu³³ ‘four’ ɕi³³ ‘nine’ mi³³ ‘five’ mɑ̃ ³¹ ‘ten’

‘Eleven’ to ‘nineteen’ are formed by mɑ̃ ³¹ ‘ten’ plus the cardinals: mã³¹bɔ⁵⁵ ‘eleven’ mã³¹bi³¹ ‘twelve’ mã³¹ɕi³³ ‘nineteen’

“Twenty’ to ‘ninety’ are formed by the relevant cardinals plus the numeral mã³¹, as: bi³³⁽³¹⁾ mã³¹ ‘twenty’ pɑu³³ mã³¹ ‘forty’ pi̱o̱³³ mã³¹ ‘sixty’

Uniʦ for ‘hundred’, ‘thousand’ and ‘ten thousand’ are: ʑu³¹ ‘hundred’, tʰiaŋ¹³ ‘thousand’, vã¹³ ‘ten thousand’. It is worth noting that the numerals ‘one’ and ‘two’ each have two forms: mə⁵⁵, bɔ⁵⁵ ‘one’ and bi³¹ , biɔ³¹ ‘two’. Each of the two forms has different uses. Typically, mə⁵⁵ ‘one’ and bi³¹ ‘two’ are used with classifiers to form numeral clf phrases. (75) piau¹³⁽³⁵⁾ mə⁵⁵ pau³¹ person one clf ‘one person’ (76) piau¹³⁽³⁵⁾ bi³¹ pau³¹ person two clf ‘two persons’

mə⁵⁵ ‘one’ and bi³¹ also occur with the uniʦ ‘hundred’, ‘thousand’ and ‘ten thousand’ and other compound expressions, as in bi³¹ʑu³¹ ‘two hundred’. The number ‘101’ is represented as mə⁵⁵ʑu³¹ lɛ³³bɔ⁵⁵, lit. ‘one hundred and one’, and ‘10,010’, mə⁵⁵ vã¹³ lɛ³³ mɑ̃ ³¹, lit. ‘one ten-thousand and ten.’ Numerals generally cannot be modified or reduplicated. 5.1.4 Classifiers Classifiers can be divided into two main types—classifiers for nouns (including sortal classifiers and measure words), and classifiers for verbs.

1053

bugan

5.1.4.1 Classifiers for Nouns These make up the majority in the classifier system of Bugan. Examples: pau³¹ for people biə³³ for animals li̱³³ for long-shaped objects ʦʰe¹³ for objects, rivers ts̱uŋ⁵⁵ for plants liu³³ for thin, sheet objects ʦam³³ pair pam³⁵ portion

piau¹³⁽³⁵⁾ bi³¹ pau³¹ li⁵⁵ mʦe³¹ biə³³ xa³³mə⁵⁵ li̱³³ ɣou⁵⁵ mə⁵⁵ ʦʰe¹³ saɯ³¹ pau³³ts̱uŋ⁵⁵ ɣai³⁵ mə⁵⁵ liu³³ sau³³ mə⁵⁵ ʦam³³ fɛi³³mə⁵⁵ pam³⁵

‘two persons’ ‘three oxen’ ‘one stick’ ‘one river’ ‘four trees’ ‘one sheet of paper’ ‘a pair of birds’ ‘a portion of food’

5.1.4.2 Classifiers for Verbs Verbal classifiers are fewer in number, but they are nonetheless of frequent use. ʦi³⁵ fɛ³¹ mʦa³⁵ tuŋ⁵⁵ ŋga³¹ ta̱ ŋ³³

time frequency for trips for eating for biting for sleeps, naps for hitting with fist

ʑau³¹ mə⁵⁵ ʦi³⁵ ken³¹ mbei³¹fɛ³¹ ʦo̱ u³¹ pi³¹ mʦa³⁵ ʦa³³ mə⁵⁵ tuŋ⁵⁵ ṉaŋ³³ mə⁵⁵ ŋga³¹ ndɛ³¹ mə⁵⁵ ta̱ ŋ³³

‘have been there once’ ‘return twice’ ‘have three meals’ ‘be bitten once; given a bite’ ‘have a nap/sleep’ ‘give sb. a punch with fist’

Classifiers can be modified by numerals and demonstratives. (77) ma°kua³¹ mi³³ ʦiu³³ song five clf ‘five folk songs’ (78) fɛi³³ pau³³ pam³⁵ food four clf ‘four portions of food’ (79) ʑiu⁵⁵ bɔ̱⁵⁵ ni³³ spoon clf prox ‘this spoon’ (80) a̱ u̱³¹ tʰa³³ ki³³ firewood bundle medl ‘that bundle of firewood’

1054

li and luo

A few classifiers/measures can be reduplicated to convey the meaning of ‘every, all’. (81) hɛ⁵⁵ hɛ⁵⁵ day day ‘every day, always’ (82) mai⁵⁵ mai⁵⁵ month month ‘every month’

5.2 Verbs A number of verbs have causative forms, marked by ‘ŋgɔ̱³⁵ + verb’; ŋgɔ̱³⁵ is derived from the lexical meaning ‘to drive out’. xo̱ u̱³⁵ ‘ride (a horse )’ ŋgɔ̱³⁵xo̱ u̱³⁵ pa̱ ɯ̱³¹ ‘get up’ ŋgɔ̱³⁵ pa̱ ɯ̱³¹ tu³³ ‘exit’ ŋgɔ̱³⁵tu³³

‘to make/force sb. to ride (a horse)’ ‘to make sb. get up’ ‘to order/force sb. or sth. to get out’

Activity verbs have durative (continuous) aspect, marked by ‘sai³³ + verb +naŋ³¹’ or ‘verb +naŋ¹³ ’. (83) ʔɔ³¹ sai³³ pioŋ¹³ 1sg dur write ‘I’m writing.’ (84) mɯ³¹ ȵu³³ naŋ¹³⁽³¹⁾ mə0dze⁵⁵ 2sg do dur what ‘What are you doing?’ (85) li⁵⁵ sai³³ ʦo̱ u̱³¹ ʦiu⁵⁵ naŋ³¹ ox dur eat grass dur ‘The ox is eating grass.’

ʦu̱ ³¹ follows verbs, which may be analyzed as marking a continuous action or situation. (86) mɛ̱⁵⁵ be³⁵ ʦu̱ ³¹ ʔɔŋ³⁵⁽¹³⁾ bi³¹ ʦʰe¹³ child hold cont sugar.cane two clf ‘This boy is holding two sugar cane.’

1055

bugan

Verbs can be modified by adverbs and auxiliary verbs. (87) pe̱¹³⁽³⁵⁾ mə⁵⁵ ɣen⁵⁵ me¹³ elder.sister neg resemble mother ‘Elder sister doesn’t take after her mother.’ (88) ʔɔ³¹ fa̱ ŋ⁵⁵ ʑau⁵⁵⁽³¹⁾ nai⁵⁵/nɛ̱³¹ 1sg want go market ‘I like to go to the market.’

Action verbs may be reduplicated to denote an act is carried out in a tentative or casual manner, as in tu⁴¹ ʑau⁴⁴ ȵa³¹ ȵa³¹ ‘exit-go-walk-walk -go out for a walk’. 5.3 Adjectives Adjectives can be modified by different adverbs to indicate degrees or intensity of the quality described by the adjective in question. Comparative constructions can be formed through the following: Equality: Comparative: Superlative:

‘adjective + te̱³¹ ‘adjective + te³¹lɛ³¹ kaŋ⁵⁵ + adjective + lɛ³¹

(89) ȵu³⁵ ʔɔ³¹ lɯ³¹ te̱³¹ house 1sg good indeed ‘My house is good (as it is).’ (90) ȵu³⁵ mɯ³¹ lɯ³¹ te̱³¹lɛ³¹ house 2sg good compar ‘Your house is better.’ (91) ȵu³⁵ ʔi³¹ kaŋ⁵⁵ lɯ³¹ lɛ³¹ house 3sg super good super ‘His house is the best.’

Adjectives can be modified by adverbs. (92) mo³¹ te̱³¹ heavy little.bit ‘a bit too heavy’

1056

li and luo

(93) kaŋ⁵⁵ mo ³¹ very heavy ‘very heavy’ (94) mə⁵⁵ lou³³ neg cold ‘not cold’ (95) tə0ka³³ ȵu¹³ immediately do ‘do immediately’

Most adjectives can be reduplicated to indicate intensity: (96) ŋga³¹ ŋga³¹ yellow yellow ‘deep yellow’ (97) ndzuŋ³³ ndzuŋ³³ thin thin ‘very thin (of person)’

5.4 Adverbs Adverbs cannot be modified; their main grammatical function is to modify verbs and adjectives. Most of them occur before their heads, only few adverbs follow their heads. Adverbials are formed by adverbs, adjectives, and time words. Some examples: (98) hɛ³³ ni³³ ʔɔ³¹ mʦʰa¹³ ʦe0qa³⁵ mi³³biə³³ today 1sg kill duck five clf ‘I have killed five ducks today.’ (99) pə⁵⁵ka̱ n³³ ka̱ i³̱ ³ qha³⁵ʦu³³ ȵu³³ mʦaŋ³⁵ Bugan exist way do festival ‘The Bugan celebrated their festivals on the way (when they moved).’ (100) ɣa³¹ tʰu⁵⁵ ni³³ kʰɔ̱ŋ³⁵ so̱ u̱⁵⁵ ʔo⁵⁵ pɛ¹³tou¹³ take vegetable prox all put inside basket ‘Put all vegetables in the basket.’

1057

bugan

(101) ʔi³¹ ba̱ n⁵⁵ ʔo⁵⁵ tɛ̱⁵⁵ ʔɔ³³ be³⁵ tʰam⁵⁵⁽³³⁾ ʦa̱ ¹³ bi̱ɔ̱³¹ no̱ ³¹ 3sg from inside pocket pull take egg bite two come ‘He took two eggs out from his pocket.’

5.5 Prepositions Prepositions form prepositional phrases with nouns and pronouns. (102) pə0 nam⁵⁵ tʰɛ³¹ ʦə⁵⁵ni³³ last year to now ‘(from) last year until today’ (103) pi¹³ tʰɔ⁵⁵⁽³¹⁾ ʦai³¹/pi³³ ta⁵⁵ sun big compar moon ‘The sun is bigger than the moon.’ (104) i³³⁽³¹⁾ la̱ i⁵̱ ⁵ ba̱ n⁵⁵ pa⁵⁵ saɯ³¹ He fall from top tree ‘He has fallen from the top of a tree.’

ʔa³³ return

5.6 Structural Particles A number of structural particles are found in Bugan. A few are discussed below. mə⁵⁵ precedes adjectives to indicate the quality or condition of something or somebody. (105) mə⁵⁵ la̱ ¹³ red ‘(It) is red; (sth.) that is red’ (106) mə⁵⁵ ŋga⁵⁵⁽³¹⁾ yellow ‘(It) is yellow’

la³³ is a possessive marker. It precedes nouns and pronouns to form a possessive phrase. (107) ȵu³³ mə⁵⁵ni³³ la³³ mɯ⁵⁵⁽³¹⁾ house prox belong.to 2sg ‘This house is yours.’

1058

li and luo

lɛ³¹ is also a possessive marker. It follows nouns and pronouns to mark possession. (108) mou̱ ³³ ʔɔ³¹ lɛ³¹ knife 1sg belong.to ‘The knife is mine.’

In some cases, la³³ and lɛ³¹ may co-occur to double mark a possessive. (109) la³³ ʔɔ⁵⁵ be 1sg ‘(This is) mine.’

lɛ³¹ belong.to

Three particles, la⁴⁴, lɛ⁵⁵ and ni⁴⁴, are used as nominalizers. (110) ʦo̱ u̱³¹ la⁴⁴/ni⁴⁴ eat nml ‘things to eat, food’ (111) ʦo³¹ lou⁴⁴ ȵu³³ lɛ⁵⁵ use iron make nml ‘(that which is) made from iron’ (112) tʰo³¹/ʦei⁵⁵ lɛ⁵⁵ big/small nml ‘big/small thing(s)’ (113) maŋ⁴⁴ ni⁴⁴ purple nml ‘purple thing(s)’

There is still a very commonly-used particle ti̱³¹, which functions as an enclitic to mark a complement describing a situation as a consequence of a prior event. (114) ʔi³¹ lau⁴⁴ ti³¹ ma²⁴ ndzen⁴⁴ ʔa⁴⁴ 3sg cold cons tremble disc ‘She was trembling from cold. (Lit. she was so cold (that) she was trembling).’

1059

bugan

5.7 Co-Verbs, Prepositions and Conjunctions Co-verbs or prepositions in Bugan include the object-fronting marker mbei²⁴, the benefactive marker ti⁵⁵ ‘give, for, to’, the passive marker tɕo³¹, the localizer kai⁴⁴ ‘exist, be at, at’ and the comparative marker bi⁴⁴. Many are grammaticalized from lexical verbs. (115) tʰaŋ⁴⁴ tʰo⁵⁵ mbei²⁴ sɯ̱ ³¹ pʰɔ²⁴ qa̱ ŋ³¹ ʔa⁴⁴ wind big obj tree blow fall disc ‘The strong wind has brought down trees.’ (116) ʔi³¹ tɕo³¹ pi³¹kua⁴⁴ naŋ²⁴ ʔa⁴⁴ 3sg pass wasp sting disc ‘He got stung by wasps.’ (117) ʔi³¹ nda⁴⁴ ti⁵⁵ ʔɔ³¹ fu³¹ da²⁴ 3sg lend give 1sg money ‘He lent me a lot of money.’

ka̱ ŋ³¹ very

nau³¹ many

(118) kai⁴⁴ ʦau⁴⁴ mbi⁴⁴ na̱ ŋ⁴⁴ kai⁴⁴ tɯ⁴⁴ qau⁴⁴ tɕou⁴⁴ exist dog one sleep exist middle road disc ‘There was a dog sleeping on the road.’

Conjunctions are used to conjoin content words, phrases, and/or clauses. (119) wɛ³³ lo³¹ du³³ elder.brother and younger.brother ‘elder brother and younger brother’ (120) mʦe̱³³ ʦo³³ ka̱i ̱³³ ma³³ mʦe⁵⁵⁽³¹⁾ banana have or neg.have ‘(Do you) have any bananas?’ (121) ʔi³¹ en³¹ bi⁴⁴ kau³¹ qa⁵⁵ tʰa³¹ nda²⁴ ken³¹ ȵou⁵⁵ 3sg wash finish cloth then carry water return home ‘Having finished her washing, she carried water back home.’

5.8 Discourse Particles Bugan has a number of discourse particles. These usually occur at the end of a sentence to indicate certain mood or modality on the part of the speaker. They express declarative/assertive, imperative/command, request, or interrogative meanings.

1060

li and luo

ma¹³, ʔa³³ occur with assertive. (122) ʔi³¹ mə⁵⁵ no³¹ ma¹³ 3sg neg come asrt ‘He is not coming here.’ (123) ʑau³¹ ɣau³¹sau⁵⁵ ʦu³¹ mə⁵⁵ nam⁵⁵ ʔa³³ go read cont one year return ‘He has gone to school for one year.’

ni⁵⁵ occurs with interrogatives. (124) ʔi³¹ no³¹ mə⁵⁵ no³¹ ni⁵⁵ 3sg come neg come q ‘Is he coming?’

la0 and no³¹ occur with imperatives and hortatives. (125) wi³¹ 1pl.inc ‘Let’s go.’

na⁵⁵ la0 go imp

(126) mɯ³¹ na⁵⁵ ha⁵ no³¹ 2sg go first imp ‘You go first.’

6

Glossed Text

The Sun and the Chicken Narrated by Wang Zhiyou, Manlong Village of Xichou County of Wenshan Zhuang and Miao Autonomous Prefecture (Yunnan). Recorded and transcribed by Li Yunbing. This dialect of Bugan is slightly different from the one described in the sketch above. pei²⁴ tɕaŋ³¹ tɕa̠ ²⁴ sun and chicken ‘The sun and the chicken.’

bugan

1061

tɕaɯ³¹.kei⁴⁴, pei²⁴ kai⁴⁴ ɕi̠⁴⁴ mou⁴⁴ na⁴⁴, before sun exist nine sisters ‘Long ago, there were nine suns in the sky.’ xa³¹ ɣɔ̠⁴⁴ tɕi̠⁴⁴ ɣɔ̠⁴⁴ ka̠ ³¹, xa³¹ lɯ̠ ³¹ tɕi̠⁴⁴ lɯ̠ ³¹ tɕo̠ u³¹, sundry stone then stone crack sundry grass then grass dry ‘The heat cracked the rocks and burnt all the grasses and plants.’ xo⁵⁵.mɯ⁴⁴.ɣi̠³¹ lɛ²⁴ mɯ⁵⁵ ɣoŋ²⁴ tu̠ ³¹ ma²⁴, fairy add neg bear able disc ‘Even the fairy couldn’t stand it’. ɕi⁴⁴ ʑa̠ u³¹ ptsou⁴⁴ qoŋ⁴⁴ ʑa̠ u³¹, then go to pond go ‘and had to get into the pond.’ qa³¹ la³¹ ʨʰaŋ⁴⁴ pjou³¹ po³¹ tɕa̠ ²⁴, then invite heavenly.dog come bite ‘(She had to) ask the heavenly dogs to eat (the suns).’ pei²⁴ ɕi⁵⁵ kou⁴⁴ loŋ⁴⁴ ʑa̠ u⁵⁵ ma²⁴, ɕi sun then hide hole go disc then ‘but the latter went hiding in their burrows.’ la⁴⁴ ȵo̠ u⁴⁴ ku³¹ ŋa̠ ²⁴ ma²⁴, again do sky dark disc ‘making the sky dark.’ pʰi³¹ mɯ⁵⁵.ȵdʑi⁴⁴ qa²⁴ lɛ⁵⁵ mɯ⁵⁵ no̠ ³¹ ma²⁴, call what call add neg come disc ‘The suns refused to come out despite numerous requests (from all kinds of animals).’‘ ʨʰaŋ⁴⁴ tɕau⁴⁴ qa²⁴ ma²⁴, qa²⁴ lɛ⁵⁵ invite bamboo.chicken call disc call add ‘The bamboo chicken was (the first to be) asked to call (the suns out).’ mɯ⁵⁵ no̠ ³¹ ma²⁴ neg come disc ‘but (they still) hid (themselves in their burrows).’

1062

li and luo

pu⁵⁵ dou²⁴ ma²⁴, tɕa̠ ²⁴ la⁴⁴ qa²⁴ sai⁴⁴, i³¹ tu⁴⁴ no̠ ³¹ ma²⁴ later.on ptcl chicken then call again 3sg exit come disc ‘Later on, after many attempts, the chicken made suns come out.’ tɕau⁴⁴ lɛ̠³¹ ma²⁴, bamboo.chicken shy disc ‘The bamboo chicken felt embarrassed.’ tɕau⁴⁴ qʰe̠i³¹ ma²⁴, ʑo̠ ŋ⁵⁵ ko³¹ la̠ ⁴⁴ ma²⁴ bamboo.chicken run disc foot a.bit red disc ‘and (the chicken) ran away, its feet turning red from running.’ ɣo²⁴ qo⁵⁵ mɯ⁵⁵ ɕaŋ⁵⁵ ma²⁴, i³¹ qo³¹ mbjau⁴⁴ ʑa̠ u⁵⁵ since then neg raise disc 3sg then hill go ‘From then on, the bamboo chicken turned into wild birds, living in the bush.’

7 References Li, Jinfang. 1996. Bugan—a new Mon-Khmer language of Yunnan Province, China. Mon-Khmer Studies 26: 135–159. Li, Yunbing. 2005. Bugengyu Yanjiu [A Study on Bugeng (Bugan)]. Beijing: Minzu Chubanshe. Wu, Zili. 1992. Guangnan Ben’ganyu Chutan [An initial investigation of Bengan]. Yunnan Minzu Yuwen 4.

section 10 Palaungic

․․

chapter 17

Dara’ang Palaung1 Sujaritlak Deepadung, Ampika Rattanapitak and Supakit Buakaw 1 Background Palaung (pce) belongs to the Palaungic branch of Austroasiatic (formerly known as Palaung-Wa). Generally, the Palaung languages are further divided into Shwe (Golden Palaung, from Burmese ɕwe ‘gold’) or Ta-ang, Rumai, and Ruching (Ngwe Palaung, from Burmese ŋwe ‘silver’), or Pale (Milne, 1924; Janzen, 1978; Howard, 2005; Ashley, 2011; Lewis, Simons & Fennig, 2013). Dara’ang Palaung (DP), the focus of this grammar sketch is a variety spoken at Nor Lae village, Fang district in Chiang Mai Province, northern Thailand. It belongs to the Silver Palaung group, which is spoken in the southern and eastern parts of Shan State (Janzen, H. and Janzen, M., 1972; Howard, 2005; Ostapirat, 2009). The word ‘Palaung’ is used as a cover term for both the people and their language. The Palaung live in mountainous areas of three countries: Shan State in Myanmar (Burma), southern Yunnan Province in China, and northern Thailand. The majority of Palaung speakers live in Myanmar. Most of the Palaung people in northwestern Shan State and the southern Kachin State are of the Ta’ang2 or Golden Palaung and Rumai groups (Milne, 1924). According to Deepadung (2011), the Palaung speakers in the southern part of the Shan State, especially in Kengtung areas, identify themselves as Dara’ang. They are also referred to as Pale or Silver Palaung in other studies (Janzen, H. & Janzen, M., 1972; Ashley, 2011). Howard (2005: 25) states that there are about 600,000 Palaung in Myanmar, i.e., 150,000 Golden Palaung, 150,000 Rumai and 300,000 Silver Palaung. However, according to Ethnologue (Lewis, Simons & 1 This study is part of the on-going research project entitled “Palaung Dialects in China, Myanmar, and Thailand: Phonology, Lexicons, and Sub-grouping”, which is supported by the Thailand Research Fund (TRF), Grant number BRG5580020 (2012–2014). The authors would like to express their gratitude to the TRF. And thanks also go to Mahidol University, Khon Kaen University and Chiang Mai University for all the supports. We are also very grateful to the Dara-ang speakers at Nor Lae village, Chiang Mai province, Thailand. 2 Ta’ang (De’ang in Chinese Pinyin script) is now used by the Palaung themselves in Myanmar and in China as a cover term for all groups.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_023

1066

deepadung, rattanapitak and buakaw

Fennig, 2013), in Myanmar, there are 148,000 Shwe or Golden Palaung, 137,000 Rumai Palaung, and 258,000 Pale or Silver Palaung. Since there is no official census of the Myanmar government, there is no way to verify the numbers of the Palaung speakers in Myanmar. As mentioned above, the Palaung live in high mountain areas in Shan State. Tai (Shan) influences can be clearly seen everywhere, especially in Buddhism. Most of the Palaung males have to enter the monkhood as a novice when they are boys and as a monk when they are grown up. Those Palaung who enter a monkhood learn to read and write using Tai script since the Palaung language has no written script.3 In China, the Palaung used to be referred to by the name ‘Benglong’ which has a pejorative meaning. In 1985, this name has been replaced by the autonym ‘Ta-ang’ which is written in Pinyin script as ‘De’ang’ in Chinese.4 Most of the Palaung live in Santaishan area of Luxi county in the Dehong Dai-Jingpho Autonomous Prefecture, and in Junnong Township of Zhenkang County in Lincang Prefecture. The rest are in cities and counties of Lianghe, Longchuan, Yingjiang, Ruili, Wanding, Baoshan and Gengma. Xiu Dingben (2008) states that, according to the year 2000 census, there are 17,804 Palaung speakers in southeastern Yunnan. Here, Palaung people live among the Tai, Jingpho, Wa, and Han. The most recent ethnic group, which migrated from Myanmar to Thailand, is the Palaung. A first group of about 200 Palaung speakers came from Doi Lai (Loi Lai) in Kengtung area, Shan State, to Chiang Mai Province, where the first Palaung village, Nor Lae village, was formally founded in 1982 in Fang district. Some of the Palaung people moved further to other places in municipal Fang, Chiang Dao, and Mae Ai districts. At present, there are about 8 Palaung villages in Chiang Mai Province. The Palaung at Nor Lae village call themselves Dara’ang ren5 (‘Red Dara’ang’) or simply Dara’ang (Deepadung, 2009). In 2006 one more Palaung village was established in Mae Sai district of Chiang Rai 3 Interestingly, the Palaung living in the northern region of the Shan State and the Palaung living in Dehong of Yunnan Province are now working together to standardize the language. Together, they have created a Palaung writing system using Burmese script. (Mak, 2012: 4–7). In spite of its being used for some time, this writing system is still not widely known to the Palaung, but it is used by the Palaung Literature and Culture Central Committee. The writing system is also used in dictionaries, journals and karaoke videos. The committee is now trying to promote the one name Ta’ang as a name for all the Palaung groups, including Samlong, Ruching and Rumai. 4 However, in this sketch the term Palaung will be used throughout. 5 The word da is always used in front of all ethnonyms: ang means ‘cliff, cave, or high mountain tops’, which reminds the Palaung of the place where their ancestors lived as well as the places

dara ’ ang palaung

1067

Province by Palaung who did not first settle in Nor Lae village. In all, the number of Dara’ang Palaung in Thailand is about 4,500–5,600 (personal interview with village leaders in 2011), and in Nor Lae live 1,232 (personal communication with the village leader in 2013). The children from Nor Lae go to school in a nearby Khop Dong village, where they all learn central Thai. In addition, living in Chiang Mai, most of the Palaung can speak Kam Mueang or northern Thai to communicate with the locals. Some of the old generation also know Burmese and Shan, which they learned before moving to Thailand. 1.1 Cultural Sketch Traditionally, the Palaung lived in long bamboo houses with multiple, related or unrelated families under the same roof. The number of fireplaces tells the number of families living in the same house. Most of the modern houses are for only one family and mostly built with wood or concrete materials and with galvanized iron or tiled roofs. Generally, inside the house, there are two small separate rooms—one for sleeping, the other for storage, and a big entrancespace—a kind of the family living room. In this big entrance room, there is a central fireplace. The traditional dress of the Palaung women are red tube skirts and bright colorful blouses, with rattan or bamboo or silver-like waist hoops of red, black and silver colors. Today, many women wear their traditional costume only for important ceremonies. Palaung people are Theravada Buddhists and most villages have their own Buddhist monastery. Despite being devoted Buddhists, the Palaung also believe in animism. In the center of the village, there is a construction called sali raw (‘the heart of the village’),6 which the villagers use for special worship ceremonies once every 2–3 years or whenever a disaster threatens. In addition, there is also a spirit shrine for guardians of the village, a twin shrine at which the villagers make offerings to the spirits each year before the Buddhist lent day. They seek help from the spirits for the cultivation of the fields, or in the case of Nor Lae, for the cultivation of tea, strawberries and the various vegetables and roses they have planted for the Ang Khang Royal Project. The most festive and colorful celebration for the Palaung is the annual water festival in April. From the 12–15th of April, every family prepares a delicious meal to present to the elders of the village. The young visit seniors of their where Palaung villages are typically located; reɲ means ‘red’, which refers to the colour of the Dara’ang women’s tube skirts. 6 Sometimes it is called Ho Teu.

1068

deepadung, rattanapitak and buakaw

village to show their respect, and to pour water over their hands while offering good wishes. It is the traditional New Year’s Day, just like elsewhere in Southeast Asia. To welcome the coming of the New Year on the 13th of April they also sing and dance, and sometimes the traditional ding is played—a kind of plucked string instrument with three strings. During festivals, traditional Palaung costumes with colorful blouses and waist hoops are to be seen everywhere. For the DP in Nor Lae, the women’s waist hoops remind them of their ancestor Nang Loi Ngoen or Nang Doi Ngoen (‘Lady of the silver mountain’). She is an angel and can fly. According to myth, she was captured by a hunter and hoops were put around her waist7 to prevent her from flying away. More on Palaung dress can be found in Kanomi (1991), Howard and Wattanapun (2001), and Howard (2005); the most detailed studies of the Palaung in northern Shan State are still those of Milne (1910, 1924); Buddhism and the DP in Thailand can be found in Ashley (2011); and information about the DP at Nor Lae village is in Deepadung (2009), and Deepadung and Patpong (2010). 2 Phonetics/Phonology The DP variety presented here is the one spoken at Nor Lae village of Fang district in Chiang Mai Province, Thailand. Officially, there is no variety which is designated as a standard of the Palaung language, even though there seems to be a consensus among the Palaung themselves, which Milne (1924, reprinted 2004:3) summed up as follows: “They all seemed to think that the dialect of Namhsan, the capital of the former State of Taungpeng, was the best form of their speech.” The DP phonology in this sketch is based on Buakaw (2012). 2.1 Word Structure A word in DP is either monosyllabic or polysyllabic. Polysyllabic words consist of one main syllable preceded by a presyllable or of two main syllables. 2.1.1 Monosyllabic Word The basic syllable structure is CV—an initial consonant followed by a vowel. A monosyllabic word can be an open syllable or a closed syllable—CV(C). Consonant clusters of two consonants can occur in the syllable initial position. The 7 In northern Shan State and in Dehong the Palaung believe that their ancestors are a Naga princess and the Sun prince, so Palaung women have to wear waist hoops as a symbol of the Naga’s snake-like body.

1069

dara ’ ang palaung

first initial consonant in a cluster is always a labial or a velar stop consonant, i.e. /p, pʰ, b/ and /k, kʰ, ɡ/. The second consonants are restricted to sonorants, namely /r, l/ and /j/. The DP consonant initial clusters are shown in table 17.1: Table 17.1 DP consonant clusters

ppʰbkkʰɡ-

-r-

-l-

-j-

prpʰrbrkrkʰrɡr-

plpʰlblkl-

pj-

ɡl-

bj kʰj-

2.1.2 Polysyllabic Word Two main types of polysyllabic words are found in DP. Type one is a sesquisyllabic word and type two is a compound word. Type one is called sesquisyllabic because it consists of one word and a half syllable, that is to say it has a stressed major syllable which is preceded by a pre-syllable of an unstressed or phonological reduced minor syllable (Matisoff, 1973). The minor syllable can be either a presyllable of CV or a presyllabic nasal. It should be noted that in DP two presyllables (CVCV) can occur before the stressed major syllable. Type two of polysyllabic words is a compound word. Compounding in DP can occur between two monosyllabic words as in /ble/ + /j̊ouɁ/ (‘fruit’ + ‘ear’) ‘earring’, between a monosyllabic word and a sesquisyllabic word as in /ceɲ/ + /saŋɛ/ (‘foot’ + ‘sun’) ‘rainbow’, and between a sesquisyllabic word and a sesquisyllabic word as in /Ɂikăt/ + /Ɂibə̆ n/ (‘elder’ + ‘female’) ‘elder sister’. 2.2 Syllable Structure DP has two types of syllable structure—a major syllable and a minor syllable. 2.2.1 Major Syllable Major syllables or main syllables can be open syllables or closed syllables. The most basic syllable structure is CV. Table 17.2 is a syllable template of major syllables.

1070

deepadung, rattanapitak and buakaw

Table 17.2 DP major syllable structure

Open syllables

CV CV CCV CCV Closed syllables CVC CCVC CCVC

/dɔ/ /dua/ /pla/ /bria/ /bom/ /kʰrə̆ m/ /kʰrauh/

‘vegetable’ ‘give’ ‘scar’ ‘hail’ ‘rice (cooked rice)’ ‘rich’ ‘sprinkle (water)’

2.2.2 Minor Syllable The two types of minor syllables in DP are presyllables and presyllablic nasals. As for presyllables, only a limited set of consonants is found in initial position: p, b, d, c, k, kʰ, Ɂ, m, n, s, l. Vowels which are allowed in the presyllables are i, ε, ɨ, ə, a, ɔ, u. They are always pronounced short in presyllables. However, the common ones are the central vowels /ɨ, ə/ and /a/, and the vowel /a/ (which may be reduced to a central vowel schwa [ə]) has the highest frequency of occurrence. The presyllables /ka-, ma-/ and /sa-/ are the most common ones in DP. Also there is a series of syllabic nasals, homorganic to segments on their right edge. In some cases, these syllabic nasals function as a nominalizing prefix, as in bih ‘to sweep’ and m-bih ‘broom’, louɁ ‘to hack with a hoe’, and n-louɁ ‘hoe’. In others, a function is not apparent, as in n-teɲ ‘road, path’, and ŋ-him ‘nail’. 2.3 Phoneme Inventory 2.3.1 Initial Consonants The inventory of DP initial single consonants is presented in Table 17.3. The five places of articulation are labial, alveolar, palatal, velar, and glottal; and manners are stops, nasals, fricatives, liquids (trills, laterals) and glides. Table 17.3 DP initial consonant phonemes

Stops

Labial

Alveolar

Palatal

Velar

Glottal

p pʰ b

t tʰ d

c cʰ ɉ

k kʰ ɡ

Ɂ

1071

dara ’ ang palaung

Nasals Fricatives Liquids Glides

Labial

Alveolar

Palatal

Velar

m̥ m f v

n̥ n s

ɲ̊ ɲ

ŋ̊ ŋ

r̥, l ̥ r, l

Glottal

h

j̊ j

There are three series of stops: plain voiceless, voiceless aspirated, and voiced. All of the eight nasals occur as syllable initials. However, voicing is contrastive only in initial position, that is, only the voiced nasals can occur in final position. For fricatives, only the labio-dentals are contrastive in terms of voicing. Liquids occur only at the alveolar place of articulation and there are only two palatal glides in DP (there is no initial labial and/or velar glide, although there is a voiced fricative v). Both, the liquids and the glides are contrastive in terms of voicing. As mentioned earlier, only the liquids r and l, and the glide j occur as the second consonant in a cluster. It should be noted that according to Mitani’s Proto-Palaung reconstruction (1977: 194), in DP, apart from the devoicing of the original voiced stops, the initial fricatives and the initial sonorants (nasals, liquids, and glides) seem to be preserved.8 2.3.2 Final Consonants There are twelve final consonants in DP. In addition to the voiceless stop series: /-p, -t, -c, -k, -Ɂ/, there are the nasals /-m, -n, -ɲ, -ŋ/, the glottal fricative /-h/, and the glides /-w, -j/. The finals /-p, -t, -c, -k, -Ɂ/ are realized as unreleased stops. The nasals in the final position are realized as plain nasals [-m, -n, -ɲ], and denasalized nasals [-pm, -tn, -cɲ,] which are conditioned by the preceding vowels. That is, when preceded by close central vowel /ɨ/ and short mid central vowel /ə/, the /-m, -n, -ɲ/ are realized as stops with nasal release. However, the final nasal /-ŋ/ always surfaces as a denasalized nasal [-kŋ] (Buakaw 2012: 67–71; 135–140). 8 Ostapirat (2009:71) states that “The changes, especially the implosivization of early voiceless stops, seem to constitute a continuum of ongoing processes which could have occurred independently in each dialect and may be better taken as an areal or a parallel development.”

1072

deepadung, rattanapitak and buakaw

2.3.3 Vowels A significant characteristic is that there is no register distinction in DP, or apparently in any Palaung variety. DP has eleven phonemes vowels as illustrated in the following table: Table 17.4 DP vowel phonemes Front

High Mid Low

Central

i e ε

Back

Short

Long

ɨ ə̆ ă

ə a

u o ɔ

The vowel length is phonemically distinctive for the central vowels /ə̆, ə/ and /ă, a/. All of the vowels occur in both open and closed syllables, except for the short /ə̆ / and /ă/ that occur only in closed syllables. Consequently we can consider the short vowels are phonologically marked, and this is reflected in our use of the breve for these in transcription while leaving other vowels unmarked for length. The vowels: /ε, ɨ, a, u, ɔ/ are pronounced long in open major syllables, while /i, e, ə, o/ are diphthongized. However, it should be noted that the mid front vowel /e/, occurring in the plural personal pronouns remains [e], i.e. it is not diphthongized. Also, generally before finals /c, ɲ/, there is a high front off-glide after the vowels (e.g.: /reɲ/ ‘red’ [reːiɲ], /hɔc/ ‘already, finish’ [hɔic˺]). 2.3.4 Diphthongs In DP, there are six diphthongs as illustrated in the table below: Table 17.5 DP diphthong phonemes

ia ei

ai

au

ua ou

dara ’ ang palaung

1073

A restriction on the occurrence of diphthongs is that the diphthongs /ia/ and /ua/ are more widely distributed, while the rest /ei, au, ai/ and /ou/ occur only before a glottal stop /-Ɂ/ and a glottal fricative /-h/.9 3

Word Formation

3.1 Compounding To form a new word by compounding is quite general in DP. Some compounds have a meaning, which is the same or related to the meaning of each unit. Some may not have a related meaning to any of their parts or only to one of them. For example: hɨk-ɡiɲ ɡɔn-Ɂimaj Ɂikăt-Ɂibə̆n ble-j̊ouɁ ceɲ-saŋɛ naleih-saŋɛ nauh-dɨ-kʰəj

‘hair’ (lit. ‘feather + head’) ‘son’ (lit. ‘child + male’) ‘older sister’ (lit. ‘elder + female’) ‘earring’ (lit. ‘fruit + ear’) ‘rainbow’ (lit. ‘foot + sun’) ‘east’ (lit. ‘go out + sun’) ‘kind’ (lit. ‘chest + goal + well’)

3.2 Derivational Affixes One productive process of word formation is the derivational process of putting a presyllable together with another lexical word. Some of these presyllables have grammatical or semantic function. In addition, among those that have function, there are still some usages which have no specified function. 3.2.1 Reciprocal kaThe presyllable ka- can be a prefix of reciprocal as in: katuh ‘bump against’ katun ‘meet’ kamuan ‘talk together’ katʰăɁ ‘clap with hands’ katom ‘scoop with two hands’ kasɔŋ ‘joint’

Some have non-specified use as in: kafaŋ ‘fireplace’, kamaj ‘new’. 9 See Buakaw (2012: 65) for details.

1074

deepadung, rattanapitak and buakaw

3.2.2 Animal Prefix maThe animal prefix ma- normally occurs in front of a noun for animals. However, there are many nouns, which have a presyllable ma-, that are not animals, but material culture. It is not obvious that these forms are connected with the animal prefix. -animal maso ‘dog’ maʔian ‘chicken’

masim ‘bird’ mapʰian ‘bee’

maravaj ‘tiger’ mabləm ‘land leech’10

-material culture manăk ‘beam of roof’ mana ‘bamboo tray’ mapua ‘knife’ makəm ‘pillow’

3.2.3 Locative Prefix naLocative na- signifies a location. naɁac ‘before, in front of’ napăn ‘behind’ nakʰεŋ ‘inside, in the back’ narec ‘outside’

3.2.4 Ethnic Prefix daA presyllable da- functions as an ethnic prefix as in: dasiam ‘Tai people’ dakʰε ‘Chinese people’

daraɁaŋ ‘Palaung people’ dagula ‘foreigners’

3.2.5 Personal Prefix ɁiA presyllable Ɂi- functions as a person prefix as in: Ɂikăt ‘older brother or older sister’ Ɂimaj ‘male, husband’ Ɂiŋauh ‘killer’

Additionally the prefixed syllabic nasals m-, n-, ɲ-, ŋ- can nominalize verbs as mentioned in 2.2.2.

10

In other varieties of Palaung, such as those in China and in the northern Shan State, there is no main front of animal names. The prefix is also dropped in some contexts in Dara’ang, e.g. in vocatives.

1075

dara ’ ang palaung

3.3 Reduplication Reduplication is a repetition of a complete word. Such reduplicated forms usually add emphasis or intensity to the forms, as in: haw səj səj ‘go frequently’ kalaŋ lɔj lɔj ‘speak slowly’

4

Ɂew cʰi cʰi ‘smile very beautifully’ boŋ Ɂakja kɨn kɨn ‘make lots of merit’

Phrase and Clause Structure

Normally, phrases function as components of clauses. A phrase may contain one or more words. Minimal phrases consist of only a noun or a verb. Peripheral elements of a clause may include temporal adverbials and appellatives or vocatives. 4.1 Simple Sentences A simple sentence in DP consists of a verb phrase plus an optional noun phrase. The basic word order is SV/AVP/AVTG. Clause types can be categorized by the semantic characteristic of the main verb—whether a main verb requires any post verbal constituents. Transitive clauses (1)

Ɂo dɨ măh mɛ 1sg GOAL11 hit 2sg ‘I will hit you.’

(2) Ɂo hɛɁ hɔc hom bom 1sg nsit finish eat rice ‘I have already finished eating (rice).’ (3) Ɂo hom bom 1sg eat rice ‘I eat rice.’ (4) ɡɔndɨtε hεɁ kapraɲ bom baby nsit hunɡry rice ‘The baby is already hungry.’ 11

See section 5.2.5 for a description of the general GOAL marker and its different functions.

1076

deepadung, rattanapitak and buakaw

Intransitive clauses (5) Ɂăn hɛɁ ɁiɁ 3sg nsit sleep ‘He slept already.’12 (6) masim kɔj dɨ he bird stay GOAL tree ‘A bird is on a tree.’ (7) ma hɔc taɁɨm mother finish bathe ‘The mother finished taking a bath.’ (8) ɡlaj dɨ coh rain GOAL fall ‘Rain will fall.’

Ditransitive clauses (9) Ɂo taih kʰamun dɨ Ɂalɔt 1sg give sweet GOAL pn ‘I gave Alot some sweets.’

Stative clauses (10) sato boŋ ni ŋauh shirt clf this expensive ‘This shirt is expensive.’ (11) man hεɁ kʰa field nsit burn ‘The field was already burned down.’

12

As there is no gender distinction in DP pronouns, the gender in the translation is always given according to the context of the example (which is not always specifically given in the text). If no context is available, the male form is used by default.

dara ’ ang palaung

1077

Subjectless sentences Subjects are often omitted in appropriate discourse contexts, that is, when the subject is known or retrievable as in (12). Fronting of objects, especially in narratives, occurs quite regularly, as seen in example (13). In this case only the context determines the function of the preverbal NP as subject or object. (12) saŋaih Ɂo gun13 delicious 1sg father ‘(It’s) delicious, my father.’

Fronted object (13) sanaɁ mɨ tu mə̆h gun TCL neg have ‘Gun, (I) don’t have (one).’

4.1.1 Interrogative Sentences Generally, content question words in DP occur in situ. (14) Ɂase tăh who come ‘Who comes?’ (15) mε və̆h malouh dop mase 2sg open door with what ‘With what did you open the door?’

There are two ways of forming a polar question in DP: by using a rising intonation (16) and by adding the interrogative particle hεɁ at the end of a statement (17). (16) kaɲ̊ a mε dɨ tăh pɔlεɁ tomorrow 2sg GOAL come again ‘Tomorrow, will you come again?’ (17) mε dɨ loŋ dop kʰu hεɁ 2sg GOAL descend with teacher q ‘Will you go down (from Angkhang mountain) with me?’ 13

The basic order of noun phrases is HEAD-ATTR-NUM-DEM/DET (see 4.4.2).

1078

deepadung, rattanapitak and buakaw

4.1.2 Negative Sentences In DP, tu ‘not’, Ɂăw ‘not’, and ɲ̊ am ‘not yet’ occur as preverbal negators. The two general negation markers, namely tu, and Ɂăw, can be used interchangeably and sometimes occur in the same sentence, as in example (23), where the negation of both verbs has a combined reinforced positive value (‘cannot not V’ means “must V by all means’).14 However, the frequency of Ɂăw in narrative texts seems to be higher than tu. The third negator, ɲ̊ am, has perfectivenegative function and can be translated as ‘not yet’. This negator is also found in other Palaung varieties, and a functionally equivalent marker occurs in Shan (pɐ̀j ‘not yet’), but is not widely found in other languages of the area or family. (18) sanaɁ mɨ tu mə̆h, Ɂăw gun TCL neg have neg ‘Gun (I) do not have, (I) do not have gun.’

mə̆h sanaʔ have gun

(19) sira tu dəm diaŋ pauh pn neg exper drink alcohol ‘Sira never drinks alcohol.’ (20) Ɂatik Ɂăw n̥ ɛn ɡuɲ pn neg alike father ‘Atik does not look like (his) father.’ (21) kaŋ ɲ̊ am hɔc raɲ house not.yet finish do ‘The house has not finished (being built) yet.’

4.1.3 Imperative Sentences A bare verb can be used to form an imperative, as in examples (22) and (23). To form a request, a down toner, such as Ɂicak ‘a little bit’ is put at the end of an imperative (24). Otherwise, a verb m̥ aɲ ‘ask for’ precedes the main verb (25). (22) ɁiɁ tauh dɨ ɁiɁ sleep say GOAL sleep ‘Sleep, (I) say sleep!’ (23) ma-vɨŋ tauh Ɂăw pən tu haw addr-pn say neg get neg go ‘Mother Wung says “(you) cannot not go!” ’ 14

In any particular expression one term may be preferable to the other.

1079

dara ’ ang palaung (24) tauh taŋ Ɂicak say big little ‘Speak a little bit louder.’ (25) m̥ aɲ dɨ haw dop mɛ ask.for GOAL go with 2sg ‘May I go with you?’

The prohibitive particle kɔj ‘do not, don’t’ occurs in a negative imperative as in (26). (26) kɔj grouh kaprɔ ma mɛ proh speak soul mother 2sg ‘Don’t speak, your mother soul comes here.’

tăh come

ti prox

4.2 Complex Sentences Clauses can be joined in various ways, with or without explicit conjunctions. Typically, in DP clauses are juxtaposed with no connectors. In coordination, the two clauses that are joined have independent forms. A structural subordinate clause usually cannot stand by itself, it needs an independent nuclear or main clause. 4.2.1

Clause Linkage

Contrastive In contrastive coordination, the additive marker lε ‘also, even though, but still’ is used to introduce an independent clause (27). This marker is an apparent loan from Burmese lɛ́ ‘also, even, though’, which occurs between the connected clauses. However, in narratives lε, is usually left out, as in (28), where the indigenous topic-comment linker mɨ has a similar function. (27) ma tauh dɨ mana bə hom dop ɡuɲ Ɂăn mother say GOAL pn wait eat with father 3sg lε mana tu bǝ add pn neg wait ‘Mother told Mana to wait and eat with her father, but Mana did not wait.’ (28) Ɂăn dɨ baɲ ge mɨ Ɂăn tu jaɁ baɲ 3sg GOAL shoot 3pl TCL 3sg neg dare shoot ‘He intended to shoot them, (but) he then did not dare to.’

1080

deepadung, rattanapitak and buakaw

4.2.2 Relative Clauses A relative clause in DP does not contain a relativizer, but is put right after to the head noun it modifies. (29) Ɂo nap Ɂikăt tuc dɨ dɔj 1sg know elder sit GOAL dist ‘I know the elder (who) sits there.’ (30) Ɂo jə Ɂikăt Ɂimaj haw tăh 1sg see elder male go come ‘I see the elder brother (who) is walking toward us.’ (31) da ta sato reɲ dɔj, Ɂăn muh ɡɨk Ɂo person wear shirt red dist 3sg be friend 1sg ‘The person (who is) in the red shirt there, he is my friend.’

4.2.3 Complement Clauses Some verbs of communication require an embedded clause as a complement clause. (32) ma tauh dɨ mana bə hom dop ɡuɲ Ɂăn mother say GOAL pn wait eat with father 3sg ‘Mother told Mana to wait and eat with her father.’ (33) daraɁaŋ ɲɨm tauh boŋ Ɂakja kɨn~kɨn pn believe say make merit much~red dɨ pən dɨ houɁ məŋdεŋ GOAL get GOAL ascend heaven ‘Dara’ang people believe (that) making lots of merit will make (them) able to go to heaven.’

A direct quotative clause can be put right after the matrix main clause (34). Frequently, the end of a short embedded quotation is marked by repeating the subject (object) and verb (35). (34) ma-vɨŋ cə ɡuɲ-vɨŋ haw kouh pɔŋ addr-pn order addr-pn go cut bamboo.shoot ‘Mother Wung ordered Father Wung: “Go to cut bamboo shoots”.’

1081

dara ’ ang palaung (35) tu jǝ Ɂăn tauh tu jǝ paj neg see 3sg say neg see pn ‘ “(I) don’t see (her), she said (I) don’t see Pai” ’.

4.2.4

Adverbial Clauses

Conditional Conditional clauses are optionally introduced by se ‘if’, as in (36) and (37), or may be medially marked by mɨ ‘topic-comment linker/TCL’, as in (38). (36) se mε tu mə̆h ɡɔn mε dɨ haw man if 2sg neg have child 2sg GOAL go field ‘If you do not have a child, you will go to the garden.’ (37) se Ɂo mə̆h rən Ɂo dɨ raɲ kaŋ kamaj if 1sg have money 1sg GOAL do house new ‘If I have money, I will build a new house.’ (38) mε si haw mɨ 2sg want go TCL ‘(If) you want to go, then go.’

haw go

Causal Causal clauses can be introduced by kop ‘because’, apparently a loan from Shan kɔp ‘because’, as seen in (39) and (40), but frequently they are strung together without any conjunction (41). (39) mana tu houɁ hoŋhen kop kʰu tu kɔj pn neg ascend school because teacher neg stay ‘Mana does not go to school because the teacher is not in.’ (40) guɲ-vɨŋ lɨp kʰε prε (kop) Ɂăn dɨ haw ɲauh maɡreih ADDR-PN enter in forest (because) 3sg GOAL go kill bear ‘Father Wung entered the forest because he intended to go to kill the bear.’ (41) kaŋ ɲ̊ am hɔc raɲ ɲ̊ am mə̆h rən cə̆h kahac cə̆h pun house not.yet finish do not.yet have money buy sand buy cement ‘The house is not finished yet, (I) do not have money to buy sand (and) to buy cement, yet.’

1082

deepadung, rattanapitak and buakaw

Concessive For concessive constructions, clauses may be strung together with no formal conjunction, or the topic-comment linker (see section 4.3.4) is used to mark the connection. The interpretation of the relationship between the clauses in both constructions depends on the context. (42) Ɂăn raɲ ɡan kasɔ ʔăn pən rən ʔicak 3sg do work hard 3sg get money little ‘(Though) he works hard, he gets small amount of money.’ (43) Ɂăn pi bɔ mŭh duɁi sikʰja mɨ mɛ tu Ɂɨŋ Ɂăn 3sg imm prob be person good TCL 2sg neg love 3sg ‘(Although) he is a good person, you still do not love him.’

Resultative Resultative clauses are preceeded by a main clause and can be marked by kʰo ‘so, so that’, as seen in (44). However, kʰo is frequently left out, when the meaning of the whole construction is understandable from the context, as in (45). (44) guɲ-vɨŋ jo maravaj kʰo guɲ-vɨŋ bɔ houɁ lɨp kʰɛŋ ADDR-PN fear tiger so ADDR-PN prob ascend enter in dɛŋ kabəj on storage.place ‘Father Wung was afraid of the tiger, so Father Wung went up and entered the storage place under the roof.’ (45) ma ɡe tu dɨkʰəj mother 3pl neg feel.good ɡe lo jaiɁ ma ɡe r̥ec mɔ 3pl must bring mother 3pl seek doctor ‘Their mother is ill, so they have to take their mother to see a doctor.’

Temporal In DP, temporal clauses, which are considered periphery, are preceded by dɔ ‘when’.

dara ’ ang palaung

1083

(46) dɔ guɲ-vɨŋ jə hăm maravaj Ɂăn jo kɔɁ cəjə̆n when ADDR-PN hear sound tiger 3sg fear till shake ‘When Father Wung heard the roar of the tiger, he got so scared that he trembled.’ (47) dɔ năktʰoŋtʰiaw tăh ɉăm lɔj Ɂaŋkʰaŋ when tourist come arrive mountain pn ɡe dɨ houɁ lɛ bɔ nɔlε 3pl GOAL ascend visit prob pn ‘When tourists arrive at Angkhang mountain, they will go up to visit Nor Lae.’

4.3 Syntax and Pragmatics 4.3.1 Topical Construction The demonstrative tə̆n ‘that’ functions as a topic marker for subjects (48) and objects (49). (48) ma-paj tǝ̆n bɔ hom bom, Ɂăn tǝ̆n hom jam ADDR-PN dem part eat rice 3sg that eat cry ‘It’s Mother Pai who ate (rice), it’s she who ate while crying.’ (49) Ɂăn mɨ kouh kouh pɔŋ tə̆n 3sg TCL cut cut bamboo.shoot dem ‘It’s the bamboo shoots, which she then cut.’

Topical objects may also be fronted without being marked by the demonstrative. (50) guɲ-vɨŋ daiɁ mɨ Ɂăw hu jə ADDR-PN call TCL neg dcon hear ‘(It’s) Father Wung whom (I) called, then (he) didn’t hear any longer.’ (51) maɡreih haj Ɂăn hεɁ ŋauh bear ferocious 3sg nsit kill ‘The ferocious bear, he already killed.’

4.3.2 Resumptive Pronoun Construction Two elements, especially, proper names, NPs, and the third singular personal pronoun Ɂăn ‘he, she, it’, which have the same referent, can be put side by side with no connectors. The 3rd person personal pronoun Ɂăn is put next to the proper name vɨŋ in (52) and next to the noun jăŋ ‘flesh’ in (53).

1084

deepadung, rattanapitak and buakaw

(52) vɨŋ Ɂăn vi veɲ kʰε kaŋ pn 3sg rep return in house ‘Wung, he, returned (and went) into the house.’ (53) jăŋ Ɂăn tə̆n hi hauʔ flesh 3sg dem def stuck ‘The very flesh got stuck . . .’

This construction may be termed ‘resumptive subject pronoun construction’. The phenomenon is widespread in SEA languages and always involves a 3rd person pronoun which is used pleonastically with a full NP subject. The subject may be seen as topic in a pre-clause position and therefore not strictly speaking part of the clause. 4.3.3 Appositional Construction In DP, besides the 3rd person resumptive pronoun Ɂăn, other personal pronouns, proper names and NPs may occur next to each other in appositional constructions. All of these elements refer to the same referent which is in focus. (54) ma Ɂăn ma-vɨŋ tə̆n mɨ jə ɡrouh ɡɔn ʔăn tə̆n mother 3sg addr-pn dem TCL hear voice child 3sg that ‘Her mother, Mother Wung who then heard the voice of (the one who is) her child.’ (55) mɛ tauh ʔo ʔiŋauh magreih ni 2sg say 1sg killer bear prox ‘You said “I, the killer of this bear.” ’

4.3.4 Topic-Comment Linker So far, one topic-comment linker (TCL) is found in this study, i.e. mɨ ‘also, then’. It occurs very frequently in narratives. The TCL mɨ occurs between nouns or pronouns and verbs, or between clauses. It seems to function as a discourserelated marker which indicates the noun or pronoun as a topic in a topiccomment construction, or marks the clause (or other constituent) to its left as topic and the clause (or other constituent) to its right as comment. As clausal TCL, mɨ may occur in clause initial position or after the subject of the second clause. (56) Ɂăn hom bom Ɂăn mɨ hɔc hom bom Ɂăn 3sg eat rice 3sg TCL finish eat rice 3sg ‘She ate rice, then she finished eating her rice.’

dara ’ ang palaung

1085

(57) Ɂăn mɨ pi kakuh salaj hap Ɂăn 3sg TCL imm spill bowl curry 3sg ‘She then suddenly spilled her curry bowl.’ (58) prε mɨ hεɁ Ɂap forest TCL nsit dark ‘The forest was also getting dark.’ (59) Ɂo mɨ tu hap 1sg TCL neg eat.curry ‘I also don’t eat (the curry).’

4.4 Noun Phrases A noun phrase in DP usually contains a noun, a pronoun or a proper name. Additionally there are adjectival verbs (stative verbs), possessives, numerals, classifiers and demonstratives. The possessives are not a part of speech of themselves, but personal pronouns (62), proper names (63) or kinship terms are. 4.4.1 Basic Noun Phrase DP noun phrase structure has a single noun, a personal pronoun, or a proper name as its head. kaŋ ‘house’ malec ‘pig’ Ɂăn ‘he, she, it’ ma-suk ‘Mother Suk’

4.4.2 Complex Noun Phrases The modifying elements usually follow the head noun. The basic order can be summarized as follows: HEAD-ATTR-NUM-CLF-DEM/DET. (60) sato Ɂoj boŋ ni shirt three clf prox ‘These three shirts’ (61) ɡlaŋ reɲ tube.skirt red ‘A red tube skirt’

1086

deepadung, rattanapitak and buakaw

(62) sato Ɂo shirt 1sg ‘My shirt’ (63) ma Ɂatik mother pn ‘Atik’s mother’

4.5 Verb Phrases Verb Phrases (VPs) in DP consist of one main verb (64b) or a main verb with a preverb or a preverbal particle (64a, 69). A verbal predicate may consist of a simple verb or several verbs in series (67–69). In some contexts, such as answering a question, the verb may be left understood, as in (65b). Especially stative verbs may also be used to modify other verbs, functioning as adverbs, as in (66). (64) a: kaɲ̊ aʔ mε dɨ tăh pɔlεɁ tomorrow 2sg GOAL come again ‘Will you come again tomorrow?’ b: tăh come ‘(I) will.’ (65) a: Ɂase tăh who come ‘Who came?’ b: Ɂatik pn ‘Atik’ (66) tauh taŋ speak big ‘Speak loudly.’ (67) Ɂo si dauɁ 1sg want weave ‘I would like to weave.’

dara ’ ang palaung

1087

(68) də̆h dε Ɂun kʰrɨm ntεŋ take hand put under stairs ‘He took the hands to put them under the stairs.’ (69) Ɂo dɨ măh mε 1sg GOAL hit 2sg ‘I will hit you.’

4.5.1 Serial Verbs or Multi-Verb Constructions Two or more verbs or verb phrases can be put together without a marker, i.e. no markers of coordination or subordination, to form a serial verb constructions (SVC). This SVC functions as a single predicate describing only one event. An SVC in which each component stands next to each other is a contiguous SVC (70–74), (77) and (78) while those that allow an intervening constituent is a non-contiguous SVC (75), (76), (79) and (80).15 Typically, one construction of a serial verb composes of a sequence of two or more verbs or it may be of a sequence of constructions each consisting of a verb and its own direct object NP. Semantically, in DP, the relation between each constituent in an SVC may be: purpose, simultaneous action, sequential action, direction, or instrument, among others. It is not always clear whether a multi-verb clause describes a single event or a sequence of events. The following examples illustrate clauses containing strings of verbs with different functions. Purposive (70) gɔn mən haw kɔj ma mɛ child crawl go stay mother 2sg ‘The child crawled to stay with your mother.’ (71) Ɂaj leih kouh pɔŋ Ɂatə 2du.incl go.out cut bamboo.shoot today ‘We go out to cut bamboo shoots today.’ (72) ma-vɨŋ mɨ tăh dăŋ addr-pn TCL come cook ‘Mother Wung then came to cook (rice).’ 15

It is not clear at this stage of research, which functions are expressed by contiguous SVCs or non-contiguous SVCs.

1088

deepadung, rattanapitak and buakaw

(73) ɡɨk tăh lε dɨ Ɂo friend come go.about GOAL 1sg ‘My friend comes to visit me.’

Simultaneous action (74) dɛc hăp kʰɛ moc magreih stuff put.into in mouth bear ‘(She) stuffed (the bamboo shoots) into the bear’s mouth.’ (75) maso mε kʰlɨp mana houɁ he tə̆n dog 2sg carry.on.head bamboo.tray ascend tree dem ‘Dog, you carry the bamboo tray up that very tree.’ (76) də̆h dɛ Ɂun kʰrɨm ntɛŋ take hand put under stairs ‘(He) took ((Pai’s) hands to put (it) under the stairs.’

Sequential action (77) houɁ dɨ he houɁ kɔj kadaiɁ hedɨruŋ ascend GOAL tree ascend stay tree.top bodhi.tree ‘(He) went up the tree, went up and stayed on top of the bodhi tree.’

Directional (78) Ɂăn mɨ də̆h veɲ jăŋ magreih Ɂăn tə̆n 3sg TCL take return flesh bear 3sg that ‘She then took that bear flesh back home.’ (79) ge jaiɁ ma ge r̥ec mɔ 3pl bring mother 3pl visit doctor ‘They brought their mother to see a doctor.’

Instrument (80) Ɂăn mɨ də̆h mapuaɁ suaɁ mɛŋ ɡɔn Ɂăn 3sg TCL take knife stab neck child 3sg ‘He then took the knife to stab his daughter’s neck.’

1089

dara ’ ang palaung

5

Word Classes

5.1 Nouns Nouns are those elements, which can be the head of a noun phrase. A noun syntactically may be identified by its position before its modifiers, such as number, possessive, or demonstrative. It can stand alone in a basic noun phrase as a referring expression. 5.1.1 Common Nouns Common nouns in DP may be a simple word, such as he ‘tree’ pʰiaŋ ‘wing’ kaŋ ‘house’

dɔ ‘vegetable’ ŋaj ‘eye’ rɔ ‘village’

bom ‘rice’ moc ‘mouth’ kʰrɨŋ ‘clothes’

ble ‘fruit’ hɨk ‘feather’ glaŋ ‘tube skirt’

A noun may be a complex noun consisting of two words, such as ble moŋ ‘mango’, ble gluan ‘pumpkin’. In these cases, the first noun indicates the class, the second specifies the meaning. 5.1.2

Pronouns and Question Words

Personal pronouns Personal pronouns in DP are divided by person (first, second and third person) and by number (singular, dual and plural). The first person dual and plural forms distinguish between inclusive and exclusive pronouns. Table 17.6 DP Personal pronouns Number Person

1 2 3

Singular

Ɂo mε ʔăn

Dual

Plural

Inclusive

Exclusive

Ɂaj

jaj

baj ɡaj

Inclusive Exclusive

Ɂe

be ɡe

je

1090

deepadung, rattanapitak and buakaw

Question words Words used to form questions in DP may be divided into two groups, according to the interrogative element they are built with: the se ‘what’ group and the m̥ ɔ ‘which’ group. One question word, namely n̥ aŋ ‘how many’, belongs to neither of the two groups. Ɂa.se ‘who’ ma.se ‘what’ m̥ ɔ ‘which’ dɨ.m̥ ɔ ‘where’ dɔ.m̥ ɔ ‘when’ paj.m̥ ɔ ‘how’ kʰε.m̥ ɔ ‘how much’ n̥ aŋ ‘how many’

5.1.3

Measure Words and Quantity Words

Numbers Numbers in DP are of a typical decimal system, as seen in table 17.7: Table 17.7 DP numbers

Ɂu, l ̥eih, buŋ Ɂa Ɂoj mpʰuan mpʰan ndɔ mbu nda ndim gə

16

one two three four five six seven eight nine ten

ɡə dɨ buŋ ɡə dɨ pʰuan Ɂa ɡə Ɂa ɡə dɨ buŋ ʔoj gə ʔoj gə dɨ buŋ pʰan ɡə (Ɂu) majauh (Ɂu) majauh dɨ gə Ɂu r̥ εŋ

eleven16 fourteen twenty twenty one thirty thirty one fifty one hundred one hundred and ten one thousand

For number 2 and 3 in the unit positions, such as 12, 13, 22, 23, ha is added before the words for two or three. For example: gə dɨ ha ʔa ‘12’, ʔa gə dɨ ha ʔoj ‘23’. According to native speakers, ha, the meaning of which they could not identify, cannot be omitted.

dara ’ ang palaung

1091

There are a few special remarks to be noted: In counting or in isolation l ̥eih is used for ‘one’. 1 in the units position from 11-91, buŋ is usd in place of l ̥eih 1 in one hundred (100) or one thousand (1,000), Ɂu is used in place of l ̥eih. 4 to 9 in the units position of 14 to 99, the presyllable N- is deleted. 4 to 9 in the tens position of 40 to 90, the presyllable N- is deleted.

Classifiers Classifiers in DP are items, which are used to indicate the quantity of units, when a number is cited. They also follow the interrogative quantifier n̥ aŋ ‘how many’. These regular classifiers are a closed set of words. The classifier vi occurs with persons, do with animals, daŋ with trees, kutu with houses; boŋ occurs with eggs and fruits, and it is also a generic classifier. The word order in numeral expressions is N-NUM-CLF. Time expressions, such as saŋε ‘day’, maɡian ‘month’ and sanə̆m ‘year’ are common nouns that can be used as a classifier following a numeral. There is also an open class of nouns, that are used as classifiers, such as salaj ‘bowl’, puŋ ‘water container’, van mu ‘glass’. Traditional standard measures include a closed set of words: dăp ‘arm’s length’ kanih ‘cubit’ kasan ‘palm span’

5.1.4 Personal Names Personal names in DP are typically one-syllable names. For elders, the names are preceded by the address terms ma ‘mother’ as in ma-suk ‘Mother Suk’, ɡuɲ ‘father’ as in ɡuɲ-vɨŋ ‘Father Wung’. A boy’s name usually has a presyllable Ɂaas in Ɂalɔt, Ɂatik; Ɂi- precedes a girl’s name, for example: Ɂipaj, Ɂinon. However, nowadays most of the DP children in Nor Lae tend to have a name from Pali or Sanskrit, like the Thai. Traditionally, there is no family name for the Palaung, but for DP in Thailand, following the Thai people naming system they have both a first name and a family name.17

17

See Deepadung & Patpong (2010).

1092

deepadung, rattanapitak and buakaw

5.2 Verbs The verb class is composed of all elements that can stand alone in the main verb phrase. Based on their distribution, i.e. whether they take nouns as complements, the major sub-classes of verbs are intransitive, transitive and ditransitive. Both, intransitive and transitive verbs each have a sub-class, namely stative verbs and quotative verbs, respectively. 5.2.1 Intransitive Verbs An intransitive verb takes no object complement, such as tuc ‘sit’, ɁiɁ ‘sleep’, ɡɨp ‘bow (in reverence)’, katuan ‘crow, coo’, jam ‘weep’, taʔɨm ‘bathe’.18 Stative verbs A stative verb or an adjectival verb is a word of quality, for example: reɲ ‘red’, taŋ ‘big’, toŋ ‘long’, sɔ ‘sick’, pʰlan ‘poor’, ɡăt ‘cold’. These words may be quantified or emphasized by elements such as tʰuan ‘too much’. 5.2.2 Transitive Verbs A transitive verb usually has an object. Objects can be left understood if they are retrievable from the contexts. Samples are: hom ‘eat’, măh ‘hit’, ŋauh ‘kill’, ɡaɁ ‘shave’, baj ‘wash (clothes, vegetable)’, kʰoc ‘wash (dishes)’. Quotative Verbs Quotative verbs take quoted speech or nouns as complements. Included in this sub-class is a small group of quoting verbs, such as tauh ‘say’, kalaŋ ‘speak’, dɨm̥ ɔɁ ‘ask’, dăm ‘order’. 5.2.3 Ditransitive Verbs Ditransitive verbs take two objects, the direct object precedes the indirect object which is usually marked by a particle dɨ, that is having dɨ in the construction sounds better than not having one. The most frequently used ditransitive verb in DP is taih ‘give’. This verb is also used to introduce a beneficiary or recipient to an otherwise transitive construction. The constituent order in this case is AVT taih G, that is, the recipient or beneficiary is added after the transitive AVP. (81) sɛŋ də̆h paiɁ taih dɨ pn take book give GOAL ‘Saeng gave his father a book.’ 18

guɲ father

Ɂăn 3sg

For the transitive ‘to bathe a child’ the construction is taʔɨm taih (dɨ) gɔn (‘bathe give (GOAL) child’).

dara ’ ang palaung

1093

(82) ʔo jə̆h kʰamun taih dɨ va 1sg buy snack give GOAL younger.sibling ‘I bought some snack for (my) younger sibling.’

5.2.4 Preverbs In general, preverbs occur before a main verb. These preverbs indicate the state of existence or completion of the action. The most frequently used preverb is hɔc which signifies the definite realized perfective aspect of the main verb and also functions as a main verb ‘finish’ in other contexts. As a preverb: (83) Ɂăn hɔc jam 3sg finish die ‘He died already.’

As a main verb: (84) hεɁ hɔc nsit finish ‘It’s done.’

Another frequent preverb in DP is pən ‘to get’, which indicates capability or possibility to do something. As a main verb it retains its lexical semantics. As a preverb: (85) Ɂăw pən dɨ kauɁ Ɂăn neg get GOAL bite 3sg ‘(The bear) was not able to bite him.’

As a main verb: (86) ʔăn pən rən ʔicăk 3sg get money a.little ‘He gets a small amount of money.’

5.2.5 Preverbal Particles Two types of particles are described in this DP grammar, i.e., the dɨ particle and the other preverbal particles, which cover a wide range of functions.

1094

deepadung, rattanapitak and buakaw

dɨ particle The most frequently used particle is the polyfunctional particle dɨ. It can function as a preposition having a directional, locative, or dative (addressee, recipient, beneficiary) meaning, as seen in examples (87–89). (87) guɲ-vɨŋ hɛʔ vi kablɛp veɲ addr-pn nsit rep run return dɨ kaŋ kɔj dɨ kaŋ GOAL house stay GOAL house ‘Father Wung had run back home (and) stayed at home.’ (88) kalaŋ dɨ cʰawban speak GOAL villagers ‘(She) spoke to the villagers.’ (89) magreih mɨ ʔaŋ moc dɨ ʔăn bear TCL open.mouth mouth GOAL 3sg ‘The bear then opened its mouth toward him.’

As a preverbal particle, dɨ occurs before a main verb to indicate a pending action or an unrealized action, which can be subsumed as goal (of the subject’s intention). It is generally translated as ‘will’ in English, as seen in (90). (90) ʔo dɨ hap jaŋ magreih 1sg GOAL eat flesh bear ‘I’ll eat the bear flesh curry.’

The particle dɨ can also join two verbs together to signify purpose or goal of an action, as in (91). (91) ʔo tu muh dɨ pra 1sg neg be GOAL steal ‘I did not steal in order to sell (it).’

Other particles Other preverbal particles include: hɛʔ bɔ

‘new or realized situation (nsit)’ ‘probable action (prob)’

dɨ GOAL

jɔŋ sell

1095

dara ’ ang palaung pi vi ha hu hi

‘immediate action (imm)’ ‘contrastive action, repetitive (rep)’ ‘in action, in progress (prog)’ ‘end of action or discontinued action (dcon)’ ‘definite action (def)’

These preverbal particles can co-occur, but more data and detailed study need to be conducted in order to determine their exact functions and interplay. (92) ʔăn magreih tə̆n hɛʔ 3sg bear that nsit ‘The one bear died.’

jăm die

(93) vɨŋ tə̆n bɔ veɲ pn that prob return ‘It’s Wung who is going to return.’ (94) dε paj kɔj kʰrɨm ntεŋ Ɂu do pi tauh hand pn stay under stairs one clf imm say ‘Pai’s hands are under the stairs, one (chicken) immediately says.’ (95) ɡuɲ-vɨŋ vi leih də̆h tăh jaŋ ADDR-PN rep go.out take come meat ‘Father Wung again went out and took the meat back home.’ (96) Ɂatə nəj mε ha douɁ man today this 2sg prog burn field ‘This day, you are, on and on, burning the field.’ (97) Ɂăw hu jə vɨŋ ɡaj ɡuɲ Ɂăn neg dcon see pn and father 3sg ‘(I) no longer see Wung and his father.’ (98) Ɂo hi pən dɨ ŋauh mɨ lo kajoŋ veɲ 1sg defint get GOAL kill TCL have.to carry return ‘I definitely can kill the bear, then (we) have to carry it back.’

5.3 Locational and Directional Words Included in this section are locational words and directional constructions, which are formed with prepositions plus NPs.

1096

deepadung, rattanapitak and buakaw

The locational prepositions express the location of the event, whereas the directional prepositions relate to the directional verbs. Some of the prominent prepositions are the followings: dɨ kʰrɨm baŋ dɛŋ kʰɛŋ

‘at, to, toward’ (see 5.2.5) ‘under’ ‘beside’ ‘on’ ‘in, into’

(99) dɛ paj kɔj kʰrɨm ntɛŋ hand pn stay under stairs ‘Pai’s hands are under the stairs.’ (100) plɔ paj bɔ kɔj foot pn prob stay ‘Pai’s feet are beside the door.’

baŋ malouh beside door

(101) giɲ ʔipaj kɔj dɛŋ kabəj head pn stay on sieve ‘Pai’s head is on the sieve.’ (102) dɛc kʰɛŋ moc magreih stuff in mouth bear ‘Stuff (these bamboo shoots) in the bear’s mouth.’

5.4 Conjunctions Besides the clause conjunctions presented in section 4.2, there are two phrase connectors in DP. The first one is ɡaj ‘and, with’, which is homonymous with the 3rd person dual pronoun. The second one is the general directive marker dɨ, used in numbers, such as, ʔa gə dɨ buŋ ‘twenty-one’, ʔoj gə dɨ ha ʔa ‘thirtytwo’ (see 5.1.3). (103) guɲ-vɨŋ gaj ma-vɨŋ, gaj mə̆h gɔn ʔu vi addr-pn and addr-pn 3du have child one cl ‘Father Wung and Mother Wung, they (two) have one child.’ (104) ʔăn daʔ maso gaj mafa 3sg call dog and monkey ‘He called the dog and the monkey.’

1097

dara ’ ang palaung

5.5 Temporals Temporals may occur either before or after the main clause. They may be temporal prepositions, such as: naɁac ‘before’, napan ‘after’, dɔ ‘when’, or temporal nouns or noun phrases: dɔn saŋε ‘everyday’ Ɂatə ‘today’ kaɲ̊ a ‘tomorrow’ Ɂam̥ ə ‘yesterday’ tuɲ̊ a ‘day after tomorrow’ sivăh ‘morning’ katε saŋε ‘noon’ baraɁap ‘evening’ kʰεm̥ ə ‘night’

5.6 Demonstratives In DP, demonstratives usually appear in the final position of a noun phrase (105). They are ni ‘this’ and dɔj ‘that’. As a demonstrative pronoun, they can also occur alone as a bare noun phrase (106). Frequently, in narratives ni ‘this’ becomes ti (106) and (107). The emphatic demonstrative tə̆n usually occurs post-nominally (4.3.1) and also marks topics. (105) mɛ ŋauh magreih ni 2sg kill bear prox ‘You killed this bear.’ (106) kɔj grouh kaprɔ ma mɛ proh speak soul mother 2sg ‘Don’t speak, your mother soul came here.’

tăh come

ti prox

(107) leih vi də̆h tăh jăŋ ti go.out rep take come flesh prox ‘Go out and taking back the flesh.’

5.7 Appellatives Appellatives usually occur before a clause. In the following example, so ‘dog’ and fa ‘monkey’ are used as appellatives. The prefix ma- of animals is dropped in this context. (108) so də̆h mɛ ɡɔɁ mana dog take 2sg carry.on.shoulder bamboo.tray ‘Dog, you carry the bamboo tray.’

1098

deepadung, rattanapitak and buakaw

(109) fa də̆h mε ɡɔɁ ŋkre monkey take 2sg carry.on.shoulder pestle ‘Monkey, you carry the mortar.’

5.8 Loanwords Loanwords in DP come from Shan and Burmese. It is noticeable that loans from Tai (Shan) often show older stages of the language, as can be seen in the example l ̥an ‘grandchild’, which in modern Shan is lǎn ‘grandchild, nephew, niece’, but the tone indicates that the form goes back to an earlier*l ̥ān. Some examples of loanwords are as follows: cɨ ‘name’ (Tai) hoŋja ‘hospital’ (Tai) ɡlɔj ‘banana’ (Tai) heŋ ‘dry’ (Tai) loŋ ‘descend’ (Tai) l ̥an ‘grandchild’ (Tai) ɉɔ ‘king, prince, lord’ (Tai) (via Burmese?) ɉoŋ ‘monastery’ (Burmese) ɁiɁ ‘sleep’ (Burmese) ɡa ‘dance’ (Burmese) l ̥u ‘bring food to old people (at the temple)’ (Burmese)

6

Kinship Terms

At Nor Lae, most of the households usually consist of two or three generations, that is, a man and his wife, their unmarried children and their married sons, their wives and children. The married daughters move away to live with their husband’s parents. That is to say, DP people seem to have a patrilinear family, and they tend to marry within their own ethnic group, even though, recently some DP have married outsiders. They practice monogamy and divorce is quite common among the DP. DP kinship terms are used when they speak to or about a relative. Some kinship terms are apparently borrowings from the Tai (Shan). These terms generally reflect relative age and generation differences. Whether the relationship is on the father’s or mother’s side is not significant. The gender reference of the person spoken or referred to is made only for those who are in the generations higher than the speaker. It is common usage to call adults by the name of one

dara ’ ang palaung

1099

of their children, such as ‘Pai’s mother’ and ‘Pai’s father’, as can also be seen in the text below. The words for ‘male’ or ‘female’ are added when a gender distinction is required for persons within the same or lower generations than the speaker. The most common basic kinship terms are as follows: gɔn son, daughter, children guɲ father ma mother da grandfather (older man) ja grandmother (older woman) bə uncle (younger than parent) gə̆n aunt (younger than parent) gun ʔan uncle (older than parent) ma ʔan aunt (older than parent) ʔimaj husband (man, male) ʔibə̆n wife (woman, female) ʔikăt older sibling va younger sibling l ̥an niece, nephew l ̥en grandchild ʔia in-law (male) ʔɔ in-law (female)

7 Example DP Text This sample glossed text is an introductory part to the whole story of the folktale “I-pai”, narrated by the late Mae Suk Paansaeng, 54 years from Nor Lae Village, Monpin subdistrict, Fang District of Chiang Mai Province, Thailand, first recorded in 2008 and rechecked in 2012. dɨmən mə̆h ɡuɲ-paj, ɡuɲ-paj ɡaj ma-paj ɡɔn ɡaj cɨ long.time have addr-pn addr-pn and addr-pn child 3du name paj, ma-paj tauh hɔc dɨ haw do, Ɂo dɨ haw toŋ, pn addr-pn say finish GOAL go those 1sg GOAL go far Ɂo dɨ haw man, Ɂo dɨ haw katih.dɨ.krəh.katə.dɨ.ɡriah, 1sg GOAL go garden 1sg GOAL go far.away

1100

deepadung, rattanapitak and buakaw

haw bə̆t ble Ɂăn bɔ haw, daɁ hεɁ tʰɔŋ kadaiɁ he ɡrɔ go pick fruit 3sg prob go sunlight nsit shine treetop tree teak mε duh ŋ̊ aw, daɁ hεɁ tʰɔŋ dɨ rεŋ daŋ 2sg pound paddy sunlight nsit shine GOAL bamboo.wall cook bom (. . .) rice (. . .) ‘Long time ago, there were a father, called Pai’s father, a mother called Pai’s mother, and their child, named Pai.19 Mother Pai said she went over there, “I will go far, I will go to the farm, I will go very very far. I am going to pick fruits. Whenever the sun shines at the top of the teak tree, pound the rice. Whenever the sun shines at the house’s wall, cook the rice.” ’ Ɂăn daɁ tʰɔŋ kadaiɁ he ɡrɔ Ɂăn păh mbɔ, 3sg sunlight shine treetop tree teak 3sg carry.on.forehead mortar Ɂăn daɁ maso ɡaj mafa dop Ɂăn houɁ he, 3sg call dog and monkey with 3sg ascend tree ‘When the sun shone at the top of the teak, he carried the mortar. He called the dog and the monkey to go with him up the tree.’ so də̆h mε ɡɔɁ mana, fa də̆h mε dog take 2sg carry.on.shoulder bamboo.tray monkey take 2sg ɡɔɁ ŋkre, mafa mε ɡɔɁ ŋkre, maso carry.on.shoulder pestle monkey 2sg carry.on.shoulder pestle dog mε kʰlɨp mana houɁ he tə̆n, Ɂăn mε 2sg carry.on.head bamboo.tray ascend tree that 3sg 2sg păh mbɔ houɁ dɨ he, carry.on.forehead mortar ascend GOAL tree ‘ “Dog, you carry the bamboo tray, Monkey, you carry the pestle.” The monkey carried the pestle, the dog carried the bamboo tray, and went up that one tree. He carried the mortar up the tree.’

19

pai ‘lima bean’.

dara ’ ang palaung

1101

houɁ houɁ houɁ he tə̆n, mbɔ Ɂăn mə̆h kadə̆t coh ascend ascend ascend tree that mortar 3sg have torn fall (dɨ) maso ɡaj mafa Ɂăn, maso ɡaj mafa tə̆n GOAL dog and monkey 3sg dog and monkey dem bɔ jăm, prob die ‘While going up the one tree, his mortar fell down on to his dog and his monkey. The dog and the monkey, both of them died.’ Ɂăn mɨ tauh simu pi ɁiɁ ɡɨŋ baj dɨ ti, 3sg TCL say what imm sleep overturn 2du GOAL prox baj pi ɁiɁ dɨ ti, Ɂase tauh baj ɁiɁ dăŋ he ti, 2du imm sleep GOAL prox who say 2du sleep trunk tree prox dǝ̌h houɁ ŋkre baj ti də̆h houɁ mana hεɁ take ascend pestle 2du this take ascend bamboo.tray nsit loŋ ta, descend use ‘He then said “What are you two doing lying there? You two sleep here. Who told you two to sleep at this tree base?” He came down to order the two of them to take the pestle and the bamboo tray up (the tree).’ Ɂăn mɨ jɔɁ jɔɁ, Ɂăw jic maso, Ɂăn tǝ̌n leih măh 3sg TCL look look neg get.up dog 3sg dem go.out hit hεɁ baj, Ɂăn tauh Ɂo măh hεɁ, Ɂăn bɔ leih dɨ nsit 2du 3sg say 1sg hit nsit 3sg prob go.out GOAL he tə̆n leih jɔɁ, maso ɡaj mafa Ɂăn tə̆n hεɁ jăm, tree dem go.out look dog and monkey 3sg dem nsit die jăm bɔ Ɂa do, Ɂăn mɨ ɡlə jam. die prob two clf 3sg TCL sad cry ‘He then looked at (both of them). The dog did not get up. It’s he who went to beat you two. He said “I beat you.” He was going down from that very tree to look (at them). His dog and his monkey were already dead. Both of them died. He then was sad and cried.’

1102

deepadung, rattanapitak and buakaw

8 Bibliography Ashley, Sean M. 2011. Late for Buddha: The construction of Dara’ang (Silver Palaung) religious and ethnic identity. Unpublished doctoral dissertation, Simon Fraser University, Canada. Buakaw, Supakit. 2012. A phonological study of Palaung dialects spoken in Thailand and Myanmar, with focuses on vowels and final nasals. Unpublished doctoral dissertation, Mahidol University. Deepadung, Sujaritlak. 2009. Ethnicity and the Dara’ang (Palaung) in Thailand. Journal of Language and Culture 28.1: 7–29. ———. 2011. Name and sub-groups of the Palaung: A preliminary field survey in Yunnan and Kengtung. Manusya 14.2: 88–104. Deepadung, Sujaritlak & Pattama Patpong. 2010. Dara’ang: language, culture and ethnic identity maintenance at the Thai-Myanmar border. [Research report] Research Institute for Languages and Cultures of Asia, Mahidol University. [In Thai] Howard, Michael C. 2005. Textiles of the highland peoples of Burma, Volume II: The Northern Mon-Khmer, Rawang, Upland Burmish, Lolo, Karen, Tai, and Hmong-Mienspeaking groups. Bangkok: White Lotus. Howard, Michael C. & Wattana Wattanapun. 2001. The Palaung in northern Thailand. Chiang Mai: Silkworm Books. Janzen, Hermann. 1978. A phonological description of Pale in comparison with Goldand Rumai-Palaung. Paper presented at the 2nd International Conference on Austroasiatic Linguistics, December 19–21, 1978. Mysore, India. ———. 1976a. The system of verb-aspect words in Pale. In Philip N. Jenner, L.C. Thompson & Stanley Starosta (eds.), Austroasiatic Studies Part I. Honolulu: University of Hawaii Press, 659–668. Janzen, Hermann & Margarete Janzen. 1972. Grammar analysis of Pale clauses and phrases. Journal of the Burma Research Society LVidii: 47–99. Kanomi, Takako. 1991. People of myth: Textiles and crafts of the Golden Triangle. Kyoto: Shikosha. Lewis, M. Paul, Gary F. Simons & Charles D. Fennig (eds.). 2013. Ethnologue: Languages of the World, Seventeenth edition. Dallas, Texas: SIL International. Online version: http://www.ethnologue.com. Mak, Pandora. 2012. Golden Palaung: A grammatical description. Canberra: Asia-Pacific Linguistics. Matisoff, James. (1973). Tonogenesis in Southeast Asia. In L.M. Hyman (Ed.), Consonant types and tone (pp. 71–95). Los Angeles: University of Southern California. Milne, Leslie. (1910). Shans at home. London: John Murray, Albemarle Street; Reprinted 2001, White Lotus, Bangkok, Thailand. ———. (1924). The home of an eastern clan: A study of the Palaungs of the Shan States. Oxford: Clarendon Press; Reprinted 2004, White Lotus, Bangkok, Thailand.

dara ’ ang palaung

1103

Mitani, Yasuyuki. (1977). Palaung dialects: A preliminary comparison. South East Asian Studies, 15 (2), 193–212. Ostapirat, Weera. (2009). Some phonological criteria for Palaung subgrouping. Journal of Language and Culture, 28 (1), 63–76. Schmid, Pater Wilhelm. (1906). Die Mon-Khmer Völker, ein Bindeglied zwischen Völkern Zentralasiens and Austronesiens. Braunschweig: Archiv für Anthopologie, 5, 59–109. Thomas, David D. & Robert K. Headley. (1970). More on Mon-Khmer subgroupings. Lingua, 25, 398–418. Xiu Dingben. (2008). De’angzu Jianshi [A Brief History of De’ang Ethnic Group]. Beijing: Minorities Press. [In Chinese]

chapter 18

Danau Aung Si 1 Background Danau (also Danaw, tʰənɔʔ; dnu) is a small Palaungic language that has previously attracted little scholarly attention; until now the only significant linguistic publication in English is the rather short account by Luce (1965) of his collection of some Danau data while passing through the speaker area in 1963). The material presented here was collected during fieldwork carried out by the present author over a two-month period in late 2012,1 and reflects a first attempt at a sketch grammar of the language. The Danau live in three or four villages to the south of the town of Aungban in Shan State, Myanmar. Most of the villages—Taungbohla, Chaunggya and Thaethit—can only be reached via a poorly-maintained dirt road, which is navigable only by motorbike. Only the village of Naung In is accessible by a sealed road, being just off the major highway that links Aungban, the airport town of Heho, and the state capital Taunggyi. Ethnologue provides a reasonable estimate of 5,000 for the number of people identifying as Danau. The first three villages mentioned above can be said to be ‘purely’ Danau villages, although their nearest neighbours are dominated by Pa’O speakers. The final village, Naung In, has a mixture of Danau and Pa’O inhabitants. The Danau are a practically unknown ethnic group in most parts of Myanmar. Even in the town of Aungban, which is frequented by Danau farmers selling their produce on market days, people tend to confuse Danau with Danu, the dominant ethnic group of Aungban, whose members speak a dialect of Burmese. Pa’O people living in nearby villages, however, have frequent contact with Danau speakers, and are aware of the names and locations of the villages. 1 The author is indebted to U Wariya, a Danau monk at Aungchantha Monastery in Aungban, for facilitating meetings between the author and key community members, to the people of Taungbohla and Thaethit villages for their contribution as language consultants, to U Win Kyaw Aung and U Kyaw Moe of Aungban for field assistance, to U Khin Maung Aye, Daw Than Yi, U Than Oo and Daw Thant Zin Aye for their hospitality in Aungban, and to the Endangered Languages Documentation Programme for funding support.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_024

danau

1105

The language situation in the villages was encouraging, in spite of predictions as far back as the 1960s that the language was ‘dying’ (Luce 1965). Even very young children were able to speak Danau fluently to their peers or their elders, and practically all the adults I encountered were able to also speak Pa’O and Burmese, albeit in a heavily accented manner. As an ethnic group, the Danau differ from neighbouring groups by not possessing the key signs of cultural distinctiveness that the latter value and cherish. These include an ethnic costume, distinctive musical instruments and unique musical styles. The Danau have instead adopted the costume, instruments and music of their Pa’O neighbours, and even elderly Danau people insist that this has always been the case. Indeed, it was not possible to identify, even with the assistance of enthusiastic senior community members, any Danau people who could sing any traditional Danau songs, or narrate any Danau epics or folk tales. 2 Phonetics/Phonology 2.1 Word/Syllable Structure The Danau word is typically one or two syllables. The typical maximal syllable structure for strong syllables in Danau is as follows, where C stands for a consonant, G for a glide /l, w, r, j/, V for a vowel, and T for a tone. C(G)V(C)T klɨʔ ‘good’

In addition, Danau has two types of weak syllables which are always followed by a strong tone-bearing syllable. One weak syllable type is of the shape Cə-, the other consists of a syllabic nasal N̩ -. The latter can carry a tone, though it is not clear whether tone is actually distinctive in this syllable type. The two types of weak syllables can combine in the order CəN-, which results in an unstressed closed presyllable. The complex word structures found in indigenous words are the following. CəCGVCT tǝphwɐ̄t ‘mat’ N̩ CGVCT ŋ̄ pwɐ̄t ‘woman’s bag’ (short form) CəNCGVT tǝŋ̄ pwɐ̄t ‘women’s bag’ (full form)

1106

si

Words always begin with a consonant and can end with either a consonant or vowel. Consonant clusters can only occur in syllable-initial position.2 In addition, syllabic nasals may also occur at syllable onsets, either word-initially or wordmedially. Clusters are formed with medial Glides (G) /l, w, r, j/, the most common G being /w/. Longer lexemes of three or more syllables (especially in the case of plant names) do exist. Pali or Burmese loanwords, especially in the semantic domains of religion and administration, also figure prominently in this latter category. Words are iambic, a stressed or strong syllable can be preceded by a weak unstressed syllable, at the nucleus of which lies a /ǝ/. The weak syllables are invariably open syllables. Strong syllables can be open, closed, or be nasalised; closed syllables never contain a nasalised vowel. 2.2 Phoneme Inventory and Phonotactics The following are the possible C1 consonants: Labial

Dental

Palatal

Velar

Glottal

ph p b ɓ m (f)v w

th t d ɗ n sh, s, z r, l

tʃh c dʒ

kh k g

ʔ

ɲ (ç) j

ŋ x

h

The possible set of syllable final consonants is much smaller. Note that stops that occur word-finally are never released:

2 A glottal stop predictably occurs before a vowel if no other initial consonant is present. This glottal stop is not represented in the phonemic transcription applied in this chapter.

1107

danau Labial

Dental

m

t n

Palatal

Velar

Glottal

ʔ

j

k ŋ x

The palatal stop /c/ is rare, apparently only occurring in words of Burmese origin, such as ǝcɨ̂ ‘benefit’ ( klɨʔ.klɨʔ ‘well’ sɛ̂ŋ ‘careful’ > sɛ̂ŋ.sɛ̂ŋ ‘carefully’ mɛt ‘all’ > mɛt.mɛt ‘completely’ (although see section 5.1.2 for the use of reduplicated mɛt in plural, inclusive pronoun formation)

A partial reduplication of the question word kətʰɐn ‘what’ allows the speaker to ask for a list of items (see section 4.1 below). kǝtʰɐn ‘what’ > kǝtʰɐn-tʰɐn ‘what.pl’

The resulting kǝtʰɐn-tʰɐn can also be translated as ‘something’, depending on the context. Similarly, ǝsɔsɔ ‘somewhere’ and ǝkʰɔkʰɔ ‘sometime’ seem to be reduplicated, but the corresponding basic lexemes have yet to be identified. (5) kɔŋ bouʔ nì kǝtʰɐn~tʰɐn hō hɔʔ inside box prox what~red exist cop ‘There is something in this box.’

Two-part adjectives have each of their constituents reduplicated separately to form adverbs, as in the case of tɐŋ plɛə̀ŋ ‘pure’, and the Burmese loan ro te ‘respectful’: tɐŋ plɛə̀ŋ > tɐŋ tɐŋ plɛə̀ŋ plɛə̀ŋ ‘in a pure manner’ ro te > ro ro te te ‘respectfully’

Reduplication can also serve a syntactic function—this is described in section 4.2.3 (‘Other subordinate clauses’). 4

Phrase and Clause Structure

4.1 Simple Sentences A basic Danau sentence has AVP verb order, but verbless sentences are also frequently used. (6) ō swe bɯ̄ 1sg eat rice ‘I am eating (rice).’

1112

si

(7) sɐʔouʔ nì=nə sɐʔouʔ ō book prox=top book 1sg ‘This book is my book.’

Declarative clauses that describe the state of an entity, or mention the existence of an entity, require a sentence final copula. In the latter case, the copula is preceded by the existential verb hō. (8) ō tin/tə̄ŋkɐj/pɐj hɔʔ 1sg sleep/tall/tired cop ‘I am sleeping/tall/tired.’ (9) ɲɐ̄ mə̀=nə kõ̂u hɔʔ house 3sg=top big cop ‘His house is big.’ (10) ŋkwɐt ɲɐ̄ mə̀=nə kʰræ̃ hō hɔʔ behind house 3sg=top garden exist cop ‘There is a garden behind his house.’

Other copula-like morphemes that occur sentence-finally include the stative copula mə̄nɔʔ, the collective copula cjɐbō or jɐbō and the negative copula kɔʔ. The latter is discussed in the following section. The exact denotational and collocational range of the stative copula is currently unclear, but it seems to occur frequently in concert with verbs of cognition (je mə̄nɔʔ ‘see’, o mə̄nɔʔ ‘hear’, etc.), when an object argument is absent or implied, with the modal sense of the verb bɪ̃ ‘get’, or with verbs that denote a fixed state, at least in the context of the utterance. Semantically, there appears to be a sense of slight reflexivity common to these uses—in all cases, the stative copula indicates that the particular action mentioned in the utterance only affects/will affect the performer of that action, and no other entity. This copula is also used with transitive verbs, when the object has been fronted. (11) kɐt souʔ-tʰē nɐ̄ ɔ lô ə-kwɐ̀ t tʰɯ̄ t mə̄nɔʔ under tree-wood dist person one-clf sit stat ‘There is someone sitting under that tree.’ (12) mə̀ swe nɐ̄ ɔ bɪ̃ mə̄nɔʔ 2sg eat dist get stat ‘You can eat that.’

danau

1113

(13) ō (kǝ)lɐ swe mə̄nɔʔ 1sg dare eat stat ‘I dare to eat it.’ (14) lɐɪk nī=nəʔ ō pɐ-pʰɐ̄ mə̄nɔʔ text prox=top 1sg abil-read stat ‘I can read this.’ (15) kɐt mə̄nɔʔ true stat ‘It’s true/Yes.’

The stative copula is only used in affirmative declarative sentences, i.e. in the absence of a question or a negation. Thus: (16) ō heiʔ mǝ̀ mǝ̀ on jɐ̂ [*mǝ̀ on mə̄nɔʔ jɐ̂ ] 1sg call 2sg 2sg hear q ‘Did you hear me call you?’

The collective copula denotes an action carried out by multiple entities. As such, it can be used in imperatives directed at multiple addressees. The obligatory sense of ‘plural actors’ indicated by this morpheme also allows it to be used in reciprocal and hortative constructions. (17) pɯ̄ t sē cjɐbō nề come imp.pol coll disc ‘(All of you) come!’ (18) kɐ̀m cjɐbō tɐ̄ t~tɐ̄ t, mɐ̀ n cjɐbō tɐ̄ t~tɐ̄ t listen coll quiet~red be coll quiet~red ‘(All of you) listen quietly, sit quietly!’ (19) ə-kwɐt nɐ̄ ə-kwɐt jɐt jɐbō one-clf top one-clf look coll ‘They are looking at each other.’ (20) ē mɛ̀tmɛ̀t nì ɪ-kɨ̂ cjɐbō ŋbæ̃ ̂ ɛ 1pl all prox hort=go coll pn ‘Let’s all go to Aungban.’

1114

si

Negation Simple AVP sentences are negated minimally by inserting the negative copula kɔʔ in a post-verbal position, before the object argument (if any) or other adjuncts belonging to the clause. This is possibly a colloquial construction, as a more elaborate, two-morpheme negation strategy also exists, which involves the additional use of a negative prefix lɐ- on the verb. (21) ō (lɐ-)kɨ̂ lɔn kɔʔ mǝ̀ 1sg (neg-)go accompany neg.cop 2sg ‘I won’t go with you.’

Verbless sentences are negated using a sentence-final kɐt kɔʔ ‘not true’. (22) sɐʔouʔ nī=nə sɐʔouʔ ō kɐt kɔʔ book prox=top book 1sg true neg.cop ‘This book is not my book.’

Questions Polar questions can be formed by the addition of the question particle ( j/r)ɐ to the end of a declarative statement. There seems to be no simple morphophonemic rule governing the choice of ɐ, rɐ or jɐ. However, there is a tendency for the allomorph with the palatal glide to be used when the preceding word ends in any vowel except /ɯ/, and for rɐ to follow /ɐ/. (23) mə̀ su kəɲwɐ̄ n ɐ̂ 2sg walk path q ‘Are you walking?’ (24) ō tʰɯ̄ t tə-nī bɪ̃ jɐ̂ 1sg sit loc-prox get q ‘Can I sit here?’ (25) dəmwɐ̀-rɐ̂ where-q ‘Where?’

Two frequently used interrogative constructions are short lexicalized phrases, both using ( j)ɐ̂. Kɐt jɐ̂ ? is a frequently used tag question in Danau discourse, along with Kɐt nɛ̀? ‘Isn’t it?’

danau

1115

(26) kəthɐn ɐ̂ what q ‘What (is it)?’ (27) kɐt jɐ̂ true q ‘Is it true?/Is that right?’

A second optional ( j)ɐ̂ is incorporated clause-medially for questions with transitive verbs: (28) mə̀ swe ( jɐ̂ ) ɲɯŋ jɪn ɐ̂ 2sg eat (q) meat chicken q ‘Are you eating chicken?’

Danau has a small set of interrogative pronouns, kəthɐn ‘what’ and ŋ̄ kwe ‘who’, and interrogative adverbs, dəmwɐ̀ ‘where’, kǝmɔ/ŋmɔ ‘when’, hodəmwɐ̀/səmwɐ̀/ thǝmwɐ̀ ‘how’. The use of the question particle in conjunction with these words is largely optional, and usually takes place when the interrogative pronoun is fronted. However, as mentioned above, the phrase kəthɐn ɐ̂ has been lexified, and kəthɐn ‘what’, as a question, is practically never used without a following ɐ̂. (29) ɐn lɯ kəthɐn ɐ̂ 3sg desire what q ‘What does he want?’ (30) (mə̀) kɨ̂ dəmwɐ̀ (2sg) go where ‘Where are you going?’ (Greeting) (31) dəmwɐ̀ rɐ̂ mə̀ sʰɔ̃ kɨ̂ where q 2sg want go ‘Where is it that you want to go?’

When the expected response is a list of items to be enumerated by the addressee, the question words kətʰɐn and dəmwɐ̀ can be pluralised by reduplicating the last syllable: (32) lô tudɔkɐũ̂ tərɐ̀ əlu tʰôn pɐ̂ nì=nə kətʰɐn~tʰɐn ɐ̂ person virtuous sermon charity five clf prox=top what~red q ‘What are the five charities (mentioned in) the sermon of the virtuous man?’

1116

si

(33) dəmwɐ̀~mwɐ̀ rɐ̂ frɐ̄ hō=nə where~red q holy exist=top ‘Where can one find temples?’ [lit. ‘Where (pl.) are there temples?’]

Commands There is no overt imperative marker in Danau, with commands merely requiring the bare verb stem uttered with a falling intonation. (34) swe bɯ̄ eat rice ‘Eat!’ (35) tʰɯ̄ t sit ‘Sit!’

Negative commands or prohibitions require the use of a special negative imperative suffix on the verb stem, in addition to the default negative prefix: (36) lɐ-kɨ̂-nɐʔ neg-go-proh ‘Don’t go!’ (37) lɐ-swe-nɐʔ nɐ̄ ɔ=nəʔ neg-eat-proh dist=top ‘Don’t eat that!’

Optative An optative construction is possible in Danau, with the reading, “May sth. happen!” Not surprisingly, this construction occurs frequently in religious contexts. The construction is composed using the conditional verbal prefix tǝalong with a pre-verbal optative particle ɐ̄ . (38) kɐjɐ̄kɐn əpjɪt ɐ̄ tə-bɐ, wɐsikɐn əpjɪt ɐ̄ body fault opt cond-disappear speech fault opt tə-kæ̃ ̂ , mənɔ̄ kɐn əpjɪt ɐ̄ tə-sʰæ̃ ̂ . cond-avoid mind fault opt cond-clear ‘May the faults of your actions disappear, may the faults of your words be avoided, may the faults of your thoughts be cleared away.’

danau

1117

In everyday speech, this construction can be used to indicate intention or causation (see also section 5.2.4 for a morphological causative). (39) ɐn wæx nì=nə ɐ̄ tə-pɪt mwî kɐj klɨʔ~klɨʔ 3sg cow prox=top opt cond-fat feed fodder good~red ‘He is feeding the cow well to fatten it.’

4.2 Complex Sentences 4.2.1 Coordination, Chaining and Subordination The coordination of two or more declarative clauses using the equivalent of ‘and’ (along the lines of e.g. ‘This is mine, and that is yours.’) is unidiomatic in Danau, with the preferred construction simply being two separate uncoordinated clauses. Co-ordination of verb phrases is carried out by simply chaining the two VPs in the correct temporal sequence. (40) ō jɪ̄n mɛ ɲɐ̄ jɪ̄n swe bɯ̄ 1sg return fut house return eat rice ‘I’ll go home and eat.’

The same is true of co-ordinating clauses that contrast with each other: (41) ɲɯŋ wæx=nə ō khrɯ̄ k mə̄nɔʔ, ō lɐ-swe kɔʔ meat cow=top 1sg like stat 1sg neg-eat neg.cop ‘Although I like beef, I don’t eat (it).’

Choice between two actions is elicited by chaining two phrases or clauses, both terminating in the Question particle ( j)ɐ̂ . (42) mǝ̀ æɛ̃ ŋwɐ̂ n un tʃhoje jɐ̂ , un kəphi jɐ̂ 2sg fut drink water tea q water coffee q ‘Will you drink tea or coffee?’

Clauses can be linked in other ways, however, including the use of the conditional and purposive constructions: (43) lo-sʰī-nì kəle tə-pəʔ=nə̀ ō kɨ̂ kɔʔ ŋbæ̃ ̂ ɛ now-day-prox rainfall cond-rain=top 1sg go neg.cop pn ‘If it rains today, I won’t go to Aungban.’

1118

si

(44) khæ̃ ̂ wæ̃ ̂ sɐʔouʔ hō hɔʔ (khɐũ̂) ō æ̃ ̂ kɨ̂ mɛʔ təsʰɐĩ purp buy book exist cop (because) 1sg fut go obl shop ‘I will go to the shop in order to buy a book.’ [‘. . . because there is a need to buy a book.’] (45) jwɐn.swe kæ̃ ɛ=nə̀ kʰɐ̃ ɔ̂ son.klɛ̀ hɔʔ lu hɔʔ believe karma=top because sacrifice cop donate cop ‘Because you believe in karma, you give away possessions, you perform charity.’

Other subordinating constructions make use of temporal adverbs such as pʰɯ ‘before’, tɐn ‘during’ and jɐ̄ ɔ thɐ ‘after’. Both constructions require an irrealis prefix on the verb of the subordinate clause. (46) pʰɯ ɲoən mə-pɯ̄ t=nə ō æ̱̃ ̀ ɛ dɯŋ bɯ̄ before electricity irr-come=top 1sg fut cook rice ‘I will cook before the power comes on.’ (47) tɐn ɲoən mə-pɯ̄ t=nə mə̀ jɔx kətʰɐn ɐ̂ during electricity irr-come=top 2sg do what q ‘What do you do when there is electricity?’

A combination of the irrealis prefix and the suffix -none (section 3.2) results in a subordinate clause with the meaning ‘without doing sth.’ (48) ɐn mǝ-ɓɐʔ kɔʔ ŋ̄ khɐ-mɐ 3sg irr-close neg.cop door-NONE ‘He went out without closing the door.’

ɐn kɨ̂ pon nthɯn 3sg go exit outside

Verbs can be chained together without any overt morphology, resulting in a compound verb that usually describes a sequence of events. (49) ō fro ple mə̀ 1sg say show 2sg ‘I will explain (sth.) to you.’ (50) ō kɨ̂ lɔn mǝ̀ 1sg go accompany 2sg ‘I’ll go with you.’ (51) ljɐk tō sʰɐ̃ æ æ̃ sʰī nì=nə ɗɐ̄ ɛx sɔ̀ m next month start two day prox=top climb clear.weeds

danau

1119

tjɐbō kəljɐ̀ŋ cjɐ̃ ɔ̂ coll compound monastery ‘At the start of the coming month, on the 2nd, let’s go up to the monastery compound to clear weeds.’

In some cases, the second verb adds an aspectual element to the first, main verb. The verbs pɯ̄ t ‘come’ and kɨ̂ ‘go’ are important in this respect, with the former indicating a past progressive, and the latter, a future progressive aspect. (52) frɐ̄ hɔ̀ pɯ̄ t sənə̀ holy preach come like.this ‘The Buddha has been preaching thus.’ ̂ nɐj laŋwɐ̄ n nì, sæ̃ ̂ ɛ pɐʔɐ̃ æ ̀ hɔ̄ (53) nəkʰɐɔ̃ kɨ̂ tərɐ̂ and.so like morning prox monk first preach go doctrine

kənɔ̀ ʔ pn ‘And so, this morning, I will first be delivering a sermon in Danau.’

Aspect is further indicated by a small number of particles which are described in section 5.4.3. 4.2.2 Modality The meaning ‘want’ is expressed by the pre-verbal shɔ̃ , which is also used in other constructions (see section 4.2.3). The meaning ‘should’ requires the preverbal (kə)lɯ̀ . (54) ō sʰɔ̃ kɨ̂ ŋbæ̃ ̂ ɛ 1sg want go pn ‘I want to go to Aungban.’ (55) ō sʰɔ̃ swe kɔʔ bɯ̄ 1sg want eat neg.cop rice ‘I don’t want to eat.’ (56) lùpē nì=nə (kə)lɯ̀ ɗɐ̄ ɛx thɛ̃ cjɐ̃ ɔ̂ child prox=top should climb teach school ‘This child should attend school.’

1120

si

(57) mə̀ nì=nə (kə)lɯ̀ swe lɐŋŋæ̃ ̂ ɛ=nì 2sg prox=top should eat medicine=top ‘You should take this medicine.’

The verb bɪ̃ ‘get’, which can be used independently, can also be chained to a following verb to produce the modal meaning ‘can’ or a request for permission to do something, in a way partly similar to the Burmese já ‘get’. (58) ē bɪ̃ əcɨ̂ sənə̀ 1pl get benefit like.this ‘We get this kind of benefit.’ (59) ō kɨ̂ bɪ̃ jɐ̂ ? 1sg go get q ‘May I go?’ (60) pɐnɐtìpɐtɐ̀ pə-de nì=nə jɔ̀x bɪ̃ kɔʔ kətʰɐn ɐ̂ , pali.quote nfin-say prox=top do get neg.cop what q pin bɪ̃ kɔʔ tʰæx bēj kill get neg.cop life other ‘ “Pɐnɐtìpɐtɐ̀” means (we) can’t do what? (We) can’t take the life of another.’

The ordering of bɪ̃ and the main verb seems to be important, as different word orders result in different modal meanings. Briefly, bɪ̃ preceding the main verb results in a construction that indicates compulsion or necessity (must/have to); the addition of the negative copula kɔʔ denotes a lack of compulsion or necessity. (61) bēj lɛ̄ bɪ̃ fro kɔʔ other also get say neg.cop ‘Others don’t need to tell you either (to do sth. good).’ (62) ɛ̀tʰɛ tə-pɯ̄ t ɲɐ̄ pɯ̄ =nə̀ pɯ̄ bɪ̃ mwì kɔʔ bɯ̄ guest cond-come house 2pl=top 2pl get nourish neg.cop rice ɗe? bɪ̃ mwì, kɐt jɐ̂ ? rhet get nourish right q ‘If a guest comes to your house, don’t you have to feed (him)? You have to feed (him)! Right?’

1121

danau

The ability to do something is indicated by a verbal prefix pɐ-, as in the following: (63) lɐik nì=nə mə̀ pɐ-pʰɐ jɐ̂ text prox=top 2sg abil-read q ‘Can you read this?’ (64) lɐik nì=nə ō pɐ-pʰɐ text prox=top 1sg abil-read ‘I can’t read this.’

kɔʔ neg.cop

An alternative strategy is to chain the verb nɯ̀ ŋ ‘be able’ following the main verb of a clause. Thus, the following two sentences are said to be identical in meaning (note that the stative copula mǝ̄nɔʔ is required in both constructions): (65) ō pɐ-fro mǝ̄nɔʔ mǝ̄nɐ̃ ɔ kǝnɔʔ 1sg abil-say stat speech pn ‘I can speak the Danau language.’ (66) ō fro nɯ̀ ŋ mǝ̄nɔʔ mǝ̄nɐ̃ ɔ kǝnɔʔ 1sg say able stat speech pn ‘I can speak the Danau language.’

The verb sɐ̀ŋ ‘try’ can be chained after the main verb in a clause to give the meaning ‘try to do sth./try out sth.’ (67) swe sɐ̀ŋ tū nì=nǝ eat try curry prox=top ‘Try this curry.’

4.2.3 Other Subordinate Clauses As in Burmese, subordinate clauses that have the meaning “no matter what/ where/etc.” are formed by reduplicating the verb of the subordinate clause. Unlike Burmese, however, the interrogative is inserted between the two reduplicated tokens: (68) ɐn shɔ̃ jɔx kǝthɐn jɔx, ɐn lǝ-klɐj nɯ̂ ŋ kɔʔ 3sg want do what do 3sg neg-become able neg.cop

1122

si lô kətʰê person rich ‘No matter what he does, he cannot become a rich person.’

A more straightforward case of verbal reduplication in the formation of subordinate clauses is as follows. Here, the meaning is “no matter how much V happens”. (69) sʰɔ̃ mɛt~mɛt sʰɔ̃ jɔ̀ ~jɔ̀ nɔət rɐ̀ hō kɔʔ want finish~red want reduce~red mind fear exist neg.cop ‘No matter how much (your possessions) get used up, no matter how much (they) diminish, [there should be] no fear in the mind.’

4.2.4 Relative Clauses Relative clauses are formed with the use of the particle pəʔ, which in the case of intransitive verbs appears at the start of the clause being relativized. In the case of transitive verbs, the object argument can be relativized by the introduction of pəʔ between the subject and the verb. The object arguments of ditransitive verbs can also be relativized—relativization of the direct object argument proceeds as described above, but the relativization of indirect object arguments seems to be unidiomatic. A preferred strategy would be a two-clause construction, such as ‘You gave someone a book yesterday; he is my brother.’ (70) ɐn tʰɯ̄ t mɛʔ kɐt souʔ kədɯ nɐ̄ 3sg sit obl under tree pine disc ‘He is sitting under a pine tree.’ (71) pəʔ tʰɯ̄ t mɛʔ kɐt souʔ-tʰē nì=nə mɔ rel sit obl under tree-wood prox=top brother ‘The person sitting under the tree is my brother.’

ō 1sg

(72) kɐntʰɯ̄ =nə mə̀ pəʔ swe tū nɐ̄ ɔ=nə tū ɲɯŋ jɪn yesterday=top 2sg rel eat curry dist=top curry meat chicken ‘The curry you ate yesterday was chicken curry.’ (73) kɐntʰɯ̄ =nəʔ ɐn pəʔ ɐ mə̀ sɐʔouʔ nɐ̄ ɔ=nəʔ sɐʔouʔ ō bɐ̀ yesterday=top 3sg rel give 2sg book dist=top book 1sg disc ‘The book he gave you yesterday is my book.’

danau

1123

Oblique arguments can also be relativised, although the inclusion of the argument and relativising particle pəʔ in the relative clause appear to be optional. The translation of the following two sentences is potentially problematic, because the Burmese sentences used to elicit these sentences are grammatically ambiguous. (74) mə̀ pəʔ tɛm (mɛʔ) nɐ̄ ɔ=nəʔ kʰɛ̄tɐn ō bɐ 2sg rel write (obl) dist=top pencil 1sg disc ‘What you are writing with is my pencil.’ (75) ɐn tə̄ŋ tʰɯ̄ t mɛʔ nɐ̄ ɔ=nəʔ kɐt souʔ kədɯ 3sg prog sit obl dist=top under tree pine ‘Where he is sitting is under a pine tree.’

4.2.5 Complement Clauses No specific particle is required to construct complement or quotative clauses in Danau. Typically, such clauses occur with the verbs fro ‘say’, on ‘hear’, thɯn ‘think’, sɪ̄nsɐ̂ ‘think about’, although other transitive verbs can also be used. Frequently, adverbial particles such as sənə̀ ‘like this’ are also placed after the verb in the framing clause to act as a kind of complementizer. Following is a general template for a sentence with a complement clause: [Complement clause] [Main clause with + sənə̀]

As the following examples illustrate, however, there are frequent exceptions to this canonical pattern. A second general pattern does not require the complementizer-like sənə̀, and the following alternative general template could be proposed to take this observation into account: [Main clause] [Complement clause] (76) ɐn=nə ɐn klɐj tɯ̀ lô kəthê ō on tʰənəʔ=nə 3sg=top 3sg become pfv person rich 1sg hear like.that=top ‘I heard that he has become a rich man.’ (77) mə̀ pə-fro nə̀=nə mɐn ō jɔx mɛt ɐ mə̄nɔʔ 2sg nfin-say that=top ? 1sg do all give stat ‘I will do everything you say.’

1124

si

(78) frɐ̄ hɔ̀ səmwɐ̀ , “jɐnì . . . petì.” frɐ̄ hɔ̀ pɯ̄ t sənə̀ holy preach how pali.quote Buddha preach come like.this pɐlì=nə̀. pn=top ‘How did the Buddha preach? [“Pali quote”] The Buddha has been preaching thus in Pali.’ (79) bēj tɛ̀n ple pɯ̄ t tʰon mɐ̄ pə̀sɛ (hō hɔʔ) other write show come five clf only exist cop ‘Others have written that (there are) only five types.’ (80) ɐn=nə ləŋwɐ̀ n=nə æ̃ ɛ pɯ̄ t pərɔ̀ ɐn fro sənə̀ 3sg=top tomorrow=top fut come 3sg say like.this ‘He told me that he will come tomorrow.’ (81) ɐn=nə ɐn thɯn tɯ̀ ō kɨ̂ ŋbæ̃ ̀ ɛ 3sg=top 3sg think pfv 1sg go pn ‘He thinks that I’ve gone to Aungban.’ (82) mə̀ lɐ-de-nɐʔ ē lô sʰɔ̄ sʰɛ̄ mǝ̄nɐ̃ ɔ kʰrɐ̀ ŋ 2sg neg-tell-proh 1pl person busybody speech coarse ‘Don’t say that we are busybodies with coarse speech.’

It may be appropriate to analyse the first pattern (using sənə̀) as consisting of a sequence of uncoordinated, single-clause sentences—a pattern that is very common in spontaneous, spoken Danau. However, there are in many cases no clear prosodic signals to confirm this suspicion. 4.2.6 Adverbial Clause Some adverbial clauses make use of the gerund prefix pə-, Other adverbial clauses, which make use of a combination of a temporal preposition and the irrealis marker mə- are presented in section 4.2.1. (83) shī ō pə-kɨ̂ ŋbæ̃ ̀ ɛ nɐ̄ ɔ=nə kəle pɯ̄ t ŋkô day 1sg nfin-go pn dist=top rain come lots ‘It rained a lot on the day I went to Aungban.’ (84) ō pə-ljak jɪ̄n ɲɐ̄ =nə ɐn mɐ̀ mɐ̃ ɔ tɯ̀ 1sg nfin-arrive return home=top 3sg already dark pfv ‘When I arrived home, it was already dark.’

danau

1125

(85) ɐn mə-swe kɔʔ bɯ̄ mɐ̄ ɐn kɨ̂ pon nthɯn 3sg irr-eat neg.cop rice without 3sg go exit out ‘He went out without eating.’

4.3 Topic and Comment The topic marking postclitic =nə is used frequently in Danau discourse, as well as in elicited utterances. While the topic of a sentence is marked overtly by =nǝ, the comment or predicative part is unmarked, often terminating in a copula or an epistemic/discourse marker. (86) kūsɐ̄ ɲɐ̄ mə̀=nə souʔ-tʰē ɓoʔ hɔʔ around house 3sg=top tree-wood many cop ‘There are many trees around his house.’

An alternate form =kɐ̄ was noted in the speech of elderly Danau people. While =kɐ̄ appears to be similar in many ways to =nǝ, the fact that both are used by the same speakers strongly suggests that there is a difference in function between the two ‘topic’ markers, at least in more archaic forms of Danau. This, however, is a matter for future research. (87) khɯ̀ =kɐ̄ shɔ̃ ǝ-jɐik ǝ-kwɐ̄ t sɛ̂ 3pl=top want nml-lift one-clf only ‘They wanted only one person to do the lifting.’ (88) mǝ̄nɐ̃ɔ khɯ̀ =kɐ̄ khɯ̀ .sû ē pɐj kɔʔ mɛ̀ speech 3pl=top pol 1pl can neg.cop asrt ‘We could not speak their language (as you probably know).’

A slightly more elaborate form of topic marking involves the embedding of the topicalised NP between the preposition kwe ‘on’ and the demonstrative-topic marker complex nì=nǝ or nɐ̄ ɔ=nǝ. (89) kwe ǝlu.ǝtæ̃ ̂ ɛ pɯ̄ nɐ̄ ɔ=nǝ, nɔǝt kǝdɐj nì hō kɔʔ on charity 2pl that=top mind stinginess prox exist neg.cop ‘When it comes to your being charitable, [there should be] no meanness of spirit.’

Even though a formal gerund prefix exists in Danau (section 3.3), this is frequently omitted from complex noun phrases containing a verbal element, with the topic marking clitic =nǝ being able to convert a clause with a finite verb into a NP.

1126

si

(90) mə̀ ɐ ɐn pəshɐ̃ j=nə ō lɐ-khrɯ̀ k kɔʔ 2sg give 2sg money=top 1sg neg-like neg.cop ‘I don’t like you giving him money.’

4.4 Noun Phrase The minimal Danau noun phrase is a pronoun or a bare noun. Noun phrases can be further elaborated as compounds, demonstrative or adjectival phrases, or as full relative clauses (see section 4.2.2). Minimal NP: ō ‘1sg’ sɐn ‘bird’

Compounds: rɐt.kûn [knife.betel] ‘knife for cutting betelnut’ lɐɛh.lō.tū [ladle.pot.curry] ‘ladle for cooking pot’

Adjectival phrases: pʰwɐt sòn [mat red] ‘red mat’ kon pɛ [child small] ‘small child’

NP with attributive adj: souʔthē lɐ̄ kõ̂u.kõ̂u [tree leaf big.red] ‘tree with big leaves’

Determiner phrase: sɐʔouʔ nì=nə [book prox=top] ‘this book’ souʔtʰē nɐ̄ɔ [tree dist] ‘that tree’ mɐ̀j ɐn [mother 3sg] ‘his/her mother’

Relativized clause as NP: (91) [pəʔ tʰə̀ mɛʔ kɐʔ souʔ-tʰē nì=nə]REL mɔ rel sit obl under tree-wood prox=top brother ‘The person sitting under the tree is my brother.’

ō 1sg

1127

danau

(92) [kɐntʰɯ̄ =nəʔ ɐn pəʔ ɐ mə̀ sɐʔouʔ nɐ̄ ɔ=nəʔ]REL sɐʔouʔ ō bɐ̀ yesterday=top 3sg rel give 2sg book dist=top book 1sg pol ‘The book he gave you yesterday is my book.’

5

Word Classes

The following word classes can be distinguished on the basis of syntactic and morphological criteria: nouns, verbs, adverbs, and particles. 5.1 Nouns Danau nouns can be followed by a demonstrative (+ topic marker) or by a possessive construction in the form of a pronoun or another noun. Formally, nouns have no predictive diagnostic features, although a significant number have the maximal syllable structure (C)əCV(C), while many derived nouns also possess a word-initial ə-. 5.1.1 Common Nouns Nouns are not obligatorily marked for grammatical number, although there are strategies to indicate plurality if required. These include numerals and other quantifiers such as ɓoʔ ‘many’ and ŋkô ‘lots’. Danau has borrowed numerous nouns from Burmese and Pali, particularly in the semantic domain of religion and administration. 5.1.2 Pronouns and Question Words Table 18.1 lists the basic pronoun inventory of Danau. Plural forms exist for all persons, and it is possible to generate more precise person combinations by co-ordinating the basic pronouns. At this stage, it is not clear whether this is a productive process, or if there are only limited numbers of lexicalised pronoun combinations. Table 18.1 Danau personal pronouns

1st person 2nd person 3rd person

Singular

Plural

Plural inclusive

ō mə̀ ɐn

ē pɯ̄ khɯ̀

ē pɯ̄ mə̀ ‘all of us, including you’ pɯ̄ mə̀ ‘all of you’

1128

si

It is also possible to emphasise the inclusiveness of the basic plural pronouns by adding the reduplicated adjective mɛt ‘all’ as follows: ē-mɛt~mɛt-nì pɯ̄ -mɛt~mɛt-nì khɯ̀ -mɛt~mɛt-nì 1pl-all~red-prox 2pl-all~red-prox 3pl-all~red-prox ‘all of us’ ‘all of you’ ‘all of them’

Some aspects of the pragmatics of pronoun use are as yet poorly understood. During grammatical elicitation sessions, and also while transcribing longer texts, it became apparent that the 2nd and 3rd person basic pronouns could sometimes be used interchangeably. The first two examples below are from a grammar elicitation session, where two consultants insisted that the pronoun mə̀ (glossed as 2sg) could be used to indicate singular second and third persons. (93) tī mə̀ hand 2sg ‘your hand’/‘his/her hand’ (94) mə̀ su kəɲwɐ̀ n 2sg walk path ‘You are walking.’/ ‘S/he is walking.’

Other examples from longer spontaneous texts reveal a similar issue with the second and third person plural pronouns. The first two sentences use the pronoun pɯ̄ (glossed as 2pl), which is translated by the consultant as ‘you-plural’, while the third sentence uses the pronoun kʰɯ̀ (glossed as 3pl), but with the same translation. (95) nì=nə pɯ̄ bɪn kʰɐn.zɐ̀ nɯ̄ ŋ əcɨ̂ prox=top 2pl get feel able benefit ‘This is a benefit that you all can experience.’ (96) ɛ̀tʰɛ tə-pɯ̄ t ɲɐ̄ pɯ̄ =nə̀ pɯ̄ bɪ̃ mwì kɔʔ guest cond-come house 2pl=top 2pl get nourish neg.cop bɯ̄ de? rice rhet If a guest comes to your house, don’t you have to feed (him)?

1129

danau

(97) kʰɯ̀ dəkɐ dəkɐmɐ̀ pə- ɗɐ̄ ɛx lu pə-ɗɐ̄ ɛx tæ̃ ̂ ɛ 3pl man woman nfin-climb donate nfin-climb give.charity ‘You men and women visit (the monastery) to donate and give charity . . .’

Finally, the following sentence shows both singular and plural second person pronouns used in quick succession, to indicate what is presumably the same referent (i.e. the addressee in the context of this utterance). (98) pɐ pɯ̄ fro ple mə̀, mə̀ lə-kɐ̀ m father 2pl tell show 2sg 2sg neg-listen ‘Your father advises you, (but) you don’t listen.’

A small set of interrogative pronouns, including kətʰɐn ‘what’ and (nǝ)ŋ̄ .kwe ‘who’, and interrogative adverbs, dəmwɐ̀ ‘where’, kǝmɔ/ŋmɔ ‘when’, hodəmwɐ̀/ səmwɐ̀ ‘how’ exists in Danau. The use of these pronouns is described in section 4.1. As mentioned in section 3.3, some of these form the basis for negative imperative pronouns, through the addition of the suffix -mɐ̀ ‘none’. dəmwɐ̀ > dəmwɐ̀-mɐ̀ ‘nowhere’ (nǝ)ŋ̄ .kwe > (nǝ)ŋ̄ .kwe-mɐ̀ ‘no one’ kɐtʰɐn > kɐtʰɐ-mɐ̀ ‘nothing’ kǝmɔ/ŋmɔ > lanmɔ-mɐ̀ ‘never’ Two demonstratives are evident: nì ‘this’ (prox) and nɐ̄ ɔ ‘that’ (dist).

5.1.3

Measure and Quantity Words

Classifiers Danau numerical classifiers encode animacy and the shape of the entity in question, among other features. Common classifiers include: kwat (people, animals, birds) klɔ̃ ̀ (stick-like objects) ouʔ (books) souʔ (plants) ɲɐ̀ (houses) tʰun (clothes)

twɐn (vehicles) ǝrɛ̃ (pairs of shoes) lun (spherical objects) pʰǣx (one of a pair, e.g. ears) twɐ̀n (rocks) kʰrɐ̃ æ (flat objects)

1130

si

Numerals Danau has two numeral series—one used to count most objects, and another used to count money. The latter series is a closed set, and only has a limited number of lexemes corresponding to commonly-used denominations. Standard

1 2 3 4 5 6 7 8 9 10 11 12 15 20 25 50 100 200 1,000 10,000 100,000

ǝæ̃ ŭwi pɯn thôn tūn pɐt sæ̃ ɛ sɐn mcjɪn mcjɪn tɪʔ mcjɪn æ̃ mcjɪn thôn æ̃ cjɪn æ̃ cjɪn thôn thôn cjɪn əphjɐ̂ æ̃ phjɐ̂ əthɐ̃ɔ ətɐ̃ ɔ̂ əthẽî

Money

əwæx thon tûŋ əsɐ̀ t hɐ̄ khɐ̄m ǝswɛ̂ əkwɐ̂ n

5.1.4 Names and Terms of Address Danau personal names appear to be modelled closely on Burmese naming practices. Older consultants invariably had monosyllabic names such as tʃhũ, tɐ, thũ̂, shɐiʔ, tʃhɪ̃, sǝ̃,̂ thi, jɪ̃, whereas two-syllable names seemed to be more popular among younger people: tʃhɪʔ.swe, thũ̂.mi, and so on. This trend is evident from an inter-generational comparison of Burmese names as well. As in the rest of Myanmar, the names of monks of all ranks are polysyllabic Pali loan words, such as wǝrìjɐ̀ , turìjɐ̀ , and so on. Another speech practice in common

danau

1131

with other language communities in Myanmar is the dispreferred use of personal pronouns (and the preferred use of title+name combinations such as srɐ plɐ̂ ‘Teacher Pla’) in connection with people deemed worthy of respect. Special terms of address exist in a register used only when talking to monks. The most important in this respect is the first person pronoun khɯ̀ sū ē, used by lay people, and the terms of address dəgɐ and dəgɐmɐ̂ used by monks to address male and female lay people respectively. The small number of villages inhabited by Danau speakers have Burmese names that are rendered into Danau phonology when speaking to a purely Danau audience. An exception is Htinyugon village, which has a very different name in Danau: mpɐ̃ ɛ. Nearby town names, which are in both Burmese and Shan, have also been incorporated into the Danau lexicon in the same way. Aungban (town) > ŋbɐ̄ n/ŋbæ̃ ̂ ɛ Thaethit (village) > tī.tīt Heho (town) > hɐhō Htinyugon (village) > mpɐ̃ ɛ Taungbohla (village) > mpōlɐ, pɔ̄ lɐ̀ Chaunggya (village) > khɔ̄ khrɐ̄

5.2 Verbs Verbs are a distinct word class in Danau, by virtue of the fact that they take subject arguments, and inflectional prefixes such as the gerundiviser pǝ- and the conditional tǝ-. Verbs are not obligatory in Danau sentences, as verbless clauses are frequently used in spontaneous and elicited speech (see section 4.1). 5.2.1 Intransitive Verbs While intransitive verbs do not typically take an object argument, it is sometimes difficult to categorise certain verbs as transitive or intransitive. Similar to Burmese and Mon (and probably many other languages of the region), certain Danau verbs obligatorily take a dummy object noun that is semantically relevant to the action being described. These include: tʃʰen-un tʰwɐn-un su-kəɲwɐ̄ n pʰɐ-lɐɪk swim-water bathe-water walk-road read-text ‘swim’ ‘bathe’ ‘walk’ ‘read’

To classify these all the above verbs as transitive would be problematic as the object noun is lexified with the verb, and cannot be substituted without, at the very least, changing the general meaning of the verb to a more specific one. In the case of ‘swim’, ‘bathe’ and ‘walk’, it is not possible for any other object noun to replace ‘water’, ‘water’ and ‘road’ respectively, and are probably best analysed as intransitive verbs with (historically) incorporated object nouns. The

1132

si

last verb, pʰɐ-lɐɪk (read-text) on the other hand, should be analysed as transitive, as the incorporated noun can be substituted productively by other nouns. Thus, ‘read a newspaper’ would be pʰɐ tɐdɐ̀ unzɐ (read newspaper), and not *pʰɐ-lɐɪk tɐdɐ̀unzɐ. Following are some examples of the more typical intransitive verbs, which do not take an object argument: kɨ̂ ‘go’ tin ‘sleep’ kǝɲɐʔ ‘laugh’ pin ‘die’ rɛ ‘fall’ ɲæ ‘cry’

5.2.2 Transitive Verbs Transitive verbs take both subject and object arguments, with the canonical word order being AVP. Objects can be fronted to create an PAV word order, with the fronted element taking the topic marker =nǝ. The same applies to fronted oblique arguments, which are otherwise placed after the object in a canonical sentence. (99) bɯ̄ =nǝ ō swe kɔʔ rice=top 1sg eat neg.cop ‘Rice, I won’t eat.’ (100) tī nɐ̄=nǝ ɐn swe bɯ̄ hand coord=top 3sg eat rice ‘He is eating with his hand.’

5.2.3 Ditransitive Verbs These verbs can take two object arguments, namely direct and indirect. The canonical words order is A V IO DO, although the DO is often fronted, as follows: (101) ō lɛ̄ fro ple mə̀ mǝ̄nɐ̃ ɔ sʰoũ̠mɐ̀ 1sg also tell show 2sg speech advice ‘I will also give you advice.’ (102) nì=nǝ ō ɐ kɔʔ mǝ̀ prox=top 1sg give neg.cop 2sg ‘I won’t give this to you.’ (lit. ‘This, I won’t give to you.’)

5.2.4 Valency Changes The valency of verbs can be changed by numerous strategies, through the use of causative, applicative, reflexive and reciprocal constructions.

1133

danau

Causative (103) mɐj ɐn=nǝ sho kɨ̂ lùpē ŋbæ̃ ̂ ɛ mother 3sg=top caus go child pn ‘The mother sends the child to Aungban.’ (lit: His mother makes the child go to Aungban.)

Applicative (104) ɐn=nə ɐn deɪ̃ pə̄ ɐ ō tù ɲɯŋ jɪ̄n 3sg=top 3sg cook appl give 1sg curry meat chicken ‘He cooked chicken for me.’ (105) ō dẽɪn pə̄ ɐ ɐn tù ɲɯŋ jɪ̄ŋ 1sg cook appl give 3sg curry meat chicken ‘I cooked chicken for him.’

Reflexive and Reciprocal (106) ɐ-kwɐt nɐ̄ ɐ-kwɐt put jɐboʔ one-clf coord one-clf hit recp ‘They are hitting each other.’ (107) ɐn nɐ̄ ɐn jɐt mɛʔ mɐj 3sg coord 3sg look obl mirror ‘He is looking at himself in the mirror.’

5.3 Locational and Directional Words Danau has a closed set of prepositions to describe locational relations; these are listed below: kɔŋ(teiʔ) ‘inside’ lɐŋwɐ̄n ‘in front of’ kwe ‘on’ kūsɐ̄ ‘around’

ntʰɯn ‘outside’ ŋkwɐt ‘behind’ kɐt ‘under’ ko ‘beside’

The prepositional phrase can be either post-verbal or fronted. In the former situation, the oblique particle is placed before the preposition:

1134

si

(108) mə̀ tʰɯ̄ t mɛʔ kɐt souʔ-tʰē 3sg sit obl under tree-wood ‘He is sitting under a tree.’ (109) kwe ɲɐ̄ mə̀=nə sɐm tʰɯ̄ t mə̄nɔʔ on house 3sg=top bird sit stat ‘There is a bird sitting on his house.’

5.4 Particles 5.4.1 Co-ordination The coordinating particle nɐ̄ acts as an instrumental and comitative postpositional marker, as well as being able to co-ordinate noun phrases. (110) kombupɛʔkʰjɐŋ nɐ̄ kɨ̂ ŋbæ̃ ̂ ɛ friend coord go pn ‘I’m going to Aungban with friends.’ (111) ɐn swe bɯ̄ tī nɐ̄ 3sg eat rice hand coord ‘He is eating with his hand.’ (112) ō nɐ̄ mǝ̀ 1sg coord 2sg ‘you and I’

When co-ordinating noun phrases, nɐ̄ can also be used to express relationships such as distance and similarity between two entities. Here, nɐ̄ follows each NP being compared. (113) hɛ̄hō nɐ̄ ŋbæ̃ ̂ ɛ nɐ̄ kəmɛʔzɛ̄/kɐŋɐjɐu pn coord pn coord near/far ‘Heho is near to/ far from Aungban.’

The meaning ‘or’, in the context of co-ordinating NPs is expressed by the phrase kɐt kɔʔ lɛ̄. (114) mə̀ kɐt kɔʔ lɛ̄ ɐn 2sg true neg.cop also 3sg ‘you or s/he’

danau

1135

5.4.2 Discourse Particles In spontaneous discourse, a significant proportion (perhaps even the majority) of Danau utterances contain sentence-final discourse or epistemic particles that serve a number of purposes, such as highlighting the certainty or newness associated with a piece of information, asking rhetorical questions, or indicating strong emotional content. (115) frɐ̄ hɔ̄ pɯ̄ t sənə̀ nɛ̀ holy preach come like.this rep ‘The Buddha has been preaching thus (I am told).’ (116) nì=nǝ thɪn mǝ̀ dē prox=top belong 2sg tag ‘Isn’t this yours?’ (117) bēj bɪn rɐ̀ khɯ̀ zɛ̄ others get afraid 3pl newinf ‘Everyone was afraid of them.’ [new information] (118) ʤǝpæ̃ ɛ cwɐn phū mɛ̀ Japanese meet exp asrt ‘(Of course) I have met the Japanese!’ (119) frɐttǝrɐ̂, khɯ̀ =kɐ̄ ɐ̄ cjɐ̀ ŋ̄ kò bjɯ̄ damn 3pl=top strength plenty emot ‘Damn, they were strong!’

Most of these particles, except for the tag question dē, are now quite rare in the speech of younger Danau speakers. A far more common epistemic particle is kô, which also occurs sentence-finally, and serves the function of introducing new information to the addressee(s). (120) ɐn sɐ̀ ŋbæ̃ ̂ ɛ pɯ̄ t kô 3sg from pn come newinf ‘He has come from Aungban.’ [new information]

The politeness particle sè occurs after the main verb to make imperatives more polite. This is often accompanied by the sentence-final request particle nẽ, which adds a sense of insistence.

1136

si

(121) jɪ̄n sè mɐt kō~kō return pol name white~red ‘Come back with a pure name (in your next life)’ (122) thūt phjǝŋ sè tū nẽ put more pol curry req ‘Please have some more curry.’

5.4.3 Aspectual Particles A handful of aspectual particles have been documented in Danau so far. Certain particles occur in a post-verbal position (namely pɯ̄ , mɛ and tɯ̀ ), while the others occupy a pre-verbal slot (kʰɐ, æ̃ ̂ ɛ, pɐj). Two particles indicate different types of perfect aspect: the resultative perfective particle pɯ̄ indicates states attained in the past that are still in effect, while the punctual perfective indicates the completion of discrete, non-durative actions. (123) frɐ̄ pǝ-fro pɯ̄ mǝ̄nɐ̃ ɔ holy nfin-say res speech ‘what the Buddha has said (and still applies)’ (124) kɐt ɲɐ̄ nɐ̄ ɔ fræx pɯ̄ pǝte under house dist pile res peanut ‘There are peanuts piled under the house.’ (125) nɔǝt lɔ̄ phɐ̀ nì=nǝ pɛ pɯ̄ nǝ̀ŋ nǝ̀ŋ mind greed prox=top reject res win red ‘(You) have successfully rejected the greed of (your) minds . . .’ (126) mǝ̀ swe jɐ̄ ɔ tɯ̀ bɯ̄ jɐ̂ 2sg eat finish pfv rice q ‘Have you finished eating?’ (127) shī lè tɯ̀ sun appear pfv ‘The sun has come out.’ (128) ɐn kɨ̂ tɯ̀ ŋbæ̃ ̂ ɛ 3sg go pfv pn ‘He has gone to Aungban.’

danau

1137

Sometimes, the two perfect particles are chained together, presumably in situations where the speaker wishes to indicate interest in the (currently active) consequences of a past, non-durative event. In responding to the following question (along the lines of ‘Yes, he has’), the particle pɯ̄ can be optionally omitted: jɪ̄n (pɯ̄ ) tɯ̀ . (129) pɐ pɯ̄ hò ŋbæ̃ ̂ ɛ jɪ̄n pɯ̄ tɯ̀ ɐ̂ father 2pl from pn return res pfv q ‘Has your father returned from Aungban?’

The remaining particles that indicate imperfective aspect (ipfv), future aspect, progressive aspect and habitual aspect occur pre-verbally. (130) ɐn khɐ kɨ̂ kɔʔ ŋbæ̃ ̂ ɛ 3sg ipfv go neg.cop pn ‘He hasn’t gone to Aungban as yet.’ (131) ō æ̃ ̂ ɛ kɨ̂ təsʰɐ̃ i 1sg fut go shop ‘I will go to the shop.’ (132) ɐn tɐ̄m swe bɯ̄ 3sg prog eat rice ‘He is still eating.’ (133) pɐ ē=nǝ̀ gǝdo lɛ̂ pɐj ŋɔ̂ n kɔʔ, father 1pl=top alcohol also hab drink neg.cop shǝleiʔ lɛ̂ pɐj ɓit kɔʔ cigarette also hab smoke neg.cop ‘Our father neither drinks nor smokes.’

The difference between the pre-verbal future aspect marker æ̃ ̂ ɛ and the postverbal mɛ (a loanword from Burmese, which appears to serve the same function) is as yet unclear. Both particles, in isolation, give a future reading to an utterance, but they can also be optionally used together. Such a construction might indicate an inceptive aspect.

1138

si

(134) jɪ̄n mɛ return fut ‘I’ll return.’ [a common salutation when about to return home] (135) ō æ̃ ̂ ɛ kɨ̂ mɛ 1sg fut go fut ? ‘I’m about to go.’ (136) ō æ̃ ̂ ɛ swe mɛ bɯ̄ 1sg fut eat fut rice ? ‘I’m about to eat.’

5.4.4 The ‘Return’ Particle The post-verbal particle bwɐt is glossed below as a return particle, because it indicates the physical return of entities to a source location, or situations where emphasis is placed on the fact that an action was carried out by the subject/agent him/herself. It is different from the reflexive construction, which is described in section 5.2.4 (which uses the co-ordinating particle nɐ̄), and from the reflexive-like sense encoded by the stative copula mə̄nɔʔ (see section 4.1). (137) ŋmɔ æ̃ ̀ ɛ jɪ̄n bwɐt ŋbæ̃ ̂ ɛ ɐ̂ when fut return ret pn q ‘When will you return to Aungban?’ (138) ē pɐ-bɪn bwɐt pjɪʔsî ouʔsɐ rətənɐ 1pl abil-get ret thing object gem ‘We can get back (our) belongings and valuables.’ (139) nì=nǝ to ō jɔx bwɐt lɛ̄ bɪn mǝ̄nɔʔ prox=top self 1sg do ret also get stat ‘I can do this myself.’ (140) tu nì=nǝ to ō ɗɯŋ bwɐt hɔʔ curry prox=top self 1sg cook ret cop ‘I cooked this curry myself.’

5.4.5 Comparative Particle The comparative particle thǣx (not to be confused with tʰæx ‘life/age’ in the following examples) is used to construct comparative and superlative constructions.

1139

danau (141) ō=nǝ thæx kõ̂u thǣx mɛʔ ɐn 1sg=top age big compar obl 3sg ‘I am older than he.’ (142) ō thæx kõ̂u thǣx bēj 1sg age big compar other ‘I am the oldest.’

5.4.5 Oblique The oblique particle denotes oblique arguments and adjuncts, such as instrument, location, goal (temporal and physical), comparison and direction. (143) ɐn tin ljak mɛʔ lonì kô 3sg sleep till obl now newinf ‘He’s still asleep!’ [lit. ‘He is sleeping till now!’] (144) mə̀ tʰɯ̄ t mɛʔ kɔŋ ɲɐ̄ 2sg sit obl inside house ‘He is sitting in the house.’

5.4.6 Accidental Particle The post-verbal accidental particle mɛ̃j indicates that something happened inadvertently or without volition. In combination with the negative, it indicates an unintentional omission. (145) ō swe mɛ̃j bɯ̄ frɐ̄ 1sg eat nvol rice holy ‘I accidentally ate the rice (meant as an offering) for Buddha. (146) mə̀ pə-heiʔ ō=nə ō lə-on mɛ̃j kɔʔ 2sg nfin-call 1sg=top 1sg neg-hear nvol neg.cop ‘I didn’t hear you calling me.’

6

Glossed Text

The following is an extract from a Buddhist sermon delivered by U Wariya, the abbot of Taungbohla Monastery, on the 15th of October, 2012.

1140

si

Transcription and gloss: ē pə-lu pjɪʔsī ouʔsɐ dɐnɐ̀ nɐj ləŋ.ŋwɐ̀ n nì=nə 1pl nfin-donate things goods charity like morning prox=top tɐŋ~tɐŋ plɛə̀ŋ~plɛə̀ŋ bēj lɛ̄ tʰì.kʰək kɔʔ bēj lɛ̀ bɪn clean~red pure~red other add harm neg.cop other add get de kɔʔ bɪn fro kɔʔ tɐŋ~tɐŋ plɛə̀ŋ~plɛə̀ŋ ē say neg.cop get say neg.cop clean~red pure~red 1pl tə-lu ē tə-tæ̃ ɛ̀ sənə̀=nə̀ bēj heɪʔ dɐnɐ̀ kətʰɐn cond-donate 1pl cond-give like.this=top others call charity what ɐ̀? “ənùpəhɪsɐ̀.dɐnɐ̀ ”. kɐt jɐ̀ ? nəkʰɑɔ̃,̀ lô tɯŋkʰrɯ̄ ŋ ō zɛ̄ q pali.quote correct q next person some 1sg only lu hɔʔ, kʰɯ̀ mɐ̀ lu kɔʔ. tə-kɨ̀ de bēj donate cop 3pl without donate neg.cop cond-go say other sənə̀, tə-tʰì kʰɯʔ bēj sənə̀=nə̀ ənùpəhɪsɐ̀ .dɐnɐ̀ like.this cond-compete against other like.this=top pali-quote ̀ tə-lu nì=nə̀ mjɔx kɔʔ. kɐt jɐ̀ ? nəkʰɑɔ̃ prox=top exalt neg.cop correct q next cond-donate tə-de lɛ̀, ŋ̄ kwe-mɐ̀ lɐ-de-nɐʔ, ŋ̄ kwe-mɐ̀ lɐ-fro-nɐʔ cond-say add who-without neg-say-proh who-without neg-say-proh ŋ̄ kwe-mɐ̀ lɐ-sɔ̄ sɛ̄-nɐʔ. lu lɛ̄ tɐŋ~tɐŋ plɛ̀əŋ~plɛ̀əŋ. who-without neg-busybody-proh donate add clean~red pure~red ō lu nə̀ŋ səlɛ̀ ō kwē lu lɯ̀ ŋ ō tæ̃ ɛ̀ ŋə̀ŋ səlɛ̀ 1sg donate can even 1sg just donate will 1sg give can even ō kwē tæ̃ ɛ̀ lɯ̀ ŋ. tə-lu tʰɐnə̀ tʰɐlɛ̀ tɐŋ~tɐŋ 1sg just give will cond-donate like.that in.that.case clean~red

danau

1141

plɛ̀əŋ~plɛ̀əŋ tʰɐnə̀ tʰɐlɛ̀ ənùpəhɪsɐ̀ .dɐnɐ̀ nì=nə̀ əmjɑɔx pure~red like.that in.that.case pali.quote prox=top exaltation hɔʔ. ənùpəhɪsɐ̀.dɐnɐ̀ tə-mjɑɔx tʰɐlɛ̀ kəmɯ̄ lô tutɔkɐɔ̃,̀ cop pali.quote cond-exalt in.that.case like person virtuous əlu nɐ̄ɔ mæ̃ ɛ hɔʔ. charity dist true cop

Translation:

If we donate alms like this morning, (if we do it) with a pure mind, without harming anyone, without anyone having to tell you (to give), engaging in charity with a pure mind—what is that called? ənùpəhɪsɐ̀ dɐnɐ̀. Right? Next, some people are like this: “Only I am charitable. Other people never are!” If you go and say such things to others, if you compete with others like this, your actions will not be exalted as ənùpəhɪsɐ̀ dɐnɐ̀. Will they? Also, even if you decide to donate alms, don’t tell anybody, don’t be a busybody. If you want to be charitable, do it with a pure mind. If I want to perform charity, that’s just what I will do. Donating alms with a pure mind will be exalted as ənùpəhɪsɐ̀ dɐnɐ̀. That is proper charity.

7 Bibliography Luce, Gordon H. 1965. Danaw, a dying Austroasiatic language. Lingua 14: 98–129.

section 11 Khasian

․․

chapter 19

Standard Khasi K.S. Nagaraja1 1 Introduction The Khasis form a majority of the population of the eastern part of Meghalaya State (the Khasi and Jaintia hill districts) and especially the state capital Shillong, some 1,128,575 according to the 2001 census. ‘Meghalaya’ (meaning ‘Abode of Cloud’), is a small north-eastern state of 22,720 square kilometres bordered by Assam in the north and east and Bangladesh in the south and west. The majority of the state is covered by forest land that is home to a rich biodiversity. Geographically Meghalaya is an island of Austroasiatic languages surrounded by Tibeto-Burman languages such as Garo, Bodo, Kachari, Mikir etc. in the West, North and East. The term ‘Khasi’ stands for both the tribe and the language. According to Bareh (1969), a native scholar, the term ‘Khasi’ has a particular significance: it is said that kha means ‘born of’ and si refers to an ancient mother i.e. Khasi means ‘born of the mother’. The people of the Khasi tribe call themselves by the name of Ki Hynñiew trep ‘the seven huts’. Khynriam or Standard Khasi, spoken around Cherrapunji in the southern part of Meghalaya, is the language described in this sketch, and the term Khasi is used subsequently throughout the text to refer to this speech variety. Historically Khasi had no indigenous script, and Bengali script was used by the Serampore Baptist Mission when they stayed in Khasi hills from 1813 to 1833. However in 1841 the Welsh Calvinistic Mission under Thomas Jones initiated the process of writing Khasi in Roman script. The first book in Khasi to be printed was in 1842, was Ka Kot Pule Ba Nyngkong (The First Khasi Reader), at the Calcutta Press, followed by a booklet by the Welsh Missionary, entitled Rhoddman (‘My Mother’s gift’), and a booklet on Christian catechism Ka Kot Layphew and the translation of St. Matthew, all the above works by Thomas Jones. All these were written in the Cherrapunji variety of Khasi. The complete edition of the Bible came in 1899. Generally Khasi literature of the 19th century

1 Dr. Paul Sidwell’s personal interest and editorial skills have greatly enriched and enhanced the quality of this article. I am highly thankful to him.

© koninklijke brill nv, leiden, ��5 | doi ��.��63/9789004283572_025

1146

nagaraja

was mostly Christian, although books on different subjects began to come, e.g.: Pryse, W., An Introduction to Khasi Language (1855); Roberts, Rev. Hugh, AngloKhasi Dictionary (1878); Roberts, Rev. Hugh, A Grammar of the Khasi language (1891); U. Jeb Solomon, a Khasi himself, wrote The Reader’s Companion, an easy guide how to speak and write Khasi (1895) under 3 sections—grammar, phrases and dictionary (English-Khasi). For details on the history of Khasi literature refer to Bareh (1969, 1977, 1979). Since independence the spheres of Khasi language use has extended to writing books on all subjects and is used for other creative activities. Khasi language is recognized as the medium of instruction in the primary schools, used widely at lower levels of the judiciary while at higher levels English is used. It is also used as a medium of expression in the legislative assembly of Meghalaya state. According to the 1971 census it has a literacy rate of 29.49% as compared to the national average of 29.4%. Most local newspapers are written in Khasi while only a few are in English. Even in electronic media Khasi is widely used. Recently in film making—short tele-films, big and small screen movies, videographies (music and films)—Khasi is used either exclusively or with English. The present writer’s grammar of Khasi was published in 1985, previously there are two other significant grammars, one by Roberts (1891) and Rabel (1961), the latter based on a modern structural framework. Papers on Khasi phonetics/phonology are not extensive, they include: Henderson (1967, 1976a, 1989–1990, 1991). Papers on morphology and word formation include: Henderson (1976b), Nagaraja (1979, 1984). Various syntactic issues are discussed by: Heymann (1977), Subbarao & Temsen (2003), Temsen (2000). Expressive vocabulary and other lexical issues feature in: Rabel (1968), Rabel-Heymann (1976), Shanta (n.d.). Khasi-English and/or English-Khasi dictionaries include: Singh (1906, 1920), Bars (1973), Blah (2007), Kharwanlang (2010). Additionally a few native scholars, such as H.W. Sten, M.B. Jyrwa, B. War have written on various aspects of Khasi in Khasi. 2 Phonetics/Phonology 2.1 Word/Syllable Structure Khasi morphemes are predominantly monosyllabic, and the monosyllables can have the following syllabic structure: (C)(C)V(C). This means that the smallest syllable is simply V, although such normally occur with an initial glottal stop. Being predictable, this glottal stop is normally considered phonetic only, and is not written in the phonemic representation. Arguments can be advanced to treat glottal as phonemic, but these are not considered here. It is

Standard Khasi

1147

also noted that open syllables are infrequent, generally arising from the historical loss of final glottal stops—present glottal codas reflect earlier velar stop. Vowel length is contrastive in stressed syllables, although not in isolation from the consonants; final consonants are phonetically long after short vowels and short after long vowels (-VːC, -VCː). Word final open syllable vowels tend to be long, and are unmarked for length in this sketch. Initial consonant clusters are very common in Khasi, while in final position clusters are not possible. The attested initial clusters are enormously rich (see Henderson 1976 for a discussion), with apparent disregard for sonority sequencing, although sequences with the same place of articulation are not favoured. Apparent exceptions tn-, tr-, tl-, tʰn-, tʰr-, tʰl- don’t violate this tendency as the first member is dental and second member is alveolar. Also note although geminate sequences (-tt-, -nn-, -ll-, -pp-, -dd-, -mm-, -ŋŋ- and -ll-) are attested, these are regarded as occurring across syllable boundaries in the present analysis. Syllable initial clusters of three consonants are limited in number; mostly the third member is a frictionless continuant, and among the other segments usually one will be a stop. In four member consonant clusters (which are very few and occur in medial position only) one of the members will be a stop and another will be either a nasal or a palatal frictionless continuant. The phonological word can be complex, containing more than one syllable, typically due to affixation and/or compounding. As compounding is very productive, coupled with a process of contraction, multi-syllable words are possible; as many as five syllables have been counted although the pattern CVC.CVC predominates. There is not much in the way of morphophonemic change to report. There are a few cases of vowel deletion; particularly when the negative or future marker, ɨm and ɨn respectively, join the preceding pronominal endings. E.g.: u/ka/ki + ɨm/ɨn > um/kam/kim; un/kan/kin.

Also there are assimilations involving =n- in affixation. E.g.: pɨn-‘causative prefix’ + leʲt ‘to go’ > pɨlleʲt ‘cause to go’. 2.2 Phoneme Inventory Phonemic inventory: The phonemes of Khasi are tabulated below. The IPA values given to the phonemes give an approximate indication of the phonetic value in most cases, the main discrepancy is that the vowels e, o are somewhat more open [e̞~ɛ, o̞ ~ɔ] than our phonemisization would indicate. A fuller description of the phonetics of Khasi vowels is found in Rabel (1961).

1148

nagaraja

Vowels i e

ɨ

a

u o

iː eː

aː

uː oː

The question of vowel length is somewhat problematic. Older dictionaries (e.g. Roberts 1875) mark long vowels, main with an acute accent, while contemporary practice is not to mark length directly. However, orthography still gives partial indications; final short stops are written with ‘voiced’ symbols b, d, j in long syllables. Orthographic ie, uo reflect [eː ~ ɪe], [oː ~ʊo] respectively. Sequences Vi#, Vu# indicate Vj#, Vw#, while ViC, VuC indicate monophthongs before final laminals (see discussion below). Orthographic ua, ia (uá, iá in older sources) indicate waː, jaː. Phonologically the one true diphthong is /ia/, e.g.: dŋiaŋ ‘lively’, kʰriak ‘sparkling’. The short /ɨ/ (orthographic y) marks the juncture within initial consonant clusters of which the second element is neither glottal stop nor a frictionless continuant. Consonants: initials

p pʰ b bʱ m w

t tʰ d dʱ n r, l s

ʤ ʤʱ ɲ j ʃ

k kʰ

ŋ h

Consonants: finals p m w

t n r

ʲt ɲ j

(ʔ)

k ŋ

ʔ

The final written here as /-ʲt/ is controversial. Orthographically it, id (e.g. dait ‘to bite’, sŋaid ‘fat, grease’, eit ‘excrement’ s’aid ‘hot, warm’) these are usually treated as diphthongal vowels followed by a dental, but articulatory studies are lacking. Historically these reflect final palatal stops following phonemic monophthongs (cf. Khasi eit with Khmer ʔac ‘excrement’, Khasi ʼnier-wait ‘entrails’ with Riang-Lang vac ‘bowel, entrails’ etc.). The synchronic analysis of /-ʲt/ here as a unitary phoneme permits a more economical account of the vocalism.

1149

Standard Khasi

The aspirated coronals /tʰ/ and /dʱ/ are phonetically dental; /ʤ/ and /ʤʱ/ are phonetically affricates. Interestingly Standard Khasi does not have voiceless affricates, although /ʃ/ is common and contrasts with /s/. The stops are usually unreleased in final position. Among nasals, /ɲ/ and /ŋ/ do not occur as frequently as /m/ and /n/ in initial position. Fricatives do not occur in final position, though in borrowed words alveolar fricative occurs in that position. The lateral seldom occurs finally (except in borrowed words). The Khasi orthographic symbols and their commonest phonetic equivalents are tabled as follows: Phon./Orth.

Phon./Orth.

Phon./Orth.

Phon./Orth.

Phon./Orth.

Phon./Orth.

i, iː / i e, eː / e ɨ/y a, aː / a o, oː / o u, uː / u

p pʰ / ph b bʱ / bh m/m w/w

p-/-p/b tʰ / th b dʱ / dh n/n rl/rl

t / t-,-t/d ʃ / sh d/d ʤʱ / jh ɲ/ñ j / y/i/ï

-ʲt / -it/-id kʰ / kh ʤ/j

k/k -ʔ / -h

ŋ / ng

h- / h-

There are many words where there is no standard spelling. Various letters are used for the single sound. For instance: (1) iong~ïong ‘black’ rïmpey~rumpei ‘hearth’ shuwa~shwa ‘before’ ioh~ïoh ‘get’ sïngkai~synkai ‘lip’ siep~sieb ‘engross’ it~ïit ‘glass’ shïlyah~shilliah ‘mat’

3

Word Formation

Word structure is quite complex in this language; while the simplest words monosyllabic and disyllabic free forms, affixation and compounding account for a huge number of complex forms, especially derivational and phrasal reduction forms. Such complex forms are readily identified as single words on morpho-syntactic grounds; nouns are readily defined by the fact that they take obligatory pronominal enclitics, and in both nominal and verbal compounds

1150

nagaraja

the components are frequently phonologically reduced variants of their free forms (by reduction in initials). 3.1 Compounding Khasi is very rich in compounding; for a more extensive discussion of compounding in the language see Nagaraja (1985, 2000). Compounds are typically left-headed with two or three constituents—they can be regarded as endocentric compounds. Khasi also has a few exocentric compounds whose derivation is not so transparent; they may also be called unique compounds, having one of the members which is unique or near unique. Generally it is straightforward to identify sequences of lexical roots as compound nouns on morpho-syntactic grounds; Khasi has a system of obligatory pronominal subject proclitic gender markers (usually referred to as ‘articles’ in Khasi grammars, see discussion under §5 Word Classes) which permits unambiguous identification of nouns. Verbal compounds also occur, and these are recognized as compounds on the basis that compounding is an otherwise strong feature of the language, rather than make a special argument for multiverb predication. At the level of compound, phonological contraction may take place. By and large this takes place when the nouns occur as heads in compounds, and then contracted forms are preferred. The phonological conditions for contraction are that the roots should be either (a) monosyllabic with an initial consonant cluster; or (b) disyllabic, with an intial syllable CCV or CVC. E.g.: briew ‘person’ > riew in riew-bʱa ‘good person’; masi ‘ox’ > si in si-kɨrtoŋ ‘bull’, sɨntiew ‘flower’ > tiew in tiew-kulaːp ‘rose flower’, kʰmat ‘eye’ > mat in ɲuoʔ-mat ‘eyelash’ etc. There are also various roots that fulfill the above-mentioned conditions but do not undergo contraction, such as thliew ‘hole’, kʰmut ‘nose’, and others (contrast the ‘nose’ example with ‘eye’ immediately above). Why some particular roots are not contracted it is not clear. 3.1.1 Nominal Compounds There are many endocentric nominal compounds, the first member is the head and the second one is the attribute, three types discussed here are noun=noun, noun-adjective, and noun-verb. In each case these can be treated as reduced forms of full phrasal constructions, with the output taking the proclitic gender marking it as a single nominal constituent. Various examples are given below with phrasal equivalents given for comparison/analysis.

1151

Standard Khasi

Noun=noun compounds From the following illustrations it will be observed that the pronominal marker of the first member is also the pronominal marker of the compound (this is generally true although there can be exceptions). At the phrasal level the constituent nouns may be primitive or themselves derived. The general pattern is that prepositions which control the attributes are deleted along with their pronominal proclitics. Examples: (2)

Compounds ka=um-mat f=water-eye ‘tear’

Phrasal equivalents ka=um ʤoŋ ka=kʰmat f=water poss f=eye ‘water of the eye’

(3) u=snep-sɨmbaːj u=snep ʤoŋ u=sɨmbaːj m=cover-seed m=cover poss m=seed ‘cover (seed)’ ‘cover of a seed’ (4) ka=buʲt-sɨnʃaːr f=cunning-govern ‘policy’

ka=buʲt ʤoŋ ka=ʤiŋ-sɨnʃaːr f=cunning poss f=nml-govern ‘tactics of the ruling’

(5) ka=miʔ-ŋi f=rise-sun ‘east’

ka=ʤiŋ-miʔ ʤoŋ ka=sŋi f= nml-rise poss f=sun ‘rising of the sun’

(6) ka=jeːʲt-ri ka=noŋ -jeːʲt ʤoŋ ka=ri f=love-country f=agt-love poss f=country ‘patriot’ ‘lover of the country’ (7) ka=baːm-kʰana f=eat=narrate ‘picnic’

ka=ʤiŋ-baːm baːt ka=ʤiŋ-kʰana f=nml-eat com f=nml=narrate ‘eating and narrating’

Noun-verb/adjective compounds Noun-verb and noun-adjective compounds are analysed as reduced relative clause constructions (adjectives are treated as a sub-class of verbs, see §5 Word Classes). Examples:

1152

nagaraja

(8) u=laŋ-jap u=phlaŋ (u) m=grass-die m=grass (3sg.m) ‘straw’ ‘grass (that) which dead’

ba jap comp die

(9) u=riew-bʱa m=man-good ‘good man’

ba bʱa comp good

u=briew (u) m=man (3sg.m) ‘man (he) who is good’

There are instances of reversed word order: (10) ka=lɨŋkot-kʰmut ka=kʰmut (ka) f=short=nose f=nose (3sg.f) ‘short nose’ ‘nose which (is) short’

ba comp

lɨŋkot short

In this type of compounds also, change of word order is possible: (11) ka=ot-lɨŋkʰot f=cut-piece ‘chip’

ka=lɨŋkʰot (ka) f=piece (3sg.f) ‘piece (that) which is cut’

ba ot comp cut

3.1.2 Verbal Compounds Verb compounds are those in which the verb is the head and other members are attributes. The following combinations are possible in compounds of this type: Verb-verb, verb-adjective, verb=noun, and verb-adverb. As with nominal compounds, prepositions and pronominal proclitic gender markers are dropped from the phrasal forms to create compounds, although in the case of verb-adverb compounds it is simply the absence of a pause. Verb-verb compounds The verb-verb compound consists of the head verb with an infinitive verb base. In the phrasal equivalents the verbs are linked by the ban infinitizer or baːt ‘and/with (com)’. Examples: (12) jaːʲt-waːt walk-search ‘to walk searching’

jaːʲt ban waːt walk inf search ‘to walk in order to search’

(13) ʃaʔ-leʲt allow-go ‘to let go’

ʃaʔ ban leʲt allow inf go ‘to allow to go’

1153

Standard Khasi (14) wan-raʔ come-carry ‘to bring’

wan baːt come com ‘come and carry’

raʔ carry

Verb-adjective compounds Verb-adjective compounds have the verb as the head and the adjective as the attribute. The adjectives that can occur with nouns can also occur with verbs. In periphrasis they are linked with the conjunction ba (comp), which is dropped in the formation of compound forms. Examples: (15) leʔ-bʱa do-good ‘to treat well’

leʔ ka ba bʱa do f comp good ‘to do which is good’

Verb=noun compounds (noun incorporation) In verb=noun compounds, or verbs with incorporated objects, are derived from transitive and oblique verb. Examples: (16) thiet-kot thiet ja ka=kot buy-book buy acc f=book ‘to purchase (a) book’ ‘to purchase a book’ (17) jaːʲt-lɨnti walk-path ‘to travel’

jaːʲt ha ka=lɨnti walk loc f=path ‘to walk on the path/way’

3.1.3 Multi-member Compounds More complicated compounded forms are created by the multiple concatenation of free forms, or even the combination of free forms with established compounds, exemplified below: (18) a) koːr-thaːɲ-ʤaːɲ b) laŋ-soʔ-moʔ machine-weave-cloth goat-hang-chin ‘loom’ ‘full grown goat’ c) sop-ti > sopti-kti d) baːm-briew > la-baːmbriew cover-hand dress-hand eat-man tiger-man.eater ‘dress’ ‘blouse’ ‘man-eater’ ‘tiger’

1154

nagaraja

e) diʔ-duma > loːr-diʔduma drink-smoke star-smoke ‘to smoke’ ‘comet’

3.1.4 Compounds with Unique or Near Unique Constituents Compounds of this type are those in which one of the constituents in two member compounds is free and identifiable, while the other constituent is unique or near unique with restricted distribution. The bound constituent may be the first member or the second. Some examples: (19) a) dew-bilat (khɨndew ‘soil’)2 b) buːt-luːm (luːm ‘hill’) c) khot-rieʔ (khot ‘to call’) soil-X X-hill call-X ‘cement’ ‘backbone’ ‘to beckon’ d) dieŋ-duʔ (kdieŋ ‘wood’) tree-X ‘walking stick’

e) ir-mat (khmat ‘eye’) f) ot-ʃeːr (ot ‘to cut’) X-eye cut-X ‘eye lid’ ‘circumcise’

Multiple member compounds exist, some examples: (20) a) um-phnjaŋ-ʃarak b) ɲjaŋ-lar-tham water-X-lamp insect-X-crab ‘wick’ ‘scorpion’ c) sma-ʔiew-tuŋ d) pɨn-hjar-mɨnsiem-set smell-smell-X caus-descend-breath-X ‘to stink’ ‘to snore’

3.2 Derivational Affixation Derivation primarily involves prefixation. Infixation is evident in vestigial elements only, indicating that it was historically a productive process. Suffixation does exist, but only marginally in respect of certain numeral forms (see below). 3.2.1 Deriving Nouns As noted above, there are many disyllabic nouns, and potentially these could be analysed as containing prefixes but synchronically there is no basis to 2 The second part of the compound bilat probably refers to England, as Bilat (Persian velāyat ‘district, dominion’, ultimately from Arabic wilāyat ‘district, dominion’) is widely used in British India for England or Great Britain. Identical compounds are also found, for example in Burmese bílaʔ-mye ‘cement’ and Mon tɑəʔ-pìlàt ‘cement’, both literally ‘British soil’.

1155

Standard Khasi

regard them as derived words (see: Nagaraja 2000–01). The important productive prefixes are noŋ- (with agentive meaning) and ʤiŋ- (with both concrete and abstract meaning), both glossed as nml, and which derive nouns from verbs. Note also that the nasals can assimilate the place of the root initial when it is labial. Examples: (21) trej ‘to work’ baːm ‘to eat’ trej ‘to work’ phoŋ ‘to wear’ taʔ ‘to season’

> > > > >

(u/ka) noŋ-trej ‘(male/female) worker’ (u/ka) noŋ-baːm ‘(male/female) eater’ ʤin-trej ‘the working’ ʤim-phoŋ ‘vest like garment’ ʤin-taʔ ‘curry’

3.2.2 Deriving Verbs Several verb prefixes are evident: sam- ‘inclinative marker’, ja- ‘reciprocal/ collective marker’, and pɨn- ‘causative marker’ (note assimilation of n > l in the final example). E.g.: (22) Inclinatives: thjaʔ ‘to sleep’: samthjaʔ ‘to feel sleepy, drowsy’ ʤuŋ ‘to urinate’: samʤuŋ ‘to have the urge to pass urine’ eʲt ‘to pass stool’: sameʲt ‘to have the urge to pass stool’ (23) Reciprocals/collectives: leʲt ‘to go’: jaleʲt ‘to go together’ lam ‘to carry’: jalam ‘to conduct, to lead’ kren ‘to speak’: jakren ‘to talk’ (together) (24) Causatives: jap ‘to die’: pɨnjap ‘to kill’ baːm ‘to eat’: pɨnbaːm ‘to feed’ laʲt ‘to escape’: pɨllaʲt. ‘to set free’

3.2.3 Deriving Adjectives Several processes are noted for Adjectives (see §5.2.7 for the verb-adjective distinction):

• •

the comparative degree is achieved with the prefix kʰam- (note: the superlative uses the addition of the adverb tam ‘more, excess’ after the adjective); with the colour and taste terms, considered adverbs in their unmarked forms, ‘-ish’ forms are created with prefixes bɨr-, lam- (it is not clear if there is a difference in meaning). E.g.:

1156

nagaraja

(25) bɨr-ʤew ‘sourish’ bɨr-lieʔ / lam-lieʔ ‘whitish’

bɨr-thjaŋ ‘sweetish’ bɨr-saːw / lam-saːw ‘reddish’

bɨr-kthaŋ ‘bitterish’

3.2.4 Deriving Adverbs Many adverbial forms are derived by prefixing, compounding and reduplication, in particular refer to the various discussions under §3.3, §5.3 and §5.4. 3.3 Reduplication In Khasi reduplication is a productive morphological process. It involves repetition of the base or stem, which can be characterized into three types: 1. 2. 3.

Simple reduplication, wherein if the base is X, the reduplicated form will be X-X, Reduplication with prefixation, wherein, if the base is X, the reduplicated form will be X-P-X, where P stands for a prefix. Reduplication with internal change, wherein if the base is X, the reduplicated form will be X-X’. Here X’ stands for X with some internal change, which may be consonantal or vocalic. In some forms the change may involve a whole syllable.

3.3.1 Simple Reduplication By and large most verbs can be reduplicated. The senses conveyed can be grouped into two types: duration and causation: (26) her ‘to fly’: her~her ‘continue to fly’ kʰlik ‘to flicker’: kʰlik~kʰlik ‘continue to flicker’ bɨnrap ‘to collect, gather’: bɨnrap~bɨnrap ‘ask someone to put’ (things one after another) haːp ‘to fall’: haːp~haːp ‘ask somebody to fall’

Adverbs are reduplicated extensively, especially manner adverbs, and the effect is intensification: (27) suki ‘slowly’: suki~suki ‘very slowly’

kdek ‘loudly’: kdek~kdek ‘very loudly’

Adjectives: These are generally not reduplicated, although two examples have been noted: (28) rim ‘old’: rim~rim ‘very old’

mlun ‘fat’: mlun~mlun ‘fat’.

Standard Khasi

1157

Intensifiers: Intensifiers can be reduplicated when used with colour adjectives; quite often, finer shades of meaning are also indicated by these intensifiers while the reduplicated forms carry the more general intensive meaning. Examples: (29) joŋ ‘black’: joŋ-plaɲ ‘jet black’: joŋ-plaɲ~plaɲ ‘very black’ saw ‘red’: saw-haɲ ‘crimson’: saw-haɲ~haɲ ‘very very red’ stem ‘yellow’: stem-(k)mer ‘very yellow’: stem-(k)mer~(k)mer ‘deep yellow’

3.3.2 Reduplication with Prefixation There are many examples of prefixed forms being reduplicated, and the general pattern is that the prefix is not repeated but is retained on the repeated part, thus it is medial in the construction. Examples involving some 10 different prefixes are discussed by Nagaraja (2000–01), just two are exemplified here: (30) ʃiː Although frequent, it does not provide a single sense: ter ‘in order’: ter~ter ‘consequently, in order’: ter-ʃi-ter ‘in a series one after another’. sor ‘slowly’: so~-sor ‘very slowly’: sor-ʃi-sor ‘slowly but surely’, ka=kʰep ‘occasion, time’ (n): ka=kʰep-ʃi-kʰep ‘different occasions’ (31) pa-: ‘continuity or degree of repetition’: pat ‘to fall’ (objects): pat-pa-pat ‘fall down intermittently’ kɨllon ‘to decline, fall’: kɨllon-pa-kɨllon ‘to fall repeatedly’

3.4 Expressive Vowel Alternation Expressive vowel alternation is remarkably pervasive. Generally oppositions are binary; there is a clear-cut distinction between [i/e/a/o/u]. [u] indicates ‘big size’, [a] and [o] indicate small size; [e] indicates largeness; sometimes smallness also. Others indicate states ‘in between sizes’. From the available forms it becomes clear that those which use [i] to mark a distinction is very rare. In binary oppositions, three combinations have been observed. Among them the second type is more common than others: small vs. big [a] [e] [o] [u] [o] [e]

1158

nagaraja

Some examples of these: (32) [a]–[e]: hap ‘open’ (mouth-small): hep ‘open’ (mouth-big) khuːʲt sɨŋlan ‘completely bald’ (small): khuːʲt sɨŋlen ‘completely bald’ (big) (33) [o]–[u]: hop ‘open’ (mouth-small one): hup ‘open’ (mouth-widely) ldoj ‘cling’ (dangling- small thing): lduj ‘cling’ (dangling- big thing) (34) [a]–[u]: phlak ‘hit’ (with a small thing): phluk ‘hit’ (with a big thing)

And three or more grades affecting the same base: (35) a) ap dŋaŋ ‘wait for- to get something’ b) hap ‘open’ (the mouth) (small) ap dŋeŋ ‘wait for- an opportunity’ hep ‘open’ (the mouth) (big) ap dŋoŋ ‘wait for- lest anything happens’ hop ‘open’ (the mouth) (small) hup ‘open’ (the mouth) (very large or big)

There is also expressive vowel alternation among reduplicated forms; instances of four or even five vowels in reduplicated forms exist, for example: (36) u jaːʲt thar~thar ‘he walks rapidly’ (of an average person) u jaːʲt tʰer~ther ‘he walks rapidly’ (of a bigger person- perhaps of one wearing loose garments) u jaːʲt tʰir~thir ‘he walks rapidly’ (of a slim, nimble person) u jaːʲt thor~thor ‘he walks rapidly’ (of one so light that he appears to be blown about by the wind) u jaːʲt thur~thur ‘he walks rapidly’ (of a heavy, clumsy person)

4

Phrase and Clause Structure

4.1 Simple Clauses The basic word order of this language is SV/AVP. This applies to statements, commands and questions—the latter are formed by placing in interrogative at first or last clause position (and may function as subjects or objects; the same forms are used as relative pronouns, see §5.1.2). Within a clause the VP may be preceded by an NP which functions as the subject of the clause. There is cross-reference between the head of the subject NP and the VP, as the pronominal proclitic gender marker—formally equiva-

1159

Standard Khasi

lent to the 3rd person free pronouns—is repeated and placed before the VP, unless the NP is a free pronoun in which case it is not repeated. A single object is optionally marked for case with a preposition, and at least one argument must be so marked if there are multiple objects. Tense/Aspect markers come before the verb, immediately after (or bound to) the agreement marking pronoun if it is present. A clause may consist at its simplest of just a VP; and at the most complex may consist of the following structures:

• • •

(NP) VP (manner-phrase) (obl-phrase) (Adv-phrase) (loc/Time-phrase) (NP) VP (manner-phrase) (obl-phrase) (Adv-phrase) (Adv-phrase) (NP) VP (acc-phrase) (obl-phrase) Some examples of simple sentences: (37) ki daŋ baːm (38) ka=khɨnnaʔ ka=n 3pl prog eat f=girl f=fut ‘they are eating’ ‘the girl will go’

leʲt go

(39) u=ksew u baːm doʔ (40) u=briew u la wan m=dog 3sg.m eat flesh m=man 3sg.m pst come ‘the dog eats flesh’ ‘the man came’

Objects can be fronted, and Prepositional Phrases, principally as obliques or adverbials, may follow the VP or be brought to the clause initial position pragmatically. There may be many PPs in a single clause and in that case temporal phrases typically occur clause initially. Examples: (41) u=kʰla ki la pɨnjap m=tiger 3pl pst kill ‘A/the tiger they killed’ (‘They killed a/the tiger’) (42) ki=briew ki daŋ jaʃoʔ da ki=waʲtlam pl=person 3pl prog fight with pl=swords ‘the people are (still) fighting with swords’ (43) na u=bneŋ u=klew u la hjar from m=sky m=peacock 3sg.m pst descend ‘from heaven the peacock descended’

1160

nagaraja

(44) ha ka=sŋiʃuti ha ka=pʰra-baʤe ŋa la aːj ja ka=kot loc f=Sunday loc f=eight-clock 1sg pst give acc f=book da ka=kti ʤoŋ ŋa ha u ha ka=ijeŋ ʤoŋ ŋa with f=hand poss 1sg loc he loc house poss 1sg ‘On Sunday at 8 o’clock I gave the book with my hand to him in my house’

Imperatives require no special marking and a bare verb will suffice for a command, additionally these can be negated or further modified adverbially or a request can be softened with, for example, sŋewbʱa ‘please’ (