Sophie's Gallery
Getting ready to take her first ballet exam at 5;3 |
Picking flowers in a garden in Provence, France at 5;5 |
Sophie's Language Background
The corpus of Sophie's bilingual development is the second installment of the Hong Kong Bilingual Child Language Corpus. Born on 28 February 1996, Sophie is the younger sister of Timmy, the first bilingual subject to be included in the Hong Kong bilingual corpus, born two years and nine months earlier (a younger sister was born when she was 4;03). Sophie's mother is a native speaker of Hong Kong Cantonese and her father of British English, and her exposure to Cantonese and English started from birth. The one parent-one language principle was adopted in principle, especially when addressing the child, but code-mixing occurred when the parents conversed with each other, which formed part of the child's input. Apart from parental input, interaction with her brother took place in both Cantonese and English. She was regularly video-taped and audio-taped by two research assistants in each recording session, one responsible for each language, from 28 August 1997 to 28 February 2000 (1;06 - 4;00) on a weekly basis. In each recording session one research assistant interacted with the child for approximately half an hour in English and the other for half an hour in Cantonese. The corpus as initially released covers transcriptions from age 1;06 up to age 3;00.09, on an approximately biweekly basis. Sophie lived in Hong Kong continuously throughout the period of recording. She did not take her first trip abroad (to Australia) until 4;04. She was cared for primarily by her maternal grandmother who spoke Cantonese and ChiuChow and a Filipino domestic helper, Belma, who spoke English and some Cantonese. She started attending a local Chinese kindergarten at 2;6 in the morning and in addition, attended an English-speaking kindergarten in the afternoon from 3;02. She continued to attend both schools until 5;01. The different kindergartens are each monolingual in the respective language. While the circumstances are similar overall to those prevailing in Timmy's case, Sophie's different personality and character lead to differences in the data. While her brother was reserved and passive, she was typically lively and talkative in recording sessions, even becoming argumentative as she grew older. In addition, being cared for primarily by her grandmother and remaining in Hong Kong exclusively during her preschool years means that the predominance of Cantonese input is even greater in her case than in Timmy's. This is reflected in the fact that while Cantonese recordings are available from age 1;06.00, she only began to use English productively at around 2. While in many respects her development recapitulates that described for Timmy (such as wh in situ, null objects and prenominal relative clauses: see Yip & Matthews 2000), her English also shows some forms of transfer which are not evident in Timmy, such as extension of the verb give to permissive and even passive usages. Since her grandmother speaks Chaoyang (Chaozhou) dialect as well as Cantonese, Sophie developed some passive knowledge of this dialect. She learnt that producing occasional phrases in Chaozhou was a source of amusement, but did not produce full sentences. There is also the possibility of syntactic influence from Chaozhou, for example in the ordering of double objects. The parents kept a diary of Sophie's utterances to supplement the audio and video-recording data. The diary continued beyond age 4, when regular recording ceased, in order to follow up some of the features. The format of the English and Cantonese data is as described for Timmy in the first installment of the Hong Kong corpus: the grammatical category labels for the English corpus are based on the MOR grammars for English in the CHILDES Windows Tools, while the Cantonese data were tagged using a program developed by Lawrence Cheung on the basis of the grammatical categories used in the Hong Kong Cantonese child language corpus (Cancorp) created by Lee et. al. (1996), which contains eight monolingual Cantonese speaking children's data from 1;05 - 3;08. There are three tiers with the main tier showing Cantonese in the JyutPing romanization, with Chinese characters and grammatical categories shown on separate tiers. A total of three sample audio-linked transcripts and two video-linked transcripts featuring Sophie's speech at 2;05 are available for access via the CHILDES archive. The three audio-linked transcripts feature Cantonese and English as well as some Cantonese-English code-switching. Two of the audio-linked transcripts have video-linked counterparts. The shorter of these transcripts has a video-linked counterpart, with a sound track that is less clear than in the audio-linked one. In the shorter video excerpt (3:00) sound quality may be improved by adjusting the balance to turn down the right channel. The video-linked files feature each language respectively as the base language as well as code-switching in the longer one (4:13).
|
Inventory of Sophie's Files
There is a total of 80 files, half in Cantonese and half in English and there are two files for the same date since they were recorded on the same day. Though there seems to be a perfect symmetry in terms of the files in each language, it should be noted that in the early English files before Sophie turned 2, she did not yet speak English fluently despite the investigators' elicitation in English. The file name is made up of Sophie's initial S, followed by the initial that stands for the language, either c for Cantonese or e for English, followed by the year, month and date of recording.e.g. Sc970828 refers to the Cantonese file containing the recording made in the year 1997, August 28 and Se970828 refers to the English file for the recording made on the same date. Thus each of the 80 files has a unique file name.
File no.
|
File name (Scyymmdd)
|
Fil no.
|
File name (Seyymmdd)
|
Age of CHI
|
1.
|
Sc970828
|
41.
|
Se970828
|
1;06.00
|
2.
|
Sc970911
|
42.
|
Se970911
|
1;06.14
|
3.
|
Sc970925
|
43.
|
Se970925
|
1;06.28
|
4.
|
Sc971016
|
44.
|
Se971016
|
1;07.18
|
5.
|
Sc971030
|
45.
|
Se971030
|
1;08.02
|
6.
|
Sc971113
|
46.
|
Se971113
|
1;08.16
|
7.
|
Sc971127
|
47.
|
Se971127
|
1;08.30
|
8.
|
Sc971218
|
48.
|
Se971218
|
1;09.20
|
9.
|
Sc971230
|
49.
|
Se971230
|
1;10.02
|
10.
|
Sc980114
|
50.
|
Se980114
|
1;10.17
|
11.
|
Sc980205
|
51.
|
Se980205
|
1;11.08
|
12.
|
Sc980219
|
52.
|
Se980219
|
1;11.22
|
13.
|
Sc980305
|
53.
|
Se980305
|
2;00.07
|
14.
|
Sc980318
|
54.
|
Se980318
|
2;00.20
|
15.
|
Sc980403
|
55.
|
Se980403
|
2;01.06
|
16.
|
Sc980417
|
56.
|
Se980417
|
2;01.20
|
17.
|
Sc980501
|
57.
|
Se980501
|
2;02.01
|
18.
|
Sc980514
|
58.
|
Se980514
|
2;02.14
|
19.
|
Sc980529
|
59.
|
Se980529
|
2;03.01
|
20.
|
Sc980611
|
60.
|
Se980611
|
2;03.13
|
21.
|
Sc980622
|
61.
|
Se980622
|
2;03.24
|
22.
|
Sc980716
|
62.
|
Se980716
|
2;04.18
|
23.
|
Sc980724
|
63.
|
Se980724
|
2;04.26
|
24.
|
Sc980730
|
64.
|
Se980730
|
2;05.02
|
25.
|
Sc980813
|
65.
|
Se980813
|
2;05.16
|
26.
|
Sc980827
|
66.
|
Se980827
|
2;05.30
|
27.
|
Sc980909
|
67.
|
Se980909
|
2;06.12
|
28.
|
Sc980929
|
68.
|
Se980929
|
2;07.01
|
29.
|
Sc981008
|
69.
|
Se981008
|
2;07.10
|
30.
|
Sc981022
|
70.
|
Se981022
|
2;07.24
|
31.
|
Sc981105
|
71.
|
Se981105
|
2;08.07
|
32.
|
Sc981119
|
72.
|
Se981119
|
2;08.21
|
33.
|
Sc981203
|
73.
|
Se981203
|
2;09.05
|
34.
|
Sc981222
|
74.
|
Se981222
|
2;09.24
|
35.
|
Sc990107
|
75.
|
Se990107
|
2;10.10
|
36.
|
Sc990121
|
76.
|
Se990121
|
2;10.24
|
37.
|
Sc990202
|
77.
|
Se990202
|
2;11.05
|
38.
|
Sc990215
|
78.
|
Se990215
|
2;11.18
|
39.
|
Sc990302
|
79.
|
Se990302
|
3;00.02
|
40.
|
Sc990309
|
80.
|
Se990309
|
3;00.09
|