-
Notifications
You must be signed in to change notification settings - Fork 55
Description
meeting element
- extend meeting elements (
#parla.term,#parla.sitting)
I haven't found any information about terms or sitting in the meeting elements. This is how other corpora implement it:
ParlaMint/Data/ParlaMint-UA/ParlaMint-UA_2014-12-02-m0.xml
Lines 11 to 13 in 197e5ec
| <meeting ana="#parla.term #parla.uni" n="8" corresp="#ВРУ">8</meeting> | |
| <meeting ana="#parla.session #parla.uni" n="1" corresp="#ВРУ">1</meeting> | |
| <meeting ana="#parla.sitting #parla.uni" n="2014-12-02" corresp="#ВРУ">2014-12-02</meeting> |
I was not able to find term info on Romanian parliament websites - I believe the information is there.
And if a single file contains one sitting, then add sitting identification.
Missing speech content
- speech content
In some files there is no speech content:
https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-09-04-id4959.xml
<note type="time">Şedinţa a început la ora 15,55.</note>
<note type="chairman">Lucrările au fost conduse de domnul Ion Diaconescu, preşedintele Camerei Deputaţilor, asistat de domnii Andrei Ioan Chiliman şi Acsinte Gaspar, secretari.</note>
<note type="speaker">Domnul Ion Diaconescu:</note>
<u ana="#chair" who="#Ion-Diaconescu" xml:id="ParlaMint-RO_2000-09-04-id4959.u1"/>
<note type="speaker">Domnul Iuliu Ioan Furo:</note>but the source contains speech contents:
https://www.cdep.ro/pls/steno/steno2015.stenograma?ids=4959&idl=1#S0
Chairman note type
- use
narrativeorpresident
According to doc,narrativeorpresidentfits better in this case:
https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-09-04-id4959.xml#L125
<note type="chairman">Lucrările au fost conduse de domnul Ion Diaconescu, preşedintele Camerei Deputaţilor, asistat de domnii Andrei Ioan Chiliman şi Acsinte Gaspar, secretari.</note>not recognized notes
- notes in text
Notes are in source italics so easy to recognize...
<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u39.seg6">Cine este pentru?(Vociferără în partea dreaptă a sălii).Vă rog să număraţi... Vă rog să ridicaţi mâna, cei care sunteţi pentru acest amendament, să repetăm numărătoarea. Este o confuzie.</seg>should be: (https://clarin-eric.github.io/ParlaMint/#TEI.vocal)
<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u39.seg6">Cine este pentru? <vocal type="shouting">
<desc>(Vociferără în partea dreaptă a sălii)</desc>
</vocal> Vă rog să număraţi... Vă rog să ridicaţi mâna, cei care sunteţi pentru acest amendament, să repetăm numărătoarea. Este o confuzie.</seg>presence list
- presence list is missing status
<u ana="#regular" who="#Andrei-Ioan-Chiliman" xml:id="ParlaMint-RO_2000-04-14-id4927.u46">
<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg1">Achimescu Victor Ştefan</seg>
<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg2">Aferăriţei Constantin</seg>
<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg3">Afrăsinei Viorica</seg>corpus timespan
- corpus timespan
bibl - corpus timespan
setting - corpus timespan it would be nice to have it in text content of corpus title too
<bibl>
<title type="main" xml:lang="en">Meeting minutes of the Romanian Parliament</title>
<title type="main" xml:lang="ro">Stenograme ale şedinţelor din Parlamentul României</title>
<idno type="URI">http://www.parlament.ro/</idno>
<date from="2000-02-01" to="2020-11-24">2000-02-01 - 2020-11-24</date>
</bibl> <setting>
<name type="city">Bucharest</name>
<name type="place">Palace of the Parliament</name>
<date from="2000-02-01" to="2020-11-24"/>
</setting>setting element
- setting element in root file
root file setting element should correspond to component ones (missing country)
<setting>
<name type="city">Bucharest</name>
<name type="place">Palace of the Parliament</name>
<date from="2000-02-01" to="2020-11-24"/>
</setting> <setting>
<name type="city">Bucharest</name>
<name type="country" key="RO">Romania</name>
<date when="2000-04-14" ana="#parla.sitting">14.04.2000</date>
</setting>capitalize surname
- dont capitalize surname
<surname>GORGHIU</surname>should be
<surname>Gorghiu</surname>sort component files
- sort component files
The component files should be ordered according to the contents' date.
taxonomies
- translations
- wrong language context - English content in
xml:lang="ro" - missing descriptions

