Skip to content

RO Feedback #626

@matyaskopp

Description

@matyaskopp

meeting element

  • extend meeting elements (#parla.term, #parla.sitting)

I haven't found any information about terms or sitting in the meeting elements. This is how other corpora implement it:

<meeting ana="#parla.term #parla.uni" n="8" corresp="#ВРУ">8</meeting>
<meeting ana="#parla.session #parla.uni" n="1" corresp="#ВРУ">1</meeting>
<meeting ana="#parla.sitting #parla.uni" n="2014-12-02" corresp="#ВРУ">2014-12-02</meeting>

I was not able to find term info on Romanian parliament websites - I believe the information is there.
And if a single file contains one sitting, then add sitting identification.

Missing speech content

  • speech content

In some files there is no speech content:
https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-09-04-id4959.xml

        <note type="time">Şedinţa a început la ora 15,55.</note>
        <note type="chairman">Lucrările au fost conduse de domnul Ion Diaconescu, preşedintele Camerei Deputaţilor, asistat de domnii Andrei Ioan Chiliman şi Acsinte Gaspar, secretari.</note>
        <note type="speaker">Domnul Ion Diaconescu:</note>
        <u ana="#chair" who="#Ion-Diaconescu" xml:id="ParlaMint-RO_2000-09-04-id4959.u1"/>
        <note type="speaker">Domnul Iuliu Ioan Furo:</note>

but the source contains speech contents:
https://www.cdep.ro/pls/steno/steno2015.stenograma?ids=4959&idl=1#S0

Chairman note type

        <note type="chairman">Lucrările au fost conduse de domnul Ion Diaconescu, preşedintele Camerei Deputaţilor, asistat de domnii Andrei Ioan Chiliman şi Acsinte Gaspar, secretari.</note>

not recognized notes

  • notes in text

Notes are in source italics so easy to recognize...

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-04-14-id4927.xml#L474

<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u39.seg6">Cine este pentru?(Vociferără în partea dreaptă a sălii).Vă rog să număraţi... Vă rog să ridicaţi mâna, cei care sunteţi pentru acest amendament, să repetăm numărătoarea. Este o confuzie.</seg>

image

should be: (https://clarin-eric.github.io/ParlaMint/#TEI.vocal)

<seg xml:id="ParlaMint-RO_2000-04-14-id4927.u39.seg6">Cine este pentru? <vocal type="shouting">
    <desc>(Vociferără în partea dreaptă a sălii)</desc>
  </vocal> Vă rog să număraţi... Vă rog să ridicaţi mâna, cei care sunteţi pentru acest amendament, să repetăm numărătoarea. Este o confuzie.</seg>

presence list

  • presence list is missing status

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-04-14-id4927.xml#L510-L513

        <u ana="#regular" who="#Andrei-Ioan-Chiliman" xml:id="ParlaMint-RO_2000-04-14-id4927.u46">
          <seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg1">Achimescu Victor Ştefan</seg>
          <seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg2">Aferăriţei Constantin</seg>
          <seg xml:id="ParlaMint-RO_2000-04-14-id4927.u46.seg3">Afrăsinei Viorica</seg>

image

corpus timespan

  • corpus timespan bibl
  • corpus timespan setting
  • corpus timespan it would be nice to have it in text content of corpus title too

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO.xml#L72

        <bibl>
          <title type="main" xml:lang="en">Meeting minutes of the Romanian Parliament</title>
          <title type="main" xml:lang="ro">Stenograme ale şedinţelor din Parlamentul României</title>
          <idno type="URI">http://www.parlament.ro/</idno>
          <date from="2000-02-01" to="2020-11-24">2000-02-01 - 2020-11-24</date>
        </bibl>

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO.xml#L252

        <setting>
          <name type="city">Bucharest</name>
          <name type="place">Palace of the Parliament</name>
          <date from="2000-02-01" to="2020-11-24"/>
        </setting>

setting element

  • setting element in root file

root file setting element should correspond to component ones (missing country)

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO.xml#L249-L253

        <setting>
          <name type="city">Bucharest</name>
          <name type="place">Palace of the Parliament</name>
          <date from="2000-02-01" to="2020-11-24"/>
        </setting>

vs:
https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO_2000-04-14-id4927.xml#L97-L101

        <setting>
          <name type="city">Bucharest</name>
          <name type="country" key="RO">Romania</name>
          <date when="2000-04-14" ana="#parla.sitting">14.04.2000</date>
        </setting>

capitalize surname

  • dont capitalize surname

https://github.com/romanian-parlamint/ParlaMint/blob/8439dd75ca3c31b89f06bac23eff736a72a6ed6a/Data/ParlaMint-RO/ParlaMint-RO.xml#L384

              <surname>GORGHIU</surname>

should be

              <surname>Gorghiu</surname>

sort component files

  • sort component files

The component files should be ordered according to the contents' date.

taxonomies

  • translations
  • wrong language context - English content in xml:lang="ro"
  • missing descriptions

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions