-
Notifications
You must be signed in to change notification settings - Fork 4
MatrixMrsTestSuiteTatoeba
This is a small experiment to put MRS test suite sentences into Tatoeba corpus. The primary purposes of this experiment are to (a) verify the naturalness of the sentences and (b) bootstrap a crosslingual MRS corpus through crowdsourcing.
no. | English | Tatoeba url |
11 | It rained. | http://tatoeba.org/eng/sentences/show/2481815 |
21 | Abram barked. | http://tatoeba.org/eng/sentences/show/2481838 |
31 | The window opened. | http://tatoeba.org/eng/sentences/show/2481909 |
351 | The dog is barking. | http://tatoeba.org/eng/sentences/show/2482132 |
421 | The dog couldn't bark. | http://tatoeba.org/eng/sentences/show/2489590 |
Issues that came up when putting MRS test suite sentences into Tatoeba (the bolded sentences/phrases are comments from Tatoeba's user on some of the MRS sentences)
-
@possible copyright violation: How are MRS test suites licensed?
-
This is not a good translation: Some supposedly grammatical sentences were deemed as unnatural by Tatoeba's user, should these sentences be corrected to ensure that our grammars are parsing the right stuff?
-
Sentences are machine-translated.: Are sentences from the MRS test suite humanly translated? Or are they generated using LOGON?
-
I cannot think of any situation where this would be used.: Sentences from the test suite should validate the grammars' correctness, so it's understandable that some sentences are not very useful to Tatoeba's user on the surface. Is there a page where it says which phenomenon/phenomena each sentence captures? Are these phenomena language dependent?
Home | Forum | Discussions | Events