|
|
Go traidisiúnta, is le lámh a thógtaí ointeolaíochtaí, WordNets agus agus réimsí focal, agus teangeolaithe ag brath go hiomlán ar inbhreithniú agus eolas faoin domhan. Tá an modh beagán suibiachtúil seo arna fheabhsú ó shin trí acmhainní a ghineann ríomhaire atá bunaithe ar bhailiúcháin téacs leictreonacha ollmhóra (cosúil leis an idirlíon), anailís gramadaí uathoibríoch agus staitisticí.
|
|
|
Traditionally, both ontologies, WordNets and word fields have been built by hand, with linguists relying solely on introspection and world knowledge. This somewhat subjective method has since been enhanced, and sometimes replaced, by computer-generated resources drawing on huge electronic text collections (like the internet), automatic grammatical analysis and statistics. Examples are automatically generated word fields in the form of co-occurence webs (Leipzig Wortschatz), or relational dictionaries (DeepDict, Sketch Engine). They syntactically extract what, for instance, a horse can (a) be [wild, dark, wooden, Trojan], (b) do [neigh, gallop, trot], or (c) have done to [tether, groom], simply by evaluating sentences where the word horse has the syntactic function of noun phrase head, subject or object, respectively.
|