vous savez – Traduction – Dictionnaire Keybot

Spacer TTN Translation Network TTN TTN Login Deutsch English Spacer Help
Langues sources Langues cibles
Keybot 146 Résultats  dannychoo.com
  Málföng  
Skrárnar "The_Broadcast_News_RUV-1_Corpus/*.wav" eru bútaðar hljóðskrár.
The files "The_Broadcast_News_RUV-1_Corpus/*.wav" are segmented wave files.
  Málföng  
Efnið er í tólf hlutum og er hljóðritað á tímabilinu október 2004 til maí 2005. Upptökurnar eru mislangar, allt frá nokkrum mínútum upp í fáeinar klukkustundir. Í heild eru þær meira en 20 tímar. Hljóðskrárnar eru á mp3-sniði.
The corpus consists of twelve sequences recorded between October 2004 and May 2005. The recordings vary in length, ranging from a few minutes to a few hours. In total they are more than 20 hours in length. The audio files are in MP3 format.
  Málföng  
Í málheildinni eru 20 málhafar, 10 kvenmenn og 10 karlmenn. Hljóðskrárnar fyrir hvern málhafa eru í undirmöppum. Í möppu 'm7' eru skrár fyrir karlmann númer 7. Hver málhafi les um það bil 200 setningar úr upplýsingum um veðurfar.
The database includes 20 persons, 10 female and 10 male readers. The sound files for each person are in a subfolder. In folder 'm7' are the files for male reader number 7. Each reader reads roughly 200 sentences which correspond to weather information queries.
  Málföng  
2. Sækja textana. Textarnir eru aðgengilegir í sérstöku xml-sniði sem er skilgreint af TEI (Text Encoding Initiative). Bókfræðilegar upplýsingar fylgja öllum textum. Væntanlegir notendur þurfa að skrá sig og samþykkja notkunarskilmála.
2. Download. The texts are available in a special xml-format that is defined by TEI (Text Encoding Initiative). Bibliographical information is included with all the texts. Prospective users must register and accept the terms and conditions. Download. The texts are accessible with a CC BY 3.0 licence.
  Málföng  
Notuð er sama markaskrá og fyrir Markaða íslenska málheild. Textaskrárnar eru í Linux-sniði og notuð er UTF-8 stafatafla. Snið skránna er þannig að í hverri línu er eitt orð ásamt marki. Orðið er fremst í línu síðan kemur dálkmerki (tab) og þá mark.
The tagset used is the same as is used for the Tagged Icelandic Corpus. The text files are in Linux format and coded in UTF-8 code page. The format of the files is one token per line, each line consists of the word, followed by a tab and then the tag. Sentences are separated by empty lines.
  Málföng  
Í textasafni því sem hér er gert aðgengilegt eru textar úr 41 Íslendingasögu, Sturlungu, Heimskringlu og Landnámabók. Í töflunni hér fyrir neðan má sjá hvernig textarnir skiptast á milli sagnanna. Tölurnar miðast við orð, greinarmerki eru ekki talin með.
The corpus that is made available here contains 41 texts from the Family Sagas, Sturlunga Saga, Heimskringla and the Book of Settlement. The division of the corpus is shown in the table below. Numbers refer to running words excluding punctuation.
  Málföng  
Efnið er í tólf hlutum og er hljóðritað á tímabilinu október 2004 til maí 2005. Upptökurnar eru mislangar, allt frá nokkrum mínútum upp í fáeinar klukkustundir. Í heild eru þær meira en 20 tímar. Hljóðskrárnar eru á mp3-sniði.
The corpus consists of twelve sequences recorded between October 2004 and May 2005. The recordings vary in length, ranging from a few minutes to a few hours. In total they are more than 20 hours in length. The audio files are in MP3 format.
  Málföng  
Loks voru gögnin flutt í umritunarforritið Transcriber og um leið var farið enn einu sinni yfir umritun textans og hún lagfærð eftir þörfum. Einnig var gengið frá samstillingu hljóð- og textaskráa. Skrárnar sem til urðu við þetta eru svokallaðar . trs-skrár en það eru textaskrár á xml-sniði sem auðveldlega má flytja yfir á annað snið.
The next step was to transfer the text to the software Transcriber which is a tool for segmenting, labeling and transcribing speech. At the same time, the transcriptions were further revised and the sound and text files were synchronized. Transcriber returned so-called .trs-files which are text files in xml format that can easily be transfered to another format. The corpus consists of these xml-files along with the sound files.
  Málföng  
Í textasafni því sem hér er gert aðgengilegt eru textar úr 41 Íslendingasögu, Sturlungu, Heimskringlu og Landnámabók. Í töflunni hér fyrir neðan má sjá hvernig textarnir skiptast á milli sagnanna. Tölurnar miðast við orð, greinarmerki eru ekki talin með.
The corpus that is made available here contains 41 texts from the Family Sagas, Sturlunga Saga, Heimskringla and the Book of Settlement. The division of the corpus is shown in the table below. Numbers refer to running words excluding punctuation.
  Málföng  
Einnig hafði komið í ljós að gallar voru í nokkrum textanna. Nú eru textarnir markaðir með bestu aðferð sem tiltæk er með því að nýta þjálfunarsafn gert úr fornritinum eins og lýst er hér að ofan og gallar í textunum hafa verið lagfærðir.
It should be pointed out that the texts available through this website until March 2018 were tagged with the same procedures as the texts in the Tagged Icelandic Corpus. Some flaws were also found in some of the texts. The texts that are now available are tagged with the best available method by utilizing a training corpus with medieval texts as described above and the flaws have been corrected.
  Málföng  
Í málheildinni eru 13 skrár. Í hverri skrá eru textar sem var safnað úr einum textaflokki af 23 textaflokkum í Markaðri íslenskri málheild. Textarnir voru hreinsaðir og síðan markaðir með fimm mismunandi mörkurum og að lokum var kosið á milli markanna með tiltekinni aðferð (Hrafn Loftsson og fl., 2010).
The MIM-GOLD corpus consists of 13 files with tagged Icelandic text that has been sampled from 13 of 23 domains of texts of the 25 million word Tagged Icelandic Corpus (MIM). The texts were cleaned extensively and then run through an automatic tagging process consisting of five taggers and a voting scheme (Loftsson et al, 2010). The tags were then manually corrected. The files are delivered as is after the first manual correction process which has not been validated. The accuracy of tagging after the first manual correction phase has not been estimated. The corpus is intended for the training of data-driven taggers for Icelandic.
  Málföng  
Háskólinn í Reykjavík og Máltæknisetur stóðu fyrir söfnun íslenskra raddsýna í samstarfi við fyritækið Google. Gögnin eru aðgengileg fyrir almenning hér á síðunni og er þetta kjörið tækifæri til að þróa ýmsan máltæknibúnað fyrir íslensku, eins og til dæmis talgreini.
Reykjavík University and The Icelandic Centre for Language Technology collected data for an Icelandic speech corpus in collaboration with Google. The data is available on this webpage for everybody and this is a good opportunity to develop language technology tools for Icelandic such as a speech recognizer. Voice samples from 563 individuals were recorded with Android G1 smart-phones, a total of 152 hours of speech. In total 127,286 voice samples were recorded. Of those 108,568 were considered useful and 18,718 were discarded. The 108,568 good voice samples can be downloaded from this webpage.
  Málföng  
Leitað var sérstaklega að setningum með sjaldgæfum mynstrum til að tryggja að nógu mörg dæmi um þau yrðu í lestextum. Í skránni patterns_sentences.xls eru annars vegar 182 sjaldgæf stafamynstur og hins vegar 1433 setningar sem valdar voru samkvæmt þeim mynstrum.
The sentences that participants in the Hjal-project were supposed to read were extracted automatically from a large corpus containing around 100 novels which were acquired from local publishing houses. The sentences were selected according to a list of n-grams made by Eiríkur Rögnvaldsson, most of them extracted from Íslensk rímorðabók [Icelandic Rhyming Dictionary]. Care was taken to select sentences including rare patterns, to ensure that these patterns would be amply exemplified in the reading text. The file pattens_sentences contains both 182 rare n-grams and 1433 sentences selected on the basis of these n-grams.
  Málföng  
Leitin er aðgengileg á málheildarsíðu Stofnunar Árna Magnússonar. Nýta má málfræðilegar upplýsingar til þess að skilgreina leitina og jafnframt fást bókfræðilegar upplýsingar um textana sem leitarniðurstöður eru úr.
1. Search. The search is available through the website of the corpus page of the Árni Magnússon Institute. Grammatical information can be used to refine the search. Bibliographic information is displayed for the texts that appear in the search results. Here is a list of the texts that can be searched. On the search page it is possible to choose any of the texts for search. One of the works is Íslendingaþættir. It is also possible to choose any of those for search.
  Málföng  
Íðorðabankinn sinnir þessu hlutverki. Hann getur veitt yfirsýn yfir íslenskan íðorðaforða og nýyrði úr almennu máli, sem eru efst á baugi, og stuðlað með því að auknu samræmi í notkun orðanna svo og skilgreiningunum á þeim.
One of the roles of the Term Banks is to standardize the use of terms within related and unrelated subject fields. The aim is to hinder that many different terms are used for the same concept or phenomenon. The Term Bank provides an overview of Icelandic terminology and topical neologisms and thereby makes it easier to coordinate and standardize term usage. Additionally, the Term Bank provides access to Icelandic translations of foreign terms, and access to definitions of terms in Icelandic and other languages. The Term Bank thus benefits all those that write about specialized topics, such as translators, teachers, students, journalists, government agencies, businesses and any interested people, and last but not least compilers of dictionaries.
  Málföng  
Hér er veittur aðgangur að rafrænum textum Íslendingasagna, Sturlungu, Heimskringlu og Landnámabókar. Að undanskilinni Landnámabók eru textarnir úr útgáfum Svarts á hvítu og Máls og menningar sem komu út á árunum 1985 til 1991.
On this page access is given to electronic texts of some Old Icelandic narrative texts: Family Sagas (Íslendingasögur), Sturlunga Saga, Sagas of the Kings of Norway (Heimskringla) and the Book of Settlement (Landnámabók). With the exception of Landnámabók, the texts are from the publication of Svart á hvítu and Mál og menning that were published between 1985 and 1991.
  Málföng  
Í safninu eru upptökur frá umræðutímum á Alþingi veturinn 2004-2005, alls tæplega 21 klukkustund, ásamt nákvæmri umritun þeirra í textaskrám. Auk þess fylgja ýmsar grunnupplýsingar um upptökurnar og þá sem taka til máls, s.s. aldur þeirra og kyn.
The corpus contains recordings from discussion periods at the Icelandic Parliament, during the winter of 2004-2005. The recordings are nearly 21 hours in total and come with detailed transcriptions in text files. Information about the recordings and the speakers, such as their age and gender, are provided as well. The data is intended to reflect natural spoken Icelandic under formal conditions. The discussion periods were chosen as they primarily consist of unprepared speeches that are unlikely to have been written in advance and read out loud. In addition, the aim was on diversity of topics and speakers (w.r.t. their origin, age and gender).
  Málföng  
Í málheildinni eru 13 skrár. Í hverri skrá eru textar sem var safnað úr einum textaflokki af 23 textaflokkum í Markaðri íslenskri málheild. Textarnir voru hreinsaðir og síðan markaðir með fimm mismunandi mörkurum og að lokum var kosið á milli markanna með tiltekinni aðferð (Hrafn Loftsson og fl., 2010).
The MIM-GOLD corpus consists of 13 files with tagged Icelandic text that has been sampled from 13 of 23 domains of texts of the 25 million word Tagged Icelandic Corpus (MIM). The texts were cleaned extensively and then run through an automatic tagging process consisting of five taggers and a voting scheme (Loftsson et al, 2010). The tags were then manually corrected. The files are delivered as is after the first manual correction process which has not been validated. The accuracy of tagging after the first manual correction phase has not been estimated. The corpus is intended for the training of data-driven taggers for Icelandic.
  Málföng  
Vinnsluhlutinn skiptist í mismunandi svæði sem hvert og eitt tilheyrir höfundi tiltekins orðasafns (einstaklingi eða t.d. orðanefnd) og engum öðrum. Þegar vinnu við gerð orðasafns lýkur flytur orðabankastjóri það í birtingarhlutann; raunar er unnt að birta einstaka hluta orðasafns jafnóðum og þeir eru tilbúnir.
The general user has no access to the processing section of the Term Bank which is reserved for compilers of terminologies (individuals and committees). The processing section is further divided into subsections, each belonging to the author of a particular terminology collection. When a terminology collection is completed the editor of the Term Bank transfers it to the published section; In fact it is possible to display individual parts of a terminology collection as soon as they are ready. The published section of the term bank is the visible term bank, i.e., the part of the bank that the general public has access to. With a tailor made search interface it is possible to search for Icelandic or foreign terms, in one or more collections at a time, and retrieve various information on the term, both in Icelandic and other relevant languages.
  Málföng  
Efnið er í tólf hlutum og er hljóðritað á tímabilinu október 2004 til maí 2005. Upptökurnar eru mislangar, allt frá nokkrum mínútum upp í fáeinar klukkustundir. Í heild eru þær meira en 20 tímar. Hljóðskrárnar eru á mp3-sniði.
The corpus consists of twelve sequences recorded between October 2004 and May 2005. The recordings vary in length, ranging from a few minutes to a few hours. In total they are more than 20 hours in length. The audio files are in MP3 format.
  Málföng  
Í málheildinni eru 20 málhafar, 10 kvenmenn og 10 karlmenn. Hljóðskrárnar fyrir hvern málhafa eru í undirmöppum. Í möppu 'm7' eru skrár fyrir karlmann númer 7. Hver málhafi les um það bil 200 setningar úr upplýsingum um veðurfar.
The database includes 20 persons, 10 female and 10 male readers. The sound files for each person are in a subfolder. In folder 'm7' are the files for male reader number 7. Each reader reads roughly 200 sentences which correspond to weather information queries.
  Málföng  
1Þegar birtar eru niðurstöður sem eru fengnar með því að nota gögn Gullstaðalins vinsamlegast vitnið í:
1When publishing results based on the texts in MIM-GOLD please refer to:
  Málföng  
1Þegar birtar eru niðurstöður sem eru fengnar með því að nota gögn Gullstaðalins vinsamlegast vitnið í:
1When publishing results based on the texts in MIM-GOLD please refer to:
  Málföng  
Eftirfarandi upplýsingar eru um málhafana, aldur þeirra, stað í upphaflegu DAT-upptökunum og línurnar sem þeir lásu.
Here is information about the readers, their age, the records position on the original DAT tape, and the lines read.
  Málföng  
Hægt er að sækja Framburðarorðabókina hér á vefnum. Væntanlegir notendur þurfa að skrá sig og samþykkja notkunarskilmála. Textarnir eru aðgengilegir með CC BY 3.0 leyfi.
You can Download the Pronunciation Dictionary. Prospective users must register and accept the terms and conditions. The texts are accessible through a CC BY 3.0 licence.
  Málföng  
Hægt er að sækja Mynstur og setningar hér á vefnum. Væntanlegir notendur þurfa að skrá sig og samþykkja notkunarskilmála. Textarnir eru aðgengilegir með CC BY 3.0 leyfi.
You can Download Patterns and Sentences. Prospective users must register and accept the terms and conditions. The texts are accessible through a CC BY 3.0 licence.
  Málföng  
Fornmálstextarnir eru aðgengilegir til notkunar á tvenns konar hátt:
The texts of the Saga Corpus are available for use in two different ways:
  Málföng  
1Þegar birtar eru niðurstöður sem eru fengnar með því að nota gögn Málróms vinsamlegast vitnið í:
1When publishing results based on data in the Málrómur database please refer to: :
  Málföng  
Notuð er sama markaskrá og fyrir Markaða íslenska málheild. Textaskrárnar eru í Linux-sniði og notuð er UTF-8 stafatafla. Snið skránna er þannig að í hverri línu er eitt orð ásamt marki. Orðið er fremst í línu síðan kemur dálkmerki (tab) og þá mark.
The tagset used is the same as is used for the Tagged Icelandic Corpus. The text files are in Linux format and coded in UTF-8 code page. The format of the files is one token per line, each line consists of the word, followed by a tab and then the tag. Sentences are separated by empty lines.
  Málföng  
Meðal umfjöllunarefna í umræðutímunum eru fjárlög, skattamál, vatnalög, orkumál, skólamál og samgöngur, auk þess sem finna má fundarstjórn forseta.
Among the topics of the discussion sessions are the government budget, taxation, water laws, energy, schools and transportation.
1 2 3 4 5 6 7 8 9 10 Arrow