| |||
Дайджест от Диалога -------------------------------------------------------------------------------- Пожалуйста, передайте информацию о рассылке нашего дайджеста всем, для кого она может быть интересной. Мы рассылаем дайджест бесплатно в двух вариантах: - в формате .html (в теле письма): анонсы новостей со ссылками на сайт; - в формате .doc (вложенный файл в архивированном виде): тексты новостей полностью, без иллюстраций. Вы можете в любой момент отказаться от подписки или изменить вид рассылки дайджеста. Мы будем благодарны за отзывы, советы или материалы для дальнейших выпусков. Контактный адрес: digest@speedy.iis.nsk.su МТ Маш перевод ... MT list www.eamt.org ... Mosling /// Рассылка московских лингвистов /// Библиотека МФТИ ... Войдите на стартовую страницу по адресу http:// www.csa.com, введите пароли и можете начинать работу. Для удобства освоения в директории Help Links ( на стартовой странице) кликните на самую нижнюю строку Quick Reference Cards , там находится инструкция (краткая) на русском языке по работе с БД. CSA - это реферативная научная база данных широкой тематики: техника; сельское хозяйство; науки о Земле; медицина; биология; общественные науки; гуманитарные науки, искусство. CSA включает около 70 БД, преимущественно реферативно-библиографических (периодика, книги, справочные издания, диссертации), 4 полнотекстовые БД статей из 80 журналов компании Sage Publications http://www.sagepub.com по коммуникациям, криминалистике, политике, международным отношениям и социологии, БД Web-ресурсов с описаниями около 240 000 научных ресурсов Интернета. Доступ предоставлен до конца 2007 г. С уважением, Наталья Кобзева Библиотека С полным перечнем электронных научных журналов и ресурсов вы можете ознакомиться на сайте: "Электронные Библиотеки в МФТИ" http://mipt.ru/study/net_libr/ На сайт "Электронные Библиотеки в МФТИ" можно также перейти с сайта "Электронна Библиотека Физтеха" http://lib.mipt.ru/ (в разделе Ссылки/Разное http://lib.mipt.ru/?spage=links) ... Диалог .. Самая лучшая у нас .. Linguistic Data Consortium LDC ... Там такие новости - Free Google Data (Web 1T 5-gram) Available - - LangTech 2008 - LDC2007T40 - Arabic Gigaword Third Edition - LDC2007S18 - CSLU Kid''s Speech Version 1.1 - LDC2007T20 - GALE Phase 1 Distillation Training - -------------------------------------------------------------------------------- Free Google Data (Web 1T 5-gram) Available We are pleased to announce that Google Inc. is once again providing financial support for the distribution of its Web 1T 5-gram (LDC2006T13) corpus to universities. As a result, LDC will make the corpus available at no charge to the next 100 non-member universities requesting a copy. Shipping and handling fees are also being covered by Google. We appreciate Google''s continued generosity and its interest in supporting language research. To obtain a free copy, universities will need to sign and submit a copy of the User License Agreement for Web 1T 5-gram Version 1 . This can be faxed to +1 215 573 2175 or scanned and emailed. Complete contact details, including shipping address, phone number, and email are also required. LangTech 2008 The LangTech 2008 conference will be held at the San Michele a Ripa convention center in Rome, February 28-29, 2008. The technical program will consist of eight oral sessions and four poster sessions. The main themes are: HLT applications in e-democracy, e-government intelligence linguistic analysis and web mining translation and multilinguality document engineering and HLT future perspective in European HLT research activities advanced speech and language technologies robotics and complex interaction in HLT HLT and interfaces standards, solutions and emerging applications in speech and language technology Take note that the paper submission deadline is November 30, 2007. Further information is available on the LangTech 2008 website. New Publications (1) Arabic Gigaword ... Corpora List Date: Mon, 9 Jul 2007 23:50:33 -0700 (PDT) From: Niladri Dash <ns_dash@yahoo.com> Reply-To: ns_dash@yahoo.com Subject: [Corpora-List] Call For Papers To: CORPORA@UIB.NO MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Message-ID: <8990.35116.qm@web51403.mail.re2.yahoo.com> X-checked-clean: by exiscan on rolf X-Scanner: e3d144cf198fdbda3885dab7646a9ed0 http://tjinfo.uib.no/virus.html X-UiB-SpamFlag: NO UIB: 6.8 hits, 8.0 required X-UiB-SpamReport: spamassassin found; 5.1 ''From'' yahoo.com does not match ''Received'' headers 1.0 BODY: UIB_MAILWON 0.1 BODY: 20% to 30% of HTML elements are non-standard 0.0 BODY: HTML included in message 0.7 RAW: Contains a line >= 199 long Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit List-Id: <corpora@uib.no> Sender: owner-corpora@md.lists.uib.no Precedence: bulk Call For Papers For: LANGUAGE FORUM: A JOURNAL OF LANGUAGE AND LITERATURE (Vol. 35, No. 2, July-Dec 2009) Special Number on CORPUS LINGUISTICS Within the last few decades we have experienced a global upsurge for generating language corpus of various types to be directly accessed in various domains of mainstream linguistics, applied linguistics, and language technology. This gives birth to Corpus Linguistics - a new area of linguistic research that can challenge our so-long-practiced traditional linguistics. However, generation and utilization of corpora are interfaced with several theoretical and practical problems that need to be addressed properly before linguistic data and information stored in corpora are utilized properly. Since most of the issues related to Corpus Linguistics are not yet addressed properly with equal emphasis to all the language types, an issue of Language Forum, a peer reviewed international Journal, has been planned on Corpus Linguistics thus. Hence, we invite original research papers related (not restricted) to the following broad areas: Corpus generation (speech corpus, text corpus, multimodal corpus, special corpus, etc.) Corpus classification and visualization Statistical analysis of corpus data Exploratory analysis corpus data Text corpus encoding and annotation Tool-based digital corpus editing Part-of-speech and morpho-syntactic tagging Lemmatisation Text processing (concordance, key-word-in-context study, parsing, etc.) Annotation (grammatical, semantic, anaphoric, discoursal) Methodology and usage study for corpus data analysis Corpus and mainstream linguistics Corpus and natural language processing Corpus and applied linguistics Corpus and machine translation Corpus and culture studies The papers pertaining to the areas mentioned above should be submitted electronically (both in PDF and DOC format) to the Guest Editor/Editors, at their contact addresses given below not later than 30th June 2008. Papers should not exceed 7500 words and should be preceded by an abstract of 200-250 words. The title, the name(s) and full mailing address(es) of the author(s), including e-mail addresses, should appear on the first page of the manuscript The editors will select contributions for the special issue and notify authors of acceptance or otherwise once reports are received from the reviewers not later than 30th Sep 2008. All papers submitted to Language Forum - LF should be original, neither having been previously published nor being considered elsewhere at the time of submission Manuscripts should be in conformity with the Language Forum format, which can be made available on request. Guest Editor: Dr. Niladri Sekhar Dash Linguistic Research Unit Indian Statistical Institute Kolkata, India Email: ns_dash@yahoo.com Email: niladri@isical.ac.in http://www.isical.ac.in/~niladri Editors: Harpreet Kaur Bahri Deepinder Singh Bahri C/o BAHRI PUBLICATIONS 1749A/5, Govindpuri Extension Kalkaji, New Delhi 110019, India Email: bahrius@vsnl.com Email: bahripublications@yahoo.com With best wishes and regards, Sincerely Niladri Sekhar Dash ISI, Kolkata, India |