Language, Speech and Multimedia Technologies Observatory

http://www.clarin.eu/news/3418
01/31/2011 - 15:25

The facet browser of the Virtual Language Observatory has been replaced by a new version. It now is faster, more stable, features direct links to resources, a feature that was broadly requested in a recent survey. It now relies completely on CMDI metadata as information source.



The Google Earth overlay has been updated too: it now features links to relevant information resources per language.

http://www.mail-archive.com/mt-list@eamt.org/msg01587.html
01/31/2011 - 15:25

2011/01/28 -- Mikel L. Forcada

http://www.speechtechmag.com/Articles/N
01/11/2011 - 19:05

Nuance's voice dictation technology enables users to speak text for email, SMS text messages, status updates, and more.

http://semanticweb.com/share-your-knowledge-and-experience-at-the-next-semtech-conference-and-meet-these-organizations_b17259?c=rss
01/05/2011 - 22:00

Next Monday (January 10) is the deadline for making a speaking proposal to the 2011 Semantic Technology Conference. You can start the submission process here.

I’ll reiterate my invitation from a few weeks ago, inviting everyone who has business and/or technical experience working with semantic technologies to put yourself forward and share your expertise. We want – in fact we NEED – your first-hand stories and lessons. We need stories from all industries, in any application area, and from the largest to the smallest scale. We’ve received lots of terrific speaking proposals thus far, but there’s plenty of room left for you and your presentation.

If selected, you and your work will be exposed to the richly diverse group of individuals and organizations that SemTech brings together each year. Here for example is a small sample of the organizations who attended SemTech in 2010:

  • 1060 Research
  • 3 Degrees Holdings
  • AAA
  • Accenture Technology Labs
  • Access Communications
  • AdGlean
  • Adify
  • Adobe Systems
  • AgileDNA
  • ai-one
  • AIFB/KIT
  • Air Force Research Lab
  • Allvoices.com
  • Amdocs
  • American Systems
  • Amgen
  • Anboto
  • artfuture
  • AT&T
  • Austral Capital Partners
  • Band of Angels
  • BBN Technologies
  • BC West Audit Div
  • Bechtel
  • Bell Labs Research
  • BestBuy.com
  • BIKO Publishing House
  • Blue Cross Blue Shield Texas
  • Boeing
  • Booz Allen Hamilton
  • Bosch Research and Tech Ctr
  • Breakpoint Books
  • British Telecom
  • Business Process Engineering
  • CA Technologies
  • Cambridge Semantics, Inc.
  • Cambridge Tech. Ventures
  • Carnegie Mellon Silicon Valley
  • CBS Interactive
  • Cedars-Sinai Medical Center
  • Charles River Ventures
  • Chevron
  • Cirque du Soleil
  • Cisco Systems
  • Clark & Parsia
  • Clear Blue Water
  • Cleveland Clinic
  • CNN
  • Coca Cola
  • CognitionTechnologies, Inc.
  • Computas AS
  • Computas
  • Core Analytics, LLC
  • Creative Commons
  • Department of Defense
  • DERI, NUI Galway
  • Det Norske Veritas
  • DISA TRANSCOM
  • DOCOMO Euro-Labs
  • eBay
  • Edison International
  • El Paso County Health Dept
  • Eli Lilly & Company
  • Elsevier / Collexis
  • EMC Consulting
  • Eqentia
  • Eurescom GmbH
  • European Bioinformatics Inst.
  • European Commission
  • European Environment Agency
  • Evri
  • Exadel Inc
  • Excellus Health Plan Inc
  • Extractiv
  • Facebook
  • Factual, Inc.
  • Fair Isaac
  • FamilySearch
  • FAO of the UN
  • Federal Computer Week
  • Financial Management Service
  • First Retail Inc.
  • Forrester Research
  • Franz Inc.
  • Gartner
  • Genentech, Inc.
  • Glisson Capital
  • Google
  • Guidewire Group, Inc.
  • Heart + Lung Research Institute
  • Heuer Media
  • Hewett Research
  • Hewlett-Packard
  • IBM Research
  • IBM
  • IDG Ventures
  • Impact
  • Indian Inst. of Adv. Research
  • Infochimps
  • Informatica
  • InfoWorld
  • Intel
  • Intellidimension, Inc.
  • Intelligent Software Solutions
  • Internal Revenue Service
  • International Monetary Fund
  • Intuit
  • ion interactive
  • Jafco Ventures
  • Japan Biological Informatics
  • Jet Propulsion Laboratory
  • Johnson & Johnson
  • Kaiser Permanente
  • Kapow Technologies
  • Karlsruhe Institute of Tech.
  • KazzaDrask Media
  • KLM
  • KMI, The Open University
  • Knowledge Based Systems Inc.
  • KONA
  • Kurzweilai.net
  • Language Computer Corp.
  • LexisNexis
  • LH Telecom
  • Library of Congress
  • Lightspeed Venture Partners
  • Link TV/ViewChange.org
  • Linux Gazette
  • Lockheed Martin
  • MarketingProfs
  • MarkLogic Corporation
  • Mashery, Inc.
  • MD Anderson Cancer Center
  • Media Research Associates
  • Menlo Ventures
  • Merck & Co., Inc.
  • MGH / Partners
  • MHS Capital
  • Microsoft Research
  • Microsoft
  • MIT
  • MITRE
  • Morgan Stanley
  • MyContextualAds
  • NASA
  • Nat’l Research Council Canada
  • Nat’l Library of Medicine
  • Nat’l Renewable Energy Lab
  • netlabs.org
  • Nfuse Partners
  • NHS National Innovation Centre
  • NIST
  • Nomura Research Institute
  • Northrop Grumman
  • Norwegian Defence Research
  • Novartis Pharma AG
  • Novo Nordisk A/S
  • NYTimes.com
  • ON24
  • ontoprise GmbH
  • Ontos AG
  • Ontotext
  • Open Data Registry
  • OpenLink Software
  • Oracle
  • Orbis Technologies, Inc.
  • Overstock.com
  • Oxford University
  • Pacific Northwest National Lab
  • Pfizer
  • Phase2 Technology
  • Pitney Bowes
  • Powerset Division of Bing
  • PricewaterhouseCoopers
  • Primal
  • Procter & Gamble
  • Public Library of Science
  • Purdue University
  • Raytheon
  • Razorfish
  • ReadWriteWeb
  • Recognos Financial
  • Recognos
  • Rensselaer Polytechnic Institute
  • Revelytix
  • Ritchie Capital
  • Salary.com
  • Salesforce.com
  • Saltlux
  • San Francisco Examiner
  • San Jose State University
  • Sandia National Laboratories
  • SAP
  • SAS Institute
  • SavantMD
  • Semantic Arts, Inc.
  • Semantic Engines
  • Semantic Seed
  • Semantic Systems
  • SemanticClarity
  • Semantifi
  • STI International
  • Shell
  • ShoppingNotes.com
  • Siemens Ltd. China
  • Silicon Impact
  • SiliconAngle
  • SLAC National Accelerator Lab
  • Smartlogic
  • Social Media Club
  • SocialPulse
  • SocialWhiteboard
  • SRI International
  • Standard & Poor’s
  • Stanford University
  • Structured Dynamics
  • Swedish Defence Research
  • Swissnex San Francisco
  • Syntactica
  • Talis
  • TechWeb
  • Ted Ventures, Inc.
  • Teradata
  • TextWise
  • The Angels’ Forum
  • The Cloud of Data
  • The Idea Travel Company
  • The New York Times
  • Thomson Reuters OpenCalais
  • Thomvest Ventures
  • TigerLogic Corporation
  • Time Inc
  • TopQuadrant
  • TripIt, Inc.
  • True Knowledge
  • Turner Broadcasting
  • UNISYS
  • University of Aberdeen
  • University of Alabama
  • University of Bolton
  • University of Latvia
  • University of Lodz
  • Univ. of Manchester
  • Univ. of New Brunswick
  • Univ. of New Hampshire
  • Univ. of Technology Sydney
  • University of Texas
  • University of Tokyo
  • University of Toronto
  • US Army
  • US EPA
  • US Government
  • VentureClef
  • VentureNe.ws
  • Ventures Technology Watch
  • VI investments
  • Vulcan Inc.
  • W3C
  • Wells Fargo
  • Wikimedia Foundation
  • World Bank Group
  • Yahoo! Research
  • zAgile Inc
  • Zemanta

Go on, add your name to this list. Make a killer speaking proposal that we just can’t turn down! But do it by next Monday, January 10!

Thanks,

Tony Shaw

SemTech Program Chair

New Career Opportunities Daily: The best jobs in media.

http://www.unibertsitatea.net/blogak/ixa/azaleko-sintaxiaren-tratamendua-ikasketa-automatikoko-tekniken-bidez
01/03/2011 - 12:50

Zuzenketa ortografiko automatikoa tresna lagungarria da zalantzarik gabe. Hala ere, oraindik aztertzeko unitatea hitz soltea izaten da. Testuen zuzenketa automatiko sakonago egin ahal izateko sintaxia ere kontuan hartu beharko da. Baina testu errealetan ohiko diren esaldi luze-luzeetan sintaxia ez da erraza.


Bertol Arrieta Kortajarena Ixakideak bere tesian Ikasketa Automatikoko teknikak aztertu eta erabiltzea izan du helburu, euskararen sintaxian eta zuzenketa automatikoan bi urrats aurrera egiteko.

Hau da tesiaren izenburu osoa:
Azaleko sintaxiaren tratamendua ikasketa automatikoko tekniken bidez: euskarako kateen eta perpausen identifikazioa eta bere erabilera koma-zuzentzaile batean.



Hala, euskarako kate- eta perpaus-identifikatzaile automatikoak sortu dira, ikasketa automatikoko teknikak hizkuntzaren ezagutzan oinarritutakoekin uztartuz. Modu honetan, testu bat emanda, makina gai da testu horretako sintagmak, perpausak eta esaldiak modu automatikoan identifikatzeko. Tresna hauek oso baliagarriak dira analisi sintaktiko automatiko osoa edo sakona bideratzeko, eta baita Hizkuntzaren Prozesamenduko hainbat arloetan aurrerapausoak egiteko ere: hala nola, informazioaren erauzketa, laburpenen sorkuntza, itzulpen automatikoa...

Horretaz gain, puntuazioaren erabilera jorratu da hizkuntzalaritza konputazionalaren ikuspegitik. Makinak hizkuntzaren ulermen osoa lor dezan, komak duen garrantzia aztertu da, batez ere. Hala, euskarako koma-zuzentzaile automatiko bat garatu da ikasketa automatikoko teknikak baliatuz. Horretarako, aurrez sortutako kate- eta perpaus-identifikatzaileek ematen duten informazioa erabili da. Koma-zuzentzaile hau XUXENg euskarako estilo- eta gramatika-zuzentzailean txertatu nahi da. Gainera, baliagarria izango da euskarako analizatzaile eta desanbiguatzaile sintaktikoak hobetzeko, eta baita ahotsaren ezagutza sistemetan integratzeko ere.

Tesi osoa helbide honetan jaso daiteke. Pasa den uztailearen 27an aurkeztu izan da, eta zuzendariak Iñaki Alegria eta Arantza Diaz de Ilarraza izan dira.

http://www.speechtechmag.com/Articles/Column/Standards/W3C-Launches-HTML-Speech-Incubator-Group-72974.aspx
01/01/2011 - 19:50

Ultimate goal is to develop tools to better integrate speech with the Web

http://permalink.gmane.org/gmane.science.linguistics.corpora/12189
12/29/2010 - 16:05
Dear List Moderator,

I attach below the call for papers on a special issue of the Computational
Linguistics journal on parsing morphologically rich languages.
Will you be so kind to post our call for papers to the list?

Thanks and Merry Christmas,

http://semanticweb.com/17074_b17074?c=rss
12/27/2010 - 13:00

iGlue? IceCube? It’s not the latest or revised rap music sensations but the names of a new semantic search engine service, and its bookmark widget or plug-in that helps users connect with more information around a subject, be it text or media resources, either from within its own borders or on other sites they’re browsing.

Because it’s semantic, it’s designed to intuit the difference between Ice Cube the real rapper and, well, IceCube the widget. And, users can use the widget to click on entities around the web – including their own pages – and adding annotations to them right from there, which can then be viewed by anyone in real time. “iGlue changes the way of connecting online content. A layer of metadata can be overlaid on web pages where the user can organize knowledge independently from the concrete location of the information,” says founder Peter Vasko, painting his vision of the service. “No matter to which page the user adds value by inserting relevant annotations, his or her changes will appear on every other page where the same content is present. Imagine that instead of scattering comments, opinions etc. over several web sites where a given product is mentioned or a news article is quoted, they can be accessed everywhere in their entirety. So we are able to join the same conversation from any place the article appeared.”

continued…

New Career Opportunities Daily: The best jobs in media.

http://semanticweb.com/semantic-company-anboto-tops-innovate100_b17134?c=rss
12/23/2010 - 21:00

 A year-long global competition for startups has ended, and a company that successfully leverages Semantic Technologies has taken top prize.

Innovate! 2010 - GuidewireGroupOver the last year, the startup analyst and advisory firm Guidewire Group and its partners conducted Pitch Slam events in 30 cities on five continents, evaluating startups based on the G/SCORE(TM) Assessment Methodology. One of these Pitch Slams took place at the SemTech Conference last June.  The G/SCORE measures companies on seven key factors of business execution providing concrete feedback on where the company is today and where it needs to go in order to build value into the business.

More than 500 companies participated in the 2010 program, and the top ones were chosen for the final Innovate!100 list.

Anboto Group - Winner of the Innovate!2010 CompetitionIn this inaugural year, sitting atop that list of 100 was Anboto Group. The Bilbao-based company leverages semantic technology expertise to provide solutions that enable easy and smart interaction in natural language between customers and computers. continued…

New Career Opportunities Daily: The best jobs in media.

http://permalink.gmane.org/gmane.science.linguistics.corpora/12175
12/22/2010 - 08:40
-----------------------------------------------------------------------------------------------------
Generating Questions from Sentences Corpus (Task B QGSTEC2010)
-----------------------------------------------------------------------------------------------------
 
Now available at: http://computing.open.ac.uk/coda/data.html 
 
A corpus of over 1000 questions (both human and machine-generated) paired with declarative sentences (and target question type) from which they were generated. The automatically generated questions are accompanied by ratings from several raters according to five criteria (relevance, question type, syntactic correctness and fluency, ambiguity, and variety). 
 
The corpus is the result of Task B of the 2010 Question Generation Shared Task and Evaluation Challenge (QGSTEC2010). The UK Engineering and Physical Sciences Research Council partially supported the effort on Task B through grant EP/G020981/1 (The CODA project, http://computing.open.ac.uk/coda/).

 
 
 

Syndicate content