Natural language processing (NLP) is a subfield of artificial intelligence and linguistics. It studies the problems of automated generation and understanding of natural human languages. Natural language generation systems convert information from computer databases into normal-sounding human language, and natural language understanding systems convert samples of human language into more formal representations that are easier for computer programs to manipulate.
Early systems such as SHRDLU, working in restricted "blocks worlds" with restricted vocabularies, worked extremely well, leading researchers to excessive optimism which was soon lost when the systems were extended to more realistic situations with real-world ambiguity and complexity.
More on [ Natural language processing ]
Style Guides :: Writers Resources
Machine Learning :: Artificial Intelligence
Linguistics :: Social Sciences
Computational Linguistics :: Linguistics

Book - String Searching Algorithms: exact / approximate string matching, edit distances, common sequences, longest repetitions.
Meta Description: [ String Searching Algorithms:
exact / approximate string matching, edit distances, common sequences,
longest repetitions. ]
404
ATLAS: A Mixed-Initiative, Multimodal Tutoring System - The overall goal of the Atlas project is to develop supplementary software that is added to second generation Intelligent Tutoring Systems (ITS) in order to obtain third generation Intelligent Tutoring System that will carry on a natural language, knowledge-constructing dialogue with students.
Australasian Language Technology Association - Information about language technology in Australia and New Zealand. Includes mailing lists, general information, and links to research and training in Australasia.
Meta Description: [ Australasian Language Technology Association ]
Automated Analysis of Natural Language Texts - Megaputer white paper about popular text analysis methods and possible business opportunities. Part of documentation for TextAnalyst software.
Bibliography of Research in Natural Language Generation - A part of the Computer Science Bibliography Collection, providing references for papers published through 1994. Searchable, browsable.
Meta Description: [ Bibliography of Research in Natural Language Generation. This bibliography is a part of the Computer Science Bibliography Collection. ]
Clyr Inc. - Create custom natural language processing software.
500
CMU AI Repository - NLP area - Machine readable parts of NLP textbooks, NLP corpora and dictionaries, fonts, and software.
Crux Editions - Scientific publisher and consultant specializing in language technology.
500
DISC Best Practice Guide - Information, standards and tools to support the development of Spoken Language Dialogue Systems.
EAGLES: Expert Advisory Group on Language Engineering Standards - A European Commission initiative to provide standards for linguistic engineering applications such as corpora, lexicons, mark-up languages and software. Contains current guidelines.
ELDA: The European Language Resource Distribution Agency - The practical arm of the ELRA agency, dedicated to solving practical and legal problems in the distribution of language resources. Legal information, catalog of resources for sale, current projects.
European Language Resource Association - A nonprofit organization serving the commercial language resource community. Site features quarterly newsletter, official definition of language resource, and member services.
Meta Description: [ ELRA: European Language Resources Association ]
Grammatical Inference - Repository of information on grammatical inference, automata induction, and language acquisition.
Language Technologies Institute - A research program at Carnegie Mellon University, focusing on machine translation and speech processing. Includes news, admissions procedures, staff profiles and current projects.
Language Technology World - A comprehensive portal on the wide range of technologies that deal with human language. News, conferences, projects, organisations, systems, and resources.
Meta Description: [ Language Technology World is an ontology-based virtual
information center on the wide spectrum of technologies for dealing with human
languages. It is a free service provided to the R & D community, potential users
of language technologies, students and other interested... ]
Link Grammar - A formalism for the computational parsing of English. Includes parser with downloadable source code, English-to-German translator, documentation, bibliography.
Meta Description: [ The link grammar system for
parsing English is explained. The implementation is available for
download. ]
MIETTA-II : A Multilingual Information Environment - Project funded by the European Commission which aims at developing a system for setting up multi-lingual information portals using natural language technology. Site is available in English, German, Italian and French, and includes product papers and annual reports.
Meta Description: [ MIETTA-II is a take-up action project funded by the European Commission within the 5th Framework Programme for Research and Development. MIETTA-II wird als Take-up Action von der Europischen Kommission im Rahmen des 5. Rahmenprogramms fr Forschung und Entwicklung gefrdert. MIETTA-II una azio... ]
Mingsee, Inc. - Develops systems which enable computers to analyse and understand text by using proprietary algorithms.
MITRE Language Technology Projects - Descriptions of projects covering a wide range of language technology applications. Includes a slide presentation and summary chart for each project.
Meta Description: [ MITRE is a not-for-profit national technology resource that provides systems engineering, research and development, and information technology support to the government. It operates federally funded research and development centers for the DOD, the FAA, and the IRS. ]
500
Natural Language FAQs - Selected FAQ lists from Usenet groups related to natural language processing.
PetaMem - Corporation developing natural language technology solutions for global business. Includes corporate information, some online demos and catalog of services.
References on Zipf's Law - An academic bibliography on this relation between a word's frequency in a text and its place in a ranking of words by frequency. Includes some online texts.
SULTRY: The Sydney University Language Technology Research Laboratory - An institute which applies natural language processing research to problems of human-computer interaction. Selected publications, academic information, and descriptions of current projects.
Meta Description: [ A computational linguistics research centre in Sydney, Australia ]
Survey of the State of the Art in Human Language Technology - A 1996 high-level review of: spoken/written input, analysis and understanding, generation, speech output, discourse and dialogue, document processing, multiple languages and modes, transmission and storage, mathematical methods, other resources, how to evalate an NLP program.
The Natural Language Software Registry - A concise summary of the capabilities and sources of language processing software available to researchers. It comprises academic, commercial and proprietary software with theory, specifications and terms on which it can be acquired clearly indicated.
VerbMobil - Mobile translation system for the translation of spontaneous speech in face-to-face situations.
| Lattice Uncertainty Visualization:Lattice graphs are used as underlying data structures in many statistical processing systems, including natural language processing. Lattices compactly represent multiple possible outputs and are usually hidden from users. We present a novel visualization intended to reveal the uncertainty and variability inherent in statistically-derived lattice structures. Applications such as machine translation and automated speech recognition typically present users with a best-guess about the appropriate output, with apparent complete confidence. |