搜索结果: 1-13 共查到“计算语言学 Large”相关记录13条 . 查询时间(0.158 秒)
Large Linguistic Corpus Reduction with SCP Algorithms
Large Linguistic Corpus Reduction SCP Algorithms
2015/9/16
Linguistic corpus design is a critical concern for building rich annotated corpora useful in different domains of applications. For example, speech technologies such as ASR (Automatic Speech Recogniti...
A Large-Scale Pseudoword-Based Evaluation Framework for State-of-the-Art Word Sense Disambiguation
Large-Scale Pseudoword State-of-the-Art Word Sense Disambiguation
2015/9/14
The evaluation of several tasks in lexical semantics is often limited by the lack of large amounts of manual annotations, not only for training purposes, but also for testing purposes. Word Sense Disa...
Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks
Large-Scale Induction Evaluation of Lexical Resources
2015/8/31
We present a methodology for extracting subcategorization frames based on an automaticlexical-functional grammar (LFG) f-structure annotation algorithm for the Penn-II and Penn-III Treebanks. We extra...
The need to correct garbled strings arises in many areas of natural language processing. If a dictionary is available that covers all possible input tokens, a natural set of candidates for correcting ...
Exclamatives and heightened emotion: Extracting pragmatic generalizations from large corpora
corpus pragmatics exclamatives expressives logistic regression
2015/6/15
Exclamatives like What a dump!, Wow!, and Boy, you’ve grown! are, when uttered in context, rich in information about the speaker’s attitudes. Drawing on evidence from about 100, 000 online product rev...
AUTOMATIC ACQUISITION OF A LARGE SUBCATEGORIZATION DICTIONARY FROM CORPORA
AUTOMATIC ACQUISITION LARGE SUBCATEGORIZATION DICTIONARY CORPORA
2015/6/12
This paper presents a new method for producing a dictionary of subcategorization frames from unlabelled text corpora. It is shown that statistical filtering of the results of a finite state parser run...
Large Multimedia Archive for World Languages
Multimedia Archive Lifecycle Management Metadata Longterm Preservation Data Replication Standards Multimedia Access
2015/4/9
In this paper, we describe the core pillars of a large archive of language material recorded worldwide partly about languages that are highly endangered. The bases for the documentation of these langu...
Managing very large Multimedia Archives and their Integration into Federations
Managing very large Multimedia Archives Integration into Federations
2015/4/3
While the natural sciences are used to deal with terabytes and even petabytes of data, such dimensions are new in the domain of linguistics resp. in the humanities. The reasons for this are manifold; ...
EVALUATION OF MULTI-LEVEL CONTEXT-DEPENDENT ACOUSTIC MODEL FOR LARGE VOCABULARY SPEAKER ADAPTATION TASKS
Multi-level acoustic model contextdependent model speaker adaptation discriminative training LVCSR
2014/11/27
In this paper, we investigate the ability of a recently proposed discriminatively trained, multi-level context-dependent acoustic model to adapt to a new speaker in both supervised and unsupervised ad...
DISCRIMINATIVE TRAINING OF HIERARCHICAL ACOUSTIC MODELS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
hierarchical acoustic modeling discriminative training LVCSR
2014/11/27
In this paper we propose discriminative training of hierarchical acoustic models for large vocabulary continuous speech recognition tasks. After presenting our hierarchical modeling framework, we desc...
HIERARCHICAL LARGE-MARGIN GAUSSIAN MIXTURE MODELS FOR PHONETIC CLASSIFICATION
hierarchical classifier committee classifier large margin GMM phonetic classification
2014/11/27
In this paper we present a hierarchical large-margin Gaussian mixture modeling framework and evaluate it on the task of phonetic classification. A two-stage hierarchical classifier is trained by alter...
Large-Scale CCG Induction from the Groningen Meaning Bank
Large-Scale CCG Groningen Meaning Bank
2015/1/24
Large-Scale CCG Induction from the Groningen Meaning Bank.
Extract Chinese Unknown Words from a Large-scale Corpus Using Morphological and Distributional Evidences
a Large-scale Corpus Using Morphological Distributional Evidences
2015/1/24
Extract Chinese Unknown Words from a Large-scale Corpus Using Morphological and Distributional Evidences.