TY - GEN
T1 - Identifying strategic information from scientific articles through sentence classification
AU - Ibekwe-Sanjuan, Fidelia
AU - Chen, Chaomei
AU - Pinho, Roberto
PY - 2008
Y1 - 2008
N2 - We address here the need to assist users in rapidly accessing the most important or strategic information in the text corpus by identifying sentences carrying specific information. More precisely, we want to identify contribution of authors of scientific papers through a categorization of sentences using rhetorical and lexical cues. We built local grammars to annotate sentences in the corpus according to their rhetorical status: objective, new things, results, findings, hypotheses, conclusion, related-word, future work. The annotation is automatically projected automatically onto two other corpora to test their portability across several domains. The local grammars are implemented in the Unitex system. After sentence categorization, the annotated sentences are clustered and users can navigate the result by accessing specific information types. The results can be used for advanced information retrieval purposes.
AB - We address here the need to assist users in rapidly accessing the most important or strategic information in the text corpus by identifying sentences carrying specific information. More precisely, we want to identify contribution of authors of scientific papers through a categorization of sentences using rhetorical and lexical cues. We built local grammars to annotate sentences in the corpus according to their rhetorical status: objective, new things, results, findings, hypotheses, conclusion, related-word, future work. The annotation is automatically projected automatically onto two other corpora to test their portability across several domains. The local grammars are implemented in the Unitex system. After sentence categorization, the annotated sentences are clustered and users can navigate the result by accessing specific information types. The results can be used for advanced information retrieval purposes.
UR - http://www.scopus.com/inward/record.url?scp=84889796765&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84889796765&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84889796765
T3 - Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008
SP - 1518
EP - 1522
BT - Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008
PB - European Language Resources Association (ELRA)
T2 - 6th International Conference on Language Resources and Evaluation, LREC 2008
Y2 - 28 May 2008 through 30 May 2008
ER -