论文简介 |
It is a challenging task to discover information from a large amount of data in an open domain∗. To address this problem, an event network framework has been proposed in this paper to address this challenge. It is in fact an empirical construct for exploring open information, which is composed of three steps: document event detection, event network construction and event network analysis. First, documents are clustered or classified into document events, which reduces the impact of noisy and heterogeneous resources on information extraction. In the second step, linguistic units are ex- tracted from each document event and combined into an event network. The event network contains structural knowledge of a document event and enables content-oriented retrieval. Final step involves development of the, techniques to analyze the event network. We give examples of exploring open information via event network. |