HyTex Phase IIEvaluation of the previously implemented strategies of text-to-hypertext conversion Development of topically motivated strategies of text-to-hypertext conversion Extension of the terminological and lexical resources Automation of pre-processing for automatic strategies of text-to-hypertext conversion The second phase of the HyTex project began in August 2005. The foci of the second phase are: the evaluation of previously implemented strategies of text-to-hypertext conversion and especially developing topically motivated strategies of text-to-hypertext conversion as well as further extending our terminological or lexical resources respectively. Evaluation of the previously implemented strategies of text-to-hypertext conversion By means of a user study, we want to examine if the Development of topically motivated strategies of text-to-hypertext conversionFurthermore, we examine in which ways methods of lexical chaining can be applied to German specialized texts. We are therefore currently implementing a lexical chainer which—in contrast to classic approaches for the English—is emphasized by several steps of (pre-)processing. The lexical chainer will access various semantic resources. We are especially interested in the question, by means of which extensions can we achieve a transition between lexical and topical chaining. The following publication covers the results of an annotation pre-study: Web-based Annotation of Anaphoric Relations and Lexical Chains (project publication with our partners in Bielefeld; PDF 163 KB) Extension of the terminological and lexical resources In order to combine the Automation of pre-processing for automatic strategies of text-to-hypertext conversionOur pilot study on automatic annotation of definitional text segments has already supplied interesting results and search patterns. Based on these results, we want to examine how these patterns may be employed for the (fully) automatic annotation of definitional text segments. Additionally, we plan to use these results as a basis for the automatic reconstruction and annotation of topical structures. ( Deutsch ) |