• No results found

Classification along Genre Dimensions Exploring a Multidisciplinary Problem Mikael Gunnarsson

N/A
N/A
Protected

Academic year: 2022

Share "Classification along Genre Dimensions Exploring a Multidisciplinary Problem Mikael Gunnarsson"

Copied!
2
0
0

Loading.... (view fulltext now)

Full text

(1)

Classification along Genre Dimensions

Exploring a Multidisciplinary Problem

Mikael Gunnarsson

Academic dissertation for the Degree of Doctor of Philosophy in Library and Information Science at the University of Borås to be publicly defended on Friday 29 april 2011 at 13.00 in lecture room M506, the University of Borås,

Allégatan 1, Borås

Swedish School of Library and Information Science the University of Borås

(2)

Title: Classification along Genre Dimensions Language: English, with a short summary in Swedish

Available: http://hdl.handle.net/2320/7920 ISBN 978-91-85659-72-2 ; ISSN 1103-6990

This thesis treats the sociotechnical notion of genre as a conflation of a communicative situation and a community of practices involved in producing and using documents.

It explores the ways in which documents may be mapped to the sociocultural con- texts from which they emanate. In other words, it is concerned with the classification of documents along genre dimensions, with the purpose of supporting information seeking.

The thesis positions itself within Library and Information Science in two parts.

Firstly, a theoretical framework for classification along genre dimensions is developed based on relevant theories and practices from Library and Information Science, as well as from sociologically motivated Linguistics, and neighbouring domains. Secondly, a setup for experiments, including feature derivation and reannotation of existing cor- pora, is designed in order to explore the relationship between text documents and genres, and the extent to which a mapping of documents to genres can be realized in real world applications.

The experimental part of the thesis relies on an existing corpus for genre classifi- cation research, used in comparable research, with an addition of a slight extension.

In the experiments, combinations of feature sets and target genres are evaluated, using traditional performance estimators for classification performance.

The outcome of the first part of the work indicates that the notion of genre with respect to classification is largely undertheorized in Library and Information Science.

We need to know more about the nature of different genres, how to robustly identify the documents of a genre, and the impact genres have on information seeking. In- terdisciplinary collaborative research would be most beneficial in these efforts. The results of the experiments of the second part are fairly inconclusive for the evaluation of feature sets, but it can be concluded that the optimal combination of feature sets and target genres is a crucial issue for high performance, and worthy of more investigation.

Keywords: Genre, Library and information science, Document studies, Classifica- tion, Knowledge organization, Library classification, Text linguistics, Sociolinguis- tics, Speech act theory, Machine learning, Support vector machines, k-NN classifica- tion, K-means clustering

References

Related documents

Industrial Emissions Directive, supplemented by horizontal legislation (e.g., Framework Directives on Waste and Water, Emissions Trading System, etc) and guidance on operating

Stöden omfattar statliga lån och kreditgarantier; anstånd med skatter och avgifter; tillfälligt sänkta arbetsgivaravgifter under pandemins första fas; ökat statligt ansvar

46 Konkreta exempel skulle kunna vara främjandeinsatser för affärsänglar/affärsängelnätverk, skapa arenor där aktörer från utbuds- och efterfrågesidan kan mötas eller

För att uppskatta den totala effekten av reformerna måste dock hänsyn tas till såväl samt- liga priseffekter som sammansättningseffekter, till följd av ökad försäljningsandel

The increasing availability of data and attention to services has increased the understanding of the contribution of services to innovation and productivity in

Generella styrmedel kan ha varit mindre verksamma än man har trott De generella styrmedlen, till skillnad från de specifika styrmedlen, har kommit att användas i större

Parallellmarknader innebär dock inte en drivkraft för en grön omställning Ökad andel direktförsäljning räddar många lokala producenter och kan tyckas utgöra en drivkraft

Närmare 90 procent av de statliga medlen (intäkter och utgifter) för näringslivets klimatomställning går till generella styrmedel, det vill säga styrmedel som påverkar