Please use this identifier to cite or link to this item: http://hdl.handle.net/10609/31661
Title: Parameter-free agglomerative hierarchical clustering to model learners' activity in online discussion forums
Author: Cobo Rodríguez, Germán  
Director: Santamaría Pérez, Eugènia
Morán Moreno, Jose Antonio  
Others: Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)
Abstract: The analysis of learners' activity in online discussion forums leads to a highly context-dependent modelling problem, which can be posed from both theoretical and empirical approaches. When this problem is tackled from the data mining field, a clustering-based perspective is usually adopted, thus giving rise to a clustering scenario where the real number of clusters is a priori unknown. Hence, this approach reveals an underlying problem, which is one of the best-known issues of the clustering paradigm: the estimation of the number of clusters, habitually selected by user according to some kind of subjective criterion that may easily lead to the appearance of undesired biases in the obtained models. With the aim of avoiding any user intervention in the cluster analysis stage, two new cluster merging criteria are proposed in the present thesis, which allow to implement a novel parameter-free agglomerative hierarchical algorithm. A complete set of experiments indicate that the new clustering algorithm is able to provide optimal clustering solutions in the face of a great variety of clustering scenarios, both having the ability to deal with different kinds of data and outperforming clustering algorithms most widely used in practice. Finally, a two-stage analysis strategy based on the subspace clustering paradigm is proposed to properly tackle the issue of modelling learners' participation in the asynchronous discussions. In combination with the new clustering algorithm, the proposed strategy proves to be able to limit user's subjective intervention to the interpretation stages of the analysis process and to lead to a complete modelling of the activity performed by learners in online discussion forums.
Keywords: parameter-free clustering
educational data mining
learner behaviour modelling
Document type: info:eu-repo/semantics/doctoralThesis
Issue Date: 22-Apr-2014
Publication license: http://creativecommons.org/licenses/by-nc-nd/3.0/es/  
Appears in Collections:Tesis doctorals

Files in This Item:
File Description SizeFormat 
thesis_gcobo.pdf6,06 MBAdobe PDFThumbnail
View/Open