Please use this identifier to cite or link to this item:

http://hdl.handle.net/10609/31661
Title: Parameter-free agglomerative hierarchical clustering to model learners' activity in online discussion forums
Author: Cobo Rodríguez, Germán  
Director: Santamaría Pérez, Eugènia
Morán Moreno, José Antonio
Others: Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)
Keywords: parameter-free clustering
educational data mining
learner behaviour modelling
Issue Date: 22-Apr-2014
Publisher: Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)
Abstract: The analysis of learners' activity in online discussion forums leads to a highly context-dependent modelling problem, which can be posed from both theoretical and empirical approaches. When this problem is tackled from the data mining field, a clustering-based perspective is usually adopted, thus giving rise to a clustering scenario where the real number of clusters is a priori unknown. Hence, this approach reveals an underlying problem, which is one of the best-known issues of the clustering paradigm: the estimation of the number of clusters, habitually selected by user according to some kind of subjective criterion that may easily lead to the appearance of undesired biases in the obtained models. With the aim of avoiding any user intervention in the cluster analysis stage, two new cluster merging criteria are proposed in the present thesis, which allow to implement a novel parameter-free agglomerative hierarchical algorithm. A complete set of experiments indicate that the new clustering algorithm is able to provide optimal clustering solutions in the face of a great variety of clustering scenarios, both having the ability to deal with different kinds of data and outperforming clustering algorithms most widely used in practice. Finally, a two-stage analysis strategy based on the subspace clustering paradigm is proposed to properly tackle the issue of modelling learners' participation in the asynchronous discussions. In combination with the new clustering algorithm, the proposed strategy proves to be able to limit user's subjective intervention to the interpretation stages of the analysis process and to lead to a complete modelling of the activity performed by learners in online discussion forums.
Language: English
URI: http://hdl.handle.net/10609/31661
Appears in Collections:Doctoral Thesis

Share:
Export:
Files in This Item:
File Description SizeFormat 
thesis_gcobo.pdf6.06 MBAdobe PDFView/Open

This item is licensed under a Creative Commons License Creative Commons