Please use this identifier to cite or link to this item:
http://hdl.handle.net/10609/31661
Title: | Parameter-free agglomerative hierarchical clustering to model learners' activity in online discussion forums |
Author: | Cobo Rodríguez, Germán |
Director: | Santamaría Pérez, Eugènia Morán Moreno, Jose Antonio |
Others: | Universitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3) |
Abstract: | The analysis of learners' activity in online discussion forums leads to a highly context-dependent modelling problem, which can be posed from both theoretical and empirical approaches. When this problem is tackled from the data mining field, a clustering-based perspective is usually adopted, thus giving rise to a clustering scenario where the real number of clusters is a priori unknown. Hence, this approach reveals an underlying problem, which is one of the best-known issues of the clustering paradigm: the estimation of the number of clusters, habitually selected by user according to some kind of subjective criterion that may easily lead to the appearance of undesired biases in the obtained models. With the aim of avoiding any user intervention in the cluster analysis stage, two new cluster merging criteria are proposed in the present thesis, which allow to implement a novel parameter-free agglomerative hierarchical algorithm. A complete set of experiments indicate that the new clustering algorithm is able to provide optimal clustering solutions in the face of a great variety of clustering scenarios, both having the ability to deal with different kinds of data and outperforming clustering algorithms most widely used in practice. Finally, a two-stage analysis strategy based on the subspace clustering paradigm is proposed to properly tackle the issue of modelling learners' participation in the asynchronous discussions. In combination with the new clustering algorithm, the proposed strategy proves to be able to limit user's subjective intervention to the interpretation stages of the analysis process and to lead to a complete modelling of the activity performed by learners in online discussion forums. |
Keywords: | parameter-free clustering educational data mining learner behaviour modelling |
Document type: | info:eu-repo/semantics/doctoralThesis |
Issue Date: | 22-Apr-2014 |
Publication license: | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
Appears in Collections: | Tesis doctorals |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
thesis_gcobo.pdf | 6,06 MB | Adobe PDF | View/Open |
Share:
This item is licensed under a Creative Commons License