Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10609/109800
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorCaballé, Santi-
dc.contributor.authorXHAFA, FATOS-
dc.contributor.otherUniversitat Oberta de Catalunya. Internet Interdisciplinary Institute (IN3)-
dc.contributor.otherUniversitat Politècnica de Catalunya (UPC)-
dc.date.accessioned2020-02-18T08:23:45Z-
dc.date.available2020-02-18T08:23:45Z-
dc.date.issued2013-03-18-
dc.identifier.citationCaballé, S. & Xhafa, F. (2013). Distributed-based Massive Processing of Activity Logs for Efficient User Modeling in a Virtual Campus. Cluster Computing, 16(4), 829-844. doi: 10.1007/s10586-013-0256-9es
dc.identifier.issn1386-7857MIAR
-
dc.identifier.urihttp://hdl.handle.net/10609/109800-
dc.description.abstractThis paper reports on a multi-fold approach for the building of user models based on the identification of navigation patterns in a virtual campus, allowing for adapting the campus' usability to the actual learners' needs, thus resulting in a great stimulation of the learning experience. However, user modeling in this context implies a constant processing and analysis of user interaction data during longterm learning activities, which produces huge amounts of valuable data stored typically in server log files. Due to the large or very large size of log files generated daily, the massive processing is a foremost step in extracting useful information. To this end, this work studies, first, the viability of processing large log data files of a real Virtual Campus using different distributed infrastructures. More precisely, we study the time performance of massive processing of daily log files implemented following the master-slave paradigm and evaluated using Cluster Computing and PlanetLab platforms. The study reveals the complexity and challenges of massive processing in the big data era, such as the need to carefully tune the log file processing in terms of chunk log data size to be processed at slave nodes as well as the bottleneck in processing in truly geographically distributed infrastructures due to the overhead caused by the communication time among the master and slave nodes. Then, an application of the massive processing approach resulting in log data processed and stored in a well-structured format is presented. We show how to extract knowledge from the log data analysis by using the WEKA framework for data mining purposes showing its usefulness to effectively build user models in terms of identifying interesting navigation patters of on-line learners. The study is motivated and conducted in the context of the actual data logs of the Virtual Campus of the Open University of Catalonia.en
dc.format.mimetypeapplication/pdf-
dc.language.isoeng-
dc.publisherCluster Computing-
dc.relation.ispartofCluster Computing, 2013, 16(4)-
dc.relation.urihttps://doi.org/10.1007/s10586-013-0256-9-
dc.rightsCC BY-NC-ND-
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/-
dc.subjectmassive processingen
dc.subjectlog filesen
dc.subjectcluster computingen
dc.subjectPlanetLaben
dc.subjectweb mining usageen
dc.subjectWEKA frameworken
dc.subjectnavigation patternsen
dc.subjectvirtual campusen
dc.subjectprocessament massiuca
dc.subjectprocesamiento masivoes
dc.subjectarxius de registreca
dc.subjectarchivos de registroes
dc.subjectinformàtica en clústerca
dc.subjectinformática en clústeres
dc.subjectPlanetLabca
dc.subjectPlanetLabes
dc.subjectminería del uso de la webes
dc.subjectmineria de l'ús de la webca
dc.subjectentorn WEKAca
dc.subjectentorno WEKAes
dc.subjectpatrons de navegacióca
dc.subjectpatrones de navegaciónes
dc.subjectcampus virtualca
dc.subjectcampus virtuales
dc.subject.lcshWeb usage miningen
dc.titleDistributed-based massive processing of activity logs for efficient user modeling in a virtual campus-
dc.typeinfo:eu-repo/semantics/article-
dc.subject.lemacMineria de webca
dc.subject.lcshesMinería de webes
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess-
dc.identifier.doi10.1007/s10586-013-0256-9-
dc.gir.idAR/0000003182-
dc.type.versioninfo:eu-repo/semantics/acceptedVersion-
Aparece en las colecciones: Articles cientÍfics
Articles

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
Caballe_Xhafa_CC_Distributed.pdfPost-print1,2 MBAdobe PDFVista previa
Visualizar/Abrir