SlideShare a Scribd company logo
1 of 13
Download to read offline
Backup
Analysing User Knowledge, Competence and Learning
during Online Activities
Stefan Dietze, L3S Research Center, Hannover
26.09.2017
CNR Institute for Educational Technologies, Palermo
27/09/17 1Stefan Dietze
Research areas
 Web science, Information Retrieval, Semantic Web, Social Web
Analytics, Knowledge Discovery, Human Computation
 Interdisciplinary application areas: digital humanities,
TEL/education, Web archiving, mobility, ...
Some projects
Research @ L3S
27/09/17 2
 See also: http://www.l3s.de
Stefan Dietze
 Pavlos Fafalios (L3S)
 Besnik Fetahu (L3S)
 Elena Demidova (L3S)
 Ujwal Gadiraju (L3S)
 Eelco Herder (L3S)
 Ivana Marenzi (L3S)
 Nicolas Tempelmeier (L3S)
 Ran Yu (L3S)
 Markus Rokicki (L3S)
 Renato Joao (L3S, PUC Rio)
Acknowledgements: Team
27/09/17 3Stefan Dietze
 Mathieu d‘Aquin (The Open University, UK)
 Mohamed Ben Ellefi (LIRMM, France)
 Davide Taibi (CNR, Italy)
 Konstantin Todorov (LIRMM, France)
 ...
27/09/17 4
Learning Analytics on the Web/for online learning ?
Stefan Dietze
 Anything can be a learning resource
 The activity makes the difference (not the
resource): i.e. how a resource is being used
 Learning Analytics in online/non-learning
environments?
o Activity streams,
o Social graphs (and their evolution),
o Behavioural traces (mouse movements,
keystrokes)
o ...
 Research challenges:
o How to detect „learning“?
o How to detect learning-specific notions
such as „competences“, „learning
performance“ etc?
27/09/17 5
„AFEL – Analytics for Everyday (Online) Learning“
Stefan Dietze
Examples of AFEL data sources:
• Activity streams and behavioral traces
• L3S Twitter Crawl: 6 bn tweets
• Common Crawl (2015): 2 bn documents
• Web Data Commons (2016): 44 bn quads
• „German Academic Web“: 6 TB Web crawl
• Web search query logs
• Wikipedia edit history: 3 M edits/month
(engl.)
• ....
 H2020 project (since 12/2015) aimed at understanding/supporting learning in social Web environments
Challenges/Tasks in AFEL & beyond: some examples
27/09/17 6Stefan Dietze
I Efficient data capture
 Crawling & extracting activity data
 Crawling, extracting and indexing learning
resources (eg Common Crawl)
II Efficient data analysis
 Understanding learning resources: entity
extraction & clustering on large Web crawls of
resources
 “Search as learning”: detecting learning in
heterogeneous search query logs & click streams
 Detecting learning activities: detection of learning
pattern (eg competent behavior) in absence of
learning objectives & assessments (!)
o Obtaining performance indicators from
behavioral traces?
o Quasi-experiments in crowdsourcing
platforms to obtain training data
Gadiraju, U., Demartini, G., Kawase, R., Dietze, S. Human beyond the
Machine: Challenges and Opportunities of Microtask
Crowdsourcing. In: IEEE Intelligent Systems, Volume 30 Issue 4 –
Jul/Aug 2015.
Gadiraju, U., Kawase, R., Dietze, S, Demartini, G., Understanding
Malicious Behavior in Crowdsourcing Platforms: The Case of
Online Surveys. ACM CHI Conference on Human Factors in Computing
Systems (CHI2015), April 18-23, Seoul, Korea.
Gadiraju, U., Demartini, G., Kawase, R., Dietze, S. Human beyond
the Machine: Challenges and Opportunities of Microtask
Crowdsourcing. In: IEEE Intelligent Systems, Volume 30 Issue 4 –
Jul/Aug 2015.
Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S., Crowd
Anatomy Beyond the Good and Bad -- Behavioral Traces for
Crowd Worker Modeling and Pre-selection. The Journal of
Collaborative Computing and Work Practices (CSCW), under review.
27/09/17 7Stefan Dietze
Predicting competence in online users?
Capturing assessment data: microtasks in Crowdflower
 “Content Creation (CC)”: transcription of captchas
 “Information Finding (IF)”: middle name of famous persons
 1800 assessments: 2 tasks * 3 durations * 3 difficulty levels
* 100 users (per assessment)
Level 1
„Daniel Craig“
Level 2
„George Lucas“
(profession: Archbishop)
Level 3
„Brian Smith“
(profession: Ice Hockey, born: 1972)
Behavioral Traces: keystrokes- and mouse movements
 timeBeforeInput, timeBeforeClick
 tabSwitchFreq
 windowToggleFreq
 openNewTabFreq
 WindowFocusFrequency
 totalMouseMovements
 scrollUpFreq, scrollDownFreq
 ….
 Total amount of events: 893.285 (CC Tasks), 736.664 (IF Tasks)
Find the middle name of:
27/09/17 8Stefan Dietze
Behavioural traces to predict competence?
Training data
 Manual annotation of 1800 assessments
 Performance types [CHI15]:
o “Competent Worker” ,
o “Diligent Worker”
o “Fast Deceiver”
o “Incompetent Worker”
o “Rule Breaker”
o “Smart Deceiver”
o “Sloppy Worker”
 Prediction of performance types from
behavioral traces?
Predicting learner types from behavioral traces
 “Random Forest Classifier” (per task)
 10-fold cross validation
 Prediction performance: Accuracy, F-Measure
Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S.,
Crowd Anatomy Beyond the Good and Bad -- Behavioral
Traces for Crowd Worker Modeling and Pre-selection. The
Journal of Collaborative Computing and Work Practices
(CSCW), under review.
Results
 Longer assessments  more signals
 Simpler assessments  more conclusive signals
 “Competent Workers” (CW, DW): accuracy of 91% respectively 87%
 Most significant features: “TotalTime”, “TippingPoint”,
“MouseMovementFrequency”, “WindowFocusFrequency”
27/09/17 9Stefan Dietze
Other features to predict competence in learning/assessments?
“Dunning-Kruger Effect”
 Incompetence in task/domain reduces capacity to
recognise/assess own incompetence
Research question
 Self-assessment as feature to predict competence?
Results
 Self-assessment as (additional) reliable indicator of
competence (94% accuracy), superior to mere
performance measurement
 Tendency to over-estimate own competence
increases with increasing difficulty level
David Dunning. 2011. The Dunning-Kruger Effect: On Being Ignorant of
One’s Own Ignorance. Advances in experimental social psychology 44
(2011), 247.
Performance („Accuracy“) of users classified as „competent“
Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S., Using
Worker Self-Assessments for Competence-based Pre-Selection in
Crowdsourcing Microtasks. In: ACM Transactions on Computer-
Human Interaction (ACM TOCHI) Vol. 24, Issue 4, August 2017.
27/09/17 10Stefan Dietze
“Search As Learning”: predicting learning/knowledge in Web search
Challenges
 Detecting individual search missions in large
query logs
 Detecting “informational” search missions (as
opposed to “transactional” or “navigational”
missions, see [Broder, 2002])
 Predict knowledge state of users in absence of
assessment data
 Predict knowledge gain (or “learning”)
throughout search missions
27/09/17 11Stefan Dietze
“Search As Learning”: predicting learning/knowledge in Web search
Challenges
 Detecting individual search missions in large
query logs
 Detecting “informational” search missions (as
opposed to “transactional” or “navigational”
missions, see [Broder, 2002])
 Predict knowledge state of users in absence of
assessment data
 Predict knowledge gain (or “learning”)
throughout search missions
Initial results
 Search mission detection with average F1 score
75% (experiments based on AOL query logs)
 Quasi experiments to generate search mission
data (queries, behavioral traces, pre- and post-
tests) for 400 search missions
 Ongoing: prediction of knowledge gain/state
27/09/17 12Stefan Dietze
Summary & outlook
 Learning analytics in online & Web-based settings
o Detection of learning & learning-related notions in
absence of assessment/performance indicators
o Analysis of range of data, including behavioral
traces, activity streams, self assessment etc
o Actual big data (dynamics/velocity)
 Positive results from initial models and classifiers
 Other tasks (e.g. detection of learning during Web
search)
 Application of developed models and classifiers in
online (learning) environments (e.g. AFEL project), such
as GNOSS/Didactalia (200.000 users), “LearnWeb”,
“Bibsonomy” etc
 Several ongoing research initiatives, eg research
initiative at LUH on “Digital Higher Education in MINT
Subjects”
27/09/17 13Stefan Dietze
Summary & outlook
 Learning analytics in online & Web-based settings
o Detection of learning & learning-related notions in
absence of assessment/performance indicators
o Analysis of range of data, including behavioral
traces, activity streams, self assessment etc
o Actual big data (dynamics/velocity)
 Positive results from initial models and classifiers
 Other tasks (e.g. detection of learning during Web
search)
 Application of developed models and classifiers in
online (learning) environments (e.g. AFEL project), such
as GNOSS/Didactalia (200.000 users), “LearnWeb”,
“Bibsonomy” etc
 Several ongoing research initiatives, eg research
initiative at LUH on “Digital Higher Education in MINT
Subjects”
?http://stefandietze.net

More Related Content

Similar to Analysing User Knowledge, Competence and Learning during Online Activities

Using AI to understand everyday learning on the Web
Using AI to understand everyday learning on the WebUsing AI to understand everyday learning on the Web
Using AI to understand everyday learning on the WebStefan Dietze
 
Learning analytics workshop
Learning analytics workshop Learning analytics workshop
Learning analytics workshop Eisa Rezaei
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsTao Xie
 
Overview of Data Science and AI
Overview of Data Science and AIOverview of Data Science and AI
Overview of Data Science and AIjohnstamford
 
2021_01_15 «Learning Analytics for Large Scale Data».
2021_01_15 «Learning Analytics for Large Scale Data».2021_01_15 «Learning Analytics for Large Scale Data».
2021_01_15 «Learning Analytics for Large Scale Data».eMadrid network
 
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...Simon Buckingham Shum
 
A Blind Date With (Big) Data: Student Data in (Higher) Education
A Blind Date With (Big) Data: Student Data in (Higher) EducationA Blind Date With (Big) Data: Student Data in (Higher) Education
A Blind Date With (Big) Data: Student Data in (Higher) EducationUniversity of South Africa (Unisa)
 
Learning Analytics - A New Discipline and Bits of Semantics
Learning Analytics - A New Discipline and Bits of SemanticsLearning Analytics - A New Discipline and Bits of Semantics
Learning Analytics - A New Discipline and Bits of SemanticsDragan Gasevic
 
Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)Wolfgang Greller
 
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...eMadrid network
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1CS, NcState
 
Web analytics presentation
Web analytics presentationWeb analytics presentation
Web analytics presentationJim Jansen
 
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...Megan Bowe
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinarJim Jansen
 
The Hidden Stories of Missing Data
The Hidden Stories of Missing DataThe Hidden Stories of Missing Data
The Hidden Stories of Missing DataMaria Wolters
 
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Bernhard Rieder
 

Similar to Analysing User Knowledge, Competence and Learning during Online Activities (20)

Using AI to understand everyday learning on the Web
Using AI to understand everyday learning on the WebUsing AI to understand everyday learning on the Web
Using AI to understand everyday learning on the Web
 
Learning analytics workshop
Learning analytics workshop Learning analytics workshop
Learning analytics workshop
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software Analytics
 
Overview of Data Science and AI
Overview of Data Science and AIOverview of Data Science and AI
Overview of Data Science and AI
 
2021_01_15 «Learning Analytics for Large Scale Data».
2021_01_15 «Learning Analytics for Large Scale Data».2021_01_15 «Learning Analytics for Large Scale Data».
2021_01_15 «Learning Analytics for Large Scale Data».
 
Data collection
Data collectionData collection
Data collection
 
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...
The Connected Intelligence Centre: Human-Centered Analytics for UTS Data Chal...
 
A Blind Date With (Big) Data: Student Data in (Higher) Education
A Blind Date With (Big) Data: Student Data in (Higher) EducationA Blind Date With (Big) Data: Student Data in (Higher) Education
A Blind Date With (Big) Data: Student Data in (Higher) Education
 
Learning Analytics - A New Discipline and Bits of Semantics
Learning Analytics - A New Discipline and Bits of SemanticsLearning Analytics - A New Discipline and Bits of Semantics
Learning Analytics - A New Discipline and Bits of Semantics
 
Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)
 
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
 
Lecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptxLecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptx
 
2 Data-mining process
2   Data-mining process2   Data-mining process
2 Data-mining process
 
Dagstuhl14 intro-v1
Dagstuhl14 intro-v1Dagstuhl14 intro-v1
Dagstuhl14 intro-v1
 
Web analytics presentation
Web analytics presentationWeb analytics presentation
Web analytics presentation
 
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...
Learning Analytics: Today, Tomorrow, and When We Get Flying Cars #psuweb Conf...
 
Web analytics webinar
Web analytics webinarWeb analytics webinar
Web analytics webinar
 
Too Difficult
Too DifficultToo Difficult
Too Difficult
 
The Hidden Stories of Missing Data
The Hidden Stories of Missing DataThe Hidden Stories of Missing Data
The Hidden Stories of Missing Data
 
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
 

More from Stefan Dietze

AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...Stefan Dietze
 
Research Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESISResearch Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESISStefan Dietze
 
Research Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceResearch Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceStefan Dietze
 
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...Stefan Dietze
 
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...Stefan Dietze
 
Towards research data knowledge graphs
Towards research data knowledge graphsTowards research data knowledge graphs
Towards research data knowledge graphsStefan Dietze
 
Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...Stefan Dietze
 
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...Stefan Dietze
 
Analysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the WebAnalysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the WebStefan Dietze
 
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebBeyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebStefan Dietze
 
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebRetrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebStefan Dietze
 
Towards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the WebTowards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the WebStefan Dietze
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Stefan Dietze
 
Dietze linked data-vr-es
Dietze linked data-vr-esDietze linked data-vr-es
Dietze linked data-vr-esStefan Dietze
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Stefan Dietze
 
Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)Stefan Dietze
 
From Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web DatasetsFrom Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web DatasetsStefan Dietze
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsStefan Dietze
 

More from Stefan Dietze (20)

AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...AI in between online and offline discourse - and what has ChatGPT to do with ...
AI in between online and offline discourse - and what has ChatGPT to do with ...
 
Research Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESISResearch Knowledge Graphs at NFDI4DS & GESIS
Research Knowledge Graphs at NFDI4DS & GESIS
 
Research Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceResearch Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScience
 
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
Human-in-the-loop: the Web as Foundation for interdisciplinary Data Science M...
 
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
Human-in-the-Loop: das Web als Grundlage interdisziplinärer Data Science Meth...
 
Towards research data knowledge graphs
Towards research data knowledge graphsTowards research data knowledge graphs
Towards research data knowledge graphs
 
Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...Beyond research data infrastructures: exploiting artificial & crowd intellige...
Beyond research data infrastructures: exploiting artificial & crowd intellige...
 
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
From Web Data to Knowledge: on the Complementarity of Human and Artificial In...
 
Analysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the WebAnalysing & Improving Learning Resources Markup on the Web
Analysing & Improving Learning Resources Markup on the Web
 
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the WebBeyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
Beyond Linked Data - Exploiting Entity-Centric Knowledge on the Web
 
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the WebRetrieval, Crawling and Fusion of Entity-centric Data on the Web
Retrieval, Crawling and Fusion of Entity-centric Data on the Web
 
Towards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the WebTowards embedded Markup of Learning Resources on the Web
Towards embedded Markup of Learning Resources on the Web
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)Linked Data for Architecture, Engineering and Construction (AEC)
Linked Data for Architecture, Engineering and Construction (AEC)
 
Dietze linked data-vr-es
Dietze linked data-vr-esDietze linked data-vr-es
Dietze linked data-vr-es
 
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
 
Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)Turning Data into Knowledge (KESW2014 Keynote)
Turning Data into Knowledge (KESW2014 Keynote)
 
From Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web DatasetsFrom Data to Knowledge - Profiling & Interlinking Web Datasets
From Data to Knowledge - Profiling & Interlinking Web Datasets
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked Datasets
 

Recently uploaded

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 

Analysing User Knowledge, Competence and Learning during Online Activities

  • 1. Backup Analysing User Knowledge, Competence and Learning during Online Activities Stefan Dietze, L3S Research Center, Hannover 26.09.2017 CNR Institute for Educational Technologies, Palermo 27/09/17 1Stefan Dietze
  • 2. Research areas  Web science, Information Retrieval, Semantic Web, Social Web Analytics, Knowledge Discovery, Human Computation  Interdisciplinary application areas: digital humanities, TEL/education, Web archiving, mobility, ... Some projects Research @ L3S 27/09/17 2  See also: http://www.l3s.de Stefan Dietze
  • 3.  Pavlos Fafalios (L3S)  Besnik Fetahu (L3S)  Elena Demidova (L3S)  Ujwal Gadiraju (L3S)  Eelco Herder (L3S)  Ivana Marenzi (L3S)  Nicolas Tempelmeier (L3S)  Ran Yu (L3S)  Markus Rokicki (L3S)  Renato Joao (L3S, PUC Rio) Acknowledgements: Team 27/09/17 3Stefan Dietze  Mathieu d‘Aquin (The Open University, UK)  Mohamed Ben Ellefi (LIRMM, France)  Davide Taibi (CNR, Italy)  Konstantin Todorov (LIRMM, France)  ...
  • 4. 27/09/17 4 Learning Analytics on the Web/for online learning ? Stefan Dietze  Anything can be a learning resource  The activity makes the difference (not the resource): i.e. how a resource is being used  Learning Analytics in online/non-learning environments? o Activity streams, o Social graphs (and their evolution), o Behavioural traces (mouse movements, keystrokes) o ...  Research challenges: o How to detect „learning“? o How to detect learning-specific notions such as „competences“, „learning performance“ etc?
  • 5. 27/09/17 5 „AFEL – Analytics for Everyday (Online) Learning“ Stefan Dietze Examples of AFEL data sources: • Activity streams and behavioral traces • L3S Twitter Crawl: 6 bn tweets • Common Crawl (2015): 2 bn documents • Web Data Commons (2016): 44 bn quads • „German Academic Web“: 6 TB Web crawl • Web search query logs • Wikipedia edit history: 3 M edits/month (engl.) • ....  H2020 project (since 12/2015) aimed at understanding/supporting learning in social Web environments
  • 6. Challenges/Tasks in AFEL & beyond: some examples 27/09/17 6Stefan Dietze I Efficient data capture  Crawling & extracting activity data  Crawling, extracting and indexing learning resources (eg Common Crawl) II Efficient data analysis  Understanding learning resources: entity extraction & clustering on large Web crawls of resources  “Search as learning”: detecting learning in heterogeneous search query logs & click streams  Detecting learning activities: detection of learning pattern (eg competent behavior) in absence of learning objectives & assessments (!) o Obtaining performance indicators from behavioral traces? o Quasi-experiments in crowdsourcing platforms to obtain training data Gadiraju, U., Demartini, G., Kawase, R., Dietze, S. Human beyond the Machine: Challenges and Opportunities of Microtask Crowdsourcing. In: IEEE Intelligent Systems, Volume 30 Issue 4 – Jul/Aug 2015. Gadiraju, U., Kawase, R., Dietze, S, Demartini, G., Understanding Malicious Behavior in Crowdsourcing Platforms: The Case of Online Surveys. ACM CHI Conference on Human Factors in Computing Systems (CHI2015), April 18-23, Seoul, Korea.
  • 7. Gadiraju, U., Demartini, G., Kawase, R., Dietze, S. Human beyond the Machine: Challenges and Opportunities of Microtask Crowdsourcing. In: IEEE Intelligent Systems, Volume 30 Issue 4 – Jul/Aug 2015. Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S., Crowd Anatomy Beyond the Good and Bad -- Behavioral Traces for Crowd Worker Modeling and Pre-selection. The Journal of Collaborative Computing and Work Practices (CSCW), under review. 27/09/17 7Stefan Dietze Predicting competence in online users? Capturing assessment data: microtasks in Crowdflower  “Content Creation (CC)”: transcription of captchas  “Information Finding (IF)”: middle name of famous persons  1800 assessments: 2 tasks * 3 durations * 3 difficulty levels * 100 users (per assessment) Level 1 „Daniel Craig“ Level 2 „George Lucas“ (profession: Archbishop) Level 3 „Brian Smith“ (profession: Ice Hockey, born: 1972) Behavioral Traces: keystrokes- and mouse movements  timeBeforeInput, timeBeforeClick  tabSwitchFreq  windowToggleFreq  openNewTabFreq  WindowFocusFrequency  totalMouseMovements  scrollUpFreq, scrollDownFreq  ….  Total amount of events: 893.285 (CC Tasks), 736.664 (IF Tasks) Find the middle name of:
  • 8. 27/09/17 8Stefan Dietze Behavioural traces to predict competence? Training data  Manual annotation of 1800 assessments  Performance types [CHI15]: o “Competent Worker” , o “Diligent Worker” o “Fast Deceiver” o “Incompetent Worker” o “Rule Breaker” o “Smart Deceiver” o “Sloppy Worker”  Prediction of performance types from behavioral traces? Predicting learner types from behavioral traces  “Random Forest Classifier” (per task)  10-fold cross validation  Prediction performance: Accuracy, F-Measure Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S., Crowd Anatomy Beyond the Good and Bad -- Behavioral Traces for Crowd Worker Modeling and Pre-selection. The Journal of Collaborative Computing and Work Practices (CSCW), under review. Results  Longer assessments  more signals  Simpler assessments  more conclusive signals  “Competent Workers” (CW, DW): accuracy of 91% respectively 87%  Most significant features: “TotalTime”, “TippingPoint”, “MouseMovementFrequency”, “WindowFocusFrequency”
  • 9. 27/09/17 9Stefan Dietze Other features to predict competence in learning/assessments? “Dunning-Kruger Effect”  Incompetence in task/domain reduces capacity to recognise/assess own incompetence Research question  Self-assessment as feature to predict competence? Results  Self-assessment as (additional) reliable indicator of competence (94% accuracy), superior to mere performance measurement  Tendency to over-estimate own competence increases with increasing difficulty level David Dunning. 2011. The Dunning-Kruger Effect: On Being Ignorant of One’s Own Ignorance. Advances in experimental social psychology 44 (2011), 247. Performance („Accuracy“) of users classified as „competent“ Gadiraju, U., Fetahu, B., Kawase, R., Siehndel, P., Dietze, S., Using Worker Self-Assessments for Competence-based Pre-Selection in Crowdsourcing Microtasks. In: ACM Transactions on Computer- Human Interaction (ACM TOCHI) Vol. 24, Issue 4, August 2017.
  • 10. 27/09/17 10Stefan Dietze “Search As Learning”: predicting learning/knowledge in Web search Challenges  Detecting individual search missions in large query logs  Detecting “informational” search missions (as opposed to “transactional” or “navigational” missions, see [Broder, 2002])  Predict knowledge state of users in absence of assessment data  Predict knowledge gain (or “learning”) throughout search missions
  • 11. 27/09/17 11Stefan Dietze “Search As Learning”: predicting learning/knowledge in Web search Challenges  Detecting individual search missions in large query logs  Detecting “informational” search missions (as opposed to “transactional” or “navigational” missions, see [Broder, 2002])  Predict knowledge state of users in absence of assessment data  Predict knowledge gain (or “learning”) throughout search missions Initial results  Search mission detection with average F1 score 75% (experiments based on AOL query logs)  Quasi experiments to generate search mission data (queries, behavioral traces, pre- and post- tests) for 400 search missions  Ongoing: prediction of knowledge gain/state
  • 12. 27/09/17 12Stefan Dietze Summary & outlook  Learning analytics in online & Web-based settings o Detection of learning & learning-related notions in absence of assessment/performance indicators o Analysis of range of data, including behavioral traces, activity streams, self assessment etc o Actual big data (dynamics/velocity)  Positive results from initial models and classifiers  Other tasks (e.g. detection of learning during Web search)  Application of developed models and classifiers in online (learning) environments (e.g. AFEL project), such as GNOSS/Didactalia (200.000 users), “LearnWeb”, “Bibsonomy” etc  Several ongoing research initiatives, eg research initiative at LUH on “Digital Higher Education in MINT Subjects”
  • 13. 27/09/17 13Stefan Dietze Summary & outlook  Learning analytics in online & Web-based settings o Detection of learning & learning-related notions in absence of assessment/performance indicators o Analysis of range of data, including behavioral traces, activity streams, self assessment etc o Actual big data (dynamics/velocity)  Positive results from initial models and classifiers  Other tasks (e.g. detection of learning during Web search)  Application of developed models and classifiers in online (learning) environments (e.g. AFEL project), such as GNOSS/Didactalia (200.000 users), “LearnWeb”, “Bibsonomy” etc  Several ongoing research initiatives, eg research initiative at LUH on “Digital Higher Education in MINT Subjects” ?http://stefandietze.net