词条 | Educational data mining |
释义 |
DefinitionEducational data mining refers to techniques, tools, and research designed for automatically extracting meaning from large repositories of data generated by or related to people's learning activities in educational settings. Quite often, this data is extensive, fine-grained, and precise. For example, several learning management systems (LMSs) track information such as when each student accessed each learning object, how many times they accessed it, and how many minutes the learning object was displayed on the user's computer screen. As another example, intelligent tutoring systems record data every time a learner submits a solution to a problem; they may collect the time of the submission, whether or not the solution matches the expected solution, the amount of time that has passed since the last submission, the order in which solution components were entered into the interface, etc. The precision of this data is such that even a fairly short session with a computer-based learning environment (e.g., 30 minutes) may produce a large amount of process data for analysis. In other cases, the data is less fine-grained. For example, a student's university transcript may contain a temporally ordered list of courses taken by the student, the grade that the student earned in each course, and when the student selected or changed his or her academic major. EDM leverages both types of data to discover meaningful information about different types of learners and how they learn, the structure of domain knowledge, and the effect of instructional strategies embedded within various learning environments. These analyses provide new information that would be difficult to discern by looking at the raw data. For example, analyzing data from an LMS may reveal a relationship between the learning objects that a student accessed during the course and their final course grade. Similarly, analyzing student transcript data may reveal a relationship between a student's grade in a particular course and their decision to change their academic major. Such information provides insight into the design of learning environments, which allows students, teachers, school administrators, and educational policy makers to make informed decisions about how to interact with, provide, and manage educational resources. HistoryWhile the analysis of educational data is not itself a new practice, recent advances in educational technology, including the increase in computing power and the ability to log fine-grained data about students' use of a computer-based learning environment, have led to an increased interest in developing techniques for analyzing the large amounts of data generated in educational settings. This interest translated into a series of EDM workshops held from 2000 to 2007 as part of several international research conferences.[4] In 2008, a group of researchers established what has become an annual international research conference on EDM, the first of which took place in Montreal, Quebec, Canada.[5] As interest in EDM continued to increase, EDM researchers established an academic journal in 2009, the Journal of Educational Data Mining, for sharing and disseminating research results. In 2011, EDM researchers established the International Educational Data Mining Society to connect EDM researchers and continue to grow the field. With the introduction of public educational data repositories in 2008, such as the Pittsburgh Science of Learning Centre's (PSLC) DataShop and the National Center for Education Statistics (NCES), public data sets have made educational data mining more accessible and feasible, contributing to its growth.[6] GoalsRyan S. Baker and Kalina Yacef [7] identified the following four goals of EDM:
Users and stakeholdersThere are four main users and stakeholders involved with educational data mining. These include:
PhasesAs research in the field of educational data mining has continued to grow, a myriad of data mining techniques have been applied to a variety of educational contexts. In each case, the goal is to translate raw data into meaningful information about the learning process in order to make better decisions about the design and trajectory of a learning environment. Thus, EDM generally consists of four phases:[2][4]
During phases 3 and 4, data is often visualized or in some other way distilled for human judgment.[2] A large amount of research has been conducted in best practices for visualizing data. Main approachesOf the general categories of methods mentioned, prediction, clustering and relationship mining are considered universal methods across all types of data mining; however, Discovery with Models and Distillation of Data for Human Judgment are considered more prominent approaches within educational data mining.[6] Discovery with modelsIn the Discovery with Model method, a model is developed via prediction, clustering or by human reasoning knowledge engineering and then used as a component in another analysis, namely in prediction and relationship mining.[6] In the prediction method use, the created model's predictions are used to predict a new variable.[6] For the use of relationship mining, the created model enables the analysis between new predictions and additional variables in the study.[6] In many cases, discovery with models uses validated prediction models that have proven generalizability across contexts. Key applications of this method include discovering relationships between student behaviors, characteristics and contextual variables in the learning environment.[6] Further discovery of broad and specific research questions across a wide range of contexts can also be explored using this method. Distillation of data for human judgmentHumans can make inferences about data that may be beyond the scope in which an automated data mining method provides.[6] For the use of education data mining, data is distilled for human judgment for two key purposes, identification and classification.[6] For the purpose of identification, data is distilled to enable humans to identify well-known patterns, which may otherwise be difficult to interpret. For example, the learning curve, classic to educational studies, is a pattern that clearly reflects the relationship between learning and experience over time. Data is also distilled for the purposes of classifying features of data, which for educational data mining, is used to support the development of the prediction model. Classification helps expedite the development of the prediction model, tremendously. The goal of this method is to summarize and present the information in a useful, interactive and visually appealing way in order to understand the large amounts of education data and to support decision making.[8] In particular, this method is beneficial to educators in understanding usage information and effectiveness in course activities.[8] Key applications for the distillation of data for human judgment include identifying patterns in student learning, behavior, opportunities for collaboration and labeling data for future uses in prediction models.[6] ApplicationsA list of the primary applications of EDM is provided by Cristobal Romero and Sebastian Ventura.[4] In their taxonomy, the areas of EDM application are:
New research on mobile learning environments also suggests that data mining can be useful. Data mining can be used to help provide personalized content to mobile users, despite the differences in managing content between mobile devices and standard PCs and web browsers. New EDM applications will focus on allowing non-technical users use and engage in data mining tools and activities, making data collection and processing more accessible for all users of EDM. Examples include statistical and visualization tools that analyzes social networks and their influence on learning outcomes and productivity.[13] CoursesIn October 2013, Coursera offered a free online course on "Big Data in Education" that taught how and when to use key methods for EDM.[14] This course moved to edX in the summer of 2015,[15] and has continued to run on edX annually since then. A course archive is now available online.[16] Teachers College, Columbia University offers a MS in Learning Analytics.[17]Publication venuesConsiderable amounts of EDM work are published at the peer-reviewed International Conference on Educational Data Mining, organized by the International Educational Data Mining Society.
EDM papers are also published in the Journal of Educational Data Mining (JEDM). Many EDM papers are routinely published in related conferences, such as Artificial Intelligence and Education, Intelligent Tutoring Systems, and User Modeling, Adaptation, and Personalization. In 2011, Chapman & Hall/CRC Press, Taylor and Francis Group published the first Handbook of Educational Data Mining. This resource was created for those that are interested in participating in the educational data mining community.[13] ContestsIn 2010, the Association for Computing Machinery's KDD Cup was conducted using data from an educational setting. The data set was provided by the Pittsburgh Science of Learning Center's [https://pslcdatashop.web.cmu.edu/ DataShop], and it consisted of over 1,000,000 data points from students using a cognitive tutor. Six hundred teams competed for over 8,000 USD in prize money (which was donated by Facebook). The goal for contestants was to design an algorithm that, after learning from the provided data, would make the most accurate predictions from new data. The [https://pslcdatashop.web.cmu.edu/KDDCup/results.jsp winners] submitted an algorithm that utilized feature generation (a form of representation learning), random forests, and Bayesian networks. Costs and challengesAlong with technological advancements are costs and challenges associated with implementing EDM applications. These include the costs to store logged data and the cost associated with hiring staff dedicated to managing data systems.[18] Moreover, data systems may not always integrate seamlessly with one another and even with the support of statistical and visualization tools, creating one simplified version of the data can be difficult.[18] Furthermore, choosing which data to mine and analyze can also be challenging,[18] making the initial stages very time consuming and labor-intensive. From beginning to end, the EDM strategy and implementation requires one to uphold privacy and ethics[18] for all stakeholders involved. Criticisms
See also
References1. ^{{cite web | title = EducationalDataMining.org | year = 2013 | url = http://www.educationaldatamining.org/ | accessdate = 2013-07-15}} {{DEFAULTSORT:Educational Data Mining}}2. ^1 2 R. Baker (2010) Data Mining for Education. In McGaw, B., Peterson, P., Baker, E. (Eds.) International Encyclopedia of Education (3rd edition), vol. 7, pp. 112-118. Oxford, UK: Elsevier. 3. ^{{Cite journal | author = G. Siemens, R.S.j.d. Baker | title=Learning analytics and educational data mining: towards communication and collaboration | journal = Proceedings of the 2nd International Conference on Learning Analytics and Knowledge | year=2012 | pages=252–254}} 4. ^1 2 C. Romero, S. Ventura. Educational Data Mining: A Review of the State-of-the-Art. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews. 40(6), 601-618, 2010. 5. ^"http://educationaldatamining.org/EDM2008/" Retrieved 2013-09-04 6. ^1 2 3 4 5 6 7 8 {{cite web|last=Baker|first=Ryan|title=Data Mining for Education|url=http://www.columbia.edu/~rsb2162/Encyclopedia%20Chapter%20Draft%20v10%20-fw.pdf|publisher=oxford, UK: Elsevier|accessdate=9 February 2014}} 7. ^1 {{cite journal|last=Baker|first=R.S.|author2=Yacef, K|title=The state of educational data mining in 2009: A review and future visions|journal=JEDM-Journal of Educational Data Mining|year=2009|volume=1|issue=1|page=2017}} 8. ^1 2 3 4 5 {{cite journal|last=Romero|first=Cristobal|author2=Ventura, Sebastian|title=WIREs Data Mining Knowl Discov|journal=Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery|date=January–February 2013|volume=3|issue=1|pages=12–27|doi=10.1002/widm.1075}} 9. ^{{cite journal|last=Romero|first=Cristobal|author2=Ventura, Sebastian|title=Educational data mining: A survey from 1995 to 2005|journal=Expert Systems with Applications|year=2007|volume=33|issue=1|pages=135–146}} 10. ^{{cite web|publisher=Industry Canada|title=Assessing the Economic Impact of Copyright Reform in the Area of Technology-Enhanced Learning|url=https://www.ic.gc.ca/eic/site/ippd-dppi.nsf/eng/ip01102.html|accessdate=6 April 2014}} 11. ^Azarnoush, Bahareh, et al. "Toward a Framework for Learner Segmentation." JEDM-Journal of Educational Data Mining 5.2 (2013): 102-126. 12. ^1 {{cite web|last=U.S. Department of Education|first=Office of Educational Technology|title=Enhancing Teaching and Learning Through Educational Data Mining and Learning Analytics: An Issue Brief|url=https://www.ed.gov/edblogs/technology/files/2012/03/edm-la-brief.pdf|accessdate=30 March 2014}} 13. ^1 {{cite book|author1=Romero, C. |author2= Ventura, S. |author3=Pechenizkiy, M. |author4= Baker, R. S.|title=Handbook of educational data mining|year=2010|publisher=CRC Press}} 14. ^{{cite web|website=Coursera|title=Big Data in Education|url=https://www.coursera.org/course/bigdata-edu|accessdate=30 March 2014}} 15. ^{{cite web|website=edXedxed|title=Big Data in Education|url=https://www.edx.org/course/big-data-education-pennx-bde1x-0|accessdate=13 October 2015}} 16. ^{{cite web|title=Big Data in Education|url=http://www.upenn.edu/learninganalytics/MOOT/bigdataeducation.html|accessdate=17 July 2018}} 17. ^{{Cite web|title = Learning Analytics {{!}} Teachers College Columbia University|url = http://www.tc.columbia.edu/human-development/learning-analytics/|website = www.tc.columbia.edu|accessdate = 2015-10-13}} 18. ^1 2 3 {{cite web|website=EdTechReview|title=How Can Educational Data Mining and Learning Analytics Improve and Personalize Education?|url=http://edtechreview.in/trends-insights/insights/389-data-mining-and-learning-analytics-improving-education|accessdate=9 April 2014}} 2 : Applied data mining|Educational psychology |
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。