Building upon an internal data science initiative, GSE IT began investigating innovative ways of using machine learning (ML), specifically around natural language processing (NLP) to analyze large amounts of text.
The goal is to find ways technology and machine learning can help supplement classroom instruction, policymaking, and personalization of learning experiences.
We first began by trying various cloud providers for natural language processing, including Google’s Cloud Natural Language, Microsoft’s Cognitive Services, and IBM Watson. We were able to process simple texts through their service and get back results according to the cloud vendor’s algorithm and dataset.
We then tried building our own algorithm in-house, using the Stanford Question Answering Dataset (SQuAD) to train our model. We also used questions from the Stanford Mobile Inquiry-based Learning Environment to rate and classify questions.
We are continually improving our algorithm to achieve better results when classifying open-ended text, and are expanding our use of NLP and ML to other educational use cases.
We are building chatbots with embedded learning proficiency levels to help tutors of English identify the written proficiency of their trainee, as well as to help our community navigate around our buildings through our digital kiosk.