Introduction to Text Mining in Japanese
This tutorial was created for the workshop Introduction to Text Mining in Japanese, sponsored by the Japan Digital Research Center and Harvard’s Digital Scholarship Support Group (DSSG). For questions, please contact to Jungeun "June" Lim (june.j.lim@gmail.com).
To access the workshop content, please open the Jupyter notebook and play the video tutorials in the Youtube playlist.
What is Jupyter Notebook?
Jupyter Notebook is an open-source application that allows you to write and execute Python (and several other programming languages) in your web browser. This Jupyter Notebook contains Python code that you can run interactively along with instructions that explain what each code cell (a box that contains code) does.
This Jupyter Notebook is hosted on Google Colab, which means that it runs on Google's cloud server with all the configurations set up for you. So you don't need to install anything to write and/or execute Python code here.
If you want to save changes, please create a copy of the notebook in your personal space such as your Google Drive (you can find the options to do so in the File menu).
Click here to access "Introduction to Text Mining in Japanese" (Jupyter notebook on Google Colab)
Playlist: Text Mining for Japanese Studies Video Tutorials on Youtube
Assessment
Workshops taken as part of the certificate coursework requirement include an assessment component to ensure students have developed an adequate understanding of the materials. Assessments will typically require students to apply their skills to a new context, and we are generally aiming for 2-3 hours of student work. Course assessments will be evaluated on a pass/fail basis by DSSG standing committee members.
Sign in to your Google Account to make a copy of the Jupyter notebooks that you can edit. To take the assessment, please click the links below.