Faculty of Science and Engineering

Back to List

COT300XE(計算基盤 / Computing technologies 300)
Multi-modal Information Processing

Shoji KURAKAKE

Class code etc
Faculty/Graduate school Faculty of Science and Engineering
Attached documents
Year 2023
Class code H6107
Previous Class code
Previous Class title
Term 秋学期授業/Fall
Day/Period 火4/Tue.4
Class Type
Campus 小金井
Classroom name 小西館‐W305
Grade 3年
Credit(s)
Notes
Open Program
Open Program (Notes)
Global Open Program
Interdepartmental class taking system for Academic Achievers
Interdepartmental class taking system for Academic Achievers (Notes)
Class taught by instructors with practical experience
SDGs CP
Urban Design CP
Diversity CP
Learning for the Future CP
Carbon Neutral CP
Chiyoda Campus Consortium
Category 応用情報工学科
学科専門科目

Show all

Hide All

Outline (in English)

Multimodal information processing is about technologies for prediction and classification from different modal data, such as image and audio. Students will learn single and multi modal data processing technologies in the first half of this course. For image processing, convolutional neural network is introduced. For speech recognition, hidden Markova model, RNN and LSTM are explained. In the second half of this course, student will learn the applications of those technologies including object detection, image generation.
Student will also have opportunities to try MATLAB code provided by the lecturer and deepen the level of understanding for technologies learned through the course.
[Learning activities outside of classroom]
The review and the preparation of each lesson will take 4 hours. How to use MATLAB should be learnt by students themselves by mainly using web and with the help form the staff at the software center for the setting related things.
[Grading Criteria /Policy]
Grade is determined 60% by the submission of the assignment for each lesson and 40% by the evaluation of reports.

Default language used in class

日本語 / Japanese