Intelligent Visual Computing: A Neural Network Approach
Location & Time:
Hasbrouck 134 (students can also join remotely - check moodle for zoom link)
Tuesdays + Thursdays 5:30-6:45
Instuctor: Evangelos Kalogerakis ()
This course covers deep learning methods for 3D computer graphics and 3D computer vision. It introduces students to a wide range of problems related to the processing and creation of visual content, including 3D models, scenes, animations, and images. For each problem, the course describes state-of-the-art learning-based solutions based on recent research findings. Topics include 3D deep learning, 3D object recognition, segmentation, correspondences, 3D object and scene reconstruction, image synthesis, 3D shape synthesis, animation, and neural rendering. Students review research papers, and complete 5 programming assignments in Python/PyTorch. Students taking the 674 section will also work on an open-ended course project related to the topics covered in this course.
CMPSCI graduate students: there are no prerequisites for CMPSCI graduate students, although familiarity with probability, statistics, linear algebra, undergraduate computer graphics and machine learning is essential!
Undergrads and others: the course has the following prerequisites:
- Introduction to Algorithms (311 or equivalent) with grade B or better
- Artificial intelligence (383 or equivalent) with grade B or better
- Introduction to Computer Graphics (373, 473 or equivalent) with grade B or better
This course counts as a CS Elective toward the CMPSCI major (BA/BS) and can satisfy a second or third AI core requirement in the CMPSCI PhD and MSc program. It offers 3 credits.
Class planLecture material will be available asynchronousy. Course work can be done asychronouly. Students can join in-person or watch live lectures over zoom (attendance is optional). There will be office hours with the instructor and the TAs remotely over zoom. Zoom links, lecture notes, recorded videos, programming resources, and assignments are posted on Moodle. Piazza will be used for asynchronous Q & A with the instructor and TAs.
Week-by-week outline (tentative)
Week 1: Image, Shape, Scene Representation
Week 2: Introduction to Neural Networks
Week 3: 3D Deep Learning: the multi-view approach
Week 4: 3D Deep Learning: the voxel-based approach
Week 5: 3D Deep Learning: the point-based approach
Week 6: 3D Deep Learning: the mesh-based approach
Week 7: Introduction to Generative Models
Week 8: 3D generative models: generating views, voxels, points
Week 9: 3D generative models: generating implicits and primitives
Week 10: 3D generative models: generating meshes and scenes
Week 11: Generative models for rigging and animation
Week 12: Neural Photorealistic Rendering
Week 13: Neural Artistic Rendering
- Complete five assignments
- Research paper presentation
- Complete five assignments
- A term project + presentation. This should be an implementation of a deep learning algorithm related to the topics of the course. The result of the project should be a prototype with some preliminary results, or could lead to a submission of a research paper in the future.
Marking scheme (tentative)
15% research paper presentation
30% project (code, report)
10% project presentation
Scores to letter conversions:
Note: A course grade below C is not a passing grade for graduate students.
Should you take 574 or 674?
Both 574 and 674 sections meet at the same time, same location, and have access to the same lectures and course material. The programming assignments are the same. The difference is that 574 requires students to present an existing research paper towards the end of the course (~15 min presentation), while 674 requires students to develop on their own a deep learning method related to the topics of the course as a final course project. The project can be an implementation of a (small-scale) research idea/prototype, or could lead to a submission of a research paper in the future. The project plan must be first discussed with the instructor (soon after the middle of the semester). Students also present their project work towards the end of the course. As a result, 674 requires more effort and is more targeted for students interested in graphics+vision research.
Accommodation StatementThe University of Massachusetts Amherst is committed to providing an equal educational opportunity for all students. If you have a documented physical, psychological, or learning disability on file with Disability Services (DS), you may be eligible for reasonable academic accommodations to help you succeed in this course. If you have a documented disability that requires an accommodation, please notify me within the first two weeks of the semester so that we may make appropriate arrangements.
Academic Honesty StatementSince the integrity of the academic enterprise of any institution of higher education requires honesty in scholarship and research, academic honesty is required of all students at the University of Massachusetts Amherst. Academic dishonesty is prohibited in all programs of the University. Academic dishonesty includes but is not limited to: cheating, fabrication, plagiarism, and facilitating dishonesty. Appropriate sanctions may be imposed on any student who has committed an act of academic dishonesty. Instructors should take reasonable steps to address academic misconduct. Any person who has reason to believe that a student has committed academic dishonesty should bring such information to the attention of the appropriate course instructor as soon as possible. Instances of academic dishonesty not related to a specific course should be brought to the attention of the appropriate department Head or Chair. Since students are expected to be familiar with this policy and the commonly accepted standards of academic integrity, ignorance of such standards is not normally sufficient evidence of lack of intent (http://www.umass.edu/dean_students/codeofconduct/acadhonesty/).