|
ISTL7032 | Statistical Natural Language Processing | 3+0+0 | ECTS:7.5 | Year / Semester | Spring Semester | Level of Course | Third Cycle | Status | Elective | Department | DEPARTMENT of STATISTICS and COMPUTER SCIENCES | Prerequisites and co-requisites | None | Mode of Delivery | Face to face | Contact Hours | 14 weeks - 3 hours of lectures per week | Lecturer | Prof. Dr. Orhan KESEMEN | Co-Lecturer | None | Language of instruction | Turkish | Professional practise ( internship ) | None | | The aim of the course: | To teach statistical processing of Turkish on the computer. |
Programme Outcomes | CTPO | TOA | Upon successful completion of the course, the students will be able to : | | | PO - 1 : | To learn Natural Language processing techniques. | 7,8 | 1,3 | PO - 2 : | To learn the use of theoretical sciences such as mathematics, and statistics in the application fields. | 7,8 | 1,3 | CTPO : Contribution to programme outcomes, TOA :Type of assessment (1: written exam, 2: Oral exam, 3: Homework assignment, 4: Laboratory exercise/exam, 5: Seminar / presentation, 6: Term paper), PO : Learning Outcome | |
Natural language processing and used input locations, world languages as a structural classification; strings processing, regular expressions, search algorithms, in text databases and internet search, the search results, statistics, audio and consonant rules, major and minor intonation, spelling algorithm ; spelling, punctuation and grammar errors and correction techniques; Verbal puzzle design; Indexing algorithm, encryption algorithm and statistics by using the decoding; Turkish words coined in the trees, human voice formation of statistical modeling, from talking to writing, writing to-speech relay; formal analysis of Turkish, its root, and additional structures and their statistics, structural comparison of languages; the analysis of Turkish syntax , elements of the sentence separation, word position statistics; Turkish semantic analysis of word meaning classification, machine learning, statistical machine translation and positioning. |
|
Course Syllabus | Week | Subject | Related Notes / Files | Week 1 | Introduction to Natural language processing and using place, | | Week 2 | A structural classification of World languages; | | Week 3 | Strings processing, | | Week 4 | Regular expressions, search algorithms, in text, databases and internet search, | | Week 5 | The search results, statistics, audio and consonant rules, major and minor intonation, spelling algorithm ; | | Week 6 | Spelling, punctuation and grammar errors and correction techniques; | | Week 7 | Verbal puzzle designs; | | Week 8 | Indexing algorithm, | | Week 9 | Mid-term exam | | Week 10 | Encryption algorithm and statistics by using the decoding | | Week 11 | Turkish word coined in the trees, | | Week 12 | Human voices formation of statistical modeling, | | Week 13 | From talking to writing, writing to-speech relay; | | Week 14 | Formal analysis of Turkish, root, and additional structures and their statistics, structural comparison of the language; | | Week 15 | The analysis of Turkish syntax , elements of the sentence separation, word position statistics; | | Week 16 | End-of-term exam | | |
1 | Chris Manning and Hinrich Schütze,1999, Foundations of Statistical Natural Language Processing, MIT Press. Cambridge | | |
1 | Xuendong Huang, ...,2001, Spoken Language Processing, A Guide to Theory, Algorithm, and System Development, Prentice-Hall, | | 2 | Daniel Jurafsky and James H. Martin, 2000, Speech and Language Processing, Prentice-Hall, | | |
Method of Assessment | Type of assessment | Week No | Date | Duration (hours) | Weight (%) | Homework/Assignment/Term-paper | 030405060708101112 | | 6 | 50 | End-of-term exam | 16 | 28/05/2010 | 1 | 50 | |
Student Work Load and its Distribution | Type of work | Duration (hours pw) | No of weeks / Number of activity | Hours in total per term | Ödev | 4 | 14 | 56 | Total work load | | | 56 |
|