CMP4507 Text MiningBahçeşehir UniversityDegree Programs COMPUTER ENGINEERINGGeneral Information For StudentsDiploma SupplementErasmus Policy StatementNational QualificationsBologna Commission
COMPUTER ENGINEERING
Bachelor TR-NQF-HE: Level 6 QF-EHEA: First Cycle EQF-LLL: Level 6

Ders Genel Tanıtım Bilgileri

Course Code: CMP4507
Ders İsmi: Text Mining
Ders Yarıyılı: Fall
Spring
Ders Kredileri:
Theoretical Practical Credit ECTS
3 0 3 6
Language of instruction:
Ders Koşulu:
Ders İş Deneyimini Gerektiriyor mu?: No
Type of course: Departmental Elective
Course Level:
Bachelor TR-NQF-HE:6. Master`s Degree QF-EHEA:First Cycle EQF-LLL:6. Master`s Degree
Mode of Delivery: Face to face
Course Coordinator : Prof. Dr. ÇAĞATAY ÇATAL
Course Lecturer(s):
Course Assistants:

Dersin Amaç ve İçeriği

Course Objectives: Textual data is increasing in many environments such as articles, blogs, tweets, news, publications, and books. To work with such data and to discover knowledge from this huge amount of data; several techniques such as linguistics, machine learning, deep learning, and natural language processing are required and as such, this is indeed a very difficult task. The aim of this course is to provide the application of machine learning in textual documents in order to analyze the textual data quantitatively. Cleaning textual data, representation of this data, and generating textual data in different problems are three important issues that should be known while working with textual data in general.
Course Content: 1) "
Introduction to Text Mining and related topics (natural language processing, machine learning, deep learning, opportunities offered by deep learning)"
2) Explanation of data preparation methods (manual text cleaning, cleaning with NLTK, data preparation with scikit-learn, data preparation with Keras)
3) Explanation of data representation models (Bag-of-words model, preparation of movie review data for sentiment classification problem)
4) Word embeddings used in data representation
5) Explanation of the methods that can be applied in the text classification problem
6) Midterm
7) Explanation of character and word based language models
8) Explanation of the methods that can create textual definition from the picture (Image captioning)
9) Machine translation from one language to another
10) Practical Application: Sentiment analysis with artificial neural network based bag-of-words model
11) Practical Application: Sentiment analysis with word embedding and CNN model, Sentiment analysis with N-gram based CNN model
12) Practical Application: Designing a Neural Network based language model for Text Generation
13) Practical Application: Designing artificial neural network based image captioning model
14) Practical Application: Designing artificial neural network based machine translation model

Learning Outcomes

The students who have succeeded in this course;
Learning Outcomes
1 - Knowledge
Theoretical - Conceptual
2 - Skills
Cognitive - Practical
3 - Competences
Communication and Social Competence
Learning Competence
Field Specific Competence
Competence to Work Independently and Take Responsibility

Ders Akış Planı

Week Subject Related Preparation
1) Introduction to Text Mining and related topics (natural language processing, machine learning, deep learning, opportunities offered by deep learning)
2) Explanation of data preparation methods (manual text cleaning, cleaning with NLTK, data preparation with scikit-learn, data preparation with Keras)
3) Explanation of data representation models (Bag-of-words model, preparation of movie review data for sentiment classification problem)
4) Word embeddings used in data representation
5) Explanation of the methods that can be applied in the text classification problem
6) Explanation of character and word based language models
7) Explanation of character and word based language models
8) Review for the midterm exam
9) Explanation of the methods that can create textual definition from the picture (Image captioning)
10) Explanation of the methods that can create textual definition from the picture (Image captioning)
11) Machine translation from one language to another
12) Machine translation from one language to another
13) Practical applications
14) Practical Applications

Sources

Course Notes / Textbooks: Brownlee, J. (2017). Deep Learning for Natural Language Processing: Develop Deep Learning Models for your Natural Language Problems. Machine Learning Mastery.

References: Ignatow, G., & Mihalcea, R. (2017). An introduction to text mining: Research design, data collection, and analysis. Sage Publications.

Ders - Program Öğrenme Kazanım İlişkisi

Ders Öğrenme Kazanımları
Program Outcomes
1) Adequate knowledge in mathematics, science and computer engineering; the ability to use theoretical and practical knowledge in these areas in complex engineering problems.
2) Ability to identify, formulate, and solve complex engineering problems; ability to select and apply appropriate analysis and modeling methods for this purpose.
3) Ability to design a complex system, process, device or product to meet specific requirements under realistic constraints and conditions; ability to apply modern design methods for this purpose.
4) Ability to develop, select and use modern techniques and tools necessary for the analysis and solution of complex problems encountered in computer engineering applications; ability to use information technologies effectively.
5) Ability to design, conduct experiments, collect data, analyze and interpret results for the study of complex engineering problems or computer engineering research topics.
6) Ability to work effectively within and multi-disciplinary teams; individual study skills.
7) Ability to communicate effectively in verbal and written Turkish; knowledge of at least one foreign language; ability to write active reports and understand written reports, to prepare design and production reports, to make effective presentations, to give and receive clear and understandable instructions.
8) Awareness of the necessity of lifelong learning; ability to access information, to follow developments in science and technology and to renew continuously.
9) To act in accordance with ethical principles, professional and ethical responsibility; information on the standards used in engineering applications.
10) Information on business practices such as project management, risk management and change management; awareness of entrepreneurship and innovation; information about sustainable development.
11) Knowledge of the effects of engineering practices on health, environment and safety in the universal and social scale and the problems of the era reflected in engineering; awareness of the legal consequences of engineering solutions.

Ders - Öğrenme Kazanımı İlişkisi

No Effect 1 Lowest 2 Low 3 Average 4 High 5 Highest
           
Program Outcomes Level of Contribution
1) Adequate knowledge in mathematics, science and computer engineering; the ability to use theoretical and practical knowledge in these areas in complex engineering problems.
2) Ability to identify, formulate, and solve complex engineering problems; ability to select and apply appropriate analysis and modeling methods for this purpose.
3) Ability to design a complex system, process, device or product to meet specific requirements under realistic constraints and conditions; ability to apply modern design methods for this purpose.
4) Ability to develop, select and use modern techniques and tools necessary for the analysis and solution of complex problems encountered in computer engineering applications; ability to use information technologies effectively.
5) Ability to design, conduct experiments, collect data, analyze and interpret results for the study of complex engineering problems or computer engineering research topics.
6) Ability to work effectively within and multi-disciplinary teams; individual study skills.
7) Ability to communicate effectively in verbal and written Turkish; knowledge of at least one foreign language; ability to write active reports and understand written reports, to prepare design and production reports, to make effective presentations, to give and receive clear and understandable instructions.
8) Awareness of the necessity of lifelong learning; ability to access information, to follow developments in science and technology and to renew continuously.
9) To act in accordance with ethical principles, professional and ethical responsibility; information on the standards used in engineering applications.
10) Information on business practices such as project management, risk management and change management; awareness of entrepreneurship and innovation; information about sustainable development.
11) Knowledge of the effects of engineering practices on health, environment and safety in the universal and social scale and the problems of the era reflected in engineering; awareness of the legal consequences of engineering solutions.

Öğrenme Etkinliği ve Öğretme Yöntemleri

Ölçme ve Değerlendirme Yöntemleri ve Kriterleri

Assessment & Grading

Semester Requirements Number of Activities Level of Contribution
Attendance 42 % 0
Project 57 % 40
Midterms 15 % 20
Final 20 % 40
Total % 100
PERCENTAGE OF SEMESTER WORK % 60
PERCENTAGE OF FINAL WORK % 40
Total % 100