Natural Language Processing II (EN)

You will be guided by Jiří Materna

Je specialista na strojové učení se zkušenostmi s jeho aplikacemi v průmyslu od roku 2007. Mezi lety 2008 a 2017 pracoval ve společnosti Seznam.cz, z…

Information

Description

In this course, we will follow up on the basic Natural Language Processing course with more advanced topics.

We will mainly focus on data preprocessing and the latest applications of deep learning in text processing. It will mainly be about the architecture of neural networks built on the so-called Transformers. Using the transfer learning method, we will show how large pre-trained neural networks can be used for various practical applications.

Contents

Preprocessing of text data
  • Character encoding and unicode normalization
  • Traditional tokenization (simple methods, Spacy, Moses)
  • Subword tokenization (byte-pair encoding, wordpiece, sentencepiece)
  • Data cleaning (deduplication, removal of text ballast)
Word embeddings
  • General principles
  • Implementation of the skip-gram model
Machine translation with recurrent networks
  • LSTM and GRU memory cells
  • Implementation of machine translation using recurrent networks
Transformers
  • Attention is all you need
  • Transformer architecture
  • GPT2
  • BERT
  • XLNET
Examples of transfer learning for natural language processing
  • Classification of texts
  • Recognizing named entities
  • Question answering

Prerequisites

  • Basic knowledge of Python programming
  • High School Mathematics
  • Course-level knowledge of machine learning Introduction to Machine Learning
  • Course-level knowledge Natural Language Processing

Natural Language Processing II (EN)

Selected course term

 ONLINE

Price
4 990 CZK + 21% VAT

Contact the supplier


Because of spam protection, please answer the following question how much is five and two ? Write the sum in digits.