Chapter 16

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Word Embeddings

Natural language models cannot operate directly on words as strings.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Subword Tokenization

A language model cannot process raw text directly. Text must first be converted into a sequence of token IDs. The procedure that performs this conversion is called tokenization.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Text Classification

Text classification assigns one or more labels to a piece of text.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Named Entity Recognition

Named entity recognition, usually abbreviated NER, identifies spans of text that refer to named or typed entities.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Machine Translation

Machine translation converts text from one language into another. Given a source sentence in one language, the model generates a semantically equivalent sentence in a target language.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Question Answering

Question answering, often abbreviated QA, is the task of producing an answer to a question.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Conversational Systems

A conversational system processes dialogue between users and machines.

Writes › Book › Deep Learning with PyTorch › Part IV › Chapter 16 ›

Language Modeling

Language modeling is the task of predicting text sequences. A language model assigns probabilities to sequences of tokens and learns the statistical structure of language.

Sections

Word Embeddings

Subword Tokenization

Text Classification

Named Entity Recognition

Machine Translation

Question Answering

Conversational Systems

Language Modeling