Logo
Pattern

Discover published sets by community

Explore tens of thousands of sets crafted by our community.

NLP Basics

25

Flashcards

0/25

Still learning
StarStarStarStar

NLP

StarStarStarStar

Natural Language Processing, the branch of computer science and artificial intelligence concerned with giving computers the ability to understand text and spoken words in much the same way human beings can.

StarStarStarStar

Sentiment Analysis

StarStarStarStar

The process of computationally determining whether a piece of writing is positive, negative, or neutral.

StarStarStarStar

Tokenization

StarStarStarStar

The process of breaking down text into units of meaning, known as tokens, which can be words, phrases, or symbols.

StarStarStarStar

TF-IDF

StarStarStarStar

Term Frequency-Inverse Document Frequency, a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus.

StarStarStarStar

Attention Mechanism

StarStarStarStar

A component of neural networks that allows the model to focus on different parts of the input sequence when generating an output, improving the performance of the network on tasks like machine translation.

StarStarStarStar

Word Embeddings

StarStarStarStar

A type of word representation that allows words with similar meaning to have a similar representation.

StarStarStarStar

Corpus

StarStarStarStar

A large and structured set of texts, typically used for linguistic research and language modeling.

StarStarStarStar

Transformer

StarStarStarStar

A type of deep learning model that uses self-attention to boost the speed of training and improve the handling of long-range dependencies in data.

StarStarStarStar

Co-reference Resolution

StarStarStarStar

The task of determining whether and how terms in a text, such as pronouns and nouns, refer to the same entity.

StarStarStarStar

Parsing

StarStarStarStar

The process of analyzing a string of symbols, either in natural language or in computer languages, according to the rules of a formal grammar.

StarStarStarStar

Part-of-Speech Tagging

StarStarStarStar

The process of marking up words in a text as corresponding to a particular part of speech, based on both its definition and context.

StarStarStarStar

Syntax

StarStarStarStar

The arrangement of words and phrases to create well-formed sentences in a language.

StarStarStarStar

Named Entity Recognition (NER)

StarStarStarStar

The process of identifying and classifying named entities in text into predefined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc.

StarStarStarStar

Language Model

StarStarStarStar

A model that predicts the likelihood of a sequence of words.

StarStarStarStar

Token

StarStarStarStar

In NLP, a token usually corresponds to a word or a punctuation marker.

StarStarStarStar

Named Entity

StarStarStarStar

A word or phrase that clearly identifies one item from a set of others that have similar attributes, such as names of people, places, companies, etc.

StarStarStarStar

Stemming

StarStarStarStar

The process of reducing words to their root form or stem, often by removing common prefixes or suffixes.

StarStarStarStar

BiLSTM

StarStarStarStar

Bidirectional Long Short-Term Memory, a type of RNN that can improve model performance on sequence classification problems.

StarStarStarStar

Stop Words

StarStarStarStar

Words that are filtered out before processing of natural language data because they are deemed irrelevant for the task and do not contain important meaning.

StarStarStarStar

BERT

StarStarStarStar

Bidirectional Encoder Representations from Transformers, a method of pre-training language representations which can be fine-tuned for various NLP tasks.

StarStarStarStar

Lemmatization

StarStarStarStar

Similar to stemming, it's the process of reducing a word to its base or dictionary form, called a lemma, often involving a vocabulary and morphological analysis of the word.

StarStarStarStar

Machine Translation

StarStarStarStar

The application of computers to the task of translating texts from one natural language to another.

StarStarStarStar

GPT

StarStarStarStar

Generative Pre-trained Transformer, a type of language model that uses unsupervised learning to generate natural language texts.

StarStarStarStar

Semantics

StarStarStarStar

The study of meaning in language, as opposed to syntax which is the study of sentence structure.

StarStarStarStar

Chatbot

StarStarStarStar

A software application used to conduct an online chat conversation via text or text-to-speech, in lieu of providing direct contact with a live human agent.

Know
0
Still learning
Click to flip
Know
0
Logo

© Hypatia.Tech. 2024 All rights reserved.