Prompt Injection
Prompt Injection is a cybersecurity attack technique in which maliciously crafted inputs manipulate AI models—particularly Large Language Models (LLMs)—to produce harmful, misleading, or confidential outputs. By cleverly altering context or slipping in hidden commands, attackers can bypass content filters, reveal...
Prompt Engineering
Prompt Engineering is the strategic design and refinement of input prompts given to large language models (LLMs) or other generative AI systems. By carefully specifying context, constraints, and desired output formats, prompt engineering enables users to guide these models to...
Pre-training
Pre-training is the process of training a machine learning model on a large, general-purpose dataset before adapting it to a specific task. By learning broad patterns, structures, and representations from unlabeled or widely available data, a model develops a rich...
Post-training
Post-training refers to the stage that follows the initial model training process, focusing on refining, optimizing, and preparing the model for deployment in real-world scenarios. After a model has been trained—often through pre-training and fine-tuning—it may still benefit from additional...
Perplexity
In the context of natural language processing (NLP) and machine learning, perplexity is a metric used to evaluate the performance of language models. It measures how well a model predicts a sequence of words, with lower perplexity indicating better performance....
Open Source
Open source refers to software, datasets, or other resources that are made publicly available, allowing anyone to view, use, modify, and distribute the source code or content. Open source promotes collaboration, transparency, and innovation by fostering a community-driven approach to...
NLP
Natural Language Processing (NLP) is a field of artificial intelligence that focuses on enabling computers to understand, interpret, and generate human language. It combines linguistics, machine learning, and computational techniques to process and analyze natural language data, such as text...