Terms & Definitions
We will be using the following nomenclature throughout this documentation:
Term
Definition
AI
Artificial Intelligence: the ability of machines to perform tasks that are typically associated with human intelligence, such as learning and problem-solving. The traditional goals of AI research include reasoning, knowledge representation, planning, learning, natural language processing, perception, and support for robotics.
ML
Machine Learning: an umbrella term for solving problems for which development of algorithms by human programmers would be cost-prohibitive, and instead the problems are solved by helping machines 'discover' their 'own' algorithms, without needing to be explicitly told what to do by any human-developed algorithms. As a scientific endeavour, machine learning grew out of the quest for AI.
Dataset
A collection of related, discrete items of related data that may be accessed individually or in combination or managed as a whole entity.
ML Algorithm
A mathematical method to find patterns in a set of data.
ML Model Training
The process of running an ML algorithm on a dataset (called training data) and optimising the algorithm to find certain patterns or outputs. The resulting function with rules and data structures is called the trained machine learning model.
ML Model
A computer program that is used to recognise patterns in data or make predictions. ML models are created from ML algorithms, which are trained using either labelled, unlabelled, or mixed data.
Data Labelling
or Data Annotation
A part of the preprocessing stage when developing an ML model. It requires the identification of raw data (i.e., images, text files, videos), and then the addition of one or more labels to that data to specify its context for the models, allowing the machine learning model to make accurate predictions.
Agent
An actor in the data value chain that is responsible for processing data.
Human Agent
A natural person that solves data processing (labelling) tasks as a part of the crowdsourcing community.
AI Agent
A computer program or system that is designed to solve data processing (labelling) autonomously, based on an ML model that has been previously trained using data labelled by Human Agents.
FL
Federated Learning: a sub-field of Machine Learning focusing on settings in which multiple entities (often referred to as clients) collaboratively train an ML model while ensuring that their data remains decentralised.
Last updated