Necessary additional metadata that is applied to training data to provide the meaning necessary to train supervised learning machine learning models. Data labels are specific to the type of data and the required purpose or output of the machine learning system. Data labels can either be applied by people who manually use their knowledge to apply the right labels or by systems that can infer the label based on previously trained supervised or semi-supervised approaches.