Machine Learning Lecture Five

Слайд 2

Main article: Sparse dictionary learning
Sparse dictionary learning is a feature learning method

Main article: Sparse dictionary learning Sparse dictionary learning is a feature learning
where a training example is represented as a linear combination of basis functions, and is assumed to be a sparse matrix. The method is strongly NP-hard and difficult to solve approximately.[57] A popular heuristic method for sparse dictionary learning is the K-SVD algorithm. Sparse dictionary learning has been applied in several contexts. In classification, the problem is to determine the class to which a previously unseen training example belongs. For a dictionary where each class has already been built, a new training example is associated with the class that is best sparsely represented by the corresponding dictionary. Sparse dictionary learning has also been applied in image de-noising. The key idea is that a clean image patch can be sparsely represented by an image dictionary, but the noise cannot.[58]

Слайд 3

Anomaly detection
Main article: Anomaly detection
In data mining, anomaly detection, also known as

Anomaly detection Main article: Anomaly detection In data mining, anomaly detection, also
outlier detection, is the identification of rare items, events or observations which raise suspicions by differing significantly from the majority of the data.[59] Typically, the anomalous items represent an issue such as bank fraud, a structural defect, medical problems or errors in a text. Anomalies are referred to as outliers, novelties, noise, deviations and exceptions.[60]

Слайд 4

In particular, in the context of abuse and network intrusion detection, the

In particular, in the context of abuse and network intrusion detection, the
interesting objects are often not rare objects, but unexpected bursts of inactivity. This pattern does not adhere to the common statistical definition of an outlier as a rare object, and many outlier detection methods (in particular, unsupervised algorithms) will fail on such data unless it has been aggregated appropriately. Instead, a cluster analysis algorithm may be able to detect the micro-clusters formed by these patterns.[61]

Слайд 5

Three broad categories of anomaly detection techniques exist.[62] Unsupervised anomaly detection techniques

Three broad categories of anomaly detection techniques exist.[62] Unsupervised anomaly detection techniques
detect anomalies in an unlabeled test data set under the assumption that the majority of the instances in the data set are normal, by looking for instances that seem to fit least to the remainder of the data set. Supervised anomaly detection techniques require a data set that has been labeled as "normal" and "abnormal" and involves training a classifier (the key difference to many other statistical classification problems is the inherently unbalanced nature of outlier detection). Semi-supervised anomaly detection techniques construct a model representing normal behavior from a given normal training data set and then test the likelihood of a test instance to be generated by the model.

Слайд 6

Robot learning
In developmental robotics, robot learning algorithms generate their own sequences of

Robot learning In developmental robotics, robot learning algorithms generate their own sequences
learning experiences, also known as a curriculum, to cumulatively acquire new skills through self-guided exploration and social interaction with humans. These robots use guidance mechanisms such as active learning, maturation, motor synergies and imitation.

Слайд 7

Association rules
Main article: Association rule learning
See also: Inductive logic programming
Association rule learning

Association rules Main article: Association rule learning See also: Inductive logic programming
is a rule-based machine learning method for discovering relationships between variables in large databases. It is intended to identify strong rules discovered in databases using some measure of "interestingness".[63]

Слайд 8

Rule-based machine learning is a general term for any machine learning method

Rule-based machine learning is a general term for any machine learning method
that identifies, learns, or evolves "rules" to store, manipulate or apply knowledge. The defining characteristic of a rule-based machine learning algorithm is the identification and utilization of a set of relational rules that collectively represent the knowledge captured by the system. This is in contrast to other machine learning algorithms that commonly identify a singular model that can be universally applied to any instance in order to make a prediction.[64] Rule-based machine learning approaches include learning classifier systems, association rule learning, and artificial immune systems.

Слайд 9

Based on the concept of strong rules, Rakesh Agrawal, Tomasz Imieliński and

Based on the concept of strong rules, Rakesh Agrawal, Tomasz Imieliński and
Arun Swami introduced association rules for discovering regularities between products in large-scale transaction data recorded by point-of-sale (POS) systems in supermarkets.[65] For example, the rule {\displaystyle \{\mathrm {onions,potatoes} \}\Rightarrow \{\mathrm {burger} \}}\{{\mathrm {onions,potatoes}}\}\Rightarrow \{{\mathrm {burger}}\} found in the sales data of a supermarket would indicate that if a customer buys onions and potatoes together, they are likely to also buy hamburger meat. Such information can be used as the basis for decisions about marketing activities such as promotional pricing or product placements. In addition to market basket analysis, association rules are employed today in application areas including Web usage mining, intrusion detection, continuous production, and bioinformatics. In contrast with sequence mining, association rule learning typically does not consider the order of items either within a transaction or across transactions.