Learn, Build, Deploy
2023-12-31
The views expressed in this talk are those of the author
and do not necessarily reflect the official policy or position of the
If you want to learn more about Machine Learning and AI in Dayton, please attend a GemCity Machine Learning meetup.
We meet every third Thursday at 6pm as part of GemCity Tech meetup group.
GemCity TECH’s mission is to grow the local industry and the community by providing a centralized destination for technical training, workshops and providing a forum for collaborating.
Currently, support several special interest groups from a variety of technical disciplines.
This presentation will cover the following topics:
The learning outcomes are as follows:
AI is more than Large Language Models (LLM)
LLM: Large Language Models: aka Chat GPT, Claude AI, Bard, Gemini
Transformers:
GPT: Generative Pre-trained Transformers
ML Algorithms learn via supervised or unsupervised learning.
Supervised learning is when you have labels for each data point.
Unsupervised learning is when it “learns” something about the data, but do not have the data truth.
Data | Truth (Label) | Data | Truth (Label) |
---|---|---|---|
![]() |
cat | ![]() |
dog |
x 100’s more Labeled images
Data | Data |
---|---|
![]() |
![]() |
x 500’s more images
Find features that separate the data into groups / clusters. Then hope there is a small number of labeled features so that we can classify those groups.
Labeled data is really expensive. Sometimes “truth” can not be obtained.
Say there is a town with two employers (A and B).
Predict if person works for company A or B, based on where they live.
First stab: See if people are clustered around their work place.
So if we had a clustering algorithm we can predict where a person might work.
K-means is the most common clustering algorithm.
K-means clusters, n data points (e.g. All your data) into k clusters by placing each data point to the nearest k.
Example: point p is 5 ft from \(k_i\) and 10 feet from \(k_j\). K-means would then place point p into the \(k_i\) group. Once, all the points are placed into a cluster or group, the new means for each cluster (\(k_i\) and \(k_j\)) are updated. The location for each mean cluster k is calculated by finding the mean from all the points that belong to that cluster.
Transfer Learning can allow you to use what is learned from a large dataset to your smaller (refined) data similar to the larger model.
Sarkar (2018)
To learn more go to TensorFlow Transfer Learning Tutorial
We are going to build a model that classifies between two (or more) classes.
The label or category:
A category into which something is put.
Tries to predict the correct label (class) of a given input data.
The model is trained using the training data, and then it is evaluated on test data.
Say you are tasked to group monkeys into two classes:
Image from Huffington Post.
Class_1: Biting | Class_2: Non Biting |
---|---|
card c | card d |
card a | card b |
… | … |
What Rule did you use to determine which class the Monkey belongs to?
Does your model hold up?
In this monkey example, we (human) could infer the model (set of rules). Software Engineering is when a human makes a rule that the machine follows.
Most Machine learning classifiers, the “rules” of the model are unknown to us. We only get the output of the machine assigned classes.
Machine learning is when the machine creates the rules for the machine to follow.
There are four basic steps to building a model
Your model, app etc will not work the first time.
We are going to train a ML model to classify sounds.
WITHOUT any coding.
If you want to take this to the classroom.
I have a No Code ML Tutorial for Elementary School Children
Gem City Tech ML/AI