We love analogies

Deep Learning is not magic, it’s more like gardening!

You are the gardener; the seeds are the algorithms we have already built for you; the data you provide are the nutrients and the Deep Learning Models are the beautiful plants that come out of all of this.

Capice handles the heavy lifting for you

Some concepts you’ll need to know

Every new approach to problems invariably has it’s own terminology. AI Deep Learning is no different, but the base set of terminology is thankfully not too large. You do not have to know any of this in depth. For general purposes, it helps to align to a common set of terms.

We will cover these in more detail below. Capice Machine Learning is used to solve for these two broad business needs. What you need to know is that Classification handles the question of “what is this?” and prediction handles the question of “what is the likely outcome”.

This is your only input to our model creation. It can be images, audio, text or a set of numbers. The essential inputs are the data and the result for each piece of data. Also known as “ground truth”. Your dat and your business model is what makes you unique.  So all training is with your data.  ‘Generic’ AI models are worthless.

For instance, you provide an image of a dog and contain the result, that this image is a dog. Do this for as much data you can pull together. The idea is that given some images the system can recognize any dog.

You only provide a relatively small set of training data to teach the system.

Example data for some of business problems and their training data set:

  • Classify customer input – text containing the input and the actual result
  • Classify negative feedback – text containing the input and the actual result
  • Classify type of part on the conveyor line – images of various part types and what those parts are
  • Classify how well a machine is running – set of sounds for machine running well, and for machine not running well
  • Predict if data center is running well – a set of metrics from around the data center like utilization, heat, network traffic
  • Predict the price of a house given 100 factors – those 100 factors and the actual house price for some set of houses

Training is the step you take to run the data you have provided against one of Capice’s machine learning algorithms to create a trained model. Like training a pet. There is nothing to doing this. Just click “Run Training” and off it goes.

An AI model is the output of training.

It is the large set of factors derived by the algorithms that can be run again and again to classify or predict. This is the repeatable use of machine learning. The beauty is that a model can classify with a high degree of accuracy things it has never seen before. Just like a human that has never seen a particular Ferrari before, can certainly classify it as a car — and without too much trouble,  a Ferrari as well.

Problems We Help You With

Image Classification

Identify ‘what is this object or picture or piece’. An image model can have many classifications of answers. For example, “is this a happy person, sad, angry, thoughtful, distressed”, “does this part look acceptable”, “is this fruit ripe or not ripe”, “what type of butterfly is this” “what model year is this car”.. Or “does this text string mean they have issues with web, customer service, shipping, …”

Audio Classification

Identify from an audio recording ‘what am I hearing’…. there can be many classes of answers. For example, is this a “the number 1, the number 2, …”, “is this a jackhammer, is this a truck, is this a bird …”, “does the machine that making parts sound OK”. Anything that makes a sound can be classified.

Text & Documents Classification

Identify ‘what does this text mean’. A machine learning text model can have many classifications of answers. “Is it spam”, “is a positive review or negative or in between”, “is customer reporting a problem and if so what department should receive this”, “does this email mean the customer has issues with web, customer service, or shipping”.  Do this for smaller pieces of text like sentences or even entire documents like news articles.

Binary Classification

Binary classification is used to determine if the new data is ‘in’ or ‘out’ of the trained model. Typically used when there is a large amount of good data (nuclear power plant readings, network health checks and metrics, call center metrics, fraud detection, …) but very little data outside the good band of data. The data points are all numeric in CSV format just like a spreadsheet.

Numeric

Prediction

Provide up to a boatload of related and unrelated numeric data (aka: features) then, the machine learning model will predict the result.

The inference is  to a set of classes you define.  For example, you might want to predict that person will “like or not like a movie” given their demographics (again – sky is the limit), or what is patient quality of life after surgery on a scale of 1 to 5.

Numeric

Classification

Provide up to a boatload of related and unrelated numeric data (aka: features) then, the machine learning model will infer the numeric result.

The inference is calculated as a continuous value.  Like a house price predictor.  You might provide features like rooms, square footage, garage size, age of house, …. (sky is the limit) then, the model will predict the price of any house.

Numeric

Missing Data

Many business processes and BPA requires 100% of the data fields to be filled in first.  They do not work well with partial data sets.  So what do you do if  some of your data is missing ?  Numeric Missing Data will estimate and plug in all missing data values from a single input or for an entire spreadsheet.

Chain

If you need one model to feed another use ‘chains’. For example, say you have a variety of parts coming down a conveyor and you need to determine good/bad for each part type. As opposed to one mammoth model, you can chain together models so model 1 determines a result then feeds in to another based on that result.

For this example, the first model is the part identifier. Which then feeds in to second set of models that is the good/bad identifier – one model per part.

This keeps your models simpler and separated.

How Capice Works

We want to present you what a typical interaction look like, it’s a simple pattern you will get used to in minutes

To make data input easy, we provide you with templates. If you will be training linear regression or text models, then use this link to download the simple CSV template file (open it with Excel or Mac Numbers). For audio and image models, you do not have to do this. In that case, just put your training data in directories where each directory name is a classification name, then create a zip file of those directories.

This is not always required, so it may be an infrequent operation for you

This is the meat and potatoes of machine learning. And is easy as clicking the Train button. It can take several minutes to create a model. It executes a lot of statistics and math to generate a new model for you. We will let you know when your model is finished on the Manage My Models tab.

This is a more frequent operation, as you identify and solve new business problems.

The cool part. Now, the machine learning model is ready to use through the Capice web application, Capice iPhone application or connecting trough the Capice APIs from one of your applications. You can also make your models public for others to use. That will share the model but, the data that created the model will never be shared with others.

This will be the most frequent operation of all.

Industry Templates

Since the Capice platform is so easy to use, has many types of AI deep learning algorithms, and has an Internet API interface, Capice can be used for a wide variety of industry solution.  We supply many industry temp[lates for contact centers, universities, human resources, ….  These are great starting points and can be changed or used as-is as you desire.

For numeric, spreadsheet style data, the Capice platform comes with a variety of industry templates for you to use.  Or create your own.

Image, video, audio, document and text don’t really need templates since they are straight-forward to train.

But spreadsheet data can have hundreds or even thousands of different feature columns.  For spreadsheet data, Capice does classification, binary classification. and continuous numeric prediction.  All using our massive deep learning models.  Very, very accurate results.