clusterization
background

Clustering is a fundamental machine learning task of dividing data into groups with similar properties and without known class labels in a training dataset. Clustering is often performed in the exploratory data analysis phase to get a better intuition about the structure of the dataset, or as a preliminary step for more complicated models.

business challenges

The goal was to recognize and match heterogeneous data from different sources in different formats



value delivered

We've introduced a two-step parallelized algorithm which performed fast clusterization of given data with very high confidence score. Overall presented algorithm was able to speed up data processing by a factor of 10.



approach

A high parallelized complex algorithm was developed with embedded RNN, CNN and DNN architectures for different types of media. Various metrixs were defined based on DTW path, Euclidian and cosine distances. Bloom filters were applied to get final results.


you may interested in other

case studies

Smart
compression

view details

View more

Anomaly
detection

view details

View more

Large Scale
Analytics

view details

View more

Predictive
Analytics

view details

View more


Tensorflow

view details

View more

Biometric
Identification

view details

View more

our expertise in

AI technologies

data mining

PCA
K-means
Decision trees
Linear models
PageRank

digital signal processing

Digital filters
DTW

machine learning

Deep learning
Probabilistic graphical models
CART
ensembles
unsupervised sound segmentation
recurrent models
bayesian approach
probabilistic programming
hmm

image processing and
computer vision

alexnet
vgg
vae

natural language
processing

PCA
TF-IDF
LDA
SVM
Naive bayes
word2vec
attention models

are you ready to see your software project getting real?

contact us


about us

Hi, we are Sciforce - a company where the integration of various branches of science builds up a powerful force to create robust software solutions. Working at the intersection of Computer Science with other technical, natural and humanitarian sciences let us go beyond traditional IT services and become both technical and scientific forces to our customers.