top of page
Image by Paul Jarvis

Wanderson souza

PhD, AI DEVELOPER & DATA SCIENTIST
  • Curriculum Vitae
  • github
  • Gmail

Hi, I'm Wanderson.

​I am a data scientist, Ph.D in Mechanical Engineering, Master and B.S. in Computer Science. I have advanced knowledge in the whole machine learning stack such as: Libraries (NumPy, Pandas, PySpark, scikit-learn, SciPy, NLTK, TensorFlow, Keras, PyTorch, Matplotlib), Container and virtual environments (Docker Container – VirtualEnv – Anaconda), Programming Language, development and versioning environment (Python, SQL, Git, Jupyter Notebook), DBMS (MySQL, MongoDB and PostgresSQL).

Currently I work as a data scientist at NeoPTO, a technology company focused on developing innovative solutions in the field of process automation for patent creation and maintenance using artificial intelligence. Such processes include the classification and automatic analysis of patents. We have recently developed a system in the information retrieval area that assists in the semantic search process for patents based on the evolution of trends passed by the user.

On a daily basis, I always try to update me and refine the maximum knowledge related to new architectures, performance improvements and network optimization during long hours of training. I am an open source software enthusiast and most productivity management tools. I intend to develop applications using machine learning and deep learning models, whether in recognition activities, natural language processing, or computer vision applied to embedded or non-embedded systems; I am also interested in researching new methods and strategies to improve network performance.

0.jpeg
EXPERIENCE
EXPERIENCE

Latest experiences

cropped-neopto-1.png
2022

Data Scientist

NEOPTO

I currently work as a data scientist at the NeoPTO Intellectual Property. We are developing a system in the information retrieval area that assists in the semantic search process for patents based on the evolution of trends passed by the user.

2021

Data Scientist

LACINA

I worked as a data scientist at the Applied Intelligence and Computation Lab (LACINA). We have developed a multilabel supervised system for hierarchical patent classification with more than 270,000 classes.

1614184720055.jpeg
Logo_GPICEEMA_edited.jpg
2020

Researcher and Machine Learning Engineer

GPICEEMA (INSTRUMENTATION AND CONTROL GROUP IN ENERGY AND ENVIRONMENT STUDY)

I worked as a machine learning engineer and researcher at the Solar Energy laboratory at the Center for Alternative and Renewable Energies at UFPB and I am part of the Instrumentation and Control Group in Energy and Environment Studies (GPICEEMA) developing solutions with computer vision.

2015-2019

Machine Learning Specialist

METROREC / CBTU-JP

I worked between 2015 and 2019 as a researcher and machine learning specialist in the partnership between UFPB and METROREC / CBTU-JP. Throughout this period, I developed the VisionBerry project, in which we developed a computer vision system to estimate the multidirectional flow of passengers in subway transportation using embedded devices.

ifpb.png
2017-2019

Research and Professor

IFPB (FEDERAL INSTITUTE OF PARAÍBA)

I was a professor of computer science and scientific methodology, teaching classes on programming basics, advanced Excel, and operating systems for technical education classes.

EDUCATION
EDUCATION
2015-2019

Doctor of Sciences

UFPB (FEDERAL UNIVERSITY OF PARAÍBA)

I am a D.Sc in Mechanical Engineering. Throughout this period I developed a passenger estimation system using deep neural networks. In October 2017, the results of the VisionBerry project earned me participation in the Empreenda Santander contest in the technological innovation category and I reached the semi-final.

2011-2013

Master of Science

2004-2009

Bachelor of Science

UEPB (STATE UNIVERSITY OF PARAÍBA)

B.S. in Computer Science at the State University of Paraíba

SKILLS
SKILLS
binary-code.png
Development

- Experience with Python;

- Expertise on the full stack machine learning development libraries (NumPy, pandas, PySpark, matplotlib, scikit-image, SciPy, tensorflow, Keras and PyTorch) since 2015.

chart.png
Data analysis

Data analysis is an essential component for those working with machine learning. I use linear, non-linear and logistic regression models to predict results using real data, such as estimating a country's GDP or even understanding the progress of a pandemic. I have several data science certificates MOOC's on Coursera.

engineer.png
Software engineering

I take use of design patterns, code review techniques, documentation, respecting the pep8 style guide and other important concepts to improve code readability and maintenance.

battery-with-a-bolt-symbol.png
Proactivity

I'm a proactive person, compromised and responsible with my work. Attributes that I believe that all developers must have in order to be a quality professional.

sport-team.png
Team work

I adapt well in teamwork. I use GitHub and GitLab for version control. I like to share the use of applications for productivity management.

mortarboard.png
High academic background

I have a high qualification, papers published in the area and teaching experience.

SPECIFIC SKILLS
  • MACHINE LEARNING MODELS

    • Supervised Learning

      • Linear and Non linear Regression

      • Logistic Regression

      • Support Vector Machine

      • Naive Bayes

      • Neural Networks

      • XGBoost

      • CatBoost

    • Unsupervised Learning​​​

      • HDBSCAN

      • K-Means

      • LDA

  • DEEP LEARNING ARCHITECTURE

    • VGGNet

    • ShallowNet

    • GoogleNet

    • AlexNet

    • SqueezeNet

    • ResNet

    • RetinaNet

    • Faster R-CNN

    • SSD

    • Mask R-CNN

    • YOLO - Real-Time Object Detection

  • LIBRARIES AND PACKAGES

    • OpenCV

    • matplotlib

    • scikit-image

    • scikit-learn

    • NLTK

    • SciPy

    • Numpy

    • Pandas

    • PySpark

    • BeautifulSoap

    • Tensorflow

    • Mxnet

    • Keras

    • PyTorch

  • METHODS TO IMPROVE PREDICTIONS

    • Data Augmentation

    • Ensemble

    • Fine-tuning Networks with Transfer Learning

    • Advanced Optimization Methods (Adagrad​, Adadelta, RMSprop, Adam, Nadam).

COURSES & CERTIFICATIONS
COURSES
Image by Kiyun Lee
  • Introduction to Artificial Intelligence by IBM

  • Python for Data Science and AI by IBM

  • Machine Learning with Python by IBM

  • Introduction to Deep Learning & Neural Networks with Keras by IBM

  • Deep Neural Networks with PyTorch by IBM

  • Building Deep Learning Models with TensorFlow by IBM

  • AI Capstone Project with Deep Learning by IBM

  • Math for Machine Learning by AWS

  • Natural Language Processing by Kaggle

  • Natural Language Processing with Classification and Vector Spaces by deeplearning.ai

  • Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization by deeplearning.ai

  • Structuring Machine Learning Projects by deeplearning.ai

  • Exploring and Preparing your Data with BigQuery by Google Cloud

  • Scrum Fundamentals Certified by SCRUMstudy

PROJECTS
CONTACT
CONTACT

Wanderson Souza

MACHINE LEARNING ENGINEER

Phone:

+55 (83) 98857-9083

E-mail:

wandersonsouza.info@gmail.com

  • github

Thanks for your submission

© 2020 por Wanderson Souza.

bottom of page