Career Profile

Data Scientist, Software Architect, Big Data Engineer in Operational Research, Cloud Solution, Big Data, Optimization, Statistical Inference, Agent-Based Modelling and Simulation, Machine Learning. Backend architect and full stack engineer.

Experiences

APPLIED SCIENTIST

2018 - Present
Amazon

Work within the NSI-Science team at the Amazon Selection and Catalog System

SOFTWARE ARCHITECT

2017 - 2018
Whiz Home

Design and implement web and mobile e-commerce solution using Spring Cloud, Magento, react-native, react, angularjs, websocket, elasticsearch

SOFTWARE ARCHITECT

2016 - 2017
CLEF

System architect responsible for the design and implementation of CLEF web applications and its distributed computing platform built using Spring Cloud, Spring Boot, Apache Spark + GraphX, Hadoop, Apache Mesos, Mariadb Cluster.

SOFTWARE ARCHITECT

2015 - 2017
Meme Analytics Pte Ltd

Managing Director Chief technology architect of Meme Analytics Pte Ltd, specializing in data analytics, machine learning, simulation and optimization, web and mobile application and works on intelligent decision support, analytics and optimization system for supporting large scale real world engineering problems in dealing with complex problem dynamics through modeling and simulation, offer consultancy or direct customized system implementation and integration of turnkey solutions to clients.

DATA SCIENTIST

2015 - 2017
Neuro10

Data scientist of neuro10, specializing in big-data machine learning for log and timeseries analytics, responsible for designing and implementing big-data machine learning infrastructure as well as algorithms related to anomaly detection, text mining, natural language processing, time series predictive modeling. Actively involved in the development of Java-based software solution using Apache Spark, Mesos, Akka, Cassandra, Kafka, as well as HDFS (Hadoop Distributed File System).

RESEARCH FELLOW

2013 - 2014
Nanyang Technological University

Researcher working on machine learning and agent-based simulations, actively involved in various industrial projects related to Machine Learning, Data Mining, Supply Chain Optimization, Agent-Based Modeling, etc.

PROJECT OFFIER

2011 - 2012
Nanyang Technological University

Researcher working with NTU, main area of research includes meta-heuristics, continuous and combinatorial optimization, memetic computation, vehicle routing, supply chain risk management. Research work involves genetic programming, multi-objective optimization, robust search, time series forecasting, agent-based modeling, neural networks, gaussian process, PID and MPC controller

TECHNICAL CONSULTANT

2009 - 2017
CodeZone Pte Ltd

Software developer and technical consultant at CodeZone, specializing in delivering custom solutions for scheduling and optimization as well as simulation applications.

SOFTWARE DEVELOPER

2007 - 2007
Autodesk Singapore Pte Ltd

Software Developer for Office Tool Automation using VBA. The office tool automation includes code generator that dynamically generate and inject VBA codes into excel based on instructions retrieve from database.

SOFTWARE DEVELOPER

2006 - 2006
Agilent Technologies Singapore Pte Lt

Software Developer for Office Tool Automation using VBA as well as developing Web 2.0 application using ASP.NET. The office tool automation includes automatic processing data from various data sources such as Excel, Access and process results into auto-generated Powerpoint presentation and auto send via Outlook automation. The Customer Satisfaction Dashboard is a Web App I designed and implemented to automatically retrieves and process data on customer satisfaction and send individuallycustomized report to various level of management, based on their roles and tasks involved

Inventions

”Large Engineering Supply Chain Adaptive System (LesCaS)” Technical Disclosure & commercialization through NTU NIEO, TD/193/12. 1st Inventor
AIRDesigner Technical Disclosure & commercialization through NTU NIEO, TD/185/14, 3rd Inventor
”Crowd Simulation Plugin (FAME)”, Research Commercialization Via Deployable APIs, TD/03713, 3rd Inventor

Journal Publications

L. Feng, Y. S. Ong, X. S. Chen and C. Chen, ”Conceptual Modeling of Evolvable Local Searches in Memetic Algorithms using Linear Genetic Programming: A Case Study on Capacitated Vehicle Routing Problem”, Soft Computing Journal, No. 9, 2016.
X. S. Chen, ”An algorithm development environment for problem-solving: software review”, Memetic Computing Journal, Vol. 4, No. 2, pp. 149-161, 2012.
X. S. Chen, L. Feng and Y. S. Ong, ”Self-Adaptive Memeplexes Robust Search Scheme for solving Stochastic Demands Vehicle Routing Problem”, International Journal of Systems Science, Vol. 43, No. 7, pp. 1347-1366, 2012.
X. S. Chen and Y. S. Ong, ”A Conceptual Modeling of Meme Complexes in Stochastic Search”, IEEE Transactions on Systems, Man and Cybernetics, Part C, Vol. 42, No. 3, 2012.
X. S. Chen, Y. S. Ong, M. H. Lim and K. C. Tan, ”A Multi-Facet Survey on Memetic Computation”, IEEE Transactions on Evolutionary Computation, Vol. 15, No. 5, pp. 591607, 2011.
X. S. Chen, Y. S. Ong, and M. H. Lim. ”Cooperating Memes for Robust Vehicle Routing”, International Journal of Innovative Computing, Information and Control, Vol. 7, No. 11, pp. 6483-6506, 2011.
Y. S. Ong, M. H. Lim and X. S. Chen, ”Research Frontier: Memetic Computation - Past, Present & Future”, IEEE Computational Intelligence Magazine, Vol. 5, No. 2, pp. 24-36, 2010
K. K. Lim, Y. S. Ong, M. H. Lim, X. S. Chen, and A. Agarwal. ”Hybrid ant colony algorithms for path planning in sparse graphs”, Soft Computing, 12(10):981-994, 2008.

Conference Publications

X. S. Chen, Liang Feng, Meng Hiot Lim, Caishun Chen, Choon Sing Ho. “Towards Believable Resource Gathering Behaviors in Real-time Strategy Games with a Memetic Ant Colony System”, In The Asia Pacific Symposium of Intelligent and Evolutionary Systems, 2013.
X. S. Chen, Yew Soon Ong, Puay Siew Tan, NengSheng Zhang, Zhengping Li. “Agent-Based Modeling and Simulation for Supply Chain Risk Management C A Survey of the State-of-theArt”, In IEEE Systems, Man & Cybernetics Conference, 2013.
H.E. Huang, Y. S. Ong, C. S. Ho, X. S. Chen, “IGA Flock Brush for Non-Photorealistic Rendering”, In Proceeding SEAL’12 Proceedings of the 9th international conference on Simulated Evolution and Learning, Vol. 7673, pp. 480-490, 2012.
L. Feng, Y. S. Ong, A. H. Tan and X. S. Chen, “Towards Human-like Social Multi-agents with Memetic Automaton”, In IEEE Congress on Evolutionary Computation, pp. 1092-1099, 2011
C. S. Ho, Q. H. Nguyen, Y. S. Ong, X. S. Chen: “Autonomous Multi-agents in Flexible Flock Formation”, In Proceedings of the Third international conference on Motion in games , pp. 375-385, 2012
X. S. Chen, Y. S. Ong, and M. H. Lim. ”Cooperating Memes for Robust Vehicle Routing”, International Journal of Innovative Computing, Information and Control, Vol. 7, No. 11, pp. 6483-6506, 2011.
X. S. Chen, M. H. Lim, and Y. S. Ong. “An Ant colony system algorithm for path planning in sparse graphs”, In ICIAS2007 (International Conference on Intelligent & Advanced Systems 2007), pp. 31-36, 2007.
X. S. Chen, M. H. Lim, and D. C. Wunsch II. “A memetic algorithm configured via a problem solving environment for the hamiltonian cycle problems”, In IEEE Congress on Evolutionary Computation, pp. 2766 - 2773, 2007.

Open Source Contributions

For the video demo of some of the projects i have done in my professional works:

For my personal Open Source contribution, they can be found at the following links:

Projects

Real-Time Big-Data Data Analytics System

2016 - 2017

Designed and implemented big-data infrastructure for a rule-based data analytics platform with real-time association rule mining for job matching and company ranking

  • AI implemented: big data expert system shell based on spark, real-time distributed association rule miner
  • Programming Languages: Java, Scala, Javascript
  • Technologies used: Azure, Spark, Redis, Hadoop, MariaDB cluster, Nginx, snort, Spring framework, Spring Cloud, Angular, gulp, Mesos, Ansible, Linux shell scripts and Powershell automation scripts.

Real User Monitoring System

2016 - 2017

Designed and implemented real user monitoring system that keeps tracks of user’s real-time interaction with the web application and behavior analytics

  • Programming Languages: Java, Javascript
  • Technologies used: Graphite, nodejs, Spring Boot, AngularJS, WebSocket

Intrusion Detection System

2016 - 2017

Designed and implemented intrusion detection and prevent system which provides webview, as well as email and chat notification, for both network-based and host-based intrusion detection

  • Programming Languages: Java, Javascript
  • Technologies used: snort, iptables, nodejs, Ansible.

Order Preserved Encryption System for MariaDB

2016 - 2017

Implement order preserved encryption system for mariadb to provide both encryption in motion and at rest as well as support for crypto versioning

  • Programming Languages: Java
  • Technologies used: Spring Data JPA, MariaDB.

Desktop Data Analytics and Visualization Tool

2016 - 2017

Develop a desktop tool that allows user to analyse data from Excel, CSV, or Database using various machine learning algorithms.

  • AI implemented: Decision Tree, Genetic Programming, Clustering, SVM, ANN, GLM, MA
  • Programming Languages: C#
  • Technologies used: Telerik, Winforms, D3

Large-scale Distributed ABMS Platform for Labour Market

2016 - 2016

Design and develop large-scale distributed agent-based modelling and simulation platfor for labour market using Akka.NET, provides visualization via 3D simulation and modelling using Unity3d, and desktop + web visualization

  • AI implemented: distributed agent-based modelling and simulation based on distributed computational framework
  • Programming Languages: C#
  • Technologies used: Akka.NET, Consul, MS SQL Compact, Telerik, Winforms, Nancy, ASP.NET, Unity3d

Real-Time Big-Data Machine Learning and AI System

2015 - 2017

Designed and implemented big-data machine learning solution in the areas of log analytics and time series prediction, the work involves designing and implementing anomaly detection, text mining, natural language processing, time series predictive modeling, graph ming

  • AI implemented: topic modelling (online LDA implementation on both Spark and Akka), Word Colocation, NER, Document Categorizer, Word2Vec, Time Series Forecasting, Non-parametric Regression Analysis, Document Clustering, Isolation Forest Anomly Detection, LOF Anomaly Detection, Deep FeedForward NeuralNet
  • Programming Languages: Java, Scala, Javascript
  • Technologies used: Spark, GraphX, Redis, Zookeeper, Hadoop, Kafka, Cassandra, ElasticSearch, Akka, Spring framework, Angular, nodejs, DeepLearning4j, Mesos, Ansible, Shell scripts

Mobile OCR Application

2013 - 2014

Designed and implemented iOS and Android mobile application using OCR (Optical Character Recognition) and Edge Detection Canny Algorithm for detecting objects in video as well as texts in the video, implemented fuzzy text matching for corrupted context obtained from OCR AI implemented: association rule minining, FPGrowth, sequential association pattern mining

  • AI implemented: association rule minining, FPGrowth, sequential association pattern mining
  • Objective-C, Java, PHP, Javascript
  • Technologies used: iOS, Android, AngularJS, JQuery, XCode, JavaFX, Spring framework (Spring MVC, Spring Data JPA, Spring Security, Jersey), Bootstrap, Jasper Reporting and Dynamic Report, tesseract, OpenCV

Network Analytics and Anomaly Detection

2013 - 2014

Develop the intelligent network management application for network analysis and anomaly detection. Current supported features

  • Agorithms implemented: 1. Centrality Measurement, including Eigen Vector Centrality, Closeness, Betweenness, Degree 2. Community Finding: Walktrap algorithm 3. Link Anomaly Detection algorithm 4. Community-based Graph Differential Anomaly Visualization 5. Data Capture and Monitor via Fluke’s Network Time Machine. 6. Graph Models 7. Simulation and Monitoring
  • Programming Languages: C#
  • Technologies used: Fluke’s Network Time Machine, PostgreSQL, Windows Presentation Framework (WPF), Telerik, MySQL, Windows Communication Framework (WCF)

Energy Management Simulator and Home Automation Kit

2013 - 2014

Develop intelligent discrete event multi-agent simulator for an intelligent multi-agent control for simulating power consumption and occupant comfort in smart building engineering. The system includes space location modelling using surface extraction from planar graph algorithm; discrete event simulation for emulating the IO devices and power units in the building; simulation model for energy consumption modelling; model and measurement for occupant comfort; parameter sweeping features for allow users to simulate various scenarios in multi-dimension feature spaces; GUI user interaction including settings for power utilization and occupant comfort, floor plan design; library/script interface for user interaction and external modules to be built on top of the simulator; graph-based visualization for the state transition in Markov Chain Monte Carlo simulation; unit testing and logging module. The system is also a home automation kit which is interfaced with electrical and other home appliances as well as sensor units for real-time monitoring and control in home automation.

  • Algorithm implemented: ARMA, NSGA II, surface extraction from planar graph
  • Programming Languages: C#
  • Technologies used: Windows Presentation Framework (WPF), Telerik, Windows Communication Framework (WCF), Raspberry PI, emonCMS, X10

Vehicle Scheduler Library

2013 - 2014

Designed and implement a framework and platform to solve various vehicle routing problem (VRP) variants, with 16 solvers including memetic algorithm, genetic algorithm, hyper heuristic, simulated annealing, constructive heuristics, etc, each of which capable of solving one or more (combined hybrid) of the vehicle routing problems such as capacitated VRP, VRP with time windows, VRP with backhaul, VRP with pickup and delivery, multiple depot VRP, VRP with stochastic demands, heterogeneous VRP, as well as dynamic VRP with time windows, with support for routing on map based viewer, parallel processing, simulation, and algorithm performance analytic. The system is designed to be both standalone desktop and enterprise web service, with client in various programming languages (Web Browser, Excel, Java, C#, etc) as the online Vehicle Routing Scheduler Library solvers.

  • Algorithms implemented: memeplex-based memetic computing, hyper-heuristics, constructive heuristics, trajectory-based optimization (tabu search and simulated annealling)
  • Programming Languages: C#
  • Technologies used: Winforms, GMap.NET

Distributed Race Solution for IPICO Readers

2013 - 2014

Design and develop race tracking and reporting solution from scratch. The software solution allows recording participant race results in real time via connection to IPICO lite and elite readers which forms a WAN with a command center via master-slave network structure. The timing data is processed and streamed to web site in real-time for participants of the race to view.

  • Algorithms implemented: Huffman compression, priority queue based on binary heap, left leaning red black binary search tree
  • Programming Languages: C#, PHP
  • Technologies used: Winforms, GMap.NET

National Strategy Analysis and Proposition System using GP and EA

2013 - 2014

Developed a national strategy analysis and proposition using machine learning approach via Genetic Programming and Evolutionary Algorithms as well as data analytics and prediction such as time series prediction algorithms such as ANN, RBF and ARMA.

  • Algorithms implemented: Kmeans clusering, genetic programming, genetic algorithm, ann predictive, ARMA, RBF
  • Programming Languages: C#, Javascript
  • Technologies used: Winforms, WebKit

3D CAD Tool for sculpting meshes using self-adaptive topology

2013 - 2014

Design and implements a polygon mesh processing system that allows users to sculpt polygon mesh in real-time, the system successfully implements the self-adaptive topology for freely sculpting 3D meshes, with several additional improvements, which includes integration with other remeshing and model repairing algorithms, managing the multiple unorganized cross-sections as well as developing offline filter version of the system for its integration with other polygon mesh processing tool.

  • Algorithms implemented: Self-adaptive topology algorithm, remeshing algorithm, mesh processing techniques
  • Programming Languages: C++
  • Technologies used: GLUT, GLEE, OpenGL