Data Science Technologist / Sr. Data engineer

Location: Boston, MA, United States
Date Posted: 06-21-2018
Job Title :Data Science Technologist / Sr. Data engineer
Location  : Boston, MA​.



Position Purpose:
The purpose of this position includes, but is not limited to, strategically designing, developing and implementing a unified and integrated big data ecosystem that ultimately supports the delivery of real time advanced analytical and machine learning products and services to our clients. This position will work closely with our data science as well as data engineering team to ensure that this newly developed ecosystem can house raw and transformed data from various source databases, maintain metadata repositories and enable faster delivery of data for consumption by various BI tools. The intent of this position is to build a Big Data platform that can be leveraged to augment the effectiveness of our ongoing engagements while continuously expanding and building new machine learning based products and services for our clients.

Requirements:
  • Expertise in Java/J2EE and big data technologies like Hadoop , Apache Spark and Hive is required. Must have applied these skills continuously in the last 2-3 years.
  • Strong hands-on experience in Spark streaming with Apache Kafka is a MUST for this position.
  • Good knowledge in Python skills and preferably have used Machine learning in solving data science problems in at least 1-2 projects.
  • Must be able to work closely with data science team to ensure there is proper integration of data science tools and techniques within the big data ecosystem. This will include but not limited to data ingestion, transformation, model building & validation, visualization and real time data streaming.
  • Must be able to continuously assess and deploy new big data technologies within the framework of the BDE with an aim to continuously improve our service delivery model.
Qualifications:
  • Work experience as a system, data or information architect for a minimum of 5-10 years.
  • Hands-on experience with data architecting and business requirements gathering/analysis
  • Direct experience in implementing big data management processes, procedures, and decision support.
  • Strong understanding of relational data structures, theories, principles, and practices.
  • Hands-on knowledge of enterprise repository tools, data modeling tools, data mapping tools, and data profiling tools.
  • Ability to manage data and metadata migration
  • Experience with database platforms, including MySQL, ORACLE, MS SQL Server
  • Proficiency and proven experience in programming languages like JAVA, J2EE, UI and server side programming. Good knowledge of web based technologies.
  • Experience with Cloudera Big Data Technologies Hadoop, Spark, Hive, Kafka, Pig, Sqoop, etc. is vital for this position. This should include configuration and deployment of required components and other infrastructure in development, testing and production environment.
  • Experience with Agile methodology preferably in managing technical projects as a scrum master.
  • Experience in python programming language using Spark.
  • Industry experience: Finance/banking industry is preferred but not a must.
  • Education: Technical degree like Engineering/Computer science/IT is preferred.

The above are only minimum skillset and any additional relevant knowledge and experiences in Big Data technologies and visualization tools like Tableau will be a definite plus for this position.




Central Business Solutions, Inc,
37600 Central Ct.
Suite #214
Newark, CA 94560.
Central Business Solutions, Inc(A Certified Minority Owned Organization)
Checkout our excellent assessment tool: http://www.skillexam.com/
Checkout our job board : http://www.job-360.net/
=====================================================
Central Business Solutions, Inc
37600 Central Court Suite 214 Newark CA, 94560
Phone: (510)-713-9900, 510-573-5500 Fax: (510)-740-3677
Web: http://www.cbsinfosys.com
=====================================================
or
this job portal is powered by CATS