Home   Register   Sign In
 
Company Info
Akraya Inc


Santa Clara, CA, United States

Phone:
Web Site: http://akraya.com

Company Profile


Data Engineer - 16-00291


col-narrow-left   

Job ID:

14484

Location:

San Francisco, CA, United States 

Category:

Information-Technology
col-narrow-right   

Job Views:

131

Zip Code:

94105

Employment Type:

Contract - Corp-to-Corp, Contract - W2

Posted:

02.18.2016
col-wide   

Key Skills:

Text Mining, NLP, Elastic Search, Spark, HIVE, PIG, MapReduce.

Job Description:

Data Engineer
Primary Skills:
Text Mining, NLP, Elastic Search, Spark, HIVE, PIG, MapReduce.
Location: San Francisco, CA
Duration: 6+ months
Contarct Type:W2/C2C

 
Looking for a senior engineer to help build our next generation data platform.
 
Specific Responsibilities:
  • Build Big Data Text Mining & Natural Language Processing Framework.
  • Extract meaningful data from text and unstructured transcript.
  • Develop Text Mining Machine Algorithms & Data Science solutions.
  • Build world class high-volume real-time data ingestion frameworks and automate ingesting various data sources into Hadoop.
  • Research, develop, Optimize and Innovate frameworks and related components for enterprise scale data analysis and computations.
  • Develop validation frameworks, proactive monitoring solutions to detect data ingestion failures in big data platform and take appropriate remedies.
  • Develop Data Adapters to ingest large volume of Unstructured, Semi Structured and Structured data from various data sources and types.
  • Collaborate with people working on various technologies and ensure consistency for the data exposed through these different channels.
  • Own the end-to-end development life cycle with high quality of solution/code you develop and evangelize the test driven development - (tests, code coverage, etc.)
  • Follow a customer centric approach, and ensure the solutions developed actually meet the customer requirements.
 Requirement:
  • 8+ years of experience in requirements analysis, design, development and testing of distributed, enterprise-class applications/platforms with particular attention to scalability and high performance, with demonstrable experience
  • Experience with NLP, Elastic Search, Text Mining, Spark, HIVE, PIG, MapReduce.
  • Strong Object Oriented programming experience (Java/Python preferred)
  • Experience with NoSQL data bases: HBase, Mongo
  • Knowledge and experience with RDBMS, O-R mapping, and application of distributed caching technologies


© 2017 Powered by Rootjobs     About Us   Sitemap   Terms and Conditions   Privacy Policy   Mobile Version     Twitter RSS LinkedIn Facebook