Senior Data Scientist
CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career shaping tomorrow. We invest in your success and you own it, all while working alongside a team of leading experts who inspire you in ways big and small. Joining our company gives you the opportunity to make a difference in global financial markets every day, whether you work on our industry-leading technology and risk management services, our benchmark products or in a corporate services area that helps us serve our customers better. We're small enough for you and your contributions to be known. But big enough for your ideas to make an impact. The pace is dynamic, the work is unlike any other firm in the business, and the possibilities are endless. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more.
To learn more about what a career at CME Group can offer you, visit us at www.wherefuturesaremade.com . Senior Data Scientist The Team
The Data Science team's mission is to provide analytics and insights relating to the economic markets, clients, and products in which CME Group operates. The team is responsible for data science, machine learning, analytics, generating insights, and developing data products and tools. The team brings diverse skill sets together (statistics, derivatives expertise, ingenuity, coding and technical skills) in a highly collaborative, collegial manner. The team is one of the key thought leadership groups at the Exchange and serves business leadership to identify and promote significant market and client opportunities. Job Description:
We are looking for a highly intelligent person who desires autonomy to explore cutting edge cloud-based machine learning to achieve our objectives. We try new things, fail fast, and ultimately deliver valuable insight through the combination of collaboration, new technology, and algorithms. Analyze patterns and correlations among derivatives products, clients, and other data to generate actionable insight. Perform sophisticated, large-scale (cloud-based) analytics, highly scalable machine learning model development, artificial intelligence, data operations, cloud and high performance computing. Brainstorm, design, and implement internal and external data products and applications. Work with IT teams to help operationalize repeatable analysis, tools, and applications.
One example of our work is: www.cmegroup.com/price-action-alerts.html Responsibilities:
Develop applications in machine learning, time series and neural networks to create new data products for the derivatives financial market participants. Use machine learning to create web analytics such as optimizing user journeys leading to product adoption, search ranking, text/sentiment classification enabling user experience personalization. Gather and process both structured and unstructured data at scale (including handling data across multiple formats, creating scripts, data pipeline, schdulers on cloud platform). Work closely with business partners to understand the requirements where data driven solutions are applicable. Visualize results in Tableau and present analytics to high level business leaders. Capable of investigating, familiarizing and mastering new data quickly and combining multiple data sets together (transaction, CRM, and website data) Identify opportunities to leverage big data, cloud based compute and datawarehouse. Stay current with leading edge systems, methods, and best practices for data science, analytics, machine learning and data infrastructure. Education & Experience
- BS/BA in Computer Science or Engineering
- MS or progress towards MS in Computer Science or Engineering
- A minimum of 2 years of experience in programming, machine learning model development
- 1+ year of experience with data operations on Cloud Services (e.g. AWS, Google Cloud, or Microsoft Azure)
- Combination of technical/quantitative and business acumen is strongly preferred
- Strong programming skills and experience using statistics, machine learning and neural networks.
- Strong programming skills in Python (Numpy, SciPy, Pandas, and scikit-learn) and any one of the major object-oriented programming language (C++, Rust, Scala, C# etc.)
- Expertise in data visualizations using packages - Tableau, PowerBI, Matplotlib, or Seaborn.
- Expertise in distributed computing frameworks - Hadoop, Spark, Flink, Ray etc.
- Expertise in highly scalable machine learning applications
- Expertise in applications that can leverage modern parallel environments (e.g. distributed clusters, multicore SMP, and GPU) and big data formats (e.g. Parquet, Arrow)
- Experience with major open source code base for data operations, machine learning, cloud computing
- Experience with AWS, GCP or other similar cloud services
- Experience working with version control tool - Git, Stash, Bitbucket
- Strong communication and organizational skills
- Knowledge of Commodity and Financial Derivatives Markets, especially in futures and options trading is recommended