Jupyter for Data Science Teams Training Course
Jupyter is an open-source, web-based interactive IDE and computing environment.
This instructor-led, live training (online or onsite) introduces the concept of collaborative development in data science and demonstrates how to use Jupyter to track and participate as a team in the "life cycle of a computational idea". It guides participants through the creation of a sample data science project built on the Jupyter ecosystem.
By the end of this training, participants will be able to:
- Install and configure Jupyter, including the creation and integration of a team repository on Git.
- Use Jupyter features such as extensions, interactive widgets, multiuser mode and more to enable project collaboration.
- Create, share and organise Jupyter Notebooks with team members.
- Choose from Scala, Python, R, to write and execute code against big data systems such as Apache Spark, all through the Jupyter interface.
Format of the Course
- Interactive lecture and discussion.
- Plenty of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- The Jupyter Notebook supports over 40 languages including R, Python, Scala, Julia, etc. To customize this course to your language(s) of choice, please contact us to arrange.
Course Outline
Introduction to Jupyter
- Overview of Jupyter and its ecosystem
- Installation and setup
- Configuring Jupyter for team collaboration
Collaborative Features
- Using Git for version control
- Extensions and interactive widgets
- Multiuser mode
Creating and Managing Notebooks
- Notebook structure and functionality
- Sharing and organising notebooks
- Best practices for collaboration
Programming with Jupyter
- Choosing and using programming languages (Python, R, Scala)
- Writing and executing code
- Integrating with big data systems (Apache Spark)
Advanced Jupyter Features
- Customizing Jupyter environment
- Automating workflows with Jupyter
- Exploring advanced use cases
Practical Sessions
- Hands-on labs
- Real-world data science projects
- Group exercises and peer reviews
Summary and Next Steps
Requirements
- Programming experience in languages such as Python, R, Scala, etc.
- A background in data science
Audience
- Data science teams
Open Training Courses require 5+ participants.
Jupyter for Data Science Teams Training Course - Booking
Jupyter for Data Science Teams Training Course - Enquiry
Jupyter for Data Science Teams - Consultancy Enquiry
Testimonials (1)
It is great to have the course custom made to the key areas that I have highlighted in the pre-course questionnaire. This really helps to address the questions that I have with the subject matter and to align with my learning goals.
Winnie Chan - Statistics Canada
Course - Jupyter for Data Science Teams
Provisional Upcoming Courses (Require 5+ participants)
Related Courses
Introduction to Data Science and AI using Python
35 HoursThis course explores practical applications of Data Science and Artificial Intelligence leveraging Python. It empowers professionals with the necessary skills to analyse data, construct machine learning models, and implement AI-driven solutions within business environments. Key topics include CRISP-DM workflows, statistical analysis, supervised and unsupervised learning, deep learning with Tensorflow, natural language processing, big data management using Spark, and data-driven storytelling. It is an ideal choice for beginners pursuing a Python data science certification and career-focused analytics training.
Apache Airflow for Data Science: Automating Machine Learning Pipelines
21 HoursThis instructor-led, live training in Australia (online or onsite) is aimed at intermediate-level participants who wish to automate and manage machine learning workflows, including model training, validation, and deployment using Apache Airflow.
By the end of this training, participants will be able to:
- Set up Apache Airflow for machine learning workflow orchestration.
- Automate data preprocessing, model training, and validation tasks.
- Integrate Airflow with machine learning frameworks and tools.
- Deploy machine learning models using automated pipelines.
- Monitor and optimize machine learning workflows in production.
Anaconda Ecosystem for Data Scientists
14 HoursThis instructor-led, live training in Australia (online or onsite) is designed for data scientists who wish to use the Anaconda ecosystem to capture, manage, and deploy packages and data analysis workflows in a single platform.
Upon completion of this training, participants will be able to:
- Install and configure Anaconda components and libraries.
- Gain a clear understanding of Anaconda’s core concepts, features, and benefits.
- Manage packages, environments, and channels using Anaconda Navigator.
- Utilise Conda, R, and Python packages for data science and machine learning applications.
- Explore practical use cases and techniques for managing multiple data environments.
AWS Cloud9 for Data Science
28 HoursThis instructor-led, live training in Australia (online or onsite) is aimed at intermediate-level data scientists and analysts who wish to use AWS Cloud9 for streamlined data science workflows.
By the end of this training, participants will be able to:
- Set up a data science environment in AWS Cloud9.
- Perform data analysis using Python, R, and Jupyter Notebook in Cloud9.
- Integrate AWS Cloud9 with AWS data services like S3, RDS, and Redshift.
- Utilize AWS Cloud9 for machine learning model development and deployment.
- Optimize cloud-based workflows for data analysis and processing.
Introduction to Google Colab for Data Science
14 HoursThis instructor-led, live training in Australia (online or onsite) is aimed at beginner-level data scientists and IT professionals who wish to learn the basics of data science using Google Colab.
By the end of this training, participants will be able to:
- Set up and navigate Google Colab.
- Write and execute basic Python code.
- Import and handle datasets.
- Create visualizations using Python libraries.
Data Science essential for Marketing/Sales professionals
21 HoursThis course is designed for marketing and sales professionals who wish to deepen their understanding of data science applications within these fields. It offers comprehensive coverage of various data science techniques utilised for upselling, cross-selling, market segmentation, branding, and Customer Lifetime Value (CLV).\n
Distinctions Between Marketing and Sales - How do sales and marketing differ?
In simple terms, sales focuses on individuals or small groups. Marketing, conversely, targets larger audiences or the general public. Marketing involves research (identifying customer needs), product development (creating innovative solutions), and promotion (via advertising) to build consumer awareness. Essentially, marketing generates leads or prospects. Once a product is in the market, it becomes the salesperson's responsibility to persuade customers to make a purchase. Sales converts leads into orders, whereas marketing is oriented towards long-term goals, while sales focuses on shorter-term objectives.
Kaggle
14 HoursThis instructor-led live training in Australia (online or in-person) is designed for data scientists and developers who wish to learn and build their careers in Data Science using Kaggle.
By the end of this training, participants will be able to:
- Learn about data science and machine learning.
- Explore data analytics.
- Learn about Kaggle and how it works.
Data Science with KNIME Analytics Platform
21 HoursThe KNIME Analytics Platform is a leading open-source solution for driving data-based innovation. It empowers you to uncover hidden potential within your data, extract fresh insights, and forecast future trends. Featuring over 1,000 modules, hundreds of ready-to-run examples, a comprehensive suite of integrated tools, and the broadest selection of advanced algorithms, KNIME Analytics Platform serves as the ideal toolkit for both data scientists and business analysts.
This course offers an excellent opportunity for beginners, advanced users, and KNIME experts to become familiar with KNIME, learn to utilise it more effectively, and develop clear, comprehensive reports based on KNIME workflows.
This instructor-led live training (available online or onsite) is designed for data professionals aiming to leverage KNIME to address complex business requirements.
It targets participants who may not have programming experience but wish to utilise cutting-edge tools to implement analytics scenarios.
Upon completion of this training, participants will be able to:
- Install and configure KNIME.
- Build Data Science scenarios
- Train, test and validate models
- Implement the end-to-end value chain for data science models
Format of the Course
- Interactive lecture and discussion.
- Plenty of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course or to learn more about this program, please contact us to arrange.
Machine Learning for Data Science with Python
21 HoursThis instructor-led, live training in Australia (online or onsite) targets intermediate-level data analysts, developers, or aspiring data scientists keen on leveraging machine learning techniques in Python to extract insights, make predictions, and automate data-driven decisions.
Upon completing this course, participants will be able to:
- Grasp and distinguish between key machine learning paradigms.
- Investigate data preprocessing techniques and model evaluation metrics.
- Implement machine learning algorithms to address real-world data challenges.
- Utilise Python libraries and Jupyter notebooks for practical development.
- Construct models for prediction, classification, recommendation, and clustering.
Introduction to Pre-trained Models
14 HoursThis instructor-led, live training in Australia (online or onsite) is designed for professional beginners who aim to grasp the concept of pre-trained models and learn how to apply them to solve real-world problems without constructing models from the ground up.
Upon completion of this training, participants will be able to:
- Comprehend the concept and advantages of pre-trained models.
- Explore various pre-trained model architectures and their specific use cases.
- Fine-tune a pre-trained model for particular tasks.
- Implement pre-trained models in straightforward machine learning projects.
Python Programming for Finance
35 HoursPython has surged in popularity within the financial sector, adopted by major investment banks and hedge funds to build diverse financial applications, from core trading systems to risk management tools.
Through this instructor-led live training, participants will learn to leverage Python to develop practical applications that address specific finance-related challenges.
Upon completing this course, participants will be able to:
- Grasp the fundamental concepts of the Python programming language
- Download, install, and manage the optimal development tools for building financial applications in Python
- Choose and employ the most appropriate Python packages and programming techniques to organise, visualise, and analyse financial data from various sources (such as CSV, Excel, databases, and web interfaces)
- Develop applications that resolve issues related to asset allocation, risk analysis, investment performance, and more
- Troubleshoot, integrate, deploy, and optimise Python applications
Audience
- Developers
- Analysts
- Quantitative analysts (Quants)
Course Format
- A blend of lectures, discussions, exercises, and extensive hands-on practice
Note
- This training aims to provide solutions for key problems faced by finance professionals. If you have a specific topic, tool, or technique you would like to add or explore further, please contact us to arrange accordingly.
GPU Data Science with NVIDIA RAPIDS
14 HoursThis instructor-led, live training in Australia (online or on-site) is designed for data scientists and developers who wish to use RAPIDS to build GPU-accelerated data pipelines, workflows, and visualizations, applying machine learning algorithms such as XGBoost, cuML, etc.
Upon completion of this training, participants will be able to:
- Configure the required development environment to build data models using NVIDIA RAPIDS.
- Gain an understanding of RAPIDS' features, components, and benefits.
- Utilise GPUs to speed up end-to-end data and analytics pipelines.
- Implement GPU-accelerated data preparation and ETL processes using cuDF and Apache Arrow.
- Learn to perform machine learning tasks using XGBoost and cuML algorithms.
- Create data visualizations and execute graph analysis using cuXfilter and cuGraph.