Voice Search
Voice Search

Introduction to Big Data Training - Classroom


Rated 9.3 out of 10 based on over 6784 reviews

Free course advice
Learn more about how we use your data

What is the Introduction to Big Data Training course?

Big data is a commonly used business term, which defines data sets that could become unmanageable due to rapid expansion. As a way to assist with this potential issue, Big Data movement includes a range of new tools and methods for storing information, which allows for efficient analysis and processing for informed business decision-making.

Through this Introduction to Big Data Course, you’ll learn to utilise big data analysis techniques and tools to adopt better business decision-making.

Available delivery methods for this course


Group booking

Group booking

Key features of the course

  • Delivery Method: Classroom
  • Exam: Included
  • Duration: 3 Days

Browse classroom dates below






Location Date Duration Price Availability
Live OnlineMay 20th 2020  3 Days£1,650 ex. VATAvailable
Enquire now
Book now
LondonJun 10th 2020  3 Days£1,650 ex. VATAvailable
Enquire now
Book now
Live OnlineJun 10th 2020  3 Days£1,650 ex. VATAvailable
Enquire now
Book now
LondonSep 2nd 2020  3 Days£1,650 ex. VATAvailable
Enquire now
Book now
Live OnlineSep 2nd 2020  3 Days£1,650 ex. VATAvailable
Enquire now
Book now

Is the Introduction to Big Data Training course right for me?

Understanding how to work with big data will help you and your business to better understand how to make better decisions with big data. If you’re working in an IT team or have been tasked with the responsibility of looking after your company’s data, this course would be ideal for you. 

Why Choose e-Careers?

e-Careers has partnered with The Learning Tree, to offer a range of courses, delivered via high-tech classrooms or virtual learning, depending on your requirement.

We are an award-winning, established eLearning course provider, with over 16 years’ experience in the industry.  We offer high-quality training courses at competitive prices.

What will I learn on this course?

Through this Introduction to Big Data Course, you’ll learn to utilise big data analysis techniques and tools to adopt better business decision-making and look at industry specific products, such as Hadoop training. Learn how to store data in a way which allows for efficient processing and analysis. Acquire the required skills to store, manage, process, and analyse large volumes of unstructured data, and create a suitable data lake.

Classroom-based Training 

 e-Careers were originally an online learning organisation but over time we’ve established additional learning methods, to provide our delegates with a variety of study options, including:

  • Bespoke training
  • Classroom-based training
  • On-site training
  • LiveOnline (virtual learning)

The Classrooms

Our classroom training centre is in London, Euston, conveniently located directly opposite Euston station, making transport and accessibility easier.

Our clean, high-tech classrooms provide a comfortable learning environment for our delegates, and we pride ourselves on providing a first-class training experience. You’ll notice this from your first steps in our London training centre, right through to your last day on the course, helping you to feel welcomed and comfortable. 

Each classroom has been designed to perfectly suit the courses being offered. For example, our Cyber Security classrooms come kitted out with a range of high specification PC’s (typically i7’s), with monitors for you to work through the practical assignments and an additional vertical screen to view your digital course materials.

Your instructor will use cutting-edge technology to ensure a high-quality learning experience for all delegates, including the latest annotation hardware and software.  

Alternatives to Classroom-based study

We understand that not every delegate has the same date availability or can’t make it to London, so we have created a range of suitable alternatives, including:  

  • LiveOnline – This is our virtual classroom option. Be a fully participating and integrated member of the classroom but from the comfort of your own home or office. We supply you with all the course materials required to fully participate with the class. 
  • eLearning – This is our Online/ Distance learning option. If a classroom or LiveOnline option are unsuitable for your requirements, we do offer a full online course option, where you can study at your own pace and in your own time.

Module 1 – Introduction to Big Data

  • Defining Big Data
    • The four dimensions of Big Data: volume, velocity, variety, veracity
    • Introducing the Storage, MapReduce and Query Stack
  • Delivering business benefit from Big Data
    • Establishing the business importance of Big Data
    • Addressing the challenge of extracting useful data
    • Integrating Big Data with traditional data

Module 2 – Sorting Big Data

  • Analysing your data characteristics
    • Selecting data sources for analysis
    • Eliminating redundant data
    • Establishing the role of NoSQL
  • Overview of Big Data stores
    • Data models: key value, graph, document, column–family
    • Hadoop Distributed File System
    • HBase
    • Hive
    • Cassandra
    • Hypertable
    • Amazon S3
    • BigTable
    • DynamoDB
    • MongoDB
    • Redis
    • Riak
    • Neo4J
  • Selecting Big Data stores
    • Choosing the correct data stores based on your data characteristics
    • Moving code to data
    • Implementing polyglot data store solutions
    • Aligning business goals to the appropriate data store

Module 3 – Processing Big Data

  • Integrating disparate data stores
    • Mapping data to the programming framework
    • Connecting and extracting data from storage
    • Transforming data for processing
    • Subdividing data in preparation for Hadoop MapReduce
  • Employing Hadoop MapReduce
    • Creating the components of Hadoop MapReduce jobs
    • Distributing data processing across server farms
    • Executing Hadoop MapReduce jobs
    • Monitoring the progress of job flows
  • The building blocks of Hadoop MapReduce
    • Distinguishing Hadoop daemons
    • Investigating the Hadoop Distributed File System
    • Selecting appropriate execution modes: local, pseudo–distributed and fully distributed
  • Handling streaming data
    • Comparing real–time processing models
    • Leveraging Storm to extract live events
    • Lightning–fast processing with Spark and Shark

Module 4 – Tools and techniques to analyse Big Data

  • Abstracting Hadoop MapReduce jobs with Pig
    • Communicating with Hadoop in Pig Latin
    • Executing commands using the Grunt Shell
    • Streamlining high–level processing
  • Performing ad hoc Big Data querying with Hive
    • Persisting data in the Hive MegaStore
    • Performing queries with HiveQL
    • Investigating Hive file formats
  • Creating business value from extracted data
    • Mining data with Mahout
    • Visualising processed results with reporting tools
    • Querying in real time with Impala

Module 5 – Developing a Big Data strategy for your organisation

  • Defining a Big Data strategy for your organisation
    • Establishing your Big Data needs
    • Meeting business goals with timely data
    • Evaluating commercial Big Data tools
    • Managing organisational expectations
  • Enabling analytic innovation
    • Focusing on business importance
    • Framing the problem
    • Selecting the correct tools
    • Achieving timely results

Module 6 – Implementing a Big Data solution

  • Selecting suitable vendors and hosting options
  • Balancing costs against business value
  • Keeping ahead of the curve

We’re trusted by

Individuals, small businesses and large corporations who have used e-Careers since 2001. Here are some names you’ll recognise:

Saatchi & Saatchi
American Express

Do you know someone who’d love this course? Tell them about it...