Difference between revisions of "Alpha Centauri"
(→How Intel DAAL Works) |
(→How Intel DAAL Works) |
||
Line 30: | Line 30: | ||
Intel DAAL comes pre-bundled with Intel® Parallel Studio XE and Intel® SystemStudio. It is also available as a stand-alone version and can be installed following these instructions [https://software.intel.com/en-us/get-started-with-daal-for-linux instructions].<br/> | Intel DAAL comes pre-bundled with Intel® Parallel Studio XE and Intel® SystemStudio. It is also available as a stand-alone version and can be installed following these instructions [https://software.intel.com/en-us/get-started-with-daal-for-linux instructions].<br/> | ||
Intel DAAL is a simple and efficient solution to solve problems related to Big Data, Machine Learning, and Deep Learning.<br/> | Intel DAAL is a simple and efficient solution to solve problems related to Big Data, Machine Learning, and Deep Learning.<br/> | ||
− | The reasoning behind that is because it handles all the complex and tedious algorithms for you and | + | The reasoning behind that is because it handles all the complex and tedious algorithms for you and software developers only have to worry about feeding the Data and follow the Data Analytics Ecosystem Flow. |
Intel DAAL provides many functions for Data Management, Algorithms, and Services.<br/> | Intel DAAL provides many functions for Data Management, Algorithms, and Services.<br/> | ||
Revision as of 17:28, 16 December 2017
"The world is one big data problem." -cit. Andrew McAfee
Contents
Intel Data Analytics Acceleration Library
Team Members
Introduction
Intel Data Analytics Acceleration Library, also known as Intel DAAL, is a library created by Intel in 2015 to solve problems associated with Big Data. Intel DAAL is available for Linux, OS X and Windows platforms and it is available for the C++, Python, and Java programming platforms. Intel DAAL is optimized to run on a wide range of devices ranging from home computers to data centers and it uses Vectorization to deliver best performances.
Intel DAAL helps speed big data analytics by providing highly optimized algorithmic building blocks for all data analysis stages and by supporting different processing modes.
The data analysis stages covered are:
- Pre-processing
- Transformation
- Analysis
- Modeling
- Validation
- Decision Making
The different processing modes are:
- Batch processing - Data is stored in memory and processed all at once.
- Online processing - Data is processed in chunks and then the partial results are combined during the finalizing stage. This is also called Streaming.
- Distributed processing - Similarly to MapReduce Consumers in a cluster process local data (map stage), and then the Producer process collects and combines partial results from Consumers (reduce stage). Developers can choose to use the data movement in a framework such as Hadoop or Spark, or explicitly coding communications using MPI.
How Intel DAAL Works
Intel DAAL comes pre-bundled with Intel® Parallel Studio XE and Intel® SystemStudio. It is also available as a stand-alone version and can be installed following these instructions instructions.
Intel DAAL is a simple and efficient solution to solve problems related to Big Data, Machine Learning, and Deep Learning.
The reasoning behind that is because it handles all the complex and tedious algorithms for you and software developers only have to worry about feeding the Data and follow the Data Analytics Ecosystem Flow.
Intel DAAL provides many functions for Data Management, Algorithms, and Services.
This picture shows the Intel DAAL Model.
This picture shows the Data Flow in Intel DAAL.
This picture shows how Intel DAAL processes the data.