This book provides a comprehensive overview and introduction to Big Data Infrastructure technologies, existing cloud-based platforms, and tools for Big Data processing and data analytics, combining both a conceptual approach in architecture design and a practical approach in technology selection and project implementation. Readers will learn the core functionality of major Big Data Infrastructure components and how they integrate to form a coherent solution with business benefits. Specific attention will be given to understanding and using the major Big Data platform Apache Hadoop…mehr
This book provides a comprehensive overview and introduction to Big Data Infrastructure technologies, existing cloud-based platforms, and tools for Big Data processing and data analytics, combining both a conceptual approach in architecture design and a practical approach in technology selection and project implementation.
Readers will learn the core functionality of major Big Data Infrastructure components and how they integrate to form a coherent solution with business benefits. Specific attention will be given to understanding and using the major Big Data platform Apache Hadoop ecosystem, its main functional components MapReduce, HBase, Hive, Pig, Spark and streaming analytics. The book includes topics related to enterprise and research data management and governance and explains modern approaches to cloud and Big Data security and compliance.
The book covers two knowledge areas defined in the EDISON Data Science Framework (EDSF): Data Science Engineering and Data Management and Governance and can be used as a textbook for university courses or provide a basis for practitioners for further self-study and practical use of Big Data technologies and competent evaluation and implementation of practical projects in their organizations.
Dr. Juan José Associate Professor in the Department of Computer Science at the University of Alcalá, in the area of ¿¿Computer Science and Artificial Intelligence and Affiliate Associate Professor in the Department of Computer Science and Software Engineering, of the Faculty of Engineering and Computer Science, of the Concordia University, in Montreal, Canada. Previously, he was a professor at the Spanish Universities Universitat Oberta de Catalunya, in Barcelona, ¿¿from 2004 to 2016, the University of Valladolid, in Segovia, in 2004, and the Universidad Carlos III de Madrid, in Madrid, between 1997 and 2004. He has been Visiting Associate Professor, in the Department of Software and IT Engineering, of the École de Technologie Supérieure, at the Université du Québec à Montréal, in Montreal, Canada, from 2009 to 2015; and Visiting Professor, in the Postgraduate and Research section, of the Faculty of Administration and Management, of the National Polytechnic Institute, inMexico City, Mexico, from 2009 to 2014. He was also a researcher in the Department of Astrophysics and Atmospheric Sciences, from the Faculty of Physical Sciences, of the Complutense University of Madrid, in Madrid, Spain, from 1994 to 1997. Juan José has a degree in Physical Sciences from the Complutense University of Madrid, in 1994; obtained in Recognition of the research sufficiency in the Faculty of Physical Sciences of the Complutense University of Madrid, in 1997; and the Doctorate in Computer Engineering, at the Carlos III University of Madrid, in 2001, with the qualification of A "cum laude" unanimously by the court. It currently has 4 six-year periods and 3 five-year periods. In 2010, she obtained the Outstanding Research Pathway certification by the National Agency for Evaluation and Prospective (ANEP) of the Secretary of State for Universities and Research of the Ministry of Science and Innovation, within the program I3 Program, Incentive for the Incorporation andIntensification of Research Activity. Juan José has carried out research stays at the Universities: University of Amsterdam, Amsterdam, Holland, at the Informatics Institute, of the Faculty of Science, in 2018, funded by a mobility grant from the University of Alcalá; at the Otto-von-Guericke-University, Magdeburg, Germany, at the Institüt für Verteilte Systeme, de la Fakültat für Informatik, in 2013, funded by a mobility grant from the University of Alcalá, in 2012, within a sabbatical year granted by the University of Alcalá, in 2009, funded by a "José Castillejo" for further studies and research, from the University of Alcalá; at the Université du Québec à Montréal, in Montreal, Canada, in the Department of Software and IT Engineering, from the École de Technologie Supérieure, in 2006 and 2005; at the University of Reading, in Reading, United Kingdom, in the Computer Science Department, in 2005 and 2004; and the Università Roma Tre, in Rome, Italy, in the Dipartamento di Informatica e Automatizacione, in 2004 and 2003. Juan José is currently researching in the fields of Artificial Intelligence and Data Science. He has made more than 200 scientific publications, many of which have been in journals indexed in the JRC Science Edition. He has also participated, as principal investigator or researcher, in numerous research projects, both financed with public funding, both European, national, regional or university; as well as with private financing, through contracts made through article 83 of the University Law. He has also directed nine doctoral theses, all of them having received the highest qualification; and has participated in numerous doctoral courts, in Spain, Germany, and Mexico. He is also an External Evaluator of projects in Computer Science, of the Natural Sciences and Engineering Research Council of Canada since 2014 and Evaluator of the National Agency for Evaluation and Prospective, of the General Directorate of Scientific and
Inhaltsangabe
Chapter 1 Introduction. - Chapter 2 Big Data Technologies Foundation: Definition Reference Architecture use cases. - Chapter 3 Cloud Computing Foundation: Definition Reference Architecture Foundational Technologies Use cases. - Chapter 4 Cloud and Big Data Service Providers and Platforms. - Chapter 5 Big Data Algorithms MapReduce and Hadoop ecosystem.- Chapter 6 Streaming Analytics and Spark.- Chapter 7 Data Structures for Big Data Modern Big Data SQL and NoSQL Databases.-Chapter 8 Enterprise Data Governance and Management.- Chapter 9 Research Data Management.- Chapter 10 Big Data Security and Compliance Data Privacy Protection.- Chapter 11 Finding Data on the Web Data sets Web Scraping Web API.- Chapter 12 Data Science Projects Management DataOps MLOPs.- Chapter13 Data Science Projects Development with Amazon SageMaker.- Chapter 14 Data Validation for Data Science Projects.
Chapter 1 Introduction. - Chapter 2 Big Data Technologies Foundation: Definition, Reference Architecture, use cases. - Chapter 3 Cloud Computing Foundation: Definition, Reference Architecture, Foundational Technologies, Use cases. - Chapter 4 Cloud and Big Data Service Providers and Platforms. - Chapter 5 Big Data Algorithms, MapReduce and Hadoop ecosystem.- Chapter 6 Streaming Analytics and Spark.- Chapter 7 Data Structures for Big Data, Modern Big Data SQL and NoSQL Databases.-Chapter 8 Enterprise Data Governance and Management.- Chapter 9 Research Data Management.- Chapter 10 Big Data Security and Compliance, Data Privacy Protection.- Chapter 11 Finding Data on the Web, Data sets, Web Scraping, Web API.- Chapter 12 Data Science Projects Management,DataOps, MLOPs.- Chapter13 Data Science Projects Development with Amazon SageMaker.- Chapter 14 Data Validation for Data Science Projects.
Chapter 1 Introduction. - Chapter 2 Big Data Technologies Foundation: Definition Reference Architecture use cases. - Chapter 3 Cloud Computing Foundation: Definition Reference Architecture Foundational Technologies Use cases. - Chapter 4 Cloud and Big Data Service Providers and Platforms. - Chapter 5 Big Data Algorithms MapReduce and Hadoop ecosystem.- Chapter 6 Streaming Analytics and Spark.- Chapter 7 Data Structures for Big Data Modern Big Data SQL and NoSQL Databases.-Chapter 8 Enterprise Data Governance and Management.- Chapter 9 Research Data Management.- Chapter 10 Big Data Security and Compliance Data Privacy Protection.- Chapter 11 Finding Data on the Web Data sets Web Scraping Web API.- Chapter 12 Data Science Projects Management DataOps MLOPs.- Chapter13 Data Science Projects Development with Amazon SageMaker.- Chapter 14 Data Validation for Data Science Projects.
Chapter 1 Introduction. - Chapter 2 Big Data Technologies Foundation: Definition, Reference Architecture, use cases. - Chapter 3 Cloud Computing Foundation: Definition, Reference Architecture, Foundational Technologies, Use cases. - Chapter 4 Cloud and Big Data Service Providers and Platforms. - Chapter 5 Big Data Algorithms, MapReduce and Hadoop ecosystem.- Chapter 6 Streaming Analytics and Spark.- Chapter 7 Data Structures for Big Data, Modern Big Data SQL and NoSQL Databases.-Chapter 8 Enterprise Data Governance and Management.- Chapter 9 Research Data Management.- Chapter 10 Big Data Security and Compliance, Data Privacy Protection.- Chapter 11 Finding Data on the Web, Data sets, Web Scraping, Web API.- Chapter 12 Data Science Projects Management,DataOps, MLOPs.- Chapter13 Data Science Projects Development with Amazon SageMaker.- Chapter 14 Data Validation for Data Science Projects.
Es gelten unsere Allgemeinen Geschäftsbedingungen: www.buecher.de/agb
Impressum
www.buecher.de ist ein Internetauftritt der buecher.de internetstores GmbH
Geschäftsführung: Monica Sawhney | Roland Kölbl | Günter Hilger
Sitz der Gesellschaft: Batheyer Straße 115 - 117, 58099 Hagen
Postanschrift: Bürgermeister-Wegele-Str. 12, 86167 Augsburg
Amtsgericht Hagen HRB 13257
Steuernummer: 321/5800/1497
USt-IdNr: DE450055826