For over a decade, he has worked for several start-ups in Silicon Valley and Raleigh, North Carolina, in the area of search and analytics. Stefan Will is a computer scientist with a degree in machine learning and pattern recognition from the University of Bonn, Germany. Michael is a father of three, and besides work, he spends most of his time with his family and coaching youth softball. He is currently a development leader for Conversant, where he maintains Flume flows of nearly 100 billion log lines per day. He has also worked on the mission-critical medical device software, e-commerce, transportation, navigation, and advertising domains. He has worked as a software engineer, coding almost exclusively in Java since JDK 1.1. Michael Keane has a BS in computer science from the University of Illinois at Urbana-Champaign. He also actively writes about enterprise application development on his blog ( ). He has contributed code to Apache Camel and developed plugins for Spring Social, which can be found at GitHub ( ). Sachin has a lot of interest in open source projects. He graduated in computer science from the University of Greenwich, London, and currently works for a global consulting company, developing enterprise applications using various open source technologies, such as Apache Camel, ServiceMix, ActiveMQ, and ZooKeeper. Sachin Handiekar is a senior software developer with over 5 years of experience in Java EE development. Their dedication to family and education above all else guides me daily as I attempt to help my own children find their happiness in the world. I also want to give a big thanks to my parents, Alan and Karen, for molding me into the somewhat satisfactory human I've become. My terrific children, Rachel and Noah, are a constant reminder that hard work does pay off and that great things can come from chaos. I couldn't ask for a better friend daily by my side. She puts up with a lot, and that is very much appreciated. I'd again like to dedicate this updated book to my loving and supportive wife, Tracy. More information on Steve can be found at and on Twitter at is the first update to Steve's first book, Apache Flume: Distributed Log Collection for Hadoop, Packt Publishing. He is currently a senior principal engineer at Orbitz Worldwide ( ). Steve holds a BS in computer engineering from the University of Illinois at Urbana-Champaign and an MS in computer science from DePaul University. For the last 5 years, he has focused on infrastructure as code, including automated Hadoop and HBase implementations and data ingestion using Apache Flume. Steve Hoffman has 32 years of experience in software development, ranging from embedded software development to the design and implementation of large-scale, service-oriented, object-oriented systems. However, Packt Publishing cannot guarantee the accuracy of this information. Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book. However, the information contained in this book is sold without warranty, either express or implied. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.Įvery effort has been made in the preparation of this book to ensure the accuracy of the information presented. Index Apache Flume: Distributed Log Collection for Hadoop Second EditionĪll rights reserved. There Is No Spoon – the Realities of Real-time Distributed Data Collection Setting up a better user interface – Kibanaĩ. Tiered data collection (multiple flows and/or agents)Īn overview of the Flume configuration fileĬonfiguring log rotation to the spool directoryĬreating more search fields with an interceptor Interceptors, channel selectors, and sink processors The problem with HDFS and streaming data/logs Support files, eBooks, discount offers, and more Apache Flume: Distributed Log Collection for Hadoop Second Edition
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |