Apache Flume Course Overview
About Apache Flume
Apache Flume is a distributed and reliable system for efficiently collecting, aggregating, and moving large amounts of log or event data from many sources to a centralized data store like MapR Data Platform.
Flume agents ingest incoming streaming data from one or more sources, including avro, thrift, exec, JMS, netcat, and syslog. Data ingested by a Flume agent is passed to a sink, which is most commonly a distributed file system like Hadoop. Multiple Flume agents can be connected together for more complex workflows by configuring the source of one agent to be the sink of another.
Apache Flume Training Curriculum
Overview, , Architecture, , Data flow mode, , Reliability and Recoverability
Setting up an agent
Configuring individual components, Wiring the pieces together, Data ingestion
Setting multi-agent flow
Consolidation, Multiplexing the flow, Configuration, Defining the flow, Configuring individual components, Adding multiple flows in an agent
Configuring a multi agent flow
Fan out flow, Flume Sources, Avro Source, Exec Source, NetCat Source, Sequence Generator Source, Syslog Sources, Syslog TCP Source, Syslog UDP Source, Legacy Sources, Avro Legacy Source, Thrift Legacy Source, Custom Source
HDFS Sink, Logger Sink, Avro Sink, IRC Sink, File Roll Sink, Null Sink, HbaseSinks, HbaseSink, AsyncHBaseSink, Custom Sink
Memory Channel , JDBC Channel , Recoverable Memory Channel , File Channel , Pseudo Transaction Channel , Custom Channel , Flume Channel Selectors , Replicating Channel Selector , Multiplexing Channel Selector , Custom Channel Selector
Flume Sink Processors
Default Sink Processor , Failover Sink Processor, Load balancing Sink Processor , Custom Sink Processor
Timestamp Interceptor , Host Interceptor , Flume Properties , Property
Monitoring , Troubleshooting, Handling agent failures, Compatibility, HDFS, AVRO
Average Apache Flume Salary in USA is increasing and is much better than other products.
12 July, 2020