Introduction of NiFi

img

Brief of NiFi

Apache NiFi is a software project from the Apache Software Foundation designed to automate the flow of data between software systems.

Software development and the commercial support is currently offered by Hortonworks (now merged into Cloudera), who acquired NiFi’s originator, Onyara Inc.

Apache NiFi supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. Some of the high-level capabilities and objectives of Apache NiFi include:

  • Web-based user interface
    • Seamless experience between design, control, feedback, and monitoring
  • Highly configurable
    • Loss tolerant vs guaranteed delivery
    • Low latency vs high throughput
    • Dynamic prioritization
    • Flow can be modified at runtime
    • Back pressure
  • Data Provenance
    • Track dataflow from beginning to end
  • Designed for extension
    • Build your own processors and more
    • Enables rapid development and effective testing
  • Secure
    • SSL, SSH, HTTPS, encrypted content, etc…
    • Multi-tenant authorization and internal authorization/policy management

For more information about the usage of NiFi:

https://nifi.apache.org/docs/nifi-docs/html/getting-started.html

Components of NiFi

  1. Processor

    Providing abundant processors for processing distinct data types, connecting processors or other data processing.

    Different purposes of the processor:

    • Data Ingestion Processor such as GetXXX
    • Data Transformation Processor
    • Data Egress/Sending Processor such as PutXXX
    • Routing and Mediation Processor
    • Databases Access Processor
    • Attribute Extraction Processor
    • System Integration Processor
    • Splitting and Aggregation Processor
    • Http and UDP Processor
    • AWS Processor
  2. Processor Group

    Combining disparate processors into one group for easy management.

    2019-12-06 at 13.49