Skip to content

Instantly share code, notes, and snippets.

@wujcheng
Forked from tsusanto/flume_gcp.confg
Created August 26, 2018 00:45
Show Gist options
  • Save wujcheng/8edcc1348aecb58af95c8346fc218ba2 to your computer and use it in GitHub Desktop.
Save wujcheng/8edcc1348aecb58af95c8346fc218ba2 to your computer and use it in GitHub Desktop.

Revisions

  1. @tsusanto tsusanto created this gist Jan 30, 2018.
    30 changes: 30 additions & 0 deletions flume_gcp.confg
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,30 @@
    tier1.sources = filename-source1
    tier1.channels = channel1
    tier1.sinks = hdfs-sink1

    tier1.channels.channel1.type = file
    tier1.channels.channel1.checkpointDir = /mnt/data/tenny/flume/checkpoint
    tier1.channels.channel1.dataDirs = /mnt/data/tenny/flume/data
    tier1.channels.channel1.capacity = 1000000
    tier1.channels.channel1.transactionCapacity = 10000


    tier1.sources.filename-source1.type = spooldir
    tier1.sources.filename-source1.channels = channel1
    tier1.sources.filename-source1.spoolDir = /mnt/data/tenny/gcp_test
    tier1.sources.filename-source1.fileHeader = true
    tier1.sources.filename-source1.fileSuffix = .DONE
    tier1.sources.filename-source1.basenameHeader = true


    tier1.sinks.hdfs-sink1.type = hdfs
    tier1.sinks.hdfs-sink1.hdfs.path = gs://uploadhd01/gcp_test
    tier1.sinks.hdfs-sink1.brokerList = s1n:9092,s2:9092,s3:9092
    tier1.sinks.hdfs-sink1.channel = channel1
    tier1.sinks.hdfs-sink1.hdfs.useLocalTimeStamp = true
    tier1.sinks.hdfs-sink1.hdfs.rollCount = 5000
    tier1.sinks.hdfs-sink1.hdfs.rollInterval = 10
    tier1.sinks.hdfs-sink1.hdfs.rollSize = 10000000
    tier1.sinks.hdfs-sink1.hdfs.fileType = DataStream
    tier1.sinks.hdfs-sink1.hdfs.writeFormat = Text
    tier1.sinks.hdfs-sink1.hdfs.filePrefix = %{basename}