Flume1.4學習問題記錄

一、red hat9 上安裝 flume-1.4.0 ,配置好環境變量:
vi /etc/profile
FLUME_HOME=/usr/flume-1.4.0
PATH = $PATH:$FLUME_HOME/bin
export FLUME_HOME PATH
進入:flume-1.4.0目錄
運行 bin/flume-ng agent -n a1 -c conf -f conf/agent1.conf -Dflume.root.logger=INFO, console
報bin/flume-ng有錯,報錯信息及解決方法以下:java

一、82行
/usr/flume-1.4.0/bin/flume-ng: line 82: conditional binary operator expected
/usr/flume-1.4.0/bin/flume-ng: line 82: syntax error near =~'<br/>/usr/flume-1.4.0/bin/flume-ng: line 82: if [[ $line =~ ^java.library.path=(.)$ ]]; then'
解決:
將:if [[ $line =~ ^java.library.path=(.
)$ ]]; then 修改成:if [[$line =~ "^java.library.path=(.)$" ]]; then
注: [[和$line之間不能有空格 ,^java.library.path=(.
)$用 "" 引發來; apache

二、 102行
bin/flume-ng: line 102: syntax error near unexpected token ('<br/>bin/flume-ng: line 102: if [[$PIECE =~ slf4j-(api|log4j12)..jar ]]; then'
解決:
將if [[$PIECE =~ slf4j-(api|log4j12).
.jar ]]; then修改成:if [[$PIECE =~ "slf4j-(api|log4j12)..jar" ]]; then
三、131行 ,和 82行同樣
四、151行
bin/flume-ng: line 151: syntax error near unexpected token ('<br/>bin/flume-ng: line 151: if [[$PIECE =~ slf4j-(api|log4j12).
.jar ]]; then'
解決:
將if [[$PIECE =~ slf4j-(api|log4j12)..jar ]]; then修改成:if [[$PIECE =~ "slf4j-(api|log4j12)..jar" ]]; thenapi

五、運行flume-ng 後臺出現信息:bin/flume-ng: line 102: [[/usr/hadoop-0.21.0: No such file or directory 能夠忽略.app

六、使用 agent1.sources.r1.type = netcat
報錯:Caused by: java.net.BindException: Address already in use
說明配置文件中配的端口被佔用 ,斷開ssh,從新鏈接便可。ssh

七、
錯誤記錄:
org.apache.flume.ChannelException: Space for commit to queue couldn't be acquired Sinks are likely not keeping up with sources, or the buffer size is too tightide

解決:設置agent1.channels.<channel_name>.keep-alive = 30oop

八、org.apache.flume.channel.file.BadCheckpointException: Configured capacity is 100000 but the checkpoint file capacity is 1000. See FileChannel documentation on how toui

最近flume運行不是很穩定,本次因爲hadoop不能寫入,致使flume報錯,Configured capacity is 100000000 but the checkpoint file capacity is 1000000,重啓flume後問題仍然存在。
1,詳細報錯以下:
22 Jan 2013 11:07:42,568 INFO [pool-7-thread-1] (org.apache.avro.ipc.NettyServer$NettyServerAvroHandler.channelClosed:209) - Connection to /10.4.203.176:60322 disconnected.
22 Jan 2013 11:07:44,617 ERROR [SinkRunner-PollingRunner-DefaultSinkProcessor] (org.apache.flume.SinkRunner$PollingRunner.run:160) - Unable to deliver event. Exception follows.
java.lang.IllegalStateException: Channel closed [channel=file_chan_1]. Due to java.lang.IllegalStateException: Configured capacity is 100000000 but the checkpoint file capacity is 1000000. See FileChannel documentation on how to change a channels capacity.
at org.apache.flume.channel.file.FileChannel.createTransaction(FileChannel.java:321)
at org.apache.flume.channel.BasicChannelSemantics.getTransaction(BasicChannelSemantics.java:122)
at org.apache.flume.sink.hdfs.HDFSEventSink.process(HDFSEventSink.java:385)
at org.apache.flume.sink.DefaultSinkProcessor.process(DefaultSinkProcessor.java:68)
at org.apache.flume.SinkRunner$PollingRunner.run(SinkRunner.java:147)
at java.lang.Thread.run(Thread.java:722)
Caused by: java.lang.IllegalStateException: Configured capacity is 100000000 but the checkpoint file capacity is 1000000. See FileChannel documentation on how to change a channels capacity.
at org.apache.flume.channel.file.EventQueueBackingStoreFile.<init>(EventQueueBackingStoreFile.java:80)
at org.apache.flume.channel.file.EventQueueBackingStoreFileV3.<init>(EventQueueBackingStoreFileV3.java:42)
at org.apache.flume.channel.file.EventQueueBackingStoreFactory.get(EventQueueBackingStoreFactory.java:67)
at org.apache.flume.channel.file.EventQueueBackingStoreFactory.get(EventQueueBackingStoreFactory.java:36)
at org.apache.flume.channel.file.Log.replay(Log.java:339)
at org.apache.flume.channel.file.FileChannel.start(FileChannel.java:271)
at org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:236)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
at java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
... 1 morethis

2,故障緣由:
FileChannel使用了固定大小的checkpoint file,修改channel的能力,簡單的方法以下
1,關閉agent
2, 刪除或者備份checkpoint目錄,默認 該目錄在當前用戶的根目錄下,好比 root/.flume, 其中root爲當前用戶 ,.flume爲checkpoint目錄。
3,重啓flume agent(若是channel中有大量文件,會致使全面的延遲,建議先關閉數據源,等把file channel中的數據所有寫入sink後,等上2分鐘,等數據文件刪除後,再重啓channel
具體詳見參考資料這段話,這段意思未徹底讀明白,經過解決方法靠譜.net

The FileChannel actually uses a fixed size checkpoint file -- so it is not possible to set
it to unlimited size (the checkpoint file is mmap-ed to a fixed size buffer). To change the
capacity of the channel, the easiest way off the top of my head is:

  • Shutdown the agent.
  • Delete all files in the file channel's checkpoint directory. (not the data directories.
    Also you might want to move them out, rather than delete to be safe)
  • Change your configuration to increase the capacity of the channel.
  • Restart the agent - this will cause full replay, so the agent might take sometime to start
    up if there are a lot of events in the channel (to avoid this - shutdown the source before
    shutting the agent down - so the sink can drain out the channel completely, wait for about
    1-2 mins after the channel is empty so that the data files get deleted (this happens only
    immediately after a checkpoint - you can verify this by making sure each data dir has only
    2 files each), since all events have been sent out - so during restart the channel will be
    quite empty, with very little to replay).
    九、14/05/20 11:44:27 ERROR source.SpoolDirectorySource: Uncaught exception in Runnable
    java.lang.IllegalStateException: Serializer has been closed
    at org.apache.flume.serialization.LineDeserializer.ensureOpen(LineDeserializer.java:124)
    at org.apache.flume.serialization.LineDeserializer.readEvents(LineDeserializer.java:88)
    at org.apache.flume.client.avro.ReliableSpoolingFileEventReader.readEvents(ReliableSpoolingFileEventReader.java:221)
    at org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.run(SpoolDirectorySource.java:160)
    at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
    at java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:317)
    at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:180)
    at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:204)
    at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
    at java.lang.Thread.run(Thread.java:662)

看了一下spool的目錄下的文件,發現有個文件已經處理了(後綴加上了.COMPLETED)了,而後目錄下還有一個文件跟這個是同名的即,目錄下存在:
123.log.COMPLETED和123.log兩個文件,就會報上訴錯誤。

十、
flume ng報錯File should not roll when commit is outstanding
執行flume ng,後臺日誌報錯
java.lang.IllegalStateException: File should not roll when commit is outstanding.
at org.apache.flume.client.avro.ReliableSpoolingFileEventReader.readEvents(ReliableSpoolingFileEventReader.java:204)
at org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.run(SpoolDirectorySource.java:160)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:317)
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:150)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$101(ScheduledThreadPoolExecutor.java:98)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.runPeriodic(ScheduledThreadPoolExecutor.java:181)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:205)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:619)

這是監控的目錄(spooldir)裏面有長度爲0的文件,把他刪了或者更名加後綴.COMPLETE,再重啓flume(有時候直接重啓也沒問題了),貌似是一個bug來的https://issues.apache.org/jira/browse/FLUME-1934

相關文章
相關標籤/搜索