2017-01-13 53 views
1

我认为我在文档上非常紧张,但我仍遇到这种异常。 (唯一不同的是,我从Eclipse J2EE运行它,但我不会期望这真的是maters,不是吗?)Apache Beam异常运行wordcount示例

代码:(我没有写这个,它是从梁项目的例子)。我想你必须指定一个谷歌云平台项目并提供正确的凭据来访问它。但是,我没有发现这个示例项目中的任何地方。

public static void main(String[] args) { 
// Create a PipelineOptions object. This object lets us set various execution 
// options for our pipeline, such as the runner you wish to use. This example 
// will run with the DirectRunner by default, based on the class path configured 
// in its dependencies. 
PipelineOptions options = PipelineOptionsFactory.create(); 

// Create the Pipeline object with the options we defined above. 
Pipeline p = Pipeline.create(options); 

// Apply the pipeline's transforms. 

// Concept #1: Apply a root transform to the pipeline; in this case, TextIO.Read to read a set 
// of input text files. TextIO.Read returns a PCollection where each element is one line from 
// the input text (a set of Shakespeare's texts). 

// This example reads a public data set consisting of the complete works of Shakespeare. 
p.apply(TextIO.Read.from("gs://apache-beam-samples/shakespeare/*")) 
..... 
) 

例外:

Exception in thread "main" java.lang.IllegalStateException: Failed to validate gs://apache-beam-samples/shakespeare/* 
at org.apache.beam.sdk.io.TextIO$Read$Bound.expand(TextIO.java:309) 
at org.apache.beam.sdk.io.TextIO$Read$Bound.expand(TextIO.java:205) 
at org.apache.beam.sdk.runners.PipelineRunner.apply(PipelineRunner.java:76) 
at org.apache.beam.runners.direct.DirectRunner.apply(DirectRunner.java:296) 
at org.apache.beam.sdk.Pipeline.applyInternal(Pipeline.java:388) 
at org.apache.beam.sdk.Pipeline.applyTransform(Pipeline.java:302) 
at org.apache.beam.sdk.values.PBegin.apply(PBegin.java:47) 
at org.apache.beam.sdk.Pipeline.apply(Pipeline.java:152) 
at google.dataflow.beam.example.MinimalWordCount.main(MinimalWordCount.java:77) 
Caused by: java.io.IOException: Unable to match files in bucket apache-beam-samples, prefix shakespeare/ against pattern shakespeare/[^/]* 
at org.apache.beam.sdk.util.GcsUtil.expand(GcsUtil.java:234) 
at org.apache.beam.sdk.util.GcsIOChannelFactory.match(GcsIOChannelFactory.java:53) 
at org.apache.beam.sdk.io.TextIO$Read$Bound.expand(TextIO.java:304) 
... 8 more 
Caused by: com.google.api.client.http.HttpResponseException: 400 Bad Request 
{ 


"error" : "invalid_grant" 
} 
    at com.google.api.client.http.HttpRequest.execute(HttpRequest.java:1070) 
    at com.google.auth.oauth2.UserCredentials.refreshAccessToken(UserCredentials.java:207) 
    at com.google.auth.oauth2.OAuth2Credentials.refresh(OAuth2Credentials.java:149) 
    at com.google.auth.oauth2.OAuth2Credentials.getRequestMetadata(OAuth2Credentials.java:135) 
    at com.google.auth.http.HttpCredentialsAdapter.initialize(HttpCredentialsAdapter.java:96) 
    at com.google.cloud.hadoop.util.ChainingHttpRequestInitializer.initialize(ChainingHttpRequestInitializer.java:52) 
    at com.google.api.client.http.HttpRequestFactory.buildRequest(HttpRequestFactory.java:93) 
    at com.google.api.client.googleapis.services.AbstractGoogleClientRequest.buildHttpRequest(AbstractGoogleClientRequest.java:300) 
    at com.google.api.client.googleapis.services.AbstractGoogleClientRequest.executeUnparsed(AbstractGoogleClientRequest.java:419) 
    at com.google.api.client.googleapis.services.AbstractGoogleClientRequest.executeUnparsed(AbstractGoogleClientRequest.java:352) 
    at com.google.api.client.googleapis.services.AbstractGoogleClientRequest.execute(AbstractGoogleClientRequest.java:469) 
    at com.google.cloud.hadoop.util.ResilientOperation$AbstractGoogleClientRequestExecutor.call(ResilientOperation.java:166) 
    at com.google.cloud.hadoop.util.ResilientOperation.retry(ResilientOperation.java:66) 
    at com.google.cloud.hadoop.util.ResilientOperation.retry(ResilientOperation.java:103) 
    at org.apache.beam.sdk.util.GcsUtil.expand(GcsUtil.java:227) 
    ... 10 more 
+0

请发布您的代码.. –

+0

代码是在那里。 – foxwendy

+1

您是否在命令行上使用'gcloud'进行身份验证? https://cloud.google.com/dataflow/security-and-permissions –

回答

1

尝试,如果使用的是Windows运行它从命令提示符。 转到包含pom.xml文件的文件夹并在其中打开cmd。 然后给各个参数的命令。

mvn compile exec:java -Dexec.mainClass=org.apache.beam.examples.WordCount -Dexec.args=" --output=counts" -Pdirect-runner 

如果你想运行你的输入文件。然后用任何名称创建一个txt文件并将其放在包含pom的文件夹中。然后按照命令Fire。

mvn compile exec:java -Dexec.mainClass=org.apache.beam.examples.WordCount -Dexec.args="--inputFile=YOURFILENAME.txt --output=counts" -Pdirect-runner** 

希望这样做。休息我正在寻找你的问题

相关问题