获得"文件不存在"运行一个Amazon EMR作业时出错作业、不存在、文件、QUOT

2023-09-11 12:23:14 作者:国际男王

我已经上传我的数据 genotype1_large_ind_large.txt phenotype1_large_ind_large_1.txt

I have uploaded my data genotype1_large_ind_large.txt phenotype1_large_ind_large_1.txt

到S3系统,并在电子病历的用户界面,我喜欢设置参数如下

to the S3 system, and in the EMR UI, I set the parameter like below

RunDear.run S3N://scalability/genotype1_large_ind_large.txt S3N://scalability/phenotype1_large_ind_large_1.txt S3N://可扩展性/ output_1phe 33 10 4

RunDear.run s3n://scalability/genotype1_large_ind_large.txt s3n://scalability/phenotype1_large_ind_large_1.txt s3n://scalability/output_1phe 33 10 4

在我的课RunDear.run我将文件genotype1_large_ind_large.txt和phenotype1_large_ind_large_1.txt分发到高速缓存

In my class RunDear.run I will distribute the file genotype1_large_ind_large.txt and phenotype1_large_ind_large_1.txt to the cache

然而,在运行电子病历后,我收到以下错误: java.io.FileNotFoundException:文件不存在:/genotype1_large_ind_large.txt

However, after running the EMR, I get the following error: java.io.FileNotFoundException: File does not exist: /genotype1_large_ind_large.txt

我想知道为什么会出现在文件名称前斜杠/? 如何使它工作?

I am wondering why there is slash '/' in front of the file name? how to make it work?

我也尝试过使用类似下面,但我的计划将采取-cacheFile作为参数,因此也不能正常工作,

I also tried to use like below, but my program will take -cacheFile as an argument, thus also does not work,

RunDear.run -cacheFile S3N://scalability/genotype1_large_ind_large.txt#genotype.txt -cacheFile S3N://scalability/phenotype1_large_ind_large_1.txt#phenotype.txt S3N://可扩展性/ output_1phe 33 280 4

RunDear.run -cacheFile s3n://scalability/genotype1_large_ind_large.txt#genotype.txt -cacheFile s3n://scalability/phenotype1_large_ind_large_1.txt#phenotype.txt s3n://scalability/output_1phe 33 280 4

推荐答案

我终于意识到这是一个使用文件系统的问题,所以我加了code在节目中像下面 文件系统FS = FileSystem.get(URI.create(S3://可扩展性),CONF);

I finally realize it is the problem of using the filesystem, so I add a code in the program like below FileSystem fs = FileSystem.get( URI.create("s3://scalability"), conf);