DataEngineerTest

sbt
run {s3_input_file_path} {s3_output_file_path} {aws_profile_name}
aws_profile_name which is set up in your ~/.aws/credentials
eg command:- run s3a://spark-test-files/input-files s3a://spark-test-files/output-files/ default

input_file_path
output_file_path
local or {aws_profile_name} => if 3rd argument local that means read and write file from local file system else it will read accessKey and secretKey from mentioned profile.

sbt assembly
once assembly completed it will create jar file in target/scala-2.13/DataEngineerTest-assembly-0.1.0-SNAPSHOT.jar
Jar file you can use it to run on EMR cluster or in your own spark cluster.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
project		project
src		src
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt

Provide feedback