Due to storage and efficiency drawbacks with Singer data, hotglue supports automatically converting the singer output of a standard tap into Parquet, which is more digestible for transformation scripts. See intermediary formats for more information.
target-parquet is a singer target which takes in singer data and outputs the data’s corresponding parquet.When developing locally with the SingerSDK, we recommend using target-parquet to easily inspect your data.
Suppose you have a file in your local directory called data.singer that you want to convert.First enter your virtual environment:
Copy
Ask AI
source ~/env/target-parquet/bin/activate
Then simply run:
Copy
Ask AI
cat data.singer | target-parquet
If your singer data is malformed, you should see an error message explaining the issue.If the Parquet conversion succeeds, you should see a Parquet file for each stream in your singer data.