Chore: Add PyAirbyte performance profiling option without caching#323
Conversation
WalkthroughWalkthroughThe recent changes enhance the caching functionality in the PyAirbyte throughput tests by introducing a command-line option to disable caching. Modifications to the Changes
Recent review detailsConfiguration used: CodeRabbit UI Files selected for processing (1)
Files skipped from review as they are similar to previous changes (1)
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configuration File (
|
|
/fix-pr
|
Adds ability to quickly profile performance using the helper script in the
examplesdirectory.This previous version (with cache) tests our max throughput from Source to Cache (approx. 50K records/second or ~10 MB/s):
poetry run python ./examples/run_perf_test_reads.py -e=5And this new version with
--no-cacheflag can test the max throughput from Source to (No-Op) Destination (approx. 115K records/second, or ~21 MB/s):poetry run python ./examples/run_perf_test_reads.py -e=5 --destination=e2e --no-cacheCaveats/notes:
Summary by CodeRabbit
New Features
--no-cachecommand-line argument, allowing users to disable caching during throughput tests.Bug Fixes
These changes improve the flexibility and usability of performance testing scripts.