We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Describe the bug
Semantic Dedup often gets stuck at the state when we call semantic_cluster_dedup.extract_dedup_data.
semantic_cluster_dedup.extract_dedup_data
Steps/Code to reproduce bug
Run semantic dedup when the client = get_client(device_type='gpu', protocol='ucx')
client = get_client(device_type='gpu', protocol='ucx')
Environment overview
Tried on cudf-cu12=24.8.*, cudf-cu12==24.10.a and cudf-cu12==24.12.a
cudf-cu12=24.8.*
cudf-cu12==24.10.a
cudf-cu12==24.12.a
Succeeds when protocol='tcp'
protocol='tcp'
The text was updated successfully, but these errors were encountered:
Also from a quick experiment it seems like classifiers (domain / quality) are about 30% slower when using UCX.
Sorry, something went wrong.
We should try if the PR #80 Patch Distributed UCX comms to allow configuring connect timeout (docs here) help solve this issue
VibhuJawa
No branches or pull requests
Describe the bug
Semantic Dedup often gets stuck at the state when we call
semantic_cluster_dedup.extract_dedup_data
.Steps/Code to reproduce bug
Run semantic dedup when the
client = get_client(device_type='gpu', protocol='ucx')
Environment overview
Tried on
cudf-cu12=24.8.*
,cudf-cu12==24.10.a
andcudf-cu12==24.12.a
Succeeds when
protocol='tcp'
The text was updated successfully, but these errors were encountered: