What's Changed
- fix multicategorical stype inference and add test case by @yiweny in #420
- coorectly infer boolean stypes by @yiweny in #421
- support xgboost early stopping by @yiweny in #424
- Update testing torch version by @zechengz in #428
- Update Excelformer benchmark results on small binary and regression tasks by @zechengz in #427
- update xgboost numbers by @yiweny in #425
- Update excelformer benchmark results by @zechengz in #431
- Remove CUDA synchronizations by slicing input tensor with
int
instead of CUDA tensors innn.LinearEmbeddingEncoder
by @akihironitta in #432 - Don't put assertions on N/A imputation correctness by @akihironitta in #433
- Don't create the same tensor every iteration in N/A handling by @akihironitta in #434
- chore: Update pre-commit by @akihironitta in #435
- Add benchmark results for large-scale multiclass classification task by @akihironitta in #436
- Fixed warning and added safe globals by @NeelKondapalli in #423
- fix error in xgboost by @puririshi98 in #443
- Add
is_floating_point()
to multi tensors by @akihironitta in #445 - Fix size mismatch error when
CatToNumTransform
sees only a subset of labels at test time by @akihironitta in #446 - add pytorch tabular benchmark by @yiweny in #398
- Compare more models across frame and tabular by @wsad1 in #444
- Add benchmark result from
ExcelFormer
on a large-scale multi-class classification task by @akihironitta in #447 - Fail
torch.load(weights=True)
gracefully by @akihironitta in #448 - Fix offset in LinearEmbeddingEncoder by @toenshoff in #455
- Fix docs build in CI by @akihironitta in #456
- Removing the deprecated
categorical_feature
parameter fromlightgbm.train(...)
function calls. by @drivanov in #454 - Tighten assert condition in graph break tests by @akihironitta in #458
- Update pytorch_tabular_benchmark.py by @wsad1 in #457
- Drop support for Python 3.8 by @akihironitta in #462
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in #461
- Update benchmark numbers by @yiweny in #411
- Add support for PyTorch 2.5 by @akihironitta in #464
- Allow empty
TensorFrame
with non-zero number of rows by @rusty1s in #466 - Support index select for empty
TensorFrame
by @rusty1s in #467 - Consistent PyPI name
pytorch-frame
by @akihironitta in #468 - Raise a friendly message when a
str
is provided toTensorFrame(col_names_dict)
instead of alist[str]
by @akihironitta in #469 - Update README.md by @akihironitta in #471
- Materialize train test by @HoustonJ2013 in #472
- Add an example of training a tabular model on multiple GPUs by @akihironitta in #474
- Support
pin_memory()
inMulti{Embedding,Nested}Tensor
andTensorFrame
by @akihironitta in #437 - Run
MultiNestedTensor
tests on both CPU and GPU by @akihironitta in #476 - Optimize the
Trompt
example to reduce training time by ~30% by @akihironitta in #477 - Add dependabot and auto-merge PRs by dependabot once CI passes by @akihironitta in #478
- Bump tj-actions/changed-files from 41 to 45 by @dependabot in #479
- Bump codecov/codecov-action from 2 to 5 by @dependabot in #481
- Bump dangoslen/changelog-enforcer from 2 to 3 by @dependabot in #480
- Bump actions/labeler from 4 to 5 by @dependabot in #482
- [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in #483
- Update
.pre-commit-config.yaml
weekly by @akihironitta in #484 - Fix documentation build by @akihironitta in #486
- Label bot PRs
skip-changelog
by @akihironitta in #487 - [pre-commit.ci] pre-commit suggestions by @pre-commit-ci in #485
- update version to
0.2.4
by @weihua916 in #488
New Contributors
- @NeelKondapalli made their first contribution in #423
- @puririshi98 made their first contribution in #443
- @wsad1 made their first contribution in #444
- @HoustonJ2013 made their first contribution in #472
Full Changelog: 0.2.3...0.2.4