How best to define input and output types in a function #52

hmf · 2023-08-18T08:00:10Z

hmf
Aug 18, 2023

This is a question related to issue #51. Placing the question here because it is general in nature. I have defined the following function in the Loss module:

  def crossEntropy[
      I <: BFloat16 | Float32 | Float64,
      O <: NumericRealNN
  ](
      input: Tensor[I],
      target: Tensor[O]
  ): Tensor[I] =
    Tensor(
      torchNative.cross_entropy(
        input.native,
        target.native
      )
    )

But I noticed that in the Activations module we have:

  def softmax[In <: DType, Out <: DType](input: Tensor[In], dim: Long)(
      dtype: Out = input.dtype
  ): Tensor[Out] =
    val nativeDType =
      if dtype == input.dtype then ScalarTypeOptional() else ScalarTypeOptional(dtype.toScalarType)
    Tensor(torchNative.softmax(input.native, dim, nativeDType))

I was wondering if their is some rule to use when encoding the types. So my questions are:

When should we opt for the more specific types or the more general DType?
Should I emulate the softmax method above and set the output type based on one of the inputs or explicitly set type (for cross entropy output type is the same as the input that is usually a float)?

TIA

Answered by sbrunk

Aug 19, 2023

I was wondering if their is some rule to use when encoding the types. So my questions are:
1. When should we opt for the more specific types or the more general `DType`?

We want to be as specific as possible to avoid runtime errors, as many ops i.e. only support floating point inputs.

It's not always easy to find out though, as it is encoded in the C++ kernels. The existing softmax for instance is too generic but the updated one #51 (comment) only accepts floats.

One way to find out is to actually run the method with different input dtypes. If it fails, we get a nice stacktrace that often leads us to the kernel implementation where we can see dtype restrictions. SoftMaxKernel.cpp here.

W…

View full answer

sbrunk · 2023-08-19T08:59:04Z

sbrunk
Aug 19, 2023
Maintainer

I was wondering if their is some rule to use when encoding the types. So my questions are:
1. When should we opt for the more specific types or the more general `DType`?

We want to be as specific as possible to avoid runtime errors, as many ops i.e. only support floating point inputs.

It's not always easy to find out though, as it is encoded in the C++ kernels. The existing softmax for instance is too generic but the updated one #51 (comment) only accepts floats.

One way to find out is to actually run the method with different input dtypes. If it fails, we get a nice stacktrace that often leads us to the kernel implementation where we can see dtype restrictions. SoftMaxKernel.cpp here.

We have a way to do that more systematically via property based tests thanks to @davoclavo. See #23 (comment). It's just not applied to all operations yet.

2. Should I emulate the `softmax` method above and set the output type based on one of the inputs or explicitly set type (for cross entropy output type is the same as the input that is usually a float)?

For these cases it's often useful to look into the Python API docs for orientation. I.e. for softmax you can explicitly override the type via dtype input parameter, otherwise it's derived.

For cross_entropy as you've said the output type is the same as the input and there's no parameter to override the dtype so I think it's fine the way you did it.

1 reply

hmf Aug 19, 2023
Author

Thanks for the details. Will have to check the tests and consider adding those too.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How best to define input and output types in a function #52

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How best to define input and output types in a function #52

hmf Aug 18, 2023

Replies: 1 comment · 1 reply

sbrunk Aug 19, 2023 Maintainer

hmf Aug 19, 2023 Author

hmf
Aug 18, 2023

Replies: 1 comment 1 reply

sbrunk
Aug 19, 2023
Maintainer

hmf Aug 19, 2023
Author