Expected docstring: https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/models/layoutlmv2/modeling_layoutlmv2.py#L897-L901 or https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/masking_utils.py#L715-L716

transformers - ✅(Solved) Fix Wrong docstring for position_ids [2 pull requests, 3 comments, 4 participants]

RmZeta2718 · 2026-03-01T15:46:43Z

[transformers] PR 44547: Fix position ids docstring in modeling flash attention utils.py - Repository: huggingface/transformers - Author: mvanhorn - State: ope… # PR #44547: Fix position_ids docstring in modeling_flash_attention_utils.py - Repository: huggingface/transformers - Author: mvanhorn - State: open | merged: False - Link: https://github.com/huggingface/transformers/pull/44547 ## Description (problem / solution / changelog) Fixes #44373 ## Summary - Corrected the docstring for `position_ids` parameter in `prepare_fa_kwargs_from_position_ids` and `_prepare_from_posids` which incorrectly described attention mask semantics ("Boolean or int tensor... 1 means valid and 0 means not valid") - The docstring now accurately describes position indices behavior ## Testing - Docstring-only change, no code behavior affected This contribution was developed with AI assistance (Claude Code). ## Changed files - `src/transformers/modeling_flash_attention_utils.py` (modified, +2/-2) --- # PR #44590: Fix incorrect docstring for position_ids - Repository: huggingface/transformers - Author: pranay-3108 - State: closed | merged: False - Link: https://github.com/huggingface/transformers/pull/44590 ## Description (problem / solution / changelog) Fixes incorrect documentation for `position_ids` in `masking_utils.py`. The docstring previously described `position_ids` as `torch.Tensor`. This PR updates it to `torch.LongTensor` and aligns the description with the standard wording used across the Transformers codebase. Fixes #44373 # What does this PR do?  Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [ ] Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#create-a-pull-request), Pull Request section? - [ ] Was this discussed/approved via a Github issue or the [forum](https://discuss.huggingface.co/)? Please add a link to it if that's the case. - [ ] Did you make sure to update the documentation with your changes? Here are the [documentation guidelines](https://github.com/huggingface/transformers/tree/main/docs), and [here are tips on formatting docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation). - [ ] Did you write any new necessary tests? ## Who can review? Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.  ## Changed files - `src/transformers/masking_utils.py` (modified, +2/-2) ## Fixed - Fixed by PR: Fix position_ids docstring in modeling_flash_attention_utils.py (https://github.com/huggingface/transformers/pull/44547) - Fixed by PR: Fix incorrect docstring for position_ids (https://github.com/huggingface/transformers/pull/44590) ### System Info latest commit, see link below ### Who can help? @stevhliu ### Information - [ ] The official ex

transformers2026-03-01 15:46:43

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

huggingface/transformers#44373•Fetched 2026-04-08 00:28:53

View on GitHub

Comments

Participants

Timeline

Reactions

Author

Participants

Timeline (top)

commented ×3cross-referenced ×2labeled ×1mentioned ×1

Fix Action

Fixed

Fixed by PR: Fix position_ids docstring in modeling_flash_attention_utils.py (https://github.com/huggingface/transformers/pull/44547)
Fixed by PR: Fix incorrect docstring for position_ids (https://github.com/huggingface/transformers/pull/44590)

PR fix notes

PR #44547: Fix position_ids docstring in modeling_flash_attention_utils.py

Repository: huggingface/transformers
Author: mvanhorn
State: open | merged: False
Link: https://github.com/huggingface/transformers/pull/44547

Description (problem / solution / changelog)

Fixes #44373

Summary

Corrected the docstring for position_ids parameter in prepare_fa_kwargs_from_position_ids and _prepare_from_posids which incorrectly described attention mask semantics ("Boolean or int tensor... 1 means valid and 0 means not valid")
The docstring now accurately describes position indices behavior

Testing

Docstring-only change, no code behavior affected

This contribution was developed with AI assistance (Claude Code).

Changed files

src/transformers/modeling_flash_attention_utils.py (modified, +2/-2)

PR #44590: Fix incorrect docstring for position_ids

Repository: huggingface/transformers
Author: pranay-3108
State: closed | merged: False
Link: https://github.com/huggingface/transformers/pull/44590

Description (problem / solution / changelog)

Fixes incorrect documentation for position_ids in masking_utils.py.

The docstring previously described position_ids as torch.Tensor.
This PR updates it to torch.LongTensor and aligns the description with the standard wording used across the Transformers codebase.

Fixes #44373

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

Changed files

src/transformers/masking_utils.py (modified, +2/-2)

RAW_BUFFERClick to expand / collapse

System Info

latest commit, see link below

Who can help?

@stevhliu

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Wrong docstring:

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/modeling_flash_attention_utils.py#L359-L360

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/modeling_flash_attention_utils.py#L412-L413

The docstring is for attention_mask, not position_ids

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/modeling_flash_attention_utils.py#L201

Expected behavior

Expected docstring:

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/models/layoutlmv2/modeling_layoutlmv2.py#L897-L901

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/masking_utils.py#L715-L716

extent analysis

Fix Plan

Fix Name

Update docstring for attention_mask and position_ids in modeling_flash_attention_utils.py

Steps

1. Update docstring for `attention_mask`

def attention_mask(input_ids: torch.Tensor, attention_mask: torch.Tensor) -> torch.Tensor:
    """
    Compute the masked attention weights.

    Args:
        input_ids (torch.Tensor): The input token ids.
        attention_mask (torch.Tensor): The attention mask.

    Returns:
        torch.Tensor: The masked attention weights.
    """

2. Update docstring for `position_ids`

def position_ids_from_input_ids(input_ids: torch.Tensor) -> torch.Tensor:
    """
    Create position ids from input ids.

    Args:
        input_ids (torch.Tensor): The input token ids.

    Returns:
        torch.Tensor: The position ids.
    """

3. Update the docstring for the `attention_mask` parameter in the `attention_mask` function

def attention_mask(input_ids: torch.Tensor, attention_mask: torch.Tensor) -> torch.Tensor:
    """
    Compute the masked attention weights.

    Args:
        input_ids (torch.Tensor): The input token ids.
        attention_mask (torch.Tensor): The attention mask to be used in the attention mechanism.

    Returns:
        torch.Tensor: The masked attention weights.
    """

4. Update the docstring for the `position_ids` parameter in the `position_ids_from_input_ids` function

def position_ids_from_input_ids(input_ids: torch.Tensor) -> torch.Tensor:
    """
    Create position ids from input ids.

    Args:
        input_ids (torch.Tensor): The input token ids.

    Returns:
        torch.Tensor: The position ids.
    """

Verification

Run python setup.py to rebuild the documentation.
Verify that the updated docstrings are

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

FAQ

Expected behavior

Expected docstring:

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/models/layoutlmv2/modeling_layoutlmv2.py#L897-L901

https://github.com/huggingface/transformers/blob/11b1906d5c0dae39c13270e47cc02c4cde70e548/src/transformers/masking_utils.py#L715-L716

#api #ssr #installation #tensor shape #autograd error #model loading #dependency error #configuration error #environment variable #network issue

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

transformers - ✅(Solved) Fix Wrong docstring for position_ids [2 pull requests, 3 comments, 4 participants]

Recommended Tools

GitHub issue graph ai analysis

Fix Action

Fixed

PR fix notes

PR #44547: Fix position_ids docstring in modeling_flash_attention_utils.py

Description (problem / solution / changelog)

Summary

Testing

Changed files

PR #44590: Fix incorrect docstring for position_ids

Description (problem / solution / changelog)

What does this PR do?

Before submitting

Who can review?

Changed files

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

extent analysis

Fix Plan

Fix Name

Steps

1. Update docstring for attention_mask

2. Update docstring for position_ids

3. Update the docstring for the attention_mask parameter in the attention_mask function

4. Update the docstring for the position_ids parameter in the position_ids_from_input_ids function

Verification

FAQ

Expected behavior

Still need to ship something?

RELATED_DISCOVERY

TRENDING

1. Update docstring for `attention_mask`

2. Update docstring for `position_ids`

3. Update the docstring for the `attention_mask` parameter in the `attention_mask` function

4. Update the docstring for the `position_ids` parameter in the `position_ids_from_input_ids` function