-
-

Full Publications/Events (50)

+
+

Full Publications/Events (51)

-

2024 (14)

+

2024 (15)

    -
  • Blog published on Intel information: 解决方案为最新Meta Llama 3.1模型提供加速

  • -
  • Blog published on Intel Developer News: Intel AI Solutions Boost LLMs: Unleashing the Power of Meta* Llama 3.1

  • -
  • Blog published on digit.in: AI hallucination in LLM and beyond: Will it ever be fixed?

  • +
  • Blog published on Intel Developer News: Intel AI Solutions Support the New Llama 3.2 Models (Sep 2024)

  • +
  • Blog published on Intel information: 解决方案为最新Meta Llama 3.1模型提供加速 (July 2024)

  • +
  • Blog published on Intel Developer News: [Intel AI Solutions Boost LLMs: Unleashing the Power of Meta* Llama 3.1] (https://www.intel.com/content/www/us/en/developer/articles/technical/intel-ai-solutions-support-meta-llama-3-1-launch.html) (July 2024)

  • +
  • Blog published on digit.in: AI hallucination in LLM and beyond: Will it ever be fixed? (July 2024)

  • Blog published on Medium: Accelerating Qwen2 Models with Intel Extension for Transformers (June 2024)

  • Blog published on Huggingface: Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon (May 2024)

  • Blog published on Intel Developer News: Efficient Natural Language Embedding Models with Intel® Extension for Transformers (May 2024)

  • @@ -181,7 +182,7 @@

    2021 (1)Sphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/qbits.html b/latest/docs/qbits.html index fba5cf5fcc0..ef83bfd5c5b 100644 --- a/latest/docs/qbits.html +++ b/latest/docs/qbits.html @@ -4,7 +4,7 @@ - QBits — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + QBits — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -205,7 +205,7 @@

    Pytorch version constrainSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/qloracpu.html b/latest/docs/qloracpu.html index 76247a96d30..e27fbc2a8fb 100644 --- a/latest/docs/qloracpu.html +++ b/latest/docs/qloracpu.html @@ -4,7 +4,7 @@ - QLoRA on CPU — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + QLoRA on CPU — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -171,7 +171,7 @@

    Neural Chat ExampleSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/quantization.html b/latest/docs/quantization.html index 3851f552579..fb1eccc4c53 100644 --- a/latest/docs/quantization.html +++ b/latest/docs/quantization.html @@ -4,7 +4,7 @@ - Quantization — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Quantization — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -314,7 +314,7 @@

    Quantization with TrainerSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/release.html b/latest/docs/release.html index 0a9930c9cc5..15a016d4322 100644 --- a/latest/docs/release.html +++ b/latest/docs/release.html @@ -4,7 +4,7 @@ - Release — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Release — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -126,7 +126,7 @@

    Release NotesSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/release_data.html b/latest/docs/release_data.html index a4586fcbf3e..fcd3f712c9f 100644 --- a/latest/docs/release_data.html +++ b/latest/docs/release_data.html @@ -4,7 +4,7 @@ - Validated Model Performance — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Validated Model Performance — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -2811,7 +2811,7 @@

    LLM FinetuningSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/reproduce/efficient_LLM_inference_on_cpus.html b/latest/docs/reproduce/efficient_LLM_inference_on_cpus.html index 88afa6b864b..4023550a105 100644 --- a/latest/docs/reproduce/efficient_LLM_inference_on_cpus.html +++ b/latest/docs/reproduce/efficient_LLM_inference_on_cpus.html @@ -4,7 +4,7 @@ - Efficient LLM Inference on CPUs — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Efficient LLM Inference on CPUs — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -202,7 +202,7 @@

    INT4 AccuracySphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/reproduce/neural_chat_v3-3_workflow.html b/latest/docs/reproduce/neural_chat_v3-3_workflow.html index bfc629c04b5..9cb4f220ea6 100644 --- a/latest/docs/reproduce/neural_chat_v3-3_workflow.html +++ b/latest/docs/reproduce/neural_chat_v3-3_workflow.html @@ -4,7 +4,7 @@ - Step-by-Step — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Step-by-Step — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -200,7 +200,7 @@

    FP32 AccuracySphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/smoothquant.html b/latest/docs/smoothquant.html index 11b6b5c6fd6..deb374b8482 100644 --- a/latest/docs/smoothquant.html +++ b/latest/docs/smoothquant.html @@ -4,7 +4,7 @@ - Smooth Quant — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Smooth Quant — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -416,7 +416,7 @@

    Supported Framework MatrixSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/streamingllm.html b/latest/docs/streamingllm.html index dc0beac4551..45d631f5a49 100644 --- a/latest/docs/streamingllm.html +++ b/latest/docs/streamingllm.html @@ -4,7 +4,7 @@ - Streaming LLM — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Streaming LLM — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -136,7 +136,7 @@

    Example Built with Sphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/tutorials/README.html b/latest/docs/tutorials/README.html index dcb642863ed..702d18a8b48 100644 --- a/latest/docs/tutorials/README.html +++ b/latest/docs/tutorials/README.html @@ -4,7 +4,7 @@ - Tutorials — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Tutorials — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -233,7 +233,7 @@

    TutorialsSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/user_guide.html b/latest/docs/user_guide.html index 21609824437..8f320a768e1 100644 --- a/latest/docs/user_guide.html +++ b/latest/docs/user_guide.html @@ -4,7 +4,7 @@ - User Guide — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + User Guide — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -108,7 +108,7 @@

    User GuideSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/docs/weightonlyquant.html b/latest/docs/weightonlyquant.html index 668fa9129b3..bcb0ab6d1ef 100644 --- a/latest/docs/weightonlyquant.html +++ b/latest/docs/weightonlyquant.html @@ -4,7 +4,7 @@ - Weight Only Quantization (WOQ) — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Weight Only Quantization (WOQ) — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -442,7 +442,7 @@

    Llama3 on MTLSphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/example.html b/latest/example.html index 1d0b6dace0e..e7bb58080cf 100644 --- a/latest/example.html +++ b/latest/example.html @@ -4,7 +4,7 @@ - Example — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Example — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -122,7 +122,7 @@

    Example Built with Sphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/feature.html b/latest/feature.html index 522dfc72065..cc5dbf6afe3 100644 --- a/latest/feature.html +++ b/latest/feature.html @@ -4,7 +4,7 @@ - Features — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Features — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -141,7 +141,7 @@

    Features Built with Sphinx using a theme provided by Read the Docs. - +

    diff --git a/latest/genindex.html b/latest/genindex.html index 1f72036a666..d3f014a3588 100644 --- a/latest/genindex.html +++ b/latest/genindex.html @@ -3,7 +3,7 @@ - Index — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Index — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -4916,8 +4916,6 @@

    S

    • (intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert method) -
    • -
    • (intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.Baddbmm method)
    • (intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul method)
    • @@ -4954,6 +4952,8 @@

      S

    • (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext method)
    • (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2 method) +
    • +
    • (intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNorm method)
    • (intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization method)
    • @@ -5499,7 +5499,7 @@

      Z

      Built with Sphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/kernel.html b/latest/kernel.html index b54171d482c..83756c10bb4 100644 --- a/latest/kernel.html +++ b/latest/kernel.html @@ -4,7 +4,7 @@ - Kernels — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Kernels — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -131,7 +131,7 @@

      Kernels Built with Sphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/kernel_desc.html b/latest/kernel_desc.html index 0f1194fc273..ef99f17daf4 100644 --- a/latest/kernel_desc.html +++ b/latest/kernel_desc.html @@ -4,7 +4,7 @@ - Implementation Details — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Implementation Details — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -150,7 +150,7 @@

      Implementation DetailsSphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/kernel_perf.html b/latest/kernel_perf.html index 9c8b0ec5dc5..c65c8892368 100644 --- a/latest/kernel_perf.html +++ b/latest/kernel_perf.html @@ -4,7 +4,7 @@ - Performance — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Performance — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -135,7 +135,7 @@

      PerformanceSphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/neural_engine.html b/latest/neural_engine.html index a8ba44efc49..896e426544c 100644 --- a/latest/neural_engine.html +++ b/latest/neural_engine.html @@ -4,7 +4,7 @@ - Neural Engine — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Neural Engine — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -143,7 +143,7 @@

      Neural EngineSphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/objects.inv b/latest/objects.inv index e5c1685b407c0713260e20b21d37cb196648e724..d8a9b5c508e3346772c2bfc18168e578cc146446 100644 GIT binary patch delta 1120 zcmV-m1fToHsshKV0+3(^Wj14CWM#2tW-EV|_p!OuS1ewwU9te|Aqb=O0pUnH2#w(6d9B4IuL>^5~j&ANElZ>WX|KKPT6;4r>XOW$bH{48Cid} zB*y9TB{8pj*ij(JnF<2}*U2-7j>Lv8b2gNDvk}s7$Z}>wBX;7nglr*kKO#$&h%i$k z&^)5blZZrT;W2%Z5ctklLHwfA%|Gw#ljX|}K_Tk^*ODOco@$gR$Nx|~BJVMQ_m0QH zmwqSF2eE~odGQrc_|3jwvVixS$;K3S*Em<)o{zR7a!cIp?vr~hUI9;&+AT;tZD60dw}|^Zg6sc)`rhevQNHhaLHKjc zLf(4<)v4hxd9RX3r<}9+GQRlU^(a*c`jiOu}^yd z;#?FEX^ej6RL4J)=`JS$S(6bja|fykutas@e3OeWJAVq=-8)W@X*Eb>8CQd)d=~GC znP8g4J3vnYFX}uzKf^+bUSkONZ(T(!{e>R+cDzb=UQyI3djz_=WRD^4g}vVsBFMmc zK9oerGmVnciUqoO#+^QNv-V4v2y(3!i!AGE(Ue;jyz|m$nxyNs4sw-Ck~&AF(ItGg zM*LJQdw&K+`6!L*aZ|x{`cxt!!5LM@T$rZI8!)ciX3i7jOv!g-r^&O0#{EZE-*@n+ zvZeq+mp{G*7KmH+Jcb-5%B^@s+lDu>qNYP$XlRM~9@n!S{RFv|StrY~%)U65;1{Z6 z32Hb{I8U8NgfzTo-PxY2N0LNa0Kz=lLMXp|`IEjd5IHT#WLZT>C!k8GfG&{&<+p(H zJi4Oq=uV3ZZuTcLQX~wq|HSD|>^*fGK;!=Qf^|LQ3kS02h32M{2QeZbELno=6Q;>B zP8e72$aZL#eMfehI&Y}lPfo_uqwKs6lSwfm0!*Y-^t`zqSZTo5L1- zdk*66kW-VKF(pZVO?P(}uo2`-356_oN;t|_(kt>>Skcm<&a|@;qr3(1%%rX-W@Ibd za|rY`#75f%h-zhf10PBv(z1+_lBMT&vnT7ktio-llL|5$2Pni~YIN-}L^cN`J` delta 1107 zcmV-Z1g!hVsshHU0+3(^G%z@2HD1_~o5t5hYm-4I2N+6N-$W}ZveVT0LgatG z@0pA&TN2}R`I4AdKI|wES>E_XQ_Q~>PhoF#kfNMz*cuzG-l;eLW9`*N_ zzg&~s>EiyBxMNj?z%;Il7A5?#KqWHu1e7c_V_}vSb`PgR? zz9011{ag0PKgIiH%Dtr#J#tIjQSXy>EnWeElhrLq0Kv6um+;9G@iV9F2@U0gCaQmjF$LG@Q;CQKXH?x* zVVW*)z_@aoIZu!?CEt;qCeIcc_a9w--@&8GngR%2{`eMHAa2?77;=~>x8fCT8{Wi< znhtrPp(W;fT+jBq6XaTEoh-{T`(o4nFI1cMYdBCiPn}1EG`wfs*`7K_l0;ho!aUkS zD8GIA@;{%x-2FrTK$T5(WawBZ*Zc)&W+Qf>@*9LN--h=EdY~lF&P%KGO03bYo9K^wg^d^!xnsdO5pAYP?Mc8B}tu3 zcX!v55#&n=g)DbUILcSj3*=f@(bA#Lw6hVTyd&?-q^>7sWb3|j2=q0?M%x95YTb7O zA4($9vW$|FrRVp0C+obd!d;+~3o;uA3t&8p`oh?gEix()S0U3S7ywY`7eHiDA-gHO zlV>t^520OU7_uBH11b0E8?@(klk75185?i(AX|T=I~G}gr~*WnM-?a~x!8yNdfAgx zGe`pa9h03iG7i0xDAiKqWI2|a&o1Dklif2S4(|k|%9^cxy3E-k?C&>|4K$qs^re%m ZG$R7{&a=@p*F*t|v#EKS8w9g{Cjx}F8_56w diff --git a/latest/py-modindex.html b/latest/py-modindex.html index c75ab30cb70..66174435741 100644 --- a/latest/py-modindex.html +++ b/latest/py-modindex.html @@ -3,7 +3,7 @@ - Python Module Index — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Python Module Index — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -1506,7 +1506,7 @@

      Python Module Index

      Built with Sphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/search.html b/latest/search.html index 0758058b4d0..a372f7101a1 100644 --- a/latest/search.html +++ b/latest/search.html @@ -3,7 +3,7 @@ - Search — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + Search — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -125,7 +125,7 @@ Built with Sphinx using a theme provided by Read the Docs. - +

      diff --git a/latest/searchindex.js b/latest/searchindex.js index 5e08007e21a..773695ee75b 100644 --- a/latest/searchindex.js +++ b/latest/searchindex.js @@ -1 +1 @@ -Search.setIndex({"alltitles": {"1 Setup Environment": [[361, "setup-environment"]], "1. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G-1.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w4g-1"]], "1. Add *.h of the customized operator to executor/include/operators": [[394, "add-h-of-the-customized-operator-to-executor-include-operators"]], "1. Architecture": [[388, "architecture"]], "1. Docker Image Setup": [[315, "docker-image-setup"]], "1. Download the Workflow Repository": [[354, "download-the-workflow-repository"]], "1. Download the pre-finetuned VITS model": [[353, "download-the-pre-finetuned-vits-model"]], "1. Environment": [[346, "environment"], [352, "environment"]], "1. Environment\u200b": [[348, "environment"], [349, "environment"]], "1. Install requirements": [[353, "install-requirements"]], "1. Introduction": [[376, "introduction"]], "1. Prepare Dataset": [[314, "prepare-dataset"]], "1. Prepare the data": [[353, "prepare-the-data"]], "1. Prepare the sparse model": [[412, "prepare-the-sparse-model"]], "1. Prerequisites": [[302, "prerequisites"]], "1. Quantization": [[427, "quantization"]], "1. Setup Environment": [[362, "setup-environment"]], "1. Single Card Fine-tuning": [[349, "single-card-fine-tuning"]], "1. Single Card Fine-tuning in Habana DL1": [[349, "single-card-fine-tuning-in-habana-dl1"]], "1. Single Node Fine-tuning in Xeon SPR": [[314, "single-node-fine-tuning-in-xeon-spr"]], "1. Single Node Fine-tuning in Xeon SPR": [[349, "single-node-fine-tuning-in-xeon-spr"]], "1. To get the tuned model and its accuracy:": [[393, "to-get-the-tuned-model-and-its-accuracy"]], "1.1 Install intel-extension-for-transformers": [[361, "install-intel-extension-for-transformers"], [362, "install-intel-extension-for-transformers"]], "1.2 Install neural-chat and retrieval dependency": [[362, "install-neural-chat-and-retrieval-dependency"]], "1.2 Install neural-chat dependency": [[361, "install-neural-chat-dependency"]], "1.2.1 CPU Platform": [[361, "cpu-platform"]], "1.2.2 GPU Platform": [[361, "gpu-platform"]], "2 Run the chatbot in command mode": [[361, "run-the-chatbot-in-command-mode"]], "2 samples regarding Eiffel Tower": [[377, "samples-regarding-eiffel-tower"], [377, "id1"]], "2 samples regarding prime minister of the United Kingdom": [[377, "samples-regarding-prime-minister-of-the-united-kingdom"], [377, "id2"]], "2. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w4g128"]], "2. Add *.cpp of the customized operator to executor/src/operators": [[394, "add-cpp-of-the-customized-operator-to-executor-src-operators"]], "2. Create Docker Container": [[315, "create-docker-container"]], "2. Create environment and install software packages": [[354, "create-environment-and-install-software-packages"]], "2. Deploy a TF/ONNX model using Engine inference": [[388, "deploy-a-tf-onnx-model-using-engine-inference"]], "2. Do finetuning of the Shanghainese Audio -> Shanghainese text ASR model": [[353, "do-finetuning-of-the-shanghainese-audio-shanghainese-text-asr-model"]], "2. Inference": [[427, "inference"]], "2. Installation": [[302, "installation"]], "2. Multi Card Fine-tuning in Habana DL1": [[349, "multi-card-fine-tuning-in-habana-dl1"]], "2. Multi-node Fine-tuning in Xeon SPR": [[314, "multi-node-fine-tuning-in-xeon-spr"], [349, "multi-node-fine-tuning-in-xeon-spr"]], "2. Prepare Docker Image": [[314, "prepare-docker-image"]], "2. Prepare reference dataset": [[346, "prepare-reference-dataset"], [352, "prepare-reference-dataset"]], "2. Prepare the Model": [[348, "prepare-the-model"], [349, "prepare-the-model"]], "2. Requirements": [[376, "requirements"]], "2. Run below commands": [[412, "run-below-commands"]], "2. Run the RAG in command mode": [[362, "run-the-rag-in-command-mode"]], "2. To get the benchmark of tuned model:": [[393, "to-get-the-benchmark-of-tuned-model"]], "2.1 Build Docker Image": [[314, "build-docker-image"]], "2.1 Install from PyPi": [[302, "install-from-pypi"]], "2.2 Docker Pull from Docker Hub": [[314, "docker-pull-from-docker-hub"]], "2.2 Install from Conda": [[302, "install-from-conda"]], "2.3 Install from Source": [[302, "install-from-source"]], "2021 (1)": [[420, "id4"]], "2022 (5)": [[420, "id3"]], "2023 (34)": [[420, "id2"]], "2024 (14)": [[420, "id1"]], "3. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W3G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w3g128"]], "3. Accuracy": [[427, "accuracy"]], "3. Analysis results": [[412, "analysis-results"]], "3. Create Docker Container": [[314, "create-docker-container"]], "3. Do finetuning of the Shanghainese text -> Mandarian text translation model": [[353, "do-finetuning-of-the-shanghainese-text-mandarian-text-translation-model"]], "3. Finetune the Mandarian text -> Shanghainese text translation model": [[353, "finetune-the-mandarian-text-shanghainese-text-translation-model"]], "3. How To Run": [[302, "how-to-run"]], "3. Manual customized yaml and weight binary to use Engine inference": [[388, "manual-customized-yaml-and-weight-binary-to-use-engine-inference"]], "3. Multi-node Fine-tuning in AWS m7i SPR instances": [[349, "multi-node-fine-tuning-in-aws-m7i-spr-instances"]], "3. Prepare Dataset": [[348, "prepare-dataset"], [349, "prepare-dataset"]], "3. Prepare dataset": [[354, "prepare-dataset"]], "3. Run chatbot in server mode with UI": [[361, "run-chatbot-in-server-mode-with-ui"]], "3. Simple Test using Docker Container": [[315, "simple-test-using-docker-container"]], "3. Single Node Fine-tuning in Habana DL1": [[314, "single-node-fine-tuning-in-habana-dl1"]], "3. Supervised Fine-tuning (SFT)": [[352, "supervised-fine-tuning-sft"]], "3. Training": [[346, "training"]], "3. Training Data Construction": [[376, "training-data-construction"]], "3.1 Install Requirements": [[302, "install-requirements"]], "3.1 Start the service": [[361, "start-the-service"]], "3.1.1 Verify the client connection to server is OK.": [[361, "verify-the-client-connection-to-server-is-ok"]], "3.1.2 Test request command at client side": [[361, "test-request-command-at-client-side"]], "3.2 Prepare Datasets": [[302, "prepare-datasets"]], "3.2 Set up Server mode UI": [[361, "set-up-server-mode-ui"]], "3.3 Model Compression": [[302, "model-compression"]], "3.3 Start the web service": [[361, "start-the-web-service"]], "3.4 Model Inference": [[302, "model-inference"]], "3D Inference": [[399, "d-inference"]], "4. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W2G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w2g128"]], "4. Evaluation": [[346, "evaluation"]], "4. Integrate Neural Engine as Backend": [[388, "integrate-neural-engine-as-backend"]], "4. Reward / preference modeling (RM) Fine-tuning": [[352, "reward-preference-modeling-rm-fine-tuning"]], "4. Simple Test using Docker Container": [[314, "simple-test-using-docker-container"]], "4. Training Example": [[376, "training-example"]], "5. Evaluation": [[376, "evaluation"]], "5. Reinforcement Fine-tuning": [[352, "reinforcement-fine-tuning"]], "6. Verified Models": [[376, "verified-models"]], "API": [[273, "api"]], "API reference for users": [[398, "api-reference-for-users"]], "API usage": [[305, "api-usage"]], "ASR": [[353, "asr"], [353, "id1"], [353, "id3"]], "Access retrieval service": [[375, "access-retrieval-service"]], "Access text chat service": [[375, "access-text-chat-service"]], "Access the Server using the RESTful API": [[321, "access-the-server-using-the-restful-api"]], "Access the Service": [[309, "access-the-service"]], "Access voice chat service": [[375, "access-voice-chat-service"]], "Accuracy Aware Tuning": [[423, "accuracy-aware-tuning"]], "Acknowledgements": [[353, "acknowledgements"], [359, "acknowledgements"], [360, "acknowledgements"], [374, "acknowledgements"]], "Add Customized Pattern": [[387, "add-customized-pattern"]], "Add one AWS inbound rule for distributed training": [[349, "add-one-aws-inbound-rule-for-distributed-training"]], "Additional useful RESTful APIs": [[321, "additional-useful-restful-apis"]], "Advanced Topics": [[319, "advanced-topics"]], "After knowledge editing": [[377, "after-knowledge-editing"]], "Architecture of Intel\u00ae Extension for Transformers": [[287, "architecture-of-intel-extension-for-transformers"]], "Attributes": [[243, "attributes"]], "Attribution": [[298, "attribution"]], "Automatic Mixed Precision (AMP)": [[319, "automatic-mixed-precision-amp"]], "Bare Metal": [[348, "bare-metal"], [349, "bare-metal"]], "Baremetal": [[322, "baremetal"]], "Before knowledge editing": [[377, "before-knowledge-editing"]], "Benchmark": [[289, "benchmark"]], "Benchmark Output": [[289, "benchmark-output"]], "Benchmark for Kernels": [[413, "benchmark-for-kernels"]], "Binary Injectors": [[400, "binary-injectors"]], "Brief introduction for ISAs": [[403, "brief-introduction-for-isas"]], "Build": [[398, "build"], [413, "build"]], "Build Docker image with customized SSH server port from scratch": [[349, "build-docker-image-with-customized-ssh-server-port-from-scratch"]], "Build RAG (retriveval augment generation) example with Intel\u00ae Extension for Transformers neural-chat on Intel GPU": [[362, "build-rag-retriveval-augment-generation-example-with-intel-extension-for-transformers-neural-chat-on-intel-gpu"]], "Build Your Chatbot with Intel\u00ae Extension for Transformers neural-chat": [[361, "build-your-chatbot-with-intel-extension-for-transformers-neural-chat"]], "Build the chatbot and interact with the chatbot:": [[358, "build-the-chatbot-and-interact-with-the-chatbot"], [372, "build-the-chatbot-and-interact-with-the-chatbot"]], "Build the yaml and weight binary": [[388, "build-the-yaml-and-weight-binary"]], "Building RESTful API Server": [[321, "building-restful-api-server"]], "CI Introduction": [[269, "ci-introduction"], [300, "ci-introduction"]], "CPU Usage": [[361, "cpu-usage"]], "Cache Issues": [[399, "cache-issues"]], "Caching": [[319, "caching"]], "Caching Data": [[370, "caching-data"]], "Calculation": [[408, "calculation"]], "Call the audio plugin service": [[336, "call-the-audio-plugin-service"]], "Call the image2image plugin service": [[337, "call-the-image2image-plugin-service"]], "Candidate patterns": [[409, "candidate-patterns"]], "Centos 8": [[308, "centos-8"]], "Chatbot with Multimodal": [[319, "chatbot-with-multimodal"]], "Chatbot with RAG": [[319, "chatbot-with-rag"]], "ChildParentRetriever": [[372, "childparentretriever"]], "Chroma": [[372, "chroma"]], "Citation": [[415, "citation"]], "Class Kernel": [[279, "class-kernel"]], "Class engine": [[278, "class-engine"]], "Class operator_desc": [[280, "class-operator-desc"]], "Classes": [[0, "classes"], [2, "classes"], [5, "classes"], [9, "classes"], [14, "classes"], [22, "classes"], [24, "classes"], [25, "classes"], [28, "classes"], [30, "classes"], [32, "classes"], [33, "classes"], [35, "classes"], [36, "classes"], [37, "classes"], [40, "classes"], [44, "classes"], [47, "classes"], [50, "classes"], [52, "classes"], [53, "classes"], [54, "classes"], [55, "classes"], [57, "classes"], [60, "classes"], [61, "classes"], [63, "classes"], [64, "classes"], [65, "classes"], [66, "classes"], [67, "classes"], [68, "classes"], [69, "classes"], [70, "classes"], [71, "classes"], [72, "classes"], [73, "classes"], [74, "classes"], [76, "classes"], [77, "classes"], [78, "classes"], [79, "classes"], [80, "classes"], [81, "classes"], [82, "classes"], [84, "classes"], [85, "classes"], [86, "classes"], [87, "classes"], [88, "classes"], [89, "classes"], [90, "classes"], [92, "classes"], [93, "classes"], [94, "classes"], [95, "classes"], [96, "classes"], [97, "classes"], [98, "classes"], [99, "classes"], [100, "classes"], [101, "classes"], [102, "classes"], [103, "classes"], [105, "classes"], [106, "classes"], [107, "classes"], [108, "classes"], [109, "classes"], [110, "classes"], [111, "classes"], [112, "classes"], [113, "classes"], [114, "classes"], [115, "classes"], [116, "classes"], [117, "classes"], [118, "classes"], [119, "classes"], [120, "classes"], [121, "classes"], [122, "classes"], [123, "classes"], [124, "classes"], [125, "classes"], [126, "classes"], [127, "classes"], [128, "classes"], [129, "classes"], [130, "classes"], [131, "classes"], [132, "classes"], [133, "classes"], [134, "classes"], [135, "classes"], [136, "classes"], [137, "classes"], [138, "classes"], [139, "classes"], [140, "classes"], [141, "classes"], [142, "classes"], [143, "classes"], [144, "classes"], [145, "classes"], [146, "classes"], [147, "classes"], [148, "classes"], [149, "classes"], [151, "classes"], [152, "classes"], [153, "classes"], [154, "classes"], [155, "classes"], [156, "classes"], [157, "classes"], [158, "classes"], [159, "classes"], [160, "classes"], [161, "classes"], [162, "classes"], [163, "classes"], [164, "classes"], [165, "classes"], [166, "classes"], [167, "classes"], [168, "classes"], [169, "classes"], [170, "classes"], [171, "classes"], [172, "classes"], [173, "classes"], [174, "classes"], [175, "classes"], [176, "classes"], [177, "classes"], [178, "classes"], [179, "classes"], [180, "classes"], [181, "classes"], [182, "classes"], [183, "classes"], [184, "classes"], [185, "classes"], [186, "classes"], [187, "classes"], [188, "classes"], [189, "classes"], [190, "classes"], [191, "classes"], [192, "classes"], [193, "classes"], [194, "classes"], [195, "classes"], [196, "classes"], [197, "classes"], [198, "classes"], [199, "classes"], [200, "classes"], [201, "classes"], [202, "classes"], [203, "classes"], [204, "classes"], [205, "classes"], [206, "classes"], [207, "classes"], [208, "classes"], [209, "classes"], [210, "classes"], [211, "classes"], [212, "classes"], [213, "classes"], [214, "classes"], [215, "classes"], [216, "classes"], [217, "classes"], [218, "classes"], [219, "classes"], [220, "classes"], [221, "classes"], [222, "classes"], [223, "classes"], [224, "classes"], [225, "classes"], [226, "classes"], [227, "classes"], [228, "classes"], [229, "classes"], [230, "classes"], [231, "classes"], [232, "classes"], [233, "classes"], [234, "classes"], [235, "classes"], [236, "classes"], [237, "classes"], [238, "classes"], [239, "classes"], [240, "classes"], [241, "classes"], [242, "classes"], [246, "classes"], [247, "classes"], [250, "classes"], [251, "classes"], [255, "classes"], [256, "classes"], [257, "classes"], [258, "classes"], [259, "classes"], [260, "classes"], [264, "classes"]], "Code Generation": [[319, "code-generation"]], "CodeLlama": [[349, "codellama"]], "Compile": [[275, "compile"]], "Compile Examples": [[392, "compile-examples"]], "Compile Models": [[337, "compile-models"]], "Compile an ONNX model to Engine IR": [[392, "compile-an-onnx-model-to-engine-ir"]], "Compile to IR": [[392, "compile-to-ir"]], "Config": [[283, "config"]], "Configure Environment Variables": [[334, "configure-environment-variables"]], "Configure Multi-Nodes": [[332, "configure-multi-nodes"]], "Configure Multi-NumaNodes": [[332, "configure-multi-numanodes"]], "Configure Multi-node": [[330, "configure-multi-node"]], "Configure OpenAI keys": [[324, "configure-openai-keys"]], "Configure SSH between Servers": [[330, "configure-ssh-between-servers"], [332, "configure-ssh-between-servers"]], "Configure YAML": [[324, "configure-yaml"], [336, "configure-yaml"], [338, "configure-yaml"], [363, "configure-yaml"]], "Configure photoai.yaml": [[334, "configure-photoai-yaml"]], "Configure the assisted_gen.yaml": [[323, "configure-the-assisted-gen-yaml"]], "Configure the codegen.yaml": [[326, "configure-the-codegen-yaml"], [327, "configure-the-codegen-yaml"], [328, "configure-the-codegen-yaml"], [329, "configure-the-codegen-yaml"], [330, "configure-the-codegen-yaml"], [331, "configure-the-codegen-yaml"], [332, "configure-the-codegen-yaml"]], "Configure the textbot.yaml": [[343, "configure-the-textbot-yaml"], [344, "configure-the-textbot-yaml"]], "Configure the voicebot.yaml": [[340, "configure-the-voicebot-yaml"]], "Consume Chat Q&A Service": [[316, "consume-chat-q-a-service"]], "Consume Chat Service": [[316, "consume-chat-service"]], "Consume Summary Service": [[316, "consume-summary-service"]], "Consume the Service": [[317, "consume-the-service"], [318, "consume-the-service"]], "Consume the Service with Simple Test": [[313, "consume-the-service-with-simple-test"], [316, "consume-the-service-with-simple-test"]], "Consume the Services": [[363, "consume-the-services"]], "Contents:": [[434, null], [439, null]], "Contribution Guidelines": [[300, "contribution-guidelines"]], "Contribution and Legal Documentation": [[270, "contribution-and-legal-documentation"]], "Contributor Covenant Code of Conduct": [[298, "contributor-covenant-code-of-conduct"], [300, "contributor-covenant-code-of-conduct"]], "Create Docker Image for HPU": [[366, "create-docker-image-for-hpu"]], "Create Image Database": [[334, "create-image-database"]], "Create Nodes and Establish Connections": [[391, "create-nodes-and-establish-connections"]], "Create Tables": [[334, "create-tables"]], "Create an Instance of Criterion(Optional)": [[303, "create-an-instance-of-criterion-optional"]], "Create an Instance of DistillationConfig": [[303, "create-an-instance-of-distillationconfig"]], "Create an Instance of Metric": [[303, "create-an-instance-of-metric"]], "Create an Instance of Objective(Optional)": [[423, "create-an-instance-of-objective-optional"]], "Create an Instance of QuantizationConfig": [[423, "create-an-instance-of-quantizationconfig"]], "Create an instance of Metric": [[419, "create-an-instance-of-metric"]], "Create an instance of WeightPruningConfig": [[419, "create-an-instance-of-weightpruningconfig"]], "Create and activate conda environment": [[322, "create-and-activate-conda-environment"]], "Customized Operators Register": [[394, "customized-operators-register"]], "Customized endpoints of a audio-input-audio-output pipeline": [[340, "customized-endpoints-of-a-audio-input-audio-output-pipeline"]], "Customizing the NeuralChat Service": [[309, "customizing-the-neuralchat-service"]], "Dataset": [[354, "dataset"]], "Dataset related arguments": [[348, "dataset-related-arguments"], [349, "dataset-related-arguments"]], "Demo": [[353, "demo"]], "Dense Reference Deployment on Neural Engine": [[304, "dense-reference-deployment-on-neural-engine"]], "Dense and Sparse": [[402, "dense-and-sparse"]], "Dependencies Installation": [[369, "dependencies-installation"]], "Deploy NeuralChat Service": [[309, "deploy-neuralchat-service"]], "Deploy a textbot with vllm": [[367, "deploy-a-textbot-with-vllm"]], "Deploy and Integration": [[388, "deploy-and-integration"]], "Deploy it as a server": [[360, "deploy-it-as-a-server"]], "Deploy on Huggingface Space": [[345, "deploy-on-huggingface-space"], [383, "deploy-on-huggingface-space"], [384, "deploy-on-huggingface-space"]], "Deploy on your server": [[345, "deploy-on-your-server"], [383, "deploy-on-your-server"], [384, "deploy-on-your-server"]], "Design": [[393, "design"]], "Details": [[408, "details"]], "Developer\u2019s Perspective": [[400, "developer-s-perspective"]], "Developer\u2019s Perspective.": [[401, "developer-s-perspective"]], "Direct Layernorm_ba": [[406, "direct-layernorm-ba"]], "Direct Preference Optimization (DPO)": [[346, "direct-preference-optimization-dpo"], [347, "direct-preference-optimization-dpo"]], "Distill with Trainer": [[303, "distill-with-trainer"]], "Distillation": [[303, "distillation"], [303, "id1"], [304, "distillation"], [306, "distillation"]], "Do chatbot inference with Docker": [[315, "do-chatbot-inference-with-docker"]], "Do inference of the Mandarian text -> Shanghainese text translation model": [[353, "do-inference-of-the-mandarian-text-shanghainese-text-translation-model"]], "Do inference of the Shanghainese Audio -> Shanghainese text ASR model": [[353, "do-inference-of-the-shanghainese-audio-shanghainese-text-asr-model"]], "Do inference of the Shanghainese text -> Mandarian text translation model": [[353, "do-inference-of-the-shanghainese-text-mandarian-text-translation-model"]], "Do inference of the Shanghainese text -> Shanghainese audio TTS model": [[353, "do-inference-of-the-shanghainese-text-shanghainese-audio-tts-model"]], "Docker": [[322, "docker"], [348, "docker"], [349, "docker"]], "Documentation Overview and Installation": [[270, "documentation-overview-and-installation"]], "Dolly-V2-3B": [[425, "dolly-v2-3b"]], "Download Models": [[338, "download-models"], [363, "download-models"]], "Dynamic Quant Matmul": [[405, "dynamic-quant-matmul"]], "Early-Exit": [[304, "early-exit"]], "Edit Knowledge of LLMs": [[377, "edit-knowledge-of-llms"]], "Editing knowledge with 2 samples": [[377, "editing-knowledge-with-2-samples"]], "Efficient LLM Inference on CPUs": [[426, "efficient-llm-inference-on-cpus"]], "Efficient kernel": [[402, "efficient-kernel"]], "Electra": [[425, "electra"]], "Element-wise Injector": [[401, "element-wise-injector"]], "Enforcement": [[298, "enforcement"]], "Engine API": [[277, "engine-api"]], "Engine Tuning": [[390, "engine-tuning"]], "English Text-to-Speech (TTS)": [[369, "english-text-to-speech-tts"]], "Environment Setup": [[313, "environment-setup"], [316, "environment-setup"], [317, "environment-setup"], [318, "environment-setup"]], "Environment\u200b": [[377, "environment"]], "Evaluation": [[351, "evaluation"]], "Evaluation Guidelines": [[351, "evaluation-guidelines"]], "Evaluation Metrics": [[349, "evaluation-metrics"]], "Evaluation Only": [[351, "evaluation-only"]], "Example": [[290, "example"], [421, "example"], [428, "example"], [429, "example"], [433, "example"]], "Example for CPU device": [[432, "example-for-cpu-device"]], "Example for CUDA GPU device": [[432, "example-for-cuda-gpu-device"]], "Example of AutoRound on Intel GPU": [[432, "example-of-autoround-on-intel-gpu"]], "Example of Chat Q&A Service.": [[316, "example-of-chat-q-a-service"]], "Example of Chat Service.": [[316, "example-of-chat-service"]], "Example of Summary Service.": [[316, "example-of-summary-service"]], "Examples": [[289, "examples"], [304, "examples"], [305, "examples"], [385, "examples"], [413, "examples"], [413, "id1"], [413, "id2"], [413, "id3"], [413, "id4"], [413, "id5"], [413, "id6"], [413, "id7"], [413, "id8"], [413, "id9"], [413, "id10"], [413, "id11"], [418, "examples"], [422, "examples"]], "Examples For CPU AND CUDA": [[432, "examples-for-cpu-and-cuda"]], "Examples For Intel GPU": [[432, "examples-for-intel-gpu"]], "Examples:": [[417, "examples"]], "Exceptions": [[24, "exceptions"]], "Expected Output": [[302, "expected-output"]], "Export to BF16 ONNX Model": [[305, "export-to-bf16-onnx-model"]], "Export to FP32 ONNX Model": [[305, "export-to-fp32-onnx-model"]], "Export to INT8 ONNX Model": [[305, "export-to-int8-onnx-model"]], "Export to ONNX": [[305, "export-to-onnx"]], "Extract Tables From PDF File": [[359, "extract-tables-from-pdf-file"]], "FAQ": [[269, "faq"], [300, "faq"]], "FLAN-T5": [[349, "flan-t5"]], "FP32 Accuracy": [[427, "fp32-accuracy"]], "FP32 Accuracy (Baseline)": [[426, "fp32-accuracy-baseline"]], "FP32 Inference": [[427, "fp32-inference"]], "FP32 Inference (Baseline)": [[426, "fp32-inference-baseline"]], "FP32/BF16 Inference": [[371, "fp32-bf16-inference"]], "Face Animation": [[360, "face-animation"], [374, "face-animation"]], "Falcon": [[349, "falcon"]], "Falcon-7B": [[425, "falcon-7b"]], "Features": [[291, "features"], [434, "features"]], "Fine-tuning": [[319, "fine-tuning"], [354, "fine-tuning"]], "Fine-tuning and Inference": [[354, "fine-tuning-and-inference"]], "Fine-tuning on Intel Arc GPUs": [[349, "fine-tuning-on-intel-arc-gpus"]], "Finetune": [[314, "finetune"], [348, "finetune"], [349, "finetune"]], "Finetune Embedding Model on Task-Specific Datasets": [[376, "finetune-embedding-model-on-task-specific-datasets"]], "Finetuning": [[353, "finetuning"], [355, "finetuning"]], "For LLaMA2": [[349, "for-llama2"]], "For developers": [[413, "for-developers"]], "For executor backend": [[305, "for-executor-backend"]], "Framework Features": [[400, "framework-features"], [401, "framework-features"]], "Full Publications/Events (50)": [[420, "full-publications-events-50"]], "Functions": [[0, "functions"], [1, "functions"], [4, "functions"], [6, "functions"], [15, "functions"], [17, "functions"], [20, "functions"], [21, "functions"], [23, "functions"], [24, "functions"], [25, "functions"], [27, "functions"], [28, "functions"], [29, "functions"], [30, "functions"], [32, "functions"], [36, "functions"], [37, "functions"], [39, "functions"], [40, "functions"], [41, "functions"], [42, "functions"], [44, "functions"], [45, "functions"], [49, "functions"], [57, "functions"], [61, "functions"], [62, "functions"], [95, "functions"], [184, "functions"], [243, "functions"], [244, "functions"], [245, "functions"], [252, "functions"], [260, "functions"], [262, "functions"], [263, "functions"], [264, "functions"], [265, "functions"], [266, "functions"], [267, "functions"], [268, "functions"]], "Fuse Pattern and Set Attributes of New Pattern after Fusion": [[387, "fuse-pattern-and-set-attributes-of-new-pattern-after-fusion"]], "GPT-J fine-tuning and inference": [[354, "gpt-j-fine-tuning-and-inference"]], "GPT-NEOX-20B": [[425, "gpt-neox-20b"]], "GPT-j-6B": [[425, "gpt-j-6b"]], "GPU Usage": [[361, "gpu-usage"]], "General": [[300, "general"]], "Generate the Engine Graph through TF/ONNX model": [[388, "generate-the-engine-graph-through-tf-onnx-model"]], "Get Start with Metrics": [[416, "get-start-with-metrics"]], "Get Started": [[302, "get-started"], [354, "get-started"], [423, "get-started"]], "Get Started with Benchmark API": [[289, "get-started-with-benchmark-api"]], "Get Started with NeuralChat": [[322, "get-started-with-neuralchat"]], "Get the result": [[367, "get-the-result"]], "Getting Started": [[306, "getting-started"], [309, "getting-started"]], "Graph": [[276, "graph"]], "Graph Fusion": [[391, "graph-fusion"]], "Graph Tuning for Dispatching Best Graph": [[390, "graph-tuning-for-dispatching-best-graph"]], "H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models": [[307, "h2o-heavy-hitter-oracle-for-efficient-generative-inference-of-large-language-models"]], "Help": [[311, "help"], [375, "help"], [375, "id1"]], "How it Works": [[302, "how-it-works"]], "How to Turn on Op Tuning Mechanism": [[390, "how-to-turn-on-op-tuning-mechanism"]], "How to Turn on Static Compressed Buffer": [[396, "how-to-turn-on-static-compressed-buffer"]], "How to train Intel/neural-chat-7b-v3-1 on Intel Gaudi2": [[347, "how-to-train-intel-neural-chat-7b-v3-1-on-intel-gaudi2"]], "How to visualize weights distribution of sparse model": [[412, "how-to-visualize-weights-distribution-of-sparse-model"]], "INT4 Accuracy": [[426, "int4-accuracy"], [427, "int4-accuracy"]], "INT4 Inference": [[426, "int4-inference"]], "INT4 inference": [[427, "int4-inference"]], "INT8/INT4 Inference": [[371, "int8-int4-inference"]], "IPEX Model": [[289, "ipex-model"]], "Implementation Details": [[294, "implementation-details"], [437, "implementation-details"]], "Import the module and build the chatbot instance:": [[356, "import-the-module-and-build-the-chatbot-instance"]], "Import the module and set the retrieval config:": [[358, "import-the-module-and-set-the-retrieval-config"], [372, "import-the-module-and-set-the-retrieval-config"]], "Inference": [[353, "inference"], [355, "inference"]], "Inference Parameters": [[371, "inference-parameters"]], "Inference with Docker": [[319, "inference-with-docker"]], "Inference with FP32/BF16": [[371, "inference-with-fp32-bf16"]], "Inference with INT8/INT4": [[371, "inference-with-int8-int4"]], "Initializing": [[370, "initializing"]], "Inputs format": [[414, "inputs-format"]], "Install": [[386, "install"]], "Install ITREX": [[331, "install-itrex"], [332, "install-itrex"]], "Install Intel Extension for Transformers": [[308, "install-intel-extension-for-transformers"], [357, "install-intel-extension-for-transformers"], [368, "install-intel-extension-for-transformers"]], "Install Intel\u00ae Extension for Transformers* from source": [[322, "install-intel-extension-for-transformers-from-source"]], "Install Models": [[334, "install-models"]], "Install MySQL": [[334, "install-mysql"]], "Install Neural Engine binary to deploy bare metal engine": [[386, "install-neural-engine-binary-to-deploy-bare-metal-engine"]], "Install NeuralChat Python Dependencies": [[331, "install-neuralchat-python-dependencies"], [332, "install-neuralchat-python-dependencies"]], "Install Python Dependencies": [[323, "install-python-dependencies"], [330, "install-python-dependencies"]], "Install Python dependencies": [[324, "install-python-dependencies"], [326, "install-python-dependencies"], [327, "install-python-dependencies"], [328, "install-python-dependencies"], [329, "install-python-dependencies"], [331, "install-python-dependencies"], [334, "install-python-dependencies"], [336, "install-python-dependencies"], [337, "install-python-dependencies"], [338, "install-python-dependencies"], [340, "install-python-dependencies"], [343, "install-python-dependencies"], [344, "install-python-dependencies"], [345, "install-python-dependencies"], [357, "install-python-dependencies"], [363, "install-python-dependencies"], [368, "install-python-dependencies"], [383, "install-python-dependencies"], [384, "install-python-dependencies"]], "Install System Dependency": [[369, "install-system-dependency"]], "Install environment": [[393, "install-environment"]], "Install from Pypi": [[308, "install-from-pypi"]], "Install from Source": [[308, "install-from-source"]], "Install intel extension for transformers": [[327, "install-intel-extension-for-transformers"], [328, "install-intel-extension-for-transformers"], [329, "install-intel-extension-for-transformers"]], "Install numactl": [[323, "install-numactl"], [324, "install-numactl"], [330, "install-numactl"], [331, "install-numactl"], [332, "install-numactl"], [334, "install-numactl"], [336, "install-numactl"], [337, "install-numactl"], [338, "install-numactl"], [340, "install-numactl"], [343, "install-numactl"], [344, "install-numactl"], [357, "install-numactl"], [363, "install-numactl"], [368, "install-numactl"]], "Install stable version intel_extension_for_transformers from pip": [[386, "install-stable-version-intel-extension-for-transformers-from-pip"]], "Install visual cpp build tools": [[327, "install-visual-cpp-build-tools"], [328, "install-visual-cpp-build-tools"], [329, "install-visual-cpp-build-tools"]], "Installation": [[308, "installation"], [309, "installation"], [356, "installation"], [370, "installation"], [386, "installation"], [398, "installation"]], "Intel Extension for Pytorch (IPEX) examples": [[304, "intel-extension-for-pytorch-ipex-examples"]], "Intel Neural Chat Dockerfile": [[312, "intel-neural-chat-dockerfile"]], "Intel TensorFlow Examples": [[304, "intel-tensorflow-examples"]], "Intel\u00ae Extension for Transformers": [[302, "intel-extension-for-transformers"]], "Intel\u00ae Extension for Transformers: Accelerating Transformer-based Models on Intel Platforms": [[272, "intel-extension-for-transformers-accelerating-transformer-based-models-on-intel-platforms"]], "Interact with the chatbot:": [[356, "interact-with-the-chatbot"]], "Intermediate Layer Knowledge Distillation": [[303, "intermediate-layer-knowledge-distillation"]], "Introduction": [[289, "introduction"], [303, "introduction"], [305, "introduction"], [307, "introduction"], [309, "introduction"], [336, "introduction"], [337, "introduction"], [338, "introduction"], [354, "introduction"], [357, "introduction"], [358, "introduction"], [363, "introduction"], [372, "introduction"], [373, "introduction"], [387, "introduction"], [389, "introduction"], [390, "introduction"], [391, "introduction"], [392, "introduction"], [395, "introduction"], [396, "introduction"], [398, "introduction"], [400, "introduction"], [401, "introduction"], [402, "introduction"], [407, "introduction"], [412, "introduction"], [416, "introduction"], [417, "introduction"], [418, "introduction"], [419, "introduction"], [421, "introduction"], [422, "introduction"], [423, "introduction"], [428, "introduction"], [429, "introduction"], [432, "introduction"]], "Iteration Level": [[389, "iteration-level"]], "Kernel APIs": [[282, "kernel-apis"]], "Kernel details": [[405, "kernel-details"]], "Kernels": [[293, "kernels"], [436, "kernels"]], "Key Instruction": [[404, "key-instruction"]], "Knowledge Distillation": [[303, "knowledge-distillation"], [304, "knowledge-distillation"]], "LLM Carbon Calculator": [[385, "llm-carbon-calculator"]], "LLM Finetuning": [[425, "llm-finetuning"]], "LLM Quantization": [[425, "llm-quantization"]], "LLM Runtime (GGML-Compatible)": [[425, "llm-runtime-ggml-compatible"]], "LLM Runtime Inference based on Pytorch Mode": [[425, "llm-runtime-inference-based-on-pytorch-mode"]], "LLMs": [[425, "llms"]], "Langchain Extension": [[372, "langchain-extension"]], "Langchain Extension APIs": [[309, "langchain-extension-apis"]], "Launch OpenAI-compatible Service": [[309, "launch-openai-compatible-service"]], "Launch and Run the Client": [[366, "launch-and-run-the-client"]], "Launch the Triton Server": [[366, "launch-the-triton-server"]], "Learn More": [[302, "learn-more"]], "Legal Information": [[415, "legal-information"]], "Length Adaptive Transformers": [[304, "length-adaptive-transformers"]], "Levels of JSON Profiling": [[389, "levels-of-json-profiling"]], "License": [[415, "license"]], "Llama3 on MTL": [[432, "llama3-on-mtl"]], "Loops": [[404, "loops"]], "MMMU Evaluation on Gaudi2": [[350, "mmmu-evaluation-on-gaudi2"]], "MPT": [[349, "mpt"]], "MPT-7B": [[425, "mpt-7b"]], "Matmul_avx512f_p2031_p2013": [[407, "matmul-avx512f-p2031-p2013"]], "Matmul_noperm_p2031_p1302": [[407, "matmul-noperm-p2031-p1302"]], "Matmul_p2031_2013": [[407, "matmul-p2031-2013"]], "Matmul_vnni_noperm_p2013_p1302": [[407, "matmul-vnni-noperm-p2013-p1302"]], "Memory Layout in SPMM_VNNI": [[399, "memory-layout-in-spmm-vnni"]], "Merge the lora weights": [[347, "merge-the-lora-weights"]], "Metric Class Summary": [[416, "metric-class-summary"]], "Metrics": [[416, "metrics"]], "Mine Hard Negatives": [[376, "mine-hard-negatives"]], "Mistral": [[349, "mistral"]], "Model": [[284, "model"]], "Model Level": [[389, "model-level"]], "Model\u2019s output 1 after editing the knowledge": [[377, "model-s-output-1-after-editing-the-knowledge"]], "Model\u2019s output 1 before editing the knowledge": [[377, "model-s-output-1-before-editing-the-knowledge"]], "Model\u2019s output 2 after editing the knowledge": [[377, "model-s-output-2-after-editing-the-knowledge"]], "Model\u2019s output 2 before editing the knowledge": [[377, "model-s-output-2-before-editing-the-knowledge"]], "Model\u2019s output 3 after editing the knowledge": [[377, "model-s-output-3-after-editing-the-knowledge"]], "Model\u2019s output 3 before editing the knowledge": [[377, "model-s-output-3-before-editing-the-knowledge"]], "Model\u2019s output 4 after editing the knowledge": [[377, "model-s-output-4-after-editing-the-knowledge"]], "Model\u2019s output 4 before editing the knowledge": [[377, "model-s-output-4-before-editing-the-knowledge"]], "Modify hostfile": [[330, "modify-hostfile"], [332, "modify-hostfile"], [332, "id1"], [332, "id2"]], "Module Contents": [[0, "module-contents"], [1, "module-contents"], [2, "module-contents"], [4, "module-contents"], [5, "module-contents"], [6, "module-contents"], [9, "module-contents"], [14, "module-contents"], [15, "module-contents"], [17, "module-contents"], [20, "module-contents"], [21, "module-contents"], [22, "module-contents"], [23, "module-contents"], [24, "module-contents"], [25, "module-contents"], [27, "module-contents"], [28, "module-contents"], [29, "module-contents"], [30, "module-contents"], [32, "module-contents"], [33, "module-contents"], [35, "module-contents"], [36, "module-contents"], [37, "module-contents"], [39, "module-contents"], [40, "module-contents"], [41, "module-contents"], [42, "module-contents"], [44, "module-contents"], [45, "module-contents"], [47, "module-contents"], [49, "module-contents"], [50, "module-contents"], [52, "module-contents"], [53, "module-contents"], [54, "module-contents"], [55, "module-contents"], [57, "module-contents"], [60, "module-contents"], [61, "module-contents"], [62, "module-contents"], [63, "module-contents"], [64, "module-contents"], [65, "module-contents"], [66, "module-contents"], [67, "module-contents"], [68, "module-contents"], [69, "module-contents"], [70, "module-contents"], [71, "module-contents"], [72, "module-contents"], [73, "module-contents"], [74, "module-contents"], [76, "module-contents"], [77, "module-contents"], [78, "module-contents"], [79, "module-contents"], [80, "module-contents"], [81, "module-contents"], [82, "module-contents"], [84, "module-contents"], [85, "module-contents"], [86, "module-contents"], [87, "module-contents"], [88, "module-contents"], [89, "module-contents"], [90, "module-contents"], [92, "module-contents"], [93, "module-contents"], [94, "module-contents"], [95, "module-contents"], [96, "module-contents"], [97, "module-contents"], [98, "module-contents"], [99, "module-contents"], [100, "module-contents"], [101, "module-contents"], [102, "module-contents"], [103, "module-contents"], [105, "module-contents"], [106, "module-contents"], [107, "module-contents"], [108, "module-contents"], [109, "module-contents"], [110, "module-contents"], [111, "module-contents"], [112, "module-contents"], [113, "module-contents"], [114, "module-contents"], [115, "module-contents"], [116, "module-contents"], [117, "module-contents"], [118, "module-contents"], [119, "module-contents"], [120, "module-contents"], [121, "module-contents"], [122, "module-contents"], [123, "module-contents"], [124, "module-contents"], [125, "module-contents"], [126, "module-contents"], [127, "module-contents"], [128, "module-contents"], [129, "module-contents"], [130, "module-contents"], [131, "module-contents"], [132, "module-contents"], [133, "module-contents"], [134, "module-contents"], [135, "module-contents"], [136, "module-contents"], [137, "module-contents"], [138, "module-contents"], [139, "module-contents"], [140, "module-contents"], [141, "module-contents"], [142, "module-contents"], [143, "module-contents"], [144, "module-contents"], [145, "module-contents"], [146, "module-contents"], [147, "module-contents"], [148, "module-contents"], [149, "module-contents"], [151, "module-contents"], [152, "module-contents"], [153, "module-contents"], [154, "module-contents"], [155, "module-contents"], [156, "module-contents"], [157, "module-contents"], [158, "module-contents"], [159, "module-contents"], [160, "module-contents"], [161, "module-contents"], [162, "module-contents"], [163, "module-contents"], [164, "module-contents"], [165, "module-contents"], [166, "module-contents"], [167, "module-contents"], [168, "module-contents"], [169, "module-contents"], [170, "module-contents"], [171, "module-contents"], [172, "module-contents"], [173, "module-contents"], [174, "module-contents"], [175, "module-contents"], [176, "module-contents"], [177, "module-contents"], [178, "module-contents"], [179, "module-contents"], [180, "module-contents"], [181, "module-contents"], [182, "module-contents"], [183, "module-contents"], [184, "module-contents"], [185, "module-contents"], [186, "module-contents"], [187, "module-contents"], [188, "module-contents"], [189, "module-contents"], [190, "module-contents"], [191, "module-contents"], [192, "module-contents"], [193, "module-contents"], [194, "module-contents"], [195, "module-contents"], [196, "module-contents"], [197, "module-contents"], [198, "module-contents"], [199, "module-contents"], [200, "module-contents"], [201, "module-contents"], [202, "module-contents"], [203, "module-contents"], [204, "module-contents"], [205, "module-contents"], [206, "module-contents"], [207, "module-contents"], [208, "module-contents"], [209, "module-contents"], [210, "module-contents"], [211, "module-contents"], [212, "module-contents"], [213, "module-contents"], [214, "module-contents"], [215, "module-contents"], [216, "module-contents"], [217, "module-contents"], [218, "module-contents"], [219, "module-contents"], [220, "module-contents"], [221, "module-contents"], [222, "module-contents"], [223, "module-contents"], [224, "module-contents"], [225, "module-contents"], [226, "module-contents"], [227, "module-contents"], [228, "module-contents"], [229, "module-contents"], [230, "module-contents"], [231, "module-contents"], [232, "module-contents"], [233, "module-contents"], [234, "module-contents"], [235, "module-contents"], [236, "module-contents"], [237, "module-contents"], [238, "module-contents"], [239, "module-contents"], [240, "module-contents"], [241, "module-contents"], [242, "module-contents"], [243, "module-contents"], [244, "module-contents"], [246, "module-contents"], [247, "module-contents"], [250, "module-contents"], [251, "module-contents"], [252, "module-contents"], [255, "module-contents"], [256, "module-contents"], [257, "module-contents"], [258, "module-contents"], [259, "module-contents"], [260, "module-contents"], [263, "module-contents"], [264, "module-contents"], [265, "module-contents"], [266, "module-contents"], [267, "module-contents"], [268, "module-contents"]], "Module Owner Matrix": [[299, "module-owner-matrix"]], "More Options": [[396, "more-options"]], "More Tuning Options": [[390, "more-tuning-options"]], "More work per thread": [[402, "more-work-per-thread"]], "Multi Language Automatic Speech Recognition (ASR)": [[369, "multi-language-automatic-speech-recognition-asr"]], "Multi Language Text-to-Speech (TTS)": [[369, "multi-language-text-to-speech-tts"]], "Multi Thread (Thread = 4)": [[411, "multi-thread-thread-4"]], "Multi-Modal": [[350, "multi-modal"]], "Multi-card serving (optional)": [[365, "multi-card-serving-optional"]], "Multimodal APIs": [[309, "multimodal-apis"]], "Naive": [[402, "naive"]], "Neural Chat Example": [[422, "neural-chat-example"]], "Neural Engine": [[296, "neural-engine"], [439, "neural-engine"]], "Neural Engine Support Matrix": [[397, "neural-engine-support-matrix"]], "NeuralChat": [[309, "neuralchat"], [311, "neuralchat"]], "NeuralChat Client": [[375, "neuralchat-client"]], "NeuralChat Command Line": [[311, "neuralchat-command-line"]], "NeuralChat Fine-tuning": [[348, "neuralchat-fine-tuning"], [349, "neuralchat-fine-tuning"]], "NeuralChat Notebooks": [[320, "neuralchat-notebooks"]], "NeuralChat Server": [[375, "neuralchat-server"]], "NeuralChat Server Command Line": [[375, "neuralchat-server-command-line"]], "OP Tuning for Dispatching Best Kernel and Related Runtime Config": [[390, "op-tuning-for-dispatching-best-kernel-and-related-runtime-config"]], "OPT-1.3B": [[425, "opt-1-3b"]], "Objective": [[417, "objective"]], "Obtain the Necessary Information for New Pattern Construction": [[391, "obtain-the-necessary-information-for-new-pattern-construction"]], "On Habana Gaudi Environment": [[314, "on-habana-gaudi-environment"], [314, "id2"], [315, "on-habana-gaudi-environment"], [315, "id2"]], "On Nvidia GPU Environment": [[314, "on-nvidia-gpu-environment"], [314, "id3"], [315, "on-nvidia-gpu-environment"]], "On Xeon SPR Environment": [[314, "on-xeon-spr-environment"], [314, "id1"], [315, "on-xeon-spr-environment"], [315, "id1"]], "On the fly activation reordering": [[409, "on-the-fly-activation-reordering"]], "OpenAI Official SDK": [[321, "openai-official-sdk"]], "OpenAI-Compatible RESTful APIs": [[309, "openai-compatible-restful-apis"], [321, "openai-compatible-restful-apis"]], "OpenSSF Badge": [[271, "openssf-badge"], [271, "id1"]], "Operator Level": [[389, "operator-level"]], "Operator Profiling Part": [[389, "operator-profiling-part"]], "Operator Specific Types": [[281, "operator-specific-types"]], "Optimization": [[319, "optimization"]], "Optimization and Inference Documentation": [[270, "optimization-and-inference-documentation"]], "Option 1 : Build Docker image from scratch": [[348, "option-1-build-docker-image-from-scratch"], [349, "option-1-build-docker-image-from-scratch"]], "Option 1: Build Docker Image": [[315, "option-1-build-docker-image"]], "Option 2: Docker Pull from Docker Hub": [[315, "option-2-docker-pull-from-docker-hub"]], "Option 2: Pull existing Docker image": [[348, "option-2-pull-existing-docker-image"], [349, "option-2-pull-existing-docker-image"]], "Orchestrate": [[304, "orchestrate"]], "Other Functionalities Documentation": [[270, "other-functionalities-documentation"]], "Our Pledge": [[298, "our-pledge"]], "Our Responsibilities": [[298, "our-responsibilities"]], "Our Standards": [[298, "our-standards"]], "Output file": [[351, "output-file"]], "Output folder structure": [[351, "output-folder-structure"]], "Overview": [[302, "overview"]], "Package Contents": [[245, "package-contents"], [262, "package-contents"]], "Parameters": [[372, "parameters"]], "Parse Pattern Representation List": [[395, "parse-pattern-representation-list"]], "Parse and Evaluation": [[351, "parse-and-evaluation"]], "Parts of CSV Profiling": [[389, "parts-of-csv-profiling"]], "Pattern": [[403, "pattern"]], "Pattern Mapping Dict": [[391, "pattern-mapping-dict"]], "Pattern Recognize": [[395, "pattern-recognize"]], "Pattern Representation": [[395, "pattern-representation"]], "Pattern Tuning for Dispatching Best Pattern": [[390, "pattern-tuning-for-dispatching-best-pattern"]], "Performance": [[295, "performance"], [397, "performance"], [398, "performance"], [438, "performance"]], "Performance acceleration on Intel\u00ae Xeon SPR": [[358, "performance-acceleration-on-intel-xeon-spr"]], "Performance and Profiling": [[410, "performance-and-profiling"]], "Pipeline": [[418, "pipeline"]], "Pipeline Inference for Executor Backend": [[418, "pipeline-inference-for-executor-backend"]], "Pipeline Inference for INT8 Model": [[418, "pipeline-inference-for-int8-model"]], "Platform Configuration": [[411, "platform-configuration"]], "Please clone a ITREX repo to this path.": [[314, "please-clone-a-itrex-repo-to-this-path"], [315, "please-clone-a-itrex-repo-to-this-path"]], "Plugin Parameters": [[371, "plugin-parameters"]], "Plugins": [[319, "plugins"]], "Post Training Dynamic Quantization": [[423, "post-training-dynamic-quantization"]], "Post Training Static Quantization": [[423, "post-training-static-quantization"]], "Pre-compute SPMM": [[406, "pre-compute-spmm"]], "Prefetch": [[402, "prefetch"]], "Prepare Configuration File and Documents": [[313, "prepare-configuration-file-and-documents"], [316, "prepare-configuration-file-and-documents"]], "Prepare Dataset": [[393, "prepare-dataset"]], "Prepare Dependency Packages": [[432, "prepare-dependency-packages"]], "Prepare Docker Image": [[316, "prepare-docker-image"]], "Prepare Environment": [[322, "prepare-environment"], [347, "prepare-environment"], [353, "prepare-environment"], [359, "prepare-environment"], [360, "prepare-environment"], [374, "prepare-environment"], [426, "id1"]], "Prepare Models": [[359, "prepare-models"], [360, "prepare-models"]], "Prepare ONNX Model": [[392, "prepare-onnx-model"]], "Prepare ONNX model": [[393, "prepare-onnx-model"]], "Prepare Python Environment": [[337, "prepare-python-environment"]], "Prepare Stable Diffusion Models": [[337, "prepare-stable-diffusion-models"]], "Prepare data": [[350, "prepare-data"], [350, "id1"], [355, "prepare-data"]], "Prepare environment": [[426, "prepare-environment"]], "Prepare serving scripts": [[364, "prepare-serving-scripts"], [365, "prepare-serving-scripts"], [366, "prepare-serving-scripts"]], "Prepare the environment": [[367, "prepare-the-environment"]], "Preprocessing of weight matrix": [[405, "preprocessing-of-weight-matrix"]], "Prerequisite": [[393, "prerequisite"]], "Prerequisites": [[308, "prerequisites"], [386, "prerequisites"]], "Prerequisites for using dynamic quant matmul": [[405, "prerequisites-for-using-dynamic-quant-matmul"]], "Prerequisite\u200b": [[314, "prerequisite"], [348, "prerequisite"], [349, "prerequisite"], [377, "prerequisite"], [427, "prerequisite"]], "Pretraining": [[350, "pretraining"]], "Print Results": [[351, "print-results"]], "Problem Description": [[406, "problem-description"]], "Problem Statements": [[407, "problem-statements"]], "Problem description": [[408, "problem-description"]], "Profiling": [[389, "profiling"]], "Profiling API": [[389, "profiling-api"]], "Profiling Examples": [[389, "profiling-examples"]], "Prune with Trainer": [[419, "prune-with-trainer"]], "Pruning": [[304, "pruning"], [306, "pruning"], [419, "pruning"]], "Pull Request Acceptance Criteria": [[300, "pull-request-acceptance-criteria"]], "Pull Request Checklist": [[300, "pull-request-checklist"]], "Pull Request Template": [[300, "pull-request-template"]], "Python API": [[422, "python-api"]], "Python APIs": [[274, "python-apis"]], "Pytorch Script:": [[303, "pytorch-script"]], "Pytorch version constrain": [[421, "pytorch-version-constrain"]], "QBits": [[421, "qbits"]], "QLoRA on CPU": [[422, "qlora-on-cpu"]], "Qdrant": [[372, "qdrant"]], "Quantization": [[304, "quantization"], [306, "quantization"], [423, "quantization"]], "Quantization Approach": [[423, "quantization-approach"]], "Quantization Aware Training": [[423, "quantization-aware-training"]], "Quantization Fundamentals": [[423, "quantization-fundamentals"]], "Quantization with Trainer": [[423, "quantization-with-trainer"]], "Quantize a ONNX model to engine low precision/int8 IR": [[393, "quantize-a-onnx-model-to-engine-low-precision-int8-ir"]], "Quantized Length Adaptive Transformer": [[306, "quantized-length-adaptive-transformer"]], "Quick check whether the server is up": [[365, "quick-check-whether-the-server-is-up"]], "Quick test with OpenAI compatible endpoints (audio)": [[340, "quick-test-with-openai-compatible-endpoints-audio"]], "QuickStart: Intel\u00ae Extension For Transformers*: NeuralChat on 4th Generation Intel\u00ae Xeon\u00ae Scalable Processors": [[322, "quickstart-intel-extension-for-transformers-neuralchat-on-4th-generation-intel-xeon-scalable-processors"]], "RAG Mode": [[372, "rag-mode"]], "Recommended Hardware": [[302, "recommended-hardware"]], "Reference Deployment on Neural Engine": [[304, "reference-deployment-on-neural-engine"]], "Register the Nodes\u2019 Op Types": [[387, "register-the-nodes-op-types"]], "Reinforcement Learning from Human Feedback (RLHF)": [[352, "reinforcement-learning-from-human-feedback-rlhf"]], "Related models": [[353, "related-models"]], "Release": [[424, "release"]], "Release Notes": [[424, "release-notes"]], "Remove the Old Pattern and Insert the New Pattern": [[391, "remove-the-old-pattern-and-insert-the-new-pattern"]], "Reorder": [[403, "reorder"]], "Reorder beforehand": [[407, "reorder-beforehand"]], "Reordering": [[408, "reordering"]], "Report a Vulnerability": [[271, "report-a-vulnerability"]], "Result": [[377, "result"]], "Retrievers": [[309, "retrievers"], [372, "retrievers"]], "Retrieving Cached Data": [[370, "retrieving-cached-data"]], "Rich Plugins": [[309, "rich-plugins"]], "Run": [[427, "run"]], "Run Accuracy Step by Step": [[426, "run-accuracy-step-by-step"]], "Run Llava": [[351, "run-llava"]], "Run Performance Step by Step": [[426, "run-performance-step-by-step"]], "Run the AskDoc server": [[338, "run-the-askdoc-server"]], "Run the Backend Container": [[366, "run-the-backend-container"]], "Run the Code Generation Chatbot Server": [[323, "run-the-code-generation-chatbot-server"], [330, "run-the-code-generation-chatbot-server"], [331, "run-the-code-generation-chatbot-server"], [332, "run-the-code-generation-chatbot-server"], [332, "id3"]], "Run the Code Generation Chatbot server": [[326, "run-the-code-generation-chatbot-server"], [327, "run-the-code-generation-chatbot-server"], [328, "run-the-code-generation-chatbot-server"], [329, "run-the-code-generation-chatbot-server"]], "Run the Inference": [[315, "run-the-inference"]], "Run the Inference on Habana Gaudi": [[315, "run-the-inference-on-habana-gaudi"]], "Run the Inference on Xeon SPR": [[315, "run-the-inference-on-xeon-spr"]], "Run the NeuralChat server with TGI framework": [[363, "run-the-neuralchat-server-with-tgi-framework"]], "Run the PhotoAI server": [[334, "run-the-photoai-server"]], "Run the TextChat server": [[324, "run-the-textchat-server"], [343, "run-the-textchat-server"], [344, "run-the-textchat-server"]], "Run the VoiceChat server": [[340, "run-the-voicechat-server"]], "Run the audio service server": [[336, "run-the-audio-service-server"]], "Run the complete code": [[358, "run-the-complete-code"]], "Run the frontend": [[345, "run-the-frontend"], [383, "run-the-frontend"], [384, "run-the-frontend"]], "Run the image2image service server": [[337, "run-the-image2image-service-server"]], "Run the inference by Engine": [[388, "run-the-inference-by-engine"], [388, "id1"]], "Run the script to set up the environment": [[322, "run-the-script-to-set-up-the-environment"]], "Run the table extraction script": [[359, "run-the-table-extraction-script"]], "Run tuning and benchmark": [[393, "run-tuning-and-benchmark"]], "SDE": [[410, "sde"]], "SPMM_VNNI 3D Inference": [[399, "spmm-vnni-3d-inference"]], "Safety Checker": [[319, "safety-checker"]], "Same Instructions as Multi-node Fine-tuning in Xeon SPR session": [[349, "same-instructions-as-multi-node-fine-tuning-in-xeon-spr-session"]], "Scope": [[298, "scope"]], "Script:": [[419, "script"], [423, "script"]], "Search Each Straight Chain Pattern": [[395, "search-each-straight-chain-pattern"]], "Sections": [[292, "sections"], [435, "sections"]], "Security Policy": [[271, "security-policy"]], "Selected Publications/Events": [[272, "selected-publications-events"]], "Sentence 1": [[377, "sentence-1"]], "Sentence 2": [[377, "sentence-2"]], "Serving NeuralChat Text Generation with Triton Inference Server": [[364, "serving-neuralchat-text-generation-with-triton-inference-server"]], "Serving NeuralChat Text Generation with Triton Inference Server (CUDA)": [[365, "serving-neuralchat-text-generation-with-triton-inference-server-cuda"]], "Serving NeuralChat Text Generation with Triton Inference Server on HPU": [[366, "serving-neuralchat-text-generation-with-triton-inference-server-on-hpu"]], "Set the Pattern Mapping Config and Register the Pattern": [[387, "set-the-pattern-mapping-config-and-register-the-pattern"]], "Setup Conda": [[323, "setup-conda"], [324, "setup-conda"], [326, "setup-conda"], [327, "setup-conda"], [328, "setup-conda"], [329, "setup-conda"], [330, "setup-conda"], [331, "setup-conda"], [332, "setup-conda"], [334, "setup-conda"], [336, "setup-conda"], [337, "setup-conda"], [338, "setup-conda"], [340, "setup-conda"], [343, "setup-conda"], [344, "setup-conda"], [345, "setup-conda"], [357, "setup-conda"], [363, "setup-conda"], [368, "setup-conda"], [383, "setup-conda"], [384, "setup-conda"]], "Setup Database": [[334, "setup-database"]], "Setup Environment": [[334, "setup-environment"], [336, "setup-environment"], [337, "setup-environment"], [338, "setup-environment"], [357, "setup-environment"], [363, "setup-environment"], [368, "setup-environment"]], "Setup NVIDIA GPU environment": [[318, "setup-nvidia-gpu-environment"]], "Setup Xeon SPR Environment": [[313, "setup-xeon-spr-environment"], [317, "setup-xeon-spr-environment"]], "Setups": [[412, "setups"]], "Shanghainese ASR (Audio-Speech-Recognition) and TTS (Text-To-Speech) finetuning/inference": [[353, "shanghainese-asr-audio-speech-recognition-and-tts-text-to-speech-finetuning-inference"]], "Simply run the test script": [[360, "simply-run-the-test-script"]], "Single Thread": [[411, "single-thread"]], "Single-node fine-tuning": [[354, "single-node-fine-tuning"]], "Smooth Quant": [[428, "smooth-quant"]], "Sparse GEMM AMX": [[403, "sparse-gemm-amx"]], "Sparse GEMM AVX512F": [[404, "sparse-gemm-avx512f"]], "Sparse GEMM VNNI": [[409, "sparse-gemm-vnni"]], "Sparse GEMM with Layer-Normalize": [[406, "sparse-gemm-with-layer-normalize"]], "Sparse Pattern & Data Format": [[404, "sparse-pattern-data-format"]], "Sparse Ratio Setting Part": [[389, "sparse-ratio-setting-part"]], "Sparse Reference Deployment on Neural Engine": [[304, "sparse-reference-deployment-on-neural-engine"]], "Sparse acceleration": [[402, "sparse-acceleration"]], "Splice Sub-chains with the Main Chain and Remove Duplicate Results": [[395, "splice-sub-chains-with-the-main-chain-and-remove-duplicate-results"]], "Stable Diffusion": [[425, "stable-diffusion"]], "StarCoder": [[349, "starcoder"]], "StarCoder-3B": [[425, "starcoder-3b"]], "Start NeuralChat Service": [[313, "start-neuralchat-service"], [316, "start-neuralchat-service"], [317, "start-neuralchat-service"], [318, "start-neuralchat-service"]], "Start NeuralChat Text Generation Service with Docker": [[316, "start-neuralchat-text-generation-service-with-docker"]], "Start NeuralChat and Code Generation Service with Docker": [[313, "start-neuralchat-and-code-generation-service-with-docker"]], "Start NeuralChat and TGI serving with Docker": [[317, "start-neuralchat-and-tgi-serving-with-docker"]], "Start NeuralChat and vLLM serving with Docker": [[318, "start-neuralchat-and-vllm-serving-with-docker"]], "Start Triton Inference Server": [[364, "start-triton-inference-server"], [365, "start-triton-inference-server"]], "Start the server": [[375, "start-the-server"]], "Start training!": [[350, "start-training"]], "Static Compressed Buffer": [[396, "static-compressed-buffer"]], "Static MHA": [[413, "static-mha"]], "Step-by-Step": [[427, "step-by-step"]], "Stock PyTorch Examples": [[304, "stock-pytorch-examples"]], "Stock Pytorch Model": [[289, "stock-pytorch-model"]], "Streaming LLM": [[429, "streaming-llm"]], "Submodules": [[18, "submodules"], [31, "submodules"], [34, "submodules"], [46, "submodules"], [51, "submodules"], [56, "submodules"], [58, "submodules"], [59, "submodules"], [83, "submodules"], [150, "submodules"], [249, "submodules"]], "Subpackages": [[58, "subpackages"], [245, "subpackages"]], "Summary and Next Steps": [[302, "summary-and-next-steps"]], "Supervised Fine-Tuning (SFT)": [[347, "supervised-fine-tuning-sft"]], "Support": [[300, "support"], [302, "support"]], "Supported Algorithms": [[432, "supported-algorithms"]], "Supported Feature Matrix": [[423, "supported-feature-matrix"]], "Supported Framework Matrix": [[428, "supported-framework-matrix"]], "Supported Matrix": [[398, "supported-matrix"]], "Supported Metric": [[416, "supported-metric"]], "Supported Model Export Matrix": [[305, "supported-model-export-matrix"]], "Supported Models": [[309, "supported-models"]], "Supported ONNX Format": [[392, "supported-onnx-format"]], "Supported Objectives Matrix:": [[417, "supported-objectives-matrix"]], "System Requirements": [[308, "system-requirements"], [309, "system-requirements"]], "System Summary": [[426, "system-summary"]], "TTS": [[353, "tts"], [353, "id2"], [353, "id4"]], "Test": [[356, "test"], [357, "test"], [368, "test"], [398, "test"]], "Test the TextChat server": [[324, "test-the-textchat-server"]], "Text Chat": [[311, "text-chat"]], "Tile": [[402, "tile"]], "Total Profiling Part": [[389, "total-profiling-part"]], "Trademarks": [[415, "trademarks"]], "Train": [[350, "train"]], "Trainer": [[285, "trainer"]], "Training": [[350, "training"]], "Training on CPU (SPR)": [[346, "training-on-cpu-spr"]], "Training on CUDA": [[352, "training-on-cuda"], [352, "id1"], [352, "id3"]], "Training on GPU": [[346, "training-on-gpu"]], "Training on Habana": [[346, "training-on-habana"], [352, "training-on-habana"], [352, "id2"], [352, "id4"]], "Transformers-Accelerated Libraries": [[398, "transformers-accelerated-libraries"]], "Transformers-accelerated Neural Engine": [[306, "transformers-accelerated-neural-engine"]], "Transposed MHA": [[408, "transposed-mha"]], "Transposed MatMul": [[407, "transposed-matmul"]], "Tutorials": [[430, "tutorials"]], "Ubuntu 20.04/22.04": [[308, "ubuntu-20-04-22-04"]], "Usage": [[307, "usage"], [356, "usage"], [358, "usage"], [359, "usage"], [360, "usage"], [362, "usage"], [369, "usage"], [369, "id1"], [369, "id2"], [370, "usage"], [372, "usage"], [373, "usage"], [374, "usage"], [377, "usage"], [400, "usage"], [401, "usage"], [413, "usage"], [419, "usage"]], "Usages": [[385, "usages"]], "Use Triton client to send inference request": [[364, "use-triton-client-to-send-inference-request"], [365, "use-triton-client-to-send-inference-request"]], "User Guide": [[297, "user-guide"], [431, "user-guide"], [440, "user-guide"]], "User-facing API": [[286, "user-facing-api"]], "User\u2019s Perspective": [[400, "user-s-perspective"], [401, "user-s-perspective"]], "Using Curl": [[309, "using-curl"]], "Using OpenAI Client Library": [[309, "using-openai-client-library"]], "Using Python Requests Library": [[309, "using-python-requests-library"]], "Using Single NumaNode": [[332, "using-single-numanode"]], "VTune": [[410, "vtune"]], "Validated Environment": [[308, "validated-environment"]], "Validated Hardware Environment": [[302, "validated-hardware-environment"], [308, "validated-hardware-environment"]], "Validated Model List": [[349, "validated-model-list"], [350, "validated-model-list"]], "Validated Model Performance": [[425, "validated-model-performance"]], "Validated Models": [[428, "validated-models"]], "Validated Performance Data": [[411, "validated-performance-data"]], "Validated Software Environment": [[308, "validated-software-environment"]], "Vector Stores": [[309, "vector-stores"], [372, "vector-stores"]], "VectorStoreRetriever": [[372, "vectorstoreretriever"]], "Verbose": [[410, "verbose"]], "Visual Instruction Tuning": [[350, "visual-instruction-tuning"]], "Voice Chat": [[311, "voice-chat"]], "Voice Cloning by finetuning a Text-To-Speech (TTS) model": [[355, "voice-cloning-by-finetuning-a-text-to-speech-tts-model"]], "Weight Only Quantization": [[319, "weight-only-quantization"]], "Weight Only Quantization (WOQ)": [[432, "weight-only-quantization-woq"]], "Weight Only Quantization with LLM Runtime": [[319, "weight-only-quantization-with-llm-runtime"]], "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!": [[292, "welcome-to-intel-extension-for-transformers-documentation"], [435, "welcome-to-intel-extension-for-transformers-documentation"]], "You can get profile only with ENGINE_PROFILING=1 before running model by python/c++ API.": [[389, "you-can-get-profile-only-with-engine-profiling-1-before-running-model-by-python-c-api"]], "alpha,beta,scale meaning": [[401, "alpha-beta-scale-meaning"]], "attention": [[413, "attention"]], "cURL": [[321, "curl"]], "conversation": [[0, "module-conversation"]], "different jit-paths for different weight size": [[405, "different-jit-paths-for-different-weight-size"]], "dynamic_quant": [[413, "dynamic-quant"]], "dynamic_quant_matmul": [[413, "dynamic-quant-matmul"]], "eltwiseop": [[413, "eltwiseop"]], "gaudi_spawn": [[1, "module-gaudi_spawn"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever"]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma": [[3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma"]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, "module-intel_extension_for_transformers.neural_chat.chatbot"]], "intel_extension_for_transformers.neural_chat.config": [[5, "module-intel_extension_for_transformers.neural_chat.config"]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, "module-intel_extension_for_transformers.neural_chat.config_logging"]], "intel_extension_for_transformers.neural_chat.errorcode": [[7, "module-intel_extension_for_transformers.neural_chat.errorcode"]], "intel_extension_for_transformers.neural_chat.pipeline": [[8, "module-intel_extension_for_transformers.neural_chat.pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory": [[10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection": [[11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer": [[12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser": [[13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm": [[16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats": [[19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util"]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol"]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats"]], "intel_extension_for_transformers.tools.utils": [[26, "module-intel_extension_for_transformers.tools.utils"]], "intel_extension_for_transformers.transformers.benchmark": [[27, "module-intel_extension_for_transformers.transformers.benchmark"]], "intel_extension_for_transformers.transformers.config": [[28, "module-intel_extension_for_transformers.transformers.config"]], "intel_extension_for_transformers.transformers.dynamic": [[31, "module-intel_extension_for_transformers.transformers.dynamic"]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils"]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, "module-intel_extension_for_transformers.transformers.dynamic.evolution"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama"]], "intel_extension_for_transformers.transformers.modeling": [[34, "module-intel_extension_for_transformers.transformers.modeling"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode"]], "intel_extension_for_transformers.transformers.modeling.model": [[35, "module-intel_extension_for_transformers.transformers.modeling.model"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama": [[38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm": [[43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic"]], "intel_extension_for_transformers.transformers.pipeline": [[45, "module-intel_extension_for_transformers.transformers.pipeline"]], "intel_extension_for_transformers.transformers.pruner": [[46, "module-intel_extension_for_transformers.transformers.pruner"]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, "module-intel_extension_for_transformers.transformers.pruner.pruning"]], "intel_extension_for_transformers.transformers.quantization": [[48, "module-intel_extension_for_transformers.transformers.quantization"]], "intel_extension_for_transformers.transformers.runtime": [[245, "module-intel_extension_for_transformers.transformers.runtime"]], "intel_extension_for_transformers.transformers.runtime.compile": [[58, "module-intel_extension_for_transformers.transformers.runtime.compile"]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader"]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger"]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2": [[75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm": [[91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize": [[104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion": [[129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape": [[133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul": [[134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape": [[135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape": [[136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D": [[137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose": [[206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT": [[207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape": [[208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion": [[209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion": [[210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert": [[211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo": [[212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode": [[213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding": [[216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape": [[217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape": [[218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape": [[219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape": [[220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape": [[221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape": [[222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask": [[223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape": [[231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul": [[232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice": [[233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1": [[234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape": [[235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape": [[236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D": [[237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape": [[238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch": [[239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice": [[240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils"]], "intel_extension_for_transformers.transformers.trainer": [[246, "module-intel_extension_for_transformers.transformers.trainer"]], "intel_extension_for_transformers.transformers.utils": [[249, "module-intel_extension_for_transformers.transformers.utils"]], "intel_extension_for_transformers.transformers.utils.config": [[247, "module-intel_extension_for_transformers.transformers.utils.config"]], "intel_extension_for_transformers.transformers.utils.get_throughput": [[248, "module-intel_extension_for_transformers.transformers.utils.get_throughput"]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, "module-intel_extension_for_transformers.transformers.utils.metrics"]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, "module-intel_extension_for_transformers.transformers.utils.objectives"]], "intel_extension_for_transformers.transformers.utils.utility": [[252, "module-intel_extension_for_transformers.transformers.utils.utility"]], "jit_binaryop_injector.hpp": [[400, "jit-binaryop-injector-hpp"]], "jit_eltwise_injector.hpp": [[401, "jit-eltwise-injector-hpp"]], "layernorm_ba": [[413, "layernorm-ba"]], "layernormalized sparse matmul": [[406, "layernormalized-sparse-matmul"]], "main_eval_only": [[253, "module-main_eval_only"]], "main_parse_and_eval": [[254, "module-main_parse_and_eval"]], "matmul_avx512f_p2031_p2013": [[413, "matmul-avx512f-p2031-p2013"]], "matmul_vnni_noperm_p2031_p1302": [[413, "matmul-vnni-noperm-p2031-p1302"]], "meta-llama/Llama-2-7b-hf": [[349, "meta-llama-llama-2-7b-hf"]], "microsoft/git-base": [[348, "microsoft-git-base"]], "models.backbone": [[255, "module-models.backbone"]], "models.detr": [[256, "module-models.detr"]], "models.detr_multi": [[257, "module-models.detr_multi"]], "models.matcher": [[258, "module-models.matcher"]], "models.position_encoding": [[259, "module-models.position_encoding"]], "models.segmentation": [[260, "module-models.segmentation"]], "models.transformer": [[261, "module-models.transformer"]], "mpt architecture": [[346, "mpt-architecture"]], "one-stage jit-path": [[405, "one-stage-jit-path"]], "operator_desc.hpp": [[400, "operator-desc-hpp"], [401, "operator-desc-hpp"]], "param_type.hpp": [[400, "param-type-hpp"]], "param_types.hpp": [[401, "param-types-hpp"]], "platform configuration": [[397, "platform-configuration"]], "prerequisite": [[361, "prerequisite"], [362, "prerequisite"]], "problem description": [[405, "problem-description"]], "references": [[432, "references"]], "softmax": [[413, "softmax"]], "sparse_matmul": [[413, "sparse-matmul"]], "sparse_matmul kernel:": [[398, "sparse-matmul-kernel"]], "spmm_amx_bf16_x16": [[413, "spmm-amx-bf16-x16"]], "spmm_avx512f": [[413, "spmm-avx512f"]], "spmm_vnni": [[413, "spmm-vnni"]], "text": [[262, "module-text"]], "transpose_matmul": [[413, "transpose-matmul"]], "two-stage jit-path": [[405, "two-stage-jit-path"]], "usage": [[303, "usage"]], "util.box_ops": [[263, "module-util.box_ops"]], "util.misc": [[264, "module-util.misc"]], "util.plot_utils": [[265, "module-util.plot_utils"]], "util.postprocess": [[266, "module-util.postprocess"]], "utils.data_utils": [[267, "module-utils.data_utils"]], "utils.eval_utils": [[268, "module-utils.eval_utils"]], "vllm serving for NeuralChat": [[367, "vllm-serving-for-neuralchat"]], "\ud83c\udf99\ufe0f Talking Bot": [[378, "talking-bot"]], "\ud83c\udfe0Introduction": [[371, "introduction"]], "\ud83d\udcf8 Project Screenshots": [[341, "project-screenshots"], [378, "project-screenshots"], [378, "id1"], [378, "id2"], [378, "id3"], [379, "project-screenshots"], [381, "project-screenshots"], [382, "project-screenshots"]], "\ud83d\udd21 TextBot": [[378, "textbot"]], "\ud83d\udd27Install dependencies": [[371, "install-dependencies"]], "\ud83d\ude0e What can this help with?": [[370, "what-can-this-help-with"]], "\ud83d\ude4c SideBySide": [[378, "sidebyside"]], "\ud83d\ude80 Check configuration": [[345, "check-configuration"], [383, "check-configuration"], [384, "check-configuration"]], "\ud83d\ude80 Create a new space on Huggingface": [[345, "create-a-new-space-on-huggingface"], [383, "create-a-new-space-on-huggingface"], [384, "create-a-new-space-on-huggingface"]], "\ud83d\ude80 Setup application": [[345, "setup-application"], [383, "setup-application"], [384, "setup-application"]], "\ud83d\ude80 What is caching plugin?": [[370, "what-is-caching-plugin"]], "\ud83d\ude80Usage": [[371, "usage"]], "\ud83d\ude97Parameters": [[371, "parameters"]], "\ud83e\udd14 How does it work?": [[370, "how-does-it-work"]], "\ud83e\udd16 AI Talking Photo": [[378, "ai-talking-photo"]]}, "docnames": ["autoapi/conversation/index", "autoapi/gaudi_spawn/index", "autoapi/intel_extension_for_transformers/langchain/langchain_community/retrievers/child_parent_retriever/index", "autoapi/intel_extension_for_transformers/langchain/langchain_community/vectorstores/chroma/index", "autoapi/intel_extension_for_transformers/neural_chat/chatbot/index", "autoapi/intel_extension_for_transformers/neural_chat/config/index", "autoapi/intel_extension_for_transformers/neural_chat/config_logging/index", "autoapi/intel_extension_for_transformers/neural_chat/errorcode/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/image2image/instructpix2pix_pipeline/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/memory/memory/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/intent_detection/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/query_explainer/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/parser/parser/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/retriever_adapter/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/safety_checker/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/bfm/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/networks/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/load_mats/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/preprocess/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/util/index", "autoapi/intel_extension_for_transformers/neural_chat/server/restful/openai_protocol/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/repr_tools/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/nethook/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/runningstats/index", "autoapi/intel_extension_for_transformers/tools/utils/index", "autoapi/intel_extension_for_transformers/transformers/benchmark/index", "autoapi/intel_extension_for_transformers/transformers/config/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/drop_and_restore_utils/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/evolution/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/index", "autoapi/intel_extension_for_transformers/transformers/kv_cache_compression/models/modeling_llama/index", "autoapi/intel_extension_for_transformers/transformers/modeling/gpt_bigcode/modeling_gpt_bigcode/index", "autoapi/intel_extension_for_transformers/transformers/modeling/index", "autoapi/intel_extension_for_transformers/transformers/modeling/model/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_bert_dynamic/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/bart/modeling_bart/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/llama/pos_shift_llama/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mistral/modeling_mistral/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mixtral/modeling_mixtral/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/phi/modeling_phi/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/swin/modeling_swin/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/streaming_llm/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_roberta_dynamic/index", "autoapi/intel_extension_for_transformers/transformers/pipeline/index", "autoapi/intel_extension_for_transformers/transformers/pruner/index", "autoapi/intel_extension_for_transformers/transformers/pruner/pruning/index", "autoapi/intel_extension_for_transformers/transformers/quantization/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/compile/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/onnx_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/tf_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/torch_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/loader/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/logger/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/onnx_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/all/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/assert/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/baddbmm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/bias_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cast/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/concat/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/conv/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cos/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/empty_ops/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/expand_dims/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_matmul_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_norm_v3/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_gemm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gemm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_get_next/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/layer_normalization/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/log_softmax/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/map_and_batch_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mkl_layer_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/model_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/one_hot/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/onnx_input/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/op/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/optimize_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pack/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/padding_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/placeholder/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pos_embed/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pow/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_linear/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_fused_matmul_and_dequantize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_matmul_with_bias_and_dequantize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_sum/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reorder/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/resize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/rsub/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/scatter_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/shape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/sin/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/size/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/slice_position_ids/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/softmax/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/split/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/squeeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/strided_slice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/tensor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/top_k/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unpack/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unsqueeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/view/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/where/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/optimizer/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/InnerproductReshapeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_cls_token/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/arangewithreciprocal/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_ConstantOfShapeWithMul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVPreReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_WeightReshapeTo4D/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_mask_length_adaptive_keep_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_output_layer_norm_length_adaptive_keep_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/cast_to/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/collect_quant_info/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/conv_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/decoder_attn_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/einsumwitharange/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddingbag/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddings_to_2d_before_inner_product/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/generate_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithbiasgelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithslice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithswish/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_data/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_file/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_bf16_node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_quant_node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/int8_bf16_mixed_precision_checker/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/interact_features/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/last_layer_shape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_reduce_mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_embeding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_matmulwithtranspose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_postprocess/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/lower_all_tuples/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_relu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_sigmoid/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_tanh/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_unsqueeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose_scale_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/merged_embeddingbag/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_reorder_change/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/operator_adaptor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/output_data/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/padding_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/pattern/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings_v1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_merge/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quant_gather_to_bf16/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantize_fusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantized_graph_dtype_refactor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_constant_op/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_last_view/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_range/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_unused_operator/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_zeros/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/removeslice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_after_restore_hidden_states/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_and_after_attention_out_layer_norm_gather_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_restore_hidden_states/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_fusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/restore_hidden_states_in_length_adaptive_update_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rms_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/slicemask/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTranspose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTransposeQAT/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_MHAReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_QuantizeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ReshapeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_bf16Convert/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_collectQDQInfo/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_insertQuantNode/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/start_end_logits/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/subgraph_matcher/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncdoer_word_embedding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_KVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_MulReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_QReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_SoftmaxReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_causal_attention_mask/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings_v1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_embedding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_ip_insert_bias/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_unpack_baddbmm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchinsertbf16node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchpaddingsquence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_ConstantOfShapeWithMul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice_1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVPreReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape4D/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_encoderHiddenStatesReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_getSampleBatch/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_sampleSlice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transpose_batch_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/word_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/tf_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/torch_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/index", "autoapi/intel_extension_for_transformers/transformers/trainer/index", "autoapi/intel_extension_for_transformers/transformers/utils/config/index", "autoapi/intel_extension_for_transformers/transformers/utils/get_throughput/index", "autoapi/intel_extension_for_transformers/transformers/utils/index", "autoapi/intel_extension_for_transformers/transformers/utils/metrics/index", "autoapi/intel_extension_for_transformers/transformers/utils/objectives/index", "autoapi/intel_extension_for_transformers/transformers/utils/utility/index", "autoapi/main_eval_only/index", "autoapi/main_parse_and_eval/index", "autoapi/models/backbone/index", "autoapi/models/detr/index", "autoapi/models/detr_multi/index", "autoapi/models/matcher/index", "autoapi/models/position_encoding/index", "autoapi/models/segmentation/index", "autoapi/models/transformer/index", "autoapi/text/index", "autoapi/util/box_ops/index", "autoapi/util/misc/index", "autoapi/util/plot_utils/index", "autoapi/util/postprocess/index", "autoapi/utils/data_utils/index", "autoapi/utils/eval_utils/index", "docs/CI_introduction", "docs/README", "docs/SECURITY", "docs/Welcome", "docs/api_doc/api", "docs/api_doc/engine/api_py_engine", "docs/api_doc/engine/compile", "docs/api_doc/engine/graph", "docs/api_doc/engine_api", "docs/api_doc/kernel/engine", "docs/api_doc/kernel/interface", "docs/api_doc/kernel/operator_desc", "docs/api_doc/kernel/types", "docs/api_doc/kernel_api", "docs/api_doc/optimization/config", "docs/api_doc/optimization/model", "docs/api_doc/optimization/trainer", "docs/api_doc/user_api", "docs/architecture", "docs/autoround_comparative_analysis", "docs/benchmark", "docs/build_docs/source/example", "docs/build_docs/source/feature", "docs/build_docs/source/index", "docs/build_docs/source/kernel", "docs/build_docs/source/kernel_desc", "docs/build_docs/source/kernel_perf", "docs/build_docs/source/neural_engine", "docs/build_docs/source/user_guide", "docs/code_of_conduct", "docs/component_owner", "docs/contributions", "docs/contributors", "docs/devcatalog", "docs/distillation", "docs/examples", "docs/export", "docs/get_started", "docs/h2o", "docs/installation", "docs/intel_extension_for_transformers/neural_chat/README", "docs/intel_extension_for_transformers/neural_chat/assets/docs/sample", "docs/intel_extension_for_transformers/neural_chat/cli/README", "docs/intel_extension_for_transformers/neural_chat/docker/README", "docs/intel_extension_for_transformers/neural_chat/docker/code_generation/README", "docs/intel_extension_for_transformers/neural_chat/docker/finetuning/README", "docs/intel_extension_for_transformers/neural_chat/docker/inference/README", "docs/intel_extension_for_transformers/neural_chat/docker/text_generation/README", "docs/intel_extension_for_transformers/neural_chat/docker/tgi_serving/README", "docs/intel_extension_for_transformers/neural_chat/docker/vllm_serving/README", "docs/intel_extension_for_transformers/neural_chat/docs/advanced_features", "docs/intel_extension_for_transformers/neural_chat/docs/full_notebooks", "docs/intel_extension_for_transformers/neural_chat/docs/neuralchat_api", "docs/intel_extension_for_transformers/neural_chat/docs/notebooks/workshop/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/assisted_generation/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/chatgpt_rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/gaudi/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gguf/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gptq/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/woq/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/deepspeed/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/ipex/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/tpp/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/backend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/audio/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/image2image/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/backend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend/xeon/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend_with_cache/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/dpo_pipeline/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/finetune_neuralchat_v3/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/image_to_text/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/instruction/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/ppo_pipeline/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/shanghainese_asr_tts/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/text_generation/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/tts/README", "docs/intel_extension_for_transformers/neural_chat/examples/helloworld/README", "docs/intel_extension_for_transformers/neural_chat/examples/langchain_extension/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/retrieval/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/table_extraction/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/video/README", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/chatbot/README", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/TGI/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cpu/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cuda/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/hpu/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/vllm/README", "docs/intel_extension_for_transformers/neural_chat/examples/sql_generation/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/caching/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/ner/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/README", "docs/intel_extension_for_transformers/neural_chat/server/README", "docs/intel_extension_for_transformers/neural_chat/tools/embedding_finetune/README", "docs/intel_extension_for_transformers/neural_chat/tools/rome/examples/README", "docs/intel_extension_for_transformers/neural_chat/ui/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/side_by_side/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talking_photo/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talkingbot/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/vision_demo/README", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/basic/README", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/side_by_side/README", "docs/intel_extension_for_transformers/tools/llm_carbon_calc_readme", "docs/intel_extension_for_transformers/transformers/runtime/docs/Installation", "docs/intel_extension_for_transformers/transformers/runtime/docs/add_customized_pattern", "docs/intel_extension_for_transformers/transformers/runtime/docs/deploy_and_integration", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_profiling", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_tuning", "docs/intel_extension_for_transformers/transformers/runtime/docs/graph_fusion", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_compile", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_quantize", "docs/intel_extension_for_transformers/transformers/runtime/docs/operator_register", "docs/intel_extension_for_transformers/transformers/runtime/docs/pattern_recognize", "docs/intel_extension_for_transformers/transformers/runtime/docs/static_compressed_buffer", "docs/intel_extension_for_transformers/transformers/runtime/docs/validated_model", "docs/intel_extension_for_transformers/transformers/runtime/kernels/README", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/3D_inference", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/binaryop_injector", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/eltwise_injector", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/gpu/sparse_gemm_gpu", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_amx", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_avx512f", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_dynamic_quant_matmul", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_layernormalized_spmm", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_matmul", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_mha", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_vnni", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/profiling", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/validated_data", "docs/intel_extension_for_transformers/transformers/runtime/kernels/scripts/README", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/benchmark", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/ci/inputs/README", "docs/legal", "docs/metrics", "docs/objectives", "docs/pipeline", "docs/pruning", "docs/publication", "docs/qbits", "docs/qloracpu", "docs/quantization", "docs/release", "docs/release_data", "docs/reproduce/efficient_LLM_inference_on_cpus", "docs/reproduce/neural_chat_v3-3_workflow", "docs/smoothquant", "docs/streamingllm", "docs/tutorials/README", "docs/user_guide", "docs/weightonlyquant", "example", "feature", "index", "kernel", "kernel_desc", "kernel_perf", "neural_engine", "user_guide"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2}, "filenames": ["autoapi/conversation/index.rst", "autoapi/gaudi_spawn/index.rst", "autoapi/intel_extension_for_transformers/langchain/langchain_community/retrievers/child_parent_retriever/index.rst", "autoapi/intel_extension_for_transformers/langchain/langchain_community/vectorstores/chroma/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/chatbot/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/config/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/config_logging/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/errorcode/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/image2image/instructpix2pix_pipeline/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/memory/memory/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/intent_detection/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/query_explainer/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/parser/parser/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/retriever_adapter/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/safety_checker/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/bfm/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/networks/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/load_mats/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/preprocess/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/util/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/server/restful/openai_protocol/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/repr_tools/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/nethook/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/runningstats/index.rst", "autoapi/intel_extension_for_transformers/tools/utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/benchmark/index.rst", "autoapi/intel_extension_for_transformers/transformers/config/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/drop_and_restore_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/evolution/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/kv_cache_compression/models/modeling_llama/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/gpt_bigcode/modeling_gpt_bigcode/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/model/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_bert_dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/bart/modeling_bart/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/llama/pos_shift_llama/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mistral/modeling_mistral/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mixtral/modeling_mixtral/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/phi/modeling_phi/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/swin/modeling_swin/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/streaming_llm/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_roberta_dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/pipeline/index.rst", "autoapi/intel_extension_for_transformers/transformers/pruner/index.rst", "autoapi/intel_extension_for_transformers/transformers/pruner/pruning/index.rst", "autoapi/intel_extension_for_transformers/transformers/quantization/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/compile/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/onnx_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/tf_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/torch_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/loader/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/logger/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/onnx_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/all/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/assert/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/baddbmm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/bias_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cast/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/concat/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/conv/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cos/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/empty_ops/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/expand_dims/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_matmul_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_norm_v3/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_gemm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gemm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_get_next/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/layer_normalization/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/log_softmax/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/map_and_batch_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mkl_layer_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/model_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/one_hot/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/onnx_input/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/op/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/optimize_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pack/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/padding_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/placeholder/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pos_embed/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pow/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_linear/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_fused_matmul_and_dequantize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_matmul_with_bias_and_dequantize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_sum/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reorder/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/resize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/rsub/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/scatter_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/shape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/sin/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/size/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/slice_position_ids/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/softmax/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/split/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/squeeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/strided_slice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/tensor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/top_k/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unpack/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unsqueeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/view/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/where/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/optimizer/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/InnerproductReshapeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_cls_token/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/arangewithreciprocal/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_ConstantOfShapeWithMul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVPreReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_WeightReshapeTo4D/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_mask_length_adaptive_keep_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_output_layer_norm_length_adaptive_keep_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/cast_to/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/collect_quant_info/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/conv_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/decoder_attn_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/einsumwitharange/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddingbag/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddings_to_2d_before_inner_product/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/generate_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithbiasgelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithslice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithswish/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_data/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_file/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_bf16_node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_quant_node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/int8_bf16_mixed_precision_checker/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/interact_features/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/last_layer_shape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_reduce_mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_embeding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_matmulwithtranspose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_postprocess/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/lower_all_tuples/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_relu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_sigmoid/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_tanh/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_unsqueeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose_scale_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/merged_embeddingbag/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_reorder_change/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/operator_adaptor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/output_data/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/padding_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/pattern/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings_v1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_merge/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quant_gather_to_bf16/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantize_fusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantized_graph_dtype_refactor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_constant_op/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_last_view/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_range/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_unused_operator/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_zeros/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/removeslice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_after_restore_hidden_states/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_and_after_attention_out_layer_norm_gather_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_restore_hidden_states/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_fusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/restore_hidden_states_in_length_adaptive_update_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rms_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/slicemask/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTranspose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTransposeQAT/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_MHAReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_QuantizeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ReshapeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_bf16Convert/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_collectQDQInfo/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_insertQuantNode/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/start_end_logits/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/subgraph_matcher/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncdoer_word_embedding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_KVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_MulReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_QReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_SoftmaxReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_causal_attention_mask/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings_v1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_embedding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_ip_insert_bias/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_unpack_baddbmm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchinsertbf16node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchpaddingsquence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_ConstantOfShapeWithMul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice_1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVPreReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape4D/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_encoderHiddenStatesReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_getSampleBatch/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_sampleSlice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transpose_batch_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/word_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/tf_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/torch_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/index.rst", "autoapi/intel_extension_for_transformers/transformers/trainer/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/config/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/get_throughput/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/metrics/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/objectives/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/utility/index.rst", "autoapi/main_eval_only/index.rst", "autoapi/main_parse_and_eval/index.rst", "autoapi/models/backbone/index.rst", "autoapi/models/detr/index.rst", "autoapi/models/detr_multi/index.rst", "autoapi/models/matcher/index.rst", "autoapi/models/position_encoding/index.rst", "autoapi/models/segmentation/index.rst", "autoapi/models/transformer/index.rst", "autoapi/text/index.rst", "autoapi/util/box_ops/index.rst", "autoapi/util/misc/index.rst", "autoapi/util/plot_utils/index.rst", "autoapi/util/postprocess/index.rst", "autoapi/utils/data_utils/index.rst", "autoapi/utils/eval_utils/index.rst", "docs/CI_introduction.md", "docs/README.md", "docs/SECURITY.md", "docs/Welcome.md", "docs/api_doc/api.rst", "docs/api_doc/engine/api_py_engine.rst", "docs/api_doc/engine/compile.rst", "docs/api_doc/engine/graph.rst", "docs/api_doc/engine_api.rst", "docs/api_doc/kernel/engine.rst", "docs/api_doc/kernel/interface.rst", "docs/api_doc/kernel/operator_desc.rst", "docs/api_doc/kernel/types.rst", "docs/api_doc/kernel_api.rst", "docs/api_doc/optimization/config.rst", "docs/api_doc/optimization/model.rst", "docs/api_doc/optimization/trainer.rst", "docs/api_doc/user_api.rst", "docs/architecture.md", "docs/autoround_comparative_analysis.md", "docs/benchmark.md", "docs/build_docs/source/example.rst", "docs/build_docs/source/feature.rst", "docs/build_docs/source/index.rst", "docs/build_docs/source/kernel.rst", "docs/build_docs/source/kernel_desc.rst", "docs/build_docs/source/kernel_perf.rst", "docs/build_docs/source/neural_engine.rst", "docs/build_docs/source/user_guide.rst", "docs/code_of_conduct.md", "docs/component_owner.md", "docs/contributions.md", "docs/contributors.md", "docs/devcatalog.md", "docs/distillation.md", "docs/examples.md", "docs/export.md", "docs/get_started.md", "docs/h2o.md", "docs/installation.md", "docs/intel_extension_for_transformers/neural_chat/README.md", "docs/intel_extension_for_transformers/neural_chat/assets/docs/sample.md", "docs/intel_extension_for_transformers/neural_chat/cli/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/code_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/finetuning/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/inference/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/text_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/tgi_serving/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/vllm_serving/README.md", "docs/intel_extension_for_transformers/neural_chat/docs/advanced_features.md", "docs/intel_extension_for_transformers/neural_chat/docs/full_notebooks.md", "docs/intel_extension_for_transformers/neural_chat/docs/neuralchat_api.md", "docs/intel_extension_for_transformers/neural_chat/docs/notebooks/workshop/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/assisted_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/chatgpt_rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/gaudi/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gguf/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gptq/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/woq/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/deepspeed/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/ipex/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/tpp/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/backend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/audio/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/image2image/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/backend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend/xeon/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend_with_cache/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/dpo_pipeline/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/finetune_neuralchat_v3/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/image_to_text/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/instruction/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/ppo_pipeline/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/shanghainese_asr_tts/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/text_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/tts/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/helloworld/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/langchain_extension/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/retrieval/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/table_extraction/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/video/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/chatbot/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/TGI/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cpu/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cuda/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/hpu/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/vllm/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/sql_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/caching/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/ner/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/README.md", "docs/intel_extension_for_transformers/neural_chat/server/README.md", "docs/intel_extension_for_transformers/neural_chat/tools/embedding_finetune/README.md", "docs/intel_extension_for_transformers/neural_chat/tools/rome/examples/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/side_by_side/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talking_photo/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talkingbot/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/vision_demo/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/basic/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/side_by_side/README.md", "docs/intel_extension_for_transformers/tools/llm_carbon_calc_readme.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/Installation.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/add_customized_pattern.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/deploy_and_integration.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_profiling.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_tuning.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/graph_fusion.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_compile.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_quantize.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/operator_register.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/pattern_recognize.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/static_compressed_buffer.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/validated_model.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/README.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/3D_inference.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/binaryop_injector.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/eltwise_injector.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/gpu/sparse_gemm_gpu.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_amx.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_avx512f.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_dynamic_quant_matmul.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_layernormalized_spmm.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_matmul.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_mha.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_vnni.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/profiling.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/validated_data.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/scripts/README.md", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/benchmark.md", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/ci/inputs/README.md", "docs/legal.md", "docs/metrics.md", "docs/objectives.md", "docs/pipeline.md", "docs/pruning.md", "docs/publication.md", "docs/qbits.md", "docs/qloracpu.md", "docs/quantization.md", "docs/release.md", "docs/release_data.md", "docs/reproduce/efficient_LLM_inference_on_cpus.md", "docs/reproduce/neural_chat_v3-3_workflow.md", "docs/smoothquant.md", "docs/streamingllm.md", "docs/tutorials/README.md", "docs/user_guide.md", "docs/weightonlyquant.md", "example.rst", "feature.rst", "index.rst", "kernel.rst", "kernel_desc.rst", "kernel_perf.rst", "neural_engine.rst", "user_guide.rst"], "indexentries": {"accuracy() (in module util.misc)": [[264, "util.misc.accuracy", false]], "add (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.normmean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.topk method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.add", false]], "add_config_item() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.add_config_item", false]], "add_gene() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.add_gene", false]], "addclstoken (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token)": [[130, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token.AddClsToken", false]], "addembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings)": [[131, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings.AddEmbeddings", false]], "addv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.AddV2", false]], "align_columns() (in module util.postprocess)": [[266, "util.postprocess.align_columns", false]], "align_headers() (in module util.postprocess)": [[266, "util.postprocess.align_headers", false]], "align_img() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess)": [[20, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess.align_img", false]], "align_rows() (in module util.postprocess)": [[266, "util.postprocess.align_rows", false]], "align_supercells() (in module util.postprocess)": [[266, "util.postprocess.align_supercells", false]], "all (class in intel_extension_for_transformers.transformers.runtime.compile.ops.all)": [[63, "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All", false]], "all_gather() (in module util.misc)": [[264, "util.misc.all_gather", false]], "apierrorcode (class in intel_extension_for_transformers.neural_chat.server.restful.openai_protocol)": [[22, "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol.ApiErrorCode", false]], "append_message() (conversation.conversation method)": [[0, "conversation.Conversation.append_message", false]], "apply_class_thresholds() (in module util.postprocess)": [[266, "util.postprocess.apply_class_thresholds", false]], "apply_rotary_pos_emb() (in module intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.apply_rotary_pos_emb", false]], "apply_threshold() (in module util.postprocess)": [[266, "util.postprocess.apply_threshold", false]], "approx_ratio() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.approx_ratio", false]], "arange (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Arange", false]], "arangewithreciprocal (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal)": [[132, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal.ArangewithReciprocal", false]], "assert (class in intel_extension_for_transformers.transformers.runtime.compile.ops.assert)": [[64, "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert", false]], "attentionblock_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_attentionmaskaddreshape)": [[133, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape.AttentionBlock_AttentionMaskAddReshape", false]], "attentionblock_constantofshapewithmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_constantofshapewithmul)": [[134, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul.AttentionBlock_ConstantOfShapeWithMul", false]], "attentionblock_qkvprereshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvprereshape)": [[135, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape.AttentionBlock_QKVPreReshape", false]], "attentionblock_qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvreshape)": [[136, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape.AttentionBlock_QKVReshape", false]], "attentionblock_weightreshapeto4d (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_weightreshapeto4d)": [[137, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D.AttentionBlock_WeightReshapeTo4D", false]], "attentionmasklengthadaptiveexpandindices (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices)": [[138, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices.AttentionMaskLengthAdaptiveExpandIndices", false]], "attentionoutputlayernormlengthadaptiveexpandindices (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices)": [[139, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices.AttentionOutputLayerNormLengthAdaptiveExpandIndices", false]], "attentionreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape)": [[140, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape.AttentionReshape", false]], "audiolanguageoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.AudioLanguageOptions", false]], "autocast_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.autocast_init", false]], "autoroundconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig", false]], "awqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.AwqConfig", false]], "backbone (class in models.backbone)": [[255, "models.backbone.Backbone", false]], "backendoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.BackendOptions", false]], "baddbmm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm)": [[65, "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.Baddbmm", false]], "basetrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer", false]], "batchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul)": [[66, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul", false]], "batchmatmulv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2)": [[67, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2", false]], "benchmark() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.benchmark", false]], "benchmark() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.benchmark", false]], "benchmarkconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.BenchmarkConfig", false]], "bertattention (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention", false]], "bertembeddings (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings", false]], "bertencoder (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder", false]], "bertformaskedlm (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM", false]], "bertformultiplechoice (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice", false]], "bertfornextsentenceprediction (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction", false]], "bertforpretraining (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining", false]], "bertforpretrainingoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTrainingOutput", false]], "bertforquestionanswering (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering", false]], "bertforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification", false]], "bertfortokenclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification", false]], "bertintermediate (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate", false]], "bertlayer (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer", false]], "bertlmheadmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel", false]], "bertlmpredictionhead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead", false]], "bertmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel", false]], "bertonlymlmhead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead", false]], "bertonlynsphead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead", false]], "bertoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput", false]], "bertpooler (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler", false]], "bertpredictionheadtransform (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform", false]], "bertpretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainedModel", false]], "bertpretrainingheads (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads", false]], "bertselfattention (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention", false]], "bertselfoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput", false]], "bias_to_int32() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.bias_to_int32", false]], "biasadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add)": [[68, "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd", false]], "binaryadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.BinaryAdd", false]], "bincount (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount", false]], "box_numpy_null() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.box_numpy_null", false]], "build_chatbot() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.build_chatbot", false]], "builtin_eval_func() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.builtin_eval_func", false]], "builtin_eval_func() (intel_extension_for_transformers.transformers.trainer.nlpseq2seqtrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer.builtin_eval_func", false]], "builtin_train_func() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.builtin_train_func", false]], "cache_load_enabled (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.cache_load_enabled", false]], "calculate_ins_level_acc() (in module utils.eval_utils)": [[268, "utils.eval_utils.calculate_ins_level_acc", false]], "cast (class in intel_extension_for_transformers.transformers.runtime.compile.ops.cast)": [[69, "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast", false]], "castto (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to)": [[141, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to.CastTo", false]], "change_node_input_tensors() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.change_node_input_tensors", false]], "change_node_output_tensors() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.change_node_output_tensors", false]], "change_num_name() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.change_num_name", false]], "check_is_number() (in module utils.eval_utils)": [[268, "utils.eval_utils.check_is_number", false]], "check_value() (in module intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.check_value", false]], "childparentretriever (class in intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever", false]], "class_subset() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.fixedrandomsubsetsampler method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler.class_subset", false]], "collectquantinfo (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info)": [[142, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info.CollectQuantInfo", false]], "combinedstat (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat", false]], "compile() (in module intel_extension_for_transformers.transformers.runtime.compile.compile)": [[49, "intel_extension_for_transformers.transformers.runtime.compile.compile.compile", false]], "compute_loss() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.compute_loss", false]], "concat (class in intel_extension_for_transformers.transformers.runtime.compile.ops.concat)": [[70, "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat", false]], "config_file_path (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.config_file_path", false]], "configure_logging() (in module intel_extension_for_transformers.neural_chat.config_logging)": [[6, "intel_extension_for_transformers.neural_chat.config_logging.configure_logging", false]], "constant (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Constant", false]], "constantofshape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ConstantOfShape", false]], "construct() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.construct", false]], "construct_node() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.construct_node", false]], "conv (class in intel_extension_for_transformers.transformers.runtime.compile.ops.conv)": [[71, "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv", false]], "conversation": [[0, "module-conversation", false]], "conversation (class in conversation)": [[0, "conversation.Conversation", false]], "convert_fullwidth_to_halfwidth() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker)": [[15, "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker.convert_fullwidth_to_halfwidth", false]], "convert_image_to_base64() (conversation.conversation method)": [[0, "conversation.Conversation.convert_image_to_base64", false]], "convex_hull() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.convex_hull", false]], "convolution (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Convolution", false]], "convreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape)": [[143, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape.ConvReshape", false]], "cos (class in intel_extension_for_transformers.transformers.runtime.compile.ops.cos)": [[72, "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos", false]], "covariance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance", false]], "cpu_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.cpu_", false]], "cpu_instance (c macro)": [[278, "c.CPU_INSTANCE", false]], "create_position_ids_from_input_ids() (in module intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.create_position_ids_from_input_ids", false]], "create_position_ids_from_inputs_embeds() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaembeddings method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings.create_position_ids_from_inputs_embeds", false]], "create_tf_node() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.create_tf_node", false]], "crosscovariance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance", false]], "crossiou (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU", false]], "crossover() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.crossover", false]], "cuda_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.cuda_", false]], "cumsum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.CumSum", false]], "dataarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.DataArguments", false]], "debug() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.debug", false]], "decoderattnreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape)": [[144, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape.DecoderAttnReshape", false]], "del_environ_var() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.del_environ_var", false]], "del_environ_vars() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.del_environ_vars", false]], "dequantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Dequantize", false]], "dequantizelinear (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.DequantizeLinear", false]], "dereference() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.fixedsubsetsampler method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler.dereference", false]], "detr (class in models.detr)": [[256, "models.detr.DETR", false]], "detrmulti (class in models.detr_multi)": [[257, "models.detr_multi.DETRMulti", false]], "deviceoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.DeviceOptions", false]], "dice_loss() (in module models.segmentation)": [[260, "models.segmentation.dice_loss", false]], "distill() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.distill", false]], "distributed_init() (in module intel_extension_for_transformers.transformers.utils.utility)": [[252, "intel_extension_for_transformers.transformers.utils.utility.distributed_init", false]], "draw_landmarks() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.draw_landmarks", false]], "dump_tensor() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.dump_tensor", false]], "dynamiclengthconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.DynamicLengthConfig", false]], "dynamicquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.DynamicQuantConfig", false]], "einsum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Einsum", false]], "einsumwitharange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange)": [[145, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange.EinsumwithArange", false]], "embeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.EmbeddingBag", false]], "embeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag)": [[146, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag.EmbeddingBag", false]], "embeddingsto2dbeforeinnerproduct (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product)": [[147, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product.EmbeddingsTo2DBeforeInnerProduct", false]], "enable_sequential_cpu_offload() (intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.stablediffusioninstructpix2pixpipeline method)": [[9, "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline.enable_sequential_cpu_offload", false]], "engine_init() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.engine_init", false]], "environ_info_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.environ_info_init", false]], "erf (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Erf", false]], "error() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.error", false]], "eval_multi_choice() (in module utils.eval_utils)": [[268, "utils.eval_utils.eval_multi_choice", false]], "eval_open() (in module utils.eval_utils)": [[268, "utils.eval_utils.eval_open", false]], "evaluate() (in module utils.eval_utils)": [[268, "utils.eval_utils.evaluate", false]], "evolution (class in intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution", false]], "expand (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Expand", false]], "expand_gather() (in module intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.expand_gather", false]], "expand_gather() (in module intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.expand_gather", false]], "expanddims (class in intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims)": [[74, "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims", false]], "expandindices (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ExpandIndices", false]], "explicitnhwctransposeforconv (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctranspose)": [[206, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose.ExplicitNHWCTransposeForConv", false]], "explicitnhwctransposeforconvqat (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctransposeqat)": [[207, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT.ExplicitNHWCTransposeForConvQAT", false]], "export_to_bf16_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_bf16_onnx", false]], "export_to_fp32_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_fp32_onnx", false]], "export_to_int8_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_int8_onnx", false]], "export_to_jit() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_jit", false]], "export_to_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_onnx", false]], "extract() (intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.onnxinput method)": [[94, "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT.extract", false]], "extract() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.extract", false]], "extract_numbers() (in module utils.eval_utils)": [[268, "utils.eval_utils.extract_numbers", false]], "extract_text_from_spans() (in module util.postprocess)": [[266, "util.postprocess.extract_text_from_spans", false]], "extract_text_inside_bbox() (in module util.postprocess)": [[266, "util.postprocess.extract_text_inside_bbox", false]], "extractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor)": [[50, "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor.Extractor", false]], "fatal() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.fatal", false]], "feed_forward_chunk() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlayer method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer.feed_forward_chunk", false]], "feed_forward_chunk() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalayer method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer.feed_forward_chunk", false]], "fill (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Fill", false]], "finetune_model() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.finetune_model", false]], "finetuningarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.FinetuningArguments", false]], "fixedrandomsubsetsampler (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler", false]], "fixedsubsetsampler (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler", false]], "flatmapdataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.FlatMapDataset", false]], "flatten (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Flatten", false]], "floor_divide (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Floor_divide", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodeforcausallm method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodeforsequenceclassification method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodefortokenclassification method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertembeddings method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertencoder method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformultiplechoice method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertfornextsentenceprediction method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforquestionanswering method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforsequenceclassification method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertfortokenclassification method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertintermediate method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlayer method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmpredictionhead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertonlymlmhead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertonlynsphead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertoutput method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpooler method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpredictionheadtransform method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpretrainingheads method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfoutput method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_bartlearnedpositionalembedding method)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaclassificationhead method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaembeddings method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaencoder method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformultiplechoice method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforquestionanswering method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforsequenceclassification method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertafortokenclassification method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaintermediate method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalayer method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalmhead method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaoutput method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertapooler method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfoutput method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput.forward", false]], "forward() (models.detr.detr method)": [[256, "models.detr.DETR.forward", false]], "forward() (models.detr.postprocess method)": [[256, "models.detr.PostProcess.forward", false]], "forward() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.forward", false]], "forward() (models.detr_multi.detrmulti method)": [[257, "models.detr_multi.DETRMulti.forward", false]], "forward() (models.detr_multi.postprocess method)": [[257, "models.detr_multi.PostProcess.forward", false]], "forward() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.forward", false]], "forward() (models.matcher.hungarianmatcher method)": [[258, "models.matcher.HungarianMatcher.forward", false]], "forward() (models.segmentation.postprocesspanoptic method)": [[260, "models.segmentation.PostProcessPanoptic.forward", false]], "from_pretrained() (intel_extension_for_transformers.transformers.modeling.model.optimizedmodel class method)": [[35, "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel.from_pretrained", false]], "frozenbatchnorm2d (class in models.backbone)": [[255, "models.backbone.FrozenBatchNorm2d", false]], "fusedbatchnormv3 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3)": [[76, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3", false]], "fusedgemm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm)": [[77, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm", false]], "fusedmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul)": [[78, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul", false]], "gather (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather", false]], "gatherelements (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements)": [[80, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements", false]], "gatherv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2", false]], "gaudi_bartattention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartAttention_forward", false]], "gaudi_bartlearnedpositionalembedding (class in intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding", false]], "gaudi_mistral_repeat_kv() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral)": [[39, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral.gaudi_mistral_repeat_kv", false]], "gaudi_mistral_rmsnorm_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral)": [[39, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral.gaudi_mistral_rmsnorm_forward", false]], "gaudi_mixtral_attention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_attention_forward", false]], "gaudi_mixtral_block_sparse_moe_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_block_sparse_moe_forward", false]], "gaudi_mixtral_decoder_layer_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_decoder_layer_forward", false]], "gaudi_mixtral_model_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_model_forward", false]], "gaudi_mixtral_repeat_kv() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_repeat_kv", false]], "gaudi_mixtral_rmsnorm_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_rmsnorm_forward", false]], "gaudi_phi_attention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_attention_forward", false]], "gaudi_phi_decoder_layer_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_decoder_layer_forward", false]], "gaudi_phi_model_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_model_forward", false]], "gaudi_spawn": [[1, "module-gaudi_spawn", false]], "gaudi_swin_get_attn_mask() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin)": [[42, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin.gaudi_swin_get_attn_mask", false]], "gaudimixtralforcausallm (class in intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.GaudiMixtralForCausalLM", false]], "gelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gelu)": [[81, "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu", false]], "gelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu)": [[148, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu.Gelu", false]], "gemm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gemm)": [[82, "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm", false]], "generalized_box_iou() (in module util.box_ops)": [[263, "util.box_ops.generalized_box_iou", false]], "generate() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.generate", false]], "generatesequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence)": [[149, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence.GenerateSequence", false]], "get_autocast_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_autocast_info", false]], "get_bbox_span_subset() (in module util.postprocess)": [[266, "util.postprocess.get_bbox_span_subset", false]], "get_children() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_children", false]], "get_conv_template() (in module conversation)": [[0, "conversation.get_conv_template", false]], "get_data_dtype() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_data_dtype", false]], "get_environ_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_environ_info", false]], "get_example_inputs() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.get_example_inputs", false]], "get_export_args() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.get_export_args", false]], "get_initializer_children_names() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_initializer_children_names", false]], "get_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.get_input_embeddings", false]], "get_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.get_input_embeddings", false]], "get_logger() (intel_extension_for_transformers.transformers.runtime.compile.logger.logger method)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger.get_logger", false]], "get_model_fwk_name() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_model_fwk_name", false]], "get_module() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.get_module", false]], "get_multi_choice_info() (in module utils.data_utils)": [[267, "utils.data_utils.get_multi_choice_info", false]], "get_next_node_names() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_next_node_names", false]], "get_node_by_name() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_node_by_name", false]], "get_node_children_names() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_node_children_names", false]], "get_node_id() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_node_id", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.get_output_embeddings", false]], "get_parameter() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.get_parameter", false]], "get_pre_node_names() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_pre_node_names", false]], "get_prompt() (conversation.conversation method)": [[0, "conversation.Conversation.get_prompt", false]], "get_quant_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_quant_info", false]], "get_reprs_at_idxs() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_reprs_at_idxs", false]], "get_reprs_at_word_tokens() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_reprs_at_word_tokens", false]], "get_sparse_nodes_name() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_sparse_nodes_name", false]], "get_sparsity_ratio() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.get_sparsity_ratio", false]], "get_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.get_store", false]], "get_tensor_dest_op() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.get_tensor_dest_op", false]], "get_tensor_idx() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_tensor_idx", false]], "get_words_idxs_in_templates() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_words_idxs_in_templates", false]], "gptbigcodeforcausallm (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM", false]], "gptbigcodeforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification", false]], "gptbigcodefortokenclassification (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification", false]], "gptbigcodemodel (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeModel", false]], "gptbigcodepretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodePreTrainedModel", false]], "gptqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig", false]], "graph (class in intel_extension_for_transformers.transformers.runtime.compile.graph.graph)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph", false]], "graph_dispatch() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.graph_dispatch", false]], "graph_init() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.graph_init", false]], "graph_node_names_details() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.graph_node_names_details", false]], "graph_node_names_details() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.graph_node_names_details", false]], "header_supercell_tree() (in module util.postprocess)": [[266, "util.postprocess.header_supercell_tree", false]], "hierarchical_subsequence() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.hierarchical_subsequence", false]], "history (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History", false]], "hungarianmatcher (class in models.matcher)": [[258, "models.matcher.HungarianMatcher", false]], "identity (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Identity", false]], "infer_framework_load_model() (in module intel_extension_for_transformers.transformers.pipeline)": [[45, "intel_extension_for_transformers.transformers.pipeline.infer_framework_load_model", false]], "infer_task() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.infer_task", false]], "inference() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.inference", false]], "info() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.info", false]], "innerproduct (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.InnerProduct", false]], "innerproductreshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductreshapefusion)": [[129, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion.InnerproductReshapeFusion", false]], "innerproductwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu)": [[151, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu.InnerproductWithBiasGelu", false]], "innerproductwithslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice)": [[152, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice.InnerproductwithSlice", false]], "innerproductwithswish (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish)": [[153, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish.InnerproductWithSwish", false]], "input (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Input", false]], "inputdata (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data)": [[154, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data.InputData", false]], "inputfile (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file)": [[155, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file.InputFile", false]], "inquire_config_item() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.inquire_config_item", false]], "insert_environ_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_environ_info", false]], "insert_nodes() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.insert_nodes", false]], "insert_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_pattern", false]], "insert_quant_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_quant_info", false]], "insertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node)": [[156, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node.InsertBF16Node", false]], "insertquantnode (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node)": [[157, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node.InsertQuantNode", false]], "int8bf16mixedprecisionchecker (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker)": [[158, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker.Int8BF16MixedPrecisionChecker", false]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", false]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma": [[3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", false]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, "module-intel_extension_for_transformers.neural_chat.chatbot", false]], "intel_extension_for_transformers.neural_chat.config": [[5, "module-intel_extension_for_transformers.neural_chat.config", false]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, "module-intel_extension_for_transformers.neural_chat.config_logging", false]], "intel_extension_for_transformers.neural_chat.errorcode": [[7, "module-intel_extension_for_transformers.neural_chat.errorcode", false]], "intel_extension_for_transformers.neural_chat.pipeline": [[8, "module-intel_extension_for_transformers.neural_chat.pipeline", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory": [[10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection": [[11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer": [[12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser": [[13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm": [[16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats": [[19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", false]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", false]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", false]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", false]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", false]], "intel_extension_for_transformers.tools.utils": [[26, "module-intel_extension_for_transformers.tools.utils", false]], "intel_extension_for_transformers.transformers.benchmark": [[27, "module-intel_extension_for_transformers.transformers.benchmark", false]], "intel_extension_for_transformers.transformers.config": [[28, "module-intel_extension_for_transformers.transformers.config", false]], "intel_extension_for_transformers.transformers.dynamic": [[31, "module-intel_extension_for_transformers.transformers.dynamic", false]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", false]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, "module-intel_extension_for_transformers.transformers.dynamic.evolution", false]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", false]], "intel_extension_for_transformers.transformers.modeling": [[34, "module-intel_extension_for_transformers.transformers.modeling", false]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", false]], "intel_extension_for_transformers.transformers.modeling.model": [[35, "module-intel_extension_for_transformers.transformers.modeling.model", false]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama": [[38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm": [[43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", false]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", false]], "intel_extension_for_transformers.transformers.pipeline": [[45, "module-intel_extension_for_transformers.transformers.pipeline", false]], "intel_extension_for_transformers.transformers.pruner": [[46, "module-intel_extension_for_transformers.transformers.pruner", false]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, "module-intel_extension_for_transformers.transformers.pruner.pruning", false]], "intel_extension_for_transformers.transformers.quantization": [[48, "module-intel_extension_for_transformers.transformers.quantization", false]], "intel_extension_for_transformers.transformers.runtime": [[245, "module-intel_extension_for_transformers.transformers.runtime", false]], "intel_extension_for_transformers.transformers.runtime.compile": [[58, "module-intel_extension_for_transformers.transformers.runtime.compile", false]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders", false]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", false]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger", false]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2": [[75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm": [[91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize": [[104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where", false]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_attentionmaskaddreshape": [[133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_constantofshapewithmul": [[134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvprereshape": [[135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvreshape": [[136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_weightreshapeto4d": [[137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductreshapefusion": [[129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_bf16convert": [[211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_collectqdqinfo": [[212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctranspose": [[206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctransposeqat": [[207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_insertquantnode": [[213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_mhareshape": [[208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_quantizefusion": [[209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_reshapefusion": [[210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencdoer_word_embedding": [[216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionmaskaddreshape": [[217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionreshape": [[218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_causal_attention_mask": [[223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_kvreshape": [[219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_mulreshape": [[220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_qreshape": [[221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_softmaxreshape": [[222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_attentionmaskaddreshape": [[231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_constantofshapewithmul": [[232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_encoderhiddenstatesreshape": [[238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice": [[233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice_1": [[234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_getsamplebatch": [[239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvprereshape": [[235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape": [[236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape4d": [[237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_sampleslice": [[240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils", false]], "intel_extension_for_transformers.transformers.trainer": [[246, "module-intel_extension_for_transformers.transformers.trainer", false]], "intel_extension_for_transformers.transformers.utils": [[249, "module-intel_extension_for_transformers.transformers.utils", false]], "intel_extension_for_transformers.transformers.utils.config": [[247, "module-intel_extension_for_transformers.transformers.utils.config", false]], "intel_extension_for_transformers.transformers.utils.get_throughput": [[248, "module-intel_extension_for_transformers.transformers.utils.get_throughput", false]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, "module-intel_extension_for_transformers.transformers.utils.metrics", false]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, "module-intel_extension_for_transformers.transformers.utils.objectives", false]], "intel_extension_for_transformers.transformers.utils.utility": [[252, "module-intel_extension_for_transformers.transformers.utils.utility", false]], "interactfeatures (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features)": [[159, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features.InteractFeatures", false]], "interpolate() (in module util.misc)": [[264, "util.misc.interpolate", false]], "inverse() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.inverse", false]], "invoke_with_optional_args() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.invoke_with_optional_args", false]], "iob() (in module util.postprocess)": [[266, "util.postprocess.iob", false]], "iou (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU", false]], "iou() (in module util.postprocess)": [[266, "util.postprocess.iou", false]], "is_null_numpy_value() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.is_null_numpy_value", false]], "is_supported_onnx_graph() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.is_supported_onnx_graph", false]], "is_supported_onnx_node() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.is_supported_onnx_node", false]], "iteratorgetnext (class in intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next)": [[84, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext", false]], "iteratorv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2)": [[85, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2", false]], "itrexquantizationconfigmixin (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin", false]], "jd (c++ type)": [[278, "_CPPv42jd", false], [279, "_CPPv42jd", false], [280, "_CPPv42jd", false], [281, "_CPPv42jd", false]], "jd::attention (c++ class)": [[279, "_CPPv4N2jd9attentionE", false]], "jd::attention::attention (c++ function)": [[279, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9attention9attentionEv", false]], "jd::attention::~attention (c++ function)": [[279, "_CPPv4N2jd9attentionD0Ev", false]], "jd::attention_desc (c++ class)": [[279, "_CPPv4N2jd14attention_descE", false]], "jd::attention_desc::attention_desc (c++ function)": [[279, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", false], [279, "_CPPv4N2jd14attention_desc14attention_descEv", false]], "jd::attention_desc::~attention_desc (c++ function)": [[279, "_CPPv4N2jd14attention_descD0Ev", false]], "jd::attention_io (c++ enum)": [[281, "_CPPv4N2jd12attention_ioE", false]], "jd::attention_io::k_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6K_BIASE", false]], "jd::attention_io::k_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8K_SCALESE", false]], "jd::attention_io::k_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8K_WEIGHTE", false]], "jd::attention_io::merge_dst (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io9MERGE_DSTE", false]], "jd::attention_io::merge_src (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io9MERGE_SRCE", false]], "jd::attention_io::q_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6Q_BIASE", false]], "jd::attention_io::q_k_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io10Q_K_SCALESE", false]], "jd::attention_io::q_k_src2 (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_K_SRC2E", false]], "jd::attention_io::q_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_SCALESE", false]], "jd::attention_io::q_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_WEIGHTE", false]], "jd::attention_io::qk_v_output_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", false]], "jd::attention_io::qk_v_output_zero_point (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", false]], "jd::attention_io::reshape_input (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", false]], "jd::attention_io::v_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6V_BIASE", false]], "jd::attention_io::v_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8V_SCALESE", false]], "jd::attention_io::v_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8V_WEIGHTE", false]], "jd::cpu_engine_t (c++ class)": [[278, "_CPPv4N2jd12cpu_engine_tE", false]], "jd::cpu_engine_t::cpu_engine_t (c++ function)": [[278, "_CPPv4N2jd12cpu_engine_t12cpu_engine_tEv", false]], "jd::cpu_engine_t::create_kernel (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", false]], "jd::cpu_engine_t::create_memory_storage (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", false]], "jd::cpu_engine_t::create_stream (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t13create_streamEPP8stream_t", false]], "jd::cpu_engine_t::empty_list (c++ member)": [[278, "_CPPv4N2jd12cpu_engine_t10empty_listE", false]], "jd::cpu_engine_t::get_implementation_list (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", false]], "jd::cpu_engine_t::~cpu_engine_t (c++ function)": [[278, "_CPPv4N2jd12cpu_engine_tD0Ev", false]], "jd::dynamic_quant (c++ class)": [[279, "_CPPv4N2jd13dynamic_quantE", false]], "jd::dynamic_quant::dynamic_quant (c++ function)": [[279, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13dynamic_quant13dynamic_quantEv", false]], "jd::dynamic_quant::~dynamic_quant (c++ function)": [[279, "_CPPv4N2jd13dynamic_quantD0Ev", false]], "jd::dynamic_quant_desc (c++ class)": [[279, "_CPPv4N2jd18dynamic_quant_descE", false]], "jd::dynamic_quant_desc::dynamic_quant_desc (c++ function)": [[279, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", false], [279, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descEv", false]], "jd::dynamic_quant_desc::~dynamic_quant_desc (c++ function)": [[279, "_CPPv4N2jd18dynamic_quant_descD0Ev", false]], "jd::dynamic_quant_matmul (c++ class)": [[279, "_CPPv4N2jd20dynamic_quant_matmulE", false]], "jd::dynamic_quant_matmul::dynamic_quant_matmul (c++ function)": [[279, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulEv", false]], "jd::dynamic_quant_matmul::~dynamic_quant_matmul (c++ function)": [[279, "_CPPv4N2jd20dynamic_quant_matmulD0Ev", false]], "jd::dynamic_quant_matmul_desc (c++ class)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_descE", false]], "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc (c++ function)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descEv", false]], "jd::dynamic_quant_matmul_desc::~dynamic_quant_matmul_desc (c++ function)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_descD0Ev", false]], "jd::eltwiseop (c++ class)": [[279, "_CPPv4N2jd9eltwiseopE", false]], "jd::eltwiseop::eltwiseop (c++ function)": [[279, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9eltwiseop9eltwiseopEv", false]], "jd::eltwiseop::~eltwiseop (c++ function)": [[279, "_CPPv4N2jd9eltwiseopD0Ev", false]], "jd::eltwiseop_desc (c++ class)": [[279, "_CPPv4N2jd14eltwiseop_descE", false]], "jd::eltwiseop_desc::eltwiseop_desc (c++ function)": [[279, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", false], [279, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descEv", false]], "jd::eltwiseop_desc::~eltwiseop_desc (c++ function)": [[279, "_CPPv4N2jd14eltwiseop_descD0Ev", false]], "jd::engine_t (c++ class)": [[278, "_CPPv4N2jd8engine_tE", false]], "jd::engine_t::create_kernel (c++ function)": [[278, "_CPPv4NK2jd8engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", false]], "jd::engine_t::create_memory_storage (c++ function)": [[278, "_CPPv4NK2jd8engine_t21create_memory_storageEPP16memory_storage_t", false]], "jd::engine_t::create_stream (c++ function)": [[278, "_CPPv4NK2jd8engine_t13create_streamEPP8stream_t", false]], "jd::engine_t::engine_kind_ (c++ member)": [[278, "_CPPv4N2jd8engine_t12engine_kind_E", false]], "jd::engine_t::engine_t (c++ function)": [[278, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", false]], "jd::engine_t::get_engine_kind (c++ function)": [[278, "_CPPv4NK2jd8engine_t15get_engine_kindEv", false]], "jd::engine_t::get_implementation_list (c++ function)": [[278, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", false]], "jd::engine_t::get_runtime_kind (c++ function)": [[278, "_CPPv4NK2jd8engine_t16get_runtime_kindEv", false]], "jd::engine_t::runtime_kind_ (c++ member)": [[278, "_CPPv4N2jd8engine_t13runtime_kind_E", false]], "jd::engine_t::~engine_t (c++ function)": [[278, "_CPPv4N2jd8engine_tD0Ev", false]], "jd::gather (c++ class)": [[279, "_CPPv4N2jd6gatherE", false]], "jd::gather::gather (c++ function)": [[279, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd6gather6gatherEv", false]], "jd::gather::~gather (c++ function)": [[279, "_CPPv4N2jd6gatherD0Ev", false]], "jd::gather_desc (c++ class)": [[279, "_CPPv4N2jd11gather_descE", false]], "jd::gather_desc::gather_desc (c++ function)": [[279, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", false], [279, "_CPPv4N2jd11gather_desc11gather_descEv", false]], "jd::gather_desc::~gather_desc (c++ function)": [[279, "_CPPv4N2jd11gather_descD0Ev", false]], "jd::groupnorm (c++ class)": [[279, "_CPPv4N2jd9groupnormE", false]], "jd::groupnorm::groupnorm (c++ function)": [[279, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9groupnorm9groupnormEv", false]], "jd::groupnorm::~groupnorm (c++ function)": [[279, "_CPPv4N2jd9groupnormD0Ev", false]], "jd::groupnorm_desc (c++ class)": [[279, "_CPPv4N2jd14groupnorm_descE", false]], "jd::groupnorm_desc::groupnorm_desc (c++ function)": [[279, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", false], [279, "_CPPv4N2jd14groupnorm_desc14groupnorm_descEv", false]], "jd::groupnorm_desc::~groupnorm_desc (c++ function)": [[279, "_CPPv4N2jd14groupnorm_descD0Ev", false]], "jd::kernel_desc_proxy (c++ class)": [[279, "_CPPv4N2jd17kernel_desc_proxyE", false]], "jd::kernel_desc_proxy::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", false]], "jd::kernel_desc_proxy::impl_list_ (c++ member)": [[279, "_CPPv4N2jd17kernel_desc_proxy10impl_list_E", false]], "jd::kernel_desc_proxy::kernel_desc_proxy (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", false], [279, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyEv", false]], "jd::kernel_desc_proxy::kernel_kind (c++ function)": [[279, "_CPPv4NK2jd17kernel_desc_proxy11kernel_kindEv", false]], "jd::kernel_desc_proxy::~kernel_desc_proxy (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxyD0Ev", false]], "jd::kernel_proxy (c++ class)": [[279, "_CPPv4N2jd12kernel_proxyE", false]], "jd::kernel_proxy::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", false]], "jd::kernel_proxy::execute (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", false], [279, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", false]], "jd::kernel_proxy::get_workspace_size (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy18get_workspace_sizeEv", false]], "jd::kernel_proxy::kernel_kind (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy11kernel_kindEv", false]], "jd::kernel_proxy::kernel_proxy (c++ function)": [[279, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd12kernel_proxy12kernel_proxyEv", false]], "jd::kernel_proxy::~kernel_proxy (c++ function)": [[279, "_CPPv4N2jd12kernel_proxyD0Ev", false]], "jd::layernorm_ba (c++ class)": [[279, "_CPPv4N2jd12layernorm_baE", false]], "jd::layernorm_ba::layernorm_ba (c++ function)": [[279, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd12layernorm_ba12layernorm_baEv", false]], "jd::layernorm_ba::~layernorm_ba (c++ function)": [[279, "_CPPv4N2jd12layernorm_baD0Ev", false]], "jd::layernorm_ba_desc (c++ class)": [[279, "_CPPv4N2jd17layernorm_ba_descE", false]], "jd::layernorm_ba_desc::layernorm_ba_desc (c++ function)": [[279, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", false], [279, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descEv", false]], "jd::layernorm_ba_desc::~layernorm_ba_desc (c++ function)": [[279, "_CPPv4N2jd17layernorm_ba_descD0Ev", false]], "jd::layernormalized_spmm (c++ class)": [[279, "_CPPv4N2jd20layernormalized_spmmE", false]], "jd::layernormalized_spmm::layernormalized_spmm (c++ function)": [[279, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmEv", false]], "jd::layernormalized_spmm::~layernormalized_spmm (c++ function)": [[279, "_CPPv4N2jd20layernormalized_spmmD0Ev", false]], "jd::layernormalized_spmm_desc (c++ class)": [[279, "_CPPv4N2jd25layernormalized_spmm_descE", false]], "jd::layernormalized_spmm_desc::layernormalized_spmm_desc (c++ function)": [[279, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", false], [279, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descEv", false]], "jd::layernormalized_spmm_desc::~layernormalized_spmm_desc (c++ function)": [[279, "_CPPv4N2jd25layernormalized_spmm_descD0Ev", false]], "jd::logsoftmax (c++ class)": [[279, "_CPPv4N2jd10logsoftmaxE", false]], "jd::logsoftmax::logsoftmax (c++ function)": [[279, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd10logsoftmax10logsoftmaxEv", false]], "jd::logsoftmax::~logsoftmax (c++ function)": [[279, "_CPPv4N2jd10logsoftmaxD0Ev", false]], "jd::logsoftmax_desc (c++ class)": [[279, "_CPPv4N2jd15logsoftmax_descE", false]], "jd::logsoftmax_desc::logsoftmax_desc (c++ function)": [[279, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", false], [279, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descEv", false]], "jd::logsoftmax_desc::~logsoftmax_desc (c++ function)": [[279, "_CPPv4N2jd15logsoftmax_descD0Ev", false]], "jd::mha_dense (c++ class)": [[279, "_CPPv4N2jd9mha_denseE", false]], "jd::mha_dense::mha_dense (c++ function)": [[279, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9mha_dense9mha_denseEv", false]], "jd::mha_dense::~mha_dense (c++ function)": [[279, "_CPPv4N2jd9mha_denseD0Ev", false]], "jd::mha_dense_desc (c++ class)": [[279, "_CPPv4N2jd14mha_dense_descE", false]], "jd::mha_dense_desc::mha_dense_desc (c++ function)": [[279, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", false], [279, "_CPPv4N2jd14mha_dense_desc14mha_dense_descEv", false]], "jd::mha_dense_desc::~mha_dense_desc (c++ function)": [[279, "_CPPv4N2jd14mha_dense_descD0Ev", false]], "jd::operator_desc (c++ class)": [[280, "_CPPv4N2jd13operator_descE", false]], "jd::operator_desc::apply_postops_list (c++ function)": [[280, "_CPPv4NK2jd13operator_desc18apply_postops_listEv", false]], "jd::operator_desc::apply_postops_list_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc19apply_postops_list_E", false]], "jd::operator_desc::attrs (c++ function)": [[280, "_CPPv4NK2jd13operator_desc5attrsEv", false]], "jd::operator_desc::attrs_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc6attrs_E", false]], "jd::operator_desc::binaryop_list_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc14binaryop_list_E", false]], "jd::operator_desc::engine_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11engine_kindEv", false]], "jd::operator_desc::engine_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc12engine_kind_E", false]], "jd::operator_desc::get_binaryop_list (c++ function)": [[280, "_CPPv4NK2jd13operator_desc17get_binaryop_listEv", false]], "jd::operator_desc::impl_nthr (c++ function)": [[280, "_CPPv4NK2jd13operator_desc9impl_nthrEv", false]], "jd::operator_desc::impl_nthr_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc10impl_nthr_E", false]], "jd::operator_desc::ker_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ker_kind_E", false]], "jd::operator_desc::ker_prop_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ker_prop_E", false]], "jd::operator_desc::kernel_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11kernel_kindEv", false]], "jd::operator_desc::kernel_prop (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11kernel_propEv", false]], "jd::operator_desc::operator== (c++ function)": [[280, "_CPPv4NK2jd13operator_desceqERK13operator_desc", false]], "jd::operator_desc::operator_desc (c++ function)": [[280, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", false], [280, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", false], [280, "_CPPv4N2jd13operator_desc13operator_descEv", false]], "jd::operator_desc::runtime_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc12runtime_kindEv", false]], "jd::operator_desc::runtime_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc13runtime_kind_E", false]], "jd::operator_desc::set_binaryop_list (c++ function)": [[280, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", false]], "jd::operator_desc::tensor_descs (c++ function)": [[280, "_CPPv4NK2jd13operator_desc12tensor_descsEv", false]], "jd::operator_desc::tensor_dtypes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_dtypesEv", false]], "jd::operator_desc::tensor_ftypes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_ftypesEv", false]], "jd::operator_desc::tensor_shapes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_shapesEv", false]], "jd::operator_desc::ts_descs_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ts_descs_E", false]], "jd::operator_desc::~operator_desc (c++ function)": [[280, "_CPPv4N2jd13operator_descD0Ev", false]], "jd::proxy_base (c++ class)": [[279, "_CPPv4I00EN2jd10proxy_baseE", false]], "jd::proxy_base::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", false]], "jd::proxy_base::data_handle_ (c++ member)": [[279, "_CPPv4N2jd10proxy_base12data_handle_E", false]], "jd::proxy_base::get_sp (c++ function)": [[279, "_CPPv4NK2jd10proxy_base6get_spEv", false]], "jd::proxy_base::proxy_base (c++ function)": [[279, "_CPPv4N2jd10proxy_base10proxy_baseEv", false]], "jd::proxy_base::reset_sp (c++ function)": [[279, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", false]], "jd::proxy_base::~proxy_base (c++ function)": [[279, "_CPPv4N2jd10proxy_baseD0Ev", false]], "jd::slice (c++ class)": [[279, "_CPPv4N2jd5sliceE", false]], "jd::slice::slice (c++ function)": [[279, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd5slice5sliceEv", false]], "jd::slice::~slice (c++ function)": [[279, "_CPPv4N2jd5sliceD0Ev", false]], "jd::slice_desc (c++ class)": [[279, "_CPPv4N2jd10slice_descE", false]], "jd::slice_desc::slice_desc (c++ function)": [[279, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", false], [279, "_CPPv4N2jd10slice_desc10slice_descEv", false]], "jd::slice_desc::~slice_desc (c++ function)": [[279, "_CPPv4N2jd10slice_descD0Ev", false]], "jd::softmax (c++ class)": [[279, "_CPPv4N2jd7softmaxE", false]], "jd::softmax::softmax (c++ function)": [[279, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd7softmax7softmaxEv", false]], "jd::softmax::~softmax (c++ function)": [[279, "_CPPv4N2jd7softmaxD0Ev", false]], "jd::softmax_desc (c++ class)": [[279, "_CPPv4N2jd12softmax_descE", false]], "jd::softmax_desc::softmax_desc (c++ function)": [[279, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", false], [279, "_CPPv4N2jd12softmax_desc12softmax_descEv", false]], "jd::softmax_desc::~softmax_desc (c++ function)": [[279, "_CPPv4N2jd12softmax_descD0Ev", false]], "jd::sparse_matmul (c++ class)": [[279, "_CPPv4N2jd13sparse_matmulE", false]], "jd::sparse_matmul::sparse_matmul (c++ function)": [[279, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13sparse_matmul13sparse_matmulEv", false]], "jd::sparse_matmul::~sparse_matmul (c++ function)": [[279, "_CPPv4N2jd13sparse_matmulD0Ev", false]], "jd::sparse_matmul_desc (c++ class)": [[279, "_CPPv4N2jd18sparse_matmul_descE", false]], "jd::sparse_matmul_desc::sparse_matmul_desc (c++ function)": [[279, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descEv", false]], "jd::sparse_matmul_desc::~sparse_matmul_desc (c++ function)": [[279, "_CPPv4N2jd18sparse_matmul_descD0Ev", false]], "jd::ssd (c++ type)": [[281, "_CPPv4N2jd3ssdE", false]], "jd::ssd::amx_bf16_params_t (c++ type)": [[281, "_CPPv4N2jd3ssd17amx_bf16_params_tE", false]], "jd::ssd::amx_bf16bf16_inputs_t (c++ type)": [[281, "_CPPv4N2jd3ssd21amx_bf16bf16_inputs_tE", false]], "jd::ssd::amx_bf16f32_inputs_t (c++ type)": [[281, "_CPPv4N2jd3ssd20amx_bf16f32_inputs_tE", false]], "jd::ssd::amx_inputs_t (c++ struct)": [[281, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", false]], "jd::ssd::amx_inputs_t::bias (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t4biasE", false]], "jd::ssd::amx_inputs_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t3dstE", false]], "jd::ssd::amx_inputs_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t3srcE", false]], "jd::ssd::amx_inputs_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t6weightE", false]], "jd::ssd::amx_int8_params_t (c++ type)": [[281, "_CPPv4N2jd3ssd17amx_int8_params_tE", false]], "jd::ssd::amx_params_t (c++ struct)": [[281, "_CPPv4I0EN2jd3ssd12amx_params_tE", false]], "jd::ssd::amx_params_t::blocks_per_group (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t16blocks_per_groupE", false]], "jd::ssd::amx_params_t::blocksize (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9blocksizeE", false]], "jd::ssd::amx_params_t::colidxs (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t7colidxsE", false]], "jd::ssd::amx_params_t::group_rowptr (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t12group_rowptrE", false]], "jd::ssd::amx_params_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t8has_biasE", false]], "jd::ssd::amx_params_t::nnz_group (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9nnz_groupE", false]], "jd::ssd::amx_params_t::nrowptr (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t7nrowptrE", false]], "jd::ssd::amx_params_t::num_tilem (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9num_tileME", false]], "jd::ssd::amx_params_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t12postop_attrsE", false]], "jd::ssd::amx_params_t::same_src_dtype (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t14same_src_dtypeE", false]], "jd::ssd::amx_params_t::shape (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5shapeE", false]], "jd::ssd::amx_params_t::tilem (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5tileME", false]], "jd::ssd::amx_params_t::tilen (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5tileNE", false]], "jd::ssd::amx_params_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t6weightE", false]], "jd::ssd::avx512_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd13avx512_data_tE", false]], "jd::ssd::avx512_data_t::bias (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t4biasE", false]], "jd::ssd::avx512_data_t::dense (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t5denseE", false]], "jd::ssd::avx512_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t3dstE", false]], "jd::ssd::avx512_data_t::sparse (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t6sparseE", false]], "jd::ssd::avx512_fp32_params_t (c++ struct)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_tE", false]], "jd::ssd::avx512_fp32_params_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8has_biasE", false]], "jd::ssd::avx512_fp32_params_t::im_end (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t6im_endE", false]], "jd::ssd::avx512_fp32_params_t::im_start (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8im_startE", false]], "jd::ssd::avx512_fp32_params_t::in_end (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t6in_endE", false]], "jd::ssd::avx512_fp32_params_t::in_start (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8in_startE", false]], "jd::ssd::avx512_fp32_params_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1KE", false]], "jd::ssd::avx512_fp32_params_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1ME", false]], "jd::ssd::avx512_fp32_params_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1NE", false]], "jd::ssd::avx512_fp32_params_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t12postop_attrsE", false]], "jd::ssd::avx512_fp32_params_t::sparse_ptr (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t10sparse_ptrE", false]], "jd::ssd::bias (c++ member)": [[281, "_CPPv4N2jd3ssd4BIASE", false]], "jd::ssd::dst (c++ member)": [[281, "_CPPv4N2jd3ssd3DSTE", false]], "jd::ssd::dst_m1 (c++ member)": [[281, "_CPPv4N2jd3ssd6DST_M1E", false]], "jd::ssd::dst_m2 (c++ member)": [[281, "_CPPv4N2jd3ssd6DST_M2E", false]], "jd::ssd::eltwiseop_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_tE", false]], "jd::ssd::eltwiseop_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t3dstE", false]], "jd::ssd::eltwiseop_data_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t11element_numE", false]], "jd::ssd::eltwiseop_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t3srcE", false]], "jd::ssd::eltwiseop_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_tE", false]], "jd::ssd::eltwiseop_param_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t11element_numE", false]], "jd::ssd::eltwiseop_param_t::element_num_each_th (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t19element_num_each_thE", false]], "jd::ssd::eltwiseop_param_t::in_dt (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t5in_dtE", false]], "jd::ssd::eltwiseop_param_t::out_dt (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t6out_dtE", false]], "jd::ssd::eltwiseop_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t12postop_attrsE", false]], "jd::ssd::eltwiseop_param_t::remain_element (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t14remain_elementE", false]], "jd::ssd::layernorm_ba_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_tE", false]], "jd::ssd::layernorm_ba_data_t::[anonymous] (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_tUt1_3E", false]], "jd::ssd::layernorm_ba_data_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t5alphaE", false]], "jd::ssd::layernorm_ba_data_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4betaE", false]], "jd::ssd::layernorm_ba_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3dstE", false]], "jd::ssd::layernorm_ba_data_t::dst2 (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4dst2E", false]], "jd::ssd::layernorm_ba_data_t::eps (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3epsE", false]], "jd::ssd::layernorm_ba_data_t::mean (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4meanE", false]], "jd::ssd::layernorm_ba_data_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t1nE", false]], "jd::ssd::layernorm_ba_data_t::one (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3oneE", false]], "jd::ssd::layernorm_ba_data_t::process_row (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t11process_rowE", false]], "jd::ssd::layernorm_ba_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3srcE", false]], "jd::ssd::layernorm_ba_data_t::var (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3varE", false]], "jd::ssd::layernorm_ba_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_tE", false]], "jd::ssd::layernorm_ba_param_t::batch_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9batch_numE", false]], "jd::ssd::layernorm_ba_param_t::binaryop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t14binaryop_attrsE", false]], "jd::ssd::layernorm_ba_param_t::col_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t7col_numE", false]], "jd::ssd::layernorm_ba_param_t::direct_process_row (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t18direct_process_rowE", false]], "jd::ssd::layernorm_ba_param_t::input_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t8input_dtE", false]], "jd::ssd::layernorm_ba_param_t::ker_per_batch (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t13ker_per_batchE", false]], "jd::ssd::layernorm_ba_param_t::output2_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t10output2_dtE", false]], "jd::ssd::layernorm_ba_param_t::output_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9output_dtE", false]], "jd::ssd::layernorm_ba_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t12postop_attrsE", false]], "jd::ssd::layernorm_ba_param_t::process_batch_per_ker (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t21process_batch_per_kerE", false]], "jd::ssd::layernorm_ba_param_t::process_col (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t11process_colE", false]], "jd::ssd::layernorm_ba_param_t::row_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t7row_numE", false]], "jd::ssd::layernorm_ba_param_t::spec_type (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9spec_typeE", false]], "jd::ssd::layernorm_ba_param_t::split_output (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t12split_outputE", false]], "jd::ssd::layernorm_ba_param_t::thread_elt_offset (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t17thread_elt_offsetE", false]], "jd::ssd::matmul_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd13matmul_data_tE", false]], "jd::ssd::matmul_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t3dstE", false]], "jd::ssd::matmul_data_t::src0 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src0E", false]], "jd::ssd::matmul_data_t::src1 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src1E", false]], "jd::ssd::matmul_data_t::src2 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src2E", false]], "jd::ssd::matmul_fp8_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_tE", false]], "jd::ssd::matmul_fp8_data_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5alphaE", false]], "jd::ssd::matmul_fp8_data_t::astep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5astepE", false]], "jd::ssd::matmul_fp8_data_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4betaE", false]], "jd::ssd::matmul_fp8_data_t::bstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5bstepE", false]], "jd::ssd::matmul_fp8_data_t::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5cstepE", false]], "jd::ssd::matmul_fp8_data_t::dstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5dstepE", false]], "jd::ssd::matmul_fp8_data_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t1kE", false]], "jd::ssd::matmul_fp8_data_t::kpos (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4kposE", false]], "jd::ssd::matmul_fp8_data_t::mata (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matAE", false]], "jd::ssd::matmul_fp8_data_t::matb (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matBE", false]], "jd::ssd::matmul_fp8_data_t::matc (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matCE", false]], "jd::ssd::matmul_fp8_data_t::matd (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matDE", false]], "jd::ssd::matmul_fp8_data_t::mate (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matEE", false]], "jd::ssd::matmul_fp8_data_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t1nE", false]], "jd::ssd::matmul_fp8_data_t::scale (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5scaleE", false]], "jd::ssd::matmul_fp8_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_tE", false]], "jd::ssd::matmul_fp8_param_t::[anonymous] (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_tUt1_5E", false]], "jd::ssd::matmul_fp8_param_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t5alphaE", false]], "jd::ssd::matmul_fp8_param_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t4betaE", false]], "jd::ssd::matmul_fp8_param_t::has_append_sum (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14has_append_sumE", false]], "jd::ssd::matmul_fp8_param_t::has_scale0 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t10has_scale0E", false]], "jd::ssd::matmul_fp8_param_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1KE", false]], "jd::ssd::matmul_fp8_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1ME", false]], "jd::ssd::matmul_fp8_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1NE", false]], "jd::ssd::matmul_fp8_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t12postop_attrsE", false]], "jd::ssd::matmul_fp8_param_t::thread_num (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t10thread_numE", false]], "jd::ssd::matmul_fp8_param_t::weight_8bit (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_8bitE", false]], "jd::ssd::matmul_fp8_param_t::weight_bf16 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_bf16E", false]], "jd::ssd::matmul_fp8_param_t::weight_f8_e4m3 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e4m3E", false]], "jd::ssd::matmul_fp8_param_t::weight_f8_e5m2 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e5m2E", false]], "jd::ssd::matmul_fp8_param_t::weight_int8 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_int8E", false]], "jd::ssd::matmul_fp8_param_t::weight_type (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_typeE", false]], "jd::ssd::matmul_input (c++ type)": [[281, "_CPPv4N2jd3ssd12matmul_inputE", false]], "jd::ssd::matmul_input::input (c++ enum)": [[281, "_CPPv4N2jd3ssd12matmul_input5inputE", false]], "jd::ssd::matmul_input::input::append_sum (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", false]], "jd::ssd::matmul_input::input::matmul_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", false]], "jd::ssd::matmul_input::input::scale0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", false]], "jd::ssd::matmul_input::input::src0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", false]], "jd::ssd::matmul_input::input::src1 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", false]], "jd::ssd::matmul_input::input::src2 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", false]], "jd::ssd::matmul_input::input::zp0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", false]], "jd::ssd::matmul_io (c++ type)": [[281, "_CPPv4N2jd3ssd9matmul_ioE", false]], "jd::ssd::matmul_io::io (c++ enum)": [[281, "_CPPv4N2jd3ssd9matmul_io2ioE", false]], "jd::ssd::matmul_io::io::append_sum (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", false]], "jd::ssd::matmul_io::io::dst0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", false]], "jd::ssd::matmul_io::io::matmul_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", false]], "jd::ssd::matmul_io::io::scale0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", false]], "jd::ssd::matmul_io::io::src0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", false]], "jd::ssd::matmul_io::io::src1 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", false]], "jd::ssd::matmul_io::io::src2 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", false]], "jd::ssd::matmul_io::io::zp0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", false]], "jd::ssd::matmul_output (c++ type)": [[281, "_CPPv4N2jd3ssd13matmul_outputE", false]], "jd::ssd::matmul_output::output (c++ enum)": [[281, "_CPPv4N2jd3ssd13matmul_output6outputE", false]], "jd::ssd::matmul_output::output::dst0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", false]], "jd::ssd::matmul_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd14matmul_param_tE", false]], "jd::ssd::matmul_param_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t5alphaE", false]], "jd::ssd::matmul_param_t::batch (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t5batchE", false]], "jd::ssd::matmul_param_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t4betaE", false]], "jd::ssd::matmul_param_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1KE", false]], "jd::ssd::matmul_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1ME", false]], "jd::ssd::matmul_param_t::m_tile (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t6m_tileE", false]], "jd::ssd::matmul_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1NE", false]], "jd::ssd::matmul_param_t::n_tile (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t6n_tileE", false]], "jd::ssd::matmul_u8_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_tE", false]], "jd::ssd::matmul_u8_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t3dstE", false]], "jd::ssd::matmul_u8_data_t::scale (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t5scaleE", false]], "jd::ssd::matmul_u8_data_t::src0 (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t4src0E", false]], "jd::ssd::matmul_u8_data_t::src1 (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t4src1E", false]], "jd::ssd::matmul_u8_data_t::zp (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t2zpE", false]], "jd::ssd::mean_var_reduce_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_tE", false]], "jd::ssd::mean_var_reduce_data_t::mean_in (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7mean_inE", false]], "jd::ssd::mean_var_reduce_data_t::mean_out (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t8mean_outE", false]], "jd::ssd::mean_var_reduce_data_t::var_in (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t6var_inE", false]], "jd::ssd::mean_var_reduce_data_t::var_out (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7var_outE", false]], "jd::ssd::mean_var_reduce_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_tE", false]], "jd::ssd::mean_var_reduce_param_t::bm (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BME", false]], "jd::ssd::mean_var_reduce_param_t::bn (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BNE", false]], "jd::ssd::mean_var_reduce_param_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t11element_numE", false]], "jd::ssd::mean_var_reduce_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1ME", false]], "jd::ssd::mean_var_reduce_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1NE", false]], "jd::ssd::scales (c++ member)": [[281, "_CPPv4N2jd3ssd6SCALESE", false]], "jd::ssd::seq_vnni_copy_params (c++ struct)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_paramsE", false]], "jd::ssd::seq_vnni_copy_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params6dstptrE", false]], "jd::ssd::seq_vnni_copy_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params9dststrideE", false]], "jd::ssd::seq_vnni_copy_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params1kE", false]], "jd::ssd::seq_vnni_copy_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params6srcptrE", false]], "jd::ssd::seq_vnni_copy_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params9srcstrideE", false]], "jd::ssd::softmax_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd14softmax_data_tE", false]], "jd::ssd::softmax_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3dstE", false]], "jd::ssd::softmax_data_t::one (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3oneE", false]], "jd::ssd::softmax_data_t::process_vec_num (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t15process_vec_numE", false]], "jd::ssd::softmax_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3srcE", false]], "jd::ssd::softmax_data_t::tmp (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3tmpE", false]], "jd::ssd::softmax_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd15softmax_param_tE", false]], "jd::ssd::softmax_param_t::get_lut_exp_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t17get_lut_exp_attrsE", false]], "jd::ssd::softmax_param_t::input_dt (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t8input_dtE", false]], "jd::ssd::softmax_param_t::output_dt (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t9output_dtE", false]], "jd::ssd::softmax_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t12postop_attrsE", false]], "jd::ssd::softmax_param_t::scalar_num (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t10scalar_numE", false]], "jd::ssd::softmax_param_t::sepc_type (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t9sepc_typeE", false]], "jd::ssd::softmax_param_t::vec_align_len (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t13vec_align_lenE", false]], "jd::ssd::softmax_param_t::vec_num_per_thr (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t15vec_num_per_thrE", false]], "jd::ssd::softmax_param_t::vec_num_tail_thr (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t16vec_num_tail_thrE", false]], "jd::ssd::softmax_param_t::vec_tail_len (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t12vec_tail_lenE", false]], "jd::ssd::sparse_scheme (c++ enum)": [[281, "_CPPv4N2jd3ssd13sparse_schemeE", false]], "jd::ssd::sparse_scheme::dense_x_sparse (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme14dense_x_sparseE", false]], "jd::ssd::sparse_scheme::sparse_x_dense (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme14sparse_x_denseE", false]], "jd::ssd::sparse_scheme::sparse_x_sparse (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme15sparse_x_sparseE", false]], "jd::ssd::sparse_scheme::undef (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme5undefE", false]], "jd::ssd::spec_softmax_type (c++ enum)": [[281, "_CPPv4N2jd3ssd17spec_softmax_typeE", false]], "jd::ssd::spec_softmax_type::lut (c++ enumerator)": [[281, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", false]], "jd::ssd::spec_translnorm_type (c++ enum)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_typeE", false]], "jd::ssd::spec_translnorm_type::direct (c++ enumerator)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", false]], "jd::ssd::spec_translnorm_type::normal (c++ enumerator)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", false]], "jd::ssd::src (c++ member)": [[281, "_CPPv4N2jd3ssd3SRCE", false]], "jd::ssd::subfunc_level (c++ enum)": [[281, "_CPPv4N2jd3ssd13subfunc_levelE", false]], "jd::ssd::subfunc_level::kdims (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level5kdimsE", false]], "jd::ssd::subfunc_level::non_kdims (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level9non_kdimsE", false]], "jd::ssd::subfunc_level::none (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level4noneE", false]], "jd::ssd::subfunc_level::subfunc_level_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level17subfunc_level_MAXE", false]], "jd::ssd::transpose_copy_params (c++ struct)": [[281, "_CPPv4N2jd3ssd21transpose_copy_paramsE", false]], "jd::ssd::transpose_copy_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params6dstptrE", false]], "jd::ssd::transpose_copy_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params9dststrideE", false]], "jd::ssd::transpose_copy_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params1kE", false]], "jd::ssd::transpose_copy_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params6srcptrE", false]], "jd::ssd::transpose_copy_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params9srcstrideE", false]], "jd::ssd::transpose_mha_io (c++ type)": [[281, "_CPPv4N2jd3ssd16transpose_mha_ioE", false]], "jd::ssd::transpose_mha_io::io (c++ enum)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2ioE", false]], "jd::ssd::transpose_mha_io::io::batch (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", false]], "jd::ssd::transpose_mha_io::io::dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", false]], "jd::ssd::transpose_mha_io::io::head_num (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", false]], "jd::ssd::transpose_mha_io::io::head_size (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", false]], "jd::ssd::transpose_mha_io::io::mask (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", false]], "jd::ssd::transpose_mha_io::io::scale_dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", false]], "jd::ssd::transpose_mha_io::io::scale_k (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", false]], "jd::ssd::transpose_mha_io::io::scale_q (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", false]], "jd::ssd::transpose_mha_io::io::scale_v (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", false]], "jd::ssd::transpose_mha_io::io::seq_len (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", false]], "jd::ssd::transpose_mha_io::io::sl_pad (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", false]], "jd::ssd::transpose_mha_io::io::src_k (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", false]], "jd::ssd::transpose_mha_io::io::src_q (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", false]], "jd::ssd::transpose_mha_io::io::src_v (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", false]], "jd::ssd::transpose_mha_io::io::tmp2m (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", false]], "jd::ssd::transpose_mha_io::io::transpose_mha_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", false]], "jd::ssd::transpose_mha_io::io::zp_dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", false]], "jd::ssd::transpose_mha_step1_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_paramsE", false]], "jd::ssd::transpose_mha_step1_params::astep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params5astepE", false]], "jd::ssd::transpose_mha_step1_params::batchk (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params6batchkE", false]], "jd::ssd::transpose_mha_step1_params::cbatchstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params10cbatchstepE", false]], "jd::ssd::transpose_mha_step1_params::cfg (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params3cfgE", false]], "jd::ssd::transpose_mha_step1_params::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params5cstepE", false]], "jd::ssd::transpose_mha_step1_params::expsum (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params6expsumE", false]], "jd::ssd::transpose_mha_step1_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params1kE", false]], "jd::ssd::transpose_mha_step1_params::m (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params1mE", false]], "jd::ssd::transpose_mha_step1_params::mata (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matAE", false]], "jd::ssd::transpose_mha_step1_params::matb (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matBE", false]], "jd::ssd::transpose_mha_step1_params::matc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matCE", false]], "jd::ssd::transpose_mha_step1_params::matd (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matDE", false]], "jd::ssd::transpose_mha_step1_params::scaleab (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params7scaleABE", false]], "jd::ssd::transpose_mha_step1_params::sumstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params7sumstepE", false]], "jd::ssd::transpose_mha_step2_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_paramsE", false]], "jd::ssd::transpose_mha_step2_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6dstptrE", false]], "jd::ssd::transpose_mha_step2_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params9dststrideE", false]], "jd::ssd::transpose_mha_step2_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params1kE", false]], "jd::ssd::transpose_mha_step2_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6srcptrE", false]], "jd::ssd::transpose_mha_step2_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params9srcstrideE", false]], "jd::ssd::transpose_mha_step2_params::sumptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6sumptrE", false]], "jd::ssd::transpose_mha_step3_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_paramsE", false]], "jd::ssd::transpose_mha_step3_params::astep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params5astepE", false]], "jd::ssd::transpose_mha_step3_params::cfg (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params3cfgE", false]], "jd::ssd::transpose_mha_step3_params::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params5cstepE", false]], "jd::ssd::transpose_mha_step3_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params1kE", false]], "jd::ssd::transpose_mha_step3_params::mata (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matAE", false]], "jd::ssd::transpose_mha_step3_params::matb (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matBE", false]], "jd::ssd::transpose_mha_step3_params::matc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matCE", false]], "jd::ssd::transpose_mha_step3_params::scaleab (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params7scaleABE", false]], "jd::ssd::transpose_mha_step3_params::scalec (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params6scaleCE", false]], "jd::ssd::transpose_mha_step3_params::zeropointc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params10zeropointCE", false]], "jd::ssd::vnni_data_t (c++ struct)": [[281, "_CPPv4I0EN2jd3ssd11vnni_data_tE", false]], "jd::ssd::vnni_data_t::ptr_bias (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t8ptr_biasE", false]], "jd::ssd::vnni_data_t::ptr_dense (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t9ptr_denseE", false]], "jd::ssd::vnni_data_t::ptr_dst (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t7ptr_dstE", false]], "jd::ssd::vnni_data_t::ptr_dst_m1 (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m1E", false]], "jd::ssd::vnni_data_t::ptr_dst_m2 (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m2E", false]], "jd::ssd::vnni_data_t::ptr_scales (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_scalesE", false]], "jd::ssd::vnni_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd12vnni_param_tE", false]], "jd::ssd::vnni_param_t::append_sum (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t10append_sumE", false]], "jd::ssd::vnni_param_t::blocksize (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t9blocksizeE", false]], "jd::ssd::vnni_param_t::bm (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t2BME", false]], "jd::ssd::vnni_param_t::bn (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t2BNE", false]], "jd::ssd::vnni_param_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8has_biasE", false]], "jd::ssd::vnni_param_t::im_start (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8im_startE", false]], "jd::ssd::vnni_param_t::indices (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t7indicesE", false]], "jd::ssd::vnni_param_t::indptr (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6indptrE", false]], "jd::ssd::vnni_param_t::output_type (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t11output_typeE", false]], "jd::ssd::vnni_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t12postop_attrsE", false]], "jd::ssd::vnni_param_t::sub_func (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8sub_funcE", false]], "jd::ssd::vnni_param_t::tile_w (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6tile_wE", false]], "jd::ssd::vnni_param_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6weightE", false]], "jd::ssd::vnni_param_t::welford (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t7welfordE", false]], "jd::ssd::wei (c++ member)": [[281, "_CPPv4N2jd3ssd3WEIE", false]], "jd::ssd::work_space (c++ member)": [[281, "_CPPv4N2jd3ssd10WORK_SPACEE", false]], "jd::transpose_matmul (c++ class)": [[279, "_CPPv4N2jd16transpose_matmulE", false]], "jd::transpose_matmul::transpose_matmul (c++ function)": [[279, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd16transpose_matmul16transpose_matmulEv", false]], "jd::transpose_matmul::~transpose_matmul (c++ function)": [[279, "_CPPv4N2jd16transpose_matmulD0Ev", false]], "jd::transpose_matmul_desc (c++ class)": [[279, "_CPPv4N2jd21transpose_matmul_descE", false]], "jd::transpose_matmul_desc::transpose_matmul_desc (c++ function)": [[279, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descEv", false]], "jd::transpose_matmul_desc::~transpose_matmul_desc (c++ function)": [[279, "_CPPv4N2jd21transpose_matmul_descD0Ev", false]], "jd::transpose_mha (c++ class)": [[279, "_CPPv4N2jd13transpose_mhaE", false]], "jd::transpose_mha::transpose_mha (c++ function)": [[279, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13transpose_mha13transpose_mhaEv", false]], "jd::transpose_mha::~transpose_mha (c++ function)": [[279, "_CPPv4N2jd13transpose_mhaD0Ev", false]], "jd::transpose_mha_desc (c++ class)": [[279, "_CPPv4N2jd18transpose_mha_descE", false]], "jd::transpose_mha_desc::transpose_mha_desc (c++ function)": [[279, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", false], [279, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descEv", false]], "jd::transpose_mha_desc::~transpose_mha_desc (c++ function)": [[279, "_CPPv4N2jd18transpose_mha_descD0Ev", false]], "lastlayershape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape)": [[160, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape.LastLayerShape", false]], "latrange (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.LatRange", false]], "layernorm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNorm", false]], "layernorm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm)": [[161, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm.LayerNorm", false]], "layernormalization (class in intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization", false]], "layernormwithreducemean (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean)": [[162, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean.LayerNormWithReduceMean", false]], "layernormwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose)": [[163, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose.LayerNormWithTranspose", false]], "lazyimport (class in intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.LazyImport", false]], "list2str() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.list2str", false]], "listconstruct (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ListConstruct", false]], "listunpack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ListUnpack", false]], "llamaattention (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaAttention", false]], "llamaembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding)": [[164, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding.LlamaEmbeddings", false]], "llamaflashattention2 (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaFlashAttention2", false]], "llamamatmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose)": [[165, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose.LlamaMatMulWithTranspose", false]], "llamapostprocess (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess)": [[166, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess.LlamaPostprocess", false]], "llamaroraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb)": [[167, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb.LlamaRoraryPosEmb", false]], "llamasdpaattention (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaSdpaAttention", false]], "load() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.load", false]], "load_cached_state() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.load_cached_state", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.load_state_dict", false]], "load_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.load_store", false]], "load_tf_weights_in_bert() (in module intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.load_tf_weights_in_bert", false]], "loader (class in intel_extension_for_transformers.transformers.runtime.compile.loaders.loader)": [[60, "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader.Loader", false]], "log() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.log", false]], "logger (class in intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger", false]], "logsoftmax (class in intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax)": [[87, "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax", false]], "loop (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Loop", false]], "loss_boxes() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_boxes", false]], "loss_boxes() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_boxes", false]], "loss_cardinality() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_cardinality", false]], "loss_cardinality() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_cardinality", false]], "loss_labels() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_labels", false]], "loss_labels() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_labels", false]], "loss_masks() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_masks", false]], "loss_masks() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_masks", false]], "loweralltuples (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples)": [[168, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples.LowerAllTuples", false]], "main_eval_only": [[253, "module-main_eval_only", false]], "main_parse_and_eval": [[254, "module-main_parse_and_eval", false]], "make_loader() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.make_loader", false]], "makeiterator (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MakeIterator", false]], "mapandbatchdataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset)": [[88, "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset", false]], "masked_fill (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Masked_fill", false]], "maskheadsmallconv (class in models.segmentation)": [[260, "models.segmentation.MaskHeadSmallConv", false]], "masks_to_boxes() (in module util.box_ops)": [[263, "util.box_ops.masks_to_boxes", false]], "matmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Matmul", false]], "matmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.matmul)": [[89, "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul", false]], "matmulwithbias (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBias", false]], "matmulwithbias (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias)": [[169, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias.MatMulWithBias", false]], "matmulwithbiasadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasAdd", false]], "matmulwithbiasadd (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add)": [[170, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add.MatMulWithBiasAdd", false]], "matmulwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasGelu", false]], "matmulwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu)": [[171, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu.MatMulWithBiasGelu", false]], "matmulwithbiasrelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasRelu", false]], "matmulwithbiasrelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu)": [[172, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu.MatMulWithBiasRelu", false]], "matmulwithbiassigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasSigmoid", false]], "matmulwithbiassigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid)": [[173, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid.MatMulWithBiasSigmoid", false]], "matmulwithbiastanh (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasTanh", false]], "matmulwithbiastanh (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh)": [[174, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh.MatmulWithBiasTanh", false]], "matmulwithbiasunsqueeze (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze)": [[175, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze.MatMulWithBiasUnsqueeze", false]], "matmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose)": [[176, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose.MatMulWithTranspose", false]], "matmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add)": [[177, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add.MatMulWithTranspose", false]], "max (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Max", false]], "mean (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean", false]], "mean (class in intel_extension_for_transformers.transformers.runtime.compile.ops.mean)": [[90, "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean", false]], "mergedembeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MergedEmbeddingbag", false]], "mergedembeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag)": [[178, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag.MergedEmbeddingbag", false]], "metric (class in intel_extension_for_transformers.transformers.utils.metrics)": [[250, "intel_extension_for_transformers.transformers.utils.metrics.Metric", false]], "mhattentionmap (class in models.segmentation)": [[260, "models.segmentation.MHAttentionMap", false]], "mkdir() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.mkdir", false]], "mkdirs() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.mkdirs", false]], "mlp (class in models.detr)": [[256, "models.detr.MLP", false]], "mlp (class in models.detr_multi)": [[257, "models.detr_multi.MLP", false]], "mmr (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.searchtype attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType.mmr", false]], "model (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.model", false]], "modelarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.ModelArguments", false]], "modeldataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset)": [[92, "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset", false]], "models.backbone": [[255, "module-models.backbone", false]], "models.detr": [[256, "module-models.detr", false]], "models.detr_multi": [[257, "module-models.detr_multi", false]], "models.matcher": [[258, "module-models.matcher", false]], "models.position_encoding": [[259, "module-models.position_encoding", false]], "models.segmentation": [[260, "module-models.segmentation", false]], "models.transformer": [[261, "module-models.transformer", false]], "modelsize() (intel_extension_for_transformers.transformers.utils.objectives.objective static method)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective.modelsize", false]], "modify_node_connections() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.modify_node_connections", false]], "module": [[0, "module-conversation", false], [1, "module-gaudi_spawn", false], [2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", false], [3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", false], [4, "module-intel_extension_for_transformers.neural_chat.chatbot", false], [5, "module-intel_extension_for_transformers.neural_chat.config", false], [6, "module-intel_extension_for_transformers.neural_chat.config_logging", false], [7, "module-intel_extension_for_transformers.neural_chat.errorcode", false], [8, "module-intel_extension_for_transformers.neural_chat.pipeline", false], [9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", false], [10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", false], [11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", false], [12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", false], [13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", false], [14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", false], [15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", false], [16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", false], [17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", false], [18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", false], [19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", false], [20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", false], [21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", false], [22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", false], [23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", false], [24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", false], [25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", false], [26, "module-intel_extension_for_transformers.tools.utils", false], [27, "module-intel_extension_for_transformers.transformers.benchmark", false], [28, "module-intel_extension_for_transformers.transformers.config", false], [29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", false], [30, "module-intel_extension_for_transformers.transformers.dynamic.evolution", false], [31, "module-intel_extension_for_transformers.transformers.dynamic", false], [32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", false], [33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", false], [34, "module-intel_extension_for_transformers.transformers.modeling", false], [35, "module-intel_extension_for_transformers.transformers.modeling.model", false], [36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", false], [37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", false], [38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", false], [39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", false], [40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", false], [41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", false], [42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", false], [43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", false], [44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", false], [45, "module-intel_extension_for_transformers.transformers.pipeline", false], [46, "module-intel_extension_for_transformers.transformers.pruner", false], [47, "module-intel_extension_for_transformers.transformers.pruner.pruning", false], [48, "module-intel_extension_for_transformers.transformers.quantization", false], [49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile", false], [50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", false], [51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors", false], [52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", false], [53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", false], [54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", false], [55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph", false], [56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph", false], [57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils", false], [58, "module-intel_extension_for_transformers.transformers.runtime.compile", false], [59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders", false], [60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", false], [61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger", false], [62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", false], [63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all", false], [64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert", false], [65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", false], [66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", false], [67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", false], [68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", false], [69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast", false], [70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat", false], [71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv", false], [72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos", false], [73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", false], [74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", false], [75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", false], [76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", false], [77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", false], [78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", false], [79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather", false], [80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", false], [81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", false], [82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", false], [83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops", false], [84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", false], [85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", false], [86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", false], [87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", false], [88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", false], [89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", false], [90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean", false], [91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", false], [92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", false], [93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", false], [94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", false], [95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op", false], [96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", false], [97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack", false], [98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", false], [99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", false], [100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", false], [101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow", false], [102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", false], [103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", false], [104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", false], [105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", false], [106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", false], [107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", false], [108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", false], [109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", false], [110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize", false], [111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", false], [112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", false], [113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape", false], [114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin", false], [115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size", false], [116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", false], [117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", false], [118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split", false], [119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", false], [120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", false], [121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", false], [122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", false], [123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", false], [124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", false], [125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", false], [126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view", false], [127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where", false], [128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer", false], [129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", false], [130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", false], [131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", false], [132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", false], [133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", false], [134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", false], [135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", false], [136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", false], [137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", false], [138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", false], [139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", false], [140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", false], [141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", false], [142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", false], [143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", false], [144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", false], [145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", false], [146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", false], [147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", false], [148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", false], [149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", false], [150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph", false], [151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", false], [152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", false], [153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", false], [154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", false], [155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", false], [156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", false], [157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", false], [158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", false], [159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", false], [160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", false], [161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", false], [162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", false], [163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", false], [164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", false], [165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", false], [166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", false], [167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", false], [168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", false], [169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", false], [170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", false], [171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", false], [172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", false], [173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", false], [174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", false], [175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", false], [176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", false], [177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", false], [178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", false], [179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", false], [180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", false], [181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", false], [182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", false], [183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", false], [184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", false], [185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", false], [186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", false], [187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", false], [188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", false], [189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", false], [190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", false], [191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", false], [192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", false], [193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", false], [194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", false], [195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", false], [196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", false], [197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", false], [198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", false], [199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", false], [200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", false], [201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", false], [202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", false], [203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", false], [204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", false], [205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", false], [206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", false], [207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", false], [208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", false], [209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", false], [210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", false], [211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", false], [212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", false], [213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", false], [214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", false], [215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", false], [216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", false], [217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", false], [218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", false], [219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", false], [220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", false], [221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", false], [222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", false], [223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", false], [224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", false], [225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", false], [226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", false], [227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", false], [228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", false], [229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", false], [230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", false], [231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", false], [232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", false], [233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", false], [234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", false], [235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", false], [236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", false], [237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", false], [238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", false], [239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", false], [240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", false], [241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", false], [242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", false], [243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils", false], [244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils", false], [245, "module-intel_extension_for_transformers.transformers.runtime", false], [246, "module-intel_extension_for_transformers.transformers.trainer", false], [247, "module-intel_extension_for_transformers.transformers.utils.config", false], [248, "module-intel_extension_for_transformers.transformers.utils.get_throughput", false], [249, "module-intel_extension_for_transformers.transformers.utils", false], [250, "module-intel_extension_for_transformers.transformers.utils.metrics", false], [251, "module-intel_extension_for_transformers.transformers.utils.objectives", false], [252, "module-intel_extension_for_transformers.transformers.utils.utility", false], [253, "module-main_eval_only", false], [254, "module-main_parse_and_eval", false], [255, "module-models.backbone", false], [256, "module-models.detr", false], [257, "module-models.detr_multi", false], [258, "module-models.matcher", false], [259, "module-models.position_encoding", false], [260, "module-models.segmentation", false], [261, "module-models.transformer", false], [262, "module-text", false], [263, "module-util.box_ops", false], [264, "module-util.misc", false], [265, "module-util.plot_utils", false], [266, "module-util.postprocess", false], [267, "module-utils.data_utils", false], [268, "module-utils.eval_utils", false]], "multiheadattenion (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MultiHeadAttenion", false]], "mutate() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.mutate", false]], "names_from_input() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.names_from_input", false]], "neoxreorderchange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change)": [[179, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change.NeoxReorderChange", false]], "neoxroraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb)": [[180, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb.NeoxRoraryPosEmb", false]], "neural_engine_bin() (in module intel_extension_for_transformers.transformers.runtime)": [[245, "intel_extension_for_transformers.transformers.runtime.neural_engine_bin", false]], "nlpseq2seqtrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer", false]], "nlptrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPTrainer", false]], "nms() (in module util.postprocess)": [[266, "util.postprocess.nms", false]], "nms_by_containment() (in module util.postprocess)": [[266, "util.postprocess.nms_by_containment", false]], "nms_supercells() (in module util.postprocess)": [[266, "util.postprocess.nms_supercells", false]], "normalize() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.normalize", false]], "normalize_str() (in module utils.eval_utils)": [[268, "utils.eval_utils.normalize_str", false]], "normmean (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean", false]], "null_instance (c macro)": [[278, "c.NULL_INSTANCE", false]], "objective (class in intel_extension_for_transformers.transformers.utils.objectives)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective", false]], "objects_to_cells() (in module util.postprocess)": [[266, "util.postprocess.objects_to_cells", false]], "objects_to_table_structures() (in module util.postprocess)": [[266, "util.postprocess.objects_to_table_structures", false]], "on_after_eval() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_after_eval", false]], "on_after_optimizer_step() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_after_optimizer_step", false]], "on_before_eval() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_before_eval", false]], "on_before_optimizer_step() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_before_optimizer_step", false]], "on_epoch_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_epoch_begin", false]], "on_epoch_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_epoch_end", false]], "on_step_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_step_begin", false]], "on_step_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_step_end", false]], "on_train_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_train_begin", false]], "on_train_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_train_end", false]], "onehot (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Onehot", false]], "onehot (class in intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot)": [[93, "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot", false]], "onnx_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.onnx_extract_operator", false]], "onnxextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor)": [[52, "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor.ONNXExtractor", false]], "onnxinput (class in intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input)": [[94, "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT", false]], "opany (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.OpAny", false]], "operator (class in intel_extension_for_transformers.transformers.runtime.compile.ops.op)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator", false]], "operator_registry() (in module intel_extension_for_transformers.transformers.runtime.compile.ops.op)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator_registry", false]], "operatoradaptor (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor)": [[181, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor.OperatorAdaptor", false]], "optimize() (intel_extension_for_transformers.transformers.runtime.compile.optimizer.optimizer method)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer.optimize", false]], "optimize_model() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.optimize_model", false]], "optimizedataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset)": [[96, "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset", false]], "optimizedmodel (class in intel_extension_for_transformers.transformers.modeling.model)": [[35, "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel", false]], "optimizer (class in intel_extension_for_transformers.transformers.runtime.compile.optimizer)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer", false]], "orchestrate_optimizations() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.orchestrate_optimizations", false]], "output (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Output", false]], "outputdata (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data)": [[182, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data.OutputData", false]], "overlaps() (in module util.postprocess)": [[266, "util.postprocess.overlaps", false]], "pack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pack)": [[97, "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack", false]], "packagepositionembedding (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed)": [[100, "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding", false]], "paddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence)": [[98, "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence", false]], "paddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence)": [[183, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence.PaddingSequence", false]], "pareto_frontier() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.pareto_frontier", false]], "parse_args() (in module gaudi_spawn)": [[1, "gaudi_spawn.parse_args", false]], "parse_multi_choice_response() (in module utils.eval_utils)": [[268, "utils.eval_utils.parse_multi_choice_response", false]], "parse_open_response() (in module utils.eval_utils)": [[268, "utils.eval_utils.parse_open_response", false]], "pattern (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern)": [[184, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern.Pattern", false]], "pattern_mapping() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.pattern_mapping", false]], "pattern_mapping_conf_validation() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.pattern_mapping_conf_validation", false]], "pattern_registry() (in module intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern)": [[184, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern.pattern_registry", false]], "performance() (intel_extension_for_transformers.transformers.utils.objectives.objective static method)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective.performance", false]], "placeholder (class in intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder)": [[99, "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder", false]], "plot_logs() (in module util.plot_utils)": [[265, "util.plot_utils.plot_logs", false]], "positionembeddinglearned (class in models.position_encoding)": [[259, "models.position_encoding.PositionEmbeddingLearned", false]], "positionembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings)": [[185, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings.PositionEmbeddings", false]], "positionembeddingsine (class in models.position_encoding)": [[259, "models.position_encoding.PositionEmbeddingSine", false]], "positionembeddingsv1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1)": [[186, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1.PositionEmbeddingsV1", false]], "positionids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.PositionIds", false]], "post_init_cpu() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_cpu", false]], "post_init_gptq() (intel_extension_for_transformers.transformers.utils.config.gptqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig.post_init_gptq", false]], "post_init_runtime() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_runtime", false]], "post_init_xpu() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_xpu", false]], "postprocess (class in models.detr)": [[256, "models.detr.PostProcess", false]], "postprocess (class in models.detr_multi)": [[257, "models.detr_multi.PostProcess", false]], "postprocesspanoptic (class in models.segmentation)": [[260, "models.segmentation.PostProcessPanoptic", false]], "pow (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pow)": [[101, "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.prepare_inputs_for_generation", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.prepare_inputs_for_generation", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.prepare_inputs_for_generation", false]], "preprocess_model() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.preprocess_model", false]], "provider (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.Provider", false]], "prune() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.prune", false]], "prune_heads() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention.prune_heads", false]], "prune_heads() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention.prune_heads", false]], "pruner_info (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.pruner_info", false]], "pruners (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.pruners", false]], "prunerv2 (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.PrunerV2", false]], "pruning (class in intel_extension_for_transformers.transformers.pruner.pruning)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning", false]], "pull_key_prefix() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.pull_key_prefix", false]], "push_key_prefix() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.push_key_prefix", false]], "qkvmerge (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge)": [[187, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge.QKVMerge", false]], "qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape)": [[188, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape.QKVReshape", false]], "qlinearadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearAdd", false]], "qlinearmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearMatMul", false]], "qlinearmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearMul", false]], "quant_info_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.quant_info_init", false]], "quantawaretrainingconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.QuantAwareTrainingConfig", false]], "quantile (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile", false]], "quantizationmethod (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.QuantizationMethod", false]], "quantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize", false]], "quantize() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.quantize", false]], "quantizedgraphdtypecheck (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor)": [[191, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor.QuantizedGraphDtypeCheck", false]], "quantizedmatmulwithbiasanddequantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize)": [[105, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize", false]], "quantizefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion)": [[190, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion.QuantizeFusion", false]], "quantizelinear (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear", false]], "quantizev2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2)": [[103, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2", false]], "range (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Range", false]], "realdiv (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.RealDiv", false]], "reciprocal (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Reciprocal", false]], "recursive_copy() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.recursive_copy", false]], "reduce_dict() (in module util.misc)": [[264, "util.misc.reduce_dict", false]], "reducemean (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean)": [[106, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean", false]], "reducesum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum)": [[107, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum", false]], "refactor_batch_size() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.refactor_batch_size", false]], "refine_columns() (in module util.postprocess)": [[266, "util.postprocess.refine_columns", false]], "refine_rows() (in module util.postprocess)": [[266, "util.postprocess.refine_rows", false]], "refine_table_structures() (in module util.postprocess)": [[266, "util.postprocess.refine_table_structures", false]], "register_conv_template() (in module conversation)": [[0, "conversation.register_conv_template", false]], "relu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Relu", false]], "remove_environ_info_item() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.remove_environ_info_item", false]], "remove_environ_info_items() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.remove_environ_info_items", false]], "remove_nodes() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.remove_nodes", false]], "remove_objects_without_content() (in module util.postprocess)": [[266, "util.postprocess.remove_objects_without_content", false]], "remove_supercell_overlap() (in module util.postprocess)": [[266, "util.postprocess.remove_supercell_overlap", false]], "removeconstantop (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op)": [[192, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op.RemoveConstantOP", false]], "removelastview (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view)": [[193, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view.RemoveLastView", false]], "removerange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range)": [[194, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range.RemoveRange", false]], "removeslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice)": [[197, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice.RemoveSlice", false]], "removeunusedoperator (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator)": [[195, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator.RemoveUnusedOperator", false]], "removezeros (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros)": [[196, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros.RemoveZeros", false]], "rename_node() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.rename_node", false]], "reorder (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reorder)": [[108, "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder", false]], "repeat (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Repeat", false]], "replace_module() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.replace_module", false]], "reshape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reshape)": [[109, "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape", false]], "reshapeafterrestorehiddenstates (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states)": [[198, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states.ReshapeAfterRestoreHiddenStates", false]], "reshapebeforeandafterattentionoutlayernormgatherelements (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements)": [[199, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements.ReshapeBeforeAndAfterAttentionOutLayerNormGatherElements", false]], "reshapebeforerestorehiddenstates (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states)": [[200, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states.ReshapeBeforeRestoreHiddenStates", false]], "reshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion)": [[201, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion.ReshapeFusion", false]], "resize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.resize)": [[110, "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize", false]], "resnet101() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet101", false]], "resnet152() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet152", false]], "resnet18() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet18", false]], "resnet34() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet34", false]], "resnet50() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet50", false]], "resnext101_32x8d() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnext101_32x8d", false]], "resnext50_32x4d() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnext50_32x4d", false]], "resolve_state_dict() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.resolve_state_dict", false]], "restorehiddenstatesinlengthadaptive (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices)": [[202, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices.RestoreHiddenStatesInLengthAdaptive", false]], "retrievaltypeoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.RetrievalTypeOptions", false]], "retrieveradapter (class in intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter)": [[14, "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter.RetrieverAdapter", false]], "rmsnorm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm)": [[203, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm.RmsNorm", false]], "robertaattention (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention", false]], "robertaclassificationhead (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead", false]], "robertaembeddings (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings", false]], "robertaencoder (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder", false]], "robertaforcausallm (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM", false]], "robertaformaskedlm (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM", false]], "robertaformultiplechoice (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice", false]], "robertaforquestionanswering (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering", false]], "robertaforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification", false]], "robertafortokenclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification", false]], "robertaintermediate (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate", false]], "robertalayer (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer", false]], "robertalmhead (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead", false]], "robertamodel (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel", false]], "robertaoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput", false]], "robertapooler (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler", false]], "robertapretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel", false]], "robertaselfattention (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention", false]], "robertaselfoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput", false]], "roraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb)": [[204, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb.RoraryPosEmb", false]], "rsqrt (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Rsqrt", false]], "rsub (class in intel_extension_for_transformers.transformers.runtime.compile.ops.rsub)": [[111, "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub", false]], "rtnconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.RtnConfig", false]], "run_evolutionary_search() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.run_evolutionary_search", false]], "sample_layer_configuration() (in module intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils)": [[29, "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils.sample_layer_configuration", false]], "sample_length_configuration() (in module intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils)": [[29, "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils.sample_length_configuration", false]], "sample_portion() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.sample_portion", false]], "save() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.save", false]], "save() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.save", false]], "save_cached_state() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.save_cached_state", false]], "save_jsonl() (in module utils.data_utils)": [[267, "utils.data_utils.save_jsonl", false]], "save_population() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.save_population", false]], "save_pretrained() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.save_pretrained", false]], "save_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.save_store", false]], "scatterelements (class in intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements)": [[112, "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements", false]], "search_kwargs (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.childparentretriever attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever.search_kwargs", false]], "search_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.search_pattern", false]], "search_straight_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.search_straight_pattern", false]], "search_type (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.childparentretriever attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever.search_type", false]], "searchtype (class in intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType", false]], "secondmoment (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment", false]], "separatorstyle (class in conversation)": [[0, "conversation.SeparatorStyle", false]], "sequencelength (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.SequenceLength", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.all.all method)": [[63, "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.assert.assert method)": [[64, "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.baddbmm method)": [[65, "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.Baddbmm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.batchmatmul method)": [[66, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.batchmatmulv2 method)": [[67, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.biasadd method)": [[68, "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.cast.cast method)": [[69, "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.concat.concat method)": [[70, "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.conv.conv method)": [[71, "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.cos.cos method)": [[72, "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.expanddims method)": [[74, "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.fusedbatchnormv3 method)": [[76, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.fusedgemm method)": [[77, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.fusedmatmul method)": [[78, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather.gather method)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather.gatherv2 method)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.gatherelements method)": [[80, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.gelu method)": [[81, "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.gemm method)": [[82, "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.iteratorgetnext method)": [[84, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.iteratorv2 method)": [[85, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.layernormalization method)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.logsoftmax method)": [[87, "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.mapandbatchdataset method)": [[88, "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.matmul method)": [[89, "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.mean.mean method)": [[90, "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.modeldataset method)": [[92, "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.onehot method)": [[93, "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.optimizedataset method)": [[96, "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pack.pack method)": [[97, "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.paddingsequence method)": [[98, "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.placeholder method)": [[99, "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.packagepositionembedding method)": [[100, "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pow.pow method)": [[101, "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.quantize method)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.quantizelinear method)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.quantizev2 method)": [[103, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.quantizedmatmulwithbiasanddequantize method)": [[105, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.reducemean method)": [[106, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.reducesum method)": [[107, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.reorder method)": [[108, "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.reshape method)": [[109, "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.resize.resize method)": [[110, "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.rsub method)": [[111, "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.scatterelements method)": [[112, "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.shape.shape method)": [[113, "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.sin.sin method)": [[114, "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.size.size method)": [[115, "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.slicepositionids method)": [[116, "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.softmax method)": [[117, "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.split.split method)": [[118, "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.squeeze method)": [[119, "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.stridedslice method)": [[120, "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.topk method)": [[122, "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.transpose method)": [[123, "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.unpack method)": [[124, "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.unsqueeze method)": [[125, "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.view.view method)": [[126, "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.where.where method)": [[127, "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where.set_attr", false]], "set_autocast() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_autocast", false]], "set_dynamic_config() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.set_dynamic_config", false]], "set_environ_var() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_environ_var", false]], "set_environ_vars() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_environ_vars", false]], "set_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_input_embeddings", false]], "set_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_input_embeddings", false]], "set_length_config() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_length_config", false]], "set_length_config() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_length_config", false]], "set_lower_constraint() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.set_lower_constraint", false]], "set_output_attentions() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_output_attentions", false]], "set_output_attentions() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_output_attentions", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.set_output_embeddings", false]], "set_requires_grad() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.set_requires_grad", false]], "set_system_message() (conversation.conversation method)": [[0, "conversation.Conversation.set_system_message", false]], "set_upper_constraint() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.set_upper_constraint", false]], "setcriterion (class in models.detr)": [[256, "models.detr.SetCriterion", false]], "setcriterion (class in models.detr_multi)": [[257, "models.detr_multi.SetCriterion", false]], "setup_for_distributed() (in module util.misc)": [[264, "util.misc.setup_for_distributed", false]], "shape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.shape)": [[113, "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape", false]], "sigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Sigmoid", false]], "sigmoid_focal_loss() (in module models.segmentation)": [[260, "models.segmentation.sigmoid_focal_loss", false]], "silu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Silu", false]], "similarity (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.searchtype attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType.similarity", false]], "sin (class in intel_extension_for_transformers.transformers.runtime.compile.ops.sin)": [[114, "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin", false]], "size (class in intel_extension_for_transformers.transformers.runtime.compile.ops.size)": [[115, "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size", false]], "slicemask (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask)": [[205, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask.SliceMask", false]], "slicepositionids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids)": [[116, "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds", false]], "slot_into_containers() (in module util.postprocess)": [[266, "util.postprocess.slot_into_containers", false]], "smoothedvalue (class in util.misc)": [[264, "util.misc.SmoothedValue", false]], "smoothquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.SmoothQuantConfig", false]], "softmax (class in intel_extension_for_transformers.transformers.runtime.compile.ops.softmax)": [[117, "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax", false]], "sort_objects_by_score() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_by_score", false]], "sort_objects_left_to_right() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_left_to_right", false]], "sort_objects_top_to_bottom() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_top_to_bottom", false]], "split (class in intel_extension_for_transformers.transformers.runtime.compile.ops.split)": [[118, "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split", false]], "sqrt (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Sqrt", false]], "square (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Square", false]], "squareddifference (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.SquaredDifference", false]], "squeeze (class in intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze)": [[119, "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze", false]], "stablediffusion_bf16convert (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_bf16convert)": [[211, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert.StableDiffusion_bf16Convert", false]], "stablediffusion_collectquantinfo (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_collectqdqinfo)": [[212, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo.StableDiffusion_CollectQuantInfo", false]], "stablediffusion_insertquantnode (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_insertquantnode)": [[213, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode.StableDiffusion_InsertQuantNode", false]], "stablediffusion_mhareshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_mhareshape)": [[208, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape.StableDiffusion_MHAReshape", false]], "stablediffusion_quantizefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_quantizefusion)": [[209, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion.StableDiffusion_QuantizeFusion", false]], "stablediffusion_reshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_reshapefusion)": [[210, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion.StableDiffusion_ReshapeFusion", false]], "stablediffusioninstructpix2pixpipeline (class in intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline)": [[9, "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline", false]], "stack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Stack", false]], "start_pipeline() (in module intel_extension_for_transformers.transformers.runtime.compile.compile)": [[49, "intel_extension_for_transformers.transformers.runtime.compile.compile.start_pipeline", false]], "startendlogits (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits)": [[214, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits.StartEndLogits", false]], "stat (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.state_dict", false]], "staticquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.StaticQuantConfig", false]], "stopforward": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.StopForward", false]], "stopgradient (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.StopGradient", false]], "store2str() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.store2str", false]], "str2list() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.str2list", false]], "stridedslice (class in intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice)": [[120, "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice", false]], "subgraphmatcher (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher)": [[215, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher.SubGraphMatcher", false]], "subsequence() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.subsequence", false]], "synchronize_between_processes() (util.misc.smoothedvalue method)": [[264, "util.misc.SmoothedValue.synchronize_between_processes", false]], "table_structure_to_cells() (in module util.postprocess)": [[266, "util.postprocess.table_structure_to_cells", false]], "tally() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.tally", false]], "tanh (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Tanh", false]], "tensor (class in intel_extension_for_transformers.transformers.runtime.compile.ops.tensor)": [[121, "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor.Tensor", false]], "tensorflowextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor)": [[53, "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor.TensorflowExtractor", false]], "tensorslicedataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TensorSliceDataset", false]], "teqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.TeqConfig", false]], "text": [[262, "module-text", false]], "text_to_sequence() (in module text)": [[262, "text.text_to_sequence", false]], "textencoder_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionmaskaddreshape)": [[217, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape.TextEncoder_AttentionMaskAddReshape", false]], "textencoder_attentionreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionreshape)": [[218, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape.TextEncoder_AttentionReshape", false]], "textencoder_casualattentionmask (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_causal_attention_mask)": [[223, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask.TextEncoder_CasualAttentionMask", false]], "textencoder_kvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_kvreshape)": [[219, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape.TextEncoder_KVReshape", false]], "textencoder_mulreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_mulreshape)": [[220, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape.TextEncoder_MulReshape", false]], "textencoder_qreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_qreshape)": [[221, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape.TextEncoder_QReshape", false]], "textencoder_softmaxreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_softmaxreshape)": [[222, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape.TextEncoder_SoftmaxReshape", false]], "textencoder_wordembedding (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencdoer_word_embedding)": [[216, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding.TextEncoder_WordEmbedding", false]], "tf_dtype_id (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.TF_DTYPE_ID", false]], "tf_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.tf_extract_operator", false]], "tile (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Tile", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.to_", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.autoroundconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.awqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.AwqConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.gptqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.rtnconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.RtnConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.teqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.TeqConfig.to_diff_dict", false]], "to_gradio_chatbot() (conversation.conversation method)": [[0, "conversation.Conversation.to_gradio_chatbot", false]], "to_json_file() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.to_json_file", false]], "to_openai_api_messages() (conversation.conversation method)": [[0, "conversation.Conversation.to_openai_api_messages", false]], "tokentypeembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings)": [[224, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings.TokenTypeEmbeddings", false]], "tokentypeembeddingsv1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1)": [[225, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1.TokenTypeEmbeddingsV1", false]], "tokentypeids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TokenTypeIds", false]], "topk (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK", false]], "topk (class in intel_extension_for_transformers.transformers.runtime.compile.ops.top_k)": [[122, "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK", false]], "topk() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.topk method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK.topk", false]], "torch_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.torch_utils)": [[244, "intel_extension_for_transformers.transformers.runtime.compile.torch_utils.torch_extract_operator", false]], "torchembedding (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding)": [[226, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding.TorchEmbedding", false]], "torchextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor)": [[54, "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor.TorchExtractor", false]], "torchinnerproductinsertbias (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias)": [[227, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias.TorchInnerProductInsertBias", false]], "torchinsertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16)": [[189, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16.TorchInsertBF16Node", false]], "torchinsertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node)": [[229, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node.TorchInsertBF16Node", false]], "torchpaddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence)": [[230, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence.TorchPaddingSequence", false]], "torchunpackbaddbmm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm)": [[228, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm.TorchUnpackBaddbmm", false]], "trace (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.Trace", false]], "tracedict (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.TraceDict", false]], "train() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.train", false]], "training_step() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.training_step", false]], "training_step_length_adaptive() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.training_step_length_adaptive", false]], "transformer2dmodel_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_attentionmaskaddreshape)": [[231, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape.Transformer2Dmodel_AttentionMaskAddReshape", false]], "transformer2dmodel_constantofshapewithmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_constantofshapewithmul)": [[232, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul.Transformer2Dmodel_ConstantOfShapeWithMul", false]], "transformer2dmodel_encoderhiddenstatesreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_encoderhiddenstatesreshape)": [[238, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape.Transformer2Dmodel_EncoderHiddenStatesReshape", false]], "transformer2dmodel_ffninputslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice)": [[233, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice.Transformer2Dmodel_FFNInputSlice", false]], "transformer2dmodel_ffninputslice_1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice_1)": [[234, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1.Transformer2Dmodel_FFNInputSlice_1", false]], "transformer2dmodel_getsamplebatch (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_getsamplebatch)": [[239, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch.Transformer2Dmodel_GetSampleBatch", false]], "transformer2dmodel_qkvprereshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvprereshape)": [[235, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape.Transformer2Dmodel_QKVPreReshape", false]], "transformer2dmodel_qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape)": [[236, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape.Transformer2Dmodel_QKVReshape", false]], "transformer2dmodel_qkvreshapeto4d (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape4d)": [[237, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D.Transformer2Dmodel_QKVReshapeTo4D", false]], "transformer2dmodel_sampleslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_sampleslice)": [[240, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice.Transformer2Dmodel_SampleSlice", false]], "transpose (class in intel_extension_for_transformers.transformers.runtime.compile.ops.transpose)": [[123, "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose", false]], "transpose_for_scores() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention.transpose_for_scores", false]], "transpose_for_scores() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention.transpose_for_scores", false]], "transpose_mode_int8() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.transpose_mode_int8", false]], "transposebatchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TransposeBatchMatMul", false]], "transposebatchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul)": [[241, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul.TransposeBatchMatMul", false]], "unbox_numpy_null() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.unbox_numpy_null", false]], "unpack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.unpack)": [[124, "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack", false]], "unsqueeze (class in intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze)": [[125, "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze", false]], "update() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.update", false]], "update_config() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.update_config", false]], "update_keys_to_ignore() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertapretrainedmodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel.update_keys_to_ignore", false]], "update_last_message() (conversation.conversation method)": [[0, "conversation.Conversation.update_last_message", false]], "util.box_ops": [[263, "module-util.box_ops", false]], "util.misc": [[264, "module-util.misc", false]], "util.plot_utils": [[265, "module-util.plot_utils", false]], "util.postprocess": [[266, "module-util.postprocess", false]], "utils.data_utils": [[267, "module-utils.data_utils", false]], "utils.eval_utils": [[268, "module-utils.eval_utils", false]], "variance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance", false]], "view (class in intel_extension_for_transformers.transformers.runtime.compile.ops.view)": [[126, "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View", false]], "warn() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.warn", false]], "warning() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.warning", false]], "weight_optimization() (intel_extension_for_transformers.transformers.runtime.compile.optimizer.optimizer method)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer.weight_optimization", false]], "weightpruningconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.WeightPruningConfig", false]], "where (class in intel_extension_for_transformers.transformers.runtime.compile.ops.where)": [[127, "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where", false]], "wide_resnet101_2() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.wide_resnet101_2", false]], "wide_resnet50_2() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.wide_resnet50_2", false]], "wordembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings)": [[242, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings.WordEmbeddings", false]], "zeros (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Zeros", false]]}, "objects": {"": [[278, 0, 1, "c.CPU_INSTANCE", "CPU_INSTANCE"], [278, 0, 1, "c.NULL_INSTANCE", "NULL_INSTANCE"], [278, 1, 1, "_CPPv42jd", "jd"], [278, 1, 1, "_CPPv42jd", "jd"], [279, 1, 1, "_CPPv42jd", "jd"], [280, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 2, 1, "_CPPv4N2jd12attention_io6K_BIASE", "jd::K_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_SCALESE", "jd::K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_WEIGHTE", "jd::K_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_DSTE", "jd::MERGE_DST"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_SRCE", "jd::MERGE_SRC"], [281, 2, 1, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", "jd::QK_V_OUTPUT_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", "jd::QK_V_OUTPUT_ZERO_POINT"], [281, 2, 1, "_CPPv4N2jd12attention_io6Q_BIASE", "jd::Q_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io10Q_K_SCALESE", "jd::Q_K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_K_SRC2E", "jd::Q_K_SRC2"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_SCALESE", "jd::Q_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_WEIGHTE", "jd::Q_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", "jd::RESHAPE_INPUT"], [281, 2, 1, "_CPPv4N2jd12attention_io6V_BIASE", "jd::V_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_SCALESE", "jd::V_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_WEIGHTE", "jd::V_WEIGHT"], [279, 3, 1, "_CPPv4N2jd9attentionE", "jd::attention"], [279, 4, 1, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", "jd::attention::attention"], [279, 4, 1, "_CPPv4N2jd9attention9attentionEv", "jd::attention::attention"], [279, 5, 1, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", "jd::attention::attention::kdp"], [279, 4, 1, "_CPPv4N2jd9attentionD0Ev", "jd::attention::~attention"], [279, 3, 1, "_CPPv4N2jd14attention_descE", "jd::attention_desc"], [279, 4, 1, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", "jd::attention_desc::attention_desc"], [279, 4, 1, "_CPPv4N2jd14attention_desc14attention_descEv", "jd::attention_desc::attention_desc"], [279, 5, 1, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", "jd::attention_desc::attention_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14attention_descD0Ev", "jd::attention_desc::~attention_desc"], [281, 6, 1, "_CPPv4N2jd12attention_ioE", "jd::attention_io"], [281, 2, 1, "_CPPv4N2jd12attention_io6K_BIASE", "jd::attention_io::K_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_SCALESE", "jd::attention_io::K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_WEIGHTE", "jd::attention_io::K_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_DSTE", "jd::attention_io::MERGE_DST"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_SRCE", "jd::attention_io::MERGE_SRC"], [281, 2, 1, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", "jd::attention_io::QK_V_OUTPUT_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", "jd::attention_io::QK_V_OUTPUT_ZERO_POINT"], [281, 2, 1, "_CPPv4N2jd12attention_io6Q_BIASE", "jd::attention_io::Q_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io10Q_K_SCALESE", "jd::attention_io::Q_K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_K_SRC2E", "jd::attention_io::Q_K_SRC2"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_SCALESE", "jd::attention_io::Q_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_WEIGHTE", "jd::attention_io::Q_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", "jd::attention_io::RESHAPE_INPUT"], [281, 2, 1, "_CPPv4N2jd12attention_io6V_BIASE", "jd::attention_io::V_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_SCALESE", "jd::attention_io::V_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_WEIGHTE", "jd::attention_io::V_WEIGHT"], [278, 3, 1, "_CPPv4N2jd12cpu_engine_tE", "jd::cpu_engine_t"], [278, 4, 1, "_CPPv4N2jd12cpu_engine_t12cpu_engine_tEv", "jd::cpu_engine_t::cpu_engine_t"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", "jd::cpu_engine_t::create_kernel"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", "jd::cpu_engine_t::create_memory_storage"], [278, 5, 1, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", "jd::cpu_engine_t::create_memory_storage::storage"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t13create_streamEPP8stream_t", "jd::cpu_engine_t::create_stream"], [278, 7, 1, "_CPPv4N2jd12cpu_engine_t10empty_listE", "jd::cpu_engine_t::empty_list"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", "jd::cpu_engine_t::get_implementation_list"], [278, 5, 1, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", "jd::cpu_engine_t::get_implementation_list::op_desc"], [278, 4, 1, "_CPPv4N2jd12cpu_engine_tD0Ev", "jd::cpu_engine_t::~cpu_engine_t"], [279, 3, 1, "_CPPv4N2jd13dynamic_quantE", "jd::dynamic_quant"], [279, 4, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", "jd::dynamic_quant::dynamic_quant"], [279, 4, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantEv", "jd::dynamic_quant::dynamic_quant"], [279, 5, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", "jd::dynamic_quant::dynamic_quant::kdp"], [279, 4, 1, "_CPPv4N2jd13dynamic_quantD0Ev", "jd::dynamic_quant::~dynamic_quant"], [279, 3, 1, "_CPPv4N2jd18dynamic_quant_descE", "jd::dynamic_quant_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", "jd::dynamic_quant_desc::dynamic_quant_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descEv", "jd::dynamic_quant_desc::dynamic_quant_desc"], [279, 5, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", "jd::dynamic_quant_desc::dynamic_quant_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_descD0Ev", "jd::dynamic_quant_desc::~dynamic_quant_desc"], [279, 3, 1, "_CPPv4N2jd20dynamic_quant_matmulE", "jd::dynamic_quant_matmul"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", "jd::dynamic_quant_matmul::dynamic_quant_matmul"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulEv", "jd::dynamic_quant_matmul::dynamic_quant_matmul"], [279, 5, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", "jd::dynamic_quant_matmul::dynamic_quant_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmulD0Ev", "jd::dynamic_quant_matmul::~dynamic_quant_matmul"], [279, 3, 1, "_CPPv4N2jd25dynamic_quant_matmul_descE", "jd::dynamic_quant_matmul_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descEv", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc"], [279, 5, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_descD0Ev", "jd::dynamic_quant_matmul_desc::~dynamic_quant_matmul_desc"], [279, 3, 1, "_CPPv4N2jd9eltwiseopE", "jd::eltwiseop"], [279, 4, 1, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", "jd::eltwiseop::eltwiseop"], [279, 4, 1, "_CPPv4N2jd9eltwiseop9eltwiseopEv", "jd::eltwiseop::eltwiseop"], [279, 5, 1, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", "jd::eltwiseop::eltwiseop::kdp"], [279, 4, 1, "_CPPv4N2jd9eltwiseopD0Ev", "jd::eltwiseop::~eltwiseop"], [279, 3, 1, "_CPPv4N2jd14eltwiseop_descE", "jd::eltwiseop_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", "jd::eltwiseop_desc::eltwiseop_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descEv", "jd::eltwiseop_desc::eltwiseop_desc"], [279, 5, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", "jd::eltwiseop_desc::eltwiseop_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_descD0Ev", "jd::eltwiseop_desc::~eltwiseop_desc"], [278, 3, 1, "_CPPv4N2jd8engine_tE", "jd::engine_t"], [278, 4, 1, "_CPPv4NK2jd8engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", "jd::engine_t::create_kernel"], [278, 4, 1, "_CPPv4NK2jd8engine_t21create_memory_storageEPP16memory_storage_t", "jd::engine_t::create_memory_storage"], [278, 4, 1, "_CPPv4NK2jd8engine_t13create_streamEPP8stream_t", "jd::engine_t::create_stream"], [278, 7, 1, "_CPPv4N2jd8engine_t12engine_kind_E", "jd::engine_t::engine_kind_"], [278, 4, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t"], [278, 5, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t::engine_kind"], [278, 5, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t::runtime_kind"], [278, 4, 1, "_CPPv4NK2jd8engine_t15get_engine_kindEv", "jd::engine_t::get_engine_kind"], [278, 4, 1, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", "jd::engine_t::get_implementation_list"], [278, 5, 1, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", "jd::engine_t::get_implementation_list::op_desc"], [278, 4, 1, "_CPPv4NK2jd8engine_t16get_runtime_kindEv", "jd::engine_t::get_runtime_kind"], [278, 7, 1, "_CPPv4N2jd8engine_t13runtime_kind_E", "jd::engine_t::runtime_kind_"], [278, 4, 1, "_CPPv4N2jd8engine_tD0Ev", "jd::engine_t::~engine_t"], [279, 3, 1, "_CPPv4N2jd6gatherE", "jd::gather"], [279, 4, 1, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", "jd::gather::gather"], [279, 4, 1, "_CPPv4N2jd6gather6gatherEv", "jd::gather::gather"], [279, 5, 1, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", "jd::gather::gather::kdp"], [279, 4, 1, "_CPPv4N2jd6gatherD0Ev", "jd::gather::~gather"], [279, 3, 1, "_CPPv4N2jd11gather_descE", "jd::gather_desc"], [279, 4, 1, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", "jd::gather_desc::gather_desc"], [279, 4, 1, "_CPPv4N2jd11gather_desc11gather_descEv", "jd::gather_desc::gather_desc"], [279, 5, 1, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", "jd::gather_desc::gather_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd11gather_descD0Ev", "jd::gather_desc::~gather_desc"], [279, 3, 1, "_CPPv4N2jd9groupnormE", "jd::groupnorm"], [279, 4, 1, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", "jd::groupnorm::groupnorm"], [279, 4, 1, "_CPPv4N2jd9groupnorm9groupnormEv", "jd::groupnorm::groupnorm"], [279, 5, 1, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", "jd::groupnorm::groupnorm::kdp"], [279, 4, 1, "_CPPv4N2jd9groupnormD0Ev", "jd::groupnorm::~groupnorm"], [279, 3, 1, "_CPPv4N2jd14groupnorm_descE", "jd::groupnorm_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", "jd::groupnorm_desc::groupnorm_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descEv", "jd::groupnorm_desc::groupnorm_desc"], [279, 5, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", "jd::groupnorm_desc::groupnorm_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_descD0Ev", "jd::groupnorm_desc::~groupnorm_desc"], [279, 3, 1, "_CPPv4N2jd17kernel_desc_proxyE", "jd::kernel_desc_proxy"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object::op_desc"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object::result_ref"], [279, 7, 1, "_CPPv4N2jd17kernel_desc_proxy10impl_list_E", "jd::kernel_desc_proxy::impl_list_"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", "jd::kernel_desc_proxy::kernel_desc_proxy"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyEv", "jd::kernel_desc_proxy::kernel_desc_proxy"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", "jd::kernel_desc_proxy::kernel_desc_proxy::op_desc"], [279, 4, 1, "_CPPv4NK2jd17kernel_desc_proxy11kernel_kindEv", "jd::kernel_desc_proxy::kernel_kind"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxyD0Ev", "jd::kernel_desc_proxy::~kernel_desc_proxy"], [279, 3, 1, "_CPPv4N2jd12kernel_proxyE", "jd::kernel_proxy"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object::kd"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object::result_ref"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", "jd::kernel_proxy::execute"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", "jd::kernel_proxy::execute"], [279, 5, 1, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", "jd::kernel_proxy::execute::ctx"], [279, 5, 1, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", "jd::kernel_proxy::execute::rt_data"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy18get_workspace_sizeEv", "jd::kernel_proxy::get_workspace_size"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy11kernel_kindEv", "jd::kernel_proxy::kernel_kind"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", "jd::kernel_proxy::kernel_proxy"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyEv", "jd::kernel_proxy::kernel_proxy"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", "jd::kernel_proxy::kernel_proxy::kdp"], [279, 4, 1, "_CPPv4N2jd12kernel_proxyD0Ev", "jd::kernel_proxy::~kernel_proxy"], [279, 3, 1, "_CPPv4N2jd12layernorm_baE", "jd::layernorm_ba"], [279, 4, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", "jd::layernorm_ba::layernorm_ba"], [279, 4, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baEv", "jd::layernorm_ba::layernorm_ba"], [279, 5, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", "jd::layernorm_ba::layernorm_ba::kdp"], [279, 4, 1, "_CPPv4N2jd12layernorm_baD0Ev", "jd::layernorm_ba::~layernorm_ba"], [279, 3, 1, "_CPPv4N2jd17layernorm_ba_descE", "jd::layernorm_ba_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", "jd::layernorm_ba_desc::layernorm_ba_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descEv", "jd::layernorm_ba_desc::layernorm_ba_desc"], [279, 5, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", "jd::layernorm_ba_desc::layernorm_ba_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_descD0Ev", "jd::layernorm_ba_desc::~layernorm_ba_desc"], [279, 3, 1, "_CPPv4N2jd20layernormalized_spmmE", "jd::layernormalized_spmm"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", "jd::layernormalized_spmm::layernormalized_spmm"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmEv", "jd::layernormalized_spmm::layernormalized_spmm"], [279, 5, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", "jd::layernormalized_spmm::layernormalized_spmm::kdp"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmmD0Ev", "jd::layernormalized_spmm::~layernormalized_spmm"], [279, 3, 1, "_CPPv4N2jd25layernormalized_spmm_descE", "jd::layernormalized_spmm_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descEv", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc"], [279, 5, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_descD0Ev", "jd::layernormalized_spmm_desc::~layernormalized_spmm_desc"], [279, 3, 1, "_CPPv4N2jd10logsoftmaxE", "jd::logsoftmax"], [279, 4, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", "jd::logsoftmax::logsoftmax"], [279, 4, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxEv", "jd::logsoftmax::logsoftmax"], [279, 5, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", "jd::logsoftmax::logsoftmax::kdp"], [279, 4, 1, "_CPPv4N2jd10logsoftmaxD0Ev", "jd::logsoftmax::~logsoftmax"], [279, 3, 1, "_CPPv4N2jd15logsoftmax_descE", "jd::logsoftmax_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", "jd::logsoftmax_desc::logsoftmax_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descEv", "jd::logsoftmax_desc::logsoftmax_desc"], [279, 5, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", "jd::logsoftmax_desc::logsoftmax_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_descD0Ev", "jd::logsoftmax_desc::~logsoftmax_desc"], [279, 3, 1, "_CPPv4N2jd9mha_denseE", "jd::mha_dense"], [279, 4, 1, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", "jd::mha_dense::mha_dense"], [279, 4, 1, "_CPPv4N2jd9mha_dense9mha_denseEv", "jd::mha_dense::mha_dense"], [279, 5, 1, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", "jd::mha_dense::mha_dense::kdp"], [279, 4, 1, "_CPPv4N2jd9mha_denseD0Ev", "jd::mha_dense::~mha_dense"], [279, 3, 1, "_CPPv4N2jd14mha_dense_descE", "jd::mha_dense_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", "jd::mha_dense_desc::mha_dense_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descEv", "jd::mha_dense_desc::mha_dense_desc"], [279, 5, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", "jd::mha_dense_desc::mha_dense_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_descD0Ev", "jd::mha_dense_desc::~mha_dense_desc"], [280, 3, 1, "_CPPv4N2jd13operator_descE", "jd::operator_desc"], [280, 4, 1, "_CPPv4NK2jd13operator_desc18apply_postops_listEv", "jd::operator_desc::apply_postops_list"], [280, 7, 1, "_CPPv4N2jd13operator_desc19apply_postops_list_E", "jd::operator_desc::apply_postops_list_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc5attrsEv", "jd::operator_desc::attrs"], [280, 7, 1, "_CPPv4N2jd13operator_desc6attrs_E", "jd::operator_desc::attrs_"], [280, 7, 1, "_CPPv4N2jd13operator_desc14binaryop_list_E", "jd::operator_desc::binaryop_list_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11engine_kindEv", "jd::operator_desc::engine_kind"], [280, 7, 1, "_CPPv4N2jd13operator_desc12engine_kind_E", "jd::operator_desc::engine_kind_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc17get_binaryop_listEv", "jd::operator_desc::get_binaryop_list"], [280, 4, 1, "_CPPv4NK2jd13operator_desc9impl_nthrEv", "jd::operator_desc::impl_nthr"], [280, 7, 1, "_CPPv4N2jd13operator_desc10impl_nthr_E", "jd::operator_desc::impl_nthr_"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ker_kind_E", "jd::operator_desc::ker_kind_"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ker_prop_E", "jd::operator_desc::ker_prop_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11kernel_kindEv", "jd::operator_desc::kernel_kind"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11kernel_propEv", "jd::operator_desc::kernel_prop"], [280, 4, 1, "_CPPv4NK2jd13operator_desceqERK13operator_desc", "jd::operator_desc::operator=="], [280, 5, 1, "_CPPv4NK2jd13operator_desceqERK13operator_desc", "jd::operator_desc::operator==::rhs"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descEv", "jd::operator_desc::operator_desc"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::apply_postops_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::apply_postops_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::attrs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::attrs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::eng_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::eng_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_prop"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_prop"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::runtime_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ts_descs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ts_descs"], [280, 4, 1, "_CPPv4NK2jd13operator_desc12runtime_kindEv", "jd::operator_desc::runtime_kind"], [280, 7, 1, "_CPPv4N2jd13operator_desc13runtime_kind_E", "jd::operator_desc::runtime_kind_"], [280, 4, 1, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", "jd::operator_desc::set_binaryop_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", "jd::operator_desc::set_binaryop_list::binaryop_list"], [280, 4, 1, "_CPPv4NK2jd13operator_desc12tensor_descsEv", "jd::operator_desc::tensor_descs"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_dtypesEv", "jd::operator_desc::tensor_dtypes"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_ftypesEv", "jd::operator_desc::tensor_ftypes"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_shapesEv", "jd::operator_desc::tensor_shapes"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ts_descs_E", "jd::operator_desc::ts_descs_"], [280, 4, 1, "_CPPv4N2jd13operator_descD0Ev", "jd::operator_desc::~operator_desc"], [279, 3, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base"], [279, 8, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base::T"], [279, 8, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base::arg_t"], [279, 4, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object::arg"], [279, 5, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object::result_ref"], [279, 7, 1, "_CPPv4N2jd10proxy_base12data_handle_E", "jd::proxy_base::data_handle_"], [279, 4, 1, "_CPPv4NK2jd10proxy_base6get_spEv", "jd::proxy_base::get_sp"], [279, 4, 1, "_CPPv4N2jd10proxy_base10proxy_baseEv", "jd::proxy_base::proxy_base"], [279, 4, 1, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", "jd::proxy_base::reset_sp"], [279, 5, 1, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", "jd::proxy_base::reset_sp::sp"], [279, 4, 1, "_CPPv4N2jd10proxy_baseD0Ev", "jd::proxy_base::~proxy_base"], [279, 3, 1, "_CPPv4N2jd5sliceE", "jd::slice"], [279, 4, 1, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", "jd::slice::slice"], [279, 4, 1, "_CPPv4N2jd5slice5sliceEv", "jd::slice::slice"], [279, 5, 1, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", "jd::slice::slice::kdp"], [279, 4, 1, "_CPPv4N2jd5sliceD0Ev", "jd::slice::~slice"], [279, 3, 1, "_CPPv4N2jd10slice_descE", "jd::slice_desc"], [279, 4, 1, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", "jd::slice_desc::slice_desc"], [279, 4, 1, "_CPPv4N2jd10slice_desc10slice_descEv", "jd::slice_desc::slice_desc"], [279, 5, 1, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", "jd::slice_desc::slice_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd10slice_descD0Ev", "jd::slice_desc::~slice_desc"], [279, 3, 1, "_CPPv4N2jd7softmaxE", "jd::softmax"], [279, 4, 1, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", "jd::softmax::softmax"], [279, 4, 1, "_CPPv4N2jd7softmax7softmaxEv", "jd::softmax::softmax"], [279, 5, 1, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", "jd::softmax::softmax::kdp"], [279, 4, 1, "_CPPv4N2jd7softmaxD0Ev", "jd::softmax::~softmax"], [279, 3, 1, "_CPPv4N2jd12softmax_descE", "jd::softmax_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", "jd::softmax_desc::softmax_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_desc12softmax_descEv", "jd::softmax_desc::softmax_desc"], [279, 5, 1, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", "jd::softmax_desc::softmax_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_descD0Ev", "jd::softmax_desc::~softmax_desc"], [279, 3, 1, "_CPPv4N2jd13sparse_matmulE", "jd::sparse_matmul"], [279, 4, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", "jd::sparse_matmul::sparse_matmul"], [279, 4, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulEv", "jd::sparse_matmul::sparse_matmul"], [279, 5, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", "jd::sparse_matmul::sparse_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd13sparse_matmulD0Ev", "jd::sparse_matmul::~sparse_matmul"], [279, 3, 1, "_CPPv4N2jd18sparse_matmul_descE", "jd::sparse_matmul_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", "jd::sparse_matmul_desc::sparse_matmul_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descEv", "jd::sparse_matmul_desc::sparse_matmul_desc"], [279, 5, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", "jd::sparse_matmul_desc::sparse_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_descD0Ev", "jd::sparse_matmul_desc::~sparse_matmul_desc"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 7, 1, "_CPPv4N2jd3ssd4BIASE", "jd::ssd::BIAS"], [281, 7, 1, "_CPPv4N2jd3ssd3DSTE", "jd::ssd::DST"], [281, 7, 1, "_CPPv4N2jd3ssd6DST_M1E", "jd::ssd::DST_M1"], [281, 7, 1, "_CPPv4N2jd3ssd6DST_M2E", "jd::ssd::DST_M2"], [281, 7, 1, "_CPPv4N2jd3ssd6SCALESE", "jd::ssd::SCALES"], [281, 7, 1, "_CPPv4N2jd3ssd3SRCE", "jd::ssd::SRC"], [281, 7, 1, "_CPPv4N2jd3ssd3WEIE", "jd::ssd::WEI"], [281, 7, 1, "_CPPv4N2jd3ssd10WORK_SPACEE", "jd::ssd::WORK_SPACE"], [281, 1, 1, "_CPPv4N2jd3ssd17amx_bf16_params_tE", "jd::ssd::amx_bf16_params_t"], [281, 1, 1, "_CPPv4N2jd3ssd21amx_bf16bf16_inputs_tE", "jd::ssd::amx_bf16bf16_inputs_t"], [281, 1, 1, "_CPPv4N2jd3ssd20amx_bf16f32_inputs_tE", "jd::ssd::amx_bf16f32_inputs_t"], [281, 3, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::bia_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t4biasE", "jd::ssd::amx_inputs_t::bias"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t3dstE", "jd::ssd::amx_inputs_t::dst"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::dst_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t3srcE", "jd::ssd::amx_inputs_t::src"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::src_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t6weightE", "jd::ssd::amx_inputs_t::weight"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::wgt_t"], [281, 1, 1, "_CPPv4N2jd3ssd17amx_int8_params_tE", "jd::ssd::amx_int8_params_t"], [281, 3, 1, "_CPPv4I0EN2jd3ssd12amx_params_tE", "jd::ssd::amx_params_t"], [281, 8, 1, "_CPPv4I0EN2jd3ssd12amx_params_tE", "jd::ssd::amx_params_t::T"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t16blocks_per_groupE", "jd::ssd::amx_params_t::blocks_per_group"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9blocksizeE", "jd::ssd::amx_params_t::blocksize"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t7colidxsE", "jd::ssd::amx_params_t::colidxs"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t12group_rowptrE", "jd::ssd::amx_params_t::group_rowptr"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t8has_biasE", "jd::ssd::amx_params_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9nnz_groupE", "jd::ssd::amx_params_t::nnz_group"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t7nrowptrE", "jd::ssd::amx_params_t::nrowptr"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9num_tileME", "jd::ssd::amx_params_t::num_tileM"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t12postop_attrsE", "jd::ssd::amx_params_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t14same_src_dtypeE", "jd::ssd::amx_params_t::same_src_dtype"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5shapeE", "jd::ssd::amx_params_t::shape"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5tileME", "jd::ssd::amx_params_t::tileM"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5tileNE", "jd::ssd::amx_params_t::tileN"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t6weightE", "jd::ssd::amx_params_t::weight"], [281, 3, 1, "_CPPv4N2jd3ssd13avx512_data_tE", "jd::ssd::avx512_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t4biasE", "jd::ssd::avx512_data_t::bias"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t5denseE", "jd::ssd::avx512_data_t::dense"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t3dstE", "jd::ssd::avx512_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t6sparseE", "jd::ssd::avx512_data_t::sparse"], [281, 3, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_tE", "jd::ssd::avx512_fp32_params_t"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1KE", "jd::ssd::avx512_fp32_params_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1ME", "jd::ssd::avx512_fp32_params_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1NE", "jd::ssd::avx512_fp32_params_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8has_biasE", "jd::ssd::avx512_fp32_params_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t6im_endE", "jd::ssd::avx512_fp32_params_t::im_end"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8im_startE", "jd::ssd::avx512_fp32_params_t::im_start"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t6in_endE", "jd::ssd::avx512_fp32_params_t::in_end"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8in_startE", "jd::ssd::avx512_fp32_params_t::in_start"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t12postop_attrsE", "jd::ssd::avx512_fp32_params_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t10sparse_ptrE", "jd::ssd::avx512_fp32_params_t::sparse_ptr"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", "jd::ssd::direct"], [281, 3, 1, "_CPPv4N2jd3ssd16eltwiseop_data_tE", "jd::ssd::eltwiseop_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t3dstE", "jd::ssd::eltwiseop_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t11element_numE", "jd::ssd::eltwiseop_data_t::element_num"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t3srcE", "jd::ssd::eltwiseop_data_t::src"], [281, 3, 1, "_CPPv4N2jd3ssd17eltwiseop_param_tE", "jd::ssd::eltwiseop_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t11element_numE", "jd::ssd::eltwiseop_param_t::element_num"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t19element_num_each_thE", "jd::ssd::eltwiseop_param_t::element_num_each_th"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t5in_dtE", "jd::ssd::eltwiseop_param_t::in_dt"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t6out_dtE", "jd::ssd::eltwiseop_param_t::out_dt"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t12postop_attrsE", "jd::ssd::eltwiseop_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t14remain_elementE", "jd::ssd::eltwiseop_param_t::remain_element"], [281, 3, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_tE", "jd::ssd::layernorm_ba_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_tUt1_3E", "jd::ssd::layernorm_ba_data_t::[anonymous]"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t5alphaE", "jd::ssd::layernorm_ba_data_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4betaE", "jd::ssd::layernorm_ba_data_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3dstE", "jd::ssd::layernorm_ba_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4dst2E", "jd::ssd::layernorm_ba_data_t::dst2"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3epsE", "jd::ssd::layernorm_ba_data_t::eps"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4meanE", "jd::ssd::layernorm_ba_data_t::mean"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t1nE", "jd::ssd::layernorm_ba_data_t::n"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3oneE", "jd::ssd::layernorm_ba_data_t::one"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t11process_rowE", "jd::ssd::layernorm_ba_data_t::process_row"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3srcE", "jd::ssd::layernorm_ba_data_t::src"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3varE", "jd::ssd::layernorm_ba_data_t::var"], [281, 3, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_tE", "jd::ssd::layernorm_ba_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9batch_numE", "jd::ssd::layernorm_ba_param_t::batch_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t14binaryop_attrsE", "jd::ssd::layernorm_ba_param_t::binaryop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t7col_numE", "jd::ssd::layernorm_ba_param_t::col_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t18direct_process_rowE", "jd::ssd::layernorm_ba_param_t::direct_process_row"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t8input_dtE", "jd::ssd::layernorm_ba_param_t::input_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t13ker_per_batchE", "jd::ssd::layernorm_ba_param_t::ker_per_batch"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t10output2_dtE", "jd::ssd::layernorm_ba_param_t::output2_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9output_dtE", "jd::ssd::layernorm_ba_param_t::output_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t12postop_attrsE", "jd::ssd::layernorm_ba_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t21process_batch_per_kerE", "jd::ssd::layernorm_ba_param_t::process_batch_per_ker"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t11process_colE", "jd::ssd::layernorm_ba_param_t::process_col"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t7row_numE", "jd::ssd::layernorm_ba_param_t::row_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9spec_typeE", "jd::ssd::layernorm_ba_param_t::spec_type"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t12split_outputE", "jd::ssd::layernorm_ba_param_t::split_output"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t17thread_elt_offsetE", "jd::ssd::layernorm_ba_param_t::thread_elt_offset"], [281, 2, 1, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", "jd::ssd::lut"], [281, 3, 1, "_CPPv4N2jd3ssd13matmul_data_tE", "jd::ssd::matmul_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t3dstE", "jd::ssd::matmul_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src0E", "jd::ssd::matmul_data_t::src0"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src1E", "jd::ssd::matmul_data_t::src1"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src2E", "jd::ssd::matmul_data_t::src2"], [281, 3, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_tE", "jd::ssd::matmul_fp8_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5alphaE", "jd::ssd::matmul_fp8_data_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5astepE", "jd::ssd::matmul_fp8_data_t::astep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4betaE", "jd::ssd::matmul_fp8_data_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5bstepE", "jd::ssd::matmul_fp8_data_t::bstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5cstepE", "jd::ssd::matmul_fp8_data_t::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5dstepE", "jd::ssd::matmul_fp8_data_t::dstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t1kE", "jd::ssd::matmul_fp8_data_t::k"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4kposE", "jd::ssd::matmul_fp8_data_t::kpos"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matAE", "jd::ssd::matmul_fp8_data_t::matA"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matBE", "jd::ssd::matmul_fp8_data_t::matB"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matCE", "jd::ssd::matmul_fp8_data_t::matC"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matDE", "jd::ssd::matmul_fp8_data_t::matD"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matEE", "jd::ssd::matmul_fp8_data_t::matE"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t1nE", "jd::ssd::matmul_fp8_data_t::n"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5scaleE", "jd::ssd::matmul_fp8_data_t::scale"], [281, 3, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_tE", "jd::ssd::matmul_fp8_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_tUt1_5E", "jd::ssd::matmul_fp8_param_t::[anonymous]"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1KE", "jd::ssd::matmul_fp8_param_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1ME", "jd::ssd::matmul_fp8_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1NE", "jd::ssd::matmul_fp8_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t5alphaE", "jd::ssd::matmul_fp8_param_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t4betaE", "jd::ssd::matmul_fp8_param_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14has_append_sumE", "jd::ssd::matmul_fp8_param_t::has_append_sum"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t10has_scale0E", "jd::ssd::matmul_fp8_param_t::has_scale0"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t12postop_attrsE", "jd::ssd::matmul_fp8_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t10thread_numE", "jd::ssd::matmul_fp8_param_t::thread_num"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_8bitE", "jd::ssd::matmul_fp8_param_t::weight_8bit"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_bf16E", "jd::ssd::matmul_fp8_param_t::weight_bf16"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e4m3E", "jd::ssd::matmul_fp8_param_t::weight_f8_e4m3"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e5m2E", "jd::ssd::matmul_fp8_param_t::weight_f8_e5m2"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_int8E", "jd::ssd::matmul_fp8_param_t::weight_int8"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_typeE", "jd::ssd::matmul_fp8_param_t::weight_type"], [281, 1, 1, "_CPPv4N2jd3ssd12matmul_inputE", "jd::ssd::matmul_input"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", "jd::ssd::matmul_input::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", "jd::ssd::matmul_input::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", "jd::ssd::matmul_input::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", "jd::ssd::matmul_input::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", "jd::ssd::matmul_input::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", "jd::ssd::matmul_input::ZP0"], [281, 6, 1, "_CPPv4N2jd3ssd12matmul_input5inputE", "jd::ssd::matmul_input::input"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", "jd::ssd::matmul_input::input::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", "jd::ssd::matmul_input::input::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", "jd::ssd::matmul_input::input::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", "jd::ssd::matmul_input::input::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", "jd::ssd::matmul_input::input::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", "jd::ssd::matmul_input::input::ZP0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", "jd::ssd::matmul_input::input::matmul_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", "jd::ssd::matmul_input::matmul_io_MAX"], [281, 1, 1, "_CPPv4N2jd3ssd9matmul_ioE", "jd::ssd::matmul_io"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", "jd::ssd::matmul_io::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", "jd::ssd::matmul_io::DST0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", "jd::ssd::matmul_io::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", "jd::ssd::matmul_io::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", "jd::ssd::matmul_io::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", "jd::ssd::matmul_io::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", "jd::ssd::matmul_io::ZP0"], [281, 6, 1, "_CPPv4N2jd3ssd9matmul_io2ioE", "jd::ssd::matmul_io::io"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", "jd::ssd::matmul_io::io::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", "jd::ssd::matmul_io::io::DST0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", "jd::ssd::matmul_io::io::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", "jd::ssd::matmul_io::io::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", "jd::ssd::matmul_io::io::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", "jd::ssd::matmul_io::io::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", "jd::ssd::matmul_io::io::ZP0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", "jd::ssd::matmul_io::io::matmul_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", "jd::ssd::matmul_io::matmul_io_MAX"], [281, 1, 1, "_CPPv4N2jd3ssd13matmul_outputE", "jd::ssd::matmul_output"], [281, 2, 1, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", "jd::ssd::matmul_output::DST0"], [281, 6, 1, "_CPPv4N2jd3ssd13matmul_output6outputE", "jd::ssd::matmul_output::output"], [281, 2, 1, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", "jd::ssd::matmul_output::output::DST0"], [281, 3, 1, "_CPPv4N2jd3ssd14matmul_param_tE", "jd::ssd::matmul_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1KE", "jd::ssd::matmul_param_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1ME", "jd::ssd::matmul_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1NE", "jd::ssd::matmul_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t5alphaE", "jd::ssd::matmul_param_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t5batchE", "jd::ssd::matmul_param_t::batch"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t4betaE", "jd::ssd::matmul_param_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t6m_tileE", "jd::ssd::matmul_param_t::m_tile"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t6n_tileE", "jd::ssd::matmul_param_t::n_tile"], [281, 3, 1, "_CPPv4N2jd3ssd16matmul_u8_data_tE", "jd::ssd::matmul_u8_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t3dstE", "jd::ssd::matmul_u8_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t5scaleE", "jd::ssd::matmul_u8_data_t::scale"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t4src0E", "jd::ssd::matmul_u8_data_t::src0"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t4src1E", "jd::ssd::matmul_u8_data_t::src1"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t2zpE", "jd::ssd::matmul_u8_data_t::zp"], [281, 3, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_tE", "jd::ssd::mean_var_reduce_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7mean_inE", "jd::ssd::mean_var_reduce_data_t::mean_in"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t8mean_outE", "jd::ssd::mean_var_reduce_data_t::mean_out"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t6var_inE", "jd::ssd::mean_var_reduce_data_t::var_in"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7var_outE", "jd::ssd::mean_var_reduce_data_t::var_out"], [281, 3, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_tE", "jd::ssd::mean_var_reduce_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BME", "jd::ssd::mean_var_reduce_param_t::BM"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BNE", "jd::ssd::mean_var_reduce_param_t::BN"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1ME", "jd::ssd::mean_var_reduce_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1NE", "jd::ssd::mean_var_reduce_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t11element_numE", "jd::ssd::mean_var_reduce_param_t::element_num"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", "jd::ssd::normal"], [281, 3, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_paramsE", "jd::ssd::seq_vnni_copy_params"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params6dstptrE", "jd::ssd::seq_vnni_copy_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params9dststrideE", "jd::ssd::seq_vnni_copy_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params1kE", "jd::ssd::seq_vnni_copy_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params6srcptrE", "jd::ssd::seq_vnni_copy_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params9srcstrideE", "jd::ssd::seq_vnni_copy_params::srcstride"], [281, 3, 1, "_CPPv4N2jd3ssd14softmax_data_tE", "jd::ssd::softmax_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3dstE", "jd::ssd::softmax_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3oneE", "jd::ssd::softmax_data_t::one"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t15process_vec_numE", "jd::ssd::softmax_data_t::process_vec_num"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3srcE", "jd::ssd::softmax_data_t::src"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3tmpE", "jd::ssd::softmax_data_t::tmp"], [281, 3, 1, "_CPPv4N2jd3ssd15softmax_param_tE", "jd::ssd::softmax_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t17get_lut_exp_attrsE", "jd::ssd::softmax_param_t::get_lut_exp_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t8input_dtE", "jd::ssd::softmax_param_t::input_dt"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t9output_dtE", "jd::ssd::softmax_param_t::output_dt"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t12postop_attrsE", "jd::ssd::softmax_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t10scalar_numE", "jd::ssd::softmax_param_t::scalar_num"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t9sepc_typeE", "jd::ssd::softmax_param_t::sepc_type"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t13vec_align_lenE", "jd::ssd::softmax_param_t::vec_align_len"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t15vec_num_per_thrE", "jd::ssd::softmax_param_t::vec_num_per_thr"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t16vec_num_tail_thrE", "jd::ssd::softmax_param_t::vec_num_tail_thr"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t12vec_tail_lenE", "jd::ssd::softmax_param_t::vec_tail_len"], [281, 6, 1, "_CPPv4N2jd3ssd13sparse_schemeE", "jd::ssd::sparse_scheme"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme14dense_x_sparseE", "jd::ssd::sparse_scheme::dense_x_sparse"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme14sparse_x_denseE", "jd::ssd::sparse_scheme::sparse_x_dense"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme15sparse_x_sparseE", "jd::ssd::sparse_scheme::sparse_x_sparse"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme5undefE", "jd::ssd::sparse_scheme::undef"], [281, 6, 1, "_CPPv4N2jd3ssd17spec_softmax_typeE", "jd::ssd::spec_softmax_type"], [281, 2, 1, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", "jd::ssd::spec_softmax_type::lut"], [281, 6, 1, "_CPPv4N2jd3ssd20spec_translnorm_typeE", "jd::ssd::spec_translnorm_type"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", "jd::ssd::spec_translnorm_type::direct"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", "jd::ssd::spec_translnorm_type::normal"], [281, 6, 1, "_CPPv4N2jd3ssd13subfunc_levelE", "jd::ssd::subfunc_level"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level5kdimsE", "jd::ssd::subfunc_level::kdims"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level9non_kdimsE", "jd::ssd::subfunc_level::non_kdims"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level4noneE", "jd::ssd::subfunc_level::none"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level17subfunc_level_MAXE", "jd::ssd::subfunc_level::subfunc_level_MAX"], [281, 3, 1, "_CPPv4N2jd3ssd21transpose_copy_paramsE", "jd::ssd::transpose_copy_params"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params6dstptrE", "jd::ssd::transpose_copy_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params9dststrideE", "jd::ssd::transpose_copy_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params1kE", "jd::ssd::transpose_copy_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params6srcptrE", "jd::ssd::transpose_copy_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params9srcstrideE", "jd::ssd::transpose_copy_params::srcstride"], [281, 1, 1, "_CPPv4N2jd3ssd16transpose_mha_ioE", "jd::ssd::transpose_mha_io"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", "jd::ssd::transpose_mha_io::BATCH"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", "jd::ssd::transpose_mha_io::DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", "jd::ssd::transpose_mha_io::HEAD_NUM"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", "jd::ssd::transpose_mha_io::HEAD_SIZE"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", "jd::ssd::transpose_mha_io::MASK"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", "jd::ssd::transpose_mha_io::SCALE_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", "jd::ssd::transpose_mha_io::SCALE_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", "jd::ssd::transpose_mha_io::SCALE_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", "jd::ssd::transpose_mha_io::SCALE_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", "jd::ssd::transpose_mha_io::SEQ_LEN"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", "jd::ssd::transpose_mha_io::SL_PAD"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", "jd::ssd::transpose_mha_io::SRC_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", "jd::ssd::transpose_mha_io::SRC_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", "jd::ssd::transpose_mha_io::SRC_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", "jd::ssd::transpose_mha_io::TMP2M"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", "jd::ssd::transpose_mha_io::ZP_DST"], [281, 6, 1, "_CPPv4N2jd3ssd16transpose_mha_io2ioE", "jd::ssd::transpose_mha_io::io"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", "jd::ssd::transpose_mha_io::io::BATCH"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", "jd::ssd::transpose_mha_io::io::DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", "jd::ssd::transpose_mha_io::io::HEAD_NUM"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", "jd::ssd::transpose_mha_io::io::HEAD_SIZE"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", "jd::ssd::transpose_mha_io::io::MASK"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", "jd::ssd::transpose_mha_io::io::SCALE_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", "jd::ssd::transpose_mha_io::io::SCALE_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", "jd::ssd::transpose_mha_io::io::SCALE_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", "jd::ssd::transpose_mha_io::io::SCALE_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", "jd::ssd::transpose_mha_io::io::SEQ_LEN"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", "jd::ssd::transpose_mha_io::io::SL_PAD"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", "jd::ssd::transpose_mha_io::io::SRC_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", "jd::ssd::transpose_mha_io::io::SRC_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", "jd::ssd::transpose_mha_io::io::SRC_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", "jd::ssd::transpose_mha_io::io::TMP2M"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", "jd::ssd::transpose_mha_io::io::ZP_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", "jd::ssd::transpose_mha_io::io::transpose_mha_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", "jd::ssd::transpose_mha_io::transpose_mha_io_MAX"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_paramsE", "jd::ssd::transpose_mha_step1_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params5astepE", "jd::ssd::transpose_mha_step1_params::astep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params6batchkE", "jd::ssd::transpose_mha_step1_params::batchk"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params10cbatchstepE", "jd::ssd::transpose_mha_step1_params::cbatchstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params3cfgE", "jd::ssd::transpose_mha_step1_params::cfg"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params5cstepE", "jd::ssd::transpose_mha_step1_params::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params6expsumE", "jd::ssd::transpose_mha_step1_params::expsum"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params1kE", "jd::ssd::transpose_mha_step1_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params1mE", "jd::ssd::transpose_mha_step1_params::m"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matAE", "jd::ssd::transpose_mha_step1_params::matA"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matBE", "jd::ssd::transpose_mha_step1_params::matB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matCE", "jd::ssd::transpose_mha_step1_params::matC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matDE", "jd::ssd::transpose_mha_step1_params::matD"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params7scaleABE", "jd::ssd::transpose_mha_step1_params::scaleAB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params7sumstepE", "jd::ssd::transpose_mha_step1_params::sumstep"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_paramsE", "jd::ssd::transpose_mha_step2_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6dstptrE", "jd::ssd::transpose_mha_step2_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params9dststrideE", "jd::ssd::transpose_mha_step2_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params1kE", "jd::ssd::transpose_mha_step2_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6srcptrE", "jd::ssd::transpose_mha_step2_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params9srcstrideE", "jd::ssd::transpose_mha_step2_params::srcstride"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6sumptrE", "jd::ssd::transpose_mha_step2_params::sumptr"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_paramsE", "jd::ssd::transpose_mha_step3_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params5astepE", "jd::ssd::transpose_mha_step3_params::astep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params3cfgE", "jd::ssd::transpose_mha_step3_params::cfg"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params5cstepE", "jd::ssd::transpose_mha_step3_params::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params1kE", "jd::ssd::transpose_mha_step3_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matAE", "jd::ssd::transpose_mha_step3_params::matA"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matBE", "jd::ssd::transpose_mha_step3_params::matB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matCE", "jd::ssd::transpose_mha_step3_params::matC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params7scaleABE", "jd::ssd::transpose_mha_step3_params::scaleAB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params6scaleCE", "jd::ssd::transpose_mha_step3_params::scaleC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params10zeropointCE", "jd::ssd::transpose_mha_step3_params::zeropointC"], [281, 3, 1, "_CPPv4I0EN2jd3ssd11vnni_data_tE", "jd::ssd::vnni_data_t"], [281, 8, 1, "_CPPv4I0EN2jd3ssd11vnni_data_tE", "jd::ssd::vnni_data_t::dst_t"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t8ptr_biasE", "jd::ssd::vnni_data_t::ptr_bias"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t9ptr_denseE", "jd::ssd::vnni_data_t::ptr_dense"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t7ptr_dstE", "jd::ssd::vnni_data_t::ptr_dst"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m1E", "jd::ssd::vnni_data_t::ptr_dst_m1"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m2E", "jd::ssd::vnni_data_t::ptr_dst_m2"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_scalesE", "jd::ssd::vnni_data_t::ptr_scales"], [281, 3, 1, "_CPPv4N2jd3ssd12vnni_param_tE", "jd::ssd::vnni_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t2BME", "jd::ssd::vnni_param_t::BM"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t2BNE", "jd::ssd::vnni_param_t::BN"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t10append_sumE", "jd::ssd::vnni_param_t::append_sum"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t9blocksizeE", "jd::ssd::vnni_param_t::blocksize"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8has_biasE", "jd::ssd::vnni_param_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8im_startE", "jd::ssd::vnni_param_t::im_start"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t7indicesE", "jd::ssd::vnni_param_t::indices"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6indptrE", "jd::ssd::vnni_param_t::indptr"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t11output_typeE", "jd::ssd::vnni_param_t::output_type"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t12postop_attrsE", "jd::ssd::vnni_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8sub_funcE", "jd::ssd::vnni_param_t::sub_func"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6tile_wE", "jd::ssd::vnni_param_t::tile_w"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6weightE", "jd::ssd::vnni_param_t::weight"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t7welfordE", "jd::ssd::vnni_param_t::welford"], [279, 3, 1, "_CPPv4N2jd16transpose_matmulE", "jd::transpose_matmul"], [279, 4, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", "jd::transpose_matmul::transpose_matmul"], [279, 4, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulEv", "jd::transpose_matmul::transpose_matmul"], [279, 5, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", "jd::transpose_matmul::transpose_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd16transpose_matmulD0Ev", "jd::transpose_matmul::~transpose_matmul"], [279, 3, 1, "_CPPv4N2jd21transpose_matmul_descE", "jd::transpose_matmul_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", "jd::transpose_matmul_desc::transpose_matmul_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descEv", "jd::transpose_matmul_desc::transpose_matmul_desc"], [279, 5, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", "jd::transpose_matmul_desc::transpose_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_descD0Ev", "jd::transpose_matmul_desc::~transpose_matmul_desc"], [279, 3, 1, "_CPPv4N2jd13transpose_mhaE", "jd::transpose_mha"], [279, 4, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", "jd::transpose_mha::transpose_mha"], [279, 4, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaEv", "jd::transpose_mha::transpose_mha"], [279, 5, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", "jd::transpose_mha::transpose_mha::kdp"], [279, 4, 1, "_CPPv4N2jd13transpose_mhaD0Ev", "jd::transpose_mha::~transpose_mha"], [279, 3, 1, "_CPPv4N2jd18transpose_mha_descE", "jd::transpose_mha_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", "jd::transpose_mha_desc::transpose_mha_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descEv", "jd::transpose_mha_desc::transpose_mha_desc"], [279, 5, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", "jd::transpose_mha_desc::transpose_mha_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_descD0Ev", "jd::transpose_mha_desc::~transpose_mha_desc"], [0, 9, 0, "-", "conversation"], [1, 9, 0, "-", "gaudi_spawn"], [253, 9, 0, "-", "main_eval_only"], [254, 9, 0, "-", "main_parse_and_eval"], [262, 9, 0, "-", "text"]], "conversation": [[0, 10, 1, "", "Conversation"], [0, 10, 1, "", "SeparatorStyle"], [0, 12, 1, "", "get_conv_template"], [0, 12, 1, "", "register_conv_template"]], "conversation.Conversation": [[0, 11, 1, "", "append_message"], [0, 11, 1, "", "convert_image_to_base64"], [0, 11, 1, "", "get_prompt"], [0, 11, 1, "", "set_system_message"], [0, 11, 1, "", "to_gradio_chatbot"], [0, 11, 1, "", "to_openai_api_messages"], [0, 11, 1, "", "update_last_message"]], "gaudi_spawn": [[1, 12, 1, "", "parse_args"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers": [[2, 9, 0, "-", "child_parent_retriever"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, 10, 1, "", "ChildParentRetriever"], [2, 10, 1, "", "SearchType"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever": [[2, 13, 1, "", "search_kwargs"], [2, 13, 1, "", "search_type"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType": [[2, 13, 1, "", "mmr"], [2, 13, 1, "", "similarity"]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores": [[3, 9, 0, "-", "chroma"]], "intel_extension_for_transformers.neural_chat": [[4, 9, 0, "-", "chatbot"], [5, 9, 0, "-", "config"], [6, 9, 0, "-", "config_logging"], [7, 9, 0, "-", "errorcode"], [8, 9, 0, "-", "pipeline"]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, 12, 1, "", "build_chatbot"], [4, 12, 1, "", "finetune_model"], [4, 12, 1, "", "optimize_model"]], "intel_extension_for_transformers.neural_chat.config": [[5, 10, 1, "", "AudioLanguageOptions"], [5, 10, 1, "", "BackendOptions"], [5, 10, 1, "", "DataArguments"], [5, 10, 1, "", "DeviceOptions"], [5, 10, 1, "", "FinetuningArguments"], [5, 10, 1, "", "ModelArguments"], [5, 10, 1, "", "RetrievalTypeOptions"]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, 12, 1, "", "configure_logging"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image": [[9, 9, 0, "-", "instructpix2pix_pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, 10, 1, "", "StableDiffusionInstructPix2PixPipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline": [[9, 11, 1, "", "enable_sequential_cpu_offload"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory": [[10, 9, 0, "-", "memory"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval": [[14, 9, 0, "-", "retriever_adapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector": [[11, 9, 0, "-", "intent_detection"], [12, 9, 0, "-", "query_explainer"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser": [[13, 9, 0, "-", "parser"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, 10, 1, "", "RetrieverAdapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security": [[15, 9, 0, "-", "safety_checker"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, 12, 1, "", "convert_fullwidth_to_halfwidth"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d": [[18, 9, 0, "-", "util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models": [[16, 9, 0, "-", "bfm"], [17, 9, 0, "-", "networks"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, 12, 1, "", "resnet101"], [17, 12, 1, "", "resnet152"], [17, 12, 1, "", "resnet18"], [17, 12, 1, "", "resnet34"], [17, 12, 1, "", "resnet50"], [17, 12, 1, "", "resnext101_32x8d"], [17, 12, 1, "", "resnext50_32x4d"], [17, 12, 1, "", "wide_resnet101_2"], [17, 12, 1, "", "wide_resnet50_2"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[19, 9, 0, "-", "load_mats"], [20, 9, 0, "-", "preprocess"], [21, 9, 0, "-", "util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, 12, 1, "", "align_img"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, 12, 1, "", "draw_landmarks"], [21, 12, 1, "", "mkdir"], [21, 12, 1, "", "mkdirs"]], "intel_extension_for_transformers.neural_chat.server.restful": [[22, 9, 0, "-", "openai_protocol"]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, 10, 1, "", "ApiErrorCode"]], "intel_extension_for_transformers.neural_chat.tools.rome": [[23, 9, 0, "-", "repr_tools"]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, 12, 1, "", "get_reprs_at_idxs"], [23, 12, 1, "", "get_reprs_at_word_tokens"], [23, 12, 1, "", "get_words_idxs_in_templates"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils": [[24, 9, 0, "-", "nethook"], [25, 9, 0, "-", "runningstats"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, 14, 1, "", "StopForward"], [24, 10, 1, "", "Trace"], [24, 10, 1, "", "TraceDict"], [24, 12, 1, "", "get_module"], [24, 12, 1, "", "get_parameter"], [24, 12, 1, "", "hierarchical_subsequence"], [24, 12, 1, "", "invoke_with_optional_args"], [24, 12, 1, "", "recursive_copy"], [24, 12, 1, "", "replace_module"], [24, 12, 1, "", "set_requires_grad"], [24, 12, 1, "", "subsequence"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, 10, 1, "", "Bincount"], [25, 10, 1, "", "CombinedStat"], [25, 10, 1, "", "Covariance"], [25, 10, 1, "", "CrossCovariance"], [25, 10, 1, "", "CrossIoU"], [25, 10, 1, "", "FixedRandomSubsetSampler"], [25, 10, 1, "", "FixedSubsetSampler"], [25, 10, 1, "", "History"], [25, 10, 1, "", "IoU"], [25, 10, 1, "", "Mean"], [25, 10, 1, "", "NormMean"], [25, 10, 1, "", "Quantile"], [25, 10, 1, "", "SecondMoment"], [25, 10, 1, "", "Stat"], [25, 10, 1, "", "TopK"], [25, 10, 1, "", "Variance"], [25, 12, 1, "", "box_numpy_null"], [25, 10, 1, "", "cache_load_enabled"], [25, 12, 1, "", "is_null_numpy_value"], [25, 12, 1, "", "load_cached_state"], [25, 12, 1, "", "make_loader"], [25, 12, 1, "", "pull_key_prefix"], [25, 12, 1, "", "push_key_prefix"], [25, 12, 1, "", "resolve_state_dict"], [25, 12, 1, "", "sample_portion"], [25, 12, 1, "", "save_cached_state"], [25, 12, 1, "", "tally"], [25, 12, 1, "", "unbox_numpy_null"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler": [[25, 11, 1, "", "class_subset"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler": [[25, 11, 1, "", "dereference"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean": [[25, 11, 1, "", "add"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "normalize"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat": [[25, 11, 1, "", "add"], [25, 11, 1, "", "cpu_"], [25, 11, 1, "", "cuda_"], [25, 11, 1, "", "load"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "save"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK": [[25, 11, 1, "", "add"], [25, 11, 1, "", "topk"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.tools": [[26, 9, 0, "-", "utils"]], "intel_extension_for_transformers.transformers": [[27, 9, 0, "-", "benchmark"], [28, 9, 0, "-", "config"], [31, 9, 0, "-", "dynamic"], [34, 9, 0, "-", "modeling"], [45, 9, 0, "-", "pipeline"], [46, 9, 0, "-", "pruner"], [48, 9, 0, "-", "quantization"], [245, 9, 0, "-", "runtime"], [246, 9, 0, "-", "trainer"], [249, 9, 0, "-", "utils"]], "intel_extension_for_transformers.transformers.benchmark": [[27, 12, 1, "", "benchmark"], [27, 12, 1, "", "get_example_inputs"], [27, 12, 1, "", "preprocess_model"], [27, 12, 1, "", "refactor_batch_size"]], "intel_extension_for_transformers.transformers.config": [[28, 10, 1, "", "BenchmarkConfig"], [28, 10, 1, "", "DynamicLengthConfig"], [28, 10, 1, "", "Provider"], [28, 10, 1, "", "PrunerV2"], [28, 10, 1, "", "WeightPruningConfig"], [28, 12, 1, "", "check_value"]], "intel_extension_for_transformers.transformers.dynamic": [[29, 9, 0, "-", "drop_and_restore_utils"], [30, 9, 0, "-", "evolution"]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, 12, 1, "", "sample_layer_configuration"], [29, 12, 1, "", "sample_length_configuration"]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, 10, 1, "", "Evolution"], [30, 12, 1, "", "approx_ratio"], [30, 12, 1, "", "inverse"], [30, 12, 1, "", "store2str"]], "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution": [[30, 11, 1, "", "add_gene"], [30, 11, 1, "", "convex_hull"], [30, 11, 1, "", "crossover"], [30, 11, 1, "", "get_store"], [30, 11, 1, "", "load_store"], [30, 11, 1, "", "mutate"], [30, 11, 1, "", "pareto_frontier"], [30, 11, 1, "", "save_population"], [30, 11, 1, "", "save_store"], [30, 11, 1, "", "set_lower_constraint"], [30, 11, 1, "", "set_upper_constraint"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models": [[32, 9, 0, "-", "modeling_llama"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, 10, 1, "", "LlamaAttention"], [32, 10, 1, "", "LlamaFlashAttention2"], [32, 10, 1, "", "LlamaSdpaAttention"], [32, 12, 1, "", "apply_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.modeling": [[35, 9, 0, "-", "model"], [36, 9, 0, "-", "modeling_bert_dynamic"], [44, 9, 0, "-", "modeling_roberta_dynamic"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode": [[33, 9, 0, "-", "modeling_gpt_bigcode"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, 10, 1, "", "GPTBigCodeForCausalLM"], [33, 10, 1, "", "GPTBigCodeForSequenceClassification"], [33, 10, 1, "", "GPTBigCodeForTokenClassification"], [33, 10, 1, "", "GPTBigCodeModel"], [33, 10, 1, "", "GPTBigCodePreTrainedModel"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.model": [[35, 10, 1, "", "OptimizedModel"]], "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel": [[35, 11, 1, "", "from_pretrained"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, 10, 1, "", "BertAttention"], [36, 10, 1, "", "BertEmbeddings"], [36, 10, 1, "", "BertEncoder"], [36, 10, 1, "", "BertForMaskedLM"], [36, 10, 1, "", "BertForMultipleChoice"], [36, 10, 1, "", "BertForNextSentencePrediction"], [36, 10, 1, "", "BertForPreTraining"], [36, 10, 1, "", "BertForPreTrainingOutput"], [36, 10, 1, "", "BertForQuestionAnswering"], [36, 10, 1, "", "BertForSequenceClassification"], [36, 10, 1, "", "BertForTokenClassification"], [36, 10, 1, "", "BertIntermediate"], [36, 10, 1, "", "BertLMHeadModel"], [36, 10, 1, "", "BertLMPredictionHead"], [36, 10, 1, "", "BertLayer"], [36, 10, 1, "", "BertModel"], [36, 10, 1, "", "BertOnlyMLMHead"], [36, 10, 1, "", "BertOnlyNSPHead"], [36, 10, 1, "", "BertOutput"], [36, 10, 1, "", "BertPooler"], [36, 10, 1, "", "BertPreTrainedModel"], [36, 10, 1, "", "BertPreTrainingHeads"], [36, 10, 1, "", "BertPredictionHeadTransform"], [36, 10, 1, "", "BertSelfAttention"], [36, 10, 1, "", "BertSelfOutput"], [36, 12, 1, "", "expand_gather"], [36, 12, 1, "", "load_tf_weights_in_bert"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "prune_heads"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "prepare_inputs_for_generation"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "prepare_inputs_for_generation"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer": [[36, 11, 1, "", "feed_forward_chunk"], [36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_input_embeddings"], [36, 11, 1, "", "set_input_embeddings"], [36, 11, 1, "", "set_length_config"], [36, 11, 1, "", "set_output_attentions"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "transpose_for_scores"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi": [[43, 9, 0, "-", "streaming_llm"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart": [[37, 9, 0, "-", "modeling_bart"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, 12, 1, "", "gaudi_BartAttention_forward"], [37, 10, 1, "", "gaudi_BartLearnedPositionalEmbedding"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding": [[37, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama": [[38, 9, 0, "-", "pos_shift_llama"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral": [[39, 9, 0, "-", "modeling_mistral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, 12, 1, "", "gaudi_mistral_repeat_kv"], [39, 12, 1, "", "gaudi_mistral_rmsnorm_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral": [[40, 9, 0, "-", "modeling_mixtral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, 10, 1, "", "GaudiMixtralForCausalLM"], [40, 12, 1, "", "gaudi_mixtral_attention_forward"], [40, 12, 1, "", "gaudi_mixtral_block_sparse_moe_forward"], [40, 12, 1, "", "gaudi_mixtral_decoder_layer_forward"], [40, 12, 1, "", "gaudi_mixtral_model_forward"], [40, 12, 1, "", "gaudi_mixtral_repeat_kv"], [40, 12, 1, "", "gaudi_mixtral_rmsnorm_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi": [[41, 9, 0, "-", "modeling_phi"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, 12, 1, "", "gaudi_phi_attention_forward"], [41, 12, 1, "", "gaudi_phi_decoder_layer_forward"], [41, 12, 1, "", "gaudi_phi_model_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin": [[42, 9, 0, "-", "modeling_swin"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, 12, 1, "", "gaudi_swin_get_attn_mask"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, 10, 1, "", "RobertaAttention"], [44, 10, 1, "", "RobertaClassificationHead"], [44, 10, 1, "", "RobertaEmbeddings"], [44, 10, 1, "", "RobertaEncoder"], [44, 10, 1, "", "RobertaForCausalLM"], [44, 10, 1, "", "RobertaForMaskedLM"], [44, 10, 1, "", "RobertaForMultipleChoice"], [44, 10, 1, "", "RobertaForQuestionAnswering"], [44, 10, 1, "", "RobertaForSequenceClassification"], [44, 10, 1, "", "RobertaForTokenClassification"], [44, 10, 1, "", "RobertaIntermediate"], [44, 10, 1, "", "RobertaLMHead"], [44, 10, 1, "", "RobertaLayer"], [44, 10, 1, "", "RobertaModel"], [44, 10, 1, "", "RobertaOutput"], [44, 10, 1, "", "RobertaPooler"], [44, 10, 1, "", "RobertaPreTrainedModel"], [44, 10, 1, "", "RobertaSelfAttention"], [44, 10, 1, "", "RobertaSelfOutput"], [44, 12, 1, "", "create_position_ids_from_input_ids"], [44, 12, 1, "", "expand_gather"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "prune_heads"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings": [[44, 11, 1, "", "create_position_ids_from_inputs_embeds"], [44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_output_embeddings"], [44, 11, 1, "", "prepare_inputs_for_generation"], [44, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_output_embeddings"], [44, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer": [[44, 11, 1, "", "feed_forward_chunk"], [44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_input_embeddings"], [44, 11, 1, "", "set_input_embeddings"], [44, 11, 1, "", "set_length_config"], [44, 11, 1, "", "set_output_attentions"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel": [[44, 11, 1, "", "update_keys_to_ignore"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "transpose_for_scores"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.pipeline": [[45, 12, 1, "", "infer_framework_load_model"]], "intel_extension_for_transformers.transformers.pruner": [[47, 9, 0, "-", "pruning"]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, 10, 1, "", "Pruning"]], "intel_extension_for_transformers.transformers.pruner.pruning.Pruning": [[47, 13, 1, "", "config_file_path"], [47, 11, 1, "", "get_sparsity_ratio"], [47, 13, 1, "", "model"], [47, 11, 1, "", "on_after_eval"], [47, 11, 1, "", "on_after_optimizer_step"], [47, 11, 1, "", "on_before_eval"], [47, 11, 1, "", "on_before_optimizer_step"], [47, 11, 1, "", "on_epoch_begin"], [47, 11, 1, "", "on_epoch_end"], [47, 11, 1, "", "on_step_begin"], [47, 11, 1, "", "on_step_end"], [47, 11, 1, "", "on_train_begin"], [47, 11, 1, "", "on_train_end"], [47, 13, 1, "", "pruner_info"], [47, 13, 1, "", "pruners"], [47, 11, 1, "", "update_config"]], "intel_extension_for_transformers.transformers.runtime": [[58, 9, 0, "-", "compile"], [245, 12, 1, "", "neural_engine_bin"]], "intel_extension_for_transformers.transformers.runtime.compile": [[49, 9, 0, "-", "compile"], [51, 9, 0, "-", "extractors"], [56, 9, 0, "-", "graph"], [57, 9, 0, "-", "graph_utils"], [59, 9, 0, "-", "loaders"], [61, 9, 0, "-", "logger"], [62, 9, 0, "-", "onnx_utils"], [83, 9, 0, "-", "ops"], [128, 9, 0, "-", "optimizer"], [150, 9, 0, "-", "sub_graph"], [243, 9, 0, "-", "tf_utils"], [244, 9, 0, "-", "torch_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, 12, 1, "", "compile"], [49, 12, 1, "", "start_pipeline"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[50, 9, 0, "-", "extractor"], [52, 9, 0, "-", "onnx_extractor"], [53, 9, 0, "-", "tf_extractor"], [54, 9, 0, "-", "torch_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, 10, 1, "", "Extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, 10, 1, "", "ONNXExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, 10, 1, "", "TensorflowExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, 10, 1, "", "TorchExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[55, 9, 0, "-", "graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, 10, 1, "", "Graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph": [[55, 11, 1, "", "add_config_item"], [55, 11, 1, "", "change_node_input_tensors"], [55, 11, 1, "", "change_node_output_tensors"], [55, 11, 1, "", "dump_tensor"], [55, 11, 1, "", "engine_init"], [55, 11, 1, "", "generate"], [55, 11, 1, "", "get_next_node_names"], [55, 11, 1, "", "get_node_by_name"], [55, 11, 1, "", "get_node_id"], [55, 11, 1, "", "get_pre_node_names"], [55, 11, 1, "", "get_sparse_nodes_name"], [55, 11, 1, "", "get_tensor_idx"], [55, 11, 1, "", "graph_dispatch"], [55, 11, 1, "", "graph_init"], [55, 11, 1, "", "inference"], [55, 11, 1, "", "inquire_config_item"], [55, 11, 1, "", "insert_nodes"], [55, 11, 1, "", "modify_node_connections"], [55, 11, 1, "", "remove_nodes"], [55, 11, 1, "", "rename_node"], [55, 11, 1, "", "save"], [55, 11, 1, "", "transpose_mode_int8"]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, 10, 1, "", "LazyImport"], [57, 12, 1, "", "autocast_init"], [57, 12, 1, "", "construct_node"], [57, 12, 1, "", "del_environ_var"], [57, 12, 1, "", "del_environ_vars"], [57, 12, 1, "", "environ_info_init"], [57, 12, 1, "", "get_autocast_info"], [57, 12, 1, "", "get_data_dtype"], [57, 12, 1, "", "get_environ_info"], [57, 12, 1, "", "get_model_fwk_name"], [57, 12, 1, "", "get_quant_info"], [57, 12, 1, "", "insert_environ_info"], [57, 12, 1, "", "insert_pattern"], [57, 12, 1, "", "insert_quant_info"], [57, 12, 1, "", "list2str"], [57, 12, 1, "", "names_from_input"], [57, 12, 1, "", "pattern_mapping"], [57, 12, 1, "", "pattern_mapping_conf_validation"], [57, 12, 1, "", "quant_info_init"], [57, 12, 1, "", "remove_environ_info_item"], [57, 12, 1, "", "remove_environ_info_items"], [57, 12, 1, "", "search_pattern"], [57, 12, 1, "", "search_straight_pattern"], [57, 12, 1, "", "set_autocast"], [57, 12, 1, "", "set_environ_var"], [57, 12, 1, "", "set_environ_vars"], [57, 12, 1, "", "str2list"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[60, 9, 0, "-", "loader"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, 10, 1, "", "Loader"]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, 10, 1, "", "Logger"], [61, 12, 1, "", "debug"], [61, 12, 1, "", "error"], [61, 12, 1, "", "fatal"], [61, 12, 1, "", "info"], [61, 12, 1, "", "log"], [61, 12, 1, "", "warn"], [61, 12, 1, "", "warning"]], "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger": [[61, 11, 1, "", "get_logger"]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, 12, 1, "", "bias_to_int32"], [62, 12, 1, "", "change_num_name"], [62, 12, 1, "", "get_children"], [62, 12, 1, "", "get_initializer_children_names"], [62, 12, 1, "", "get_node_children_names"], [62, 12, 1, "", "graph_node_names_details"], [62, 12, 1, "", "is_supported_onnx_graph"], [62, 12, 1, "", "is_supported_onnx_node"], [62, 12, 1, "", "onnx_extract_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[63, 9, 0, "-", "all"], [64, 9, 0, "-", "assert"], [65, 9, 0, "-", "baddbmm"], [66, 9, 0, "-", "batch_matmul"], [67, 9, 0, "-", "batch_matmul_v2"], [68, 9, 0, "-", "bias_add"], [69, 9, 0, "-", "cast"], [70, 9, 0, "-", "concat"], [71, 9, 0, "-", "conv"], [72, 9, 0, "-", "cos"], [73, 9, 0, "-", "empty_ops"], [74, 9, 0, "-", "expand_dims"], [75, 9, 0, "-", "fused_batch_matmul_v2"], [76, 9, 0, "-", "fused_batch_norm_v3"], [77, 9, 0, "-", "fused_gemm"], [78, 9, 0, "-", "fused_matmul"], [79, 9, 0, "-", "gather"], [80, 9, 0, "-", "gather_elements"], [81, 9, 0, "-", "gelu"], [82, 9, 0, "-", "gemm"], [84, 9, 0, "-", "iterator_get_next"], [85, 9, 0, "-", "iterator_v2"], [86, 9, 0, "-", "layer_normalization"], [87, 9, 0, "-", "log_softmax"], [88, 9, 0, "-", "map_and_batch_dataset"], [89, 9, 0, "-", "matmul"], [90, 9, 0, "-", "mean"], [91, 9, 0, "-", "mkl_layer_norm"], [92, 9, 0, "-", "model_dataset"], [93, 9, 0, "-", "one_hot"], [94, 9, 0, "-", "onnx_input"], [95, 9, 0, "-", "op"], [96, 9, 0, "-", "optimize_dataset"], [97, 9, 0, "-", "pack"], [98, 9, 0, "-", "padding_sequence"], [99, 9, 0, "-", "placeholder"], [100, 9, 0, "-", "pos_embed"], [101, 9, 0, "-", "pow"], [102, 9, 0, "-", "quantize_linear"], [103, 9, 0, "-", "quantize_v2"], [104, 9, 0, "-", "quantized_fused_matmul_and_dequantize"], [105, 9, 0, "-", "quantized_matmul_with_bias_and_dequantize"], [106, 9, 0, "-", "reduce_mean"], [107, 9, 0, "-", "reduce_sum"], [108, 9, 0, "-", "reorder"], [109, 9, 0, "-", "reshape"], [110, 9, 0, "-", "resize"], [111, 9, 0, "-", "rsub"], [112, 9, 0, "-", "scatter_elements"], [113, 9, 0, "-", "shape"], [114, 9, 0, "-", "sin"], [115, 9, 0, "-", "size"], [116, 9, 0, "-", "slice_position_ids"], [117, 9, 0, "-", "softmax"], [118, 9, 0, "-", "split"], [119, 9, 0, "-", "squeeze"], [120, 9, 0, "-", "strided_slice"], [121, 9, 0, "-", "tensor"], [122, 9, 0, "-", "top_k"], [123, 9, 0, "-", "transpose"], [124, 9, 0, "-", "unpack"], [125, 9, 0, "-", "unsqueeze"], [126, 9, 0, "-", "view"], [127, 9, 0, "-", "where"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, 10, 1, "", "All"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All": [[63, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, 10, 1, "", "Assert"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert": [[64, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, 10, 1, "", "Baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.Baddbmm": [[65, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, 10, 1, "", "BatchMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul": [[66, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, 10, 1, "", "BatchMatMulV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2": [[67, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, 10, 1, "", "BiasAdd"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd": [[68, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, 10, 1, "", "Cast"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast": [[69, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, 10, 1, "", "Concat"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat": [[70, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, 10, 1, "", "Conv"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv": [[71, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, 10, 1, "", "Cos"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos": [[72, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, 10, 1, "", "Add"], [73, 10, 1, "", "AddV2"], [73, 10, 1, "", "Arange"], [73, 10, 1, "", "BinaryAdd"], [73, 10, 1, "", "Constant"], [73, 10, 1, "", "ConstantOfShape"], [73, 10, 1, "", "Convolution"], [73, 10, 1, "", "CumSum"], [73, 10, 1, "", "Dequantize"], [73, 10, 1, "", "DequantizeLinear"], [73, 10, 1, "", "Einsum"], [73, 10, 1, "", "EmbeddingBag"], [73, 10, 1, "", "Erf"], [73, 10, 1, "", "Expand"], [73, 10, 1, "", "ExpandIndices"], [73, 10, 1, "", "Fill"], [73, 10, 1, "", "FlatMapDataset"], [73, 10, 1, "", "Flatten"], [73, 10, 1, "", "Floor_divide"], [73, 10, 1, "", "Identity"], [73, 10, 1, "", "InnerProduct"], [73, 10, 1, "", "Input"], [73, 10, 1, "", "LatRange"], [73, 10, 1, "", "ListConstruct"], [73, 10, 1, "", "ListUnpack"], [73, 10, 1, "", "Loop"], [73, 10, 1, "", "MakeIterator"], [73, 10, 1, "", "Masked_fill"], [73, 10, 1, "", "MatMulWithBias"], [73, 10, 1, "", "MatMulWithBiasAdd"], [73, 10, 1, "", "MatMulWithBiasGelu"], [73, 10, 1, "", "MatMulWithBiasRelu"], [73, 10, 1, "", "MatMulWithBiasSigmoid"], [73, 10, 1, "", "MatMulWithBiasTanh"], [73, 10, 1, "", "Matmul"], [73, 10, 1, "", "Max"], [73, 10, 1, "", "MergedEmbeddingbag"], [73, 10, 1, "", "MultiHeadAttenion"], [73, 10, 1, "", "Onehot"], [73, 10, 1, "", "OpAny"], [73, 10, 1, "", "Output"], [73, 10, 1, "", "PositionIds"], [73, 10, 1, "", "QLinearAdd"], [73, 10, 1, "", "QLinearMatMul"], [73, 10, 1, "", "QLinearMul"], [73, 10, 1, "", "Range"], [73, 10, 1, "", "RealDiv"], [73, 10, 1, "", "Reciprocal"], [73, 10, 1, "", "Relu"], [73, 10, 1, "", "Repeat"], [73, 10, 1, "", "Rsqrt"], [73, 10, 1, "", "SequenceLength"], [73, 10, 1, "", "Sigmoid"], [73, 10, 1, "", "Silu"], [73, 10, 1, "", "Sqrt"], [73, 10, 1, "", "Square"], [73, 10, 1, "", "SquaredDifference"], [73, 10, 1, "", "Stack"], [73, 10, 1, "", "StopGradient"], [73, 10, 1, "", "Tanh"], [73, 10, 1, "", "TensorSliceDataset"], [73, 10, 1, "", "Tile"], [73, 10, 1, "", "TokenTypeIds"], [73, 10, 1, "", "TransposeBatchMatMul"], [73, 10, 1, "", "Zeros"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, 10, 1, "", "ExpandDims"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims": [[74, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, 10, 1, "", "FusedBatchNormV3"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3": [[76, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, 10, 1, "", "FusedGemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm": [[77, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, 10, 1, "", "FusedMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul": [[78, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, 10, 1, "", "Gather"], [79, 10, 1, "", "GatherV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather": [[79, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2": [[79, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, 10, 1, "", "GatherElements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements": [[80, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, 10, 1, "", "Gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu": [[81, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, 10, 1, "", "Gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm": [[82, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, 10, 1, "", "IteratorGetNext"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext": [[84, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, 10, 1, "", "IteratorV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2": [[85, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, 10, 1, "", "LayerNorm"], [86, 10, 1, "", "LayerNormalization"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization": [[86, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, 10, 1, "", "LogSoftmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax": [[87, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, 10, 1, "", "MapAndBatchDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset": [[88, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, 10, 1, "", "MatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul": [[89, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, 10, 1, "", "Mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean": [[90, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, 10, 1, "", "ModelDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset": [[92, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, 10, 1, "", "OneHot"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot": [[93, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, 10, 1, "", "ONNXINPUT"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT": [[94, 11, 1, "", "extract"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, 10, 1, "", "Operator"], [95, 12, 1, "", "operator_registry"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator": [[95, 11, 1, "", "construct"], [95, 11, 1, "", "extract"], [95, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, 10, 1, "", "OptimizeDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset": [[96, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, 10, 1, "", "Pack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack": [[97, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, 10, 1, "", "PaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence": [[98, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, 10, 1, "", "Placeholder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder": [[99, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, 10, 1, "", "PackagePositionEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding": [[100, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, 10, 1, "", "Pow"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow": [[101, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, 10, 1, "", "Quantize"], [102, 10, 1, "", "QuantizeLinear"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize": [[102, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear": [[102, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, 10, 1, "", "QuantizeV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2": [[103, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, 10, 1, "", "QuantizedMatMulWithBiasAndDequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize": [[105, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, 10, 1, "", "ReduceMean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean": [[106, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, 10, 1, "", "ReduceSum"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum": [[107, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, 10, 1, "", "Reorder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder": [[108, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, 10, 1, "", "Reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape": [[109, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, 10, 1, "", "Resize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize": [[110, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, 10, 1, "", "Rsub"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub": [[111, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, 10, 1, "", "ScatterElements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements": [[112, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, 10, 1, "", "Shape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape": [[113, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, 10, 1, "", "Sin"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin": [[114, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, 10, 1, "", "Size"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size": [[115, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, 10, 1, "", "SlicePositionIds"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds": [[116, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, 10, 1, "", "Softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax": [[117, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, 10, 1, "", "Split"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split": [[118, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, 10, 1, "", "Squeeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze": [[119, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, 10, 1, "", "StridedSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice": [[120, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, 10, 1, "", "Tensor"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, 10, 1, "", "TopK"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK": [[122, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, 10, 1, "", "Transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose": [[123, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, 10, 1, "", "Unpack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack": [[124, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, 10, 1, "", "Unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze": [[125, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, 10, 1, "", "View"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View": [[126, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, 10, 1, "", "Where"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where": [[127, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, 10, 1, "", "Optimizer"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer": [[128, 11, 1, "", "optimize"], [128, 11, 1, "", "weight_optimization"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[129, 9, 0, "-", "InnerproductReshapeFusion"], [130, 9, 0, "-", "add_cls_token"], [131, 9, 0, "-", "add_embeddings"], [132, 9, 0, "-", "arangewithreciprocal"], [133, 9, 0, "-", "attentionBlock_AttentionMaskAddReshape"], [134, 9, 0, "-", "attentionBlock_ConstantOfShapeWithMul"], [135, 9, 0, "-", "attentionBlock_QKVPreReshape"], [136, 9, 0, "-", "attentionBlock_QKVReshape"], [137, 9, 0, "-", "attentionBlock_WeightReshapeTo4D"], [138, 9, 0, "-", "attention_mask_length_adaptive_keep_indices"], [139, 9, 0, "-", "attention_output_layer_norm_length_adaptive_keep_indices"], [140, 9, 0, "-", "attention_reshape"], [141, 9, 0, "-", "cast_to"], [142, 9, 0, "-", "collect_quant_info"], [143, 9, 0, "-", "conv_reshape"], [144, 9, 0, "-", "decoder_attn_reshape"], [145, 9, 0, "-", "einsumwitharange"], [146, 9, 0, "-", "embeddingbag"], [147, 9, 0, "-", "embeddings_to_2d_before_inner_product"], [148, 9, 0, "-", "gelu"], [149, 9, 0, "-", "generate_sequence"], [151, 9, 0, "-", "innerproductwithbiasgelu"], [152, 9, 0, "-", "innerproductwithslice"], [153, 9, 0, "-", "innerproductwithswish"], [154, 9, 0, "-", "input_data"], [155, 9, 0, "-", "input_file"], [156, 9, 0, "-", "insert_bf16_node"], [157, 9, 0, "-", "insert_quant_node"], [158, 9, 0, "-", "int8_bf16_mixed_precision_checker"], [159, 9, 0, "-", "interact_features"], [160, 9, 0, "-", "last_layer_shape"], [161, 9, 0, "-", "layer_norm"], [162, 9, 0, "-", "layer_norm_with_reduce_mean"], [163, 9, 0, "-", "layer_norm_with_transpose"], [164, 9, 0, "-", "llama_embeding"], [165, 9, 0, "-", "llama_matmulwithtranspose"], [166, 9, 0, "-", "llama_postprocess"], [167, 9, 0, "-", "llama_rotary_pos_emb"], [168, 9, 0, "-", "lower_all_tuples"], [169, 9, 0, "-", "matmul_with_bias"], [170, 9, 0, "-", "matmul_with_bias_add"], [171, 9, 0, "-", "matmul_with_bias_gelu"], [172, 9, 0, "-", "matmul_with_bias_relu"], [173, 9, 0, "-", "matmul_with_bias_sigmoid"], [174, 9, 0, "-", "matmul_with_bias_tanh"], [175, 9, 0, "-", "matmul_with_bias_unsqueeze"], [176, 9, 0, "-", "matmul_with_transpose"], [177, 9, 0, "-", "matmul_with_transpose_scale_add"], [178, 9, 0, "-", "merged_embeddingbag"], [179, 9, 0, "-", "neox_reorder_change"], [180, 9, 0, "-", "neox_rotary_pos_emb"], [181, 9, 0, "-", "operator_adaptor"], [182, 9, 0, "-", "output_data"], [183, 9, 0, "-", "padding_sequence"], [184, 9, 0, "-", "pattern"], [185, 9, 0, "-", "position_embeddings"], [186, 9, 0, "-", "position_embeddings_v1"], [187, 9, 0, "-", "qkv_merge"], [188, 9, 0, "-", "qkv_reshape"], [189, 9, 0, "-", "quant_gather_to_bf16"], [190, 9, 0, "-", "quantize_fusion"], [191, 9, 0, "-", "quantized_graph_dtype_refactor"], [192, 9, 0, "-", "remove_constant_op"], [193, 9, 0, "-", "remove_last_view"], [194, 9, 0, "-", "remove_range"], [195, 9, 0, "-", "remove_unused_operator"], [196, 9, 0, "-", "remove_zeros"], [197, 9, 0, "-", "removeslice"], [198, 9, 0, "-", "reshape_after_restore_hidden_states"], [199, 9, 0, "-", "reshape_before_and_after_attention_out_layer_norm_gather_elements"], [200, 9, 0, "-", "reshape_before_restore_hidden_states"], [201, 9, 0, "-", "reshape_fusion"], [202, 9, 0, "-", "restore_hidden_states_in_length_adaptive_update_indices"], [203, 9, 0, "-", "rms_norm"], [204, 9, 0, "-", "rotary_pos_emb"], [205, 9, 0, "-", "slicemask"], [206, 9, 0, "-", "stableDiffusion_ExplicitNHWCTranspose"], [207, 9, 0, "-", "stableDiffusion_ExplicitNHWCTransposeQAT"], [208, 9, 0, "-", "stableDiffusion_MHAReshape"], [209, 9, 0, "-", "stableDiffusion_QuantizeFusion"], [210, 9, 0, "-", "stableDiffusion_ReshapeFusion"], [211, 9, 0, "-", "stableDiffusion_bf16Convert"], [212, 9, 0, "-", "stableDiffusion_collectQDQInfo"], [213, 9, 0, "-", "stableDiffusion_insertQuantNode"], [214, 9, 0, "-", "start_end_logits"], [215, 9, 0, "-", "subgraph_matcher"], [216, 9, 0, "-", "textEncdoer_word_embedding"], [217, 9, 0, "-", "textEncoder_AttentionMaskAddReshape"], [218, 9, 0, "-", "textEncoder_AttentionReshape"], [219, 9, 0, "-", "textEncoder_KVReshape"], [220, 9, 0, "-", "textEncoder_MulReshape"], [221, 9, 0, "-", "textEncoder_QReshape"], [222, 9, 0, "-", "textEncoder_SoftmaxReshape"], [223, 9, 0, "-", "textEncoder_causal_attention_mask"], [224, 9, 0, "-", "token_type_embeddings"], [225, 9, 0, "-", "token_type_embeddings_v1"], [226, 9, 0, "-", "torch_embedding"], [227, 9, 0, "-", "torch_ip_insert_bias"], [228, 9, 0, "-", "torch_unpack_baddbmm"], [229, 9, 0, "-", "torchinsertbf16node"], [230, 9, 0, "-", "torchpaddingsquence"], [231, 9, 0, "-", "transformer2Dmodel_AttentionMaskAddReshape"], [232, 9, 0, "-", "transformer2Dmodel_ConstantOfShapeWithMul"], [233, 9, 0, "-", "transformer2Dmodel_FFNSlice"], [234, 9, 0, "-", "transformer2Dmodel_FFNSlice_1"], [235, 9, 0, "-", "transformer2Dmodel_QKVPreReshape"], [236, 9, 0, "-", "transformer2Dmodel_QKVReshape"], [237, 9, 0, "-", "transformer2Dmodel_QKVReshape4D"], [238, 9, 0, "-", "transformer2Dmodel_encoderHiddenStatesReshape"], [239, 9, 0, "-", "transformer2Dmodel_getSampleBatch"], [240, 9, 0, "-", "transformer2Dmodel_sampleSlice"], [241, 9, 0, "-", "transpose_batch_matmul"], [242, 9, 0, "-", "word_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion": [[129, 10, 1, "", "InnerproductReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, 10, 1, "", "AddClsToken"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, 10, 1, "", "AddEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, 10, 1, "", "ArangewithReciprocal"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape": [[133, 10, 1, "", "AttentionBlock_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul": [[134, 10, 1, "", "AttentionBlock_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape": [[135, 10, 1, "", "AttentionBlock_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape": [[136, 10, 1, "", "AttentionBlock_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D": [[137, 10, 1, "", "AttentionBlock_WeightReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, 10, 1, "", "AttentionMaskLengthAdaptiveExpandIndices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, 10, 1, "", "AttentionOutputLayerNormLengthAdaptiveExpandIndices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, 10, 1, "", "AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, 10, 1, "", "CastTo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, 10, 1, "", "CollectQuantInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, 10, 1, "", "ConvReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, 10, 1, "", "DecoderAttnReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, 10, 1, "", "EinsumwithArange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, 10, 1, "", "EmbeddingBag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, 10, 1, "", "EmbeddingsTo2DBeforeInnerProduct"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, 10, 1, "", "Gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, 10, 1, "", "GenerateSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, 10, 1, "", "InnerproductWithBiasGelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, 10, 1, "", "InnerproductwithSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, 10, 1, "", "InnerproductWithSwish"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, 10, 1, "", "InputData"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, 10, 1, "", "InputFile"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, 10, 1, "", "InsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, 10, 1, "", "InsertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, 10, 1, "", "Int8BF16MixedPrecisionChecker"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, 10, 1, "", "InteractFeatures"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, 10, 1, "", "LastLayerShape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, 10, 1, "", "LayerNorm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, 10, 1, "", "LayerNormWithReduceMean"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, 10, 1, "", "LayerNormWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, 10, 1, "", "LlamaEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, 10, 1, "", "LlamaMatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, 10, 1, "", "LlamaPostprocess"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, 10, 1, "", "LlamaRoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, 10, 1, "", "LowerAllTuples"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, 10, 1, "", "MatMulWithBias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, 10, 1, "", "MatMulWithBiasAdd"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, 10, 1, "", "MatMulWithBiasGelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, 10, 1, "", "MatMulWithBiasRelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, 10, 1, "", "MatMulWithBiasSigmoid"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, 10, 1, "", "MatmulWithBiasTanh"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, 10, 1, "", "MatMulWithBiasUnsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, 10, 1, "", "MatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, 10, 1, "", "MatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, 10, 1, "", "MergedEmbeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, 10, 1, "", "NeoxReorderChange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, 10, 1, "", "NeoxRoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, 10, 1, "", "OperatorAdaptor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, 10, 1, "", "OutputData"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, 10, 1, "", "PaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, 10, 1, "", "Pattern"], [184, 12, 1, "", "pattern_registry"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, 10, 1, "", "PositionEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, 10, 1, "", "PositionEmbeddingsV1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, 10, 1, "", "QKVMerge"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, 10, 1, "", "QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, 10, 1, "", "TorchInsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, 10, 1, "", "QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, 10, 1, "", "QuantizedGraphDtypeCheck"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, 10, 1, "", "RemoveConstantOP"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, 10, 1, "", "RemoveLastView"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, 10, 1, "", "RemoveRange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, 10, 1, "", "RemoveUnusedOperator"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, 10, 1, "", "RemoveZeros"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, 10, 1, "", "RemoveSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, 10, 1, "", "ReshapeAfterRestoreHiddenStates"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, 10, 1, "", "ReshapeBeforeAndAfterAttentionOutLayerNormGatherElements"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, 10, 1, "", "ReshapeBeforeRestoreHiddenStates"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, 10, 1, "", "ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, 10, 1, "", "RestoreHiddenStatesInLengthAdaptive"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, 10, 1, "", "RmsNorm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, 10, 1, "", "RoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, 10, 1, "", "SliceMask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose": [[206, 10, 1, "", "ExplicitNHWCTransposeForConv"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT": [[207, 10, 1, "", "ExplicitNHWCTransposeForConvQAT"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape": [[208, 10, 1, "", "StableDiffusion_MHAReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion": [[209, 10, 1, "", "StableDiffusion_QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion": [[210, 10, 1, "", "StableDiffusion_ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert": [[211, 10, 1, "", "StableDiffusion_bf16Convert"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo": [[212, 10, 1, "", "StableDiffusion_CollectQuantInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode": [[213, 10, 1, "", "StableDiffusion_InsertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, 10, 1, "", "StartEndLogits"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, 10, 1, "", "SubGraphMatcher"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding": [[216, 10, 1, "", "TextEncoder_WordEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape": [[217, 10, 1, "", "TextEncoder_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape": [[218, 10, 1, "", "TextEncoder_AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape": [[219, 10, 1, "", "TextEncoder_KVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape": [[220, 10, 1, "", "TextEncoder_MulReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape": [[221, 10, 1, "", "TextEncoder_QReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape": [[222, 10, 1, "", "TextEncoder_SoftmaxReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask": [[223, 10, 1, "", "TextEncoder_CasualAttentionMask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, 10, 1, "", "TokenTypeEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, 10, 1, "", "TokenTypeEmbeddingsV1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, 10, 1, "", "TorchEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, 10, 1, "", "TorchInnerProductInsertBias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, 10, 1, "", "TorchUnpackBaddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, 10, 1, "", "TorchInsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, 10, 1, "", "TorchPaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape": [[231, 10, 1, "", "Transformer2Dmodel_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul": [[232, 10, 1, "", "Transformer2Dmodel_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice": [[233, 10, 1, "", "Transformer2Dmodel_FFNInputSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1": [[234, 10, 1, "", "Transformer2Dmodel_FFNInputSlice_1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape": [[235, 10, 1, "", "Transformer2Dmodel_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape": [[236, 10, 1, "", "Transformer2Dmodel_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D": [[237, 10, 1, "", "Transformer2Dmodel_QKVReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape": [[238, 10, 1, "", "Transformer2Dmodel_EncoderHiddenStatesReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch": [[239, 10, 1, "", "Transformer2Dmodel_GetSampleBatch"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice": [[240, 10, 1, "", "Transformer2Dmodel_SampleSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, 10, 1, "", "TransposeBatchMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, 10, 1, "", "WordEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, 15, 1, "", "TF_DTYPE_ID"], [243, 12, 1, "", "create_tf_node"], [243, 12, 1, "", "get_tensor_dest_op"], [243, 12, 1, "", "graph_node_names_details"], [243, 12, 1, "", "tf_extract_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, 12, 1, "", "torch_extract_operator"]], "intel_extension_for_transformers.transformers.trainer": [[246, 10, 1, "", "BaseTrainer"], [246, 10, 1, "", "NLPSeq2SeqTrainer"], [246, 10, 1, "", "NLPTrainer"]], "intel_extension_for_transformers.transformers.trainer.BaseTrainer": [[246, 11, 1, "", "benchmark"], [246, 11, 1, "", "builtin_eval_func"], [246, 11, 1, "", "builtin_train_func"], [246, 11, 1, "", "compute_loss"], [246, 11, 1, "", "distill"], [246, 11, 1, "", "export_to_bf16_onnx"], [246, 11, 1, "", "export_to_fp32_onnx"], [246, 11, 1, "", "export_to_int8_onnx"], [246, 11, 1, "", "export_to_jit"], [246, 11, 1, "", "export_to_onnx"], [246, 11, 1, "", "get_export_args"], [246, 11, 1, "", "infer_task"], [246, 11, 1, "", "orchestrate_optimizations"], [246, 11, 1, "", "prune"], [246, 11, 1, "", "quantize"], [246, 11, 1, "", "run_evolutionary_search"], [246, 11, 1, "", "set_dynamic_config"], [246, 11, 1, "", "train"], [246, 11, 1, "", "training_step"], [246, 11, 1, "", "training_step_length_adaptive"]], "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer": [[246, 11, 1, "", "builtin_eval_func"]], "intel_extension_for_transformers.transformers.utils": [[247, 9, 0, "-", "config"], [248, 9, 0, "-", "get_throughput"], [250, 9, 0, "-", "metrics"], [251, 9, 0, "-", "objectives"], [252, 9, 0, "-", "utility"]], "intel_extension_for_transformers.transformers.utils.config": [[247, 10, 1, "", "AutoRoundConfig"], [247, 10, 1, "", "AwqConfig"], [247, 10, 1, "", "DynamicQuantConfig"], [247, 10, 1, "", "GPTQConfig"], [247, 10, 1, "", "ITREXQuantizationConfigMixin"], [247, 10, 1, "", "QuantAwareTrainingConfig"], [247, 10, 1, "", "QuantizationMethod"], [247, 10, 1, "", "RtnConfig"], [247, 10, 1, "", "SmoothQuantConfig"], [247, 10, 1, "", "StaticQuantConfig"], [247, 10, 1, "", "TeqConfig"]], "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.AwqConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.GPTQConfig": [[247, 11, 1, "", "post_init_gptq"], [247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin": [[247, 11, 1, "", "post_init_cpu"], [247, 11, 1, "", "post_init_runtime"], [247, 11, 1, "", "post_init_xpu"], [247, 11, 1, "", "save_pretrained"], [247, 11, 1, "", "to_json_file"], [247, 11, 1, "", "update"]], "intel_extension_for_transformers.transformers.utils.config.RtnConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.TeqConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, 10, 1, "", "Metric"]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, 10, 1, "", "Objective"]], "intel_extension_for_transformers.transformers.utils.objectives.Objective": [[251, 11, 1, "", "modelsize"], [251, 11, 1, "", "performance"]], "intel_extension_for_transformers.transformers.utils.utility": [[252, 12, 1, "", "distributed_init"]], "models": [[255, 9, 0, "-", "backbone"], [256, 9, 0, "-", "detr"], [257, 9, 0, "-", "detr_multi"], [258, 9, 0, "-", "matcher"], [259, 9, 0, "-", "position_encoding"], [260, 9, 0, "-", "segmentation"], [261, 9, 0, "-", "transformer"]], "models.backbone": [[255, 10, 1, "", "Backbone"], [255, 10, 1, "", "FrozenBatchNorm2d"]], "models.detr": [[256, 10, 1, "", "DETR"], [256, 10, 1, "", "MLP"], [256, 10, 1, "", "PostProcess"], [256, 10, 1, "", "SetCriterion"]], "models.detr.DETR": [[256, 11, 1, "", "forward"]], "models.detr.PostProcess": [[256, 11, 1, "", "forward"]], "models.detr.SetCriterion": [[256, 11, 1, "", "forward"], [256, 11, 1, "", "loss_boxes"], [256, 11, 1, "", "loss_cardinality"], [256, 11, 1, "", "loss_labels"], [256, 11, 1, "", "loss_masks"]], "models.detr_multi": [[257, 10, 1, "", "DETRMulti"], [257, 10, 1, "", "MLP"], [257, 10, 1, "", "PostProcess"], [257, 10, 1, "", "SetCriterion"]], "models.detr_multi.DETRMulti": [[257, 11, 1, "", "forward"]], "models.detr_multi.PostProcess": [[257, 11, 1, "", "forward"]], "models.detr_multi.SetCriterion": [[257, 11, 1, "", "forward"], [257, 11, 1, "", "loss_boxes"], [257, 11, 1, "", "loss_cardinality"], [257, 11, 1, "", "loss_labels"], [257, 11, 1, "", "loss_masks"]], "models.matcher": [[258, 10, 1, "", "HungarianMatcher"]], "models.matcher.HungarianMatcher": [[258, 11, 1, "", "forward"]], "models.position_encoding": [[259, 10, 1, "", "PositionEmbeddingLearned"], [259, 10, 1, "", "PositionEmbeddingSine"]], "models.segmentation": [[260, 10, 1, "", "MHAttentionMap"], [260, 10, 1, "", "MaskHeadSmallConv"], [260, 10, 1, "", "PostProcessPanoptic"], [260, 12, 1, "", "dice_loss"], [260, 12, 1, "", "sigmoid_focal_loss"]], "models.segmentation.PostProcessPanoptic": [[260, 11, 1, "", "forward"]], "text": [[262, 12, 1, "", "text_to_sequence"]], "util": [[263, 9, 0, "-", "box_ops"], [264, 9, 0, "-", "misc"], [265, 9, 0, "-", "plot_utils"], [266, 9, 0, "-", "postprocess"]], "util.box_ops": [[263, 12, 1, "", "generalized_box_iou"], [263, 12, 1, "", "masks_to_boxes"]], "util.misc": [[264, 10, 1, "", "SmoothedValue"], [264, 12, 1, "", "accuracy"], [264, 12, 1, "", "all_gather"], [264, 12, 1, "", "interpolate"], [264, 12, 1, "", "reduce_dict"], [264, 12, 1, "", "setup_for_distributed"]], "util.misc.SmoothedValue": [[264, 11, 1, "", "synchronize_between_processes"]], "util.plot_utils": [[265, 12, 1, "", "plot_logs"]], "util.postprocess": [[266, 12, 1, "", "align_columns"], [266, 12, 1, "", "align_headers"], [266, 12, 1, "", "align_rows"], [266, 12, 1, "", "align_supercells"], [266, 12, 1, "", "apply_class_thresholds"], [266, 12, 1, "", "apply_threshold"], [266, 12, 1, "", "extract_text_from_spans"], [266, 12, 1, "", "extract_text_inside_bbox"], [266, 12, 1, "", "get_bbox_span_subset"], [266, 12, 1, "", "header_supercell_tree"], [266, 12, 1, "", "iob"], [266, 12, 1, "", "iou"], [266, 12, 1, "", "nms"], [266, 12, 1, "", "nms_by_containment"], [266, 12, 1, "", "nms_supercells"], [266, 12, 1, "", "objects_to_cells"], [266, 12, 1, "", "objects_to_table_structures"], [266, 12, 1, "", "overlaps"], [266, 12, 1, "", "refine_columns"], [266, 12, 1, "", "refine_rows"], [266, 12, 1, "", "refine_table_structures"], [266, 12, 1, "", "remove_objects_without_content"], [266, 12, 1, "", "remove_supercell_overlap"], [266, 12, 1, "", "slot_into_containers"], [266, 12, 1, "", "sort_objects_by_score"], [266, 12, 1, "", "sort_objects_left_to_right"], [266, 12, 1, "", "sort_objects_top_to_bottom"], [266, 12, 1, "", "table_structure_to_cells"]], "utils": [[267, 9, 0, "-", "data_utils"], [268, 9, 0, "-", "eval_utils"]], "utils.data_utils": [[267, 12, 1, "", "get_multi_choice_info"], [267, 12, 1, "", "save_jsonl"]], "utils.eval_utils": [[268, 12, 1, "", "calculate_ins_level_acc"], [268, 12, 1, "", "check_is_number"], [268, 12, 1, "", "eval_multi_choice"], [268, 12, 1, "", "eval_open"], [268, 12, 1, "", "evaluate"], [268, 12, 1, "", "extract_numbers"], [268, 12, 1, "", "normalize_str"], [268, 12, 1, "", "parse_multi_choice_response"], [268, 12, 1, "", "parse_open_response"]]}, "objnames": {"0": ["c", "macro", "C macro"], "1": ["cpp", "type", "C++ type"], "2": ["cpp", "enumerator", "C++ enumerator"], "3": ["cpp", "class", "C++ class"], "4": ["cpp", "function", "C++ function"], "5": ["cpp", "functionParam", "C++ function parameter"], "6": ["cpp", "enum", "C++ enum"], "7": ["cpp", "member", "C++ member"], "8": ["cpp", "templateParam", "C++ template parameter"], "9": ["py", "module", "Python module"], "10": ["py", "class", "Python class"], "11": ["py", "method", "Python method"], "12": ["py", "function", "Python function"], "13": ["py", "attribute", "Python attribute"], "14": ["py", "exception", "Python exception"], "15": ["py", "data", "Python data"]}, "objtypes": {"0": "c:macro", "1": "cpp:type", "2": "cpp:enumerator", "3": "cpp:class", "4": "cpp:function", "5": "cpp:functionParam", "6": "cpp:enum", "7": "cpp:member", "8": "cpp:templateParam", "9": "py:module", "10": "py:class", "11": "py:method", "12": "py:function", "13": "py:attribute", "14": "py:exception", "15": "py:data"}, "terms": {"": [22, 24, 25, 28, 44, 57, 62, 95, 147, 243, 246, 247, 256, 257, 260, 265, 269, 270, 272, 279, 298, 302, 303, 309, 313, 314, 316, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 338, 339, 340, 341, 342, 343, 344, 345, 346, 348, 349, 354, 355, 356, 361, 363, 369, 370, 372, 376, 378, 379, 380, 381, 382, 383, 384, 387, 388, 389, 391, 392, 394, 396, 397, 402, 406, 408, 411, 413, 414, 418, 420, 421, 423, 425, 426], "0": [9, 20, 21, 24, 25, 28, 30, 33, 36, 37, 44, 55, 57, 243, 247, 250, 252, 256, 257, 260, 265, 266, 278, 279, 281, 289, 302, 303, 306, 308, 309, 313, 314, 315, 316, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 351, 352, 353, 354, 355, 358, 359, 361, 362, 363, 365, 366, 369, 371, 372, 375, 376, 383, 384, 385, 387, 388, 389, 390, 391, 393, 394, 395, 397, 401, 402, 403, 404, 405, 409, 410, 411, 412, 413, 415, 416, 418, 419, 421, 423, 425, 426, 427, 428, 432], "00": [288, 361, 366, 389, 425], "000": [25, 389], "0003575115963544682": 385, "00035751489124038457": 385, "00163713": 411, "00164658": 411, "00171023": 411, "00179382": 411, "00180316": 411, "00198061": 411, "00203027": 411, "00216633": 411, "00217889": 411, "00223598": 411, "00226557": 411, "00235812": 411, "00241": 411, "00243581": 411, "00245821": 411, "0025": 432, "00252331": 411, "00261406": 411, "00265488": 411, "00269113": 411, "00270954": 411, "00289114": 411, "00291005": 411, "00292684": 411, "00293671": 411, "0029515": 411, "00297233": 411, "00297784": 411, "003": [289, 314, 349], "00308582": 411, "00310676": 411, "00315343": 411, "0031551": 411, "00317296": 411, "00317463": 411, "00332212": 411, "00338962": 411, "00340452": 411, "00341811": 411, "00343822": 411, "00344785": 411, "0034657": 411, "00348609": 411, "00350486": 411, "00367406": 411, "00368479": 411, "00385131": 411, "00389863": 411, "00393276": 411, "00393589": 411, "00394876": 411, "00395783": 411, "00396819": 411, "0040612": 411, "00407149": 411, "00418212": 411, "0042401": 411, "00425647": 411, "00437142": 411, "00443796": 411, "00447544": 411, "00448956": 411, "00449335": 411, "00451855": 411, "00464705": 411, "00466269": 411, "00480098": 411, "00481074": 411, "00483104": 411, "00484669": 411, "00488058": 411, "00493861": 411, "004m": 389, "00500361": 411, "00502113": 411, "00502473": 411, "00503604": 411, "00514768": 411, "0051719": 411, "00517637": 411, "00526697": 411, "00535855": 411, "00542595": 411, "00548478": 411, "0054935": 411, "00555918": 411, "00561026": 411, "00565044": 411, "00570175": 411, "00570293": 411, "00578904": 411, "00579899": 411, "0058452": 411, "00584761": 411, "00593063": 411, "00609695": 411, "00633179": 411, "00643591": 411, "00651108": 411, "00653312": 411, "0065352": 411, "00655363": 411, "00655654": 411, "00656544": 411, "00657187": 411, "00659512": 411, "00667871": 411, "00672351": 411, "00677631": 411, "00693265": 411, "00698123": 411, "00701343": 411, "00716987": 411, "00727645": 411, "00731429": 411, "00741956": 411, "00744553": 411, "0074474": 411, "00745008": 411, "00749636": 411, "00755406": 411, "00759056": 411, "00760217": 411, "00761117": 411, "00764146": 411, "00781277": 411, "00785878": 411, "00794258": 411, "00811779": 411, "00821985": 411, "00826017": 411, "00828943": 411, "00835933": 411, "00850778": 411, "00860835": 411, "00869796": 411, "00879332": 411, "00893304": 411, "00896329": 411, "00897444": 411, "0090376": 411, "00908195": 411, "00910648": 411, "00914975": 411, "00920252": 411, "00921101": 411, "00923343": 411, "00925277": 411, "0092883": 411, "009382": 411, "00940157": 411, "00940534": 411, "00947462": 411, "00959948": 411, "00978": 432, "00979113": 411, "00980134": 411, "00992419": 411, "00e": 425, "00x": 425, "01": [250, 288, 306, 361, 371, 411, 416, 423, 425], "0101487": 411, "010269": 411, "0103377": 411, "0103961": 411, "0104209": 411, "0105324": 411, "010552": 411, "0105865": 411, "0106293": 411, "0107115": 411, "0107712": 411, "0109527": 411, "0109669": 411, "0109927": 411, "0110537": 411, "0111132": 411, "0112255": 411, "0114194": 411, "011443": 411, "0116008": 411, "0116365": 411, "0116466": 411, "0116589": 411, "011705": 411, "0117535": 411, "011932": 411, "0119455": 411, "0120042": 411, "0120525": 411, "012078": 411, "0120946": 411, "0123966": 411, "0125696": 411, "0126225": 411, "0127448": 411, "0127799": 411, "0128144": 411, "0129116": 411, "0129936": 411, "013": 397, "0130778": 411, "0131335": 411, "0131446": 411, "0132428": 411, "0132869": 411, "0134367": 411, "013504": 411, "0135348": 411, "0135801": 411, "0137027": 411, "0137122": 411, "013742": 411, "0137691": 411, "0139037": 411, "0140129": 411, "0142343": 411, "0142667": 411, "0143274": 411, "0144483": 411, "0145757": 411, "0147718": 411, "0147951": 411, "0148329": 411, "0149058": 411, "015": 397, "0150624": 411, "0150693": 411, "0152068": 411, "0152199": 411, "0152997": 411, "0154121": 411, "0158702": 411, "0158714": 411, "0158773": 411, "0158951": 411, "016": [397, 411], "0161277": 411, "0161691": 411, "0161696": 411, "016186": 411, "0164591": 411, "0164699": 411, "0166254": 411, "0166666": 411, "0167419": 411, "0168147": 411, "0168219": 411, "0168348": 411, "016901": 411, "0169214": 411, "0170105": 411, "0170807": 411, "0170987": 411, "0171018": 411, "0176505": 411, "0177431": 411, "0177477": 411, "0177873": 411, "0179766": 411, "0180933": 411, "018228": 411, "0183481": 411, "0183895": 411, "0184267": 411, "0184384": 411, "018464": 411, "0187415": 411, "0192313": 411, "0192409": 411, "0192593": 411, "0192628": 411, "0193516": 411, "0193761": 411, "01_quickstart_neuralchat": 322, "01x": 425, "02": [288, 376, 411, 416, 425], "02002": 260, "0200457": 411, "0203923": 411, "0204832": 411, "0206321": 411, "0207462": 411, "0207504": 411, "0207815": 411, "0207876": 411, "0208901": 411, "021": 397, "0210726": 411, "0211151": 411, "0211298": 411, "0213786": 411, "0215163": 411, "0217062": 411, "0217468": 411, "0217822": 411, "0218703": 411, "0218969": 411, "02197": 411, "0220014": 411, "0221319": 411, "0222103": 411, "0222947": 411, "0223472": 411, "0224431": 411, "0231199": 411, "0231282": 411, "023182": 411, "0231979": 411, "0232584": 411, "0234498": 411, "0240415": 411, "024706": 411, "0247063": 411, "0248571": 411, "0249397": 411, "025032": 411, "0250395": 411, "0252901": 411, "0256871": 411, "0257188": 411, "0257262": 411, "0258341": 411, "0258802": 411, "0260486": 411, "0261888": 411, "0262706": 411, "0263137": 411, "0265272": 411, "0266731": 411, "0266886": 411, "0267483": 411, "0268136": 411, "0269904": 411, "0270028": 411, "027025": 411, "0270492": 411, "0274874": 411, "0275282": 411, "027535": 411, "0275467": 411, "0275881": 411, "0276086": 411, "028": [389, 411], "028166": 411, "028483": 411, "028568": 411, "0289719": 411, "0291396": 411, "0292454": 411, "0295362": 411, "0296385": 411, "02x": 425, "03": [288, 314, 349, 361, 411, 425], "0302293": 411, "0302746": 411, "0309886": 411, "0310083": 411, "031279": 411, "0314895": 411, "0317559": 411, "0317602": 411, "0318745": 411, "0319455": 411, "0321109": 411, "0321377": 411, "0323642": 411, "0325741": 411, "0326952": 411, "0329699": 411, "033": 397, "0333436": 411, "0336342": 411, "0340362": 411, "0341169": 411, "0341912": 411, "0342908": 411, "0345669": 411, "0346142": 411, "03474": 411, "0348388": 411, "0354192": 411, "0357023": 411, "0358603": 411, "0358752": 411, "03588": 411, "0363329": 411, "0364227": 411, "0365834": 411, "0366748": 411, "0367258": 411, "036978": 411, "036992": 411, "037": 397, "037334": 411, "0373579": 411, "0373802": 411, "0373823": 411, "0374397": 411, "0375": 389, "0375093": 411, "0375683": 411, "0376119": 411, "03762": 36, "0376949": 411, "0381385": 411, "03849": 411, "0387886": 411, "0389357": 411, "039": 397, "03923": 411, "0394101": 411, "039411": 411, "0395342": 411, "0397992": 411, "04": [288, 302, 304, 313, 314, 315, 316, 317, 318, 349, 366, 397, 411, 425], "0401657": 411, "0402931": 411, "0404778": 411, "0407051": 411, "0411331": 411, "0414047": 411, "0414834": 411, "0416614": 411, "0417964": 411, "0421644": 411, "042188": 411, "0423267": 411, "0426942": 411, "0427839": 411, "0428737": 411, "0429436": 411, "0429916": 411, "043787": 411, "044": 389, "044154": 411, "044202": 411, "0444861": 411, "0445693": 411, "0447282": 411, "0447548": 411, "044m": 389, "0451228": 411, "0454416": 411, "0454583": 411, "0455066": 411, "0458481": 411, "0459135": 411, "046": 411, "0460811": 411, "046201": 411, "0465882": 411, "0467291": 411, "0467462": 411, "0467998": 411, "0473412": 411, "0475549": 411, "0476463": 411, "0483781": 411, "0484067": 411, "0487342": 411, "04874": 411, "0487727": 411, "0489938": 411, "0490096": 411, "0496581": 411, "0497077": 411, "05": [266, 288, 346, 347, 397, 411, 425], "050021": 411, "0510217": 411, "0514668": 411, "0516788": 411, "0521326": 411, "0521595": 411, "0521945": 411, "0524509": 411, "0526609": 411, "053": 397, "0530097": 411, "0532543": 411, "0533513": 411, "053639": 411, "0537321": 411, "0537768": 411, "0538146": 411, "0538395": 411, "0539197": 411, "0543977": 411, "0549107": 411, "05516": 432, "0553082": 411, "0556653": 411, "0558945": 411, "0560297": 411, "0574189": 411, "0580473": 411, "0588583": 411, "0589148": 411, "0591283": 411, "0592912": 411, "0595001": 411, "0596004": 411, "059613": 411, "0596185": 411, "0597882": 411, "06": [288, 411, 425], "0600772": 411, "0603517": 411, "0603789": 411, "0604759": 411, "0609618": 411, "0609701": 411, "0610684": 411, "0612457": 411, "061272": 411, "0613803": 411, "0614806": 411, "0616695": 411, "0616923": 411, "062": 411, "0620034": 411, "0622484": 411, "0624729": 411, "0625579": 411, "0626013": 411, "063": 377, "0633017": 411, "0637226": 411, "0640577": 411, "0642402": 411, "0651551": 411, "0656322": 411, "066": 397, "0660571": 411, "06648": 411, "0665519": 411, "0668515": 411, "0677547": 411, "0677766": 411, "068": 411, "0687866": 411, "068835": 411, "069": 411, "0692752": 411, "0698868": 411, "06x": 425, "07": [288, 346, 397, 411, 425], "0700283": 411, "07006": 411, "0701429": 411, "0710327": 411, "0712915": 411, "0713578": 411, "0713821": 411, "0714324": 411, "0716356": 411, "0717247": 411, "0721208": 411, "0723144": 411, "0725632": 411, "0728843": 411, "0736189": 411, "0739962": 411, "074": 411, "0740655": 411, "0747271": 411, "075": 389, "0759107": 411, "076": [397, 411], "0760123": 411, "0765083": 411, "0765841": 411, "0771592": 411, "0780751": 411, "078109": 411, "0781101": 411, "0784417": 411, "0796627": 411, "08": [288, 361, 411, 425], "080936": 411, "0811198": 411, "0813271": 411, "0819725": 411, "0822007": 411, "0825026": 411, "0825665": 411, "0832193": 411, "0835321": 411, "0836219": 411, "0840322": 411, "0843776": 411, "0845544": 411, "0849766": 411, "085": 411, "0852": 411, "0854403": 411, "0854876": 411, "0855686": 411, "0870121": 411, "0873881": 411, "0876727": 411, "0879386": 411, "08794": 411, "0881114": 411, "0893092": 411, "0893345": 411, "08991": 411, "0899513": 411, "09": [288, 346, 411, 425, 426], "091": 397, "0922471": 411, "0923655": 411, "0933483": 411, "0933565": 411, "0938959": 411, "0943305": 411, "0946983": 411, "0948318": 411, "09557": 432, "0955952": 411, "0958787": 411, "096": 397, "0961662": 411, "09719": 411, "097692": 411, "0977256": 411, "0994565": 411, "0995304": 411, "0999998": 411, "0a0": [361, 432], "0e": [314, 349], "0f": 402, "0m": 389, "0x10": 410, "0x100": 410, "0x14": 410, "0x140": 410, "0x18": 410, "0x180": 410, "0x1c": 410, "0x1c0": 410, "0x20": 410, "0x200": 410, "0x24": 410, "0x240": 410, "0x28": 410, "0x280": 410, "0x2b0001b0": [425, 426], "0x2c": 410, "0x2c0": 410, "0x30": 410, "0x34": 410, "0x38": 410, "0x3c": 410, "0x4": 410, "0x40": 410, "0x400": 410, "0x8": 410, "0x80": 410, "0xc": 410, "0xc0": 410, "0xd000331": [397, 411], "1": [9, 14, 25, 27, 28, 32, 33, 36, 38, 39, 40, 44, 57, 246, 247, 252, 256, 257, 258, 260, 264, 266, 269, 281, 289, 298, 300, 303, 304, 305, 306, 308, 309, 313, 316, 319, 320, 321, 324, 326, 327, 328, 329, 330, 332, 334, 336, 337, 338, 340, 343, 344, 345, 350, 359, 363, 364, 365, 366, 369, 371, 372, 373, 375, 383, 384, 386, 387, 390, 391, 392, 395, 396, 397, 399, 401, 402, 403, 404, 405, 406, 408, 409, 410, 411, 413, 416, 418, 419, 421, 422, 423, 426, 428, 429, 432], "10": [288, 302, 308, 309, 314, 322, 346, 347, 349, 354, 361, 376, 388, 389, 397, 403, 411, 413, 425, 426], "100": [25, 33, 36, 44, 246, 247, 302, 314, 346, 347, 348, 349, 352, 354, 361, 376, 413, 422, 423, 425, 428, 429, 432], "1000": [346, 347], "10000": 259, "10004": [304, 305, 432], "1001": 411, "1002": 425, "1004": 411, "100424": 411, "10045": 425, "10049": 411, "1006": 411, "1007": 425, "10072": 397, "1008": 411, "101": [17, 255, 410], "101071": 411, "10117": 411, "1012": 411, "101206": 411, "10127": 411, "101434": 411, "1015": 411, "10159": 411, "1018": 411, "101844": 411, "1019": 411, "102": 20, "1020": 411, "1021": [397, 411], "102244": 411, "10231": 425, "1024": [17, 25, 346, 347, 372, 388, 389, 390, 411, 413, 425], "1024x256": 389, "1025": 411, "10259": 411, "1027": 411, "10270": 411, "10272": 411, "103": [309, 361, 366, 425], "103035": 411, "103083": 411, "103125": 411, "103126": 411, "1032": 411, "103379": 411, "103385": 411, "10370": 425, "10372": 411, "103927": 411, "104": [304, 425], "104267837": 319, "10428": 411, "104294": 411, "1043": 411, "1046": 411, "1047": 411, "10474": 411, "1048": 411, "10488": 425, "105": 425, "1050": [411, 425], "1051": 411, "105192": 411, "1053": 411, "1056": 411, "105656": 411, "10566": 425, "1057": 411, "1058": 411, "105849": 411, "106": [397, 411, 425], "1060": 425, "106089": 411, "1062": 425, "10621": 425, "10672": 411, "107": [410, 425], "1070": 411, "10703": 411, "10713": 425, "1072": 411, "10742": 411, "107514": 411, "1076": 411, "10763": 411, "108": 425, "1081": 397, "1082": 411, "1083": 411, "1085": 411, "1086": 397, "10860": 411, "1087": 411, "108718": 411, "1088": 411, "108899": 411, "109": 425, "1091": 397, "10917": 411, "1092": 411, "109308": 411, "1094": 411, "10940": 425, "10944": 432, "10947": 411, "1095": 411, "1096": 411, "10962": 411, "1097": 411, "1098": 411, "1099": 425, "10999": 411, "10e": 410, "10k": [247, 288, 425, 428], "10m": 397, "10x": 425, "11": [302, 304, 308, 340, 347, 361, 364, 365, 393, 403, 411, 425, 426, 427], "110": 425, "1100": 411, "11009": 425, "1102": [397, 411], "1103": 411, "11059": 411, "1106": [397, 411], "11064": 411, "1108": 411, "111": 425, "11116": 411, "111186": 411, "111211": 411, "1113": 411, "1114": 411, "1115": 411, "11180": 411, "112": [397, 411, 425], "1120": 411, "1123": 411, "1124": 411, "1125": 411, "1126": 397, "1128": 411, "112882": 411, "113": [397, 425], "1130": 397, "113174": 411, "1132": [411, 425], "11320": 411, "11322": 411, "11323": 397, "11327": 425, "1136": 425, "11368": 411, "1137": 397, "1138": 411, "11386": 425, "114": 410, "1140": 411, "11401": 411, "1142": 411, "1143": 411, "1144": 411, "11444": 411, "1145": 411, "11458": 411, "1147": 411, "11476": 411, "11484": 411, "115": [304, 425], "11503": 411, "1154": 411, "1156": 411, "1159": 411, "116": [389, 411, 425], "1160": 411, "116019": 411, "1162": 411, "11624": 411, "1163": 411, "11660": 425, "116701": 411, "11684": 411, "1169": 411, "117": [397, 425], "11707": 411, "11737": 411, "11741": 425, "1176": 411, "11793": 411, "118": [410, 425], "1184": 411, "118402": 411, "118429": 411, "1185": 411, "11860": 411, "11868": 425, "1188": 411, "119": [397, 411, 425], "11914": 425, "1192": [411, 425], "11943": 425, "11950": 411, "1196": 411, "119678": 411, "11970": 425, "1199": [411, 425], "119951": 411, "11a": 410, "12": [9, 30, 288, 308, 314, 332, 337, 349, 361, 386, 389, 397, 403, 407, 410, 411, 413, 425], "120": [410, 425], "1202": 425, "1203": 411, "12058": 425, "1207": 411, "12086": 411, "121": 425, "1210": 411, "12102": 397, "12104": 425, "1213": 411, "12147": 425, "1215": 411, "1218": 411, "1219": 411, "12190": 425, "122": 411, "1220": 397, "1224": 425, "122421": 411, "1226": 411, "12261": 411, "1228": 411, "1230": 397, "1232": 425, "1234": 371, "123429": 411, "12345": 252, "1235": 411, "123554": 411, "123585": 411, "1236": 411, "124": 397, "124072": 411, "1242": [411, 425], "124238": 411, "1244": 411, "1247": 411, "124749": 411, "124m": 428, "1250": 411, "125018112": 388, "1251": 425, "12526": 411, "1253": 411, "125344": 411, "12535": 397, "12537": 425, "12541": 425, "12548": 411, "12567": 411, "1257": 411, "125772": 411, "125m": [304, 428], "126545": 411, "126819": 411, "1269": 411, "127": [252, 309, 313, 314, 315, 316, 324, 326, 327, 328, 329, 334, 336, 337, 338, 340, 343, 344, 353, 361, 375, 389, 410, 411, 423, 425], "12702": 425, "1271": 411, "1273": 411, "1278": 397, "12788": 425, "128": [247, 302, 352, 388, 389, 393, 396, 397, 411, 413, 423, 425], "1280": [411, 413], "1281": 411, "1286": 411, "1287": 411, "1288": 411, "129": [411, 425], "1291": 411, "1292": 411, "1293": 397, "129767": 411, "1298": 411, "129806": 411, "12d": 410, "12k": [346, 347, 352], "12xlarg": [397, 411], "13": [288, 308, 349, 351, 361, 372, 397, 403, 411, 425, 426], "13001": 425, "1302": 411, "13031": 425, "1304": 397, "13064": 425, "1307": 411, "130834": 411, "130863": 411, "131": 397, "1310": 425, "13129": 397, "1313": 411, "13142": 425, "1315": [411, 425], "13154": 397, "1316": 425, "1319": 397, "132": 411, "1320": 411, "132552": 411, "1328969a": 319, "1329": 411, "133": 410, "1330": 425, "133295": 411, "1334": 411, "133647": 411, "1337": 411, "13381": 397, "134": 425, "1342": [397, 411], "134442": 411, "1345": 425, "1346": 411, "13466": 411, "1347": [397, 411], "134716": 411, "135054": 411, "13524": 425, "13529": 425, "135495": 411, "135532": 411, "13582": 425, "135839": 411, "13586": 425, "135864": 411, "1359": 425, "136": [279, 385], "13616": 425, "13621": 425, "13638": 425, "13639": 425, "13650": 425, "13674": 425, "13675": 425, "13686": 425, "137": 425, "13703": 425, "1371": 411, "13717": 425, "137361": 411, "138": 389, "1381": 411, "1382": 411, "13825": 425, "1383": 425, "1384": 411, "1385": 411, "1386": 411, "1387": 411, "13871": 425, "1388": 411, "139": 410, "139021": 411, "1392": 411, "139298": 411, "1393": 425, "1394": 411, "1397": 397, "13990": 397, "13b": [288, 323, 332, 346, 347, 351, 352, 428], "13k": 425, "14": [246, 288, 305, 350, 397, 403, 410, 411, 425], "140": [410, 425], "1403": 425, "1407": 411, "1408": 425, "1409": 397, "141": 397, "141097": 411, "1412": 411, "14124194128933833351": 390, "1413": 411, "141333": 411, "1414": 411, "1415": 411, "1417": 411, "141966": 411, "142": [304, 411, 425], "1422": 411, "1425": 411, "1426": [411, 425], "14263": 425, "1427": 411, "142778": 411, "143": 397, "1430": 411, "1435": 411, "1436": 411, "1437": 425, "144": 425, "1440": 411, "1441": [411, 425], "144231": 411, "1443": 411, "1444": 411, "1446": 411, "1449": 411, "1450": [411, 425], "145322": 411, "1456": 411, "1457": 411, "145836": 411, "1459": 411, "146": [410, 425], "1461": 411, "1464": 411, "146452": 411, "1465": 411, "146781": 411, "146935": 411, "147": 425, "1470": 411, "14737": 425, "1474": 397, "147474": 411, "1476": 411, "1478": 411, "148115": 411, "148369": 411, "1484": 397, "148512": 411, "1487": [397, 411], "14896": 425, "14905": 411, "1492": 411, "1495": 411, "1498": 411, "14993": 425, "14c": 410, "15": [38, 288, 369, 397, 403, 404, 409, 411, 425], "1501": 411, "150549": 411, "1506": 411, "1508": 411, "150k": 350, "1513": 411, "151649": 411, "15180": 425, "152": [17, 410, 425], "1523": 411, "1526": 411, "1527": 411, "15278": 425, "152848": 411, "152925": 411, "153086": 411, "1531": 411, "1534": 411, "1536": 347, "1539": 411, "154": 425, "1540": 411, "1544268": 361, "1545": 411, "15460": 397, "15462": 425, "1547": 411, "1549": 411, "155": 411, "15506": 425, "15525": 411, "1559": 411, "156168": 411, "156368": 411, "1565": 411, "157": 425, "157349": 411, "15748": 411, "157518": 411, "1578": 411, "1579": 411, "158": 425, "1581": 397, "158162": 411, "15834": 425, "1585": 411, "158502": 411, "158668": 411, "1589": 411, "159": [304, 410], "1594": 397, "159566": 411, "159911": 411, "16": [281, 288, 289, 304, 305, 314, 346, 347, 348, 349, 361, 388, 397, 403, 404, 405, 406, 409, 410, 411, 413, 423, 425], "160": [397, 410], "16004": 397, "1601": 411, "1602": 411, "160705": 411, "1609": 411, "161251": 411, "161443": 411, "1617": 411, "162": 411, "1622": 425, "1624": 411, "1627": 397, "163": 411, "163369": 411, "1637": 411, "1650": 425, "165192": 411, "165648": 411, "1658": 425, "1659": 397, "16591": 425, "166": [397, 425], "166153": 411, "1662": 411, "167": [410, 411, 425], "1671": [397, 411], "167473": 411, "167575": 411, "16771": 397, "168": [330, 332], "1680": 425, "16901": 425, "169119": 411, "1696": 411, "1698": 411, "169874": 411, "1699": 425, "16e": 410, "16gb": 325, "16x1": [403, 407], "16x16": 407, "16x16gb": [425, 426], "16x32": 403, "16x32x16": 407, "16x4": 409, "16xn": 405, "16xpad_n": 405, "17": [288, 317, 346, 347, 363, 389, 397, 403, 411, 425], "170": 425, "1702": 425, "1703": 425, "1706": [36, 397, 411], "1708": 260, "1710750809": 361, "1712": 411, "171434": 411, "17178": 425, "1719": 397, "172": 425, "172356": 411, "17245": 425, "17281": 425, "173": 425, "17323": 432, "17364": 411, "174": 410, "174091": 411, "174101": 411, "174215": 411, "1743": 411, "17436": 411, "17454": 425, "17468": 397, "1747": 411, "17496": 425, "175": [314, 349, 425], "1758": 425, "17585": 425, "17598": 425, "1760": 425, "176031": 411, "1762": 411, "176292": 411, "1763": 425, "176b": [272, 302], "177": 411, "17764": 411, "1777": 397, "178": 425, "178324": 411, "1786": 411, "1787": 411, "179": 397, "1792": 411, "1793": 425, "1795": 411, "179525": 411, "179593": 411, "179695": 411, "1797": 411, "17a": 410, "18": [17, 255, 288, 361, 397, 403, 411, 425], "180": 410, "1801": 411, "1804": 411, "1805": 411, "180921": 411, "181": 397, "18119": 425, "1813": 411, "1816": 411, "181783": 411, "182": 411, "1823": 411, "1825": 411, "1826": 411, "1828": 411, "1829": 411, "183": 425, "183003": 411, "183193": 411, "18324": 411, "18336": 425, "184": 411, "184256": 411, "184412": 411, "185": 389, "1850": 411, "1851": 411, "1857": 411, "18575": 397, "1858": 411, "186": 425, "18672": 425, "1869": 411, "187": [397, 410], "18708": 425, "1872": 397, "187933": 411, "188": 397, "1881": 411, "18824": 425, "1885": 411, "18868": 425, "188745": 411, "18876": 425, "1889": [377, 411], "18939": 425, "1895": 411, "1899": 411, "18d": 410, "19": [288, 397, 403, 411, 425, 426], "1904": 411, "190508": 411, "191": 361, "1910": 397, "1913": 411, "191564": 411, "1918": 411, "1919": 411, "192": [330, 332, 425], "1920": 411, "1924": 411, "193": [410, 425], "1930": 411, "193579": 411, "1936": 411, "193713": 411, "1938": 397, "1942": 411, "19463": 411, "195": [289, 425], "1952": 411, "195271": 411, "19536": 411, "1956": 411, "1964": 411, "197": [397, 411], "1971": 411, "1972": 411, "1979": 411, "198": 411, "1983": 25, "198303": 411, "1987": 411, "198987": 411, "199": 410, "1993": 397, "1994": 411, "19_": 369, "19x": 425, "1_1": 369, "1a": 410, "1a0": 410, "1a6": 410, "1ac": 410, "1b2": 410, "1b7": 428, "1b9": 410, "1bf": 410, "1c5": 410, "1cb": 410, "1d2": 410, "1d9": 410, "1e": [281, 314, 348, 349, 352, 376, 422], "1e0": 410, "1e7": 410, "1ed": 410, "1f": 369, "1f3": 410, "1f9": 410, "1m": 397, "1ubuntu2": 369, "1x": [304, 425, 426], "1x1": [17, 303, 390], "1x16": [403, 409], "1x4": [304, 409], "1\u6a21\u578b\u63d0\u4f9b\u52a0\u901f": 420, "2": [17, 20, 21, 25, 28, 29, 32, 36, 57, 256, 257, 260, 266, 269, 281, 289, 300, 303, 304, 305, 306, 308, 309, 320, 323, 327, 328, 329, 330, 332, 337, 340, 347, 350, 358, 366, 372, 373, 384, 386, 387, 389, 390, 391, 392, 395, 396, 397, 402, 403, 404, 409, 410, 411, 413, 415, 416, 418, 419, 420, 421, 422, 425, 426, 428, 432], "20": [28, 264, 288, 289, 302, 324, 361, 388, 393, 397, 403, 410, 411, 425], "200": [247, 361, 364, 365, 366, 410, 425, 432], "2000": [314, 349, 352], "20013": 425, "2003": 410, "2005": 411, "2007": 425, "2009": 411, "200k": 349, "2010": [397, 411], "2012": 425, "2013": 407, "2016": [25, 411], "2017": 361, "2019": 377, "202": 304, "2021": [266, 272, 302], "20210514": [425, 426], "2022": [272, 302, 358, 372, 397, 411, 432], "2023": [272, 316, 347, 415, 425, 426, 432], "202306": 432, "2024": [319, 361, 362], "2025": 411, "2031": 411, "2038": 411, "203901": 411, "2044": 411, "2048": [17, 247, 288, 413, 432], "204966": 411, "204973": 411, "205": [397, 411], "20505": 397, "2055": [411, 425], "206": 410, "2060": 411, "206049": 411, "207": 397, "2071": 411, "20787": 425, "20824": 411, "2085": 425, "208555": 411, "2086": 411, "2089": 411, "209526": 411, "20b": [314, 315, 349], "20c": 410, "20k": 349, "20m": 397, "21": [9, 288, 403, 411, 425], "210": 425, "211": [397, 411, 425], "2110": 411, "2116": 397, "2118": 425, "211893": 411, "2119": 411, "212": 410, "2120": 425, "2121": 425, "212152": 411, "21269": 411, "2129": 425, "2131": [397, 411], "2134": 411, "21341": 425, "213454": 411, "214": 425, "214208": 411, "21431": 411, "2146": 411, "2148": 411, "215": 411, "2150": 397, "2156": 411, "21568": 425, "2160": 411, "2163": 411, "216338": 411, "2165": 411, "217": 397, "2174": 425, "2181": 397, "218765": 411, "219": [410, 411, 425], "219777": 411, "21f": 410, "21x": 304, "22": [288, 314, 315, 349, 361, 372, 403, 411, 425], "2201": 411, "220585": 411, "2206": 432, "220947": 411, "220994": 411, "221": [397, 425], "2210": [411, 432], "2211": 411, "222": 425, "2220": 411, "22241": 411, "222661": 411, "2229": 411, "2232": 411, "223615": 411, "22389": 425, "2239": 411, "224": [20, 410, 411], "224925": 411, "22499": 425, "225": 410, "225023": 411, "2251": 411, "2263": 397, "2266": 411, "2267": 411, "227": 425, "2271": 411, "2274": 425, "22776": 425, "227976": 411, "228043": 411, "2284": 385, "2285": 411, "228752": 411, "2290": 425, "22951": 425, "229837": 411, "22b": 410, "23": [288, 309, 337, 351, 361, 364, 365, 372, 403, 411, 425, 426], "2301": 411, "2306": 432, "2308": 425, "2309": 432, "230945": 411, "231": 425, "2310": 432, "232": [410, 411], "2320": 411, "2326": 411, "233057": 411, "233231": 411, "234": 425, "2342": 425, "2345": 349, "235": 397, "2351": 397, "2354": 425, "2357": 411, "2359": 425, "236101": 411, "236418": 410, "2365": 425, "2369": 411, "237": [366, 411], "2377": 411, "23772": 425, "238": [410, 425], "238855": 411, "23e": 410, "24": [57, 288, 361, 395, 397, 403, 411, 425], "24038": 411, "2404": 411, "240739": 411, "2409": 397, "241": 397, "2415": 411, "242": [304, 397, 411], "2420": 411, "2421": 425, "242512": 411, "2427": 425, "2429": 425, "243012": 411, "2433": 411, "2435": 411, "2439": 411, "244": [397, 410, 425], "2449": 411, "245": [397, 411], "2463": 411, "2467": 425, "247251": 411, "247491": 411, "2475": 411, "24910": 397, "24b": 410, "25": [260, 288, 346, 350, 372, 403, 411, 413, 425], "2504": 425, "2505": [397, 411], "2507": 411, "250t": 351, "251": [304, 411], "2510": 411, "251221": 411, "2513": [411, 425], "252": [304, 410], "2525": 411, "253": 304, "2537": 411, "254835": 411, "25485": 397, "255": [21, 397, 408, 423], "255199": 411, "255598": 411, "2558": 411, "256": [247, 259, 376, 389, 411, 413], "256619": 411, "256635": 411, "256715": 411, "2568": 411, "256gb": [425, 426], "256px": 348, "256x1024": 389, "256x256": [389, 413], "257138": 411, "2576": 397, "2578": [411, 425], "257989": 411, "25799": 411, "258": 397, "2580": 411, "2582": 411, "259": [397, 410, 411], "259051": 411, "2594": 411, "26": [288, 361, 403, 410, 411, 425], "260": 410, "26056": 411, "2608": 411, "261": 397, "261028": 411, "2612": 411, "261265": 411, "2615": 411, "262": [397, 425], "2624": 411, "263": 397, "2633": 411, "263316": 411, "264": 411, "2642": 411, "2643": [411, 425], "2652": 397, "2653": [411, 425], "26552": 425, "266": 397, "2663": 411, "2665": 425, "2669": 425, "266945": 411, "267": 410, "267289": 411, "2673": 411, "267367": 411, "2677": 425, "2678": 411, "2683386": 25, "2686": 425, "2689": 425, "269": 304, "2693": 425, "2694": 411, "269504": 411, "2697": 411, "26974": 411, "2698": [411, 425], "2699": 411, "26e": 410, "27": [288, 361, 403, 411, 425], "2701": 425, "2703": 425, "2704": 411, "2706": 425, "2709": 425, "271": 411, "271587": 411, "2718": 411, "2720": 425, "2721": 425, "2725": 425, "27264": 425, "2728": 425, "2729": 425, "2730": 425, "273363": 411, "2735": 411, "2737": 425, "274": 397, "2741": 397, "27412": 385, "2742": 425, "2743": 411, "274441": 411, "2746": 411, "275": [397, 410, 425], "2751": [411, 425], "2753": 425, "27579": 425, "2758": 425, "2763": 425, "2768": 425, "2774": 411, "277815": 411, "2783": 411, "2784": 411, "2795": 411, "2796": 353, "27c": 410, "28": [288, 304, 337, 361, 397, 403, 411, 425], "28032": 425, "2804": 397, "280686": 411, "2807": 411, "281": 425, "2813": 411, "2815": 411, "282": 397, "2821": 411, "2822": 411, "282241": 411, "2824": 411, "2825": 425, "2828": 411, "283": 410, "283046": 411, "2831": 411, "28321": 411, "2834": 411, "283445": 411, "2835": 411, "2836": 411, "28399": 397, "284": 411, "2842": [411, 425], "2844": 411, "2846": 411, "28479": 425, "2850": 411, "2854": 411, "2856": 411, "2858": 411, "28593": 411, "286141": 411, "286461": 411, "2866": 411, "2867": 425, "2868": 425, "2869": 411, "286973": 411, "287": 411, "2870": 411, "2871": 411, "2876": [411, 425], "2879": 411, "2882": 411, "288236": 411, "2889": 411, "289": 411, "2896": [411, 425], "2898": 411, "28a": 410, "29": [288, 403, 411, 425, 426, 427], "2901": 411, "2902": 411, "2906": 411, "291": [410, 425], "2918": 397, "2919": 411, "2921": 397, "29220": 410, "2923": 411, "2928": 397, "293": 425, "2930": 411, "2931": 411, "2935": 411, "2944": 411, "29501": 347, "2953": 411, "2954": 411, "2958": 411, "296": 397, "2962": 411, "2965": [397, 411], "2969": 411, "297": 411, "2970": 411, "2974": 411, "2975": 411, "298": [410, 411], "2980": 411, "2983": 411, "298489": 411, "2988": 411, "298907": 411, "2994": 411, "2995": 411, "299561": 411, "29a": 410, "29c": 410, "29e": 410, "29gvlhfosjhehtgql4hgxp": 361, "2a0": 410, "2a1": 410, "2a2": 410, "2a5": 410, "2b": 349, "2b_peft_finetuned_model": 349, "2c": 410, "2d": [260, 399, 413], "2e": [314, 376], "2nd": [25, 405, 408], "2x1": 409, "2xk": 402, "3": [20, 21, 25, 57, 256, 257, 281, 289, 300, 303, 304, 305, 308, 309, 320, 322, 323, 324, 327, 328, 329, 330, 331, 332, 337, 340, 345, 347, 350, 351, 362, 371, 383, 384, 385, 386, 387, 389, 390, 391, 392, 393, 394, 395, 396, 397, 401, 403, 404, 409, 411, 413, 414, 416, 420, 421, 422, 425, 426, 432], "30": [28, 288, 351, 369, 403, 425], "300": [376, 429, 432], "3008": 411, "300k": 349, "301": 397, "3010": 411, "3011": 411, "3018": [411, 425], "302": 411, "3025159985633461085": 390, "3026": 425, "303": 397, "3030": 411, "303455": 411, "3035": 411, "30458": 411, "3046": 425, "3049": 411, "3050": 411, "30522": 388, "3053": 411, "3058": 411, "3060": 425, "3064": 425, "307141": 411, "3072": [25, 411], "3077": 411, "307908": 411, "308": 425, "3080": 425, "3085": 411, "309195": 411, "3093": 411, "30b": [288, 428], "31": [288, 332, 346, 385, 397, 403, 404, 411, 425], "310": 425, "3113": 411, "311348": 411, "3113761e": 354, "311691": 411, "3117": 411, "3121": 411, "31211": 397, "3125": 425, "313": 425, "3130": 411, "3132": 411, "313656": 411, "31382": 425, "3147": 411, "3148": 411, "315": 304, "31592": 411, "316": 397, "317": 411, "317204": 411, "317837": 411, "318": [397, 411, 425], "318094": 411, "3185": 411, "3191": 425, "31929": 411, "319865": 411, "31x": 425, "32": [247, 288, 305, 371, 385, 388, 396, 397, 403, 404, 406, 407, 408, 409, 410, 411, 413, 425, 426, 427, 428, 432], "320": [397, 425], "3219": 411, "3226": 397, "3227": 411, "3230": 411, "323476": 411, "3235": 411, "3237": 411, "324": [377, 411, 425], "3240": 411, "3241": 411, "3245": 411, "3255": 397, "3264": 397, "326917": 411, "3276": 411, "328": 425, "3284": 411, "3288": 411, "3290": 425, "32966": 425, "32x16": 403, "32x4d": 17, "32x8d": 17, "33": [288, 304, 346, 350, 361, 396, 411, 425], "330": 425, "3300": [397, 411], "3306": 334, "3307": 425, "3314": 411, "332": [397, 411], "332153": 411, "3322": 411, "33246": 411, "3325": 411, "333": 411, "33386": 411, "3341": 411, "3348": 425, "3353": 411, "336": 350, "336519": 411, "3368": 397, "3369": 411, "336px": 350, "337": 411, "337529": 411, "3377": 411, "338": 425, "3382": 411, "3389": 425, "339": 397, "3391": 411, "3393": 411, "3394": 411, "3399": [411, 425], "33x": 304, "34": [17, 255, 288, 304, 324, 326, 343, 346, 348, 349, 352, 372, 384, 411, 425], "3405": 425, "3408": 425, "340939": 411, "3412": 425, "342843": 411, "3433": [411, 425], "3436": 411, "3441": 411, "34423": 411, "3448": 411, "345": 411, "3453": 411, "3462": 411, "346369": 411, "3467": 411, "3479": 425, "348": [411, 425], "3487": 411, "3489": 411, "349": 397, "3494": [411, 425], "34b": [309, 326, 330, 332], "35": [288, 304, 323, 327, 328, 329, 330, 366, 397, 411, 425], "350": 385, "350147": 411, "350m": [323, 428], "351": 411, "3519": 411, "3522": [411, 425], "353": 397, "3532": 411, "3538": 411, "354": 428, "3542": 428, "3543": 411, "355651": 411, "3557": 411, "3563": 425, "3572": 411, "357348": 411, "3576": 411, "358": 411, "3583": 411, "3584": 411, "3585": 425, "35873": 425, "358769": 411, "358932": 411, "3590": 411, "359791": 411, "36": [288, 345, 351, 361, 383, 385, 397, 411, 425], "360": [397, 420], "3601": 411, "3604": 411, "3606": 411, "3616": 411, "3617": 425, "3626": 411, "363": 397, "36322": 411, "3634": [397, 411], "364": 397, "3641": 411, "3642": 411, "3646": 411, "3647": [397, 425], "3650": 425, "3651": [411, 425], "3659": 411, "366": 389, "366328": 411, "3678": 397, "368": [411, 425], "3681": 397, "3684": 397, "3694": 411, "369429": 411, "369466": 411, "3698": 411, "37": [288, 346, 397, 411, 425], "3701": 411, "3712": 411, "3725": 425, "3730": 411, "3732": 411, "37333": 411, "3736": 411, "3739": 425, "3741": 411, "375": 411, "3752": 425, "375284": 411, "37537": 411, "3755": 411, "3757": 428, "3758": 411, "3761": 425, "376539": 411, "377": 411, "379": 428, "379699": 411, "3797": 425, "3798": 425, "379899": 411, "3799": 425, "37m": 386, "38": [288, 346, 361, 397, 410, 411, 425], "3800": 411, "3804": 428, "380582": 411, "3813": 425, "3822": 411, "382208": 411, "3823": 411, "3829": 411, "3833": 411, "384": [30, 390, 397, 411], "3848": 411, "3849": 411, "3850": 411, "3855": 411, "386": 411, "3868": 411, "387": 411, "3882": 411, "3887": 428, "3889": 411, "389": 397, "3894": 411, "3898": 411, "3899": 425, "39": [288, 346, 372, 397, 411, 425], "390": 425, "39024": 411, "391055": 411, "3912": 411, "391387": 411, "3919": 411, "392": 397, "39218": 397, "3927": 411, "393": 425, "3930": 428, "3933a071": 25, "3934": 411, "3940": 411, "3943": 411, "3947": [411, 428], "3952": 411, "3956": 411, "396634": 411, "397": 425, "3979": 425, "398": [361, 425], "3983": 411, "398509": 411, "3986": 411, "3991": 411, "39914": 425, "3993": 411, "3999": 425, "3a14": [425, 426], "3b": 428, "3b3f03e3f12": 319, "3c89": 319, "3d": [16, 19, 406, 413, 437], "3e": [376, 410], "4": [28, 36, 44, 57, 246, 247, 256, 257, 258, 263, 281, 289, 298, 300, 303, 304, 308, 313, 316, 319, 320, 323, 324, 326, 327, 328, 329, 330, 332, 337, 343, 347, 348, 349, 350, 354, 363, 366, 372, 374, 387, 389, 390, 391, 394, 395, 396, 397, 403, 404, 405, 406, 409, 410, 413, 420, 421, 422, 425, 426, 427, 428, 429, 432], "40": [288, 346, 390, 397, 420, 425], "4018": 425, "402406": 411, "4036": [411, 425], "4041": 411, "4047": [411, 425], "4049": 425, "405": 397, "4050": 411, "4057": 411, "4061": 411, "407388": 411, "408": 411, "408357": 411, "4084": 411, "409": 397, "4090": 411, "4096": [304, 411, 425], "41": [288, 304, 321, 346, 397, 411, 425], "410": 425, "4101": 425, "4107": 411, "412174": 411, "412912": 411, "4132": 411, "4133": 425, "4142": 411, "4147": 411, "4149": 428, "415": [304, 397, 425], "4154": 410, "4155": 410, "4156": 410, "4157": 410, "41598": 397, "415c": 410, "415d": 410, "415e": 410, "415f": 410, "416": 425, "4161": 411, "4164": 411, "416571": 411, "4167": 411, "4172": 428, "4176": 411, "418491": 411, "419": 425, "4191": 425, "41x": 425, "42": [288, 346, 397, 411, 425], "4200": 411, "4201": 411, "420619": 411, "4208": 411, "42134": 411, "42145": 411, "421781": 411, "422": 361, "4221": 411, "4224": 411, "4225": 411, "422517": 411, "4226": 411, "4228": 425, "423052": 411, "4248": 411, "4253": 411, "4262": 411, "4269": 425, "4275": 425, "4285": 411, "42874": 425, "429": [397, 410], "429166": 411, "4294": 411, "43": [288, 346, 397, 411, 425], "430": 425, "430288": 411, "432": 425, "4321": 411, "4334": 411, "433492": 411, "4339": 397, "4347": 411, "4352": 411, "435488": 366, "4356": 411, "4361": 425, "4366": 411, "437": 411, "4370": 411, "4373": 411, "4374": 425, "4383": 411, "4384": 425, "4389": 425, "439": 425, "4395": 425, "4398": 411, "44": [288, 304, 361, 372, 389, 397, 410, 411, 425], "4402": 411, "4409": 425, "4418": 411, "442": 397, "4430": 411, "44309": 411, "4433": 411, "4435": 425, "444133": 411, "4445": 411, "4448": 425, "4457": 411, "446": [397, 425], "4460": 397, "446442": 411, "4466": 411, "447": 425, "4481": 411, "4483": [411, 425], "4485": 411, "45": [288, 346, 351, 397, 411, 425], "4500": 411, "4501": 411, "451": 411, "4516": 428, "4517": 411, "4520": 411, "4521": 411, "4523": 425, "4526": 411, "4533": 428, "454": 361, "45434": 425, "4551": 411, "4553": [411, 425], "4559": 411, "456": [397, 411], "4561": 425, "4568": 411, "4579": 411, "458": 425, "4582": 411, "4586": 411, "459915": 411, "46": [288, 346, 397, 411, 425], "461b": 319, "462": 411, "4627": 411, "462737": 411, "4628": 411, "4632": 411, "4634": 428, "4636": 411, "4638": 411, "465": 411, "4650": 411, "4654": 411, "4658": [410, 411], "467": 397, "4683": 411, "46x": 425, "47": [288, 304, 331, 346, 397, 411, 425], "4701": 411, "4707": 425, "4714": 411, "4723": 425, "472466": 411, "4727": 411, "473": 397, "4737": 425, "4746": 425, "475": 385, "4750": 397, "475444": 411, "4769": 411, "47752": 411, "4784": 411, "4786": 411, "48": [28, 288, 331, 397, 411, 425], "4800": [425, 426], "4802": 411, "480308": 411, "4806": 411, "4807": 425, "4808": 397, "4822": 425, "4828": 428, "4829": 411, "483": 411, "483053": 411, "4834": 425, "4838": 411, "484": 397, "4858": 411, "48699": 425, "487": 397, "4873": 411, "4876": 411, "488558": 411, "489b": 319, "48b9": 319, "48x": 425, "49": [288, 304, 346, 395, 397, 411, 425], "4904": 411, "4906": 428, "49120": 397, "4913": 425, "4914": 425, "4920": 397, "4936": 428, "4940": 425, "4948": 411, "4951": 425, "4971": 425, "497127": 411, "4972": 411, "4980": [425, 428], "499": 397, "4990": [411, 425], "4993": [411, 425], "4997": 425, "4_bit_llama2": 432, "4a": 410, "4bbb": 332, "4c8b3f": 410, "4c8b6f10": 410, "4c8b7708": 410, "4d": 413, "4ddp": [314, 349], "4e": 352, "4g": 366, "4th": [272, 302, 349, 354], "4x": 409, "4x1": [28, 389, 399, 409], "4x16": [408, 409], "4x4": 409, "5": [9, 25, 28, 57, 133, 134, 135, 136, 216, 217, 218, 221, 222, 223, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 247, 266, 281, 288, 289, 303, 314, 320, 321, 324, 337, 346, 347, 349, 350, 351, 354, 369, 372, 387, 388, 389, 391, 394, 395, 397, 403, 411, 413, 425, 426, 428, 432], "50": [17, 25, 255, 266, 288, 302, 346, 410, 425], "500": [354, 422], "5000": 354, "5005": 425, "500698": 411, "501": 397, "5011": 425, "5018": 428, "5019": 425, "501ff80d3f56": 332, "5024": 425, "5025": 425, "50257": 354, "5031": 411, "503341": 411, "5045": 425, "5046": 411, "5048": 428, "505": 411, "5050": 411, "5057": 428, "506": 411, "5071": 289, "5076": 411, "508": 411, "5084": 425, "5085": 425, "5087": 411, "509": 397, "5094": 411, "50m": 397, "51": [288, 346, 397, 411, 425], "510039": 411, "51009": 411, "5108": 411, "5119": 425, "512": [17, 247, 288, 332, 346, 372, 375, 389, 404, 406, 409, 411, 413, 425], "513": 397, "5137": 425, "514108": 411, "5147": 425, "5149": 411, "515": 397, "5151": 425, "5153": 411, "5156": 411, "515k": 350, "5164": 425, "517278": 411, "5173": 425, "5176": 425, "518": 428, "518276": 411, "5184": 425, "5185": 428, "518614": 411, "5190": 425, "5192": 425, "5193": 425, "5197": 425, "5199": 425, "52": [288, 346, 371, 411, 425], "5202": 411, "5206": 425, "5207": 425, "5210": 425, "5212": 425, "5213": 425, "5222": 425, "5223": 425, "5225": 425, "5227": 425, "5228": 425, "5230": 425, "5231": 425, "5232": 425, "5234": 425, "5241": 425, "5242": 425, "5243": 425, "5244": 425, "5245": 425, "525": 411, "52509": 411, "5252": 425, "5252279507a7": 319, "5253": 425, "52532": 361, "5257": 425, "526": 411, "5262": 411, "527": [397, 411], "5271": 411, "5274": 425, "5280": 397, "5282": 425, "5284": 411, "529": 411, "52k": [314, 349], "53": [288, 353, 410, 411, 425], "5303": 411, "53204": 411, "533329": 411, "5337": 411, "5346": 425, "534615": 411, "5361": 411, "5366": 411, "537405": 411, "5376": 411, "538": 332, "53x": 425, "54": [288, 346, 354, 410, 411, 425], "5408": 411, "5418": 411, "54288": 411, "5432": 425, "5436": 428, "543634": 411, "5439": 411, "5440": 397, "544194": 411, "5443": 428, "545": 397, "5457": 411, "5478": 411, "5482": 397, "5488": 425, "549": 397, "5498": 411, "54x": 425, "55": [288, 304, 346, 372, 410, 411, 425], "5503": 411, "5507": 425, "5513": 411, "5518": 411, "5521": 411, "5535": 425, "5541": 411, "5544": 425, "5552": 428, "5555": [411, 425], "556249": 411, "55628": 397, "5566": 425, "557": 397, "5578": 411, "558061": 411, "558473": 411, "558k": 350, "5593": 428, "5594": 425, "55it": 361, "56": [288, 304, 310, 314, 346, 349, 361, 385, 410, 411, 425, 426], "5600": 411, "5604": 411, "560m": 428, "561317": 411, "5615": 411, "5617": 411, "561805": 411, "562": 397, "5624": 411, "5633": 411, "564": 411, "5644": 411, "564787": 411, "5652": 411, "5662": 411, "5672": 411, "569": 411, "5692": 411, "5695": 411, "56982": 410, "57": [288, 351, 411, 425], "5703": 411, "5713": 411, "573": 411, "5733": 425, "5742": 428, "5748": 411, "5764": 428, "5770": 411, "5772": 411, "578": [397, 411], "5781": [411, 425], "5789": [411, 428], "57x": 425, "58": [288, 304, 366, 411, 425], "5805": 411, "581": 411, "5810": 411, "5811": 425, "5820": 411, "5822": 411, "582871": 411, "583": 411, "5843": 425, "586": 411, "5861": 411, "587": [397, 411], "5876": 411, "588": 411, "5884": 411, "589": 397, "589803": 411, "59": [288, 372, 411, 425], "5912": 411, "592": 411, "592043": 411, "5923": 411, "5933": 411, "5953": 411, "5956": 411, "5959": 411, "59625": 411, "596568": 411, "5968": 411, "5969": 411, "5970": 411, "5977": [411, 428], "598": 411, "5980": 411, "598168": 411, "5986": 411, "599": 397, "59902": 361, "5993": 425, "5_13b": 351, "5_13b_val": 351, "5_adam": 352, "5_finetun": 376, "5b": [410, 428], "5c": 410, "5d": 410, "5e": [346, 347], "5ghz": [397, 411], "5x": [272, 420], "6": [57, 281, 303, 304, 313, 320, 330, 331, 347, 361, 369, 386, 387, 391, 395, 397, 401, 403, 410, 411, 423, 425, 426, 427, 428], "60": [288, 346, 376, 425], "600": [330, 332, 411, 423], "601": 411, "602": 397, "6023": 411, "6026": 397, "6034": 425, "6055": 425, "606477": 411, "608": 411, "6080": 397, "6081": 397, "60813": 411, "609": [397, 411], "61": [288, 346, 372, 411, 425], "6100": 411, "611059": 411, "6114": 425, "611718": 411, "613": 397, "6133": 411, "614": 397, "614109": 411, "6146": 411, "615338": 411, "616": 411, "6161": [411, 425], "6162": 411, "618": 397, "619": 397, "62": [288, 372, 410, 411, 425], "620": 411, "6201": 425, "62123": 411, "62126d40b8d7": 410, "62126d40b8f7": 410, "62127540b8cf": 410, "62127540b8ef": 410, "62127d40b8c7": 410, "62127d40b8e7": 410, "6221": 397, "62241": 411, "62409": 411, "62427d48183f": 410, "62427d48187f01": 410, "62427d48187f02": 410, "62427d48187f03": 410, "62427d48187f04": 410, "62427d48187f05": 410, "62427d48187f06": 410, "62427d48187f07": 410, "62427d48187f08": 410, "62427d48187f09": 410, "62427d48187f0a": 410, "62427d48187f0b": 410, "62427d48187f0c": 410, "62427d48187f0d": 410, "62427d48187f0e": 410, "62427d48187f0f": 410, "6246": 397, "6247": 428, "625089": 411, "62510d48eff6": 410, "62511548efe": 410, "62511d48efe4": 410, "62512d48efd2": 410, "62513548efc9": 410, "62513d48efc0": 410, "62517c48114506": 410, "62517c48114d07": 410, "62517c48115508": 410, "62517c48116509": 410, "62517c48116d0a": 410, "62517c4811750b": 410, "626": 366, "6263": [397, 411], "627": 397, "628": 397, "6289": 425, "629": 411, "6290": 397, "62926d40b8d7": 410, "62926d40b8f7": 410, "62927540b8cf": 410, "62927540b8ef": 410, "62927d40b8c7": 410, "62927d40b8e7": 410, "6297": 428, "62c17c481006": 410, "62c17c48104603": 410, "62c17c48104606": 410, "62c17c48104609": 410, "62c17c48104e01": 410, "62c17c48104e04": 410, "62c17c48104e07": 410, "62c17c48104e0a": 410, "62c17c48105602": 410, "62c17c48105605": 410, "62c17c48105608": 410, "62c17c4810560b": 410, "62d17c48114500": 410, "62d17c48114d01": 410, "62d17c48115502": 410, "62d17c48116503": 410, "62d17c48116d04": 410, "62d17c48117505": 410, "62f14d48eff6": 410, "62f15548efe": 410, "62f15d48efe4": 410, "62f16d48efd2": 410, "62f17548efc9": 410, "62f17d48efc0": 410, "63": [288, 372, 396, 397, 411, 425], "6313": 411, "6316": 411, "632": 411, "6322": 397, "63282": 411, "633": 397, "634": 361, "6341": 411, "6342": 425, "635554": 411, "635729": 411, "6362": [411, 425], "6365": 428, "6374": 397, "6378": 425, "638": 411, "6392": 428, "63x": 425, "64": [25, 259, 281, 288, 321, 327, 328, 329, 349, 372, 376, 389, 396, 397, 405, 407, 408, 410, 411, 413, 425, 426], "6404": 428, "641": 389, "641585": 411, "64247": 411, "64253": 411, "642672": 411, "6432": 411, "6437": 428, "644": 411, "6449": 397, "645": 411, "6462": 411, "6477": 397, "6487": 411, "64963": 411, "6499": 428, "64byte": 399, "65": [288, 411, 425], "65059": 411, "6509": 411, "6510": 385, "6518": 411, "652": 411, "6542": 428, "6543": 411, "655": 428, "6569": 428, "658": 411, "65b": [288, 428], "65k": 348, "65x": 425, "66": [288, 411, 425], "661b400b8983": 319, "6621": 428, "6633": 411, "6637": 425, "664": 411, "6659": 397, "668": 411, "66b": 428, "67": [288, 372, 411, 425], "6702": 411, "6718": 428, "6735": 428, "6737": 397, "6740": 428, "675": 411, "6757": 425, "6759": 411, "6760": 411, "6769": 428, "679": 411, "6796": 411, "6798": 425, "67x": 425, "68": [20, 21, 288, 304, 410, 411, 425], "680": 397, "6804": 428, "6814": 428, "682": 411, "6821": 428, "6831": 428, "68383": 411, "684": 397, "6847": 425, "685": 411, "685382": 411, "686": 411, "6860": 425, "6866": 428, "687": 397, "6872": 428, "6895": 428, "69": [288, 346, 411, 425], "690": [397, 411], "6917": 411, "69186": 411, "6923": 411, "693": 389, "694533": 411, "6947": 425, "695": 411, "6953": 428, "6968": 425, "697": 411, "697876": 411, "698": [397, 411], "699579": 411, "6b": [272, 302, 304, 309, 354, 428, 432], "6f": 410, "7": [55, 57, 281, 288, 304, 320, 347, 351, 361, 386, 387, 391, 393, 395, 397, 403, 411, 413, 416, 423, 425, 428], "70": [288, 389, 397, 425], "700": [330, 332], "7009": 411, "701": [397, 411], "701639": 411, "7019": [411, 425], "702": 411, "7021": 411, "703": [411, 425], "703207": 411, "704": 411, "70404": 411, "705": 411, "708": 411, "7081": 425, "70b": [288, 309, 326, 330, 332, 349], "71": [288, 397, 410, 411, 425], "711": 411, "711146": 411, "712": 411, "7121": 411, "7128": 428, "7143": 428, "7149": 428, "718776": 411, "718893": 411, "719": 397, "7192": 397, "72": [288, 411, 425], "720963": 411, "7213": 411, "722": 411, "7221": 428, "7225": 411, "724": [397, 411], "7256": 411, "726": 389, "7261": 411, "7262": 428, "7265": 411, "727": 411, "7282": 411, "729": [397, 425], "73": [288, 411, 425], "730678": 411, "7307": 411, "73162": 411, "7324": 411, "7326": 428, "7330": 428, "7334": 411, "7336": 376, "734": 411, "7341": 411, "735": 411, "7354": 411, "7357": 428, "7361": 428, "7369": [411, 428], "737": 411, "737943": 411, "738": 411, "7385": 376, "738939": 411, "7398": 428, "73x": 425, "74": [288, 397, 411, 425], "741": 411, "742": [411, 425], "743": 411, "7442": 411, "7445": 411, "745": [361, 411], "745357": 411, "7466": 411, "747": [397, 411], "74845": 411, "7488": 411, "749f02a5": 332, "75": [288, 346, 351, 385, 397, 411, 425], "750": 397, "75007": 377, "7502": 411, "7512": 411, "7516": 425, "7518": 411, "752": 397, "7520": 411, "753": 411, "75328": 411, "753487": 411, "75384": 397, "754": 411, "755": [411, 425], "756": [397, 411], "75786": 411, "758": 411, "759": 411, "7590": 428, "7599": 425, "75x": 425, "76": [288, 304, 346, 410, 411, 425], "760": 425, "7600": 425, "7608": 411, "761": 411, "7627": 428, "7637": 411, "764": 411, "76407": 411, "7643": 411, "7647": 411, "765": 411, "7651": 411, "767569": 411, "768": [372, 397, 411], "769": 397, "7690": 411, "77": [288, 346, 411, 425], "77082": 411, "771439": 411, "77317": 411, "77444": 411, "774m": 428, "775294": 411, "7759": 428, "7770": 411, "7774": 411, "7777": 336, "778244": 411, "7794": 411, "7799": 411, "77it": 361, "78": [288, 411, 425], "7803": 411, "781": 411, "7815": 411, "7833": 411, "784": 411, "7840": 428, "7850": 411, "786": [397, 411, 425], "7860": [345, 383, 384], "787": 411, "788": [397, 411], "789777": 411, "79": [288, 346, 372, 411, 425], "7901": 397, "7908": 428, "7924": 425, "7929": 425, "793": 425, "793822": 411, "7941": 411, "7957": 428, "7965": 411, "797": 411, "7978": 411, "798": 397, "799": 411, "7b": [288, 309, 313, 314, 315, 316, 319, 321, 324, 327, 328, 329, 331, 334, 338, 340, 343, 344, 345, 346, 350, 352, 361, 363, 364, 366, 371, 375, 377, 383, 384, 396, 420, 422, 427, 428, 429, 432], "7b1": 428, "7b86016aa1d2107440c1928694a7bba926509887": 427, "7c": 410, "7th": 377, "8": [57, 246, 247, 281, 302, 304, 305, 314, 320, 326, 330, 346, 347, 348, 349, 352, 354, 366, 387, 389, 391, 393, 395, 397, 401, 402, 403, 409, 410, 411, 413, 422, 423, 425, 426, 428, 432], "80": [288, 304, 321, 345, 383, 384, 389, 397, 411, 425], "8000": [309, 313, 316, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 337, 338, 343, 344, 361, 363, 367, 375], "8008": 361, "801": 389, "8011": 425, "802": 411, "8021": [364, 365, 366, 425], "8029": 411, "8033": 411, "8034": 411, "804": 411, "8045": 425, "805": 411, "806": 411, "8061": 425, "807": 397, "81": [288, 361, 411, 425], "8101": 425, "8102": 411, "8105": 425, "811": 411, "8127": 425, "813": 411, "8148": 411, "816": 397, "817": [397, 411], "818": 411, "819": [397, 411], "82": [288, 397, 410, 411, 425], "821": [397, 411], "822": 411, "823": [411, 425], "8237": 425, "8240": 411, "8253": 425, "8255": 411, "826": [411, 425], "8262": 411, "8275": 376, "8280": 304, "8282": 397, "8297": 376, "82x": 304, "83": [288, 411, 425], "8300": 411, "83024": 411, "831": 411, "832": 411, "8325": 411, "832701": 411, "833": 397, "834": [411, 425], "835": 411, "8363": 425, "836616": 411, "8375c": [397, 411], "8381": 411, "8395": 425, "8399": 411, "84": [288, 304, 411, 425], "840": 411, "841": 411, "8412": 411, "84121": 411, "842": 425, "8426": [397, 411], "842936": 411, "843": 397, "844": 411, "8441": 425, "8447": 425, "845": 411, "8456": 319, "8466": 411, "848": 397, "8480": [310, 425, 426], "8481": 411, "8482": 425, "84983": 411, "85": [260, 288, 304, 397, 411, 425], "8507": 425, "8515": 425, "853": [411, 425], "853916": 411, "855": 411, "857": 411, "858": 411, "859": 411, "8598": 397, "86": [288, 304, 411, 425], "861": [411, 425], "862": [397, 411, 425], "863": [397, 411], "865": 397, "8652": 411, "867": 397, "868": [397, 411], "8689": 397, "87": [288, 304, 411, 425], "870": 397, "8711": 411, "8715": 425, "8728": 411, "87335": 411, "8736": 397, "874": 411, "87429": 411, "875": 411, "876": 411, "8768": 425, "87685": 411, "877": [397, 411], "8775": 425, "878": 411, "8798": 411, "87x": 425, "88": [288, 304, 397, 410, 411, 425], "880": 411, "880179": 411, "880185": 411, "881": 411, "8818": 411, "8823": 411, "883258": 411, "884": 411, "8841": 411, "885": 411, "887": 425, "8888": [322, 340], "889": 411, "88x": 425, "89": [288, 304, 411, 425], "890": [397, 411], "891": 411, "8916": 425, "892": 411, "8923": 425, "893": 425, "893959": 411, "894": [397, 411], "8940": 411, "895": 411, "8972": 397, "898": [397, 411], "8989": 425, "8b": 432, "8e": 410, "8ghz": [425, 426], "8x7b": [309, 349], "9": [28, 57, 302, 306, 308, 320, 323, 327, 328, 329, 330, 331, 332, 337, 348, 349, 361, 362, 377, 387, 395, 397, 403, 411, 413, 419, 425, 426, 427, 428, 432], "90": [288, 302, 304, 389, 397, 419], "900": [411, 425], "9000": 334, "9018": 411, "902": 411, "902588": 411, "9026": 425, "9031": 425, "904": [397, 411], "905": [397, 411], "906": 397, "9060": 411, "907": 411, "908": 397, "9088": 411, "909": 411, "909941": 411, "90ghz": [397, 411], "91": [288, 397, 411, 425], "910": [397, 411], "911": 411, "9110": 411, "912": 411, "913": [397, 411], "913626": 411, "914": [397, 411], "915": 411, "916": 411, "917": 397, "9176": 411, "9183": 411, "91db": 332, "92": [288, 411, 425], "9206": 425, "92067": 411, "921": [397, 411], "9213": 425, "922": 425, "923": 411, "924": 397, "926": 397, "926038": 411, "927": 411, "928": 397, "9283": 425, "929398": 411, "93": [288, 411, 425], "930": 411, "931": [397, 411], "933": [289, 397], "935": [397, 411], "936": [397, 411], "937": 411, "937824": 411, "938": [397, 411], "939": 411, "94": [288, 411, 425], "940": 411, "94057": 411, "941": 411, "9418": 425, "943": 411, "945": [397, 411], "946": 411, "947": [397, 411], "94733": 411, "949": 411, "94x": 304, "95": [288, 346, 351, 410, 411, 425], "951": 411, "9513": 319, "952": 397, "955251": 411, "956": 411, "957": [397, 411], "958": [397, 425], "959": 411, "96": [288, 372, 397, 411, 425], "9609": 425, "961": 397, "965": [397, 411], "966": 397, "967": 397, "968": 411, "96945": 411, "97": [288, 411, 425], "971": [397, 411], "972": [397, 411], "973": 397, "974": [397, 411], "975": [397, 411], "9761": 411, "977": 411, "978": 397, "979": [397, 411], "98": [28, 288, 385, 411, 425], "980": 425, "9817": 425, "982": [397, 411], "983": 397, "985": [397, 411], "9857": 425, "987": 397, "9876": 363, "988": 397, "989": 397, "9890": 411, "99": [288, 302, 304, 411, 425], "9919": 411, "993": [397, 411], "994": 411, "994935": 411, "995": 397, "996": [397, 411], "996979": 411, "997": 411, "998": [397, 425], "999": 411, "9998425245285034": 418, "9998886585235596": 418, "99x": 425, "9b": 410, "9ghz": [397, 411], "A": [0, 1, 4, 9, 24, 25, 29, 36, 45, 47, 50, 57, 246, 258, 260, 266, 268, 309, 348, 349, 351, 375, 387, 388, 395, 397, 399, 402, 403, 409, 411, 413, 420, 428], "And": [158, 319, 347, 361, 389, 390, 391, 392, 395, 400], "As": [57, 303, 316, 342, 361, 371, 387, 389, 391, 392, 403, 407, 409, 432], "At": [25, 300, 361, 405, 406, 408], "Be": [62, 243], "Being": 298, "But": [57, 388, 399, 418], "By": [24, 246, 272, 351, 366, 379, 420], "FOR": [403, 404, 409], "For": [32, 52, 53, 57, 62, 181, 256, 257, 258, 266, 270, 271, 288, 298, 302, 306, 307, 308, 309, 314, 319, 325, 330, 332, 345, 346, 347, 348, 350, 352, 355, 357, 359, 361, 369, 372, 376, 378, 383, 384, 387, 390, 391, 395, 396, 397, 398, 400, 403, 407, 408, 409, 410, 411, 418, 425, 426, 427, 428], "If": [0, 17, 24, 25, 29, 33, 36, 44, 57, 246, 247, 260, 266, 269, 300, 303, 305, 308, 309, 313, 314, 315, 316, 317, 318, 330, 332, 335, 340, 349, 350, 351, 361, 363, 369, 372, 373, 375, 376, 380, 387, 389, 390, 391, 392, 395, 400, 406, 413, 415, 416, 419, 421, 423, 426, 427, 432], "In": [24, 36, 57, 258, 298, 302, 309, 314, 316, 321, 322, 330, 332, 338, 347, 349, 351, 356, 366, 368, 369, 372, 376, 387, 388, 389, 390, 391, 392, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 416, 417, 423, 425, 426, 429, 432], "It": [25, 35, 57, 147, 256, 257, 289, 303, 307, 319, 350, 354, 367, 369, 372, 376, 377, 378, 387, 389, 390, 391, 394, 395, 396, 404, 405, 407, 408, 413, 432], "Its": 391, "No": [269, 304, 361, 372], "Not": 404, "OF": 322, "ON": [398, 413], "OR": 351, "Of": [363, 389, 395, 402], "On": [325, 376, 420, 425, 426], "One": [319, 361, 365, 377, 379], "Or": [313, 316, 395, 420], "Such": 323, "TO": [403, 427], "That": [57, 408, 409], "The": [0, 2, 3, 4, 6, 10, 14, 17, 25, 28, 29, 30, 32, 35, 36, 39, 40, 41, 42, 44, 47, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 250, 251, 256, 257, 260, 263, 266, 267, 272, 274, 277, 279, 280, 282, 286, 289, 298, 302, 303, 308, 309, 311, 314, 315, 316, 319, 321, 334, 338, 340, 345, 346, 347, 348, 349, 353, 354, 355, 356, 358, 361, 366, 369, 370, 371, 372, 373, 375, 376, 377, 378, 379, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 399, 400, 401, 403, 404, 405, 406, 407, 408, 409, 413, 416, 418, 419, 420, 423, 429, 432], "Then": [32, 49, 57, 303, 309, 321, 330, 332, 351, 355, 364, 365, 366, 372, 375, 391, 392, 408, 409, 413, 419, 423], "There": [49, 303, 373, 376, 387, 388, 389, 406, 410, 413, 417, 419, 426], "These": [25, 256, 257, 270, 309, 327, 328, 329, 357, 361, 372, 387, 391, 395, 402, 408], "To": [24, 25, 36, 44, 270, 300, 307, 314, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 342, 343, 344, 345, 347, 348, 349, 351, 358, 361, 363, 364, 365, 369, 370, 372, 373, 383, 384, 387, 402, 405, 407, 409, 413, 414, 422, 426, 427, 429, 432], "Will": [25, 420], "With": [272, 314, 349, 371, 408, 420, 423], "_": [24, 44, 270, 303, 307, 309, 314, 317, 318, 321, 322, 323, 324, 327, 328, 329, 331, 332, 336, 345, 346, 347, 348, 349, 350, 351, 352, 354, 361, 363, 366, 372, 376, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 394, 395, 398, 399, 400, 401, 405, 406, 407, 408, 413, 416, 417, 418, 419, 423, 426, 428, 432], "__call__": 387, "__file__": [314, 332, 349], "__global": 402, "__init__": 387, "__kernel": 402, "__local": 402, "__m256i": 403, "__m512i": 403, "__str__": 247, "__version__": 421, "_attr": 387, "_create_out_pattern": 391, "_datatyp": 28, "_devic": 25, "_get_pattern_info": 391, "_mm256_loadu_epi": 403, "_mm512_castsi256_si512": 403, "_mm512_inserti32x8": 403, "_mm512_permutexvar_epi16": 403, "_mm512_set_epi16": 403, "_mm512_storeu_epi32": 403, "_n": 57, "_replace_pattern": 391, "_set_attr": 387, "a1": 410, "a100": [320, 379], "a1ef": 319, "a32543254": 299, "a7": 410, "a_node_name_1": 395, "a_node_name_2": 395, "a_node_name_n": 395, "a_scal": 62, "ab": [36, 260, 423], "abi": 361, "abil": [358, 372, 382, 428], "abl": [266, 330, 332, 335, 336, 363, 380, 407, 423], "about": [4, 25, 57, 298, 302, 309, 311, 316, 318, 319, 321, 337, 338, 345, 358, 361, 364, 365, 366, 367, 370, 372, 375, 383, 384, 387, 391, 394, 397, 401, 409, 411, 420, 425, 426, 429], "abov": [36, 44, 49, 57, 256, 257, 266, 309, 322, 364, 377, 386, 387, 390, 391, 395, 402, 403, 405, 406, 407, 412], "absolut": [256, 257, 259, 314, 315, 416, 423], "absorb": 372, "absorb_to_lay": 247, "abspath": [314, 332, 349], "abstract": [25, 33, 36, 44], "abus": 298, "academ": 350, "acb8": 332, "acc": [402, 413, 414], "acc91": 304, "acceler": [9, 38, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 293, 302, 304, 305, 314, 315, 327, 328, 329, 349, 352, 357, 366, 385, 392, 400, 401, 405, 407, 420, 421, 423, 428, 432, 436], "accent": 369, "accept": [24, 246, 298, 361, 372, 376, 418, 428, 432], "access": [264, 266, 270, 313, 314, 316, 330, 332, 335, 345, 348, 349, 361, 370, 372, 373, 380, 383, 384, 399, 404, 405], "accommod": [306, 370, 372], "accompani": 415, "accomplish": 371, "accord": [25, 55, 313, 316, 317, 318, 330, 335, 349, 371, 376, 379, 380, 387, 390, 409], "accordingli": [331, 332], "account": [36, 44, 298, 351], "accumul": [25, 307, 402, 407, 409], "accur": [25, 316, 372, 432], "accuraci": [25, 246, 264, 268, 304, 306, 307, 313, 319, 338, 354, 357, 358, 371, 372, 373, 376, 405, 413, 416, 417, 425, 428, 429, 432], "accuracy_criterion": 423, "accuracycriterion": 423, "achiev": [246, 302, 321, 354, 402, 417, 425, 428], "acoust": 369, "acquir": [348, 349], "across": [312, 319, 320, 323, 325, 326, 330, 331, 332, 376, 379, 428], "act": [288, 298, 404], "action": [269, 298], "activ": [256, 257, 261, 309, 323, 330, 331, 332, 337, 354, 361, 362, 372, 388, 392, 393, 396, 399, 403, 404, 405, 408, 413, 421, 423, 428, 432], "activation_dag": 396, "activation_mem_compress": 396, "actual": [345, 377, 383, 384, 387, 388, 406], "ad": [25, 36, 44, 255, 376, 378, 387, 391, 395, 400, 409, 413, 418], "adafactor": 352, "adapt": [4, 28, 32, 38, 43, 272, 298, 302, 309, 314, 331, 349, 352, 353, 354, 357, 359, 360, 372, 374, 420, 422, 430], "adapter_model_nam": 352, "add": [25, 30, 39, 40, 41, 47, 55, 57, 73, 269, 303, 314, 315, 317, 318, 347, 364, 366, 369, 371, 376, 388, 389, 390, 391, 395, 398, 400, 401, 408, 413, 414, 421, 422, 439], "add_1": 395, "add_284": 389, "add_37": 389, "add_bia": [413, 421], "add_cls_token": 150, "add_config_item": 55, "add_cross_attent": [36, 44], "add_embed": 150, "add_execut": 388, "add_gen": 30, "add_pooling_lay": [36, 44], "addclstoken": [130, 138], "addembed": 131, "addit": [50, 246, 247, 302, 304, 309, 359, 372, 378, 389, 402, 406, 414, 424, 432], "addition": [25, 319, 325, 357, 370, 373, 432], "additional_cmd": 27, "addr": 400, "addr_dst": 401, "addr_ptr": 400, "addr_src": 401, "addr_typ": 400, "address": [272, 288, 298, 314, 319, 322, 325, 337, 338, 345, 349, 358, 370, 372, 373, 375, 377, 383, 384, 400, 405, 406, 420, 429], "addv2": [57, 73, 395], "adher": [270, 300, 307, 319, 372], "aditya": 301, "adjac": 409, "adjust": [266, 288, 289, 309, 314, 331, 332, 345, 349, 357, 369, 372, 383, 384, 394, 423, 432], "adopt": [25, 364, 365, 366, 370, 399, 404, 409, 432], "advanc": [272, 298, 302, 309, 335, 357, 361, 372, 378, 380, 398, 410, 420, 432], "advantag": 370, "adventur": 361, "advis": 349, "ae": 410, "affect": [350, 390, 405, 408, 414], "affin": [255, 423], "aforement": 355, "after": [0, 24, 25, 36, 47, 57, 147, 181, 195, 220, 247, 256, 257, 260, 264, 305, 309, 313, 322, 347, 348, 349, 350, 355, 357, 363, 366, 372, 376, 382, 386, 389, 390, 391, 392, 394, 395, 399, 401, 406, 408, 409, 412, 413, 414, 423, 427, 428], "after_lay": 24, "afterward": [332, 395], "ag": 298, "again": [363, 406], "against": 409, "agent": [361, 372, 420], "agent_qa": 372, "aggreg": [17, 25], "agnost": 303, "agreement": [335, 380], "ahouzi": 301, "ai": [272, 302, 309, 315, 319, 325, 333, 334, 346, 361, 372, 420, 425, 426], "ai_photo": 334, "aid": [309, 321], "aidan": [36, 44], "aim": [306, 377, 387, 389, 391, 428], "aipc": 309, "airmeng": 299, "akarx23": 301, "akdlm": 332, "al": 432, "alapaca": 314, "alg": 400, "algo": 401, "algorithm": [25, 57, 95, 184, 281, 346, 347, 369, 370, 372, 376, 377, 390, 391, 394, 395, 399, 400, 406, 413, 419, 423, 427, 429], "algorithm_": 394, "alia": 394, "alibaba": [272, 420], "alibi": 429, "align": [266, 298, 346, 347, 350, 352, 372, 399, 401, 409, 420], "align_column": 266, "align_corn": 264, "align_head": 266, "align_img": 20, "align_row": 266, "align_supercel": 266, "all": [0, 1, 9, 24, 25, 32, 33, 36, 38, 44, 47, 52, 53, 54, 55, 57, 83, 95, 181, 184, 195, 246, 247, 256, 257, 259, 261, 264, 266, 268, 270, 272, 288, 298, 300, 301, 302, 303, 307, 314, 315, 316, 318, 322, 334, 335, 347, 349, 350, 351, 355, 359, 365, 366, 372, 376, 380, 386, 387, 388, 389, 391, 395, 397, 400, 401, 402, 403, 405, 408, 411, 416, 419, 420, 423, 429, 430], "all_choic": [267, 268, 351], "all_gath": 264, "alloc": [332, 396, 401, 402], "allow": [24, 35, 266, 289, 319, 322, 349, 357, 361, 369, 370, 372], "along": [25, 32, 247, 314, 349, 361, 373, 390, 404, 407, 409], "alpaca": [314, 349, 425], "alpaca_data": [314, 349, 422], "alpha": [111, 247, 260, 281, 309, 406, 413, 425, 428], "alpha_max": 247, "alpha_min": 247, "alpha_step": 247, "alreadi": [314, 315, 330, 332, 345, 355, 364, 372, 383, 384, 395, 421], "also": [24, 25, 57, 247, 256, 257, 266, 270, 300, 302, 309, 314, 319, 324, 336, 337, 345, 347, 349, 350, 351, 354, 358, 363, 364, 365, 366, 370, 372, 373, 376, 382, 383, 384, 387, 388, 389, 391, 392, 394, 395, 396, 399, 400, 401, 402, 405, 408, 409, 410, 417, 421, 423, 427, 428, 432], "alter": 377, "altern": [345, 370, 372, 373, 383, 384, 403, 409, 420], "although": 340, "alwai": [372, 373, 376, 395, 405, 414], "always_keep_cls_token": [36, 44], "amaz": [345, 383, 384], "amazon": [359, 397, 411], "ammbashankar": 301, "among": [361, 372, 405, 408], "amount": [25, 304, 402], "amx": [398, 405, 408, 413, 437], "amx_bf16": 421, "amx_bf16_params_t": 281, "amx_bf16_x16": 413, "amx_bf16bf16_inputs_t": 281, "amx_bf16f32_inputs_t": 281, "amx_inputs_t": 281, "amx_int8": 421, "amx_int8_params_t": 281, "amx_params_t": 281, "an": [0, 8, 24, 25, 33, 36, 44, 50, 57, 62, 243, 246, 258, 268, 270, 272, 298, 302, 304, 305, 306, 307, 309, 313, 314, 316, 317, 318, 319, 320, 323, 325, 335, 336, 338, 348, 349, 351, 354, 355, 358, 363, 365, 366, 369, 370, 372, 373, 376, 377, 378, 380, 387, 388, 389, 390, 391, 394, 395, 396, 399, 400, 401, 405, 406, 409, 414, 416, 418, 420, 421, 422, 428, 432, 439], "ana": 301, "anaconda": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "anaconda3": 361, "analys": 410, "analysi": [272, 302, 420], "analyz": [316, 389], "anatol": 377, "ani": [0, 17, 25, 32, 35, 36, 44, 57, 246, 247, 255, 256, 257, 264, 266, 270, 298, 309, 316, 335, 351, 363, 375, 377, 380, 387, 395, 418], "anim": 321, "annot": [350, 372], "annual": [316, 358, 372], "annual_report": [358, 372], "anomali": [319, 325], "anonym": 281, "anoth": [319, 321, 325, 387, 391, 396, 410], "answer": [36, 44, 298, 304, 316, 319, 324, 335, 338, 351, 361, 370, 372, 380, 430], "anyth": [266, 279, 377], "anywher": 419, "apach": [372, 415], "api": [4, 22, 32, 52, 53, 55, 256, 257, 260, 270, 272, 300, 302, 311, 315, 316, 319, 320, 334, 335, 336, 337, 338, 354, 356, 357, 358, 363, 369, 370, 372, 375, 380, 388, 390, 391, 392, 394, 395, 400, 401, 418, 420, 435], "api_kei": [309, 321], "api_open": 361, "apierrorcod": 22, "apolog": 377, "app": [6, 309, 345, 353, 361, 383, 384], "appear": [298, 373], "append": [0, 39, 40, 309, 321, 324, 338, 372, 406, 413, 414], "append_loop_len": 400, "append_messag": 0, "append_op": 389, "append_sum": 281, "append_vec": 400, "appl": 420, "appli": [5, 32, 57, 256, 257, 266, 298, 304, 306, 319, 346, 347, 354, 400, 401, 405, 406, 407, 409, 413, 419, 423, 432], "applic": [6, 309, 313, 316, 317, 318, 319, 321, 324, 325, 338, 340, 358, 361, 363, 367, 369, 370, 372, 420, 432], "apply_class_threshold": 266, "apply_lora": 347, "apply_postop_list": 401, "apply_postops_list": [280, 401], "apply_postops_list_": [280, 401], "apply_rotary_pos_emb": 32, "apply_threshold": 266, "appoint": 298, "approach": [260, 288, 303, 304, 306, 307, 311, 314, 349, 370, 372, 373, 375, 378, 401, 402, 420, 422, 429], "appropri": [298, 372, 376, 387, 400, 408], "approv": [270, 299, 300, 348, 349], "approx_ratio": 30, "approxim": [30, 408], "apr": 420, "april": [272, 420], "apt": [308, 309, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 359, 363, 368, 369], "ar": [5, 9, 23, 24, 25, 29, 30, 32, 33, 36, 39, 40, 41, 44, 47, 57, 195, 247, 255, 256, 257, 258, 261, 263, 265, 266, 269, 270, 272, 298, 300, 302, 303, 307, 308, 309, 314, 315, 316, 319, 321, 324, 327, 328, 329, 330, 332, 335, 336, 346, 347, 352, 354, 355, 356, 357, 361, 363, 369, 371, 372, 373, 377, 378, 380, 384, 385, 386, 387, 388, 389, 390, 391, 392, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 413, 414, 415, 416, 417, 419, 420, 421, 422, 423, 426, 432], "arang": 73, "arangewithreciproc": 150, "arbitrari": [25, 260, 264, 355, 369, 387], "arc": [288, 346, 432], "arch": 323, "architectur": [9, 36, 44, 270, 302, 316, 323, 349, 350, 399, 406, 408, 432], "archiv": 369, "arcturus22": 301, "area": 266, "areg": 402, "arg": [24, 25, 32, 35, 39, 40, 41, 47, 61, 128, 246, 279, 307, 313, 314, 315, 316, 317, 318, 319, 324, 336, 338, 340, 347, 349, 358, 371, 372, 375, 389, 394, 421], "arg1": 1, "arg2": 1, "arg3": 1, "arg_t": 279, "argmax": [302, 306], "argument": [1, 2, 5, 24, 25, 32, 36, 44, 47, 246, 247, 303, 314, 355, 361, 371, 372, 376, 389, 414, 416, 417, 419, 422], "argumentpars": 1, "ariel": 301, "aris": 372, "arithmet": 400, "around": [263, 350, 361], "arrai": [20, 21, 24, 25, 62, 388], "arrondiss": 377, "art": [302, 372], "articl": [316, 349, 402, 423], "artifact": 35, "artifici": [272, 420], "arxiv": [36, 260, 420, 432], "aryaman": 301, "ashimin": 301, "ashish": [36, 44], "askdoc": 324, "aspect": [316, 372], "asr": [309, 319, 322, 334, 336, 340, 355, 375], "assembli": [293, 398, 402, 404, 409, 410, 436], "assert": [36, 83, 421], "asset": [311, 319, 375], "assign": [256, 257, 258, 314, 349, 405], "assign_reg": 401, "assist": [309, 319, 323, 324, 325, 350, 361, 377, 424], "assistant_model": 323, "associ": [266, 399], "assum": [266, 387, 395, 402], "assur": 319, "astep": 281, "astunpars": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "asub": 402, "asym": 421, "asymmetr": [421, 423], "aten": 111, "atom": 415, "atributt": 247, "ats": 432, "atsm": 361, "attach": [24, 349, 387, 395], "attack": 298, "attempt": [35, 266], "attenion": 44, "attent": [32, 36, 44, 57, 259, 260, 279, 298, 307, 367, 389, 395, 407, 429], "attention_desc": 279, "attention_io": 281, "attention_mask": [33, 36, 37, 39, 40, 41, 44, 396], "attention_mask_length_adaptive_keep_indic": 150, "attention_output": [36, 44], "attention_output_layer_norm_length_adaptive_keep_indic": 150, "attention_reshap": 150, "attention_sink": 38, "attentionblock_attentionmaskaddreshap": 150, "attentionblock_constantofshapewithmul": 150, "attentionblock_qkvprereshap": 150, "attentionblock_qkvreshap": 150, "attentionblock_weightreshapeto4d": 150, "attentionmasklengthadaptiveexpandindic": 138, "attentionoutputlayernormlengthadaptiveexpandindic": 139, "attentionreshap": 140, "attr": [57, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 280, 387, 388, 400, 401], "attract": [338, 356, 358], "attribut": [50, 52, 53, 54, 57, 95, 247, 278, 279, 389, 391, 394, 401], "attrs_": [280, 401], "attrs_map": 394, "audio": [309, 311, 315, 319, 321, 322, 355, 369], "audio2text": 369, "audio_name_0": 355, "audio_name_1": 355, "audio_name_2": 355, "audio_path": 369, "audio_url": [341, 381], "audiolanguageopt": 5, "audiospeechrecognit": 369, "aug": [272, 420], "augment": [256, 257, 260, 270, 309, 319, 321, 324, 372, 376], "augmented_exampl": 376, "authent": 321, "author": [353, 359, 360, 374, 415], "authorized_kei": [330, 332], "auto": [9, 304, 314, 317, 318, 324, 334, 336, 338, 347, 349, 369, 375, 389, 394, 401, 428, 432], "auto_alpha_arg": 247, "auto_clip": 247, "auto_round": 427, "auto_scal": 247, "autoawq": 432, "autocast_init": 57, "autoconf": 337, "autoconfig": [45, 302, 306, 418, 428, 432], "autodistil": 270, "autoencoderkl": 9, "automat": [246, 289, 309, 316, 338, 346, 347, 352, 358, 372, 382, 389, 390, 391, 400, 413, 428], "automata": 373, "automativ": 246, "automodelforcausallm": [422, 428, 429, 432], "automodelforsequenceclassif": [302, 306], "autoregress": [354, 429], "autoround": 427, "autoroundconfig": [247, 432], "autotoken": [289, 302, 418, 428, 429, 432], "aux_loss": [256, 257], "aux_output": [256, 257], "auxiliari": [256, 257], "avail": [57, 274, 277, 282, 286, 302, 308, 309, 313, 316, 317, 318, 330, 332, 361, 366, 370, 371, 372, 388, 404, 423], "avatar": [341, 381], "avenu": 377, "averag": [23, 25, 36, 264, 289, 302, 346, 372, 376, 385, 389], "avg": 288, "avoid": [25, 36, 38, 44, 335, 354, 364, 372, 373, 380, 395, 399, 401, 405, 407, 408, 413], "avx": 410, "avx2": 421, "avx512": [398, 399, 403, 423], "avx512_data_t": 281, "avx512_fp32_params_t": 281, "avx512evex": 410, "avx512f": [398, 407, 413, 421, 437], "avx512f_p2031_p2013": 413, "aw": [397, 411], "awai": [264, 307], "awar": [272, 302, 309, 335, 380, 432], "awq": [288, 432], "awqconfig": [247, 432], "ax": [246, 387, 407], "axi": [302, 387, 389, 407, 408], "ayaan": 301, "azur": 269, "b": [21, 25, 36, 55, 57, 62, 268, 301, 330, 351, 387, 395, 399, 402, 403, 404, 408, 409, 413], "b1": 399, "b2": 399, "b4": 410, "b5a3f2c4": 319, "b_node_name_1": 395, "b_node_name_2": 395, "b_node_name_n": 395, "b_scale": 62, "ba": [269, 406, 410, 413], "baai": [14, 376], "back": [0, 361, 372, 405, 406, 407, 408], "backbon": [256, 257, 369], "backend": [28, 181, 246, 247, 252, 289, 302, 304, 314, 320, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 337, 338, 339, 340, 342, 343, 344, 345, 348, 349, 370, 383, 384, 386, 392, 423, 425, 426, 432], "backendopt": 5, "background": [345, 383, 384], "backpropag": [422, 432], "backup": 299, "backward": [24, 340, 423], "bad": 409, "badd_dim": 413, "baddbmm": 83, "badg": 435, "baeseong": 432, "balanc": [260, 307, 319, 320, 372, 373, 397, 411, 425, 428, 432], "ban": 298, "bandit": 269, "bandwidth": [406, 408, 423, 432], "bar": [17, 272, 420], "barrier": 402, "base": [2, 3, 4, 9, 14, 25, 35, 36, 40, 44, 45, 50, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 266, 279, 302, 304, 305, 306, 307, 309, 312, 319, 321, 323, 325, 327, 328, 329, 331, 335, 336, 337, 338, 340, 341, 347, 357, 358, 361, 362, 367, 369, 370, 372, 376, 377, 379, 380, 381, 382, 392, 394, 397, 402, 404, 405, 406, 407, 408, 410, 411, 418, 420, 428, 429, 430, 432], "base64": 0, "base_finetuned_model": 348, "base_model": 319, "base_model_path": 315, "base_url": [309, 321, 382], "basefinetuningconfig": 4, "baselin": 350, "basemodeloutputwithpast": 41, "basemodeloutputwithpastandcrossattent": [36, 44], "basemodeloutputwithpoolingandcrossattent": [36, 44], "basetrain": 246, "bash": [313, 314, 315, 322, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 350, 357, 358, 360, 363, 368, 383, 384, 392, 393, 427], "bashrc": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "basi": 372, "basic": [21, 25, 44, 302, 308, 309, 335, 356, 358, 361, 372, 380, 394, 405], "basicmagnitud": [304, 306], "basketbal": 23, "bass": 184, "batch": [25, 27, 37, 39, 40, 246, 255, 256, 257, 258, 260, 264, 268, 281, 302, 367, 376, 388, 404, 405, 407, 414, 425, 426, 432], "batch_decod": [428, 432], "batch_first": 23, "batch_matmul": 83, "batch_matmul_v2": 83, "batch_num": 281, "batch_siz": [25, 27, 28, 32, 33, 36, 44, 246, 247, 256, 257, 258, 289, 302, 352, 388, 389, 393, 407, 413, 427], "batched_data": 27, "batched_gen": 352, "batched_valu": 27, "batchk": [281, 408], "batchmatmul": 66, "batchmatmulv2": 67, "batchnorm": [255, 395], "batchnorm2d": 255, "batchsiz": 397, "bbox": 266, "bbox1": 266, "bbox2": 266, "bd00040000": 410, "beam": [350, 396, 432], "bear": 419, "beat": 420, "beauti": [361, 418], "becaus": [57, 258, 347, 349, 387, 394, 400, 403, 408, 423], "becom": [302, 319, 370, 391, 396, 409, 432], "been": [24, 36, 44, 57, 266, 303, 309, 319, 349, 350, 352, 354, 357, 372, 377, 401, 405, 418, 421], "befor": [24, 25, 28, 36, 47, 57, 246, 255, 256, 257, 260, 289, 300, 303, 305, 313, 314, 316, 317, 318, 330, 332, 336, 338, 345, 349, 358, 361, 370, 371, 372, 376, 383, 384, 387, 390, 391, 392, 395, 400, 401, 402, 403, 405, 406, 408, 413, 423, 426, 432], "begin": [44, 47, 400, 401], "behav": [36, 44, 352], "behavior": [246, 266, 298, 303, 370, 372, 375, 399, 400, 405, 419, 423], "behaviour": 372, "behind": 361, "being": [25, 246, 372], "believ": [377, 418], "belong": [57, 387, 423], "below": [57, 266, 272, 300, 302, 303, 308, 309, 314, 316, 317, 318, 319, 330, 332, 345, 347, 348, 349, 351, 354, 358, 359, 363, 366, 371, 372, 377, 383, 384, 387, 388, 390, 392, 395, 399, 404, 406, 407, 408, 409, 417, 422, 428, 432], "bench_": 413, "benchmark": [28, 246, 248, 269, 270, 302, 390, 397, 398, 411, 414, 420, 425, 426, 432, 434], "benchmark_dir": 413, "benchmark_it": 413, "benchmark_no_refresh": 413, "benchmark_util": 413, "benchmarkconfig": [27, 28, 289], "benedikt": 301, "benefici": 319, "benefit": [0, 370, 405, 409, 423], "bert": [36, 302, 303, 304, 340, 369, 388, 389, 390, 393, 395, 396, 397, 400, 405, 406, 407, 408, 430], "bert_large_model_path": 395, "bert_large_squad": 57, "bert_model": 388, "bertattent": 36, "bertembed": [36, 44], "bertencod": 36, "bertformaskedlm": 36, "bertformultiplechoic": 36, "bertfornextsentencepredict": 36, "bertforpretrain": 36, "bertforpretrainingoutput": 36, "bertforquestionansw": 36, "bertforsequenceclassif": 36, "bertfortokenclassif": 36, "bertintermedi": 36, "bertlay": 36, "bertlmheadmodel": 36, "bertlmpredictionhead": 36, "bertmodel": 36, "bertonlymlmhead": 36, "bertonlynsphead": 36, "bertoutput": 36, "bertpool": 36, "bertpredictionheadtransform": 36, "bertpretrainedmodel": 36, "bertpretraininghead": 36, "bertselfattent": 36, "bertselfoutput": 36, "berttoken": 36, "besid": [266, 303, 376, 401, 432], "best": [246, 258, 298, 304, 335, 340, 354, 372, 378, 380], "best_model": 428, "bestla": 421, "beta": [281, 406, 413], "better": [25, 57, 147, 246, 247, 319, 346, 347, 352, 361, 365, 371, 376, 387, 388, 389, 390, 399, 405, 406, 407, 408, 412, 416, 417, 423, 432], "between": [24, 25, 36, 44, 52, 53, 256, 257, 258, 266, 281, 303, 336, 354, 369, 370, 372, 379, 406, 409, 413, 423], "beyond": [361, 420], "bf16": [28, 158, 246, 302, 304, 314, 315, 320, 332, 336, 337, 340, 346, 347, 348, 349, 352, 354, 374, 375, 376, 392, 398, 401, 403, 405, 408, 413, 421, 422, 425, 426], "bf16_exp": [401, 413], "bf16_exp_attr": 401, "bf16_gelu": 401, "bf16_gelu_attr": 401, "bfloat": 305, "bfloat16": [305, 331, 346, 369, 371, 422], "bfloat16_t": 281, "bge": [14, 372, 376, 420], "bhadresh": 304, "bhargav": 301, "bia": [57, 62, 260, 281, 389, 413, 421], "bia_t": 281, "bianryop": 400, "bianryop_attr_list": 400, "bias_add": [62, 83], "bias_nod": 62, "bias_to_int32": 62, "biasadd": [57, 68, 391, 395], "bibtex": 415, "big": [49, 314, 315, 361, 390, 391, 396], "bigcod": [309, 349], "bigscienc": 428, "bin": [49, 55, 308, 309, 313, 314, 315, 388, 389, 390, 392, 410, 412], "binari": [25, 256, 257, 260, 308, 336, 401, 408, 413, 437], "binary_add": 400, "binary_injector": 400, "binary_injector_init": 400, "binaryadd": [73, 400], "binaryop": 400, "binaryop_addr": 400, "binaryop_alg": 400, "binaryop_attr": [280, 281, 400], "binaryop_injector": 400, "binaryop_list": [280, 400], "binaryop_list_": [280, 400], "bincount": 25, "bind": [314, 349, 388], "bio": [397, 411, 425, 426], "bit": [25, 247, 305, 306, 319, 348, 399, 400, 406, 409, 420, 421, 422, 423, 432], "bitsandbyt": [320, 432], "bitsandbytesconfig": 432, "blank": 413, "blip": 350, "blob": [22, 319], "block": [17, 25, 36, 44, 47, 304, 307, 396, 399, 402, 403, 404, 405, 406, 408, 409, 419, 432], "blocks_per_group": 281, "blocksiz": [247, 281, 421], "blockwise_over_matmul_gemm_conv": 47, "blog": [272, 302, 349, 420], "bloom": [272, 302, 363, 428], "bloom_1b7": 425, "bloom_7b1": 425, "bloomz": 428, "blue": [21, 36], "bm": 281, "bm25": 372, "bn": 281, "bnb_4bit_quant_typ": 432, "bo": 415, "boast": [309, 432], "bodi": [298, 316, 361, 407], "bolder": 407, "bond": 361, "bool": [0, 4, 9, 17, 23, 28, 33, 35, 36, 37, 40, 41, 44, 246, 247, 250, 251, 255, 264, 278, 279, 280, 281, 289, 303, 371, 372, 387, 400, 401, 416, 417, 421], "boolean": 4, "boolq": 288, "boost": [309, 357, 372, 407, 420], "boost_inc_dir": 388, "border": 407, "bordoloi": 301, "bori": 377, "bot": [341, 381], "both": [23, 36, 44, 265, 298, 306, 309, 319, 321, 330, 332, 333, 335, 339, 341, 342, 350, 354, 357, 369, 372, 373, 379, 380, 381, 382, 384, 405, 407, 412, 413, 414, 416, 423, 432], "bottleneck": [17, 404, 406, 432], "bottom": [266, 382], "bound": [29, 256, 257, 263, 266], "boundari": 266, "box": [256, 257, 258, 263, 266, 335, 380, 382, 398], "box_numpy_nul": 25, "boxes1": 263, "boxes2": 263, "brain": [419, 432], "branch": [35, 314, 315, 413], "brand": [24, 302, 415], "breadth": 372, "break": [353, 428], "breg": 402, "brief": [372, 410, 420], "bring": [25, 376, 390, 404, 408, 409, 420, 429], "broadcast": [32, 39, 40, 400, 404, 409, 410, 413], "broaden": 361, "broader": 432, "brought": [361, 406, 423], "brown": 340, "brows": 400, "browser": [322, 361], "bs0": 413, "bs1": 413, "bsc": 404, "bsc_data_t": 281, "bsmock": 359, "bsr": 404, "bstep": 281, "bsub": 402, "bsz": 37, "bubbl": 409, "budget": 306, "buffer": [247, 405, 406, 408], "buffers": 25, "bug": [300, 302], "build": [4, 27, 270, 302, 308, 309, 313, 316, 317, 318, 319, 320, 330, 336, 337, 338, 347, 366, 370, 378, 386, 399, 405, 406, 410, 417, 420, 426, 432], "build_chatbot": [4, 319, 356, 358, 372], "build_ext": 330, "build_with_cpu": 432, "built": [25, 361, 377, 417, 421, 428], "builtin_eval_func": 246, "builtin_train_func": 246, "bundl": [25, 361], "burden": 370, "busi": [314, 349, 376], "bxkxm": 399, "bxm": 399, "byeongwook": 432, "byte": [403, 409], "bytes_or_buff": 247, "c": [25, 38, 57, 245, 266, 268, 282, 288, 302, 308, 314, 319, 323, 324, 327, 328, 329, 331, 332, 334, 336, 337, 338, 340, 343, 344, 349, 351, 354, 357, 358, 359, 361, 363, 368, 385, 386, 387, 388, 390, 395, 397, 402, 404, 411, 413, 426, 427], "c0": 410, "c1": 57, "c2": 57, "c3": [57, 410], "c5f877": 410, "c6i": [397, 411], "c7": 410, "c_node_name_1": 395, "c_node_name_2": 395, "c_node_name_n": 395, "cach": [25, 35, 40, 41, 279, 307, 309, 320, 330, 340, 342, 344, 347, 364, 375, 394, 402, 405, 406, 407, 413, 421, 427, 429], "cache_config": 375, "cache_dir": 35, "cache_load_en": 25, "cache_plugin": 370, "cache_util": [40, 41], "cachefil": 25, "cacheplugin": 370, "cai": 432, "calc": 385, "calc_flop": 413, "calcul": [30, 47, 52, 53, 268, 349, 389, 395, 399, 401, 402, 405, 406, 409, 413, 423, 428], "calculate_ins_level_acc": 268, "calculate_scale_on_tmp_buf": 405, "calib": 428, "calib_dataload": [246, 247, 428], "calib_dataset": 247, "calib_func": [247, 428], "calib_it": [247, 432], "calib_len": [247, 432], "calib_pad": 247, "calib_pad_v": 247, "calib_shuffl": 247, "calibr": [246, 288, 423, 428], "calibrate_method": 246, "call": [9, 24, 25, 32, 57, 147, 177, 256, 257, 305, 319, 352, 354, 363, 370, 387, 390, 391, 396, 399, 400, 401, 408, 409, 420, 423, 432], "callabl": [25, 246], "calle": 24, "caller": 24, "can": [0, 9, 24, 25, 32, 33, 35, 36, 39, 40, 44, 47, 49, 57, 158, 181, 246, 247, 264, 266, 270, 302, 303, 305, 307, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 354, 355, 356, 358, 361, 363, 364, 365, 366, 369, 371, 372, 373, 375, 376, 377, 380, 382, 383, 384, 385, 387, 388, 390, 391, 392, 393, 394, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 410, 413, 417, 418, 419, 420, 421, 423, 425, 426, 428, 432], "candid": [372, 376], "candidate_context": 376, "cannot": [24, 25, 44, 281, 319, 324, 330, 332, 338, 373, 399, 405, 409, 414], "cao": 301, "cap": [314, 315, 347, 366], "capabl": [289, 309, 319, 325, 342, 357, 361, 369, 372, 373, 406, 409], "capac": 372, "caption": [267, 348, 350], "captur": 316, "carbon": 420, "card": [9, 320, 326, 346, 347, 352], "cardin": [256, 257], "carefulli": [309, 357, 373], "cascad": 302, "case": [32, 36, 44, 258, 268, 303, 304, 314, 316, 322, 348, 349, 351, 372, 376, 389, 390, 396, 399, 401, 402, 403, 413, 414], "cast": 83, "cast_to": 150, "castto": 141, "casual": [57, 372], "cat": [314, 330, 332, 349], "catalog": 302, "catch": 24, "categori": [351, 371, 387, 389], "category_nam": 351, "cater": [361, 369, 372], "caus": [24, 266], "causal": 44, "causal_lm": 422, "causallmoutputwithcrossattent": [33, 36, 44], "cbatchstep": 281, "cc": [350, 366, 388], "ccl": [314, 330, 348, 349], "ccl_torch2": 330, "ccl_worker_count": [314, 349], "ccontain": 57, "cd": [302, 308, 309, 314, 315, 322, 327, 328, 329, 330, 331, 332, 334, 335, 341, 347, 354, 361, 362, 364, 365, 366, 374, 379, 380, 381, 382, 386, 387, 388, 393, 398, 410, 413, 426, 427, 432], "ce": [303, 319, 420], "ceil": 25, "celebr": 361, "cell": [266, 322, 407, 409], "center": [25, 271, 309, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 361, 363], "center_i": [256, 257], "center_x": [256, 257], "cento": [302, 369, 425, 426], "central": 361, "centric": 309, "certain": [266, 319, 371, 372, 377, 387, 395, 432], "cffi": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "cfg": 281, "chain": [57, 309, 372, 400, 401], "challeng": [338, 358, 361, 372], "champ": 377, "chan": 25, "chang": [0, 32, 300, 309, 349, 355, 366, 369, 372, 377, 387, 399, 400, 409, 414, 415], "change_node_input_tensor": 55, "change_node_output_tensor": 55, "change_num_nam": 62, "changeabl": 400, "channel": [17, 25, 37, 361, 400, 404, 409, 413, 419, 428], "channel_num": 413, "chapter": 320, "charact": [15, 341, 381], "characterist": [298, 314, 349], "chart": 405, "chat": [0, 4, 5, 6, 7, 8, 10, 309, 313, 314, 315, 317, 318, 319, 321, 324, 331, 332, 334, 335, 338, 340, 341, 343, 344, 345, 346, 349, 352, 356, 358, 363, 364, 366, 367, 369, 371, 372, 377, 378, 379, 380, 381, 382, 383, 384, 420, 427, 428, 429, 432], "chat_a100_url": 379, "chat_gaudi2_url": 379, "chatbat": 361, "chatbot": [0, 8, 272, 302, 309, 312, 314, 316, 320, 324, 333, 338, 340, 341, 342, 343, 344, 345, 349, 357, 363, 370, 373, 375, 378, 381, 383, 384, 420], "chatbot_finetun": 347, "chatbot_serv": 361, "chatcmpl": 361, "chatglm2": 309, "chatglm3": 309, "chatgpt": [335, 338, 352, 358, 380], "chatqna": [309, 316, 382], "check": [9, 11, 15, 28, 39, 40, 57, 62, 158, 246, 268, 269, 295, 300, 302, 319, 330, 331, 332, 337, 340, 351, 353, 355, 361, 364, 366, 372, 373, 387, 390, 391, 395, 401, 421, 427, 438], "check_is_numb": 268, "check_result_": 413, "check_torch_compat": 421, "check_valu": 28, "checker": [9, 247, 309, 373], "checkout": [332, 372], "checkpoint": [36, 246, 348, 349, 360, 361, 369], "chen": 301, "cheng": 432, "chi": 359, "chian": 401, "child": [2, 372], "child_docu": [309, 372], "child_document_stor": 14, "child_par": 372, "childparentretriev": [2, 309, 357], "chines": [353, 359, 369], "chitchat": 372, "chmod": [330, 332], "choic": [36, 44, 267, 268, 304, 309, 321, 351, 361, 379, 396, 430], "choos": [313, 316, 335, 349, 369, 371, 372, 378, 380, 384, 395, 426], "chosen": [346, 347, 352, 372], "chroma": [14, 309, 357], "chrome": 389, "chuck": 372, "chunk": [36, 372], "ci": [315, 414], "circumst": 298, "citi": 377, "cjangcjengh": 353, "cl": [95, 184, 309, 361], "claim": [302, 415], "clamp": [36, 44], "clangformat": 269, "clarifi": [12, 298], "class": [34, 45, 62, 243, 261, 266, 281, 282, 289, 303, 361, 369, 373, 387, 394, 400, 401, 432], "class_error": 265, "class_filt": 25, "class_map": 266, "class_nam": 266, "class_subset": 25, "class_threshold": 266, "classif": [9, 33, 36, 44, 256, 257, 258, 260, 272, 302, 303, 304, 393, 418, 430], "classifi": [266, 371], "classmethod": 35, "claud": 352, "cleaner": 262, "cleaner_nam": 262, "clear": [316, 335, 380, 382, 408], "cli": 311, "cli_command": 311, "click": [322, 327, 328, 329, 382, 410], "client": [319, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 363], "clip": [9, 350], "clipfeatureextractor": 9, "cliptextmodel": 9, "cliptoken": 9, "clk_local_mem_f": 402, "clm": 304, "cloc": 269, "clone": [24, 302, 308, 322, 327, 328, 329, 330, 331, 332, 334, 335, 338, 340, 341, 345, 347, 348, 349, 354, 359, 361, 362, 363, 366, 369, 379, 380, 381, 382, 383, 384, 386, 387, 432], "cloud": [269, 319, 325, 361, 385], "cluster": [314, 349], "clx": 408, "cmake": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368, 386, 388, 398, 410, 413], "cmake_thread_libs_init": 388, "cmakelist": 388, "cn": 353, "cnn": [304, 349], "cnn_dailymail": 304, "co": [9, 25, 32, 35, 83, 127, 302, 334, 338, 345, 348, 349, 351, 359, 363, 383, 384], "coco": [256, 257, 260, 350], "code": [7, 20, 22, 25, 265, 270, 278, 279, 280, 281, 302, 306, 309, 321, 325, 336, 337, 338, 345, 347, 349, 350, 353, 356, 370, 371, 372, 377, 383, 384, 385, 387, 390, 401, 402, 403, 404, 405, 410, 413, 415, 426, 432], "code_chat": 309, "code_gen": 313, "code_gener": [309, 312, 313], "codealpaca": 349, "codegen": [309, 312, 313, 321, 323], "codegen2": 309, "codegen25": 350, "codellama": [309, 326, 327, 328, 329, 330, 332], "codellama_peft_finetuned_model": 349, "codenam": 272, "coeffici": 413, "coher": [272, 372, 406, 420], "col": [402, 403, 406, 408], "col_num": 281, "cola": 304, "colidx": 281, "collabor": [300, 420], "collect": [0, 18, 28, 266, 289, 372, 387, 389, 423], "collect_quant_info": 150, "collectquantinfo": 142, "color": [21, 265, 407, 409], "colsb": 403, "column": [265, 266, 348, 399, 404, 406, 409], "com": [22, 38, 43, 262, 302, 308, 314, 315, 319, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 348, 349, 352, 354, 357, 361, 362, 363, 366, 368, 369, 380, 383, 384, 386, 388, 397, 411, 415, 420, 424, 425, 426, 432], "combin": [9, 246, 319, 325, 354, 369, 372, 378, 390, 395, 400, 401], "combinatori": 307, "combinedstat": 25, "come": [260, 361, 372, 387], "command": [1, 308, 309, 313, 314, 315, 316, 317, 318, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 348, 349, 351, 354, 355, 358, 359, 363, 364, 366, 369, 376, 377, 379, 380, 381, 382, 383, 384, 385, 387, 388, 392, 414, 426], "comment": [298, 319, 325, 432], "commit": [35, 269, 298, 373, 414, 420], "common": [25, 298, 302, 319, 372, 423, 428], "commonli": [370, 372, 373], "commun": [0, 272, 298, 330, 332, 345, 378, 383, 384, 420], "compact": [303, 354], "compar": [266, 304, 319, 342, 347, 361, 370, 372, 378, 379, 399, 402, 413, 423, 432], "comparison": [288, 378, 389, 409], "compassion": 361, "compat": [300, 316, 319, 372, 421, 432], "competitor": 420, "compil": [245, 274, 306, 322, 324, 327, 328, 329, 338, 386, 387, 388, 390, 391, 393, 395, 396, 432, 439], "compiler_vers": 426, "complaint": 298, "complet": [0, 25, 57, 309, 316, 318, 321, 324, 346, 347, 349, 352, 354, 361, 367, 387, 397, 402, 405, 408, 411, 425, 426], "completion_token": 361, "complex": [369, 370], "compli": [335, 380], "complianc": 300, "complic": [57, 387, 395], "compon": [25, 269, 270, 299, 300, 319, 333, 339, 342, 369, 371, 378, 400, 415], "compos": [52, 53, 54, 369, 387, 392, 408], "comprehens": [270, 309, 319, 357, 372, 378], "compress": [272, 303, 306, 399, 403, 405, 409, 412, 423, 428, 432], "compression_manag": 246, "compressionmanag": 246, "compressor": [35, 272, 289, 302, 319, 366, 417, 419, 423, 425, 428, 432], "compris": 32, "comput": [23, 24, 25, 33, 36, 44, 49, 57, 243, 246, 256, 257, 258, 260, 263, 264, 266, 293, 302, 306, 312, 320, 361, 371, 372, 395, 398, 399, 400, 401, 402, 405, 407, 408, 412, 418, 421, 423, 425, 429, 432, 436], "comput_vector": 400, "computation": [346, 347], "compute_dtyp": [247, 319, 327, 328, 329, 371, 375, 429, 432], "compute_loss": 246, "compute_metr": 302, "compute_perform": 302, "compute_typ": 421, "compute_vector": 400, "concat": [83, 387], "concaten": [25, 314, 349, 403, 409, 413, 425], "concentr": [314, 349, 372], "concept": [372, 407, 409], "conceptu": 407, "concern": [338, 358, 372, 373], "concis": [316, 372, 395], "conclud": 372, "conclus": 409, "concurr": [354, 379], "concurrency_count": 361, "conda": [308, 354, 358, 360, 361, 362, 374, 393, 426], "conda_prefix": [354, 426, 427], "condens": 316, "condit": [9, 266, 390, 391, 415], "conduct": [270, 314, 347, 349, 376], "conf": [55, 319, 388, 389, 390, 394, 412], "conf_dict": 57, "confid": 266, "confidenti": 298, "config": [4, 27, 29, 32, 33, 36, 44, 45, 47, 49, 55, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 249, 286, 289, 302, 303, 304, 306, 309, 319, 330, 332, 345, 348, 349, 351, 354, 361, 364, 365, 366, 375, 377, 383, 384, 388, 389, 391, 394, 400, 413, 418, 419, 423, 428], "config_dir": 375, "config_fil": [309, 360, 367, 375], "config_file_path": 47, "config_hpu": 366, "config_list": 246, "config_nam": 354, "config_path": 351, "configur": [4, 6, 28, 29, 35, 36, 44, 47, 49, 138, 246, 247, 281, 304, 305, 309, 314, 317, 318, 337, 348, 349, 357, 361, 365, 366, 368, 370, 371, 372, 375, 390, 399, 400, 404, 424, 425, 426, 432], "configuration_llama": 32, "configure_log": 6, "conflict": [266, 281], "confus": 377, "conjunct": 416, "conll03": 304, "conll2003": 304, "connect": [57, 330, 332, 350, 419, 420], "connector": 350, "consecut": [57, 403, 409], "consequ": 429, "conserv": 377, "consid": [9, 25, 298, 319, 377, 390, 399, 401, 403, 414], "consider": 373, "consist": [49, 256, 257, 266, 300, 319, 350, 354, 370, 372], "consol": [355, 361], "const": [57, 62, 243, 278, 279, 280, 281, 394, 398, 400, 401, 402, 403], "const_cast": 394, "const_rat": 28, "constant": [7, 73, 192, 246], "constantofshap": 73, "constexpr": 281, "constitut": 350, "constraint": [28, 30, 288], "construct": [0, 36, 57, 95, 267, 298, 314, 338, 347, 349, 371, 372, 400, 401], "construct_default_prompt": 371, "construct_nod": 57, "constructor": [361, 394], "consult": 270, "consum": [396, 403], "consumpt": [361, 385], "contact": [269, 298, 300, 324, 338, 424], "contain": [20, 21, 23, 24, 25, 32, 36, 44, 47, 49, 57, 62, 243, 244, 246, 247, 256, 257, 258, 265, 266, 267, 270, 293, 303, 313, 316, 319, 347, 348, 349, 350, 351, 355, 361, 364, 365, 372, 373, 375, 387, 388, 390, 391, 395, 398, 400, 412, 413, 414, 419, 423, 436], "container_object": 266, "content": [309, 313, 314, 316, 317, 318, 319, 321, 324, 336, 338, 340, 358, 361, 363, 367, 370, 372, 382, 389, 432], "context": [23, 24, 25, 316, 319, 335, 372, 373, 376, 380, 382], "context_dim": 260, "context_templ": 23, "contextu": 372, "contigu": 407, "conting": 404, "continu": [0, 36, 354, 361, 367, 399, 402, 406, 407], "contradict": 354, "contrast": 315, "contribut": [0, 298, 299, 378], "contributor": 301, "control": [24, 27, 40, 57, 369, 376, 387], "conv": [28, 83, 389, 390, 401], "conv_reshap": 150, "conveni": [25, 349, 356, 372], "convent": [24, 25, 278, 279, 280, 281], "convers": [300, 319, 335, 340, 341, 349, 353, 369, 372, 378, 379, 380, 381, 382, 423, 428, 432], "convert": [0, 15, 25, 27, 49, 52, 53, 57, 62, 243, 256, 257, 260, 262, 266, 305, 340, 369, 370, 374, 393, 396, 408, 413, 423, 428], "convert_fullwidth_to_halfwidth": 15, "convert_image_to_base64": 0, "convex": 266, "convex_hul": 30, "convolut": [17, 73, 260, 303, 390], "convreshap": 143, "cooper": [302, 405], "coordin": [256, 257, 258], "copi": [0, 24, 255, 261, 264, 300, 330, 332, 345, 379, 383, 384, 391, 407], "copilot": [309, 319, 325, 420], "copyright": [266, 269, 415], "core": [28, 269, 289, 308, 310, 331, 371, 388, 397, 399, 405, 406, 411, 414, 415, 420, 425, 426], "cores_per_inst": [28, 246, 289], "corner": 322, "corpor": [266, 415], "corpu": [324, 372], "correct": [247, 298, 314, 372, 391, 395, 407, 426, 427], "correct_answ": 319, "correctli": [32, 316, 340, 387], "correl": 25, "correspond": [49, 52, 53, 57, 247, 258, 260, 262, 330, 335, 341, 346, 347, 352, 355, 372, 373, 379, 380, 381, 382, 384, 387, 391, 395, 398, 405, 409, 412, 423, 426], "correspondingli": 402, "cosin": [32, 346, 347], "cosmo": 353, "cost": [258, 314, 315, 342, 361, 409, 420], "cost_bbox": 258, "cost_class": 258, "cost_giou": 258, "costom": 371, "could": [9, 25, 47, 57, 279, 298, 314, 315, 336, 337, 338, 348, 349, 352, 358, 361, 363, 371, 372, 375, 385, 387, 388, 389, 391, 392, 395, 403, 412, 413, 419, 423, 428, 432], "count": [25, 314, 349, 365, 366, 389, 394, 396], "countri": 385, "courag": 361, "cours": [363, 389, 395, 402], "covari": 25, "cover": [269, 270], "coverag": 269, "cozi": 361, "cp": [364, 365, 366, 388], "cpp": [269, 270, 413, 420, 432], "cpplint": 269, "cpu": [9, 25, 28, 272, 280, 289, 302, 306, 308, 309, 310, 312, 313, 314, 315, 316, 317, 320, 323, 327, 328, 329, 330, 331, 332, 336, 340, 343, 344, 348, 349, 354, 360, 363, 367, 369, 374, 375, 376, 385, 388, 394, 397, 399, 401, 410, 411, 418, 420, 421, 425, 427], "cpu_": 25, "cpu_engine_t": 278, "cpu_inst": 278, "cpython": 386, "craft": [309, 314, 349, 357, 372], "crash": 25, "creat": [21, 24, 25, 62, 243, 247, 272, 298, 309, 316, 319, 320, 321, 323, 327, 328, 329, 330, 331, 332, 336, 337, 338, 340, 348, 349, 351, 356, 358, 361, 362, 372, 387, 393, 394, 404, 413, 416, 420, 421], "create_embed": 340, "create_kernel": 278, "create_memory_storag": 278, "create_position_ids_from_input_id": 44, "create_position_ids_from_inputs_emb": 44, "create_proxy_object": 279, "create_stream": 278, "create_tf_nod": 243, "creation": [309, 319], "creativ": 378, "criteria": [269, 417], "criterion": [250, 256, 257, 306, 416, 423], "criterion_reduce_typ": 28, "critic": [298, 307], "crop": 350, "cross": [33, 36, 44], "crosscovari": 25, "crossiou": 25, "crossov": [28, 30], "crossover_s": 28, "crucial": [372, 429], "cstep": 281, "csv": [266, 319, 372], "ctrl": [304, 361], "ctx": 279, "ctx_size": 429, "cuda": [25, 309, 318, 374, 375, 376, 422], "cuda_": 25, "cuda_visible_devic": 351, "cudatoolkit": 340, "cultur": 361, "cumsum": 73, "curat": 309, "curios": 361, "curl": [313, 316, 317, 318, 324, 340, 361, 363, 364, 365, 366, 367], "current": [47, 314, 332, 335, 340, 341, 349, 350, 353, 364, 365, 366, 367, 369, 372, 377, 379, 380, 381, 382, 389, 400, 401, 402, 404, 405, 407, 412, 413, 421, 422, 432], "current_working_directori": 322, "custom": [57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 272, 302, 316, 319, 320, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 338, 341, 343, 344, 348, 355, 361, 363, 366, 369, 371, 372, 373, 375, 378, 380, 381, 382, 389, 391, 420, 428, 439], "custom_port": 349, "customiz": [266, 309, 420], "cut": 369, "cute": [36, 44], "cutoff": 266, "cwd": [314, 349], "cxx11": 361, "cybersecur": 420, "cycl": [396, 410], "d": [25, 57, 268, 303, 309, 313, 316, 317, 318, 321, 324, 331, 340, 351, 361, 363, 364, 365, 367, 399, 407], "d0": 413, "d0xd1x": 413, "d1": [57, 413], "d12c0123": 319, "d2": 57, "d3": [57, 410], "d37": 304, "d9": 410, "d_conf": [303, 306], "da": 301, "daemon": [314, 315], "dag": 396, "dai": 361, "daili": [349, 372], "dailymail": 349, "dalvishruti14": 301, "damag": 419, "damp_perc": 247, "daniel": 301, "dash": 265, "data": [5, 25, 27, 38, 57, 121, 247, 256, 257, 260, 264, 266, 267, 270, 281, 288, 302, 304, 309, 314, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 346, 347, 348, 349, 354, 361, 363, 369, 372, 377, 387, 388, 390, 392, 393, 394, 396, 399, 400, 401, 402, 405, 406, 409, 413, 421, 422, 423, 425, 426, 438], "data0": 398, "data0_desc": 401, "data1": 398, "data1_desc": 401, "data2": 398, "data3": 398, "data4": 398, "data_dir": 393, "data_handle_": 279, "data_param": 400, "data_ptr": 394, "data_sourc": 25, "data_typ": [280, 281, 394, 400, 401, 413], "dataargu": 5, "databas": [14, 338, 372], "databrick": 428, "dataclass": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "datalearn": 420, "dataload": [25, 27, 246, 289, 302], "dataloader_drop_last": 376, "dataset": [25, 246, 247, 270, 289, 304, 315, 347, 350, 351, 355, 361, 366, 369, 372, 416, 423, 425, 426, 428, 432], "dataset_concaten": [314, 349, 352, 354], "dataset_config_nam": 354, "dataset_fil": 314, "dataset_nam": [289, 346, 347, 348, 349, 352, 354, 422], "datatyp": [28, 304, 305, 425, 426, 432], "dataxf": 410, "date": [347, 361, 372], "davinci": [314, 349], "day2": 420, "dcmake_vtune_hom": 410, "dco": 269, "ddatabas": 358, "ddimschedul": 9, "ddp": [314, 349], "ddp_backend": [314, 349], "ddp_find_unused_paramet": 352, "ddr4": 361, "ddr5": [425, 426], "de": [57, 369, 377], "deal": [32, 243, 389], "deb": 369, "debug": [61, 388, 396], "dec": [272, 302, 420], "decapoda": 422, "deci": 309, "decid": [361, 372, 404, 405], "decilm": 309, "decis": 372, "decod": [9, 36, 44, 247, 256, 257, 261, 354, 396, 410, 429], "decoder_attn_reshap": 150, "decoder_input_id": [36, 44], "decoderattnreshap": 144, "decompos": [57, 354, 387], "decor": [52, 53, 54, 62, 95, 184, 243, 244], "decreas": [370, 371], "deem": 298, "deep": [17, 272, 302, 317, 363, 392, 401, 417, 420, 423], "deep3dfacerecon_pytorch": [16, 17, 19, 20, 21], "deeper": [319, 325], "deeplearn": 399, "deepspe": [330, 347, 349, 350, 361], "deepspeed_hpu_zero3_sync_mark_step_requir": 349, "def": [28, 289, 302, 313, 387, 428], "default": [6, 14, 24, 25, 27, 28, 32, 35, 36, 44, 45, 246, 247, 260, 265, 266, 289, 302, 303, 309, 319, 334, 336, 340, 345, 347, 349, 355, 361, 364, 366, 369, 371, 372, 375, 376, 383, 384, 385, 387, 390, 396, 397, 401, 405, 409, 410, 411, 413, 416, 417, 419, 423], "defaultli": 373, "defin": [5, 16, 17, 47, 57, 246, 247, 270, 278, 298, 302, 303, 314, 349, 369, 371, 372, 387, 388, 394, 395, 409, 414, 416, 419], "definit": [55, 121, 128, 260, 288, 401, 428, 432], "defog": 309, "degrad": 423, "degre": [372, 404], "del_environ_var": 57, "del_keys_to_ignor": 44, "delet": [35, 57, 419], "delimit": 413, "deliv": 372, "delv": [387, 430], "demand": [361, 371, 432], "demeanor": 270, "demo": [319, 331, 332, 334, 338, 361, 378, 420], "demonstr": [302, 304, 306, 319, 323, 326, 327, 328, 329, 330, 331, 332, 348, 349, 350, 354, 356, 358, 368, 370, 377, 407, 409, 426], "demystifi": 420, "denois": 9, "denomin": 387, "dens": [14, 260, 281, 372, 375, 389, 390, 395, 398, 409, 413], "dense_x_spars": 281, "densiti": 409, "dep": 426, "depend": [29, 38, 256, 257, 300, 309, 335, 341, 356, 358, 372, 379, 380, 381, 382, 386], "depict": 406, "deploi": [269, 303, 319, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 358, 363, 364, 365, 366, 420, 432, 439], "deploy": [302, 306, 309, 319, 325, 372, 390, 393, 420], "deprec": [32, 36, 44, 246, 361], "depth": 24, "dequ": 264, "dequant": [73, 400, 401, 405, 413], "dequantize_tile_on_tmp_buf": 405, "dequantizelinear": [73, 392], "derefer": 25, "deriv": [5, 279, 346, 347, 406, 407], "derogatori": 298, "desc": [394, 400, 401], "desc_act": 247, "descend": [24, 25], "descent": [24, 432], "describ": [36, 44, 270, 280, 402, 404, 407, 413, 416, 417], "descript": [270, 303, 309, 319, 320, 371, 372, 389, 409, 416, 417, 419], "descriptor": [280, 394, 413], "design": [305, 316, 319, 327, 328, 329, 340, 342, 354, 361, 369, 371, 372, 373, 378, 387, 400, 401, 412, 429, 432], "desir": [25, 314, 349, 369, 370, 377, 389], "despit": 372, "dest": [52, 53, 54, 57, 62, 243], "dest_op": 121, "dest_op_nam": 243, "destin": [399, 404, 407, 413], "destructor": 394, "detach": 24, "detail": [9, 23, 25, 50, 52, 53, 57, 269, 270, 293, 295, 298, 300, 302, 303, 304, 306, 307, 309, 312, 317, 319, 320, 345, 346, 347, 349, 351, 352, 361, 365, 372, 378, 383, 384, 385, 386, 387, 389, 390, 391, 394, 395, 397, 398, 403, 410, 411, 413, 419, 420, 421, 423, 429, 432, 436, 438], "detect": [57, 256, 257, 260, 266, 269, 300, 369, 373, 420], "detection_model_path": 359, "determin": [266, 298, 370, 372, 390, 410], "determinist": [25, 373], "detr": [257, 261], "detrmulti": 257, "dev": [308, 314, 315, 332, 335, 341, 379, 380, 381, 382], "dev0": [426, 427], "devel": 308, "develop": [302, 309, 312, 313, 314, 316, 319, 320, 348, 349, 369, 372, 373, 394, 420, 422, 432], "devic": [9, 25, 30, 303, 309, 313, 316, 317, 318, 319, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 347, 349, 352, 363, 365, 369, 374, 375, 385, 388, 390, 418, 422], "device_map": 432, "deviceopt": 5, "df": [57, 395], "diagon": 25, "dialog": 382, "dialogu": [341, 381, 382], "dic": 25, "dice": [256, 257, 260], "dice_loss": 260, "dict": [2, 9, 25, 36, 44, 45, 47, 57, 62, 243, 244, 246, 247, 256, 257, 258, 260, 264, 266, 267, 268, 351, 372, 373, 376, 387, 388, 419, 428, 432], "dict_path": 373, "dictionari": [25, 246, 247, 256, 257, 264, 267, 372, 373, 428], "did": 361, "didn": [21, 372], "differ": [25, 29, 39, 40, 41, 42, 49, 55, 60, 129, 130, 131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 224, 225, 226, 228, 229, 230, 241, 242, 265, 270, 298, 302, 303, 306, 309, 314, 316, 319, 325, 335, 346, 349, 351, 361, 369, 371, 372, 377, 378, 380, 387, 388, 390, 391, 392, 395, 399, 406, 409, 416, 417, 419, 423, 428], "difficult": 428, "difficulti": [376, 409, 428], "diffus": [9, 272, 302, 304, 405, 420], "diffusionpipelin": 9, "diffusionv1": [133, 134, 135, 136, 217, 218, 221, 222, 223, 231, 232, 235, 236, 237], "digest": 316, "digit": 420, "dilat": 255, "dim": [25, 36, 44, 256, 257, 258, 260, 388, 394, 399, 405, 409, 413], "dim_t": [280, 281], "dimens": [25, 32, 36, 44, 256, 257, 263, 303, 372, 378, 390, 404, 405, 407, 409, 413], "dimension": 25, "diminish": 429, "dino": 301, "dir": [265, 340, 348, 388], "direct": [20, 24, 281, 315, 319], "direct_process_row": 281, "directli": [24, 44, 260, 321, 335, 361, 369, 372, 380, 400, 403, 406, 428, 432], "directori": [21, 35, 247, 309, 314, 315, 321, 322, 345, 353, 355, 372, 375, 383, 384, 392, 412, 427], "dirnam": [314, 332, 349], "disabl": [24, 25, 264, 298, 314, 330, 362, 397, 411, 413], "disable_quanted_input": 247, "disast": 403, "discard": 414, "disclaim": [335, 380], "discontinu": 399, "discov": [312, 320], "discoveri": 361, "discrep": 372, "discret": 406, "discuss": [270, 409], "disjoint": 266, "disk": [390, 392, 396], "dispatch": 55, "dispatch_table_file_root": 390, "displai": [17, 382], "disregard": [256, 257, 270], "distanc": [303, 372], "distil": [246, 270, 302, 393, 430, 434], "distil_bert_bas": 387, "distilbert": [272, 302, 304, 306, 392, 418, 420, 430], "distilbert_bas": 387, "distilbert_base_uncas": 393, "distilgpt2": 304, "distillation_config": [246, 303, 306], "distillationconfig": [246, 306], "distillbert": 397, "distilledtextattack": 304, "distilroberta": 304, "distinct": 372, "distinguish": [369, 413], "distribut": [1, 25, 28, 252, 264, 303, 307, 314, 348, 354, 376, 406, 423, 424, 428], "distributed_init": 252, "distributed_world_s": 28, "div": [387, 391], "div2": 402, "dive": [312, 320], "diverg": 303, "divers": [314, 349, 369, 372, 378], "divid": [347, 372, 387, 395, 399, 404, 405, 408, 414], "divis": [372, 405], "dl": [400, 425, 426], "dlsa": [272, 302], "dm": 304, "dne_with_sparselib": [386, 413], "dne_with_sparselib_benchmark": [398, 413], "dne_with_sparselib_onli": [398, 413], "dne_with_sparselib_vtun": 410, "dne_with_test": 398, "dnnl": [279, 390, 394], "dnnl_arg_dst": 394, "dnnl_arg_src": 394, "do": [30, 47, 50, 246, 258, 264, 269, 298, 305, 316, 349, 350, 365, 369, 370, 387, 388, 390, 391, 395, 396, 400, 402, 405, 419, 420, 423, 428, 429], "do_blockwis": 247, "do_constant_fold": [246, 305], "do_ev": [349, 354], "do_lm_ev": 349, "do_sampl": [317, 363, 428, 432], "do_train": [314, 348, 349, 352, 354, 422], "doc": [9, 14, 22, 256, 257, 260, 270, 309, 321, 322, 324, 338, 372, 375, 387, 391, 400, 409], "docker": [312, 347, 364, 365], "docker_build_arg": [313, 316], "docker_cach": 366, "docker_run_env": [313, 314, 316], "dockerfil": [313, 314, 315, 316, 317, 318, 347, 349, 366], "dockerfile_tgi": 317, "dockerfile_vllm": 318, "dockerignor": [314, 315], "docstr": [36, 44], "document": [2, 9, 13, 14, 246, 272, 273, 302, 303, 306, 309, 319, 338, 348, 357, 358, 361, 371, 372, 376, 405, 407, 408, 409, 419, 423], "document_stor": 14, "docx": 372, "doe": [25, 247, 264, 350, 371, 387, 400, 401, 402, 403, 404, 407, 413], "doesn": [57, 256, 257, 349, 400, 409, 413], "dog": [36, 44, 340], "dolli": 428, "domain": [272, 293, 302, 309, 314, 335, 349, 372, 380, 398, 436], "don": [21, 36, 44, 57, 258, 266, 350, 361, 394, 396, 400], "done": [260, 303, 390, 405, 413, 423], "dong": 415, "dongsoo": 432, "dot": [24, 25, 407, 423], "doubl": [25, 327, 328, 329], "double_quant_bit": 247, "double_quant_dtyp": 247, "double_quant_group_s": 247, "double_quant_scale_dtyp": 247, "double_quant_use_sym": 247, "down": 57, "downgrad": 332, "download": [9, 17, 33, 35, 36, 44, 314, 315, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 340, 343, 344, 345, 350, 355, 357, 358, 364, 368, 369, 371, 372, 375, 383, 384, 424, 426, 427], "download_model": 360, "doxygenfil": 281, "dp_tile_n": 405, "dpc": [324, 338], "dpcpp": 432, "dpkg": 369, "dpo": 352, "dpo_clm": [346, 347], "dpo_pipelin": 347, "dpython_execut": 386, "draw_landmark": 21, "drawn": 407, "dream": 361, "driven": [309, 420], "driven_audio": 374, "driver": [361, 362, 425], "drop": [28, 29, 269, 304, 307, 309, 321, 406, 409, 416, 428, 429], "drop_and_restore_util": 31, "dropout": [28, 29, 260], "ds_build_cpu_adam": 330, "ds_build_util": 330, "dslim": 304, "dsparse_lib_use_amx": 413, "dst": [281, 394, 400, 401, 403, 404, 405, 408, 409, 413], "dst0": 281, "dst2": 281, "dst_data": 394, "dst_dt": 413, "dst_m1": 281, "dst_m2": 281, "dst_m_": 394, "dst_shape": [387, 388, 394], "dst_stride": 394, "dst_t": 281, "dst_tensor_ptr": 394, "dst_type": 413, "dstep": 281, "dstptr": 281, "dststride": 281, "dt": [400, 401], "dt1op1": 401, "dt2op2": 401, "dtype": [42, 57, 62, 121, 243, 244, 246, 302, 305, 371, 388, 389, 390, 394, 421, 432], "dtypes_dict": 57, "du": 405, "dual": 361, "due": [36, 57, 288, 314, 349, 351, 370, 372, 373, 391, 395, 399, 423, 426, 427, 432], "dummydataload": 302, "dump": [55, 269, 423], "dump_activation_dag": 396, "dump_tensor": 55, "duplic": 57, "durat": 389, "dure": [24, 39, 40, 246, 288, 307, 349, 350, 372, 388, 396, 405, 409, 414, 417, 419, 423], "dword": [409, 410], "dynam": [28, 38, 40, 246, 300, 304, 305, 307, 378, 394, 396, 398, 400, 404, 406, 413, 430, 437], "dynamic_config": [246, 306], "dynamic_length_config": 306, "dynamic_qu": 279, "dynamic_quant_desc": 279, "dynamic_quant_matmul": 279, "dynamic_quant_matmul_desc": 279, "dynamic_train": 28, "dynamiclengthconfig": [28, 246, 306], "dynamicquantconfig": 247, "e": [17, 23, 25, 33, 57, 267, 268, 269, 288, 298, 302, 303, 309, 313, 314, 315, 316, 318, 327, 328, 329, 331, 332, 347, 350, 351, 364, 365, 366, 369, 376, 390, 400, 401, 406, 407, 413, 414], "e0": 410, "e5": [372, 376], "e60": 319, "e7": 410, "each": [23, 25, 28, 36, 44, 57, 256, 257, 258, 260, 264, 265, 266, 299, 309, 314, 316, 330, 332, 349, 351, 355, 371, 372, 376, 379, 389, 390, 391, 399, 402, 404, 405, 406, 407, 408, 409, 412, 413, 414], "eager": 423, "earli": [369, 423, 430], "earlier": 319, "early_stop": 352, "eas": [370, 378], "easi": [260, 302, 316, 319, 338, 357, 358, 372, 390, 392], "easier": 382, "easiest": 319, "easili": [246, 316, 321, 369, 370, 372, 373, 399, 400, 432], "ebp": 410, "ec2": [397, 411], "echarlaix": 304, "econom": [372, 396], "edg": 49, "edit": [9, 298, 332, 361, 365], "edit_output": 24, "editor": 377, "edu": 263, "educ": 298, "ee": 410, "ee32e42": 425, "eea": 420, "effect": [272, 302, 342, 363, 369, 370, 372, 387, 413, 420, 432], "effici": [258, 272, 302, 306, 309, 314, 319, 349, 357, 361, 367, 370, 372, 396, 420, 422, 432], "effort": [372, 432], "effortlessli": 309, "ehdwns1516": 304, "einsum": 73, "einsumwitharang": 150, "either": [266, 309, 348, 349, 350, 372, 423], "elast": 304, "elec": 351, "electr": 385, "electra_base_chinese_discrimin": 425, "electra_base_chinese_gener": 425, "electron": [298, 351], "element": [47, 57, 246, 256, 257, 258, 260, 387, 395, 398, 402, 404, 407, 409, 413, 437], "element_num": [281, 401], "element_num_each_th": 281, "elementwise_over_al": 47, "elementwise_over_matmul_gemm_conv": 47, "eleuth": 346, "eleutherai": [304, 309, 314, 315, 349, 354, 428], "elia": 432, "elig": 395, "elimin": 266, "ellipsi": 407, "els": [289, 374, 377, 387, 394], "eltociear": 301, "eltwis": 401, "eltwise_forward": 394, "eltwise_gelu_erf": 394, "eltwise_gelu_tanh": 394, "eltwise_injector": [400, 401], "eltwise_injector_init": 401, "eltwiseop": [279, 400, 401], "eltwiseop_data_t": 281, "eltwiseop_desc": 279, "eltwiseop_kd_t": 401, "eltwiseop_param_t": [281, 401], "elucid": [319, 325], "embed": [2, 32, 36, 37, 44, 57, 259, 309, 336, 340, 357, 369, 370, 372, 375, 388, 391, 395, 400, 420], "embed_size_per_head": [36, 44], "embedding_dim": 37, "embedding_model": [372, 375], "embedding_model_dir": 375, "embeddingbag": [73, 150], "embeddings_reshap": 391, "embeddings_to_2d_before_inner_product": 150, "embeddingsto2dbeforeinnerproduct": 147, "embrac": 361, "emiss": 385, "emit": [57, 391], "emitt": 392, "emot": 304, "empathi": 298, "emphas": [354, 376], "emphasi": 372, "emphasized_weight": [256, 257], "emploi": [314, 319, 325, 347, 349, 370], "empow": [309, 378, 420], "empti": [21, 25, 57, 73, 256, 257, 264, 266, 309, 321, 391, 395, 401, 414, 421], "empty_list": 278, "empty_op": [83, 387], "emul": 423, "en": [9, 304, 353, 369, 372, 376], "en_core_web_lg": [334, 371, 375], "enabl": [25, 28, 40, 288, 289, 305, 306, 309, 314, 315, 319, 320, 324, 330, 332, 334, 336, 338, 340, 342, 344, 349, 350, 358, 361, 363, 369, 372, 373, 375, 376, 378, 397, 399, 405, 406, 410, 411], "enable_bf16": 305, "enable_executor": [302, 305], "enable_mask": 400, "enable_op_tun": 390, "enable_rerank": [14, 372], "enable_sequential_cpu_offload": 9, "encapsul": 406, "encod": [0, 9, 36, 44, 247, 259, 261, 350, 389, 395, 429], "encoder_attention_mask": [33, 36, 44], "encoder_hidden_st": [33, 36, 44], "encount": [319, 338, 358, 361, 370, 429], "encourag": 372, "end": [25, 36, 44, 47, 57, 261, 272, 302, 320, 354, 369, 389, 392, 394, 395, 401, 410, 432], "end_posit": [36, 44], "end_step": [28, 419], "endfor": [403, 404, 409], "endpoint": [309, 363], "energi": 361, "eng_": 394, "eng_kind": [280, 398, 401], "eng_kind_": 401, "engag": [319, 372, 378], "engin": [49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 150, 181, 215, 243, 244, 245, 269, 273, 282, 297, 299, 302, 337, 367, 372, 377, 387, 389, 391, 394, 396, 400, 405, 408, 412, 432, 433, 440], "engine_dispatch_t": 390, "engine_graph": [62, 94, 95, 243, 244], "engine_init": 55, "engine_integr": 388, "engine_kind": [278, 280, 401], "engine_kind_": [278, 280], "engine_t": 278, "english": [302, 304, 306, 349, 359, 418], "enhanc": [306, 309, 319, 325, 338, 342, 349, 357, 358, 361, 369, 370, 371, 372, 374, 378, 404, 432], "enlarg": 390, "enough": [396, 405, 423, 432], "enrich": [361, 372], "ensur": [47, 266, 307, 309, 316, 319, 322, 327, 328, 329, 351, 357, 370, 372, 373, 405], "entail": [266, 354], "enter": [313, 335, 347, 355, 364, 365, 366, 380, 387], "enterpris": 420, "entir": [25, 389, 399, 405, 406, 419, 428], "entiti": [309, 361, 371], "entranc": 413, "entri": [36, 44, 245, 246, 258, 415], "entropi": [33, 36, 44], "enum": [281, 400, 401], "enumer": [2, 5, 25, 281, 428], "env": [57, 314, 332, 335, 341, 349, 351, 361, 379, 380, 381, 382, 393], "env_setup": 322, "environ": [57, 252, 298, 323, 324, 326, 327, 328, 329, 330, 331, 332, 335, 340, 341, 343, 344, 345, 379, 380, 381, 382, 383, 384, 388, 413, 414, 425], "environ_info_init": 57, "environment": 361, "environment_vari": 413, "enviton": 361, "eos_coef": [256, 257], "ep": [255, 281], "epoch": [47, 314, 348, 349, 353, 354, 419, 422, 425], "epsilon": 387, "equal": [57, 246, 369, 391, 399, 400], "equat": [407, 423], "equival": [264, 340, 387, 409, 428, 432], "eras": 401, "erf": [73, 394], "error": [7, 22, 61, 247, 256, 257, 308, 309, 361, 372, 394, 410, 426, 427, 432], "escap": [400, 401], "escape_eras": 401, "escape_reg": 401, "especi": [319, 394, 409, 421], "essenti": [316, 319, 327, 328, 329, 372, 408], "establish": [345, 372, 383, 384], "estim": [9, 25, 385, 389, 409], "et": 432, "etc": [9, 25, 246, 266, 303, 308, 335, 371, 380, 389], "ethnic": 298, "euclidean": 303, "eval": [5, 349, 351, 416], "eval_accuraci": [302, 303, 306, 419], "eval_dataset": [302, 306], "eval_f1": [28, 30, 306, 416], "eval_func": 246, "eval_loss": 425, "eval_metr": 30, "eval_multi_choic": 268, "eval_open": 268, "evalpredict": 302, "evalu": [30, 47, 246, 253, 254, 256, 257, 268, 303, 354, 389, 407, 416, 417, 423, 426, 427], "evaluation_strategi": [314, 349, 352], "even": [24, 309, 319, 357, 372, 387, 399], "evenli": 399, "event": [298, 372], "eventu": [264, 361], "ever": 420, "everi": [17, 25, 47, 57, 266, 355, 364, 365, 387, 399, 413], "everydai": [338, 358], "everyon": [298, 319, 373], "everywher": 420, "evict": 307, "evo_eval_metr": 28, "evo_it": 28, "evoc": 418, "evol": 349, "evolust": 30, "evolustionari": 30, "evolut": [28, 31, 304], "evolutionari": [30, 246], "ewm_col": 265, "ex": [327, 328, 329, 349, 423], "exact": [268, 370, 377], "exactli": 395, "examin": 373, "exampl": [4, 25, 28, 32, 36, 44, 55, 57, 181, 260, 266, 269, 270, 272, 273, 292, 298, 302, 303, 306, 307, 309, 313, 314, 317, 318, 319, 321, 322, 323, 324, 330, 332, 333, 336, 337, 338, 339, 342, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 358, 359, 361, 363, 364, 365, 366, 368, 370, 371, 372, 377, 378, 387, 388, 390, 391, 393, 394, 395, 396, 398, 400, 402, 416, 419, 423, 426, 427, 435], "example_input": [27, 247, 289], "example_output": 351, "example_persist": [324, 338], "exc": 361, "exce": 429, "except": [17, 314, 349, 372, 390, 400], "excerpt": [319, 325], "excess": [372, 429], "excit": 378, "excluded_op_nam": 28, "excluded_precis": 247, "exclus": [24, 341, 381], "exec": [313, 314, 315, 410], "exec_context_t": 279, "execut": [24, 279, 302, 309, 313, 316, 317, 318, 330, 332, 335, 341, 347, 351, 358, 365, 379, 380, 381, 382, 390, 394, 398, 400, 401, 405, 406, 408, 410, 413, 414, 423, 426, 432], "execution_mod": 396, "execution_opt": [390, 396], "executor": [55, 375, 387, 388, 389], "executorbenchmark": 27, "exhaust": 25, "exhibit": [319, 370], "exist": [21, 35, 57, 247, 306, 307, 345, 355, 363, 372, 383, 384, 387, 418, 426, 427, 432], "exit": [385, 430], "exp": [28, 401, 408, 413], "expand": [36, 39, 40, 44, 73, 309, 357, 370, 372], "expand_dim": 83, "expand_gath": [36, 44], "expanddim": 74, "expandindic": 73, "expans": [39, 40], "expect": [36, 37, 44, 246, 256, 257, 260, 298, 300, 352, 372, 409, 417], "expens": [303, 319, 370], "experi": [272, 298, 302, 309, 319, 347, 357, 361, 369, 372, 378, 403, 408, 409], "experiment": [401, 432], "expert": 40, "explain": [270, 316], "explan": 407, "explicit": [279, 298, 394, 401], "explicitli": [314, 319, 349], "explicitnhwctransposeforconv": 206, "explicitnhwctransposeforconvqat": 207, "exploit": [49, 395], "explor": [270, 306, 319, 320, 361, 432], "explos": 420, "expon": 260, "exponenti": 265, "export": [246, 266, 270, 302, 313, 314, 316, 322, 324, 331, 332, 334, 337, 349, 353, 354, 366, 389, 392, 418, 426, 427, 432, 434], "export_model": 337, "export_to_bf16_onnx": 246, "export_to_fp32_onnx": 246, "export_to_int8_onnx": 246, "export_to_jit": 246, "export_to_onnx": [246, 302, 305], "expos": [247, 400, 401], "exposur": 372, "express": [298, 371, 372], "expsum": 281, "exteion": 331, "extend": [272, 302, 376, 402, 421, 428, 432], "extens": [247, 269, 270, 299, 300, 306, 313, 314, 315, 316, 319, 320, 325, 330, 331, 332, 337, 347, 349, 352, 354, 355, 364, 365, 366, 374, 386, 387, 388, 415, 417, 418, 419, 420, 424, 425, 426, 427, 428, 429, 432], "extern": [320, 372], "extra": [24, 261, 332, 372, 396, 405], "extract": [9, 23, 49, 50, 52, 53, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 266, 371, 372, 387, 390, 392], "extract_numb": 268, "extract_t": 359, "extract_text_from_span": 266, "extract_text_inside_bbox": 266, "extractor": [57, 58, 390, 392, 395], "extrem": [302, 306, 427], "f": [57, 281, 303, 313, 314, 315, 316, 317, 318, 340, 347, 348, 349, 361, 394, 410, 423, 432], "f1": [304, 416], "f32": [389, 394], "f5": 410, "f7e0": 319, "fac": 304, "face": [16, 19, 35, 247, 272, 273, 298, 302, 309, 321, 345, 348, 349, 354, 361, 372, 383, 384, 392, 420, 429], "face_anim": [309, 360, 374], "faceanim": [309, 321], "facebook": [323, 428], "facet": 372, "facilit": [319, 325, 405, 408], "fact": [307, 408, 423], "factor": [260, 372, 397, 411, 425, 426, 428], "fail": [351, 361], "failur": 269, "fair": [288, 298, 377], "fairseq": 44, "faiss": 376, "faith": 298, "fake": 423, "falcon": [363, 428], "falcon_peft_finetuned_model": 349, "fall": 266, "fallback": 372, "fals": [0, 4, 14, 17, 24, 25, 28, 30, 35, 36, 37, 40, 41, 44, 55, 246, 247, 256, 257, 259, 266, 281, 289, 303, 307, 314, 316, 336, 340, 346, 347, 349, 352, 354, 361, 363, 371, 372, 375, 387, 390, 400, 401, 413, 416, 422, 428, 432], "famili": 361, "familiar": 361, "faq": 298, "far": 408, "fascin": 361, "fast": [272, 302, 306, 319, 420, 432], "fastapi": [309, 366], "fastchat": [0, 22], "fastedit": 377, "faster": [361, 396, 420], "fastrag": 309, "fatal": 61, "father": 387, "fault": 269, "fb": 410, "feasibl": 372, "featur": [9, 25, 44, 272, 297, 300, 302, 303, 309, 319, 324, 327, 328, 329, 335, 341, 350, 361, 369, 372, 378, 379, 380, 381, 382, 392, 395, 399, 406, 410, 418, 421, 424, 426, 430, 440], "feature_extractor": 9, "feature_mxfp4_poc": 332, "fed": 246, "feed": [25, 36, 44, 303, 324, 388], "feed_forward_chunk": [36, 44], "feedback": [319, 346, 347], "feel": [341, 378, 381, 405, 413, 430], "feet": 377, "feng": 301, "fetch": [305, 370, 372, 399, 402], "few": [57, 336, 337, 338, 340, 358, 369, 404], "fewer": [266, 342, 406], "fewest": 266, "ffmpeg": [308, 360, 369, 374], "ffn": [256, 257], "fictiti": 372, "fid": 425, "field": [57, 264, 265, 266, 361, 365, 375, 400, 401], "figur": [372, 402, 405, 406, 407, 409, 412], "file": [0, 6, 13, 25, 28, 30, 35, 47, 49, 50, 52, 53, 54, 55, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 184, 215, 246, 247, 248, 260, 265, 267, 278, 279, 280, 281, 302, 309, 314, 315, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 348, 349, 350, 355, 363, 366, 369, 370, 372, 375, 376, 378, 379, 382, 383, 384, 387, 388, 389, 390, 392, 394, 400, 401, 412, 413, 414, 415], "file_nam": 428, "file_root": 390, "filenam": [25, 267], "fill": [73, 361, 391, 407], "filter": [25, 266, 319, 372, 373, 419], "final": [49, 260, 266, 350, 351, 390, 391, 392, 394, 395, 402, 405, 406, 408, 416, 426], "find": [24, 270, 281, 314, 324, 327, 328, 329, 338, 345, 349, 364, 365, 366, 372, 383, 384, 387, 390, 394, 395, 404, 419, 426, 427], "fine": [5, 272, 302, 309, 320, 357, 372, 376, 420, 422, 423, 432], "finetun": [4, 5, 246, 302, 304, 306, 309, 312, 319, 320, 321, 340, 346, 347, 350, 352, 354, 369, 372, 375, 418, 422], "finetune_cfg": 319, "finetune_clm": [314, 348, 349, 352, 354, 422], "finetune_lora": 350, "finetune_model": [4, 319], "finetune_neuralchat_v3": 347, "finetune_seq2seq": [314, 349], "finetuned_model": [347, 355], "finetuned_model_lora": 347, "finetuned_model_lora_plus_dpo": [346, 347], "finetuning_data": 350, "finetuningargu": 5, "finish": [387, 391, 393], "finish_reason": 361, "finit": [25, 373], "first": [24, 25, 57, 177, 246, 266, 299, 309, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 351, 357, 363, 366, 368, 370, 372, 375, 380, 383, 384, 385, 386, 387, 390, 391, 393, 395, 396, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 414, 423, 425, 428, 432], "first_lay": 24, "firstli": 49, "fist": 57, "fit": [25, 402, 432], "five": 356, "fix": [25, 37, 255, 390, 403, 420, 428], "fixedrandomsubsetsampl": 25, "fixedsubsetsampl": 25, "fl": 385, "flag": [305, 315, 321, 364], "flan": [272, 302, 314], "flan_t5_larg": 425, "flash": 32, "flat": 25, "flatmapdataset": 73, "flatten": [25, 73], "flexibl": [309, 319, 325, 372, 405, 429], "float": [25, 28, 47, 246, 247, 250, 251, 258, 260, 264, 268, 281, 303, 305, 371, 387, 393, 400, 401, 402, 416, 417, 419, 423, 428], "float16": 432, "float32": [371, 407, 423], "float4": 422, "float8_e4m3_t": 281, "float8_e5m2_t": 281, "floattensor": [36, 40, 41, 44], "floor_divid": 73, "flow": [38, 49, 57, 387, 391], "fluent": 372, "fly": 403, "fma": 409, "fmt": 264, "fn": [24, 25], "foc": 25, "focal": [256, 257], "focu": [378, 393, 407, 416, 432], "focus": [298, 314, 349, 369, 372, 378], "fold": [246, 413, 428], "folder": [270, 316, 335, 341, 348, 353, 355, 364, 365, 366, 372, 373, 379, 380, 381, 382, 386, 387, 388, 389, 390, 392, 432], "follow": [9, 24, 25, 36, 44, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 256, 257, 274, 277, 282, 286, 289, 298, 300, 302, 303, 308, 309, 313, 314, 315, 316, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 350, 351, 352, 354, 355, 356, 357, 358, 361, 363, 364, 365, 366, 368, 369, 370, 372, 373, 376, 380, 383, 384, 385, 387, 389, 390, 391, 392, 394, 400, 401, 403, 404, 405, 406, 407, 408, 409, 410, 413, 414, 415, 423, 426, 432], "followt": 366, "footprint": [307, 420], "forc": [25, 35, 315, 353], "force_download": 35, "forced_assign": 266, "forg": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368, 426], "forget": [387, 391, 394], "fork": 315, "form": [49, 57, 246, 266, 268, 303, 314, 316, 340, 349, 351, 361, 372, 378, 389, 395, 399, 404, 408, 413], "formal": 36, "format": [0, 23, 61, 128, 246, 256, 257, 260, 263, 266, 269, 319, 351, 355, 363, 372, 389, 407, 408, 411, 412, 421, 423, 428], "format_typ": 280, "formatt": 6, "former": [57, 355, 395], "formerli": [293, 302, 398, 436], "formul": [346, 347, 372], "formula": [385, 405], "forth": 361, "forward": [9, 32, 33, 36, 37, 40, 44, 256, 257, 258, 260, 389, 394, 423], "forward_infer": [280, 394, 401], "forward_tim": 389, "foster": 298, "found": [270, 302, 308, 309, 321, 372, 386, 387, 409, 426, 427], "four": [409, 426, 429], "fourth": 420, "fox": 340, "fp16": [288, 423, 432], "fp32": [28, 158, 246, 269, 302, 304, 337, 375, 388, 389, 390, 392, 394, 400, 401, 403, 404, 406, 408, 413, 418, 421, 423, 425, 428, 432], "fp32_bia": 62, "fp32_exp": 401, "fp32_exp_attr": 401, "fp32_gelu": [401, 413], "fp32_gelu_attr": 401, "fp32_relu": 413, "fp32_weight": 421, "fp4": 422, "fp4_e2m1": 421, "fp8": [425, 426], "fpn": 260, "fpn_dim": 260, "fr": 369, "frac": [399, 407], "fraction": [25, 266], "fragment": 354, "framework": [45, 49, 57, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 181, 246, 252, 293, 304, 308, 309, 313, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 346, 350, 372, 387, 388, 392, 396, 398, 418, 420, 423, 425, 426, 436], "framework_model": [62, 94, 95, 243], "framework_modeling_config": 55, "franc": 377, "frantar": 432, "fraud": 420, "free": [298, 316, 341, 372, 378, 381, 400, 413, 428, 430], "freedom": 378, "frequenc": [397, 411, 419], "frequencygovern": [397, 411], "frequent": [370, 399], "friend": 361, "friendli": [309, 319, 357, 372, 389, 405, 406], "from": [0, 2, 4, 5, 9, 17, 22, 23, 24, 25, 27, 30, 32, 35, 36, 38, 39, 40, 43, 44, 47, 55, 57, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 114, 115, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 243, 244, 246, 247, 255, 260, 261, 262, 263, 264, 265, 266, 268, 270, 288, 289, 298, 299, 300, 303, 304, 306, 307, 309, 311, 316, 319, 327, 328, 329, 330, 331, 332, 334, 335, 337, 345, 346, 347, 350, 353, 354, 355, 356, 358, 360, 361, 367, 369, 370, 371, 372, 373, 374, 375, 376, 380, 383, 384, 387, 388, 389, 390, 391, 392, 394, 395, 396, 400, 401, 404, 406, 407, 408, 409, 416, 417, 418, 419, 420, 422, 423, 425, 426, 427, 428, 429, 430, 432], "from_llm": [309, 372], "from_output": 55, "from_pretrain": [35, 36, 44, 247, 289, 302, 306, 307, 418, 422, 428, 429, 432], "front": 400, "frontend": [60, 320, 333, 339, 342, 378, 404, 432], "frozen": [9, 57, 255, 350, 388, 392, 422], "frozenbatchnorm2d": 255, "fschat": 366, "ftl": 385, "fulfil": 390, "full": [15, 25, 327, 328, 329, 372, 402, 415, 423], "full_finetun": 348, "fulli": [25, 372, 399, 408, 409, 426], "func": [57, 390], "function": [2, 10, 18, 35, 47, 246, 278, 279, 280, 303, 309, 319, 335, 342, 357, 360, 361, 369, 370, 371, 372, 373, 374, 380, 387, 390, 391, 394, 395, 396, 400, 401, 413, 416, 419, 420, 421, 423, 428, 432], "further": [298, 306, 313, 372, 390, 404, 432], "fuse": [39, 40, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 392, 400, 401], "fused_batch_matmul_v2": 83, "fused_batch_norm_v3": 83, "fused_gemm": 83, "fused_matmul": 83, "fusedbatchnormv3": 76, "fusedgemm": 77, "fusedmatmul": 78, "fusion": [57, 138, 181, 195, 394, 395, 400, 401, 406, 439], "futur": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 361, 363, 367, 368, 392, 400, 401, 410], "futurewarn": 361, "fwk": 57, "fx": 423, "g": [17, 23, 25, 267, 268, 303, 309, 327, 328, 329, 331, 332, 350, 351, 369, 376, 400, 401, 406, 414, 425], "g_idx": 421, "gadget": 420, "gain": [372, 406, 423, 425], "gamma": 260, "gaodrew": 348, "gap": 406, "gather": [36, 44, 83, 246, 264, 279, 387, 400], "gather_desc": 279, "gather_el": [83, 387], "gather_typ": 281, "gatherel": [80, 387], "gatherv2": [79, 387], "gatherwithadd": 147, "gaudi": [309, 313, 316, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 363, 366, 420], "gaudi1": 320, "gaudi2": [308, 319, 320, 349, 379, 420, 425], "gaudi_bartattention_forward": 37, "gaudi_bartlearnedpositionalembed": 37, "gaudi_mistral_repeat_kv": 39, "gaudi_mistral_rmsnorm_forward": 39, "gaudi_mixtral_attention_forward": 40, "gaudi_mixtral_block_sparse_moe_forward": 40, "gaudi_mixtral_decoder_layer_forward": 40, "gaudi_mixtral_model_forward": 40, "gaudi_mixtral_repeat_kv": 40, "gaudi_mixtral_rmsnorm_forward": 40, "gaudi_phi_attention_forward": 41, "gaudi_phi_decoder_layer_forward": 41, "gaudi_phi_model_forward": 41, "gaudi_spawn": [346, 349, 352], "gaudi_swin_get_attn_mask": 42, "gaudimixtralforcausallm": 40, "gb": 372, "gcc": [308, 337, 386, 425, 426, 427], "gchhablani": 304, "geeki": 420, "gelu": [83, 150, 387, 394, 398, 401, 413], "gelu_algorithm": 394, "gelu_d": 394, "gelu_erf": 394, "gelu_p_": 394, "gelu_pd": 394, "gelu_tanh": [389, 394], "geluoper": 394, "gemm": [83, 305, 399, 402, 405, 408, 413, 421, 437], "gemma": 349, "gen": [302, 323, 349, 354, 401], "gen_cas": 401, "gen_case_": 413, "gen_id": [428, 432], "gen_text": [428, 432], "genai": 420, "gender": 298, "gene": 30, "gener": [0, 5, 9, 27, 28, 36, 44, 47, 55, 57, 246, 258, 259, 260, 263, 268, 269, 270, 289, 302, 303, 309, 314, 315, 317, 321, 324, 325, 335, 338, 340, 346, 347, 348, 349, 350, 351, 352, 354, 355, 358, 361, 363, 368, 369, 371, 372, 373, 380, 384, 387, 391, 395, 400, 401, 404, 405, 408, 410, 412, 413, 416, 417, 420, 423, 426, 427, 428, 429, 432], "generalized_box_i": 263, "generate_kwarg": [428, 432], "generate_sequ": 150, "generatesequ": 149, "genv": [314, 349], "get": [0, 25, 29, 30, 49, 55, 57, 61, 62, 243, 244, 246, 251, 270, 299, 305, 308, 319, 321, 327, 328, 329, 334, 335, 337, 341, 348, 349, 356, 359, 361, 365, 369, 370, 379, 380, 381, 382, 387, 390, 391, 392, 394, 395, 400, 407, 409, 414, 418, 426, 435], "get_addr": 400, "get_autocast_info": 57, "get_bbox_span_subset": 266, "get_binaryop_list": [280, 400], "get_children": 62, "get_conv_templ": 0, "get_data_dtyp": 57, "get_data_s": 400, "get_engine_kind": 278, "get_environ_info": 57, "get_example_input": [27, 289], "get_export_arg": 246, "get_global_id": 402, "get_group_id": 402, "get_implementation_list": 278, "get_initializer_children_nam": 62, "get_input_embed": [36, 44], "get_last_word_idx_in_templ": 23, "get_local_id": 402, "get_logg": 61, "get_lut_exp_attr": 281, "get_model_fwk_nam": 57, "get_modul": 24, "get_multi_choice_info": [267, 351], "get_next_node_nam": 55, "get_node_by_nam": 55, "get_node_children_nam": 62, "get_node_id": [55, 387], "get_output_embed": [36, 44], "get_paramet": 24, "get_peft_model": 422, "get_pre_node_nam": 55, "get_prompt": 0, "get_quant_info": 57, "get_refresh_data_idx": 413, "get_relevant_docu": [309, 372], "get_reprs_at_idx": 23, "get_reprs_at_word_token": 23, "get_runtime_kind": 278, "get_sp": 279, "get_sparse_nodes_nam": 55, "get_sparsity_ratio": 47, "get_stor": 30, "get_tensor_dest_op": 243, "get_tensor_idx": 55, "get_throughput": 249, "get_true_data": 401, "get_true_data_": 413, "get_words_idxs_in_templ": 23, "get_workspace_s": 279, "getdefaultencod": 247, "getidx": 401, "getmemori": 396, "getstrid": 394, "getter": [36, 44], "gflag": 388, "gflop": [304, 411, 414], "gfpgan": [360, 374], "ggml": 426, "gha": 269, "gidx": 421, "gigant": 428, "giou": [256, 257, 263], "girl": [361, 426, 427, 428, 429, 432], "git": [35, 269, 302, 308, 314, 315, 322, 327, 328, 329, 330, 331, 332, 334, 338, 347, 349, 354, 357, 359, 361, 362, 363, 366, 368, 386, 388, 432], "github": [22, 38, 43, 262, 269, 300, 302, 308, 314, 315, 319, 322, 325, 327, 328, 329, 330, 331, 332, 345, 347, 348, 349, 352, 354, 361, 362, 366, 383, 384, 386, 388, 394, 415, 424, 432], "give": [24, 57, 316, 349, 387, 391, 399], "given": [0, 4, 23, 24, 25, 28, 35, 36, 44, 247, 267, 268, 316, 322, 351, 354, 376, 385, 395, 401, 404, 407, 409], "glibcxx_3": [426, 427], "global": [28, 264, 330, 369, 425], "globalcol": 402, "globalrow": 402, "glog": 388, "glog_minloglevel": [302, 388, 393], "gloo": 252, "glue": [302, 354], "glx": [308, 309, 334], "gmt": 361, "gnr": 332, "gnu": 386, "go": [5, 39, 40, 57, 264, 300, 351, 361, 363, 402], "goal": [246, 316, 372, 419], "goe": 372, "gold_i": 268, "golub": 25, "gomez": [36, 44], "good": [25, 298, 321, 369, 373, 402, 403, 425], "googl": [314, 334, 349, 372], "google_api_kei": 334, "got": 361, "govindh": 420, "gp": 334, "gperftool": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368], "gpt": [272, 302, 304, 309, 314, 315, 324, 346, 347, 349, 350, 352, 363, 420, 428, 432], "gpt_j_6b": 425, "gpt_j_6b_clm": 425, "gpt_j_6b_url": [335, 380], "gpt_neox_clm": 425, "gptbigcod": 33, "gptbigcodeforcausallm": 33, "gptbigcodeforsequenceclassif": 33, "gptbigcodefortokenclassif": 33, "gptbigcodemodel": 33, "gptbigcodepretrainedmodel": 33, "gptcach": [319, 370], "gptj": 354, "gptj_ft_env": 354, "gptj_peft_finetuned_model": 354, "gptneotoken": 349, "gptneoxtoken": 349, "gptneoxtokenizerfast": 349, "gptq": [288, 319, 421, 432], "gptqconfig": [247, 432], "gpu": [9, 25, 309, 312, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 350, 363, 365, 367, 376, 385, 402, 420], "gpu_id": 9, "gpu_ocl_engine_t": 278, "gqa": 350, "gracefulli": 298, "grad": 24, "gradient": [24, 256, 257, 422, 432], "gradient_accumulation_step": [314, 346, 347, 348, 349, 352, 354, 422], "gradient_checkpoint": [346, 347, 349], "gradient_checkpointing_en": 422, "gradio": [0, 345, 361, 378, 383, 384], "gradio_cli": 361, "gradio_web_serv": [345, 361, 383, 384], "gradiodeprecationwarn": 361, "granular": 354, "graph": [4, 24, 38, 49, 50, 52, 53, 54, 57, 58, 59, 62, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 269, 274, 302, 303, 320, 387, 392, 395, 396, 404, 407, 409, 439], "graph_def": [52, 53, 54, 243], "graph_dispatch": [55, 390], "graph_init": [55, 388, 390], "graph_node_names_detail": [62, 243, 244], "graph_nodes_dict": [62, 243], "graph_util": [58, 387, 391, 395], "great": [353, 359, 360, 374, 418, 420], "greater": [391, 416, 417], "greater_is_bett": [250, 251, 416, 417, 423], "greatli": [403, 408], "grep": [302, 345, 383, 426, 427], "grew": 361, "grid": 266, "ground": [256, 257, 258, 376], "group": [260, 304, 349, 361, 365, 366, 376, 387, 395, 402, 407, 409, 425, 426, 432], "group_by_length": [314, 349], "group_by_modality_length": 350, "group_dim": 247, "group_rowptr": 281, "group_siz": [247, 426, 432], "grouplasso": 419, "groupnorm": 279, "groupnorm_desc": 279, "grow": [25, 395, 432], "gt": [401, 428], "gte": [372, 376], "gtest": 269, "guarante": 373, "guard": 361, "gui": [320, 409, 410], "guid": [22, 270, 292, 302, 303, 312, 314, 320, 323, 324, 326, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 348, 349, 356, 363, 387, 401, 403, 420, 435], "guidanc": [270, 314, 335, 346, 349, 352, 378, 380], "guidelin": [270, 271, 319, 372], "guimar\u00e3": 301, "gunho": 432, "guskin": 301, "gxx": 426, "gxx_linux": 426, "h": [21, 256, 257, 263, 309, 313, 316, 317, 318, 321, 324, 340, 361, 363, 367, 385, 390, 432], "h100": 420, "h2": 307, "h2o_config": 307, "h2o_min_seqlen": 307, "h2oconfig": 307, "h384": 304, "ha": [9, 17, 52, 53, 54, 57, 266, 302, 309, 310, 319, 323, 330, 332, 338, 350, 351, 354, 355, 357, 358, 369, 372, 377, 387, 390, 391, 393, 394, 395, 399, 401, 405, 413, 421, 423, 429], "habana": [39, 40, 313, 316, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 363, 366], "habana_visible_devic": [314, 315, 347, 363, 366], "habanaai": 366, "had": [57, 361, 387], "haihao": 415, "half": [15, 402, 408, 429], "hallucin": [338, 350, 358, 372, 420], "hammer": 301, "han": 43, "hand": 420, "handcraft": 432, "handl": [24, 25, 33, 36, 44, 246, 271, 279, 309, 338, 354, 357, 358, 359, 372, 394, 403, 404], "handler": [6, 247], "hanwen": 415, "happen": [57, 256, 257, 389, 409], "happi": 378, "har": 346, "harass": 298, "hard": 260, "hardik": 301, "hardwar": [303, 309, 325, 357, 410, 412, 420], "harm": [9, 298, 319, 373, 409], "has_append_sum": [281, 413], "has_bia": 281, "has_binary_add": 413, "has_scale0": 281, "hash": [390, 400, 401], "hat": [425, 426], "have": [0, 9, 24, 32, 36, 44, 49, 57, 243, 264, 266, 269, 288, 298, 300, 302, 308, 309, 314, 315, 319, 323, 325, 330, 332, 335, 338, 345, 348, 349, 352, 355, 357, 358, 361, 364, 371, 372, 373, 376, 378, 380, 383, 384, 387, 388, 389, 390, 391, 392, 395, 396, 401, 405, 406, 407, 408, 412, 413, 415, 416, 417, 418, 419, 422, 423, 428, 432], "haven": 351, "haystack": [319, 372], "hbm": 302, "he": [377, 432], "head": [32, 36, 44, 57, 260, 395, 401, 407, 408, 426, 427], "head_dim": [32, 39, 40], "head_mask": [33, 36, 44], "head_num": [281, 407, 413], "head_nun": 407, "head_siz": [281, 407, 413], "header": [266, 309, 372], "header_supercell_tre": 266, "health": [364, 365, 366], "heart": 361, "heavy_ratio": 307, "height": [42, 256, 257, 266, 377], "hella": 288, "hellaswag": 346, "hello": [36, 44, 369], "helloswag": 426, "help": [24, 302, 309, 316, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 342, 343, 344, 345, 351, 358, 361, 363, 372, 377, 383, 384, 385, 395, 400, 412], "helper": [1, 18, 24, 264], "helsinki": [304, 353], "henc": [319, 370], "hengyu": 415, "her": 361, "here": [57, 246, 295, 299, 302, 313, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 338, 340, 341, 343, 344, 349, 355, 356, 358, 361, 363, 366, 367, 369, 370, 371, 372, 379, 380, 381, 382, 386, 387, 390, 391, 392, 394, 395, 401, 409, 421, 423, 424, 426, 427, 429, 432, 438], "hessian": 288, "hf": [309, 314, 327, 328, 329, 332, 346, 347, 352, 377, 422, 432], "hf_access_token": 352, "hf_home": 427, "hidden": [29, 36, 44, 402, 425], "hidden_dim": [256, 257, 260], "hidden_s": [36, 44], "hidden_st": [36, 37, 39, 40, 41, 44], "hide": [36, 44, 246], "hierarchi": 25, "hierarchical_subsequ": 24, "high": [266, 293, 302, 319, 337, 361, 363, 367, 369, 372, 388, 396, 398, 405, 406, 409, 426, 427, 436], "higher": [25, 266, 319, 337, 346, 348, 349, 352, 370, 376, 377, 390, 407, 409, 413, 423, 426, 429], "higher_is_bett": 423, "highli": [305, 372, 421], "highlight": [316, 409], "hill": 361, "hint": [335, 380], "histogram": 25, "histor": 382, "histori": [0, 10, 25, 335, 380, 382], "hit": [370, 376], "hkunlp": 375, "hold": [258, 266], "home": [334, 361, 393, 427], "hook": 24, "hope": 405, "horizon": 361, "horizont": 400, "host": [35, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 347, 361, 363, 364, 365, 366, 372, 375], "host_dir": 315, "hostfil": [1, 314], "hostnam": 314, "hotmap": 412, "hour": 425, "hover": [335, 380], "how": [29, 246, 256, 257, 266, 269, 270, 271, 300, 308, 312, 319, 320, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 335, 339, 342, 343, 348, 349, 350, 352, 353, 354, 355, 356, 358, 367, 368, 369, 376, 377, 380, 387, 388, 389, 392, 393, 395, 401, 402, 403, 413, 416, 419, 426], "howev": [47, 57, 319, 335, 338, 358, 370, 372, 373, 377, 380, 390, 391, 395, 396, 399, 403, 406, 409, 428, 432], "howpublish": 415, "hpp": [279, 280, 281, 390, 398, 413], "hpu": [1, 38, 42, 309, 312, 313, 314, 315, 316, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 349, 352, 363, 375], "ht": [425, 426], "html": [270, 309, 316, 317, 319, 364, 365, 366, 370, 372, 389, 392, 394, 419], "html64": 369, "htmlon": 351, "http": [9, 22, 25, 36, 38, 43, 260, 262, 263, 302, 308, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 348, 349, 352, 354, 357, 359, 361, 362, 363, 364, 365, 366, 367, 368, 369, 372, 380, 383, 384, 386, 388, 394, 415, 420, 424, 432], "http_proxi": [313, 314, 315, 316, 317, 318, 363], "https_proxi": [313, 314, 315, 316, 317, 318], "hub": [35, 247, 270, 348, 364, 418, 426, 427], "hug": [35, 247, 272, 302, 309, 345, 348, 349, 354, 383, 384, 392, 420, 429], "huge": 25, "hugginfac": [314, 337, 349], "huggingfac": [9, 35, 302, 309, 314, 334, 338, 348, 349, 351, 359, 363, 364, 369, 371, 372, 393, 416, 418, 420, 426, 427, 432], "huggingface_pipelin": [309, 372], "huggingfaceh4": [347, 349], "huggingfacepipelin": [309, 372], "huiyan": 301, "hull": 266, "human": [319, 346, 347, 349, 369], "hungarian": [256, 257], "hungarianmatch": 258, "hw": [306, 308, 309, 312], "hybrid": [304, 319, 325, 420], "hypeparamet": 432, "hyperparamet": [246, 428, 432], "hypothesi": [354, 419], "i": [0, 17, 19, 20, 23, 24, 25, 28, 32, 33, 36, 37, 42, 44, 47, 49, 50, 52, 53, 54, 57, 62, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 246, 247, 256, 257, 258, 259, 260, 261, 263, 266, 267, 272, 274, 277, 278, 279, 280, 281, 282, 286, 288, 289, 293, 298, 299, 300, 302, 303, 304, 305, 306, 307, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 324, 325, 326, 327, 328, 329, 330, 332, 334, 335, 336, 337, 338, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 353, 354, 355, 357, 358, 359, 360, 362, 363, 364, 366, 367, 369, 371, 372, 373, 374, 375, 376, 377, 378, 380, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 394, 395, 396, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 426, 427, 428, 429, 432, 436], "i0103": 366, "ic": 302, "icelak": 308, "icon": [377, 382], "icx": [320, 345, 383, 384], "icx02": 361, "id": [35, 44, 55, 57, 262, 327, 328, 329, 330, 332, 351, 361, 372, 375, 401, 402], "id2label": [302, 306], "id_rsa": [330, 332], "idea": [401, 409, 419], "ideal": 372, "ident": [73, 298, 303], "identif": 372, "identifi": [35, 57, 307, 319, 325, 370, 372, 402, 429], "idm": [358, 372], "idx": [23, 38, 57, 401], "ie": [256, 257, 260], "ieee": 25, "igeni": 301, "ignor": [33, 36, 44, 246, 321, 361, 387], "ignore_keys_for_ev": 246, "ikko": 301, "illia": [36, 44], "illustr": 407, "im_end": 281, "im_start": 281, "imag": [0, 9, 17, 20, 256, 257, 259, 260, 267, 302, 304, 313, 317, 318, 319, 321, 335, 337, 347, 350, 361, 378, 380, 392, 395, 403, 409], "image2imag": [309, 321], "image2text": 348, "image_aspect_ratio": 350, "image_nam": [349, 366], "image_root_path": 334, "image_server_ip": 334, "image_tag": 349, "imagenet": 17, "imageri": 298, "imbal": 372, "imbusch": 301, "img": [20, 21], "img_mask": 42, "img_new": 20, "immedi": 24, "impact": [319, 354, 361, 372, 403], "impl_list_": 279, "impl_list_item_t": [278, 279], "impl_nthr": 280, "impl_nthr_": [280, 401], "implement": [9, 25, 47, 293, 302, 307, 326, 327, 328, 329, 330, 332, 345, 354, 367, 372, 377, 383, 384, 387, 390, 391, 395, 398, 399, 400, 402, 404, 405, 406, 407, 408, 410, 413, 432, 436], "implicit": 404, "import": [0, 4, 36, 44, 45, 55, 57, 289, 300, 302, 303, 306, 307, 309, 311, 314, 316, 319, 321, 332, 347, 349, 361, 369, 370, 371, 373, 374, 375, 376, 387, 388, 390, 392, 395, 396, 400, 401, 405, 416, 417, 418, 419, 421, 422, 423, 426, 427, 428, 429, 432], "importerror": [426, 427], "impos": 432, "impract": 372, "imprecis": 390, "impress": 432, "improv": [300, 302, 319, 347, 350, 361, 369, 370, 372, 376, 389, 400, 402, 404, 405, 409, 423, 429, 432], "in8": 158, "in_dt": 281, "in_end": 281, "in_pattern": 57, "in_start": 281, "inaccuraci": 372, "inappropri": 298, "inc": [28, 35, 246, 393], "incid": 298, "incit": 428, "includ": [18, 24, 25, 256, 257, 258, 264, 270, 279, 280, 281, 298, 301, 302, 304, 309, 314, 315, 320, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 347, 354, 357, 361, 363, 368, 369, 372, 373, 375, 376, 388, 389, 390, 398, 401, 409, 413, 415, 421, 423], "inclus": [24, 298], "incom": [25, 367, 408], "incomplet": [35, 372], "inconsist": 406, "incorpor": [25, 429], "incorrect": [338, 358, 361, 395], "increas": [307, 370, 402, 432], "independ": 372, "indetermin": 243, "index": [13, 23, 25, 36, 44, 55, 268, 281, 319, 332, 361, 372, 391, 394, 395, 421, 428], "index2an": [267, 268, 351], "index_file_jsonl_path": 376, "index_i": 258, "index_j": 258, "indexa": 402, "indexb": 402, "indic": [4, 23, 25, 33, 36, 44, 256, 257, 258, 281, 288, 314, 349, 395, 400, 401, 407, 409, 413, 416], "individu": [256, 257, 265, 298, 372], "indptr": 281, "ineffici": 429, "inevit": 405, "infer": [27, 38, 44, 49, 55, 60, 246, 272, 304, 306, 309, 312, 317, 318, 321, 325, 327, 328, 329, 337, 347, 358, 363, 367, 369, 385, 386, 387, 389, 390, 391, 392, 396, 403, 405, 406, 408, 413, 417, 420, 423, 428, 432, 437], "infer_framework_load_model": 45, "infer_task": 246, "inferen": 421, "inferenc": [158, 371], "inference_asr": 353, "inference_transl": 353, "inference_translation_revers": 353, "inference_tt": 353, "influenc": [372, 376, 387, 391], "influenti": 307, "info": [6, 57, 61, 62, 243, 244, 302, 307, 314, 345, 348, 349, 352, 354, 361, 383, 384, 387, 410, 422], "inform": [47, 256, 257, 270, 271, 274, 277, 282, 286, 298, 300, 302, 303, 316, 334, 338, 345, 358, 361, 371, 372, 378, 383, 384, 388, 389, 397, 401, 404, 411, 412, 413, 419, 420, 423, 424, 425, 426, 435], "inher": [319, 372, 373, 377], "inherit": [9, 32, 289, 303, 387, 394, 418, 419, 423], "init": [46, 252, 330, 332, 386, 388, 401, 432], "init_alpha": 247, "init_db_ai_photo": 334, "init_method": 252, "init_quant": 400, "init_similar_cache_from_config": 370, "initi": [33, 36, 44, 55, 57, 62, 65, 95, 281, 309, 321, 347, 357, 365, 369, 372, 373, 382, 400, 401, 405, 418, 419, 429], "initialis": 402, "inject": [246, 401], "injector": 437, "inlin": [278, 279, 280, 400], "inner": 407, "innerproduct": [55, 73, 158, 389, 390, 398], "innerproductreshapefus": [145, 150], "innerproductwithbiasgelu": 150, "innerproductwithslic": 150, "innerproductwithswish": 150, "innov": [272, 302, 378, 420, 421], "inp": 24, "inplac": 432, "input": [5, 9, 11, 12, 15, 23, 24, 25, 27, 32, 33, 36, 37, 44, 45, 52, 53, 55, 57, 62, 73, 181, 243, 244, 246, 260, 264, 265, 281, 289, 302, 303, 305, 306, 317, 319, 335, 336, 349, 350, 355, 356, 361, 363, 372, 373, 375, 380, 382, 388, 389, 390, 391, 394, 396, 404, 406, 407, 409, 413, 418, 421, 425, 428, 429, 432], "input_0": [55, 388, 390], "input_1": [55, 388, 390], "input_2": [55, 388, 390], "input_data": [55, 57, 150, 388], "input_dict": 264, "input_dim": [256, 257], "input_dt": [281, 400, 413], "input_fil": [150, 376], "input_id": [33, 36, 37, 40, 41, 44, 57, 306, 388, 396, 428, 429, 432], "input_mask": [57, 306, 388], "input_model": [302, 337, 389, 392, 393], "input_name_to_nod": 62, "input_path": [316, 319, 324, 338, 358, 372, 375], "input_pattern": [57, 395], "input_shap": [128, 389, 390, 413], "input_tensor": [36, 44, 52, 53, 54, 57, 62, 95, 243, 244, 387, 391], "input_tensor_nam": 389, "input_typ": 389, "inputdata": [154, 387], "inputfil": 155, "inputs_emb": [33, 36, 40, 41, 44], "inputs_shap": [55, 390], "inquire_config_item": 55, "insert": [55, 57, 392, 394, 395, 400, 401, 423], "insert_bf16_nod": 150, "insert_environ_info": 57, "insert_nod": 55, "insert_pattern": 57, "insert_quant_info": 57, "insert_quant_nod": 150, "insertbf16nod": 156, "insertquantnod": 157, "insid": [33, 57, 266, 314, 315, 349, 366, 391, 394, 404, 406], "insight": [319, 325, 378], "inspir": [314, 319, 345, 349, 361, 372, 383, 384], "inst": 389, "instal": [313, 314, 315, 316, 317, 318, 321, 335, 341, 346, 348, 349, 350, 351, 352, 358, 359, 360, 366, 367, 372, 374, 376, 377, 379, 380, 381, 382, 387, 412, 420, 426, 427, 432, 435], "install_chatbot_cpu": 361, "install_chatbot_gpu": 361, "install_rag_gpu": 362, "instanc": [6, 28, 246, 247, 248, 268, 289, 298, 309, 315, 322, 332, 348, 351, 365, 366, 372, 388, 389, 397, 411, 414, 416, 417, 418, 425, 426], "instance_group": [365, 366], "instanti": 35, "instead": [0, 25, 36, 44, 314, 349, 350, 361, 372, 402], "instruct": [9, 268, 295, 303, 309, 314, 315, 316, 319, 327, 328, 329, 330, 332, 335, 336, 337, 338, 342, 346, 351, 352, 358, 376, 380, 391, 400, 403, 405, 408, 409, 410, 413, 420, 421, 423, 428, 432, 438], "instruction_tuning_pipelin": 314, "instructor": [372, 375], "instrument": 24, "insult": 298, "int": [6, 23, 27, 28, 32, 36, 37, 39, 40, 57, 246, 247, 264, 281, 289, 371, 372, 387, 400, 401, 402, 405, 421], "int32": [62, 302, 388, 421], "int32_bia": 62, "int32_t": 281, "int4": [319, 320, 345, 375, 383, 384, 422, 425, 429, 432], "int4_clip": [421, 432], "int4_fullrang": [319, 421], "int4_gptq": 427, "int64_t": 281, "int8": [28, 45, 62, 246, 269, 270, 302, 304, 319, 375, 389, 390, 392, 398, 401, 406, 407, 413, 420, 421, 422, 423, 425, 426, 428, 429, 432, 439], "int8_bf16_mixed_precision_check": 150, "int8_bia": 62, "int8_bias_scal": 62, "int8_bias_zero_point": 62, "int8_lut": 401, "int8_lut_acc_test": 394, "int8_lut_optim": 394, "int8_model_path": 390, "int8_t": 281, "int8bf16mixedprecisioncheck": 158, "intact": 361, "integ": [25, 262, 305, 314, 349, 375, 390, 391, 409, 413, 419, 423, 428, 432], "integr": [309, 319, 335, 338, 357, 358, 360, 361, 363, 369, 372, 374, 380, 429, 439], "intel": [4, 8, 35, 247, 269, 270, 271, 289, 299, 300, 306, 309, 310, 311, 313, 314, 315, 316, 317, 318, 319, 320, 321, 323, 324, 325, 326, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 348, 350, 352, 354, 355, 360, 363, 364, 365, 366, 367, 369, 370, 372, 374, 375, 383, 384, 386, 387, 388, 397, 399, 410, 411, 415, 417, 418, 419, 420, 423, 424, 425, 426, 427, 428, 429], "intel_domain": [314, 349], "intel_extension_for_pytorch": 432, "intel_extension_for_transform": [289, 302, 303, 306, 307, 308, 309, 315, 317, 319, 322, 340, 347, 349, 354, 356, 358, 361, 362, 364, 365, 366, 369, 370, 371, 372, 373, 374, 387, 388, 390, 392, 395, 396, 398, 413, 416, 417, 418, 419, 421, 422, 423, 428, 429, 432], "intellig": [272, 420], "intend": [256, 257, 300, 334, 336, 337, 338, 340, 343, 344, 363], "intens": [372, 385], "intent": [11, 15, 372], "intentdetector": 372, "interact": [309, 316, 321, 378], "interact_featur": 150, "interactfeatur": 159, "interconnect": 372, "interest": [25, 298, 316], "interfac": [28, 33, 36, 44, 49, 245, 279, 305, 355, 369, 378, 386, 398], "intermedi": [36, 44, 387, 392, 395, 409, 423], "intermediatelayersknowledgedistillationlossconfig": 303, "intermediatelayersloss": 303, "intern": [25, 57, 376, 391, 405], "internation": 376, "internet": [313, 314, 316, 355, 372], "interpol": 264, "interpret": 316, "intersect": [25, 266], "interv": [24, 413], "intrins": 403, "introduc": [295, 309, 319, 333, 336, 337, 338, 339, 342, 353, 357, 358, 361, 372, 399, 400, 401, 402, 403, 405, 407, 408, 409, 423, 428, 438], "introduct": 413, "intuit": [395, 405, 432], "invalid": 413, "invers": [30, 408], "investig": [298, 409], "invit": 270, "invok": 24, "invoke_with_optional_arg": 24, "involv": [316, 319, 321, 325, 372], "io": [281, 361, 364, 365, 394], "iob": 266, "iou": [25, 260, 263, 266], "ip": [313, 314, 316, 317, 318, 322, 330, 332, 334, 337, 345, 349, 363, 375, 383, 384], "ipc": [313, 314, 315, 316, 317, 318, 347, 366], "ipex": [269, 308, 314, 348, 349, 369, 423, 428, 432], "ipex_opt_llm": 247, "ipykernel": 322, "ipynb": 322, "ir": [55, 337, 387, 388, 389, 390, 396, 410, 412, 439], "ir_path": 392, "irc_na": 332, "irq": [397, 411], "is_avail": 374, "is_decod": [36, 44], "is_mast": 264, "is_null_numpy_valu": 25, "is_rel": [250, 306, 416], "is_supported_onnx_graph": 62, "is_supported_onnx_nod": 62, "is_thing_map": 260, "isa": [390, 398, 400, 405, 408, 409, 410], "ise": [309, 313, 331], "issu": [271, 288, 295, 298, 300, 302, 314, 332, 335, 349, 361, 370, 372, 380, 406, 423, 429, 438], "it_per_cor": 414, "itai": 301, "itali": [36, 377], "item": [25, 57, 289, 302, 306, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 372, 376], "iter": [25, 28, 247, 266, 289, 302, 388, 390, 394, 396, 400, 404, 407, 408, 413, 414, 428], "iteration4": 389, "iterator_get_next": 83, "iterator_v2": [83, 387], "iteratorgetnext": 84, "iteratorv2": [85, 387], "itrex": [302, 308, 320, 322, 348, 349, 361, 362, 366, 386, 388, 421], "itrex_v": [314, 315, 347], "itrexquantizationconfigmixin": 247, "its": [9, 24, 57, 181, 266, 298, 300, 303, 309, 313, 321, 335, 346, 347, 352, 355, 360, 369, 372, 374, 377, 380, 387, 388, 391, 392, 395, 404, 405, 406, 409, 412, 413, 415, 429, 432], "itself": [369, 372, 410], "itt": 410, "j": [272, 302, 304, 309, 386, 387, 388, 398, 404, 409, 410, 413, 428, 432], "j8": 330, "jakob": [36, 44], "jan": 420, "japanes": 369, "jax": 354, "jd": [278, 279, 280, 281, 401, 413], "jemalloc": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368], "ji": 432, "jiafu": 301, "jianyu": 301, "jiqe": 301, "jit": [27, 28, 246, 278, 279, 280, 281, 289, 293, 315, 398, 400, 401, 404, 408, 413, 436], "jit_binary_injector": 400, "jit_domain": 281, "jit_eltwiseop_t": 401, "jit_gener": [400, 401], "job": 313, "jogesh": 301, "johnson": 377, "join_with_spac": 266, "jonatasgrosman": 353, "jonathan": 301, "jone": [36, 44], "journalist": 349, "journei": 361, "jpg": [350, 412], "json": [247, 267, 309, 313, 314, 316, 317, 318, 319, 321, 324, 340, 349, 350, 351, 354, 361, 363, 367, 372, 375, 376, 377, 422], "json_file_path": 247, "jsonl": [319, 372, 376], "jstor": 25, "juli": [377, 420], "jump": 340, "jun": 361, "june": 420, "jung": 432, "jupyt": [320, 322, 347], "just": [24, 25, 44, 57, 302, 314, 315, 316, 349, 355, 356, 366, 372, 387, 388, 389, 390, 391, 392, 395, 401, 409, 412, 416, 420, 422, 430, 432], "k": [25, 32, 264, 281, 372, 376, 390, 400, 402, 403, 404, 405, 407, 408, 409, 411, 413, 432], "k_bia": 281, "k_dim_dp": 405, "k_proj": [346, 347, 349, 354], "k_scale": 281, "k_weight": 281, "kaiser": [36, 44], "kamboj": 301, "karnin": 25, "karrasdiffusionschedul": 9, "kd": [279, 303], "kdim": 281, "kdp": 279, "keep": [0, 25, 266, 340, 345, 367, 383, 384, 391, 427], "keep_dim": 387, "keep_high": 266, "kei": [25, 32, 36, 39, 40, 44, 47, 55, 57, 62, 243, 246, 247, 256, 257, 267, 272, 302, 316, 321, 330, 332, 334, 335, 361, 367, 369, 372, 380, 389, 390, 391, 400, 401, 403, 429], "keithito": 262, "kept": 29, "ker_kind": [280, 398, 401], "ker_kind_": [280, 401], "ker_per_batch": 281, "ker_prop": [280, 398, 401], "ker_prop_": [280, 401], "kernel": [269, 273, 281, 297, 299, 319, 322, 388, 389, 394, 397, 399, 400, 401, 403, 404, 406, 407, 408, 409, 410, 411, 412, 440], "kernel_config": [390, 413], "kernel_desc_proxi": 279, "kernel_desc_t": 279, "kernel_kind": [279, 280, 401], "kernel_nam": [390, 413], "kernel_prop": [280, 401], "kernel_proxi": 279, "kernel_t": [278, 279], "kernel_typ": [413, 414], "kevin": 301, "kevinintel": 299, "key_stat": [39, 40], "key_value_st": 37, "keyboard": 355, "keygen": [330, 332], "keynot": 420, "keyword": [2, 25, 246, 372], "kgco2e": 385, "kim": 432, "kind": [57, 138, 280, 314, 349, 361, 365, 366, 394, 406, 413], "kind_cpu": 366, "kind_gpu": 365, "kindli": [0, 270], "kll": 25, "km": 390, "kmp_affin": 354, "kmp_blocktim": 354, "kmp_set": 354, "kn": 390, "know": [390, 396, 403], "knowledg": [246, 272, 302, 316, 319, 320, 335, 341, 361, 372, 379, 380, 381, 432], "knowledge_a100_url": 379, "knowledge_gaudi2_url": 379, "knowledge_url": [335, 380], "knowledgebas": 372, "knowledgedistillationloss": 303, "knowledgedistillationlossconfig": 303, "knowledgeloss": 303, "known": [293, 302, 338, 358, 361, 398, 436], "korat": 301, "kpo": 281, "krishna2020": 304, "kullback": 303, "kv": [39, 40, 57, 307, 429], "kv_cache_compress": 307, "kv_cache_inc_s": 332, "kwarg": [14, 17, 24, 25, 28, 32, 33, 35, 36, 40, 41, 44, 47, 61, 128, 246, 247, 372, 432], "kwon": 432, "kxn": [413, 421], "l": [30, 303, 331, 350], "l1": [256, 257, 399], "l2": [399, 405, 413], "l6": 304, "l_mpi_oneapi_p_2021": 332, "la": [304, 402], "lab": [43, 349, 420], "label": [33, 36, 44, 246, 256, 257, 258, 260, 266, 372, 414, 418], "label2id": [302, 306], "label_id": 302, "labor": 372, "lack": 372, "laion": 350, "lake": 302, "lamabda": 372, "lamb": 288, "lambada": [289, 426, 428], "lambada_openai": [426, 427], "lambda": [302, 425], "lambdalab": 304, "lamini": 428, "landmark": [21, 377], "lang": 25, "langchain": [14, 319, 320, 357], "langchain_commun": [309, 372], "langchain_cor": [309, 372], "languag": [4, 33, 36, 44, 272, 298, 302, 304, 319, 325, 340, 342, 346, 349, 350, 352, 354, 359, 363, 368, 371, 372, 373, 418, 420, 421, 422, 428, 429, 430, 432], "laptop": [327, 328, 329, 420], "larg": [4, 9, 14, 25, 303, 304, 319, 325, 346, 349, 350, 353, 354, 363, 371, 372, 373, 375, 376, 395, 396, 397, 399, 402, 405, 406, 407, 413, 420, 421, 422, 428, 430, 432], "large_wei_threshold": 405, "large_weight_threshold": 413, "larger": [17, 25, 246, 372, 376, 428], "largest": [25, 266], "lasso": 304, "last": [0, 17, 23, 24, 25, 36, 44, 57, 246, 361, 391, 395, 396, 399, 404, 405, 407, 423, 428], "last_lay": 24, "last_layer_shap": 150, "lastlayershap": 160, "latanc": 389, "latenc": [28, 289, 302, 304, 373, 382, 385, 389, 397, 402, 420, 423, 425, 429], "latency_constraint": 28, "latent": [9, 369], "later": [24, 25, 57, 266, 349, 387, 395], "latest": [315, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 347, 357, 363, 366, 368, 372, 383, 384, 410, 424, 426, 427], "latrang": 73, "latter": [355, 408], "launch": [314, 316, 321, 327, 328, 329, 345, 347, 348, 349, 352, 361, 363, 383, 384, 408], "launcher": 1, "law": 307, "layer": [23, 24, 28, 29, 36, 44, 47, 57, 256, 257, 261, 350, 389, 395, 400, 404, 407, 419, 420, 421, 428, 437], "layer1": 24, "layer2": 24, "layer_0": 395, "layer_1": 395, "layer_2": 395, "layer_config": [29, 36, 44], "layer_dropout": 29, "layer_dropout_bound": [28, 29], "layer_dropout_prob": [28, 29], "layer_head_mask": 37, "layer_idx": 32, "layer_norm": [83, 150, 387], "layer_norm_with_reduce_mean": [150, 387], "layer_norm_with_transpos": 150, "layer_wis": 247, "layernam": 24, "layernorm": [49, 57, 86, 161, 387, 391, 395, 398, 413], "layernorm_ba": 279, "layernorm_ba_data_t": 281, "layernorm_ba_desc": [279, 400], "layernorm_ba_param_t": 281, "layernormalized_spmm": 279, "layernormalized_spmm_desc": 279, "layernormwithreducemean": [162, 387], "layernormwithtranspos": 163, "layernrom": 406, "layout": [403, 406, 407, 408], "layternorm": 406, "lazi": [57, 340], "lazyimport": 57, "ld_preload": [354, 426, 427], "le": 401, "lead": [309, 372, 377, 396, 407, 409, 432], "leaderboard": [309, 346, 347, 372, 420], "leadership": 298, "learn": [17, 37, 259, 272, 317, 319, 320, 346, 347, 361, 363, 372, 392, 401, 417, 420, 423, 425], "learning_r": [314, 346, 347, 348, 349, 352, 354, 376, 422], "least": [25, 47, 57, 258, 266, 300, 406], "leav": [25, 351, 361, 391, 407, 409, 413], "lecun": 419, "lee": 432, "left": [23, 24, 36, 44, 57, 260, 266, 335, 361, 380, 403, 407, 409], "legaci": [36, 44], "legal": 435, "legend": 407, "leibler": 303, "len": [256, 257, 258, 263, 387, 388, 395, 407], "length": [28, 29, 36, 44, 57, 272, 288, 302, 314, 349, 350, 361, 372, 376, 391, 395, 400, 413, 420, 423, 425, 429, 430], "length_config": [28, 36, 44, 306], "length_drop_prob": 29, "length_drop_ratio": 29, "length_drop_ratio_bound": [28, 29], "lengthi": 316, "lengthier": 372, "less": [29, 266, 289, 303, 370, 390, 405, 409, 419], "lesson": 361, "let": [355, 369, 389, 394, 402, 403, 428], "level": [6, 9, 44, 61, 268, 270, 272, 298, 302, 319, 326, 330, 332, 370, 378, 390, 401, 404, 412, 432], "levequ": 25, "leverag": [272, 302, 303, 306, 309, 319, 337, 359, 370, 372, 420, 421, 426, 429], "lf": [334, 338, 348, 349, 357, 363, 368], "lh": 407, "li": 403, "liangliang": 301, "lib": [300, 354, 361, 386, 388], "lib64": [426, 427], "liberti": 25, "libgl": [308, 334], "libgl1": [308, 309, 334], "libiomp": 354, "libiomp5": 354, "libjpeg": 361, "libkernellib": 386, "libneural_engin": 386, "libpng": 361, "librari": [8, 9, 293, 308, 321, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 348, 349, 354, 357, 359, 363, 368, 370, 400, 401, 421, 426, 427, 436], "libsm": 308, "libsm6": 308, "libssl1": 369, "libstdc": [426, 427], "libstdc_path_": [426, 427], "libstdcxx": 426, "libtcmalloc": 354, "libxext": 308, "libxext6": 308, "libxrend": 308, "libxsmm": 332, "licens": [270, 300, 372], "life": [361, 396], "lifelong": 361, "lifengwang": 301, "lifetim": 361, "light": 36, "lighter": 407, "lightweight": [346, 347], "lihongzhi": 319, "like": [25, 49, 52, 53, 54, 57, 243, 302, 303, 306, 309, 314, 317, 318, 319, 338, 345, 347, 349, 352, 353, 354, 355, 358, 363, 364, 365, 366, 369, 370, 372, 376, 383, 384, 385, 387, 388, 389, 390, 391, 392, 395, 396, 400, 401, 403, 410, 416, 417, 419, 423, 428, 432], "likelihood": [256, 257, 346, 347, 372], "limit": [25, 28, 303, 335, 372, 380, 404, 408, 426, 427, 429, 432], "limitless": 420, "lin": 432, "line": [1, 265, 267, 314, 319, 336, 337, 338, 348, 349, 356, 358, 361, 376, 385, 387, 390, 399, 406, 407, 409, 414, 432], "linear": [28, 303, 401, 404, 407, 421, 432], "lineup": [309, 321], "link": [270, 300, 319, 320, 321, 322, 325, 327, 328, 329, 333, 338, 339, 342, 361, 365, 372, 379, 388, 394, 430, 432], "linux": [308, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 363, 368, 383, 384, 386], "list": [21, 23, 24, 25, 28, 36, 40, 41, 44, 47, 52, 53, 54, 55, 57, 62, 95, 243, 244, 246, 256, 257, 258, 260, 262, 264, 265, 267, 268, 270, 302, 303, 309, 321, 336, 337, 345, 348, 351, 371, 372, 375, 376, 383, 386, 387, 388, 391, 396, 400, 401, 414, 419, 426, 427, 428], "list2str": [57, 387], "listconstruct": 73, "listen": 355, "listunpack": 73, "liter": 23, "littl": [361, 423, 426, 427, 428, 429, 432], "liuhaotian": [350, 351], "live": 361, "livestream": 420, "lkk12014402": 299, "ll": [327, 328, 329, 332, 402], "llama": [32, 309, 314, 319, 323, 332, 346, 347, 350, 352, 363, 372, 377, 396, 420, 422, 428, 432], "llama2": [314, 366], "llama2_7b_rm": 352, "llama2_7b_s": 352, "llama2_ds_zero3_config": 349, "llama2_peft_finetuned_model": [314, 349], "llama_embed": 150, "llama_matmulwithtranspos": 150, "llama_peft_finetuned_model": 422, "llama_postprocess": 150, "llama_rotary_pos_emb": 150, "llamaattent": 32, "llamaconfig": 32, "llamaembed": 164, "llamaflashattention2": 32, "llamaforcausallm": 307, "llamamatmulwithtranspos": 165, "llamapostprocess": 166, "llamaroraryposemb": 167, "llamasdpaattent": 32, "llava": 350, "llava1": 351, "llion": [36, 44], "llm": [4, 8, 11, 12, 15, 43, 309, 314, 320, 321, 323, 324, 325, 326, 327, 328, 329, 330, 332, 336, 337, 338, 340, 346, 347, 349, 350, 354, 358, 361, 363, 364, 366, 367, 370, 371, 372, 373, 375, 420, 421, 422, 427, 428, 432], "llm_carbon_calc": 385, "llm_tt": 340, "llma_url": [335, 380], "lm": [20, 22, 325, 349], "lm3d": 20, "lm_eval_task": 349, "lm_new": 20, "lmsdiscreteschedul": 9, "ln": [261, 308, 309], "ln_node_idx": 387, "ln_pattern": 395, "lo": 403, "load": [9, 10, 19, 25, 30, 33, 36, 44, 45, 60, 246, 247, 267, 302, 309, 320, 361, 372, 387, 388, 389, 390, 392, 396, 399, 401, 402, 403, 404, 409, 428, 432], "load_cached_st": 25, "load_dataset": [289, 302], "load_graph": 302, "load_in_4bit": [422, 432], "load_in_8bit": 432, "load_mat": 18, "load_metr": 302, "load_param": 401, "load_state_dict": 25, "load_stor": 30, "load_store_fil": 28, "load_tf_weights_in_bert": 36, "load_weight": 55, "loaded_model": 432, "loader": [25, 49, 58, 390, 392, 395], "loading_config": 319, "loadingmodelconfig": 319, "loc": 361, "local": [35, 246, 269, 314, 315, 316, 319, 321, 325, 330, 332, 334, 338, 345, 348, 349, 351, 358, 359, 361, 363, 366, 370, 371, 372, 375, 379, 383, 384, 387, 399, 402, 405, 419, 427, 429], "local_step": 47, "localhost": [313, 314, 315, 316, 318, 321, 322, 324, 330, 332, 340, 347, 353, 361, 364, 365, 366, 367, 372], "localmemori": 402, "locat": [57, 121, 309, 312, 327, 328, 329, 332, 371, 372, 373, 377, 387, 388, 391, 395, 409, 413, 424], "lock": 419, "log": [6, 61, 256, 257, 265, 302, 309, 330, 345, 349, 361, 375, 383, 384, 388, 394], "log_fil": [6, 309, 375], "log_level": [6, 314, 348, 349, 352, 354, 422], "log_nam": 265, "log_softmax": 83, "log_with": 352, "logger": [6, 58, 410], "logging_step": [314, 346, 347, 348, 349, 352, 354, 376, 422], "logic": [39, 40, 351, 408, 410], "login": 334, "logit": [36, 44, 256, 257, 258, 302, 303, 306, 388], "logo": 415, "logsoftmax": [87, 279], "logsoftmax_desc": 279, "long": [24, 314, 315, 372, 395], "longer": [314, 349, 395], "longest": [57, 395], "longform": 304, "longtensor": [36, 40, 41, 44], "look": [314, 349, 372, 387, 389, 401, 402], "loop": [25, 73, 387, 400, 402, 407], "lora": [314, 349, 350, 352, 354, 422, 425], "lora_all_linear": [346, 347], "lora_alpha": [346, 347, 349, 354], "lora_dropout": [346, 347], "lora_rank": [346, 347, 349], "lora_target_modul": [346, 347, 349, 354], "loraconfig": 422, "loss": [33, 36, 44, 246, 256, 257, 260, 303, 314, 349, 423, 432], "loss_bbox_unsc": 265, "loss_box": [256, 257], "loss_cardin": [256, 257], "loss_label": [256, 257], "loss_mask": [256, 257], "lossi": 423, "lot": 403, "louie": 301, "low": [246, 266, 302, 306, 354, 370, 399, 406, 408, 417, 420, 421, 422, 423, 432, 439], "lower": [30, 266, 268, 354, 372, 409, 417, 423, 432], "lower_all_tupl": 150, "lower_bound": 413, "lower_constraint": 30, "loweralltupl": 168, "lpta": 402, "lr": [247, 432], "lr_scheduler_typ": [346, 347], "lsap": 258, "lscpu": 331, "lt": [397, 411], "luca": 301, "lukasz": [36, 44], "luoyu": 299, "lut": [281, 398, 400, 401, 413], "lv": 432, "lvliang": 299, "lvwerra": 304, "m": [25, 57, 263, 281, 302, 303, 304, 331, 332, 334, 348, 349, 355, 371, 385, 389, 390, 397, 399, 402, 403, 404, 405, 406, 408, 409, 411, 413, 425, 426, 427, 432], "m150": 432, "m_tile": 281, "ma": 301, "mac": 30, "machin": [361, 365, 366, 379, 413], "made": [25, 314, 349, 361, 372, 423], "magicod": [309, 313, 331], "magnitud": 419, "mai": [49, 57, 278, 279, 280, 281, 298, 300, 302, 322, 335, 351, 367, 372, 373, 380, 387, 390, 395, 396, 402, 403, 404, 406, 407, 408, 409, 413, 415, 420, 423, 429, 432], "mail": [298, 349], "main": [22, 36, 43, 44, 47, 57, 246, 316, 319, 347, 352, 354, 356, 357, 360, 368, 369, 391, 406, 413], "main_eval_onli": 351, "main_parse_and_ev": 351, "mainli": [347, 372, 390, 405, 406], "maintain": [0, 269, 270, 288, 298, 300, 302, 307, 372, 373, 391, 396, 424, 432], "major": [399, 405, 406, 408, 409, 423], "majotr": 406, "make": [13, 24, 32, 38, 57, 95, 181, 246, 247, 266, 268, 289, 298, 308, 309, 314, 315, 316, 317, 318, 321, 323, 324, 326, 330, 332, 335, 337, 340, 343, 345, 349, 350, 355, 361, 362, 366, 369, 372, 377, 380, 382, 383, 384, 386, 387, 388, 398, 399, 400, 401, 402, 404, 405, 406, 407, 410, 413, 428], "make_load": 25, "make_posit": 44, "makeiter": 73, "maktukmak": 301, "malloc": [354, 396], "mamou": 301, "manag": [0, 24, 25, 345, 367, 372, 383, 384, 394, 396], "mandarin": 353, "mandatori": [57, 314, 349, 371], "mani": [29, 338, 358, 361, 372, 387, 389, 391, 400, 402, 403, 406, 408, 413, 428], "manipul": [263, 369], "manual": [314, 347, 349, 355, 410], "manufactur": [397, 411], "map": [52, 53, 57, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 75, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 265, 302, 303, 365, 373, 396, 399, 423], "map_and_batch_dataset": [83, 387], "mapandbatchdataset": [88, 387], "mapping_config": 57, "mapping_dict": 57, "mar": 377, "march": 420, "margin": 2, "mark": [24, 361], "markdown": [319, 372], "marktechpost": 420, "marvel": 377, "mask": [20, 33, 36, 44, 256, 257, 260, 263, 281, 304, 373, 400, 401, 403, 405, 408], "mask_mock1": 401, "mask_new": 20, "masked_fil": 73, "maskedlmoutput": [36, 44], "maskheadsmallconv": 260, "maskinun": 304, "masks_to_box": 263, "master": [264, 314, 349, 419], "master_addr": [252, 314, 349], "master_address": [314, 349], "master_port": [252, 347], "mata": 281, "matb": 281, "matc": 281, "match": [24, 25, 57, 247, 256, 257, 258, 303, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 341, 343, 344, 363, 370, 372, 381, 390, 391, 395, 404, 407], "match_criteria": 266, "match_result": 57, "match_threshold": 266, "matcher": [215, 256, 257, 392], "matchtyp": 373, "matd": 281, "mate": 281, "math": 423, "mathemat": [401, 428], "matmul": [39, 40, 62, 73, 83, 158, 305, 389, 391, 392, 395, 398, 408, 413, 432, 437], "matmul_346": 389, "matmul_357": 389, "matmul_358": 389, "matmul_data_t": 281, "matmul_fp8_data_t": 281, "matmul_fp8_param_t": 281, "matmul_input": 281, "matmul_io": 281, "matmul_io_max": 281, "matmul_output": 281, "matmul_param_t": 281, "matmul_u8_data_t": 281, "matmul_with_bia": 150, "matmul_with_bias_add": 150, "matmul_with_bias_gelu": 150, "matmul_with_bias_relu": 150, "matmul_with_bias_sigmoid": 150, "matmul_with_bias_tanh": 150, "matmul_with_bias_unsqueez": 150, "matmul_with_transpos": 150, "matmul_with_transpose_scale_add": 150, "matmulwithbia": [73, 169], "matmulwithbiasadd": [73, 170], "matmulwithbiasgelu": [73, 171], "matmulwithbiasrelu": [73, 172], "matmulwithbiassigmoid": [73, 173], "matmulwithbiastanh": [73, 174], "matmulwithbiasunsqueez": 175, "matmulwithtranspos": [176, 177], "matmulwithtransposescaleadd": 177, "matplotlib": 265, "matric": [62, 354, 402, 407, 408, 432], "matrix": [25, 263, 288, 302, 306, 399, 402, 403, 404, 406, 407, 408, 409, 413, 419, 433], "matter": 430, "max": [25, 29, 73, 246, 247, 302, 308, 309, 324, 349, 371, 372, 376, 396, 397, 400, 402, 404, 409, 411, 423, 432], "max_chuck_s": 372, "max_eval_sampl": 354, "max_input_length": 432, "max_input_shapes_list": 396, "max_length": [28, 289, 302, 346, 347, 375], "max_new_token": [317, 363, 371, 426, 427, 428, 429, 432], "max_prompt_length": [346, 347], "max_seq_length": [29, 30], "max_sparsity_ratio_per_op": 28, "max_step": [346, 347], "max_thread": 361, "max_tile_k": 405, "max_token": 321, "max_train_sampl": [354, 422], "max_trial": 423, "maxim": 2, "maxima": 266, "maximum": [28, 37, 346, 347, 349, 372, 396, 397, 411, 423], "mayb": [57, 347, 390, 409, 420], "mb": [304, 385, 425], "mbzuai": 428, "mc": 346, "mc1": 425, "mc2": 425, "me": [4, 309, 311, 316, 318, 319, 321, 361, 364, 365, 366, 367, 370, 371, 375], "mean": [25, 33, 36, 44, 57, 83, 266, 281, 316, 365, 376, 387, 388, 389, 390, 391, 395, 396, 399, 400, 402, 406, 409, 413, 416, 419, 425], "mean_in": 281, "mean_out": 281, "mean_var_reduce_data_t": 281, "mean_var_reduce_param_t": 281, "meanwhil": [304, 372, 399, 405], "measur": [266, 270, 289, 303, 385, 398, 416, 417, 419, 423], "mechan": 376, "media": 298, "median": 25, "medic": 371, "medium": [272, 302, 414, 420], "medium_n": 414, "meet": [266, 278, 279, 280, 281, 308, 309, 361, 371, 372, 387, 403, 405, 409, 421, 429, 432], "mem": 385, "member": [279, 280, 281, 298, 394, 400, 401], "memori": [9, 25, 288, 307, 325, 361, 367, 372, 385, 394, 396, 400, 401, 402, 403, 404, 406, 407, 408, 409, 417, 422, 423, 425, 426, 428, 429, 432], "memory_args_": 394, "memory_storage_t": 278, "meng": 415, "mention": [270, 371], "merg": [340, 352, 390, 395], "merge_dst": 281, "merge_peft_adapt": 352, "merge_src": 281, "merged_embeddingbag": 150, "mergedembeddingbag": [73, 178], "mesa": [308, 309, 334], "mesh": 354, "messag": [0, 309, 316, 321, 324, 361, 385], "met": [295, 361, 438], "meta": [9, 309, 314, 332, 346, 347, 352, 377, 420, 422, 432], "metadata": [348, 372], "meter": 377, "method": [9, 25, 30, 57, 246, 247, 270, 288, 304, 307, 314, 316, 319, 349, 352, 361, 372, 373, 376, 400, 403, 405, 408, 410, 423, 428, 432], "meticul": 319, "metric": [28, 246, 249, 266, 270, 302, 306, 346, 376, 423, 434], "mha": [398, 437], "mha_dens": [279, 413], "mha_dense_desc": 279, "mhattent": 261, "mhattentionmap": 260, "micro": [390, 399, 404, 409], "micro_b": 413, "micro_oc": 413, "microarchitectur": 361, "microcod": [397, 411, 425, 426], "microkernel": 404, "microsoft": [266, 309, 327, 328, 329, 330], "midst": 372, "might": [24, 295, 319, 361, 370, 391, 438], "migrat": [309, 428], "miko\u0142aj": 301, "mimic": 423, "min": [25, 29, 246, 258, 372, 423, 432], "min_chuck_s": 372, "min_length": 29, "min_sparsity_ratio_per_op": 28, "mind": [0, 319, 420], "mine_hard_neg": 376, "mini": [304, 385, 389, 393, 397, 430], "mini_batch_s": 352, "miniconda": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 354, 357, 363, 368, 383, 384], "miniconda3": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "minilm": [272, 302, 304, 306, 420, 430], "minilmv2": 304, "minim": [303, 361, 370, 371, 409], "minimum": [325, 372], "minmax": [246, 432], "minmax_lr": [247, 432], "minor": [314, 349], "minut": [309, 430], "misc": [255, 256, 257, 415], "miscellan": 18, "misinterpret": 372, "miss": [349, 361, 399, 409], "mistral": [309, 347, 350, 420], "mistral_peft_finetuned_model": 349, "mistralai": [309, 347, 349], "mit": 43, "mitig": [372, 432], "mix": [158, 314, 320, 341, 349, 361, 381, 390], "mix665k": 350, "mixedprecisionconfig": 319, "mixin": 247, "mixtral": [309, 349], "mixtral_peft_finetuned_model": 349, "mixtur": 350, "mk": 390, "mkdir": [21, 334, 364, 365, 366, 386, 388, 398, 410, 413], "mkl": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 363, 368], "mkl_layer_norm": 83, "ml": [345, 383, 384], "mleffici": [272, 302, 420], "mlm": 304, "mlp": [256, 257, 350], "mlp2x_gelu": 350, "mlperf": [272, 420], "mm_projector_typ": 350, "mmkmb": 390, "mmlu": [288, 346], "mmmu_ev": 350, "mmr": [2, 372], "mmxmb": 390, "mnli": [304, 354], "moat": 420, "mobil": [303, 378], "mobilebert": 303, "mod2": 402, "modal": [319, 420], "mode": [27, 48, 55, 264, 307, 331, 332, 335, 380, 389, 393, 406, 408, 413, 414, 423], "model": [0, 4, 5, 9, 19, 23, 24, 27, 28, 30, 45, 47, 49, 52, 53, 54, 55, 57, 60, 62, 129, 130, 131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 224, 225, 226, 228, 229, 230, 241, 242, 244, 246, 247, 266, 268, 269, 270, 286, 303, 304, 306, 313, 314, 315, 316, 319, 320, 321, 323, 324, 325, 326, 330, 331, 332, 335, 340, 342, 345, 346, 347, 351, 354, 356, 357, 358, 361, 364, 365, 366, 369, 371, 372, 373, 375, 380, 383, 384, 386, 387, 390, 391, 395, 396, 397, 400, 405, 406, 407, 408, 411, 415, 416, 417, 419, 420, 421, 422, 423, 426, 427, 429, 430, 432, 439], "model_and_token": [389, 392, 393], "model_class": 45, "model_dataset": 83, "model_dir": 314, "model_doc": 9, "model_format": [426, 427], "model_id": 429, "model_infer": 389, "model_input": 40, "model_kwarg": [36, 44, 45, 418], "model_max_length": 350, "model_nam": [289, 351, 352, 376, 418, 426, 427, 429, 432], "model_name_or_path": [27, 35, 246, 289, 309, 314, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 346, 347, 348, 349, 352, 354, 363, 371, 375, 376, 422, 428, 432], "model_path": [351, 375, 390, 396, 426, 427, 432], "model_pix2pix": 337, "modelargu": 5, "modeldataset": 92, "modeling_bert_dynam": 34, "modeling_output": [33, 36, 40, 41, 44], "modeling_roberta_dynam": 34, "models": [251, 304, 417], "moder": 309, "modern": [399, 432], "modif": [9, 261, 309, 314, 349, 377, 389], "modifi": [24, 44, 47, 50, 55, 57, 158, 181, 300, 309, 335, 337, 341, 349, 371, 372, 373, 375, 377, 379, 380, 381, 382, 388, 389, 392], "modify_node_connect": 55, "modul": [51, 56, 58, 59, 83, 150, 245, 303, 337, 361, 369, 392, 393, 421, 432], "module_nam": 57, "module_templ": 23, "moemodeloutputwithpast": 40, "moment": [25, 395], "momentum": 419, "monetari": 371, "more": [23, 25, 50, 52, 53, 57, 258, 259, 266, 271, 300, 303, 306, 307, 309, 314, 316, 319, 325, 345, 346, 349, 357, 363, 369, 372, 373, 376, 383, 384, 385, 386, 387, 389, 391, 392, 394, 395, 397, 398, 399, 400, 403, 405, 406, 407, 409, 411, 412, 413, 420, 421, 425, 426, 428, 429, 432], "mosaicml": [309, 314, 315, 346, 349, 428], "moshew": [304, 393], "most": [246, 266, 302, 316, 347, 363, 372, 376, 377, 391, 395, 396, 400, 401, 402, 405, 407, 418, 420, 429], "mostli": [57, 264, 359, 360, 374, 395], "motiv": 432, "mount": [314, 315, 316, 366], "mount_dir": 315, "mov": [400, 410], "move": [9, 25, 42, 400], "mp3": [340, 355, 369], "mp4": 374, "mpi": [314, 332, 349], "mpirun": [314, 349], "mpnet": 376, "mpt": [309, 314, 315, 428], "mpt_7b": 346, "mpt_peft_finetuned_model": [314, 349], "mrpc": [304, 392, 393], "mrr": 376, "mse_rang": 247, "msft": 359, "msg": [61, 361], "mt": [304, 353, 412, 425, 426], "much": [25, 266, 372, 392, 402], "mul": [57, 387, 391, 395, 400], "mul_1": 395, "mul_2": 395, "mult": [350, 372], "multi": [1, 32, 256, 257, 319, 326, 340, 351, 352, 354, 359, 361, 388, 389, 390, 420], "multi_gpu": 352, "multiheadattenion": 73, "multilang": 336, "multilangtexttospeech": 369, "multimod": [321, 350, 418], "multipart": 340, "multipl": [2, 24, 25, 36, 44, 243, 248, 260, 266, 267, 268, 289, 304, 319, 320, 351, 361, 369, 372, 379, 387, 389, 401, 402, 404, 405, 406, 407, 408, 409, 413, 416, 417, 430], "multiplechoicemodeloutput": [36, 44], "multipli": [399, 405, 409, 423], "multius": 361, "must": [57, 247, 256, 257, 266, 289, 299, 308, 319, 356, 371, 372, 391, 395, 399, 400, 402, 409, 421], "mutable_data": 394, "mutat": [28, 30], "mutation_prob": [28, 30], "mutation_s": 28, "mutual": 372, "mxfp4": 332, "mxk": [399, 413], "mxkxn": 409, "mxn": [402, 408, 413, 421], "my": [36, 44, 349], "mydataset": 25, "myenv": [327, 328, 329], "mymean": 25, "mysql_db": 334, "mysql_host": 334, "mysql_password": 334, "mysql_port": 334, "mysql_us": 334, "n": [25, 30, 36, 44, 57, 255, 263, 281, 303, 314, 322, 323, 327, 328, 329, 330, 331, 332, 337, 349, 354, 361, 362, 385, 390, 391, 393, 397, 399, 402, 403, 404, 405, 408, 409, 411, 413, 421, 426, 427], "n1": 361, "n2": 361, "n3": 361, "n4": 361, "n5": 361, "n_discard": 429, "n_keep": 429, "n_layer": [36, 44], "n_rep": [39, 40], "n_sampl": 247, "n_tile": 281, "na": [57, 319, 398], "naiv": 406, "naive_gemm": 402, "nalamati": 301, "name": [0, 6, 24, 28, 35, 45, 52, 53, 54, 55, 57, 62, 95, 121, 184, 243, 246, 247, 250, 251, 255, 262, 265, 269, 302, 303, 304, 306, 307, 309, 313, 314, 315, 316, 331, 332, 348, 349, 351, 353, 355, 366, 371, 372, 373, 375, 376, 387, 388, 389, 390, 391, 393, 395, 397, 401, 411, 412, 415, 416, 417, 418, 419, 423, 427, 432], "name1": 24, "name2": 24, "namedentityrecognit": 371, "namedentityrecognitionint": 371, "namedtupl": 387, "names_from_input": 57, "namespac": [247, 278, 279, 280, 281], "nan": [25, 255], "nation": 298, "nativ": [264, 367, 372, 407], "natur": [302, 304, 354, 368, 369, 372, 406, 420], "navig": [309, 322, 372], "nb_target_box": [256, 257], "nbsp": [304, 397, 411], "nd": 391, "ne": 388, "ne_root": 388, "nearest": [264, 432], "necessari": [266, 298, 314, 336, 338, 345, 349, 358, 366, 372, 383, 384, 394, 409, 413, 422, 428], "necessarili": 264, "necessit": 432, "need": [0, 24, 25, 32, 36, 39, 40, 44, 57, 158, 259, 266, 269, 302, 303, 309, 313, 314, 315, 316, 317, 318, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 350, 355, 357, 359, 361, 363, 366, 368, 369, 370, 371, 372, 373, 383, 384, 387, 389, 390, 391, 392, 398, 399, 400, 401, 402, 403, 406, 407, 408, 409, 413, 421, 423, 424, 432], "needn": 405, "neelnanda": [247, 425, 428], "neg": [23, 25, 256, 257, 260, 302, 306, 413], "negative_numb": 376, "negatives_cross_devic": 376, "neo": [301, 304], "neox": [272, 302, 314, 315, 349, 363], "neox_reorder_chang": 150, "neox_rotary_pos_emb": 150, "neoxreorderchang": 179, "neoxroraryposemb": 180, "ner": [304, 309, 334, 371, 375], "ner_int": [371, 375], "ner_obj": 371, "nerual": [337, 388], "nest": 24, "nestedtensor": [256, 257], "nesterov": 301, "nestl": 361, "net": [9, 24, 313, 314, 315, 316, 317, 318, 347, 364, 365, 366], "net_info": 55, "netron": 392, "network": [24, 258, 303, 314, 316, 330, 332, 349, 361, 372, 387, 388, 389, 391, 404, 419, 423], "neualspe": 427, "neural": [4, 5, 6, 7, 8, 17, 35, 49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 150, 181, 215, 243, 244, 245, 269, 272, 289, 297, 302, 303, 309, 313, 314, 315, 316, 317, 318, 319, 321, 324, 331, 334, 335, 337, 338, 340, 343, 344, 345, 346, 352, 356, 358, 363, 364, 366, 369, 371, 372, 375, 378, 380, 383, 384, 387, 389, 390, 391, 392, 396, 404, 412, 417, 419, 420, 423, 425, 426, 427, 428, 429, 432, 433, 440], "neural_chat": [309, 315, 317, 319, 322, 340, 347, 349, 354, 356, 358, 361, 362, 364, 365, 366, 369, 370, 371, 372, 373, 374], "neural_compressor": [246, 302, 303, 306, 419, 423], "neural_engin": [302, 388, 389], "neural_engine_bin": [245, 386], "neural_engine_exampl": 388, "neural_engine_pi": 386, "neural_spe": [366, 426, 427], "neural_speed_verbos": 427, "neuralchat": [272, 299, 302, 308, 312, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 334, 336, 337, 338, 340, 343, 344, 345, 360, 368, 372, 374, 378, 383, 384, 420], "neuralchat_cli": 375, "neuralchat_infer": 315, "neuralchat_serv": [309, 360, 367, 375], "neuralchat_tgi": 317, "neuralchat_vllm": 318, "neuralchatserverexecutor": [309, 375], "neurip": [272, 302, 420, 426], "neutral": [316, 354], "never": 361, "nevertheless": 319, "new": [0, 5, 24, 39, 40, 41, 49, 52, 53, 54, 57, 62, 247, 269, 300, 301, 313, 320, 327, 328, 329, 335, 349, 350, 361, 369, 370, 372, 380, 395, 396, 400, 401, 414, 420, 424, 432], "new_embed": [36, 44], "new_input_fil": 414, "new_modul": 24, "new_nam": 55, "new_nod": 57, "new_node_nam": 387, "newer": 24, "newgraph": 55, "newli": 413, "newsroom": 420, "next": [36, 44, 49, 55, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 354, 357, 363, 368, 385, 391, 392, 400, 402, 404, 406, 407, 408, 409, 425], "next_input_id": 40, "next_position_id": 40, "next_sent": 36, "next_sentence_label": 36, "nextsentencepredictoroutput": 36, "nf4": [421, 422, 432], "nfs_imag": 334, "ng": 426, "nhwc": 390, "nightli": 314, "niki": [36, 44], "ninja": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "niroop": 301, "nk": 390, "nl": 385, "nli": 354, "nlp": [246, 270, 272, 302, 304, 306, 353, 354, 371, 388, 420, 423], "nlp_executor": 388, "nlpseq2seqtrain": 246, "nlptrainer": [246, 302, 303, 306, 419, 423], "nm": 266, "nms_by_contain": 266, "nms_supercel": 266, "nn": [27, 32, 246, 261, 264, 303, 404], "nncf": 28, "nnode": 352, "nnz_group": 281, "no_cuda": [314, 349, 422], "no_object": 258, "no_proxi": [313, 314, 315, 316, 353, 361], "noam": [36, 44], "nod": 391, "node": [1, 49, 52, 53, 54, 55, 57, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 220, 243, 244, 305, 320, 395, 397, 411, 425, 426, 427], "node0": [314, 349], "node1": [314, 349], "node2": [314, 349], "node3": [314, 349], "node_nam": [55, 57, 62, 243, 387, 391], "node_name_list": 55, "node_names_detail": [62, 243], "node_rank": [314, 349], "nodedef": [57, 243], "nodefil": [314, 349], "nodeproto": [62, 244], "nodes_dict": [62, 94, 95, 243, 244], "nohup": [323, 324, 326, 327, 328, 329, 330, 334, 336, 337, 338, 340, 343, 344, 345, 351, 353, 363, 383, 384], "nois": 369, "non": [24, 25, 44, 256, 257, 258, 266, 288, 350, 404, 407, 409, 413, 414, 432], "non_kdim": 281, "none": [0, 4, 14, 20, 23, 24, 25, 27, 28, 29, 30, 32, 33, 36, 37, 40, 41, 44, 45, 49, 55, 57, 62, 94, 95, 121, 128, 243, 244, 246, 247, 250, 251, 252, 259, 260, 264, 281, 303, 304, 305, 314, 315, 346, 347, 366, 372, 374, 389, 416, 417, 423], "nonetyp": 247, "nonexist": [338, 358], "nonneg": 25, "nonzero": 403, "noperm": [407, 413], "norm": [25, 260], "normal": [25, 256, 257, 259, 268, 281, 319, 375, 376, 400, 408, 422, 432, 437], "normalfloat": 422, "normalize_str": 268, "normmean": 25, "not_quant": [426, 427], "notat": 288, "note": [32, 33, 38, 47, 57, 270, 289, 304, 308, 309, 314, 315, 319, 324, 326, 331, 332, 335, 337, 343, 345, 346, 347, 348, 349, 350, 352, 359, 372, 375, 376, 377, 380, 383, 384, 387, 388, 389, 390, 391, 393, 394, 395, 400, 401, 407, 408, 409, 413, 423, 425, 426, 427, 428, 432], "notebook": [309, 322, 347, 430], "noth": [376, 387, 395], "notic": [52, 53, 392, 400, 407, 408, 415], "nov": [272, 302, 309, 420], "novel": 307, "novemb": [272, 420], "noveral": 361, "now": [57, 266, 314, 316, 330, 332, 349, 355, 361, 363, 366, 386, 387, 388, 390, 391, 392, 400, 401, 408, 413, 418, 432], "np": 302, "npm": [335, 341, 379, 380, 381, 382], "nproc_per_nod": 352, "npz": 25, "nrowptr": 281, "nsampl": 432, "nsome": 361, "nthe": 361, "nthr": 410, "ntl": 385, "null": 332, "null_inst": 278, "null_numpy_valu": 25, "nullptr": [279, 281, 400], "num": [28, 314, 349, 366, 389, 399, 401, 407, 432], "num_beam": [247, 428, 432], "num_box": [256, 257, 260], "num_cards_you_hav": 1, "num_choic": [36, 44], "num_class": [256, 257, 258], "num_cpu": 28, "num_embed": 37, "num_head": [36, 39, 40, 44, 260], "num_hidden_lay": 29, "num_iter": 390, "num_key_value_head": [39, 40], "num_label": [33, 36, 44, 302, 306], "num_lay": [256, 257], "num_machin": 352, "num_nod": [314, 349], "num_of_inst": [28, 246, 289], "num_pos_feat": 259, "num_process": 352, "num_processes_per_nod": [314, 349], "num_queri": [256, 257, 258], "num_sandwich": 28, "num_shard": 363, "num_target_box": 258, "num_tilem": 281, "num_train_epoch": [314, 346, 348, 349, 352, 354, 376, 422], "num_work": 25, "numa": [332, 397, 411], "numactl": [388, 426, 427], "number": [17, 23, 25, 28, 29, 30, 44, 62, 256, 257, 258, 263, 266, 268, 289, 314, 331, 332, 337, 346, 349, 354, 370, 371, 372, 375, 376, 385, 390, 391, 395, 399, 402, 408, 409, 413, 414, 423], "numer": [25, 387, 423], "numpi": [20, 21, 25, 57, 62, 302, 388], "numtil": 402, "nuqmm": 432, "nv": 420, "nvcr": [364, 365], "nvgpu": [314, 315], "nvidia": [309, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 363, 364, 365, 366], "nxhxw": 390, "nxm": 399, "nz2": 369, "o": [57, 247, 302, 308, 314, 332, 349, 369, 397, 401, 406, 411], "o_proj": 349, "obj": [25, 266], "object": [24, 25, 28, 45, 47, 52, 53, 54, 243, 246, 247, 249, 256, 257, 258, 264, 265, 266, 270, 289, 302, 303, 306, 346, 347, 361, 387, 394, 434], "object2_overlap": 266, "objects_in_t": 266, "objects_to_cel": 266, "objects_to_table_structur": 266, "oblig": 298, "observ": [25, 309, 321, 350], "obtain": [302, 304, 335, 349, 369, 372, 380, 389, 408, 427, 428], "obvious": [401, 402, 406], "oc": [390, 413], "occasion": [372, 432], "occupi": 266, "occur": [372, 395, 399, 406, 432], "occurr": 25, "ocr": [350, 359], "ocr_vqa": 350, "oct": 420, "off": [25, 269, 319, 390, 432], "offens": [9, 298], "offer": [270, 309, 313, 316, 317, 318, 321, 338, 340, 342, 357, 358, 361, 363, 369, 370, 372, 373, 378, 432], "offic": 377, "offici": [298, 319, 348, 351, 371, 372], "offlin": [298, 403, 409, 423, 428], "offload": 9, "offset": [402, 406, 407], "offset_exp": 400, "offsetm": 402, "offsetn": 402, "often": [303, 370, 372], "ok": [364, 365, 366], "old": [57, 309], "old_batch_s": 27, "old_nam": 55, "old_node_index": 391, "older": 361, "omp": [314, 349, 390], "omp_get_max_thread": 401, "omp_get_num_proc": 401, "omp_num_thread": [314, 331, 349, 388], "ompi_mca_btl_vader_single_copy_mechan": [314, 315, 347, 366], "on_after_ev": 47, "on_after_optimizer_step": 47, "on_before_ev": 47, "on_before_optimizer_step": 47, "on_epoch_begin": 47, "on_epoch_end": 47, "on_step_begin": 47, "on_step_end": 47, "on_train_begin": 47, "on_train_end": 47, "onc": [24, 272, 302, 309, 321, 327, 328, 329, 335, 345, 348, 349, 361, 370, 372, 380, 383, 384, 389, 408, 420, 426, 427, 428, 429, 430, 432], "one": [9, 23, 24, 25, 36, 47, 49, 52, 53, 57, 259, 266, 281, 302, 303, 306, 314, 315, 340, 348, 351, 355, 371, 376, 377, 385, 386, 387, 389, 390, 391, 395, 396, 400, 402, 403, 408, 412, 413, 418, 426], "one_hot": 83, "oneapi": [279, 324, 338, 361, 362, 394, 410, 432], "oneccl": [314, 330, 349], "oneccl_bind_pt": [332, 348, 349], "oneccl_bindings_for_pytorch": [314, 332, 349], "oneccl_bindings_for_pytorch_path": [314, 349], "onednn": [279, 394], "onehot": [73, 93], "ones": [395, 432], "onli": [9, 24, 25, 32, 33, 36, 39, 40, 41, 42, 44, 57, 256, 257, 260, 270, 272, 289, 308, 309, 314, 316, 320, 336, 337, 349, 350, 354, 363, 367, 369, 372, 388, 390, 391, 392, 394, 396, 398, 400, 401, 402, 405, 407, 408, 409, 413, 416, 418, 420, 421, 422, 425, 428], "onlin": [298, 302, 370, 372, 406], "onnx": [50, 52, 62, 244, 246, 270, 302, 306, 337, 387, 389, 390, 407, 418, 425, 432, 434, 439], "onnx_extract_oper": 62, "onnx_extractor": [50, 51], "onnx_input": 83, "onnx_util": 58, "onnxextractor": 52, "onnxinput": 94, "onnxmodel": [52, 62], "onnxrt": [425, 426], "onnxruntim": [70, 71, 72, 78, 80, 101, 102, 107, 108, 110, 111, 112, 114, 118, 122, 123, 125, 126, 302, 305, 308, 387, 393], "op": [49, 52, 53, 54, 57, 58, 62, 158, 181, 192, 243, 244, 246, 255, 281, 389, 394, 395, 396, 400, 401, 413, 414, 423, 432], "op_alg": [400, 401], "op_attr": [398, 400, 401, 407], "op_desc": [278, 279, 398, 401], "op_desc_": 401, "op_dt": 400, "op_idx": 57, "op_nam": 28, "op_name_dict": 247, "op_typ": [57, 62, 95, 243, 244, 387, 390, 391, 401], "op_type1": 395, "op_type2": 395, "op_type_dict": 247, "opani": 73, "open": [8, 268, 288, 298, 309, 322, 346, 347, 351, 352, 354, 361, 363, 372, 388, 392, 420], "openai": [0, 9, 22, 316, 335, 336, 350, 355, 369, 375, 380, 425, 426, 428], "openai_api_kei": 324, "openai_api_protocol": 22, "openai_org": 324, "opencl": 402, "opencv": 334, "openmp": 404, "openorca": [346, 347, 352], "openssf": 435, "openssl": 369, "oper": [50, 52, 53, 57, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 181, 195, 243, 244, 266, 280, 282, 293, 304, 309, 327, 328, 329, 347, 357, 361, 369, 372, 386, 387, 388, 390, 392, 398, 400, 401, 402, 404, 405, 406, 407, 408, 409, 413, 421, 423, 428, 436, 439], "operand": [400, 404], "operator_adaptor": 150, "operator_conf_": 394, "operator_desc": [278, 279, 282, 398], "operator_registri": [95, 387], "operator_typ": [95, 387], "operatoradaptor": [147, 181], "operatorconfig": 394, "opinion": 316, "opmask": [400, 401], "opportun": [306, 307], "opposit": 20, "opset_vers": [246, 305], "opt": [323, 361, 362, 364, 365, 366, 410, 428, 432], "opt_1": 425, "opt_2": 425, "opt_6": 425, "optim": [4, 25, 28, 40, 47, 58, 246, 249, 250, 251, 272, 302, 304, 305, 306, 309, 314, 320, 325, 327, 328, 329, 331, 349, 354, 357, 360, 361, 367, 369, 372, 374, 388, 391, 392, 393, 396, 400, 401, 402, 404, 416, 417, 419, 420, 421, 423, 428, 432], "optimization_config": 319, "optimization_typ": [327, 328, 329], "optimizationconfig": 4, "optimize_dataset": [83, 387], "optimize_model": 4, "optimize_transform": 432, "optimizedataset": [96, 387], "optimizedmodel": 35, "optimum": [314, 346, 349, 352, 366], "option": [1, 6, 24, 25, 27, 28, 32, 33, 35, 36, 44, 45, 57, 246, 247, 256, 257, 260, 265, 267, 289, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 350, 351, 363, 372, 373, 383, 384, 385, 389, 395, 400, 409, 413, 419], "option1": 421, "option2": 421, "optuna": 246, "opu": [304, 353], "orac": 307, "orca": [346, 347, 352], "orca_dpo_pair": [346, 347, 352], "orchestr": 246, "orchestrate_optim": 246, "order": [21, 24, 52, 53, 55, 57, 258, 266, 288, 330, 332, 347, 366, 387, 389, 395, 399, 405, 406, 408, 409, 432], "ordereddict": [95, 387], "ordinari": 24, "org": [25, 36, 260, 332], "organ": [350, 361, 371, 399], "orient": [298, 350], "origin": [24, 25, 27, 47, 52, 53, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 256, 257, 288, 303, 309, 350, 357, 372, 377, 387, 392, 406, 407, 420, 423], "other": [1, 24, 25, 35, 57, 111, 158, 255, 258, 266, 281, 288, 298, 300, 302, 316, 317, 319, 324, 330, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 357, 359, 361, 363, 368, 369, 372, 375, 383, 384, 387, 388, 389, 390, 391, 395, 396, 397, 405, 408, 409, 411, 413, 415, 420, 423, 425, 426, 432], "otherwis": [24, 57, 247, 266, 298, 361, 369, 387, 390, 405, 413], "our": [5, 49, 288, 305, 312, 320, 325, 336, 337, 338, 340, 346, 350, 351, 355, 358, 369, 372, 373, 378, 395, 400, 402, 403, 405, 407, 408, 409, 418, 429, 432], "out": [23, 25, 55, 57, 266, 300, 302, 330, 361, 372, 387, 388, 391, 398, 407, 423], "out_dt": 281, "out_pattern": 57, "out_proj": 354, "outcom": [319, 350, 395], "outdat": 377, "outer": 17, "outlier": 428, "outlin": 270, "output": [0, 24, 25, 36, 44, 55, 57, 62, 73, 243, 244, 246, 256, 257, 258, 260, 264, 265, 281, 303, 306, 319, 321, 347, 361, 369, 372, 373, 375, 379, 385, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 404, 405, 406, 407, 409, 413, 418, 421, 425, 427, 429, 432], "output2_dt": 281, "output_attent": [33, 36, 37, 40, 41, 44], "output_audio": [336, 340, 375], "output_audio_path": [311, 319, 336, 340, 369, 375], "output_bf16": 413, "output_data": [150, 388], "output_dim": [256, 257], "output_dir": [55, 246, 314, 346, 347, 348, 349, 352, 354, 376, 392, 393, 422, 428], "output_dt": [281, 413], "output_fil": 376, "output_fp32": 413, "output_hidden_st": [33, 36, 40, 41, 44], "output_length": [36, 44], "output_max_length": 352, "output_nam": [62, 352, 388], "output_path": [337, 351], "output_router_logit": 40, "output_shap": 389, "output_tensor": [52, 53, 54, 57, 62, 95, 243, 244, 387, 391], "output_tensor_nam": 389, "output_typ": [281, 389], "output_video_path": 374, "outputdata": [182, 387], "outsid": [36, 44, 57, 391, 395], "over": [24, 25, 264, 266, 340, 349, 402, 404, 407], "overal": [309, 357, 370, 372, 406], "overflow": 423, "overhead": [400, 406, 407, 408, 409], "overlap": 266, "overlap_threshold": 266, "overlook": 377, "overrid": [0, 35, 39, 40, 246, 278, 279, 394], "overview": [369, 370], "overwrit": 414, "overwrite_output_dir": [314, 348, 349, 352, 354, 422], "ow": 319, "own": [57, 272, 309, 324, 335, 338, 341, 345, 351, 355, 372, 373, 380, 381, 383, 384, 387, 391, 392, 400, 406, 417, 420], "owner": [270, 300], "p": [25, 270, 302, 314, 349, 364, 365, 366], "p1302": [407, 413], "p2013": [407, 413], "p2031": [407, 413], "p50": 302, "p90": [302, 425], "p99": [302, 425], "p_conf": 306, "p_num": 374, "p_t": 260, "pack": [49, 83, 409], "pack_weight": 421, "packag": [18, 31, 272, 302, 327, 328, 329, 349, 351, 361, 371], "package_object": 266, "packagepositionembed": 100, "pad": [23, 32, 36, 44, 247, 256, 257, 289, 302, 350, 389, 405, 409, 413], "pad_max": [346, 347, 350], "padding_idx": 44, "padding_mask": 413, "padding_sequ": [83, 150, 388], "paddingsequ": [57, 98, 183, 388], "page": [266, 298, 300, 302, 306, 319, 327, 328, 329, 359], "page_span": 266, "pagedattent": 367, "pain": 423, "pair": [36, 55, 247, 256, 257, 346, 347, 348, 352, 354, 372, 388, 401, 409], "pairwis": 263, "palm2": 372, "panda": 266, "panopt": 260, "paper": [25, 32, 259, 272, 302, 306, 307, 420, 426, 428], "parallel": [314, 326, 330, 332, 349, 354, 361, 404, 405, 406, 409, 413, 421], "param": [21, 25, 62, 243, 246, 256, 257, 258, 260, 264, 363, 400, 401], "param_": [400, 401], "paramet": [4, 6, 9, 17, 20, 21, 24, 25, 27, 28, 29, 30, 32, 35, 36, 44, 45, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 255, 256, 257, 260, 262, 264, 267, 272, 281, 289, 302, 303, 309, 316, 317, 319, 335, 354, 363, 369, 375, 380, 385, 389, 395, 416, 419, 428, 432], "parameter": [346, 347], "parametr": 16, "params_": 401, "parent": [2, 30, 266, 372], "parent_docu": [309, 372], "parentstor": [309, 372], "pareto_fronti": 30, "pari": 377, "park": [301, 432], "parm": 373, "parmar": [36, 44], "pars": [1, 13, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 96, 97, 99, 100, 102, 103, 105, 106, 107, 108, 109, 110, 112, 114, 115, 117, 118, 119, 120, 122, 123, 124, 125, 127, 243, 253, 254, 268, 372, 388, 394], "parse_arg": 1, "parse_multi_choice_respons": 268, "parse_open_respons": 268, "parsed_output": 351, "parser": 359, "part": [32, 57, 349, 361, 391, 394, 395, 396, 408, 409], "part1": [350, 389, 394], "part2": [350, 394], "parti": [300, 377, 415], "partial": [369, 372], "particip": [298, 402], "particular": [9, 272, 372, 432], "particularli": [302, 372], "pass": [2, 24, 25, 32, 36, 44, 47, 247, 260, 261, 269, 281, 300, 314, 348, 349, 355, 365, 369, 371, 372, 396, 400, 401, 418, 423], "passag": 376, "passage_max_len": 376, "passion": 361, "password": 334, "passwordless": [314, 349], "past": [36, 44, 255, 261, 264], "past_k_v_0": 396, "past_k_v_1": 396, "past_key_valu": [33, 36, 37, 40, 41, 44], "past_key_values_length": [36, 37, 44], "pat": 369, "patch": 350, "patch14": [9, 350], "path": [21, 28, 35, 47, 57, 246, 247, 265, 267, 302, 306, 316, 319, 331, 332, 334, 347, 348, 349, 351, 359, 364, 365, 366, 369, 371, 372, 375, 376, 388, 389, 390, 392, 396, 410, 412, 413, 422, 426, 427], "path_to_hostfil": 1, "pathlik": 247, "patient": 303, "pattern": [25, 28, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 279, 392, 399, 402, 412, 419, 439], "pattern_dict": 387, "pattern_list": 57, "pattern_map": [57, 387, 391], "pattern_mapping_conf_valid": 57, "pattern_mapping_config": 387, "pattern_nam": 57, "pattern_registri": [184, 387], "pattern_typ": [184, 387], "patternlock": 304, "payload": 372, "pb": [57, 306], "pbtxt": [364, 365, 366], "pc": [319, 320, 325, 327, 328, 329, 420], "pdf": [25, 319, 358, 372], "pdf_file": 359, "peak": 325, "peer": 432, "peft": [272, 302, 314, 349, 352, 354, 375, 422], "peft_config": 422, "peft_model_path": 375, "pegasu": 304, "penalti": 371, "penghuicheng": 299, "pentium": 415, "peopl": [361, 423], "per": [309, 314, 349, 389, 397, 400, 403, 411, 413, 414, 428], "per_channel_dequ": 400, "per_channel_qu": 400, "per_device_eval_batch_s": [314, 348, 349, 352, 354, 422], "per_device_train_batch_s": [314, 346, 347, 348, 349, 352, 354, 376, 422], "percentag": [348, 349, 371], "perceptron": [256, 257], "perf": [389, 409, 413, 414], "perform": [2, 36, 42, 44, 57, 246, 251, 256, 257, 258, 269, 270, 289, 293, 302, 303, 305, 306, 309, 314, 315, 319, 325, 347, 349, 354, 355, 357, 361, 363, 369, 370, 371, 372, 376, 378, 379, 388, 389, 390, 393, 399, 402, 403, 404, 405, 406, 407, 408, 409, 413, 416, 417, 419, 420, 423, 432, 436], "perhap": 399, "peripher": 404, "perm": [389, 407], "perm1302": 407, "perm2013": 407, "perm2031": 407, "perman": 298, "permiss": [269, 298], "permit": 372, "permut": [389, 403, 407, 413], "perplex": 349, "persist": 372, "persist_dir": [324, 338, 375], "person": [298, 316, 355, 371], "perspect": [314, 349, 361], "pertain": 5, "pertin": 372, "phase": [47, 303, 429], "phenomenon": 391, "phi": [309, 349, 415], "philschmid": 304, "phind": [309, 326, 330, 332], "photo": [333, 334, 371], "photoai": [338, 375], "phrase": 389, "physic": [289, 298, 407], "pick": [348, 349, 409], "picklabl": 264, "pictur": [390, 399, 412], "piec": [354, 407], "pil": 20, "pile": [247, 288, 354, 425, 428], "pin_memori": 25, "ping": [330, 332, 402], "pip": [302, 308, 309, 315, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 350, 351, 352, 353, 354, 357, 358, 359, 360, 361, 363, 366, 367, 368, 369, 370, 371, 374, 376, 377, 383, 384, 387, 393, 412, 426, 427, 432], "pipel": 357, "pipelin": [4, 49, 270, 309, 315, 319, 332, 351, 354, 358, 369, 370, 371, 372, 374, 375, 434], "pipeline_cfg": 319, "pipeline_config": 319, "pipelineconfig": [4, 319, 358, 372], "piqa": [288, 426], "pitch": 369, "pix2pix": 337, "pixel": [9, 256, 257], "pizza": 36, "place": [0, 24, 38, 372, 377, 396, 401, 406, 419], "placehold": 83, "plai": [23, 372], "plain": 25, "plan": [335, 361, 380], "platform": [22, 302, 312, 319, 320, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 354, 363, 372, 378, 412, 420, 421, 423], "platinum": [304, 310, 397, 411, 425, 426], "pleas": [0, 9, 47, 50, 52, 53, 269, 270, 271, 289, 300, 303, 304, 306, 307, 308, 309, 316, 317, 319, 324, 326, 331, 335, 336, 337, 338, 340, 342, 343, 345, 347, 348, 349, 350, 351, 353, 355, 358, 361, 365, 372, 376, 378, 380, 383, 384, 387, 391, 394, 398, 399, 400, 401, 405, 408, 413, 419, 421, 423, 426, 427, 428, 429, 432], "plm": 304, "plot": 265, "plot_log": 265, "plu": [25, 36, 350], "plugin": [315, 320, 321, 322, 324, 332, 334, 342, 344, 357, 358, 360, 363, 369, 372, 373, 374, 375], "plugin_audio": 336, "pndmschedul": 9, "po": [259, 376], "podcast": 420, "point": [36, 44, 245, 246, 265, 316, 372, 378, 391, 400, 401, 405, 408, 421, 423, 428], "pointer": 396, "pokemon": 304, "polici": [298, 307, 335, 346, 347, 380, 435], "polish": [12, 372], "polit": 298, "polosukhin": [36, 44], "polynomi": 408, "pong": 402, "pool": 369, "pooled_output": 36, "pooler": [36, 44], "poor": 406, "pop": 410, "popul": [24, 28, 30], "popular": [302, 309, 319, 325, 338, 358, 363, 370, 420, 432], "population_fil": 30, "population_s": 28, "port": [313, 316, 317, 318, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 361, 363, 364, 365, 366, 372, 375], "portion": [9, 24], "pos_emb": 83, "pose": 372, "posit": [32, 36, 37, 44, 259, 260, 261, 270, 288, 298, 302, 306, 376, 390, 395, 413, 418, 429, 432], "position_embed": 150, "position_embedding_typ": [36, 44], "position_embeddings_v1": 150, "position_id": [32, 33, 36, 40, 41, 44], "positionembed": 185, "positionembeddinglearn": 259, "positionembeddingsin": 259, "positionembeddingsv1": 186, "positionid": 73, "possibl": [256, 257, 268, 375], "possibli": 25, "post": [23, 298, 309, 313, 316, 317, 318, 334, 361, 363, 367, 372, 389, 413, 428, 430, 432], "post_init_cpu": 247, "post_init_gptq": 247, "post_init_runtim": 247, "post_init_xpu": 247, "postambl": 401, "postman": 363, "postop": [400, 401, 413], "postop_alg": 401, "postop_attr": [280, 281, 401], "postop_idx": 401, "postop_list": 401, "postop_typ": 401, "postprocess": [256, 257], "postprocesspanopt": 260, "posttrainingquantconfig": [246, 302, 306, 423], "potenti": [281, 361, 406, 420], "pow": [83, 387, 391], "power": [303, 304, 307, 309, 319, 352, 361, 420], "ppn": [314, 349, 425], "ppo_epoch": 352, "pr": [300, 413], "practic": [270, 340], "pragma": 402, "pre": [17, 272, 302, 354, 371, 372, 391, 402, 412, 420, 432], "preambl": 401, "precis": [25, 158, 246, 264, 305, 314, 320, 336, 337, 349, 371, 372, 378, 392, 417, 421, 423, 425, 432, 439], "precomput": [36, 44], "pred": 302, "pred_box": [256, 257, 258], "pred_i": 268, "pred_logit": [256, 257, 258], "predecessor": 361, "predefin": [319, 372], "predict": [4, 36, 44, 246, 256, 257, 258, 260, 268, 302, 303, 311, 319, 351, 354, 356, 358, 370, 372], "prediction_logit": [36, 44], "predominantli": 372, "pref": 389, "prefer": [316, 319, 372, 395, 407], "prefix": [25, 314, 349, 372, 413], "premis": [319, 325, 354], "prepar": [36, 44, 389, 391, 394, 400, 401, 409, 423], "prepare_dataset": 393, "prepare_inputs_for_gener": [36, 44], "prepare_model": [337, 392, 393], "prepare_model_for_kbit_train": 422, "prepare_t": 401, "preprint": 432, "preprocess": [18, 27, 288, 302, 408], "preprocess_model": 27, "prerequisit": 366, "present": [36, 246, 319, 408], "preserv": 372, "press": [355, 361], "pretrain": [17, 33, 36, 44, 309, 319, 347, 348, 349, 369, 387, 429], "pretrainedconfig": 247, "pretrainedmodel": 23, "pretrainedtoken": 23, "pretraining_data": 350, "preval": 432, "prevent": [25, 372, 390], "previou": [55, 246, 302, 349, 388, 405, 424], "previous": 332, "price": [335, 380], "primari": [316, 370, 378], "primarili": [369, 372, 429], "primconst": 115, "primit": [281, 332, 394], "primitive_desc": 394, "print": [24, 25, 264, 309, 314, 319, 321, 332, 349, 369, 371, 387, 395, 428, 432], "print_hello_world": 313, "print_result": 351, "prior": [408, 429], "priorit": [319, 325], "prioriti": 24, "privat": [279, 280, 298, 349, 388, 394, 399, 400, 401, 405, 406, 420], "privileg": 314, "proactiv": [314, 349], "probabl": [25, 28, 29, 319, 346, 347, 370, 406, 432], "problem": [307, 338, 358, 372, 409, 413, 429], "problemat": 372, "proce": 347, "procedur": [316, 385, 413], "procedurein": 385, "process": [27, 28, 47, 246, 256, 257, 264, 266, 267, 270, 302, 304, 309, 314, 319, 321, 322, 327, 328, 329, 332, 334, 338, 340, 347, 349, 351, 354, 355, 358, 361, 369, 370, 371, 372, 373, 375, 379, 387, 388, 390, 391, 395, 396, 399, 400, 402, 405, 406, 409, 419, 420, 423, 432], "process_batch_per_k": 281, "process_col": [281, 400], "process_row": 281, "process_vec_num": 281, "processed_s": 260, "processed_text": 373, "processor": [4, 272, 302, 304, 308, 309, 310, 311, 316, 318, 319, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 347, 349, 358, 361, 363, 364, 365, 366, 367, 370, 375, 397, 411, 420], "produc": [25, 255, 266, 303, 316, 372, 409], "product": [302, 372, 397, 407, 411, 417, 423], "profession": 298, "profil": [295, 398, 438, 439], "proflil": 389, "program": [399, 415, 432], "progress": 17, "project": [270, 298, 300, 309, 327, 328, 329, 335, 345, 369, 380, 383, 384], "project_source_dir": 388, "promis": [404, 405], "prompt": [0, 36, 267, 309, 313, 314, 318, 321, 335, 349, 354, 355, 364, 365, 366, 367, 370, 371, 372, 375, 380, 426, 427, 428, 429, 432], "prompt_token": 361, "prop_kind": 394, "propag": [256, 257], "proper": [321, 371], "properli": [32, 335, 340, 380], "properti": [302, 388, 415], "proport": [416, 417], "propos": [303, 350, 372, 399, 420, 428], "prot": 361, "protect": [278, 279, 361], "protobuf": [308, 393], "protocol": [22, 406], "prove": 369, "provid": [4, 24, 27, 28, 30, 35, 36, 44, 246, 260, 263, 264, 270, 272, 289, 302, 304, 305, 306, 307, 309, 314, 316, 319, 321, 325, 327, 328, 329, 335, 336, 337, 342, 345, 349, 351, 356, 357, 365, 367, 369, 370, 371, 372, 373, 375, 376, 378, 380, 383, 384, 385, 387, 396, 398, 401, 406, 408, 416, 421, 423, 427, 428, 432], "proxi": [279, 293, 313, 314, 315, 316, 317, 318, 361, 398, 436], "proxy_bas": 279, "prune": [36, 44, 46, 246, 270, 272, 302, 420, 425, 430, 434], "prune_config": 307, "prune_head": [36, 44], "prune_typ": 306, "pruneofa": 304, "pruner": [303, 419], "pruner_config": 306, "pruner_info": 47, "prunerconfig": 306, "prunerv2": 28, "pruning_conf": 419, "pruning_config": [28, 246, 306, 419], "pruning_frequ": 28, "pruning_op_typ": 28, "pruning_scop": [28, 419], "pruning_typ": [28, 419], "pruningconf": 246, "pruningconfig": 306, "psedorandom": 25, "pseudo": 405, "pseudorandom": 25, "pt": [36, 44, 289, 302, 306, 355, 418, 428, 429, 432], "pt_hpu_max_compound_op_s": 349, "pth": [353, 359], "ptq": [392, 428], "ptr": [400, 401, 410], "ptr_bia": 281, "ptr_dens": 281, "ptr_dst": 281, "ptr_dst_m1": 281, "ptr_dst_m2": 281, "ptr_scale": 281, "ptun": [314, 349], "pub": [270, 330, 332], "public": [32, 270, 278, 279, 280, 281, 298, 319, 325, 349, 361, 394, 400, 401], "publish": [272, 298, 302, 415, 420], "pubtables1m": 359, "pubtables1m_detection_detr_r18": 359, "pull": [313, 316], "pull_key_prefix": 25, "pure": [319, 386, 401], "purif": [272, 420], "purpos": [256, 257, 378, 391, 395, 400, 405], "push": [247, 300, 410], "push_back": 401, "push_key_prefix": 25, "push_to_hub": 247, "pushtohubmixin": 247, "put": [266, 353, 370, 387, 388, 391], "pvc": 432, "pwd": [364, 365], "py": [1, 22, 50, 314, 315, 326, 327, 328, 329, 330, 331, 332, 337, 340, 345, 346, 347, 348, 349, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 364, 365, 366, 368, 371, 372, 376, 377, 383, 384, 385, 387, 389, 393, 412, 419, 422, 426, 427, 432], "py3": [364, 365], "pyakurel": 301, "pybind": 55, "pydant": 361, "pyg": 340, "pylint": [269, 300], "pypi": [330, 337], "pytest": 269, "python": [1, 6, 8, 24, 36, 44, 52, 53, 57, 247, 277, 286, 300, 302, 308, 311, 314, 315, 316, 319, 321, 322, 346, 347, 348, 349, 351, 352, 353, 354, 355, 356, 358, 359, 360, 361, 362, 364, 365, 366, 371, 374, 375, 376, 377, 385, 386, 387, 388, 390, 392, 393, 412, 426, 427, 432], "python3": [308, 309, 314, 349, 352, 361, 386], "pythonpath": [364, 365, 366], "pytorch": [24, 25, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 246, 252, 264, 269, 270, 299, 302, 305, 308, 309, 314, 331, 332, 340, 349, 361, 393, 412, 418, 420, 423, 426, 427, 428, 432], "pytorchbenchmark": 27, "pyyaml": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "q": [25, 32, 340, 407, 408], "q_bia": 281, "q_config": [302, 306, 423], "q_k_scale": 281, "q_k_src2": 281, "q_model": 35, "q_proj": [346, 347, 349, 354], "q_scale": 281, "q_weight": 281, "qa": 372, "qat": [304, 305, 423], "qdq": [246, 305, 392], "qk": 407, "qk_v_output_scal": 281, "qk_v_output_zero_point": 281, "qkv": [220, 390, 392], "qkv_merg": 150, "qkv_reshap": 150, "qkvmerg": 187, "qkvreshap": 188, "qlinear": [305, 392], "qlinearadd": 73, "qlinearmatmul": [73, 392], "qlinearmul": 73, "qmodel": 432, "qnli": 304, "qqp": 304, "quaint": 361, "quala": [272, 302, 304, 306, 420], "qualiti": [337, 355, 369, 372, 376, 379], "quanstion": 44, "quant": [57, 158, 398, 413, 423, 432, 437], "quant_config": [246, 302, 306, 423], "quant_format": [246, 305], "quant_gather_to_bf16": 150, "quant_info_init": 57, "quant_lm_head": 247, "quant_tile_n": 405, "quantawaretrainingconfig": 247, "quantgathertobf16": [147, 189], "quantif": [428, 432], "quantil": 25, "quantiti": 371, "quantiz": [28, 35, 102, 246, 247, 270, 272, 302, 305, 309, 357, 400, 401, 405, 406, 408, 413, 416, 420, 421, 422, 426, 428, 430, 434, 439], "quantization_config": [306, 428, 429, 432], "quantizationawaretrainingconfig": [246, 423], "quantizationconfig": 246, "quantizationmethod": 247, "quantize_dim_elt_num": 413, "quantize_fus": 150, "quantize_linear": [83, 387], "quantize_on_tmp_buf": 405, "quantize_to_packed_weight": 421, "quantize_v2": 83, "quantized_fused_matmul_and_dequant": 83, "quantized_graph_dtype_refactor": 150, "quantized_matmul_with_bias_and_dequant": 83, "quantized_weight": 421, "quantizedgraphdtypecheck": 191, "quantizedgraphdtyperefactor": [158, 191], "quantizedmatmulwithbiasanddequant": 105, "quantizefus": 190, "quantizelinear": [102, 387, 392], "quantizev2": 103, "quarter": [25, 406], "queri": [4, 11, 12, 15, 32, 39, 40, 256, 257, 309, 311, 316, 319, 324, 325, 338, 342, 370, 371, 372, 373, 376], "query_dim": 260, "query_emb": 257, "query_file_jsonl_path": 376, "query_instruction_for_retriev": 376, "query_max_len": 376, "query_st": [39, 40], "question": [36, 267, 268, 298, 300, 302, 304, 316, 319, 335, 351, 370, 372, 380, 403, 430], "question_typ": 351, "questionansweringmodeloutput": [36, 44], "queue": 361, "quick": [306, 356, 372, 387], "quick_start": [361, 362], "quickli": 316, "quiet": 25, "quit": [361, 385, 396, 400, 432], "qweight": 421, "qwen": [309, 432], "qwen2": 420, "qword": 410, "r": [21, 25, 266, 302, 308, 309, 315, 322, 323, 324, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 351, 352, 353, 354, 357, 358, 359, 360, 361, 363, 368, 370, 371, 374, 376, 377, 383, 384, 387, 393, 397, 411, 412, 422, 423, 425, 426, 427, 432], "r10": 401, "r12": 410, "r13": 410, "r14": [401, 410], "r15": [401, 410], "race": 298, "rag": [309, 316, 320, 321, 324, 357, 358, 361, 420], "rag_doc": 316, "rai": 30, "rais": [24, 272, 338, 358, 420, 426, 427], "ramakrishna": 301, "random": [25, 36, 288, 371, 376, 406], "random_sampl": 25, "rang": [21, 73, 246, 260, 302, 309, 312, 319, 320, 369, 372, 387, 390, 396, 413, 423], "range_for_sampl": 376, "rank": [252, 264, 309, 314, 347, 349, 354, 372, 376, 377, 422, 425], "ransform": 432, "rapid": [272, 302, 308, 349], "rate": [370, 406, 425], "rather": [25, 391, 400], "ratio": [28, 29, 30, 47, 303, 376, 411, 413, 416, 417], "raw": [256, 257], "raw_cmd": 27, "raw_dataset": [302, 306], "raw_h": 20, "raw_w": 20, "rbp": 410, "rbx": 410, "rcx": 401, "rdi": [401, 410], "rdx": 401, "re": [35, 247, 319, 335, 341, 378, 379, 380, 381, 382, 403], "reach": [300, 302], "read": [25, 47, 302, 313, 317, 318, 388, 420], "readabl": 247, "readi": [335, 348, 349, 364, 365, 366, 372, 380], "readm": [269, 316, 317, 319, 323, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 356, 358, 363, 364, 365, 366, 370, 378, 389, 392, 432], "real": [279, 307, 317, 318, 319, 325, 363, 390, 405, 406, 407, 410], "real_drop": 307, "realdiv": 73, "realiz": 391, "realli": [256, 257], "reason": [258, 298, 372, 373, 391, 394, 406], "receiv": [35, 181, 364, 365, 366, 391, 395, 396], "recent": [307, 372, 429, 432], "recent_ratio": 307, "recip": [246, 390, 417], "reciproc": [73, 376], "recogn": [372, 387, 439], "recognit": [17, 266, 309, 355, 371, 391, 395], "recommend": [308, 313, 314, 325, 345, 348, 349, 354, 375, 376, 377, 383, 384, 387, 390, 393, 395, 396, 406, 410, 426, 427, 429], "recomput": 25, "record": [52, 53, 54, 382, 389, 390], "recruit": 395, "rectifi": [319, 325], "recurs": [24, 25, 330, 386, 388, 395, 432], "recursive_copi": 24, "red": [21, 425, 426], "redevelop": 372, "redpajama": 428, "reduc": [9, 264, 266, 302, 307, 314, 315, 342, 350, 354, 361, 370, 372, 376, 394, 399, 400, 402, 404, 405, 406, 408, 409, 420, 422, 423, 428, 432], "reduce_dict": 264, "reduce_mean": [83, 387], "reduce_sum": 83, "reducemean": [106, 387, 391], "reducesum": 107, "reduct": [264, 306, 369, 404, 407], "redund": [402, 419], "refactor": [278, 279, 280, 281, 319, 325, 372], "refactor_batch_s": 27, "refer": [9, 24, 47, 50, 52, 53, 264, 269, 270, 302, 303, 306, 307, 308, 309, 317, 319, 321, 337, 338, 342, 345, 347, 348, 349, 350, 351, 358, 372, 376, 378, 383, 384, 385, 391, 394, 403, 405, 408, 413, 415, 419, 421, 423, 429], "refin": [266, 372], "refine_column": 266, "refine_row": 266, "refine_table_structur": 266, "reflect": 25, "refresh": [382, 413], "refresh_model": 55, "reg": [400, 401], "reg64": [400, 401], "reg64_mock1": 401, "reg_idx": 401, "reg_param": 401, "reg_src": 401, "reg_typ": [28, 401], "regard": [298, 346, 347, 352], "regardless": 298, "regener": [335, 341, 380, 381], "regex": 268, "regexp": 400, "region": 428, "regist": [0, 65, 73, 86, 95, 98, 101, 102, 111, 113, 116, 126, 184, 399, 400, 401, 402, 404, 405, 406, 407, 409, 439], "register_conv_templ": 0, "register_operator_class": 394, "registr": [95, 184, 387], "registrationcent": 332, "regress": [33, 36, 44, 256, 257, 269], "regul": [335, 380], "reinforc": [319, 346, 347, 420], "reinstal": [315, 387], "reinterpret_cast": 394, "reject": [298, 346, 347, 352], "rel": [256, 257, 340, 369, 416, 423, 425, 429], "relat": [52, 53, 246, 256, 257, 270, 288, 295, 303, 314, 319, 325, 335, 354, 359, 360, 369, 370, 371, 372, 374, 375, 380, 387, 391, 395, 396, 403, 408, 419, 423, 438], "relationship": 57, "releas": [270, 272, 302, 309, 319, 332, 349, 372, 420, 425, 435], "relev": [2, 302, 335, 372, 380], "reli": 372, "reliabl": 372, "relianc": [372, 432], "relief": 404, "religion": 298, "reload": 25, "relu": [73, 401, 413], "remain": [24, 372, 420, 432], "remain_el": 281, "remain_element_num": 401, "remain_task_mask": 401, "remark": [372, 390, 429], "rememb": [25, 313, 316, 317, 318, 345, 363, 366, 378, 383, 384], "remot": [349, 379], "remov": [25, 40, 44, 55, 57, 192, 195, 247, 261, 266, 298, 361, 401, 419], "remove_constant_op": 150, "remove_environ_info_item": 57, "remove_integer_superscript": 266, "remove_last_view": 150, "remove_nod": 55, "remove_objects_without_cont": 266, "remove_rang": 150, "remove_supercell_overlap": 266, "remove_unused_oper": 150, "remove_zero": 150, "removeconstantop": 192, "removelastview": 193, "removerang": 194, "removeslic": 150, "removeunusedoper": 195, "removezero": 196, "rename_nod": 55, "rencetli": 429, "reorder": [55, 83, 405, 406], "repack": 421, "repack_quantized_weight": 421, "repeat": [25, 73, 402, 414], "repeatedli": 25, "repercuss": 298, "repetit": 371, "repetition_penalti": 371, "replac": [24, 25, 44, 57, 247, 302, 303, 306, 309, 321, 345, 346, 347, 355, 357, 366, 369, 372, 375, 383, 384, 387, 391, 419, 420, 421, 423, 432], "replace_modul": 24, "replacechar": 373, "replc": 315, "repo": [292, 300, 322, 323, 324, 326, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 347, 351, 353, 354, 357, 363, 366, 368, 379, 380, 381, 382, 383, 384, 387, 413, 435], "repo_id": 247, "repo_path": [314, 315], "report": [298, 300, 302, 316, 358, 372], "report_to": 346, "repositori": [35, 247, 302, 314, 315, 345, 348, 349, 364, 365, 366, 383, 384], "repr": 247, "repres": [25, 47, 57, 256, 257, 298, 319, 325, 345, 375, 383, 384, 389, 391, 395, 399, 401, 402, 405, 423], "represent": [9, 23, 24, 57, 266, 298, 306, 387, 391, 392], "representtaion": 57, "reproduc": [351, 426], "request": [0, 24, 260, 295, 302, 316, 323, 324, 331, 334, 336, 337, 338, 340, 342, 343, 344, 348, 349, 357, 363, 366, 367, 368, 370, 379, 426, 427, 438], "requir": [4, 24, 32, 57, 181, 266, 289, 306, 313, 314, 315, 316, 322, 323, 324, 326, 327, 328, 329, 330, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 346, 347, 348, 349, 352, 354, 356, 357, 358, 359, 360, 361, 362, 363, 366, 368, 369, 370, 371, 372, 374, 377, 378, 379, 380, 381, 382, 383, 384, 387, 391, 393, 395, 397, 399, 402, 403, 405, 412, 413, 421, 423, 426, 427, 432], "requirements_cpu": [309, 331, 332, 376], "requirements_cuda": 376, "requirements_hpu": [309, 326], "requirements_win": [309, 327, 328, 329], "requirements_xpu": 309, "requires_grad": 24, "requires_safety_check": 9, "rerank": [2, 14, 372], "reranker_model": [14, 372], "rerun": 432, "rescale_factor": 20, "research": [314, 349, 415, 420, 422], "reset_sp": 279, "reshap": [39, 40, 57, 83, 100, 220, 387, 388, 389, 394], "reshape_0": [57, 391], "reshape_after_restore_hidden_st": 150, "reshape_before_and_after_attention_out_layer_norm_gather_el": 150, "reshape_before_restore_hidden_st": 150, "reshape_fus": 150, "reshape_input": 281, "reshape_tim": 389, "reshapeafterrestorehiddenst": 198, "reshapebeforeandafterattentionoutlayernormgatherel": 199, "reshapebeforerestorehiddenst": 200, "reshapefus": 201, "residu": [17, 398], "resili": 361, "resiz": 83, "resnet": [17, 255], "resnet101": 17, "resnet152": 17, "resnet18": 17, "resnet34": 17, "resnet50": 17, "resnext": 17, "resnext101_32x8d": 17, "resnext50_32x4d": 17, "resolut": 25, "resolv": [24, 25, 266, 271], "resolve_state_dict": 25, "resourc": [302, 303, 361, 372, 402], "respect": [298, 351, 384, 388, 391, 392], "respectfulli": 407, "respond": [349, 356], "respons": [0, 4, 268, 309, 311, 316, 319, 335, 346, 347, 349, 351, 352, 356, 358, 359, 361, 370, 372, 374, 375, 379, 380, 399, 405, 406, 408, 420], "response_templ": [324, 338, 372], "responsibli": 373, "rest": [24, 316, 340, 363, 390, 391, 395, 407, 409], "restart": [335, 380], "restaur": 36, "restor": [29, 304, 427], "restore_hidden_states_in_length_adaptive_update_indic": 150, "restorehiddenstatesinlengthadapt": 202, "result": [24, 57, 246, 247, 260, 264, 265, 268, 272, 289, 298, 302, 304, 338, 342, 358, 369, 370, 371, 372, 373, 374, 376, 387, 390, 391, 397, 400, 401, 402, 405, 406, 407, 408, 409, 411, 415, 420, 423, 425, 426, 429], "result_dir": 374, "result_ref": 279, "resum": [35, 246], "resume_download": 35, "resume_from_checkpoint": 246, "resume_from_pruned_checkpoint": 28, "ret": [24, 57, 395, 410], "ret_old_nod": 387, "retain": [24, 25, 307, 382, 429], "retain_grad": 24, "retain_input": 24, "retain_output": 24, "retinanet": 260, "retriev": [3, 23, 256, 257, 316, 319, 321, 322, 324, 332, 334, 338, 357, 376, 387], "retrieval_chat": 358, "retrieval_file_path": 334, "retrieval_typ": [14, 372, 375], "retrievalqa": [309, 372], "retrievaltypeopt": 5, "retrieveradapt": 14, "return": [0, 4, 6, 17, 20, 21, 23, 24, 25, 27, 29, 32, 35, 36, 44, 45, 47, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 256, 257, 258, 260, 261, 262, 263, 264, 266, 267, 268, 289, 302, 336, 361, 370, 372, 373, 387, 391, 395, 400, 401, 416, 421], "return_dict": [33, 36, 40, 41, 44], "return_interm_lay": 255, "return_output": 246, "return_tensor": [36, 44, 289, 302, 306, 428, 429, 432], "retval": 1, "reus": [388, 396, 405, 421], "revamp": 372, "revers": [25, 266], "review": [298, 300, 319, 432], "revis": [24, 35], "reward": [346, 347], "reward_model": 352, "reward_model_nam": 352, "rewrit": 387, "rf": 330, "rf_data": 401, "rgb": 21, "rh": [280, 407], "rhel": 308, "rich": [302, 321], "richer": 372, "right": [23, 36, 44, 57, 266, 298, 316, 322, 403, 407, 409, 418], "rishi": 377, "river": 377, "rl": 352, "rl_train": 352, "rlhf": [346, 347], "rm": [330, 364, 365, 407], "rms_norm": 150, "rmsnorm": [39, 40, 203], "ro": 304, "roberata": 44, "roberta": [44, 304, 430], "robertaattent": 44, "robertaclassificationhead": 44, "robertaconfig": 44, "robertaembed": 44, "robertaencod": 44, "robertaforcausallm": 44, "robertaformaskedlm": 44, "robertaformultiplechoic": 44, "robertaforquestionansw": 44, "robertaforsequenceclassif": 44, "robertafortokenclassif": 44, "robertaintermedi": 44, "robertalay": 44, "robertalmhead": 44, "robertamodel": 44, "robertaoutput": 44, "robertapool": 44, "robertapretrainedmodel": 44, "robertaselfattent": 44, "robertaselfoutput": 44, "robertatoken": 44, "robot": [341, 378, 381], "robust": [319, 325, 372], "rocketknight1": 304, "roco": 348, "rohan": 301, "role": [0, 309, 316, 321, 324, 361, 372], "roll": [361, 407, 429], "rome": 377, "root": [322, 334, 361, 364, 388], "rope": 429, "roraryposemb": [167, 180, 204], "rotari": 32, "rotary_pos_emb": 150, "rotat": 32, "rotten": 315, "roug": 349, "rougelsum": 425, "rough": 385, "roughli": [387, 405], "round": [401, 423, 432], "row": [266, 390, 402, 403, 405, 409], "row_num": 281, "rqsrt": 255, "rsi": 401, "rsp": 401, "rsqrt": [57, 73, 395], "rsub": 83, "rt_data": [279, 398], "rte": 288, "rtn": [288, 421, 432], "rtn_config": 429, "rtnconfig": [247, 319, 429, 432], "rubric": 25, "rule": [24, 25, 373, 395], "run": [9, 23, 24, 25, 246, 262, 264, 269, 289, 307, 308, 309, 313, 314, 316, 317, 318, 319, 321, 335, 341, 347, 348, 349, 350, 352, 354, 355, 364, 365, 369, 379, 380, 381, 382, 413, 414, 423, 432], "run_accuraci": [426, 427], "run_autoround": 427, "run_bench_": 413, "run_ci": 414, "run_code_gen": [326, 327, 328, 329, 330, 331], "run_evolutionary_search": 246, "run_executor": [389, 393], "run_generation_gpu_woq": 432, "run_infer": [426, 427], "run_llava": 351, "run_retrieval_on_cpu": 358, "runscript": 354, "runtim": [4, 272, 281, 302, 306, 314, 315, 318, 327, 328, 329, 347, 366, 386, 387, 388, 392, 395, 396, 398, 410, 413, 423, 426, 432], "runtime_kind": [278, 280], "runtime_kind_": [278, 280], "runtime_output_directori": 388, "runwayml": 9, "s8": [158, 392, 400, 401, 405, 413], "s8s8": [246, 305, 405], "s8s8bf16": 405, "sadhu": 301, "sadtalk": [360, 374], "safe": [300, 319, 349, 373], "safeti": [9, 247, 309, 373], "safety_check": [9, 319, 373, 375], "safetycheck": 373, "sahil2801": 349, "sai": 409, "said": 391, "salesforc": [309, 350], "salient": 419, "samanwai": 301, "same": [2, 17, 24, 25, 44, 57, 260, 264, 266, 303, 305, 314, 316, 323, 330, 332, 346, 350, 351, 355, 370, 372, 376, 387, 388, 389, 391, 392, 395, 399, 402, 405, 406, 409, 412, 413, 414], "same_src_dtyp": 281, "sampl": [25, 29, 246, 256, 257, 268, 288, 289, 302, 306, 311, 314, 319, 349, 350, 355, 369, 376, 397, 407, 423, 425], "sample_1": 340, "sample_layer_configur": 29, "sample_length_configur": 29, "sample_port": 25, "sample_s": [25, 246], "sample_zh_cn": 340, "sampler": [25, 350], "samsum": [304, 425], "sandesh": 301, "sandeshpyakurel": 301, "sandwich": 28, "sangjun": 301, "sapphir": [272, 302, 308, 349], "satisfact": 372, "satisfactori": 405, "satisfi": [308, 395, 405], "satur": 401, "savani": 304, "save": [9, 10, 25, 30, 55, 246, 247, 267, 302, 316, 350, 372, 376, 387, 388, 389, 392, 396, 403, 407, 409, 423, 427, 428, 432], "save_cached_st": 25, "save_directori": 247, "save_freq": 352, "save_jsonl": 267, "save_model": 302, "save_path": [246, 305], "save_popul": 30, "save_pretrain": [247, 428, 432], "save_step": [314, 346, 347, 349, 352], "save_stor": 30, "save_strategi": [314, 348, 349, 352, 354, 422], "save_total_limit": [314, 348, 349, 352, 354, 422], "saved_dir": 432, "saved_result": [302, 428], "say_hello": [311, 375], "sbu": 350, "scabl": 370, "scalabl": [4, 272, 302, 304, 308, 309, 311, 316, 319, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 354, 361, 363, 364, 365, 366, 370, 400], "scalar": [25, 111, 400], "scalar_num": 281, "scale": [20, 25, 62, 246, 259, 281, 370, 376, 400, 405, 408, 420, 421, 423, 428, 432], "scale0": 281, "scale_dst": 281, "scale_dtyp": [247, 432], "scale_factor": 264, "scale_k": 281, "scale_map": [246, 302], "scale_q": 281, "scale_reduce_quant": 405, "scale_shar": 247, "scale_typ": 421, "scale_v": 281, "scaleab": 281, "scalec": 281, "scaled_dot_product_attent": 32, "scan": 269, "scatter_el": 83, "scatterel": 112, "scenario": [281, 372, 405], "scene": [406, 429], "schedul": [9, 246, 293, 398, 436], "schedulermixin": 9, "scheme": [266, 354, 372], "scope": 269, "score": [30, 36, 44, 266, 307, 372, 376, 418], "score_threshold": 266, "scour": 372, "scr2": 413, "scratch": 5, "scratch_": 401, "screen": 395, "screenshot": [335, 380], "script": [1, 16, 17, 19, 20, 21, 269, 300, 315, 331, 332, 347, 349, 350, 351, 358, 376, 385, 390, 392, 412, 427, 432], "scriptmodul": 27, "scroll": 382, "sd": 304, "sdk": [345, 364, 365, 383, 384], "sdpa": 32, "se": [352, 432], "seamless": [272, 302, 319], "seamlessli": [309, 319, 372, 378, 429], "search": [2, 28, 30, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 304, 307, 350, 370, 372, 376, 391], "search_kwarg": [2, 309, 372], "search_mod": [57, 387, 391], "search_pattern": [57, 395], "search_straight_pattern": [57, 395], "search_typ": [2, 309, 372], "searchtyp": 2, "sec": [302, 397, 425], "second": [36, 44, 57, 289, 299, 337, 347, 385, 386, 391, 393, 394, 395, 403, 404, 407, 409, 410, 413, 432], "secondmo": 25, "secretli": 346, "section": [302, 320, 325, 333, 339, 342, 348, 349, 361, 398, 409, 410], "secur": [349, 361, 373, 435], "see": [23, 24, 36, 44, 49, 57, 256, 257, 260, 271, 298, 300, 302, 309, 314, 340, 349, 355, 366, 372, 376, 387, 389, 390, 391, 392, 395, 397, 399, 404, 408, 410, 411, 412, 413, 415, 421, 428, 432], "seed": [25, 288, 314, 349, 352, 371], "seek": [338, 358, 361, 371, 372], "seen": 418, "segment": [269, 319, 325, 372], "segment_id": [57, 306, 388], "sein": 377, "select": [25, 33, 36, 44, 246, 258, 314, 322, 335, 346, 347, 349, 351, 352, 372, 376, 380, 401, 413, 426], "self": [25, 28, 36, 37, 39, 40, 41, 42, 44, 111, 369, 387, 389], "semant": [319, 370, 372], "semi": 389, "semidefinit": 432, "send": [300, 366], "sensit": [319, 361, 373], "sensitive_check": 373, "sensitive_filt": 373, "sent": [342, 361, 370, 391], "sentenc": [36, 44, 289, 302, 314, 315, 335, 349, 354, 355, 372, 376, 380], "sentiment": [272, 302], "sep": 420, "separ": [0, 24, 298, 395, 409, 415], "separatorstyl": 0, "sepc_typ": 281, "seq": [349, 388, 407, 425], "seq2seq": [36, 44, 246], "seq_len": [32, 247, 281, 302, 388, 389, 393, 407, 413], "seq_relationship_logit": 36, "seq_vnni_copy_param": 281, "seqenti": 288, "seqlen": [37, 39, 40], "sequenc": [25, 29, 33, 36, 44, 57, 262, 288, 302, 306, 349, 350, 372, 387, 391, 395, 404, 413, 429], "sequence_length": [33, 36, 44], "sequence_output": 36, "sequenceclassifieroutput": [36, 44], "sequenceclassifieroutputwithpast": 33, "sequencelength": [73, 397], "sequenti": [24, 44, 391, 400, 401, 404], "sergei": 301, "seri": [264, 302, 308, 309, 349, 350, 361, 400, 403, 413], "serial": 247, "serv": [36, 309, 312, 319, 323, 326, 330, 331, 332, 340, 363, 372], "server": [309, 316, 319, 325, 347, 370, 371], "server_executor": [309, 375], "server_ip": 375, "server_nam": 361, "server_port": 361, "servic": [312, 319, 325, 335, 340, 342, 345, 366, 369, 370, 372, 380, 383, 384], "session": [348, 388, 396], "set": [0, 2, 24, 25, 29, 32, 33, 36, 44, 57, 95, 246, 266, 289, 298, 302, 307, 309, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 345, 347, 348, 349, 350, 351, 354, 357, 362, 363, 366, 371, 373, 375, 376, 383, 384, 388, 390, 391, 392, 394, 395, 396, 399, 400, 401, 404, 413, 428, 432], "set_attr": [63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 387], "set_autocast": 57, "set_binaryop_list": [280, 400], "set_data_handl": 394, "set_dtyp": 394, "set_dynamic_config": [246, 306], "set_environ_var": 57, "set_input_embed": [36, 44], "set_length_config": [36, 44], "set_log_fil": 302, "set_lower_constraint": 30, "set_mask": 400, "set_output_attent": [36, 44], "set_output_embed": [36, 44], "set_requires_grad": 24, "set_scal": 400, "set_shap": 394, "set_system_messag": 0, "set_target_properti": 388, "set_upper_constraint": 30, "set_zp": 400, "setcriterion": [256, 257], "setfit": [272, 302, 430], "setp": 396, "settabl": 389, "setter": [30, 36, 44], "setup": [325, 333, 339, 342, 348, 349, 372, 432], "setup_and_instal": 366, "setup_for_distribut": 264, "setuptool": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "setvar": [314, 332, 349, 361, 362, 432], "sever": [57, 309, 314, 321, 349, 387, 392, 395, 396, 399, 413, 423], "sex": 298, "sexual": 298, "sf": [308, 309], "sgx": 361, "sh": [314, 322, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 350, 354, 357, 358, 360, 361, 362, 363, 368, 383, 384, 392, 393, 414, 427, 432], "shanghai": [356, 371], "shanghai_": 353, "shape": [32, 33, 36, 37, 38, 40, 44, 57, 83, 121, 256, 257, 260, 281, 302, 350, 388, 389, 390, 394, 396, 399, 405, 407, 413, 421], "shape_0": 390, "shape_1": 390, "shape_2": 390, "shape_256_256_128": 410, "shard": [361, 363], "share": [266, 316, 347, 361, 372, 376, 402], "share_weight": 24, "shared_criterion": 247, "shared_ptr": [278, 279, 394], "sharegpt": 350, "sharma": 301, "shazeer": [36, 44], "she": 361, "shell": [309, 393], "shen": [415, 432], "shift": [33, 369], "shira": 301, "shirin": 301, "shm": [314, 315, 366], "short": [351, 372], "shorter": 36, "shot": [340, 369], "should": [24, 25, 33, 35, 36, 44, 57, 246, 256, 257, 263, 266, 267, 314, 316, 347, 349, 350, 355, 364, 365, 366, 372, 376, 387, 388, 390, 391, 394, 395, 399, 400, 401, 406, 413, 414, 416, 417, 423, 432], "show": [298, 302, 315, 324, 343, 355, 361, 367, 371, 376, 385, 387, 388, 391, 392, 395, 403, 405, 407], "showcas": 432, "shown": [303, 332, 351, 372, 390, 404, 408, 409, 428], "shrestha": 301, "shrink": 266, "shrunk": 266, "shuffl": [247, 421], "sid": 369, "siddhi": 301, "side": [325, 378, 379], "sidebar": [335, 380], "sight": 361, "sigmoid": 73, "sigmoid_focal_loss": 260, "sign": [269, 409, 423, 432], "signal": 25, "signextend16": 409, "signific": [303, 319, 325, 361, 372, 428], "significantli": [9, 302, 307, 354, 372, 406, 408], "silu": 73, "sim": 359, "simd": [399, 400, 404], "similar": [2, 28, 259, 260, 279, 319, 325, 347, 361, 370, 372, 375, 376, 391, 400, 403, 404, 406, 407, 419], "similarli": 32, "simpl": [1, 33, 36, 44, 256, 257, 260, 302, 356, 372, 376, 378, 385, 388, 400, 408, 418, 432], "simplest": [311, 375], "simpli": [340, 345, 346, 351, 352, 383, 384], "simplic": [355, 432], "simplifi": [270, 309, 319, 338, 358, 359, 387, 391, 420], "simul": [307, 390, 409, 410], "sin": [32, 83], "sinc": [35, 303, 349, 377, 384, 405, 406, 408, 432], "sine": 32, "singl": [1, 21, 266, 320, 336, 337, 350, 352, 361, 372, 402, 407], "single_lay": 24, "singlenod": 354, "sink": 429, "site": [361, 424], "situat": [316, 391, 406], "six": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "size": [25, 27, 28, 36, 37, 44, 83, 246, 256, 257, 258, 260, 264, 298, 302, 306, 314, 315, 354, 366, 372, 376, 385, 388, 390, 396, 399, 402, 404, 406, 407, 408, 413, 423, 425, 426, 429, 432], "size_t": [279, 281, 390, 401], "sizeof": 281, "skill": 316, "skip": [28, 289, 314, 315, 361, 402, 414, 432], "skip_special_token": [428, 432], "sky": 36, "skylak": [302, 361], "skylin": 377, "sl_pad": 281, "slice": [24, 40, 279, 396], "slice_desc": 279, "slice_position_id": 83, "slicemask": 150, "slicepositionid": 116, "slide": 382, "slight": 369, "slightli": [350, 351, 371, 388], "slimorca": 347, "slot": [266, 330, 332], "slot_into_contain": 266, "slow": [319, 370], "small": [25, 304, 307, 314, 336, 340, 349, 369, 372, 375, 376, 390, 405, 407, 420, 430], "smaller": [246, 303, 354, 372, 420], "smallest": 25, "smart": 409, "smooth": [264, 265, 327, 328, 329, 372], "smoothedvalu": 264, "smoothieewastaken": 301, "smoothquant": [270, 428], "smoothquantconfig": [247, 428], "snapshot": 427, "snip": 419, "snip_momentum": 28, "snippet": [319, 370, 402], "so": [0, 23, 25, 32, 35, 39, 40, 44, 57, 247, 264, 266, 303, 309, 314, 316, 321, 322, 332, 334, 349, 354, 355, 364, 371, 376, 386, 387, 390, 391, 394, 395, 400, 402, 403, 404, 405, 406, 408, 409, 410, 413, 416, 417, 419, 423, 426, 427, 428, 432], "social": 298, "socioeconom": 298, "sock": [317, 318], "socket": [314, 332, 349, 361, 397, 411, 425, 426], "softmax": [36, 83, 260, 279, 303, 398, 407, 408], "softmax_data_t": 281, "softmax_desc": 279, "softmax_param_t": 281, "softwar": [272, 302, 319, 361, 364, 365, 366, 369, 371, 415, 420], "solar": 309, "solid": 265, "solut": [319, 336, 337, 338, 358, 359, 361, 372, 406, 409, 420, 426, 427, 428], "solv": [258, 300, 354, 405, 406, 423], "some": [44, 57, 181, 195, 247, 266, 270, 302, 322, 332, 335, 338, 341, 345, 351, 358, 370, 372, 376, 379, 380, 381, 382, 383, 384, 385, 387, 388, 389, 390, 391, 394, 395, 396, 400, 401, 405, 409, 423], "someth": [243, 361], "sometim": [57, 391, 423], "soni": 301, "soon": 425, "sort": [25, 426, 427], "sort_objects_by_scor": 266, "sort_objects_left_to_right": 266, "sort_objects_top_to_bottom": 266, "sota": 354, "sound": 369, "soundfil": 369, "sourc": [0, 1, 2, 4, 5, 6, 8, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 262, 263, 264, 265, 266, 267, 268, 314, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 347, 349, 354, 357, 361, 362, 363, 368, 372, 383, 384, 385, 388, 400, 407, 413, 415, 420, 426, 432], "source_imag": 374, "source_op": 121, "sp": 279, "space": [298, 300, 390, 399, 402, 432], "spaci": [334, 371, 375], "spacy_model": [371, 375], "span": [36, 44, 266], "sparelib": 407, "spars": [55, 270, 272, 281, 302, 390, 398, 399, 408, 413, 420, 437], "sparse_lib_dump": 410, "sparse_lib_verbos": 410, "sparse_lib_vtun": 410, "sparse_matmul": [279, 410], "sparse_matmul_desc": [279, 398], "sparse_matmul_desc_t": 279, "sparse_matmul_t": 279, "sparse_ptr": 281, "sparse_ratio": 413, "sparse_schem": 281, "sparse_x_dens": 281, "sparse_x_spars": 281, "sparselib": [293, 390, 398, 436], "sparselib_verbos": 410, "sparsiti": [47, 55, 309, 397, 413, 419], "sparsity_al": 412, "sparsity_decay_typ": 28, "spatial": [263, 399, 405], "speak": 355, "speaker": [340, 369, 375], "spec_softmax_typ": 281, "spec_translnorm_typ": 281, "spec_typ": [281, 413], "special": [57, 248, 369, 372, 401, 407], "specif": [9, 35, 57, 256, 257, 265, 269, 270, 280, 282, 298, 299, 303, 309, 314, 316, 319, 326, 327, 328, 329, 330, 331, 332, 340, 349, 357, 361, 369, 371, 372, 385, 387, 390, 391, 399, 404, 405, 406, 412, 413, 416, 417, 418, 423, 429, 432], "specifi": [6, 23, 25, 29, 32, 48, 57, 246, 247, 264, 269, 270, 313, 316, 317, 318, 319, 330, 332, 334, 351, 365, 366, 369, 371, 372, 375, 391, 392, 396, 401, 405, 407, 413, 423, 430], "speech": [309, 319, 321, 340, 418, 420], "speechbrain": 369, "speecht5": [340, 355, 369], "speecht5_tt": 309, "speed": [36, 44, 315, 325, 337, 350, 361, 369, 387, 391, 425, 426, 427, 432], "speedup": [304, 314, 340, 349], "spell": 269, "spk_id": 375, "splice": 57, "split": [24, 57, 83, 289, 348, 349, 369, 372, 390, 399, 403, 405, 406, 428], "split_batch": 25, "split_output": 281, "spmm": [399, 407, 413], "spmm_desc": 398, "spmm_kern": 398, "spmm_type": 281, "spmm_vnni": 281, "spoken": 369, "spot": 428, "spr": [320, 337, 408], "spycsh": [364, 365], "sq": 428, "sq_config": 428, "sq_model": 428, "sql": [309, 334, 368], "sqlcoder": [309, 368], "sqlcoder2": 309, "sqrt": [73, 387, 391, 407], "squad": 304, "squadv1": 304, "squar": [33, 36, 44, 73, 350], "squareddiffer": [57, 73, 395], "squeez": 83, "src": [28, 281, 388, 401, 409, 413], "src0": [281, 413], "src1": [281, 389, 400, 413], "src1_perm": 389, "src2": [281, 400, 413], "src_data": 394, "src_data_typ": 413, "src_dt": 413, "src_k": 281, "src_m_": 394, "src_perm": 57, "src_q": 281, "src_shape": 394, "src_str": 57, "src_stride": 394, "src_t": 281, "src_v": 281, "srcptr": 281, "srcstride": 281, "srikanth": 301, "ssd": [281, 401, 413], "ssh": [314, 322, 410], "sshd_port": 349, "sshleifer": 304, "sst": [302, 304, 306, 418], "sst2": [289, 302, 304, 389, 393], "st": 391, "stabil": [387, 429], "stabilityai": 428, "stabl": [9, 25, 133, 134, 135, 136, 216, 217, 218, 221, 222, 223, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 272, 302, 308, 332, 346, 347, 348, 349, 396, 405, 420, 432], "stable_diffus": 9, "stable_diffusion_v1_4": 425, "stable_diffusion_v1_5": 425, "stable_diffusion_v2_1": 425, "stablediffusion_bf16convert": 150, "stablediffusion_collectqdqinfo": 150, "stablediffusion_collectquantinfo": 212, "stablediffusion_explicitnhwctranspos": 150, "stablediffusion_explicitnhwctransposeqat": 150, "stablediffusion_insertquantnod": 150, "stablediffusion_mhareshap": 150, "stablediffusion_quantizefus": 150, "stablediffusion_reshapefus": 150, "stablediffusioninstructpix2pixpipelin": 9, "stablediffusionsafetycheck": 9, "stablelm": 428, "stack": [73, 261, 408], "stage": [347, 350, 369], "stai": [32, 432], "stand": [57, 372, 377, 387], "standard": [35, 259, 342, 370], "stanford": [263, 314, 349], "stanford_alpaca": [314, 349], "star": 304, "starcod": [309, 363], "starcoder_peft_finetuned_model": 349, "start": [25, 36, 44, 57, 270, 314, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 347, 349, 355, 356, 360, 363, 366, 367, 370, 378, 379, 380, 381, 382, 384, 389, 395, 414, 435], "start_end_logit": 150, "start_pipelin": 49, "start_posit": [36, 44], "start_step": [28, 419], "startendlogit": 214, "startup": [361, 372], "stat": [25, 410], "state": [9, 25, 36, 39, 40, 44, 246, 302, 372, 397, 411, 429], "state_dict": 25, "static": [38, 57, 251, 278, 281, 300, 304, 305, 306, 350, 389, 392, 400, 403, 405, 418, 430], "static_addr": 400, "static_group": 247, "staticquantconfig": 247, "statist": [25, 255], "statsit": 25, "statu": [270, 298, 334, 366, 417, 423], "status_update_r": 361, "std": [278, 279, 280, 281, 398, 400, 401], "stderr": [17, 361], "stdev": 25, "stdout": [345, 361, 383, 384], "steadili": 340, "stella": [372, 376], "step": [21, 25, 47, 57, 246, 256, 257, 307, 314, 315, 321, 345, 347, 349, 352, 372, 383, 384, 386, 387, 389, 391, 392, 393, 394, 395, 396, 400, 405, 407, 408, 413, 420, 425, 432], "step0": 401, "step1": [400, 401, 408], "step2": [40, 400, 401, 408], "step3": [401, 408], "still": [47, 57, 269, 272, 340, 382, 395, 402, 420, 423, 427], "stop": [24, 361], "stopforward": 24, "stopgradi": 73, "storag": [25, 278, 370, 372], "store": [25, 28, 30, 35, 243, 260, 319, 334, 338, 357, 358, 370, 387, 391, 392, 395, 396, 399, 400, 401, 402, 403, 405, 406, 407, 409, 429], "store2str": 30, "store_fil": 30, "stori": 361, "str": [0, 6, 21, 23, 27, 28, 35, 36, 44, 45, 57, 95, 184, 246, 247, 250, 251, 255, 264, 267, 268, 289, 371, 372, 376, 421], "str2list": 57, "straight": 57, "straightforward": [309, 319, 357, 378], "strategi": [129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 330, 332, 370, 372, 402], "stream": [25, 43, 316, 340, 363, 394, 425, 426], "stream_mod": [336, 340, 375], "stream_t": 278, "streamer": [429, 432], "streamlin": [347, 372], "strict": [247, 373], "strictli": [351, 394], "stride": [394, 399], "strided_slic": 83, "stridedslic": 120, "string": [0, 23, 30, 47, 57, 62, 243, 244, 247, 262, 266, 268, 280, 303, 336, 351, 371, 387, 390, 391, 394, 401, 416, 417, 419], "strong": 372, "stronger": 376, "strongli": [345, 383, 384], "struct": [279, 281, 400, 401], "structur": [57, 266, 304, 319, 364, 365, 366, 372, 387, 388, 390, 404, 408, 412, 419], "structure_model_path": 359, "student": [303, 304], "studio": [308, 327, 328, 329], "style": [0, 25, 300, 345, 346, 347, 349, 352, 361, 383, 384], "sub": [49, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 266, 337, 375, 387, 390, 391, 400, 407, 408], "sub_func": 281, "sub_func_level": 413, "sub_graph": [58, 387, 390], "subclass": [25, 95, 184, 246, 278, 279], "subdir": 413, "subdirectori": 386, "subfold": 351, "subfunc_level": [281, 413], "subfunc_level_max": [281, 413], "subgraph": [49, 57, 215, 390, 392], "subgraph_match": [150, 390], "subgraphmatch": [215, 390], "subject": [268, 309, 351, 415], "submit": [300, 302, 347], "submodul": [9, 24, 330, 332, 386, 388, 432], "suboptim": 407, "subsampl": 25, "subsequ": [24, 390, 405, 408], "subset": [25, 350], "subsidiari": 415, "substanti": [319, 370, 372, 429], "substitut": [23, 313, 316, 317, 318, 363], "subtask": 371, "subtoken": 23, "succeed": 302, "success": [308, 378], "successfulli": [313, 316, 317, 318, 363, 366, 387, 395], "successor": 403, "sudhanshu": 301, "sudo": [323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368, 369], "suggest": [305, 307, 309, 346, 348, 349, 352, 400], "suit": [309, 357, 369, 372, 378], "suitabl": [324, 327, 328, 329, 338, 361, 372, 405], "sum": [36, 264, 289, 303, 389, 406, 408, 409, 413], "summar": [303, 304, 309, 316, 319, 349, 430], "summari": [289, 425], "summit": 432, "sumptr": 281, "sumstep": 281, "sunak": 377, "super": [50, 387, 390], "supercel": 266, "supercell1": 266, "supercell2": 266, "supercharg": 420, "superclass": 9, "superior": [342, 370], "supervis": [256, 257, 420], "suppli": [24, 25, 391, 395, 396], "support": [8, 24, 25, 28, 30, 45, 48, 57, 62, 184, 264, 266, 281, 306, 308, 314, 319, 321, 325, 336, 337, 345, 349, 350, 354, 361, 364, 365, 366, 367, 369, 371, 372, 375, 378, 383, 384, 386, 387, 388, 389, 390, 394, 395, 401, 405, 406, 408, 410, 412, 413, 418, 419, 421, 422, 429, 433], "supported_pattern": 387, "supported_typ": 28, "supported_valu": 28, "suppress": 266, "surav": 301, "sure": [57, 181, 266, 289, 308, 309, 314, 315, 316, 317, 318, 321, 323, 324, 326, 330, 332, 335, 337, 343, 349, 355, 361, 362, 366, 372, 380, 387, 402, 413], "surfac": 319, "surgeon": 432, "surround": 361, "sw": 306, "swag": 304, "sweep": 181, "sweet": [428, 430], "sweetnotebook": 304, "switch": [25, 314, 331, 332, 335, 349, 380], "swizzl": 409, "sy": [22, 247, 361], "sym": 247, "symbol": [44, 262, 373], "symmetr": [57, 395, 405, 413, 423], "synaps": 425, "sync": [330, 405], "synchron": 264, "synchronis": 402, "synchronize_between_process": 264, "synthes": 369, "synthesi": 369, "sys_nic": [314, 315, 347, 366], "sysroot_linux": 426, "system": [0, 35, 302, 313, 314, 316, 324, 327, 328, 329, 333, 339, 342, 361, 370, 372, 378, 386, 425], "system_messag": 0, "systemat": 428, "systemctl": 334, "t": [21, 25, 36, 44, 57, 256, 257, 258, 266, 279, 281, 303, 313, 314, 315, 316, 317, 318, 347, 349, 350, 351, 361, 372, 385, 394, 396, 399, 400, 402, 405, 407, 408, 409, 413, 426], "t5": [272, 302, 304, 314, 363, 430], "ta": 301, "tab": [335, 380], "tabl": [266, 323, 324, 326, 327, 328, 329, 330, 331, 332, 336, 338, 340, 343, 344, 363, 366, 372, 390, 401, 409], "table_bbox": 266, "table_object": 266, "table_span": 266, "table_structur": 266, "table_structure_to_cel": 266, "tabul": 351, "tacotron": 262, "tag": 35, "tail": [57, 395, 410], "tailor": [309, 319, 357, 361, 372], "take": [24, 25, 298, 303, 314, 319, 335, 337, 349, 370, 380, 389, 391, 394, 400, 408, 409], "taken": [25, 36, 44, 371, 379], "talent": 409, "talk": [341, 381, 387, 420], "talker": 355, "talkingbot": [309, 319, 320, 339, 340, 369], "talli": 25, "tangobert": 430, "tangobertnotebook": 304, "tanh": [73, 394, 401, 413], "tanspos": 413, "target": [27, 246, 256, 257, 258, 260, 264, 314, 315, 347, 349, 389, 409, 419], "target_include_directori": 388, "target_link_librari": 388, "target_node_nam": 57, "target_s": [20, 256, 257, 260], "target_spars": [28, 419], "target_sparsity_ratio": 306, "task": [44, 45, 246, 270, 302, 303, 304, 309, 314, 316, 319, 321, 340, 346, 348, 349, 350, 354, 368, 369, 371, 372, 375, 393, 401, 407, 410, 418, 426, 427, 428, 430], "task_nam": [392, 393], "task_typ": 422, "tasks_list": [309, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 375], "tasktyp": 422, "tatr": 359, "tatsu": 349, "taught": 361, "tbd": 322, "tce": 361, "tcmalloc": 354, "tdp": 385, "tdpbf16p": 403, "tdpbssd": 405, "teach": 350, "teacher": [303, 304], "teacher_model": [246, 303, 306], "team": 298, "tech": [272, 319, 420], "techcrunch": 420, "technic": 420, "techniqu": [270, 302, 304, 306, 354, 423], "technologi": [319, 325, 405], "tee": 330, "tel2p1": [425, 426], "tell": [4, 309, 311, 316, 318, 319, 321, 361, 364, 365, 366, 367, 370, 375, 389, 391, 400, 401], "temperatur": [259, 303, 321, 371, 376, 428, 432], "templat": [0, 23, 279, 281, 314, 341, 349, 381], "temporari": [298, 407], "temporarili": 298, "ten": [369, 420], "tendenc": 372, "tendorflow": 243, "tensor": [23, 24, 25, 27, 32, 33, 36, 37, 38, 39, 40, 41, 44, 49, 52, 53, 54, 55, 57, 62, 83, 111, 181, 243, 244, 246, 256, 257, 258, 260, 263, 264, 281, 332, 387, 388, 389, 391, 392, 394, 396, 407, 412, 413, 421, 423], "tensor_desc": [280, 401], "tensor_dtyp": 280, "tensor_ftyp": 280, "tensor_list": 55, "tensor_nam": [55, 62, 243], "tensor_shap": 280, "tensorflow": [50, 53, 63, 64, 66, 67, 68, 69, 74, 76, 81, 84, 85, 88, 89, 90, 92, 93, 96, 97, 99, 100, 103, 105, 106, 109, 119, 120, 124, 243, 269, 299, 303, 306, 308, 361, 388, 395, 423], "tensorflowextractor": 53, "tensorflowmodel": [53, 243], "tensorslicedataset": 73, "tent": 247, "teq": 432, "teqconfig": [247, 432], "term": [303, 347, 372, 404, 407, 409, 415, 416, 417, 421, 423], "termin": 313, "tesseract": 359, "test": [265, 266, 269, 299, 302, 304, 319, 325, 330, 332, 346, 358, 372, 397, 411, 413, 414, 425, 426], "test_": 413, "test_doc": 338, "test_finetuning_data": 314, "test_infer": 315, "test_spmm_vnni_kernel": 398, "text": [9, 266, 272, 289, 302, 304, 309, 314, 319, 320, 321, 324, 331, 332, 333, 336, 340, 342, 343, 344, 348, 349, 350, 354, 363, 370, 371, 372, 376, 378, 379, 382, 393, 401, 410, 415, 418, 426, 427, 428, 430, 432], "text2imag": [309, 321, 375], "text2speech": 369, "text_classifi": 418, "text_encod": 9, "text_gen": 316, "text_gen_qa": 316, "text_gen_summari": 316, "text_gener": [312, 316, 354, 364, 365, 366], "text_to_sequ": 262, "text_to_speak": 369, "textattack": [304, 392], "textbot": [340, 345, 383, 384], "textbot_vllm": 367, "textchat": [309, 311, 321, 323, 326, 327, 328, 329, 330, 338, 375], "textchatclientexecutor": 375, "textencdoer_word_embed": 150, "textencoder_attentionmaskaddreshap": 150, "textencoder_attentionreshap": 150, "textencoder_casualattentionmask": 223, "textencoder_causal_attention_mask": 150, "textencoder_kvreshap": 150, "textencoder_mulreshap": 150, "textencoder_qreshap": 150, "textencoder_softmaxreshap": 150, "textencoder_wordembed": 216, "textencoderv1": [216, 233, 234, 238, 239, 240], "textgen": [309, 312], "textgenerationfinetuningconfig": 319, "textract": 359, "textstream": [429, 432], "texttospeech": 369, "textual": [349, 379], "textunderscor": 407, "textvoicechatexecutor": 311, "textvqa": 350, "tf": [36, 57], "tf_checkpoint_path": 36, "tf_dtype": [62, 243, 244], "tf_dtype_id": 243, "tf_extract_oper": 243, "tf_extractor": [50, 51], "tf_util": 58, "tgi": [309, 312], "tgi_engine_param": 363, "tgi_serv": 312, "th": [57, 391], "than": [25, 29, 255, 258, 265, 266, 289, 337, 346, 348, 349, 352, 359, 372, 376, 389, 390, 391, 400, 405, 407, 412, 413, 423, 426, 432], "thank": [301, 353, 359, 360, 374], "thch": 369, "theblackcat102": 349, "thei": [21, 32, 35, 57, 247, 266, 298, 303, 316, 355, 361, 372, 386, 395, 396, 399, 400, 401, 403, 407, 413], "them": [24, 25, 32, 39, 40, 49, 52, 53, 57, 266, 268, 270, 335, 347, 350, 352, 361, 372, 380, 387, 388, 391, 400, 403, 405, 408, 409, 423], "therebi": [370, 372], "therefor": [57, 305, 319, 372, 399, 404, 407, 409, 423], "thi": [0, 5, 9, 16, 17, 18, 19, 20, 21, 24, 25, 32, 36, 37, 44, 47, 50, 57, 220, 246, 247, 256, 257, 258, 259, 260, 264, 266, 270, 278, 279, 280, 281, 298, 300, 302, 303, 305, 307, 309, 316, 319, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 347, 348, 349, 350, 353, 354, 355, 356, 357, 358, 359, 360, 361, 363, 364, 365, 368, 369, 371, 372, 373, 374, 375, 376, 377, 378, 380, 383, 384, 385, 387, 388, 389, 390, 391, 394, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 412, 413, 415, 416, 418, 420, 423, 425, 426, 427, 428, 429, 432], "thing": 390, "think": 377, "thinner": 407, "third": [57, 300, 391, 404, 409, 415], "those": [24, 25, 36, 44, 266, 348, 349, 355, 361, 407, 423], "though": [336, 337, 338, 358], "thought": 391, "thread": [314, 349, 371, 397, 421], "thread_elt_offset": [281, 400], "thread_num": 281, "threat": 361, "threaten": 298, "three": [47, 302, 337, 340, 352, 356, 372, 387, 391, 395, 407, 408, 423], "threshold": [55, 260, 266, 372, 413, 429], "through": [23, 262, 270, 272, 289, 302, 309, 314, 324, 327, 328, 329, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 349, 352, 361, 363, 365, 372, 377, 387, 400, 403, 404, 405, 410, 422], "throughout": 361, "throughput": [289, 302, 307, 342, 365, 367, 370, 397, 405, 425], "througput": 319, "throw": [24, 25], "thu": [258, 354, 372, 404, 423], "thudm": 309, "tid": 402, "tidm": 402, "tidn": 402, "tight": 402, "tiiuae": [349, 428], "tile": [65, 73, 111, 399, 403, 405, 407, 408, 409, 413], "tile_gemm": 402, "tile_k": 405, "tile_m": [405, 413], "tile_n": 413, "tile_w": 281, "tiledcol": 402, "tiledindex": 402, "tiledrow": 402, "tileloadd": 403, "tilem": 281, "tilen": 281, "tilestor": 405, "till": 57, "timbrook": 337, "time": [24, 25, 37, 314, 315, 316, 319, 321, 325, 354, 361, 364, 370, 371, 372, 379, 382, 385, 389, 396, 399, 400, 402, 403, 404, 405, 406, 407, 408, 409, 411, 413, 414, 420, 423, 425, 426, 427, 428, 429, 432], "tini": [44, 304, 355, 430], "tinybert_general_4l_312d": 304, "titl": [389, 415], "titsworth": 301, "tmp": [281, 401, 403, 405, 408], "tmp1": 409, "tmp2": 409, "tmp2m": 281, "tmp3": 409, "tmp4": 409, "tmp_trainer": 302, "to_": 25, "to_diff_dict": 247, "to_gradio_chatbot": 0, "to_json_fil": 247, "to_openai_api_messag": 0, "todai": 371, "todo": [281, 431], "togeth": [24, 25, 302, 361, 369], "togethercomput": 428, "toi": 372, "token": [5, 9, 23, 32, 36, 44, 247, 266, 289, 302, 304, 306, 307, 314, 324, 342, 348, 349, 354, 370, 371, 372, 375, 385, 392, 396, 418, 425, 428, 429, 430, 432], "token_idx": [37, 40, 41], "token_typ": 36, "token_type_embed": [150, 387], "token_type_embeddings_v1": [150, 387], "token_type_id": [33, 36, 44, 396], "tokenclassifieroutput": [33, 36, 44], "tokenizer_class": 349, "tokenizer_config": 349, "tokenizer_dir": 393, "tokenizer_nam": [314, 315, 349, 352], "tokenizer_name_or_path": 375, "tokens_in_t": 266, "tokentypeembed": [224, 387], "tokentypeembeddingsv1": [225, 387], "tokentypeid": 73, "toler": 416, "tolerable_loss": 423, "tomaarsen": 38, "tone": 369, "too": [57, 314, 315, 372, 387, 399, 400, 405], "tool": [302, 304, 314, 315, 321, 349, 369, 372, 373, 389, 396, 398, 413, 420], "toolkit": [272, 302, 304, 361, 362, 363, 420], "top": [25, 57, 266, 272, 302, 356, 369, 376, 382, 404, 420, 421, 425], "top1": 347, "top2": 376, "top200": 376, "top60": 376, "top_k": [83, 371], "top_n": [14, 372], "topic": [270, 335, 372, 380], "topk": [25, 122, 264], "topologi": 50, "tor": 246, "torch": [9, 23, 24, 25, 27, 28, 32, 33, 36, 37, 39, 40, 41, 44, 54, 127, 244, 246, 260, 261, 264, 289, 302, 303, 314, 315, 330, 332, 340, 349, 353, 374, 418, 421, 422, 426, 427, 432], "torch_ccl": [314, 349], "torch_ccl_path": [314, 332, 349], "torch_cuda_arch_list": 330, "torch_dtyp": [346, 422, 432], "torch_embed": 150, "torch_extract_oper": 244, "torch_extractor": 51, "torch_ip_insert_bia": 150, "torch_unpack_baddbmm": 150, "torch_util": 58, "torchaudio": [332, 340], "torchembed": 226, "torchextractor": 54, "torchinnerproductinsertbia": 227, "torchinsertbf16nod": [150, 189], "torchpaddingsequ": 230, "torchpaddingsqu": 150, "torchprofil": 304, "torchrun": 352, "torchscript": [27, 28, 54, 115, 244, 246, 289], "torchunpackbaddbmm": 228, "torchvis": [255, 264, 332, 361], "total": [28, 29, 36, 57, 289, 310, 314, 349, 385, 391, 395, 402, 409, 410, 425, 426], "total_token": 361, "total_val_output": 351, "toward": 298, "tpp": 332, "tpp_cache_remapped_weight": 332, "tr": 24, "trace": [24, 27, 28, 289, 315, 345, 372, 383, 384, 389], "tracedict": 24, "track": [23, 25, 264, 319], "trade": [319, 432], "trademark": 302, "tradeoff": [306, 403], "tradit": [361, 370, 401, 429], "traffic": [319, 349, 370], "train": [1, 5, 17, 25, 28, 47, 246, 265, 272, 302, 303, 306, 314, 319, 348, 354, 372, 412, 419, 420, 428, 430, 432], "train2017": 350, "train_asr": 353, "train_backbon": 255, "train_batch_s": 247, "train_data": 376, "train_dataload": 247, "train_dataset": [247, 302, 306], "train_dir": 348, "train_fil": [314, 349], "train_func": [246, 247], "train_group_s": 376, "train_imag": 350, "train_it": 247, "train_len": 247, "train_pad": 247, "train_pad_v": 247, "train_shuffl": 247, "train_transl": 353, "train_translation_revers": 353, "trainabl": [354, 432], "trainer": [286, 302, 304, 305, 306], "training_step": 246, "training_step_length_adapt": 246, "trainingargu": 376, "tranform": [330, 332], "transcrib": 369, "transcript": [309, 321, 340, 369], "transfer": [246, 303, 336, 361], "transform": [9, 17, 23, 256, 257, 259, 269, 270, 289, 293, 299, 300, 303, 307, 309, 313, 314, 315, 316, 319, 320, 323, 324, 326, 330, 331, 332, 337, 343, 346, 347, 348, 349, 350, 352, 354, 355, 359, 364, 365, 366, 369, 372, 374, 376, 386, 387, 388, 390, 392, 395, 396, 400, 401, 406, 407, 408, 409, 413, 415, 416, 417, 418, 419, 420, 422, 423, 424, 425, 426, 427, 428, 429, 432, 436], "transformer2dmodel_attentionmaskaddreshap": 150, "transformer2dmodel_constantofshapewithmul": 150, "transformer2dmodel_encoderhiddenstatesreshap": 150, "transformer2dmodel_ffninputslic": 233, "transformer2dmodel_ffninputslice_1": 234, "transformer2dmodel_ffnslic": 150, "transformer2dmodel_ffnslice_1": 150, "transformer2dmodel_getsamplebatch": 150, "transformer2dmodel_qkvprereshap": 150, "transformer2dmodel_qkvreshap": 150, "transformer2dmodel_qkvreshape4d": 150, "transformer2dmodel_qkvreshapeto4d": 237, "transformer2dmodel_sampleslic": 150, "translat": [25, 304, 309, 321, 340, 430], "transparam": 20, "transpos": [36, 44, 55, 83, 108, 389, 390, 398, 399, 403, 405, 406, 409, 413, 421, 437], "transpose_4b_8x8": 407, "transpose_batch_matmul": [150, 387], "transpose_copy_param": 281, "transpose_for_scor": [36, 44], "transpose_matmul": 279, "transpose_matmul_desc": 279, "transpose_mha": 279, "transpose_mha_desc": 279, "transpose_mha_io": 281, "transpose_mha_io_max": 281, "transpose_mha_step1_param": 281, "transpose_mha_step2_param": 281, "transpose_mha_step3_param": 281, "transpose_mode_int8": 55, "transposebatchmatmul": [73, 241, 387], "transposit": 409, "travel": 361, "treat": [258, 387], "tree": [43, 266, 315, 352, 354, 373], "trend": 370, "trial": 246, "trie": 373, "trigger": [248, 372], "tripathi": 301, "triton": 309, "triton_backend": 366, "triton_cli": 366, "triton_inference_serv": 366, "triton_neuralchat": 364, "triton_neuralchat_gpu": 365, "tritoncli": 366, "tritonserv": [364, 365, 366], "triumph": 361, "troll": 298, "true": [9, 17, 23, 24, 25, 28, 36, 44, 55, 246, 247, 250, 251, 256, 257, 260, 264, 266, 288, 302, 305, 306, 307, 314, 317, 319, 324, 326, 330, 332, 334, 336, 338, 340, 344, 346, 347, 348, 349, 350, 352, 354, 358, 361, 363, 366, 371, 372, 373, 374, 375, 376, 386, 387, 389, 390, 396, 400, 401, 407, 410, 413, 416, 417, 422, 423, 428, 429, 432], "true_sequenti": 247, "truncat": [29, 302, 349], "trust": [314, 349, 420], "trust_remote_cod": [307, 349, 429, 432], "truth": [256, 257, 258, 288, 376], "truthfulqa": [346, 425], "truthfulqa_mc": 349, "try": [309, 361, 363, 372, 375, 390, 423], "ts_desc": [280, 398, 401], "ts_descs_": [280, 401], "tsai": 301, "tsmodelforcausallm": 428, "tt": [309, 319, 322, 334, 336, 340, 375, 402], "tts_finetun": 355, "tts_multilang": [336, 340, 369, 375], "ttsdatasetargu": 355, "ttsmodelargu": 355, "tune": [5, 55, 246, 270, 272, 302, 303, 309, 320, 357, 372, 376, 415, 416, 417, 419, 420, 422, 428, 432, 439], "tune_metr": [302, 419], "tuning_criterion": 423, "tuningcriterion": 423, "tunnel": 322, "tupl": [29, 32, 33, 36, 37, 40, 41, 44, 45, 57, 258, 260], "turbo": [324, 397, 411, 425, 426], "turn": [370, 407], "tutori": [270, 302, 347, 388, 426, 427], "tweak": 44, "twice": [17, 402, 408], "two": [24, 25, 57, 256, 257, 266, 270, 300, 303, 314, 316, 330, 332, 340, 347, 349, 350, 351, 355, 369, 371, 372, 373, 376, 384, 387, 390, 391, 393, 394, 396, 400, 401, 403, 406, 407, 408, 409, 417, 418, 419, 423], "twofold": 407, "tx": 20, "txt": [265, 302, 308, 309, 315, 319, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 352, 353, 354, 355, 357, 358, 359, 360, 361, 363, 368, 370, 371, 372, 373, 374, 376, 377, 383, 384, 387, 388, 390, 393, 410, 412, 426, 427, 432], "txt2img": [335, 380], "ty": 20, "tyler": 301, "type": [2, 4, 21, 25, 27, 28, 29, 35, 36, 45, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 258, 264, 282, 302, 303, 304, 305, 309, 313, 316, 317, 318, 319, 321, 324, 340, 361, 363, 367, 372, 375, 388, 389, 390, 392, 395, 398, 400, 401, 406, 412, 413, 416, 417, 419, 421, 422, 423, 425], "type1": 395, "type2": 395, "typedef": 281, "typeerror": 24, "typenam": [279, 281], "typic": [0, 319, 369, 370, 372, 409, 432], "typing_extens": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "u": [9, 332, 340, 351, 353, 355, 412, 428], "u8": [158, 392, 394, 401, 408, 413], "u8s8": 305, "u8u8": 305, "ubuntu": [302, 369, 397, 411], "ubuntu22": [313, 314, 315, 316, 317, 318], "ubuntu_v": [314, 315, 349], "ui": [378, 382], "uint64_t": 280, "uint8": [28, 407, 423], "uint8_t": [281, 400, 401], "uiuc": [309, 313, 331], "uk": 377, "ultim": [309, 357, 423], "ultra": 420, "ultrachat": 349, "un": [258, 412], "unabl": 363, "unaccept": 298, "unawar": 372, "unbox_numpy_nul": 25, "unbreak": 361, "uncas": [36, 302, 304, 306, 392, 418], "uncased_swag": 304, "uncertain": 300, "unchang": 25, "undef": [280, 281, 400, 401], "under": [158, 246, 302, 309, 340, 351, 353, 355, 364, 365, 366, 372, 373, 387, 388, 389, 392, 406, 413, 415], "understand": [316, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 372, 402, 405], "unencumb": 349, "unet": 9, "unet2dconditionmodel": 9, "unexpect": 373, "unfix": 57, "unfortun": 409, "unifi": [346, 407], "uniform": 432, "uniformli": 25, "uninstal": 332, "unintellig": 402, "union": [25, 246, 266, 281], "uniqu": [266, 272, 302, 349, 372, 378], "unique_assign": 266, "unit": [269, 319, 325, 361, 398, 405], "unit_test_util": 401, "unittest": [314, 315], "univers": [266, 314, 349], "unknown": 361, "unleash": 420, "unlik": [400, 429], "unlock": [420, 430], "unnorm": [256, 257], "unordered_map": [280, 401], "unpack": [83, 246, 387], "unprocess": 361, "unquant": 432, "unreach": 375, "unref": 394, "unref_tensor": 394, "unrefernc": 394, "unrel": 354, "unrol": [390, 402, 404], "unseen": 423, "unset": 363, "unsign": [409, 413], "unslic": 36, "unsqueez": [32, 83, 387], "unsqueeze_dim": 32, "unstructur": [304, 371, 372, 419], "until": 405, "untouch": 32, "unus": [32, 247, 401], "unwelcom": 298, "up": [24, 25, 36, 37, 44, 247, 315, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 350, 351, 363, 364, 366, 372, 387, 389, 391, 396, 401, 413], "updat": [0, 25, 38, 39, 40, 247, 308, 309, 330, 332, 334, 345, 372, 377, 383, 384, 386, 388, 406, 419, 426, 432], "update_config": 47, "update_keys_to_ignor": 44, "update_last_messag": 0, "upgrad": [309, 315, 321, 426], "upload": [13, 321, 334, 336, 341, 372, 378, 379, 381, 382], "upload_link": 309, "upon": [321, 361, 426, 427, 428, 429, 432], "upper": [30, 322, 401], "upper_bound": 413, "upper_constraint": 30, "upsampl": 260, "upstag": 309, "upto": 24, "upto_lay": 24, "url": [309, 319, 332, 345, 359, 361, 364, 365, 366, 372, 383, 384, 415], "url_of_pdf": 359, "us": [0, 4, 9, 14, 18, 23, 24, 25, 27, 28, 29, 32, 35, 36, 38, 44, 49, 57, 62, 95, 184, 195, 220, 243, 246, 247, 259, 260, 265, 269, 270, 288, 289, 298, 300, 302, 303, 307, 308, 311, 312, 313, 316, 317, 318, 319, 320, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 334, 335, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 354, 355, 356, 357, 358, 361, 363, 366, 368, 369, 370, 371, 372, 373, 375, 376, 377, 379, 380, 383, 384, 385, 386, 387, 389, 390, 391, 392, 393, 394, 395, 396, 397, 399, 400, 401, 402, 403, 404, 406, 407, 408, 409, 410, 411, 413, 415, 416, 417, 418, 419, 420, 421, 422, 423, 425, 426, 427, 428, 429, 432], "usag": [9, 269, 300, 302, 308, 335, 380, 416, 417, 421, 422, 429, 432], "use_aot_devlist": 432, "use_auth_token": [346, 352], "use_cach": [33, 36, 40, 41, 44], "use_cpu": [346, 354], "use_deepspe": [1, 326, 330, 349], "use_diff": 247, "use_double_qu": 247, "use_fast_token": [314, 349, 352, 354, 422], "use_full_rang": 247, "use_ggml": 247, "use_gptq": [426, 427], "use_gpu_for_search": 376, "use_gradient_checkpoint": 422, "use_habana": [314, 346, 347, 349, 350, 352], "use_hpu_graph": 315, "use_hpu_graphs_for_train": 346, "use_inbatch_neg": 376, "use_kv_cach": 315, "use_lazy_mod": [314, 346, 347, 349, 350, 352], "use_mpi": [1, 346, 349, 352], "use_mse_search": 247, "use_mxfp4": 332, "use_neural_spe": [4, 247, 319, 327, 328, 329, 422], "use_qu": 247, "use_tpp": 332, "useless": [314, 315, 401], "user": [11, 12, 13, 15, 36, 44, 47, 57, 272, 273, 289, 292, 295, 302, 305, 309, 316, 319, 321, 324, 325, 330, 332, 334, 336, 337, 338, 347, 348, 349, 355, 356, 357, 358, 361, 369, 370, 372, 373, 375, 377, 378, 382, 385, 387, 389, 391, 393, 396, 405, 407, 410, 413, 417, 418, 421, 430, 435, 438], "user_model": 307, "userwarn": 361, "usr": [308, 309], "usual": [303, 314, 349, 391, 399, 409, 423], "uszkoreit": [36, 44], "ut": [398, 401], "util": [23, 29, 44, 57, 62, 243, 244, 256, 257, 270, 288, 309, 319, 321, 323, 326, 330, 331, 332, 334, 338, 345, 357, 358, 361, 370, 371, 372, 373, 383, 384, 387, 395, 399, 406, 409, 413, 432], "uvicorn": [361, 366], "v": [20, 25, 260, 302, 308, 313, 314, 315, 316, 317, 318, 322, 330, 337, 351, 364, 365, 366, 387, 407, 408, 420], "v0": [309, 347, 349, 427, 428], "v1": [9, 38, 288, 304, 309, 313, 316, 317, 318, 321, 324, 340, 346, 349, 350, 351, 359, 361, 363, 364, 367, 372, 376, 421, 425, 427, 428, 429], "v2": [269, 288, 309, 326, 330, 332, 364, 365, 366, 372, 376, 428], "v3": [309, 316, 319, 321, 324, 331, 334, 338, 340, 343, 344, 345, 361, 363, 364, 366, 371, 375, 383, 384, 420, 427, 432], "v4": 9, "v5": 269, "v_bia": 281, "v_proj": [346, 347, 349, 354], "v_scale": 281, "v_weight": 281, "vaddp": 400, "vae": 9, "val": [55, 57, 350], "valhalla": 304, "valid": [57, 246, 289, 303, 306, 309, 321, 348, 354, 375, 395, 415, 424, 432, 438], "validation_accounting_1": 351, "validation_architecture_and_engineering_14": 351, "validation_dir": 348, "validation_electronics_28": 351, "validation_electronics_29": 351, "validation_split_percentag": 349, "valu": [7, 24, 25, 27, 28, 36, 39, 40, 44, 47, 57, 62, 243, 244, 246, 247, 256, 257, 260, 264, 267, 281, 302, 303, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 363, 367, 371, 372, 375, 376, 383, 384, 387, 388, 389, 390, 391, 400, 401, 402, 403, 405, 407, 408, 413, 416, 417, 419, 423, 428, 429, 432], "valuabl": [0, 361], "value_error": 361, "value_st": [39, 40], "var": [57, 281, 317, 318], "var_in": 281, "var_out": 281, "vari": [371, 397, 411, 425, 426, 428], "variabl": [281, 314, 322, 324, 335, 341, 349, 361, 370, 379, 380, 381, 382, 388, 391, 394, 413, 414], "varianc": [25, 395, 406], "variant": [9, 404], "variat": 9, "varieti": [325, 372], "variou": [259, 268, 270, 309, 312, 319, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 361, 363, 369, 372, 378], "vast": 372, "vastli": [314, 349], "vaswani": [36, 44], "vbroadcastss": 410, "ve": [319, 372, 418], "vec": [25, 403], "vec_align_len": 281, "vec_num_per_thr": 281, "vec_num_tail_thr": 281, "vec_tail_len": 281, "veca": 402, "vecb": 402, "vectara": 420, "vector": [23, 25, 278, 279, 280, 281, 357, 370, 394, 398, 400, 401, 402, 404, 407, 409], "vector_comput": [400, 401], "vector_databas": 372, "vectorstor": [309, 372], "vectorstoreretriev": [309, 357], "velankar": 301, "ventur": 361, "verbos": [246, 305], "veri": [25, 256, 257, 259, 316, 351, 396, 399, 401, 402, 405], "verifi": [346, 354, 410], "versatil": [361, 369, 372], "version": [24, 25, 35, 246, 259, 266, 269, 298, 302, 308, 314, 324, 326, 330, 337, 343, 345, 348, 349, 361, 362, 366, 372, 377, 383, 384, 390, 397, 411, 415, 425, 426, 427, 428], "versu": 266, "vfma": 410, "vfmadd": 404, "vfmadd231p": [404, 410], "vg": 350, "vg_100k": 350, "vg_100k_2": 350, "via": [25, 298, 330, 332, 348, 349, 351, 363, 372, 373, 400, 403, 410, 421, 432], "viath": 351, "video": [319, 321, 360, 369, 374, 420], "view": [38, 83, 300, 335, 380, 382, 389, 399, 424], "viewpoint": 298, "villag": 361, "vim": [330, 332], "vincyzhang": 299, "violat": [266, 335, 380], "virtual": [278, 279, 280, 323, 330, 361, 394, 400, 401], "vision": [350, 418, 420], "vision_tow": 350, "visit": [270, 302, 327, 328, 329, 345, 356, 383, 384, 397, 411, 425, 426], "visual": [256, 257, 265, 308], "visualgenom": 350, "vit": [9, 350, 369], "vits2": [340, 369], "vllm": [309, 312], "vllm_serv": 312, "vmovdqu32": 403, "vmovup": [401, 410], "vmware": 420, "vnni": [398, 399, 403, 407, 408, 411, 413, 421, 423, 437], "vnni_data_t": 281, "vnni_noperm_p2013_p1302": 407, "vnni_noperm_p2031_p1302": 413, "vnni_param_t": 281, "vocab": 354, "vocab_s": [33, 36, 44], "vocabulari": 36, "vocod": 369, "voic": [321, 336, 340, 341, 369, 378, 381], "voicechat": [309, 311, 321, 334, 375], "voicechat_api": 340, "void": [279, 280, 281, 394, 398, 400, 401, 402], "volatil": 395, "volum": 370, "vpaddb": 400, "vpxord": 410, "vqa": 350, "vtune": 415, "vv": 361, "vzeroupp": 410, "w": [21, 256, 257, 263, 385, 388, 389, 390, 399, 402, 408, 428, 432], "w4g32": 432, "w8": 432, "w8a8": [319, 432], "wa": [36, 266, 372, 377], "wai": [302, 309, 314, 319, 338, 349, 356, 358, 361, 363, 371, 372, 389, 390, 391, 395, 399, 401, 407, 410], "wait": 361, "walk": [327, 328, 329], "wall": 389, "wandb": 352, "wang": 301, "want": [24, 25, 28, 247, 289, 295, 309, 314, 340, 347, 350, 351, 369, 387, 389, 390, 392, 395, 396, 399, 400, 401, 413, 416, 421, 438], "warm": 396, "warm_up": 302, "warmup": [28, 289, 390, 396], "warmup_it": 390, "warmup_ratio": [314, 349], "warmup_step": [346, 347], "warn": [61, 264, 361], "wast": [396, 405, 406], "watt": [351, 385], "wav": [311, 319, 336, 340, 369, 375], "wav2vec2": 353, "wavelength": 36, "we": [0, 5, 23, 35, 44, 256, 257, 258, 266, 269, 270, 288, 295, 298, 302, 305, 309, 314, 315, 319, 324, 325, 326, 327, 328, 329, 330, 332, 335, 337, 338, 340, 345, 346, 347, 349, 350, 351, 352, 353, 354, 355, 357, 359, 360, 361, 364, 365, 366, 367, 368, 369, 370, 372, 373, 374, 376, 377, 380, 383, 384, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 413, 416, 417, 418, 422, 423, 425, 426, 427, 428, 429, 432, 438], "weak": 340, "web": [322, 372, 378], "webpag": 319, "websit": [345, 383, 384], "wed": 361, "week": 420, "wei": [281, 413], "weight": [32, 33, 36, 44, 128, 260, 265, 270, 281, 303, 305, 320, 354, 360, 369, 371, 377, 389, 390, 392, 399, 402, 403, 404, 408, 409, 413, 416, 417, 420, 421, 423, 428], "weight_8bit": 281, "weight_bf16": 281, "weight_data": 55, "weight_decai": [314, 349], "weight_dict": [256, 257], "weight_dtyp": [247, 319, 327, 328, 329, 371, 375, 429, 432], "weight_f8_e4m3": 281, "weight_f8_e5m2": 281, "weight_int8": 281, "weight_optim": 128, "weight_ratio": [250, 251, 416, 417, 423], "weight_typ": [281, 421], "weightonlyqu": 270, "weightpruningconfig": [28, 246], "welcom": [298, 300, 312, 320, 333, 339, 342, 369, 420], "welford": [281, 406], "well": [25, 36, 44, 260, 266, 305, 350, 372, 423, 424, 432], "wenxin": 415, "were": [247, 260, 361, 376], "wget": [319, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 369, 383, 384], "wgt_t": 281, "what": [5, 298, 316, 317, 324, 356, 358, 363, 372, 400, 401, 407, 419], "when": [0, 9, 23, 24, 25, 28, 36, 40, 243, 246, 256, 257, 264, 289, 298, 304, 309, 314, 315, 316, 317, 318, 319, 332, 349, 350, 363, 366, 370, 372, 385, 387, 388, 390, 391, 394, 395, 396, 399, 401, 402, 404, 405, 406, 408, 409, 413, 416, 417, 423, 425, 426, 427, 429], "where": [25, 32, 36, 44, 57, 83, 247, 255, 258, 263, 267, 303, 314, 322, 334, 338, 349, 355, 358, 376, 377, 382, 391, 399, 401, 404, 405, 407, 409, 413, 414, 422], "whether": [4, 9, 28, 35, 247, 264, 295, 316, 330, 332, 340, 349, 364, 371, 372, 373, 376, 378, 387, 389, 395, 413, 421, 438], "which": [5, 17, 24, 25, 29, 32, 35, 36, 44, 49, 52, 53, 54, 57, 195, 243, 246, 247, 255, 256, 257, 260, 265, 266, 270, 298, 303, 307, 308, 309, 314, 316, 319, 325, 330, 331, 332, 334, 335, 336, 340, 346, 347, 349, 350, 354, 361, 370, 372, 375, 376, 377, 380, 386, 387, 388, 390, 391, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 408, 409, 412, 413, 416, 419, 421, 423], "while": [25, 57, 258, 288, 307, 335, 338, 358, 371, 372, 378, 380, 388, 391, 395, 402, 405, 408, 413, 423, 427, 432], "whisper": [309, 336, 340, 355, 369, 375], "whisper_larg": 425, "white": 402, "whitespac": 414, "whl": [332, 348, 349, 432], "who": [295, 298, 319, 361, 377, 401, 438], "whole": [25, 57, 304, 316, 389, 390, 404, 405, 406, 408, 410], "whom": 377, "whose": [266, 314, 349, 395], "wide": [17, 302, 303, 312, 319, 320, 364, 365, 366, 372, 376, 401, 402, 423, 432], "wide_resnet101_2": 17, "wide_resnet50_2": 17, "width": [15, 42, 256, 257, 266, 399, 400, 404, 406, 423], "wiki": 298, "wikitext": [304, 425], "window": [264, 302, 308, 309, 320, 327, 328, 329, 386, 420, 429], "window_s": 264, "wino": 288, "winogrand": 426, "wip": [304, 349, 403], "wisdom": 361, "wise": [361, 398, 413, 420, 428, 432, 437], "wish": [0, 415], "witch": 320, "within": [24, 25, 266, 298, 309, 358, 369, 372, 404, 428, 432], "without": [24, 25, 50, 255, 298, 303, 314, 319, 327, 328, 329, 335, 349, 354, 372, 374, 380, 382, 387, 388, 406, 409, 410, 413, 420], "wm": 402, "wmt16": 304, "wn": 402, "woman": 361, "won": [314, 349], "wondrou": 361, "woq": 421, "woq_config": 432, "woq_linear": 421, "woq_model": 432, "word": [23, 36, 44, 247, 266, 304, 319, 369, 372, 373, 409], "word_embed": [150, 388], "wordembed": 242, "work": [25, 259, 271, 317, 318, 347, 353, 359, 360, 363, 374, 377, 396, 401, 418, 425, 427], "work_spac": 281, "workaround": 409, "workdir": 398, "worker": [314, 349], "workflow": [272, 299, 302, 303, 390, 392, 407], "workload": [361, 402, 407], "workshop": 322, "workspac": [322, 364, 365, 390], "workstat": 361, "world": [28, 361, 372, 377, 385], "world_siz": [1, 252, 326, 330, 346, 349, 352], "wors": 432, "worst": 399, "worth": 408, "would": [24, 32, 57, 266, 347, 352, 361, 365, 372, 387, 391, 392, 395, 396, 410, 428], "wrap": 25, "wrapper": [2, 3, 13, 14, 400], "write": [25, 57, 355, 387, 395, 405, 406, 408], "write_back_scal": 405, "write_row_and_zero": 403, "write_tile_to_dst": 405, "write_tile_to_tmp_buf": 405, "written": [24, 313, 316, 349, 369, 382, 391], "wrong": [361, 387, 395], "ws2": 361, "www": [25, 319, 397, 411, 420, 425, 426], "x": [24, 25, 30, 36, 37, 44, 256, 257, 309, 313, 316, 317, 318, 361, 363, 367, 390, 401, 404, 405, 407, 408, 413, 423, 432], "x0": 263, "x1": 263, "x16": 413, "x86": [327, 328, 329, 421], "x86_64": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384, 386], "xbyak": 401, "xdi": 410, "xed3": 410, "xed64": 410, "xeon": [4, 272, 302, 304, 308, 309, 310, 311, 316, 318, 319, 320, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 354, 360, 361, 363, 364, 365, 366, 367, 369, 370, 374, 375, 399, 408, 411, 415, 420, 423, 425, 426], "xeonplatinum": 397, "xigui": 301, "xiguiwang": 361, "xin3h": 299, "xk": 399, "xl": [314, 349, 372], "xl_peft_finetuned_model": [314, 349], "xlnet": 304, "xlsr": 353, "xlsx": [319, 372, 389], "xpu": [309, 320, 375, 432], "xsum": 304, "xuehaosun": 299, "xxx": [302, 309, 314, 319, 324, 334, 349, 371, 372], "xxxxx_sampl": 355, "xxxxxx": 352, "xyxi": 263, "y": [20, 57, 308, 309, 322, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368, 401, 426, 432], "y0": 263, "y1": 263, "yac": 366, "yaml": [49, 55, 57, 246, 309, 313, 316, 317, 318, 319, 351, 360, 367, 375, 389, 390, 392, 396, 412], "yann": 419, "ye": 425, "year": [415, 420], "yet": 349, "yi": [281, 301], "yield": [319, 423], "ymal": 55, "you": [0, 24, 25, 28, 32, 33, 36, 44, 57, 247, 259, 269, 270, 289, 300, 302, 303, 305, 308, 309, 313, 314, 315, 316, 317, 318, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 355, 356, 357, 358, 361, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 375, 376, 377, 378, 380, 383, 384, 385, 387, 388, 390, 391, 392, 395, 396, 400, 401, 403, 410, 412, 413, 415, 416, 418, 419, 423, 424, 426, 427, 428, 432], "you_repo_path": [314, 315], "you_work_dir": 387, "young": 361, "youngjoo": 432, "your": [1, 57, 246, 247, 270, 272, 300, 302, 306, 308, 309, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 346, 350, 351, 355, 358, 363, 364, 365, 366, 369, 370, 371, 372, 376, 378, 380, 381, 385, 387, 388, 389, 390, 391, 400, 401, 413, 415, 418, 420, 432], "your_branch": [314, 315], "your_env_nam": [323, 330], "your_ip": [317, 363], "your_kernel_log": 401, "your_port": [317, 363, 366], "your_pytorch_model_path_or_hf_model_nam": 432, "your_saved_model_dir": 432, "your_training_script": 1, "yourself": [387, 395], "youtub": 420, "yum": 308, "z": 303, "zaker": 420, "zero": [47, 73, 340, 349, 350, 401, 402, 404, 405, 409, 421], "zero2": 347, "zero_point": 247, "zero_tileconfig_start": 403, "zero_upper_row": 403, "zeroextend16": 409, "zeropoint": 423, "zeropointc": 281, "zeroth": 25, "zh": [353, 369], "zhang": [301, 415, 432], "zhenwei": 299, "zmm": [400, 401, 404, 406, 409], "zmm0": 410, "zmm1": 410, "zmm10": 410, "zmm12": 410, "zmm13": 410, "zmm14": 410, "zmm16": 410, "zmm17": 410, "zmm18": 410, "zmm2": 410, "zmm31": 410, "zmm4": 410, "zmm5": 410, "zmm6": 410, "zmm8": 410, "zmm9": 410, "zmm_byte_s": 400, "zmm_mock1": 401, "zmm_src": 401, "zmm_src1": 400, "zmmword": 410, "zoom": [335, 380], "zp": [281, 400, 421], "zp0": 281, "zp_dst": 281, "\u017cyczy\u0144ski": 301, "\u03b1x": 401, "\u03b2": 401, "\u3053\u3093\u306b\u3061\u306f": 369, "\u6b22\u8fce\u6765\u5230\u82f1\u7279\u5c14": [340, 369], "\u89e3\u51b3\u65b9\u6848\u4e3a\u6700\u65b0meta": 420}, "titles": ["conversation", "gaudi_spawn", "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", "intel_extension_for_transformers.neural_chat.chatbot", "intel_extension_for_transformers.neural_chat.config", "intel_extension_for_transformers.neural_chat.config_logging", "intel_extension_for_transformers.neural_chat.errorcode", "intel_extension_for_transformers.neural_chat.pipeline", "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", "intel_extension_for_transformers.tools.utils", "intel_extension_for_transformers.transformers.benchmark", "intel_extension_for_transformers.transformers.config", "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", "intel_extension_for_transformers.transformers.dynamic.evolution", "intel_extension_for_transformers.transformers.dynamic", "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", "intel_extension_for_transformers.transformers.modeling", "intel_extension_for_transformers.transformers.modeling.model", "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", "intel_extension_for_transformers.transformers.pipeline", "intel_extension_for_transformers.transformers.pruner", "intel_extension_for_transformers.transformers.pruner.pruning", "intel_extension_for_transformers.transformers.quantization", "intel_extension_for_transformers.transformers.runtime.compile.compile", "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors", "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", "intel_extension_for_transformers.transformers.runtime.compile.graph.graph", "intel_extension_for_transformers.transformers.runtime.compile.graph", "intel_extension_for_transformers.transformers.runtime.compile.graph_utils", "intel_extension_for_transformers.transformers.runtime.compile", "intel_extension_for_transformers.transformers.runtime.compile.loaders", "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", "intel_extension_for_transformers.transformers.runtime.compile.logger", "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", "intel_extension_for_transformers.transformers.runtime.compile.ops.all", "intel_extension_for_transformers.transformers.runtime.compile.ops.assert", "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", "intel_extension_for_transformers.transformers.runtime.compile.ops.cast", "intel_extension_for_transformers.transformers.runtime.compile.ops.concat", "intel_extension_for_transformers.transformers.runtime.compile.ops.conv", "intel_extension_for_transformers.transformers.runtime.compile.ops.cos", "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.gather", "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", "intel_extension_for_transformers.transformers.runtime.compile.ops", "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.mean", "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", "intel_extension_for_transformers.transformers.runtime.compile.ops.op", "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.pack", "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", "intel_extension_for_transformers.transformers.runtime.compile.ops.pow", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", "intel_extension_for_transformers.transformers.runtime.compile.ops.resize", "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", "intel_extension_for_transformers.transformers.runtime.compile.ops.shape", "intel_extension_for_transformers.transformers.runtime.compile.ops.sin", "intel_extension_for_transformers.transformers.runtime.compile.ops.size", "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", "intel_extension_for_transformers.transformers.runtime.compile.ops.split", "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", "intel_extension_for_transformers.transformers.runtime.compile.ops.view", "intel_extension_for_transformers.transformers.runtime.compile.ops.where", "intel_extension_for_transformers.transformers.runtime.compile.optimizer", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.tf_utils", "intel_extension_for_transformers.transformers.runtime.compile.torch_utils", "intel_extension_for_transformers.transformers.runtime", "intel_extension_for_transformers.transformers.trainer", "intel_extension_for_transformers.transformers.utils.config", "intel_extension_for_transformers.transformers.utils.get_throughput", "intel_extension_for_transformers.transformers.utils", "intel_extension_for_transformers.transformers.utils.metrics", "intel_extension_for_transformers.transformers.utils.objectives", "intel_extension_for_transformers.transformers.utils.utility", "main_eval_only", "main_parse_and_eval", "models.backbone", "models.detr", "models.detr_multi", "models.matcher", "models.position_encoding", "models.segmentation", "models.transformer", "text", "util.box_ops", "util.misc", "util.plot_utils", "util.postprocess", "utils.data_utils", "utils.eval_utils", "CI Introduction", "Documentation Overview and Installation", "OpenSSF Badge", "Intel\u00ae Extension for Transformers: Accelerating Transformer-based Models on Intel Platforms", "API", "Python APIs", "Compile", "Graph", "Engine API", "Class engine", "Class Kernel", "Class operator_desc", "Operator Specific Types", "Kernel APIs", "Config", "Model", "Trainer", "User-facing API", "Architecture of Intel\u00ae Extension for Transformers", "1. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G-1.", "Benchmark", "Example", "Features", "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!", "Kernels", "Implementation Details", "Performance", "Neural Engine", "User Guide", "Contributor Covenant Code of Conduct", "Module Owner Matrix", "Contribution Guidelines", "<no title>", "Intel\u00ae Extension for Transformers", "Distillation", "Examples", "Export to ONNX", "Getting Started", "H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models", "Installation", "NeuralChat", "<no title>", "NeuralChat Command Line", "Intel Neural Chat Dockerfile", "Start NeuralChat and Code Generation Service with Docker", "Prerequisite\u200b", "Do chatbot inference with Docker", "Start NeuralChat Text Generation Service with Docker", "Start NeuralChat and TGI serving with Docker", "Start NeuralChat and vLLM serving with Docker", "Plugins", "NeuralChat Notebooks", "Building RESTful API Server", "QuickStart: Intel\u00ae Extension For Transformers*: NeuralChat on 4th Generation Intel\u00ae Xeon\u00ae Scalable Processors", "Setup Conda", "Setup Conda", "<no title>", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "<no title>", "Setup Environment", "<no title>", "Introduction", "Introduction", "Introduction", "<no title>", "Setup Conda", "\ud83d\udcf8 Project Screenshots", "<no title>", "Setup Conda", "Setup Conda", "Deploy on Huggingface Space", "Direct Preference Optimization (DPO)", "How to train Intel/neural-chat-7b-v3-1 on Intel Gaudi2", "NeuralChat Fine-tuning", "NeuralChat Fine-tuning", "Multi-Modal", "Evaluation Guidelines", "Reinforcement Learning from Human Feedback (RLHF)", "Shanghainese ASR (Audio-Speech-Recognition) and TTS (Text-To-Speech) finetuning/inference", "GPT-J fine-tuning and inference", "Voice Cloning by finetuning a Text-To-Speech (TTS) model", "Installation", "Introduction", "Introduction", "Extract Tables From PDF File", "Face Animation", "Build Your Chatbot with Intel\u00ae Extension for Transformers neural-chat", "Build RAG (retriveval augment generation) example with Intel\u00ae Extension for Transformers neural-chat on Intel GPU", "Introduction", "Serving NeuralChat Text Generation with Triton Inference Server", "Serving NeuralChat Text Generation with Triton Inference Server (CUDA)", "Serving NeuralChat Text Generation with Triton Inference Server on HPU", "vllm serving for NeuralChat", "Setup Environment", "Install System Dependency", "\ud83d\ude80 What is caching plugin?", "\ud83c\udfe0Introduction", "Introduction", "Introduction", "Face Animation", "NeuralChat Server Command Line", "Finetune Embedding Model on Task-Specific Datasets", "Prerequisite\u200b", "\ud83d\udd21 TextBot", "\ud83d\udcf8 Project Screenshots", "<no title>", "\ud83d\udcf8 Project Screenshots", "\ud83d\udcf8 Project Screenshots", "Deploy on Huggingface Space", "Deploy on Huggingface Space", "LLM Carbon Calculator", "Installation", "Add Customized Pattern", "Deploy and Integration", "Profiling", "Engine Tuning", "Graph Fusion", "Compile an ONNX model to Engine IR", "Quantize a ONNX model to engine low precision/int8 IR", "Customized Operators Register", "Pattern Recognize", "Static Compressed Buffer", "Neural Engine Support Matrix", "Transformers-Accelerated Libraries", "3D Inference", "Binary Injectors", "Element-wise Injector", "Introduction", "Sparse GEMM AMX", "Sparse GEMM AVX512F", "Dynamic Quant Matmul", "Sparse GEMM with Layer-Normalize", "Transposed MatMul", "Transposed MHA", "Sparse GEMM VNNI", "Performance and Profiling", "Validated Performance Data", "How to visualize weights distribution of sparse model", "Benchmark for Kernels", "Inputs format", "Legal Information", "Metrics", "Objective", "Pipeline", "Pruning", "Full Publications/Events (50)", "QBits", "QLoRA on CPU", "Quantization", "Release", "Validated Model Performance", "Efficient LLM Inference on CPUs", "Step-by-Step", "Smooth Quant", "Streaming LLM", "Tutorials", "User Guide", "Weight Only Quantization (WOQ)", "Example", "Features", "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!", "Kernels", "Implementation Details", "Performance", "Neural Engine", "User Guide"], "titleterms": {"": [377, 400, 401], "0": 288, "04": 308, "1": [288, 302, 314, 315, 346, 347, 348, 349, 352, 353, 354, 361, 362, 376, 377, 388, 389, 393, 394, 412, 420, 425, 427], "11": 288, "14": 420, "2": [288, 302, 314, 315, 346, 348, 349, 352, 353, 354, 361, 362, 376, 377, 388, 393, 394, 412, 427], "20": 308, "2021": 420, "2022": 420, "2023": 420, "2024": 420, "20b": 425, "22": 308, "3": [288, 302, 314, 315, 346, 348, 349, 352, 353, 354, 361, 376, 377, 388, 412, 427], "34": 420, "3b": 425, "3d": 399, "4": [288, 302, 314, 346, 352, 376, 377, 388, 411], "4th": 322, "5": [352, 376, 420], "50": 420, "6": 376, "6b": 425, "7b": [347, 349, 425], "8": 308, "A": 316, "AND": 432, "For": [305, 322, 349, 413, 432], "On": [314, 315, 409], "To": [302, 353, 355, 393], "acceler": [272, 306, 358, 398, 402], "accept": 300, "access": [309, 321, 375], "accuraci": [288, 393, 423, 426, 427], "acknowledg": [353, 359, 360, 374], "across": 288, "activ": [322, 409], "adapt": [304, 306], "add": [349, 387, 394], "add_cls_token": 130, "add_embed": 131, "addit": 321, "advanc": 319, "after": [377, 387], "ai": 378, "algorithm": 432, "all": 63, "alpha": 401, "amp": 319, "amx": 403, "an": [303, 392, 419, 423], "analysi": 412, "anim": [360, 374], "api": [273, 274, 277, 282, 286, 289, 305, 309, 321, 389, 398, 422], "applic": [345, 383, 384], "approach": 423, "arangewithreciproc": 132, "arc": 349, "architectur": [287, 346, 388], "argument": [348, 349], "askdoc": 338, "asr": [353, 369], "assert": 64, "assisted_gen": 323, "attent": 413, "attention_mask_length_adaptive_keep_indic": 138, "attention_output_layer_norm_length_adaptive_keep_indic": 139, "attention_reshap": 140, "attentionblock_attentionmaskaddreshap": 133, "attentionblock_constantofshapewithmul": 134, "attentionblock_qkvprereshap": 135, "attentionblock_qkvreshap": 136, "attentionblock_weightreshapeto4d": 137, "attribut": [243, 298, 387], "audio": [336, 340, 353], "augment": 362, "automat": [319, 369], "autoround": 432, "avx512f": 404, "aw": 349, "awar": 423, "backbon": 255, "backend": [305, 366, 388, 418], "baddbmm": 65, "badg": 271, "bare": [348, 349, 386], "baremet": 322, "bart": 37, "base": [272, 348, 425], "baselin": 426, "batch_matmul": 66, "batch_matmul_v2": 67, "befor": [377, 389], "beforehand": 407, "below": 412, "benchmark": [27, 289, 393, 413], "best": 390, "beta": 401, "between": [330, 332], "bf16": [305, 371], "bfm": 16, "bias_add": 68, "binari": [386, 388, 400], "bot": 378, "box_op": 263, "brief": 403, "buffer": 396, "build": [314, 315, 321, 327, 328, 329, 348, 349, 356, 358, 361, 362, 372, 388, 398, 413], "c": 389, "cach": [319, 370, 399], "calcul": [385, 408], "call": [336, 337], "can": [370, 389], "candid": 409, "carbon": 385, "card": [349, 365], "cast": 69, "cast_to": 141, "cento": 308, "chain": 395, "chat": [311, 312, 316, 347, 361, 362, 375, 422], "chatbot": [4, 315, 319, 323, 326, 327, 328, 329, 330, 331, 332, 356, 358, 361, 372], "check": [345, 365, 383, 384], "checker": 319, "checklist": 300, "child_parent_retriev": 2, "childparentretriev": 372, "chroma": [3, 372], "ci": [269, 300], "citat": 415, "class": [0, 2, 5, 9, 14, 22, 24, 25, 28, 30, 32, 33, 35, 36, 37, 40, 44, 47, 50, 52, 53, 54, 55, 57, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 247, 250, 251, 255, 256, 257, 258, 259, 260, 264, 278, 279, 280, 416], "client": [309, 361, 364, 365, 366, 375], "clone": [314, 315, 355], "co": 72, "code": [298, 300, 313, 319, 323, 326, 327, 328, 329, 330, 331, 332, 358], "codegen": [326, 327, 328, 329, 330, 331, 332], "codellama": 349, "collect_quant_info": 142, "command": [311, 361, 362, 375, 412], "compat": [309, 321, 340, 425], "compil": [49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 275, 337, 392], "complet": 358, "compress": [302, 396], "comput": 406, "concat": 70, "conda": [302, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "conduct": [298, 300], "config": [5, 28, 247, 283, 358, 372, 387, 390], "config_log": 6, "configur": [313, 316, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 363, 383, 384, 397, 411], "connect": [361, 391], "constrain": 421, "construct": [376, 391], "consum": [313, 316, 317, 318, 363], "contain": [314, 315, 366], "content": [0, 1, 2, 4, 5, 6, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 262, 263, 264, 265, 266, 267, 268, 434, 439], "contribut": [270, 300], "contributor": [298, 300], "conv": 71, "conv_reshap": 143, "convers": 0, "coven": [298, 300], "cpp": [327, 328, 329, 394], "cpu": [346, 361, 422, 426, 432], "creat": [303, 314, 315, 322, 334, 345, 354, 366, 383, 384, 391, 419, 423], "criteria": 300, "criterion": 303, "csv": 389, "cuda": [352, 365, 432], "curl": [309, 321], "custom": [309, 340, 349, 387, 388, 394], "data": [350, 353, 355, 370, 376, 404, 411], "data_util": 267, "databas": 334, "dataset": [302, 314, 346, 348, 349, 352, 354, 376, 393], "decoder_attn_reshap": 144, "demo": 353, "dens": [304, 402], "depend": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 362, 363, 368, 369, 371, 383, 384, 432], "deploi": [309, 345, 360, 367, 383, 384, 386, 388], "deploy": 304, "descript": [405, 406, 408], "design": 393, "detail": [294, 405, 408, 437], "detector": [11, 12], "detr": 256, "detr_multi": 257, "develop": [400, 401, 413], "devic": 432, "dict": 391, "differ": 405, "diffus": [337, 425], "direct": [346, 347, 406], "dispatch": 390, "distil": [303, 304, 306], "distillationconfig": 303, "distribut": [349, 412], "dl1": [314, 349], "do": [315, 353], "docker": [313, 314, 315, 316, 317, 318, 319, 322, 348, 349, 366], "dockerfil": 312, "document": [270, 292, 313, 316, 435], "doe": 370, "dolli": 425, "download": [338, 353, 354, 363], "dpo": [346, 347], "drop_and_restore_util": 29, "duplic": 395, "dynam": [29, 30, 31, 405, 423], "dynamic_qu": 413, "dynamic_quant_matmul": 413, "each": 395, "earli": 304, "edit": 377, "effici": [307, 402, 426], "eiffel": 377, "einsumwitharang": 145, "electra": 425, "element": 401, "eltwiseop": 413, "embed": 376, "embeddingbag": 146, "embeddings_to_2d_before_inner_product": 147, "empty_op": 73, "endpoint": 340, "enforc": 298, "engin": [277, 278, 296, 304, 306, 386, 388, 390, 392, 393, 397, 439], "engine_profil": 389, "english": 369, "environ": [302, 308, 313, 314, 315, 316, 317, 318, 322, 334, 336, 337, 338, 346, 347, 348, 349, 352, 353, 354, 357, 359, 360, 361, 362, 363, 367, 368, 374, 377, 393, 426], "errorcod": 7, "establish": 391, "eval_util": 268, "evalu": [346, 349, 350, 351, 376], "event": [272, 420], "evolut": 30, "exampl": [289, 290, 304, 305, 316, 362, 376, 385, 389, 392, 413, 417, 418, 421, 422, 428, 429, 432, 433], "except": 24, "executor": [305, 394, 418], "exist": [348, 349], "exit": 304, "expand_dim": 74, "expect": 302, "export": 305, "extens": [272, 287, 292, 302, 304, 308, 309, 322, 327, 328, 329, 357, 361, 362, 368, 372, 435], "extract": 359, "extractor": [50, 51, 52, 53, 54], "face": [286, 360, 374], "face3d": [16, 17, 18, 19, 20, 21], "face_anim": [16, 17, 18, 19, 20, 21], "falcon": [349, 425], "faq": [269, 300], "featur": [291, 400, 401, 423, 434], "feedback": 352, "file": [313, 316, 351, 359], "fine": [314, 319, 347, 348, 349, 352, 354], "finetun": [314, 348, 349, 353, 355, 376, 425], "flan": 349, "fly": 409, "folder": 351, "format": [392, 404, 414], "fp32": [305, 371, 426, 427], "framework": [363, 400, 401, 428], "from": [302, 308, 314, 315, 322, 348, 349, 352, 359, 386], "frontend": [345, 383, 384], "full": 420, "function": [0, 1, 4, 6, 15, 17, 20, 21, 23, 24, 25, 27, 28, 29, 30, 32, 36, 37, 39, 40, 41, 42, 44, 45, 49, 57, 61, 62, 95, 184, 243, 244, 245, 252, 260, 262, 263, 264, 265, 266, 267, 268, 270], "fundament": 423, "fuse": 387, "fused_batch_matmul_v2": 75, "fused_batch_norm_v3": 76, "fused_gemm": 77, "fused_matmul": 78, "fusion": [387, 391], "gather": 79, "gather_el": 80, "gaudi": [314, 315], "gaudi2": [347, 350], "gaudi_spawn": 1, "gelu": [81, 148], "gemm": [82, 403, 404, 406, 409], "gener": [300, 307, 313, 316, 319, 322, 323, 326, 327, 328, 329, 330, 331, 332, 362, 364, 365, 366, 388], "generate_sequ": 149, "get": [289, 302, 306, 309, 322, 354, 367, 389, 393, 416, 423], "get_throughput": 248, "ggml": 425, "git": 348, "gpt": [354, 425], "gpt_bigcod": 33, "gpu": [314, 315, 318, 346, 349, 361, 362, 432], "graph": [55, 56, 276, 388, 390, 391], "graph_util": 57, "guid": [297, 431, 440], "guidelin": [300, 351], "h": 394, "h2o": 307, "habana": [314, 315, 346, 349, 352], "hard": 376, "hardwar": [302, 308], "heavi": 307, "help": [311, 370, 375], "hf": 349, "hitter": 307, "hostfil": [330, 332], "how": [302, 347, 370, 390, 396, 412], "hpp": [400, 401], "hpu": 366, "hub": [314, 315], "huggingfac": [345, 383, 384], "human": 352, "i": [361, 365, 370], "imag": [314, 315, 316, 334, 348, 349, 366], "image2imag": [9, 337], "implement": [294, 437], "import": [356, 358, 372], "inbound": 349, "includ": 394, "infer": [270, 302, 307, 315, 319, 353, 354, 355, 364, 365, 366, 371, 388, 399, 418, 425, 426, 427], "inform": [391, 415], "initi": 370, "injector": [400, 401], "innerproductreshapefus": 129, "innerproductwithbiasgelu": 151, "innerproductwithslic": 152, "innerproductwithswish": 153, "input": [340, 414], "input_data": 154, "input_fil": 155, "insert": 391, "insert_bf16_nod": 156, "insert_quant_nod": 157, "instal": [270, 302, 308, 309, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 353, 354, 356, 357, 361, 362, 363, 368, 369, 370, 371, 383, 384, 386, 393, 398], "instanc": [303, 349, 356, 419, 423], "instruct": [349, 350, 404], "instructpix2pix_pipelin": 9, "int4": [371, 426, 427], "int8": [305, 371, 393, 418], "int8_bf16_mixed_precision_check": 158, "integr": 388, "intel": [272, 287, 292, 302, 304, 308, 312, 322, 327, 328, 329, 347, 349, 357, 358, 361, 362, 368, 432, 435], "intel_extension_for_transform": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 386], "intent_detect": 11, "interact": [356, 358, 372], "interact_featur": 159, "intermedi": 303, "introduct": [269, 289, 300, 303, 305, 307, 309, 336, 337, 338, 354, 357, 358, 363, 371, 372, 373, 376, 387, 389, 390, 391, 392, 395, 396, 398, 400, 401, 402, 403, 407, 412, 416, 417, 418, 419, 421, 422, 423, 428, 429, 432], "ipex": [289, 304], "ir": [392, 393], "isa": 403, "issu": 399, "iter": 389, "iterator_get_next": 84, "iterator_v2": 85, "itrex": [314, 315, 331, 332], "its": 393, "j": [354, 425], "jit": 405, "jit_binaryop_injector": 400, "jit_eltwise_injector": 401, "json": 389, "kei": [324, 404], "kernel": [279, 282, 293, 390, 398, 402, 405, 413, 436], "kingdom": 377, "knowledg": [303, 304, 377], "kv_cache_compress": 32, "langchain": [2, 3, 309, 372], "langchain_commun": [2, 3], "languag": [307, 369], "larg": 307, "last_layer_shap": 160, "launch": [309, 366], "layer": [303, 406], "layer_norm": [86, 161], "layer_norm_with_reduce_mean": 162, "layer_norm_with_transpos": 163, "layernorm": 406, "layernorm_ba": [406, 413], "layout": 399, "learn": [302, 352], "legal": [270, 415], "length": [304, 306], "level": 389, "librari": [309, 398], "licens": 415, "line": [311, 375], "list": [349, 350, 395], "llama": [38, 288, 349], "llama2": 349, "llama3": 432, "llama_embed": 164, "llama_matmulwithtranspos": 165, "llama_postprocess": 166, "llama_rotary_pos_emb": 167, "llava": 351, "llm": [319, 377, 385, 425, 426, 429], "load_mat": 19, "loader": [59, 60], "log_softmax": 87, "logger": 61, "loop": 404, "lora": 347, "low": 393, "lower_all_tupl": 168, "m7i": 349, "main": 395, "main_eval_onli": 253, "main_parse_and_ev": 254, "mandarian": 353, "manual": 388, "map": [387, 391], "map_and_batch_dataset": 88, "matcher": 258, "matmul": [89, 405, 406, 407], "matmul_avx512f_p2031_p2013": [407, 413], "matmul_noperm_p2031_p1302": 407, "matmul_p2031_2013": 407, "matmul_vnni_noperm_p2013_p1302": 407, "matmul_vnni_noperm_p2031_p1302": 413, "matmul_with_bia": 169, "matmul_with_bias_add": 170, "matmul_with_bias_gelu": 171, "matmul_with_bias_relu": 172, "matmul_with_bias_sigmoid": 173, "matmul_with_bias_tanh": 174, "matmul_with_bias_unsqueez": 175, "matmul_with_transpos": 176, "matmul_with_transpose_scale_add": 177, "matrix": [299, 305, 397, 398, 405, 417, 423, 428], "mean": [90, 401], "mechan": 390, "memori": [10, 399], "merg": 347, "merged_embeddingbag": 178, "meta": 349, "metal": [348, 349, 386], "metric": [250, 303, 349, 416, 419], "mha": [408, 413], "microsoft": 348, "mine": 376, "minist": 377, "misc": 264, "mistral": [39, 288, 349], "mix": 319, "mixtral": 40, "mkl_layer_norm": 91, "mmmu": 350, "modal": 350, "mode": [361, 362, 372, 425], "model": [16, 17, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 255, 256, 257, 258, 259, 260, 261, 272, 284, 288, 289, 302, 305, 307, 309, 334, 337, 338, 348, 349, 350, 352, 353, 355, 359, 360, 363, 376, 377, 388, 389, 392, 393, 412, 418, 425, 428], "model_dataset": 92, "modeling_bart": 37, "modeling_bert_dynam": 36, "modeling_gaudi": [37, 38, 39, 40, 41, 42, 43], "modeling_gpt_bigcod": 33, "modeling_llama": 32, "modeling_mistr": 39, "modeling_mixtr": 40, "modeling_phi": 41, "modeling_roberta_dynam": 44, "modeling_swin": 42, "modifi": [330, 332], "modul": [0, 1, 2, 4, 5, 6, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 263, 264, 265, 266, 267, 268, 299, 356, 358, 372], "more": [302, 390, 396, 402], "mpt": [346, 349, 425], "mtl": 432, "multi": [314, 330, 332, 349, 350, 365, 369, 411], "multimod": [309, 319], "mysql": 334, "naiv": 402, "necessari": 391, "neg": 376, "neox": 425, "neox_reorder_chang": 179, "neox_rotary_pos_emb": 180, "nethook": 24, "network": 17, "neural": [296, 304, 306, 312, 347, 361, 362, 386, 388, 397, 422, 439], "neural_chat": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25], "neuralchat": [309, 311, 313, 316, 317, 318, 320, 322, 331, 332, 348, 349, 363, 364, 365, 366, 367, 375], "new": [345, 383, 384, 387, 391], "next": 302, "node": [314, 330, 332, 349, 354, 387, 391], "normal": 406, "note": 424, "notebook": 320, "numactl": [323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "numanod": 332, "nvidia": [314, 315, 318], "object": [251, 417, 423], "obtain": 391, "offici": 321, "ok": 361, "old": 391, "one": [349, 405], "one_hot": 93, "onli": [319, 351, 389, 432], "onnx": [305, 388, 392, 393], "onnx_extractor": 52, "onnx_input": 94, "onnx_util": 62, "op": [63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 387, 390], "openai": [309, 321, 324, 340], "openai_protocol": 22, "openssf": 271, "oper": [281, 389, 394], "operator_adaptor": 181, "operator_desc": [280, 400, 401], "opt": 425, "optim": [128, 270, 319, 346, 347], "optimize_dataset": 96, "option": [303, 315, 348, 349, 365, 390, 396, 423], "oracl": 307, "orchestr": 304, "other": 270, "our": 298, "output": [289, 302, 340, 351, 377], "output_data": 182, "overview": [270, 302], "owner": 299, "pack": 97, "packag": [245, 262, 354, 432], "padding_sequ": [98, 183], "param_typ": [400, 401], "paramet": [371, 372], "pars": [351, 395], "parser": 13, "part": 389, "path": [314, 315, 405], "pattern": [184, 387, 390, 391, 395, 403, 404, 409], "pdf": 359, "per": 402, "perform": [295, 358, 397, 398, 410, 411, 425, 426, 438], "perspect": [400, 401], "phi": 41, "photo": 378, "photoai": 334, "pip": 386, "pipelin": [8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 45, 340, 418], "placehold": 99, "platform": [272, 361, 397, 411], "pleas": [314, 315], "pledg": 298, "plot_util": 265, "plugin": [9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 309, 319, 336, 337, 370, 371], "polici": 271, "port": 349, "pos_emb": 100, "pos_shift_llama": 38, "position_embed": 185, "position_embeddings_v1": 186, "position_encod": 259, "post": 423, "postprocess": 266, "pow": 101, "pre": [353, 406], "precis": [319, 393], "prefer": [346, 347, 352], "prefetch": 402, "prepar": [302, 313, 314, 316, 322, 337, 346, 347, 348, 349, 350, 352, 353, 354, 355, 359, 360, 364, 365, 366, 367, 374, 392, 393, 412, 426, 432], "preprocess": [20, 405], "prerequisit": [302, 308, 314, 348, 349, 361, 362, 377, 386, 393, 405, 427], "pretrain": 350, "prime": 377, "print": 351, "problem": [405, 406, 407, 408], "processor": 322, "profil": [389, 410], "project": [341, 378, 379, 381, 382], "prune": [47, 304, 306, 419], "pruner": [46, 47], "public": [272, 420], "pull": [300, 314, 315, 348, 349], "pypi": [302, 308], "python": [274, 309, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384, 389, 422], "pytorch": [289, 303, 304, 421, 425], "q": 316, "qbit": 421, "qdrant": 372, "qkv_merg": 187, "qkv_reshap": 188, "qlora": 422, "quant": [405, 428], "quant_gather_to_bf16": 189, "quantiz": [48, 304, 306, 319, 393, 423, 425, 427, 432], "quantizationconfig": 423, "quantize_fus": 190, "quantize_linear": 102, "quantize_v2": 103, "quantized_fused_matmul_and_dequant": 104, "quantized_graph_dtype_refactor": 191, "quantized_matmul_with_bias_and_dequant": 105, "query_explain": 12, "quick": [340, 365], "quickstart": 322, "rag": [319, 362, 372], "ratio": 389, "recogn": 395, "recognit": [353, 369], "recommend": 302, "reduce_mean": 106, "reduce_sum": 107, "refer": [304, 346, 352, 398, 432], "regard": 377, "regist": [387, 394], "reinforc": 352, "relat": [348, 349, 353, 390], "releas": 424, "remov": [391, 395], "remove_constant_op": 192, "remove_last_view": 193, "remove_rang": 194, "remove_unused_oper": 195, "remove_zero": 196, "removeslic": 197, "reorder": [108, 403, 407, 408, 409], "repo": [314, 315], "report": 271, "repositori": 354, "repr_tool": 23, "represent": 395, "request": [300, 309, 361, 364, 365], "requir": [302, 308, 309, 353, 376], "reshap": 109, "reshape_after_restore_hidden_st": 198, "reshape_before_and_after_attention_out_layer_norm_gather_el": 199, "reshape_before_restore_hidden_st": 200, "reshape_fus": 201, "resiz": 110, "respons": 298, "rest": [22, 309, 321], "restore_hidden_states_in_length_adaptive_update_indic": 202, "result": [351, 367, 377, 395, 412], "retriev": [2, 11, 12, 13, 14, 309, 358, 362, 370, 372, 375], "retriever_adapt": 14, "retrivev": 362, "reward": 352, "rich": 309, "rlhf": 352, "rm": 352, "rms_norm": 203, "rome": [23, 24, 25], "rotary_pos_emb": 204, "rsub": 111, "rule": 349, "run": [302, 315, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 351, 358, 359, 360, 361, 362, 363, 366, 383, 384, 388, 389, 393, 412, 426, 427], "runningstat": 25, "runtim": [49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 319, 390, 425], "safeti": 319, "safety_check": 15, "same": 349, "sampl": 377, "scalabl": 322, "scale": 401, "scatter_el": 112, "scope": 298, "scratch": [348, 349], "screenshot": [341, 378, 379, 381, 382], "script": [303, 322, 359, 360, 364, 365, 366, 419, 423], "sde": 410, "sdk": 321, "search": 395, "section": [292, 435], "secur": [15, 271], "segment": 260, "select": 272, "send": [364, 365], "sentenc": 377, "serv": [317, 318, 364, 365, 366, 367], "server": [22, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 360, 361, 363, 364, 365, 366, 375, 383, 384], "servic": [309, 313, 316, 317, 318, 336, 337, 361, 363, 375], "session": 349, "set": [322, 358, 361, 372, 387, 389], "setup": [313, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 362, 363, 368, 383, 384, 412], "sft": [347, 352], "shanghaines": 353, "shape": 113, "shot": 288, "side": 361, "sidebysid": 378, "simpl": [313, 314, 315, 316], "simpli": 360, "sin": 114, "singl": [314, 332, 349, 354, 411], "size": [115, 405], "slice_position_id": 116, "slicemask": 205, "smooth": 428, "softmax": [117, 413], "softwar": [308, 354], "sourc": [302, 308, 322], "space": [345, 383, 384], "spars": [304, 389, 402, 403, 404, 406, 409, 412], "sparse_matmul": [398, 413], "specif": [281, 376], "speech": [353, 355, 369], "splice": 395, "split": 118, "spmm": 406, "spmm_amx_bf16_x16": 413, "spmm_avx512f": 413, "spmm_vnni": [399, 413], "spr": [313, 314, 315, 317, 346, 349, 358], "squeez": 119, "src": [16, 17, 18, 19, 20, 21, 394], "ssh": [330, 332, 349], "stabl": [337, 386, 425], "stablediffusion_bf16convert": 211, "stablediffusion_collectqdqinfo": 212, "stablediffusion_explicitnhwctranspos": 206, "stablediffusion_explicitnhwctransposeqat": 207, "stablediffusion_insertquantnod": 213, "stablediffusion_mhareshap": 208, "stablediffusion_quantizefus": 209, "stablediffusion_reshapefus": 210, "stage": 405, "standard": 298, "starcod": [349, 425], "start": [289, 302, 306, 309, 313, 316, 317, 318, 322, 350, 354, 361, 364, 365, 375, 416, 423], "start_end_logit": 214, "statement": 407, "static": [396, 413, 423], "step": [302, 426, 427], "stock": [289, 304], "store": [309, 372], "straight": 395, "stream": 429, "streaming_llm": 43, "strided_slic": 120, "structur": 351, "sub": 395, "sub_graph": [129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242], "subgraph_match": 215, "submodul": [18, 31, 34, 46, 51, 56, 58, 59, 83, 150, 249], "subpackag": [58, 245], "summari": [302, 316, 416, 426], "supervis": [347, 352], "support": [300, 302, 305, 309, 392, 397, 398, 416, 417, 423, 428, 432], "swin": 42, "system": [308, 309, 369, 426], "t5": 349, "tabl": [334, 359], "talk": 378, "task": [288, 376], "templat": 300, "tensor": 121, "tensorflow": 304, "test": [313, 314, 315, 316, 324, 340, 356, 357, 360, 361, 368, 398], "text": [262, 311, 316, 353, 355, 364, 365, 366, 369, 375], "textbot": [343, 344, 367, 378], "textchat": [324, 343, 344], "textencdoer_word_embed": 216, "textencoder_attentionmaskaddreshap": 217, "textencoder_attentionreshap": 218, "textencoder_causal_attention_mask": 223, "textencoder_kvreshap": 219, "textencoder_mulreshap": 220, "textencoder_qreshap": 221, "textencoder_softmaxreshap": 222, "tf": 388, "tf_extractor": 53, "tf_util": 243, "tgi": [317, 363], "thi": [314, 315, 370], "thread": [402, 411], "through": 388, "tile": 402, "token_type_embed": 224, "token_type_embeddings_v1": 225, "tool": [23, 24, 25, 26, 327, 328, 329], "top_k": 122, "topic": 319, "torch_embed": 226, "torch_extractor": 54, "torch_ip_insert_bia": 227, "torch_unpack_baddbmm": 228, "torch_util": 244, "torchinsertbf16nod": 229, "torchpaddingsqu": 230, "total": 389, "tower": 377, "trademark": 415, "train": [346, 347, 349, 350, 352, 376, 423], "trainer": [246, 285, 303, 419, 423], "transform": [27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 261, 272, 287, 292, 302, 304, 306, 308, 322, 327, 328, 329, 357, 361, 362, 368, 398, 435], "transformer2dmodel_attentionmaskaddreshap": 231, "transformer2dmodel_constantofshapewithmul": 232, "transformer2dmodel_encoderhiddenstatesreshap": 238, "transformer2dmodel_ffnslic": 233, "transformer2dmodel_ffnslice_1": 234, "transformer2dmodel_getsamplebatch": 239, "transformer2dmodel_qkvprereshap": 235, "transformer2dmodel_qkvreshap": 236, "transformer2dmodel_qkvreshape4d": 237, "transformer2dmodel_sampleslic": 240, "translat": 353, "transpos": [123, 407, 408], "transpose_batch_matmul": 241, "transpose_matmul": 413, "triton": [364, 365, 366], "tt": [353, 355, 369], "tune": [314, 319, 347, 348, 349, 350, 352, 354, 390, 393, 423], "turn": [390, 396], "tutori": 430, "two": 405, "type": [281, 387], "ubuntu": 308, "ui": 361, "unit": 377, "unpack": 124, "unsqueez": 125, "up": [322, 361, 365], "uparrow": 288, "us": [309, 314, 315, 321, 332, 364, 365, 388, 405], "usag": [303, 305, 307, 356, 358, 359, 360, 361, 362, 369, 370, 371, 372, 373, 374, 377, 385, 400, 401, 413, 419], "user": [286, 297, 398, 400, 401, 431, 440], "util": [18, 19, 20, 21, 24, 25, 26, 247, 248, 249, 250, 251, 252, 263, 264, 265, 266, 267, 268], "v2": 425, "v3": 347, "valid": [302, 308, 349, 350, 411, 425, 428], "variabl": 334, "vector": [309, 372], "vectorstor": 3, "vectorstoreretriev": 372, "verbos": 410, "verifi": [361, 376], "version": [386, 421], "video": [16, 17, 18, 19, 20, 21], "view": 126, "visual": [327, 328, 329, 350, 412], "vit": 353, "vllm": [318, 367], "vnni": 409, "voic": [311, 355, 375], "voicebot": 340, "voicechat": 340, "vtune": 410, "vulner": 271, "w2g128": 288, "w3g128": 288, "w4g": 288, "w4g128": 288, "web": 361, "weight": [319, 347, 388, 405, 412, 432], "weightpruningconfig": 419, "welcom": [292, 435], "what": 370, "where": 127, "whether": 365, "wise": 401, "woq": 432, "word_embed": 242, "work": [302, 370, 402], "workflow": 354, "xeon": [313, 314, 315, 317, 322, 349, 358], "yaml": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 388], "you": 389, "your": [345, 361, 383, 384]}}) \ No newline at end of file +Search.setIndex({"alltitles": {"1 Setup Environment": [[361, "setup-environment"]], "1. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G-1.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w4g-1"]], "1. Add *.h of the customized operator to executor/include/operators": [[394, "add-h-of-the-customized-operator-to-executor-include-operators"]], "1. Architecture": [[388, "architecture"]], "1. Docker Image Setup": [[315, "docker-image-setup"]], "1. Download the Workflow Repository": [[354, "download-the-workflow-repository"]], "1. Download the pre-finetuned VITS model": [[353, "download-the-pre-finetuned-vits-model"]], "1. Environment": [[346, "environment"], [352, "environment"]], "1. Environment\u200b": [[348, "environment"], [349, "environment"]], "1. Install requirements": [[353, "install-requirements"]], "1. Introduction": [[376, "introduction"]], "1. Prepare Dataset": [[314, "prepare-dataset"]], "1. Prepare the data": [[353, "prepare-the-data"]], "1. Prepare the sparse model": [[412, "prepare-the-sparse-model"]], "1. Prerequisites": [[302, "prerequisites"]], "1. Quantization": [[427, "quantization"]], "1. Setup Environment": [[362, "setup-environment"]], "1. Single Card Fine-tuning": [[349, "single-card-fine-tuning"]], "1. Single Card Fine-tuning in Habana DL1": [[349, "single-card-fine-tuning-in-habana-dl1"]], "1. Single Node Fine-tuning in Xeon SPR": [[314, "single-node-fine-tuning-in-xeon-spr"]], "1. Single Node Fine-tuning in Xeon SPR": [[349, "single-node-fine-tuning-in-xeon-spr"]], "1. To get the tuned model and its accuracy:": [[393, "to-get-the-tuned-model-and-its-accuracy"]], "1.1 Install intel-extension-for-transformers": [[361, "install-intel-extension-for-transformers"], [362, "install-intel-extension-for-transformers"]], "1.2 Install neural-chat and retrieval dependency": [[362, "install-neural-chat-and-retrieval-dependency"]], "1.2 Install neural-chat dependency": [[361, "install-neural-chat-dependency"]], "1.2.1 CPU Platform": [[361, "cpu-platform"]], "1.2.2 GPU Platform": [[361, "gpu-platform"]], "2 Run the chatbot in command mode": [[361, "run-the-chatbot-in-command-mode"]], "2 samples regarding Eiffel Tower": [[377, "samples-regarding-eiffel-tower"], [377, "id1"]], "2 samples regarding prime minister of the United Kingdom": [[377, "samples-regarding-prime-minister-of-the-united-kingdom"], [377, "id2"]], "2. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w4g128"]], "2. Add *.cpp of the customized operator to executor/src/operators": [[394, "add-cpp-of-the-customized-operator-to-executor-src-operators"]], "2. Create Docker Container": [[315, "create-docker-container"]], "2. Create environment and install software packages": [[354, "create-environment-and-install-software-packages"]], "2. Deploy a TF/ONNX model using Engine inference": [[388, "deploy-a-tf-onnx-model-using-engine-inference"]], "2. Do finetuning of the Shanghainese Audio -> Shanghainese text ASR model": [[353, "do-finetuning-of-the-shanghainese-audio-shanghainese-text-asr-model"]], "2. Inference": [[427, "inference"]], "2. Installation": [[302, "installation"]], "2. Multi Card Fine-tuning in Habana DL1": [[349, "multi-card-fine-tuning-in-habana-dl1"]], "2. Multi-node Fine-tuning in Xeon SPR": [[314, "multi-node-fine-tuning-in-xeon-spr"], [349, "multi-node-fine-tuning-in-xeon-spr"]], "2. Prepare Docker Image": [[314, "prepare-docker-image"]], "2. Prepare reference dataset": [[346, "prepare-reference-dataset"], [352, "prepare-reference-dataset"]], "2. Prepare the Model": [[348, "prepare-the-model"], [349, "prepare-the-model"]], "2. Requirements": [[376, "requirements"]], "2. Run below commands": [[412, "run-below-commands"]], "2. Run the RAG in command mode": [[362, "run-the-rag-in-command-mode"]], "2. To get the benchmark of tuned model:": [[393, "to-get-the-benchmark-of-tuned-model"]], "2.1 Build Docker Image": [[314, "build-docker-image"]], "2.1 Install from PyPi": [[302, "install-from-pypi"]], "2.2 Docker Pull from Docker Hub": [[314, "docker-pull-from-docker-hub"]], "2.2 Install from Conda": [[302, "install-from-conda"]], "2.3 Install from Source": [[302, "install-from-source"]], "2021 (1)": [[420, "id4"]], "2022 (5)": [[420, "id3"]], "2023 (34)": [[420, "id2"]], "2024 (15)": [[420, "id1"]], "3. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W3G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w3g128"]], "3. Accuracy": [[427, "accuracy"]], "3. Analysis results": [[412, "analysis-results"]], "3. Create Docker Container": [[314, "create-docker-container"]], "3. Do finetuning of the Shanghainese text -> Mandarian text translation model": [[353, "do-finetuning-of-the-shanghainese-text-mandarian-text-translation-model"]], "3. Finetune the Mandarian text -> Shanghainese text translation model": [[353, "finetune-the-mandarian-text-shanghainese-text-translation-model"]], "3. How To Run": [[302, "how-to-run"]], "3. Manual customized yaml and weight binary to use Engine inference": [[388, "manual-customized-yaml-and-weight-binary-to-use-engine-inference"]], "3. Multi-node Fine-tuning in AWS m7i SPR instances": [[349, "multi-node-fine-tuning-in-aws-m7i-spr-instances"]], "3. Prepare Dataset": [[348, "prepare-dataset"], [349, "prepare-dataset"]], "3. Prepare dataset": [[354, "prepare-dataset"]], "3. Run chatbot in server mode with UI": [[361, "run-chatbot-in-server-mode-with-ui"]], "3. Simple Test using Docker Container": [[315, "simple-test-using-docker-container"]], "3. Single Node Fine-tuning in Habana DL1": [[314, "single-node-fine-tuning-in-habana-dl1"]], "3. Supervised Fine-tuning (SFT)": [[352, "supervised-fine-tuning-sft"]], "3. Training": [[346, "training"]], "3. Training Data Construction": [[376, "training-data-construction"]], "3.1 Install Requirements": [[302, "install-requirements"]], "3.1 Start the service": [[361, "start-the-service"]], "3.1.1 Verify the client connection to server is OK.": [[361, "verify-the-client-connection-to-server-is-ok"]], "3.1.2 Test request command at client side": [[361, "test-request-command-at-client-side"]], "3.2 Prepare Datasets": [[302, "prepare-datasets"]], "3.2 Set up Server mode UI": [[361, "set-up-server-mode-ui"]], "3.3 Model Compression": [[302, "model-compression"]], "3.3 Start the web service": [[361, "start-the-web-service"]], "3.4 Model Inference": [[302, "model-inference"]], "3D Inference": [[399, "d-inference"]], "4. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W2G128.": [[288, "accuracies-uparrow-across-11-tasks-0-shot-of-llama-and-mistral-models-at-w2g128"]], "4. Evaluation": [[346, "evaluation"]], "4. Integrate Neural Engine as Backend": [[388, "integrate-neural-engine-as-backend"]], "4. Reward / preference modeling (RM) Fine-tuning": [[352, "reward-preference-modeling-rm-fine-tuning"]], "4. Simple Test using Docker Container": [[314, "simple-test-using-docker-container"]], "4. Training Example": [[376, "training-example"]], "5. Evaluation": [[376, "evaluation"]], "5. Reinforcement Fine-tuning": [[352, "reinforcement-fine-tuning"]], "6. Verified Models": [[376, "verified-models"]], "API": [[273, "api"]], "API reference for users": [[398, "api-reference-for-users"]], "API usage": [[305, "api-usage"]], "ASR": [[353, "asr"], [353, "id1"], [353, "id3"]], "Access retrieval service": [[375, "access-retrieval-service"]], "Access text chat service": [[375, "access-text-chat-service"]], "Access the Server using the RESTful API": [[321, "access-the-server-using-the-restful-api"]], "Access the Service": [[309, "access-the-service"]], "Access voice chat service": [[375, "access-voice-chat-service"]], "Accuracy Aware Tuning": [[423, "accuracy-aware-tuning"]], "Acknowledgements": [[353, "acknowledgements"], [359, "acknowledgements"], [360, "acknowledgements"], [374, "acknowledgements"]], "Add Customized Pattern": [[387, "add-customized-pattern"]], "Add one AWS inbound rule for distributed training": [[349, "add-one-aws-inbound-rule-for-distributed-training"]], "Additional useful RESTful APIs": [[321, "additional-useful-restful-apis"]], "Advanced Topics": [[319, "advanced-topics"]], "After knowledge editing": [[377, "after-knowledge-editing"]], "Architecture of Intel\u00ae Extension for Transformers": [[287, "architecture-of-intel-extension-for-transformers"]], "Attributes": [[243, "attributes"]], "Attribution": [[298, "attribution"]], "Automatic Mixed Precision (AMP)": [[319, "automatic-mixed-precision-amp"]], "Bare Metal": [[348, "bare-metal"], [349, "bare-metal"]], "Baremetal": [[322, "baremetal"]], "Before knowledge editing": [[377, "before-knowledge-editing"]], "Benchmark": [[289, "benchmark"]], "Benchmark Output": [[289, "benchmark-output"]], "Benchmark for Kernels": [[413, "benchmark-for-kernels"]], "Binary Injectors": [[400, "binary-injectors"]], "Brief introduction for ISAs": [[403, "brief-introduction-for-isas"]], "Build": [[398, "build"], [413, "build"]], "Build Docker image with customized SSH server port from scratch": [[349, "build-docker-image-with-customized-ssh-server-port-from-scratch"]], "Build RAG (retriveval augment generation) example with Intel\u00ae Extension for Transformers neural-chat on Intel GPU": [[362, "build-rag-retriveval-augment-generation-example-with-intel-extension-for-transformers-neural-chat-on-intel-gpu"]], "Build Your Chatbot with Intel\u00ae Extension for Transformers neural-chat": [[361, "build-your-chatbot-with-intel-extension-for-transformers-neural-chat"]], "Build the chatbot and interact with the chatbot:": [[358, "build-the-chatbot-and-interact-with-the-chatbot"], [372, "build-the-chatbot-and-interact-with-the-chatbot"]], "Build the yaml and weight binary": [[388, "build-the-yaml-and-weight-binary"]], "Building RESTful API Server": [[321, "building-restful-api-server"]], "CI Introduction": [[269, "ci-introduction"], [300, "ci-introduction"]], "CPU Usage": [[361, "cpu-usage"]], "Cache Issues": [[399, "cache-issues"]], "Caching": [[319, "caching"]], "Caching Data": [[370, "caching-data"]], "Calculation": [[408, "calculation"]], "Call the audio plugin service": [[336, "call-the-audio-plugin-service"]], "Call the image2image plugin service": [[337, "call-the-image2image-plugin-service"]], "Candidate patterns": [[409, "candidate-patterns"]], "Centos 8": [[308, "centos-8"]], "Chatbot with Multimodal": [[319, "chatbot-with-multimodal"]], "Chatbot with RAG": [[319, "chatbot-with-rag"]], "ChildParentRetriever": [[372, "childparentretriever"]], "Chroma": [[372, "chroma"]], "Citation": [[415, "citation"]], "Class Kernel": [[279, "class-kernel"]], "Class engine": [[278, "class-engine"]], "Class operator_desc": [[280, "class-operator-desc"]], "Classes": [[0, "classes"], [2, "classes"], [5, "classes"], [9, "classes"], [14, "classes"], [22, "classes"], [24, "classes"], [25, "classes"], [28, "classes"], [30, "classes"], [32, "classes"], [33, "classes"], [35, "classes"], [36, "classes"], [37, "classes"], [40, "classes"], [44, "classes"], [47, "classes"], [50, "classes"], [52, "classes"], [53, "classes"], [54, "classes"], [55, "classes"], [57, "classes"], [60, "classes"], [61, "classes"], [63, "classes"], [64, "classes"], [65, "classes"], [66, "classes"], [67, "classes"], [68, "classes"], [69, "classes"], [70, "classes"], [71, "classes"], [72, "classes"], [73, "classes"], [74, "classes"], [76, "classes"], [77, "classes"], [78, "classes"], [79, "classes"], [80, "classes"], [81, "classes"], [82, "classes"], [84, "classes"], [85, "classes"], [86, "classes"], [87, "classes"], [88, "classes"], [89, "classes"], [90, "classes"], [92, "classes"], [93, "classes"], [94, "classes"], [95, "classes"], [96, "classes"], [97, "classes"], [98, "classes"], [99, "classes"], [100, "classes"], [101, "classes"], [102, "classes"], [103, "classes"], [105, "classes"], [106, "classes"], [107, "classes"], [108, "classes"], [109, "classes"], [110, "classes"], [111, "classes"], [112, "classes"], [113, "classes"], [114, "classes"], [115, "classes"], [116, "classes"], [117, "classes"], [118, "classes"], [119, "classes"], [120, "classes"], [121, "classes"], [122, "classes"], [123, "classes"], [124, "classes"], [125, "classes"], [126, "classes"], [127, "classes"], [128, "classes"], [129, "classes"], [130, "classes"], [131, "classes"], [132, "classes"], [133, "classes"], [134, "classes"], [135, "classes"], [136, "classes"], [137, "classes"], [138, "classes"], [139, "classes"], [140, "classes"], [141, "classes"], [142, "classes"], [143, "classes"], [144, "classes"], [145, "classes"], [146, "classes"], [147, "classes"], [148, "classes"], [149, "classes"], [151, "classes"], [152, "classes"], [153, "classes"], [154, "classes"], [155, "classes"], [156, "classes"], [157, "classes"], [158, "classes"], [159, "classes"], [160, "classes"], [161, "classes"], [162, "classes"], [163, "classes"], [164, "classes"], [165, "classes"], [166, "classes"], [167, "classes"], [168, "classes"], [169, "classes"], [170, "classes"], [171, "classes"], [172, "classes"], [173, "classes"], [174, "classes"], [175, "classes"], [176, "classes"], [177, "classes"], [178, "classes"], [179, "classes"], [180, "classes"], [181, "classes"], [182, "classes"], [183, "classes"], [184, "classes"], [185, "classes"], [186, "classes"], [187, "classes"], [188, "classes"], [189, "classes"], [190, "classes"], [191, "classes"], [192, "classes"], [193, "classes"], [194, "classes"], [195, "classes"], [196, "classes"], [197, "classes"], [198, "classes"], [199, "classes"], [200, "classes"], [201, "classes"], [202, "classes"], [203, "classes"], [204, "classes"], [205, "classes"], [206, "classes"], [207, "classes"], [208, "classes"], [209, "classes"], [210, "classes"], [211, "classes"], [212, "classes"], [213, "classes"], [214, "classes"], [215, "classes"], [216, "classes"], [217, "classes"], [218, "classes"], [219, "classes"], [220, "classes"], [221, "classes"], [222, "classes"], [223, "classes"], [224, "classes"], [225, "classes"], [226, "classes"], [227, "classes"], [228, "classes"], [229, "classes"], [230, "classes"], [231, "classes"], [232, "classes"], [233, "classes"], [234, "classes"], [235, "classes"], [236, "classes"], [237, "classes"], [238, "classes"], [239, "classes"], [240, "classes"], [241, "classes"], [242, "classes"], [246, "classes"], [247, "classes"], [250, "classes"], [251, "classes"], [255, "classes"], [256, "classes"], [257, "classes"], [258, "classes"], [259, "classes"], [260, "classes"], [264, "classes"]], "Code Generation": [[319, "code-generation"]], "CodeLlama": [[349, "codellama"]], "Compile": [[275, "compile"]], "Compile Examples": [[392, "compile-examples"]], "Compile Models": [[337, "compile-models"]], "Compile an ONNX model to Engine IR": [[392, "compile-an-onnx-model-to-engine-ir"]], "Compile to IR": [[392, "compile-to-ir"]], "Config": [[283, "config"]], "Configure Environment Variables": [[334, "configure-environment-variables"]], "Configure Multi-Nodes": [[332, "configure-multi-nodes"]], "Configure Multi-NumaNodes": [[332, "configure-multi-numanodes"]], "Configure Multi-node": [[330, "configure-multi-node"]], "Configure OpenAI keys": [[324, "configure-openai-keys"]], "Configure SSH between Servers": [[330, "configure-ssh-between-servers"], [332, "configure-ssh-between-servers"]], "Configure YAML": [[324, "configure-yaml"], [336, "configure-yaml"], [338, "configure-yaml"], [363, "configure-yaml"]], "Configure photoai.yaml": [[334, "configure-photoai-yaml"]], "Configure the assisted_gen.yaml": [[323, "configure-the-assisted-gen-yaml"]], "Configure the codegen.yaml": [[326, "configure-the-codegen-yaml"], [327, "configure-the-codegen-yaml"], [328, "configure-the-codegen-yaml"], [329, "configure-the-codegen-yaml"], [330, "configure-the-codegen-yaml"], [331, "configure-the-codegen-yaml"], [332, "configure-the-codegen-yaml"]], "Configure the textbot.yaml": [[343, "configure-the-textbot-yaml"], [344, "configure-the-textbot-yaml"]], "Configure the voicebot.yaml": [[340, "configure-the-voicebot-yaml"]], "Consume Chat Q&A Service": [[316, "consume-chat-q-a-service"]], "Consume Chat Service": [[316, "consume-chat-service"]], "Consume Summary Service": [[316, "consume-summary-service"]], "Consume the Service": [[317, "consume-the-service"], [318, "consume-the-service"]], "Consume the Service with Simple Test": [[313, "consume-the-service-with-simple-test"], [316, "consume-the-service-with-simple-test"]], "Consume the Services": [[363, "consume-the-services"]], "Contents:": [[434, null], [439, null]], "Contribution Guidelines": [[300, "contribution-guidelines"]], "Contribution and Legal Documentation": [[270, "contribution-and-legal-documentation"]], "Contributor Covenant Code of Conduct": [[298, "contributor-covenant-code-of-conduct"], [300, "contributor-covenant-code-of-conduct"]], "Create Docker Image for HPU": [[366, "create-docker-image-for-hpu"]], "Create Image Database": [[334, "create-image-database"]], "Create Nodes and Establish Connections": [[391, "create-nodes-and-establish-connections"]], "Create Tables": [[334, "create-tables"]], "Create an Instance of Criterion(Optional)": [[303, "create-an-instance-of-criterion-optional"]], "Create an Instance of DistillationConfig": [[303, "create-an-instance-of-distillationconfig"]], "Create an Instance of Metric": [[303, "create-an-instance-of-metric"]], "Create an Instance of Objective(Optional)": [[423, "create-an-instance-of-objective-optional"]], "Create an Instance of QuantizationConfig": [[423, "create-an-instance-of-quantizationconfig"]], "Create an instance of Metric": [[419, "create-an-instance-of-metric"]], "Create an instance of WeightPruningConfig": [[419, "create-an-instance-of-weightpruningconfig"]], "Create and activate conda environment": [[322, "create-and-activate-conda-environment"]], "Customized Operators Register": [[394, "customized-operators-register"]], "Customized endpoints of a audio-input-audio-output pipeline": [[340, "customized-endpoints-of-a-audio-input-audio-output-pipeline"]], "Customizing the NeuralChat Service": [[309, "customizing-the-neuralchat-service"]], "Dataset": [[354, "dataset"]], "Dataset related arguments": [[348, "dataset-related-arguments"], [349, "dataset-related-arguments"]], "Demo": [[353, "demo"]], "Dense Reference Deployment on Neural Engine": [[304, "dense-reference-deployment-on-neural-engine"]], "Dense and Sparse": [[402, "dense-and-sparse"]], "Dependencies Installation": [[369, "dependencies-installation"]], "Deploy NeuralChat Service": [[309, "deploy-neuralchat-service"]], "Deploy a textbot with vllm": [[367, "deploy-a-textbot-with-vllm"]], "Deploy and Integration": [[388, "deploy-and-integration"]], "Deploy it as a server": [[360, "deploy-it-as-a-server"]], "Deploy on Huggingface Space": [[345, "deploy-on-huggingface-space"], [383, "deploy-on-huggingface-space"], [384, "deploy-on-huggingface-space"]], "Deploy on your server": [[345, "deploy-on-your-server"], [383, "deploy-on-your-server"], [384, "deploy-on-your-server"]], "Design": [[393, "design"]], "Details": [[408, "details"]], "Developer\u2019s Perspective": [[400, "developer-s-perspective"]], "Developer\u2019s Perspective.": [[401, "developer-s-perspective"]], "Direct Layernorm_ba": [[406, "direct-layernorm-ba"]], "Direct Preference Optimization (DPO)": [[346, "direct-preference-optimization-dpo"], [347, "direct-preference-optimization-dpo"]], "Distill with Trainer": [[303, "distill-with-trainer"]], "Distillation": [[303, "distillation"], [303, "id1"], [304, "distillation"], [306, "distillation"]], "Do chatbot inference with Docker": [[315, "do-chatbot-inference-with-docker"]], "Do inference of the Mandarian text -> Shanghainese text translation model": [[353, "do-inference-of-the-mandarian-text-shanghainese-text-translation-model"]], "Do inference of the Shanghainese Audio -> Shanghainese text ASR model": [[353, "do-inference-of-the-shanghainese-audio-shanghainese-text-asr-model"]], "Do inference of the Shanghainese text -> Mandarian text translation model": [[353, "do-inference-of-the-shanghainese-text-mandarian-text-translation-model"]], "Do inference of the Shanghainese text -> Shanghainese audio TTS model": [[353, "do-inference-of-the-shanghainese-text-shanghainese-audio-tts-model"]], "Docker": [[322, "docker"], [348, "docker"], [349, "docker"]], "Documentation Overview and Installation": [[270, "documentation-overview-and-installation"]], "Dolly-V2-3B": [[425, "dolly-v2-3b"]], "Download Models": [[338, "download-models"], [363, "download-models"]], "Dynamic Quant Matmul": [[405, "dynamic-quant-matmul"]], "Early-Exit": [[304, "early-exit"]], "Edit Knowledge of LLMs": [[377, "edit-knowledge-of-llms"]], "Editing knowledge with 2 samples": [[377, "editing-knowledge-with-2-samples"]], "Efficient LLM Inference on CPUs": [[426, "efficient-llm-inference-on-cpus"]], "Efficient kernel": [[402, "efficient-kernel"]], "Electra": [[425, "electra"]], "Element-wise Injector": [[401, "element-wise-injector"]], "Enforcement": [[298, "enforcement"]], "Engine API": [[277, "engine-api"]], "Engine Tuning": [[390, "engine-tuning"]], "English Text-to-Speech (TTS)": [[369, "english-text-to-speech-tts"]], "Environment Setup": [[313, "environment-setup"], [316, "environment-setup"], [317, "environment-setup"], [318, "environment-setup"]], "Environment\u200b": [[377, "environment"]], "Evaluation": [[351, "evaluation"]], "Evaluation Guidelines": [[351, "evaluation-guidelines"]], "Evaluation Metrics": [[349, "evaluation-metrics"]], "Evaluation Only": [[351, "evaluation-only"]], "Example": [[290, "example"], [421, "example"], [428, "example"], [429, "example"], [433, "example"]], "Example for CPU device": [[432, "example-for-cpu-device"]], "Example for CUDA GPU device": [[432, "example-for-cuda-gpu-device"]], "Example of AutoRound on Intel GPU": [[432, "example-of-autoround-on-intel-gpu"]], "Example of Chat Q&A Service.": [[316, "example-of-chat-q-a-service"]], "Example of Chat Service.": [[316, "example-of-chat-service"]], "Example of Summary Service.": [[316, "example-of-summary-service"]], "Examples": [[289, "examples"], [304, "examples"], [305, "examples"], [385, "examples"], [413, "examples"], [413, "id1"], [413, "id2"], [413, "id3"], [413, "id4"], [413, "id5"], [413, "id6"], [413, "id7"], [413, "id8"], [413, "id9"], [413, "id10"], [413, "id11"], [418, "examples"], [422, "examples"]], "Examples For CPU AND CUDA": [[432, "examples-for-cpu-and-cuda"]], "Examples For Intel GPU": [[432, "examples-for-intel-gpu"]], "Examples:": [[417, "examples"]], "Exceptions": [[24, "exceptions"]], "Expected Output": [[302, "expected-output"]], "Export to BF16 ONNX Model": [[305, "export-to-bf16-onnx-model"]], "Export to FP32 ONNX Model": [[305, "export-to-fp32-onnx-model"]], "Export to INT8 ONNX Model": [[305, "export-to-int8-onnx-model"]], "Export to ONNX": [[305, "export-to-onnx"]], "Extract Tables From PDF File": [[359, "extract-tables-from-pdf-file"]], "FAQ": [[269, "faq"], [300, "faq"]], "FLAN-T5": [[349, "flan-t5"]], "FP32 Accuracy": [[427, "fp32-accuracy"]], "FP32 Accuracy (Baseline)": [[426, "fp32-accuracy-baseline"]], "FP32 Inference": [[427, "fp32-inference"]], "FP32 Inference (Baseline)": [[426, "fp32-inference-baseline"]], "FP32/BF16 Inference": [[371, "fp32-bf16-inference"]], "Face Animation": [[360, "face-animation"], [374, "face-animation"]], "Falcon": [[349, "falcon"]], "Falcon-7B": [[425, "falcon-7b"]], "Features": [[291, "features"], [434, "features"]], "Fine-tuning": [[319, "fine-tuning"], [354, "fine-tuning"]], "Fine-tuning and Inference": [[354, "fine-tuning-and-inference"]], "Fine-tuning on Intel Arc GPUs": [[349, "fine-tuning-on-intel-arc-gpus"]], "Finetune": [[314, "finetune"], [348, "finetune"], [349, "finetune"]], "Finetune Embedding Model on Task-Specific Datasets": [[376, "finetune-embedding-model-on-task-specific-datasets"]], "Finetuning": [[353, "finetuning"], [355, "finetuning"]], "For LLaMA2": [[349, "for-llama2"]], "For developers": [[413, "for-developers"]], "For executor backend": [[305, "for-executor-backend"]], "Framework Features": [[400, "framework-features"], [401, "framework-features"]], "Full Publications/Events (51)": [[420, "full-publications-events-51"]], "Functions": [[0, "functions"], [1, "functions"], [4, "functions"], [6, "functions"], [15, "functions"], [17, "functions"], [20, "functions"], [21, "functions"], [23, "functions"], [24, "functions"], [25, "functions"], [27, "functions"], [28, "functions"], [29, "functions"], [30, "functions"], [32, "functions"], [36, "functions"], [37, "functions"], [39, "functions"], [40, "functions"], [41, "functions"], [42, "functions"], [44, "functions"], [45, "functions"], [49, "functions"], [57, "functions"], [61, "functions"], [62, "functions"], [95, "functions"], [184, "functions"], [243, "functions"], [244, "functions"], [245, "functions"], [252, "functions"], [260, "functions"], [262, "functions"], [263, "functions"], [264, "functions"], [265, "functions"], [266, "functions"], [267, "functions"], [268, "functions"]], "Fuse Pattern and Set Attributes of New Pattern after Fusion": [[387, "fuse-pattern-and-set-attributes-of-new-pattern-after-fusion"]], "GPT-J fine-tuning and inference": [[354, "gpt-j-fine-tuning-and-inference"]], "GPT-NEOX-20B": [[425, "gpt-neox-20b"]], "GPT-j-6B": [[425, "gpt-j-6b"]], "GPU Usage": [[361, "gpu-usage"]], "General": [[300, "general"]], "Generate the Engine Graph through TF/ONNX model": [[388, "generate-the-engine-graph-through-tf-onnx-model"]], "Get Start with Metrics": [[416, "get-start-with-metrics"]], "Get Started": [[302, "get-started"], [354, "get-started"], [423, "get-started"]], "Get Started with Benchmark API": [[289, "get-started-with-benchmark-api"]], "Get Started with NeuralChat": [[322, "get-started-with-neuralchat"]], "Get the result": [[367, "get-the-result"]], "Getting Started": [[306, "getting-started"], [309, "getting-started"]], "Graph": [[276, "graph"]], "Graph Fusion": [[391, "graph-fusion"]], "Graph Tuning for Dispatching Best Graph": [[390, "graph-tuning-for-dispatching-best-graph"]], "H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models": [[307, "h2o-heavy-hitter-oracle-for-efficient-generative-inference-of-large-language-models"]], "Help": [[311, "help"], [375, "help"], [375, "id1"]], "How it Works": [[302, "how-it-works"]], "How to Turn on Op Tuning Mechanism": [[390, "how-to-turn-on-op-tuning-mechanism"]], "How to Turn on Static Compressed Buffer": [[396, "how-to-turn-on-static-compressed-buffer"]], "How to train Intel/neural-chat-7b-v3-1 on Intel Gaudi2": [[347, "how-to-train-intel-neural-chat-7b-v3-1-on-intel-gaudi2"]], "How to visualize weights distribution of sparse model": [[412, "how-to-visualize-weights-distribution-of-sparse-model"]], "INT4 Accuracy": [[426, "int4-accuracy"], [427, "int4-accuracy"]], "INT4 Inference": [[426, "int4-inference"]], "INT4 inference": [[427, "int4-inference"]], "INT8/INT4 Inference": [[371, "int8-int4-inference"]], "IPEX Model": [[289, "ipex-model"]], "Implementation Details": [[294, "implementation-details"], [437, "implementation-details"]], "Import the module and build the chatbot instance:": [[356, "import-the-module-and-build-the-chatbot-instance"]], "Import the module and set the retrieval config:": [[358, "import-the-module-and-set-the-retrieval-config"], [372, "import-the-module-and-set-the-retrieval-config"]], "Inference": [[353, "inference"], [355, "inference"]], "Inference Parameters": [[371, "inference-parameters"]], "Inference with Docker": [[319, "inference-with-docker"]], "Inference with FP32/BF16": [[371, "inference-with-fp32-bf16"]], "Inference with INT8/INT4": [[371, "inference-with-int8-int4"]], "Initializing": [[370, "initializing"]], "Inputs format": [[414, "inputs-format"]], "Install": [[386, "install"]], "Install ITREX": [[331, "install-itrex"], [332, "install-itrex"]], "Install Intel Extension for Transformers": [[308, "install-intel-extension-for-transformers"], [357, "install-intel-extension-for-transformers"], [368, "install-intel-extension-for-transformers"]], "Install Intel\u00ae Extension for Transformers* from source": [[322, "install-intel-extension-for-transformers-from-source"]], "Install Models": [[334, "install-models"]], "Install MySQL": [[334, "install-mysql"]], "Install Neural Engine binary to deploy bare metal engine": [[386, "install-neural-engine-binary-to-deploy-bare-metal-engine"]], "Install NeuralChat Python Dependencies": [[331, "install-neuralchat-python-dependencies"], [332, "install-neuralchat-python-dependencies"]], "Install Python Dependencies": [[323, "install-python-dependencies"], [330, "install-python-dependencies"]], "Install Python dependencies": [[324, "install-python-dependencies"], [326, "install-python-dependencies"], [327, "install-python-dependencies"], [328, "install-python-dependencies"], [329, "install-python-dependencies"], [331, "install-python-dependencies"], [334, "install-python-dependencies"], [336, "install-python-dependencies"], [337, "install-python-dependencies"], [338, "install-python-dependencies"], [340, "install-python-dependencies"], [343, "install-python-dependencies"], [344, "install-python-dependencies"], [345, "install-python-dependencies"], [357, "install-python-dependencies"], [363, "install-python-dependencies"], [368, "install-python-dependencies"], [383, "install-python-dependencies"], [384, "install-python-dependencies"]], "Install System Dependency": [[369, "install-system-dependency"]], "Install environment": [[393, "install-environment"]], "Install from Pypi": [[308, "install-from-pypi"]], "Install from Source": [[308, "install-from-source"]], "Install intel extension for transformers": [[327, "install-intel-extension-for-transformers"], [328, "install-intel-extension-for-transformers"], [329, "install-intel-extension-for-transformers"]], "Install numactl": [[323, "install-numactl"], [324, "install-numactl"], [330, "install-numactl"], [331, "install-numactl"], [332, "install-numactl"], [334, "install-numactl"], [336, "install-numactl"], [337, "install-numactl"], [338, "install-numactl"], [340, "install-numactl"], [343, "install-numactl"], [344, "install-numactl"], [357, "install-numactl"], [363, "install-numactl"], [368, "install-numactl"]], "Install stable version intel_extension_for_transformers from pip": [[386, "install-stable-version-intel-extension-for-transformers-from-pip"]], "Install visual cpp build tools": [[327, "install-visual-cpp-build-tools"], [328, "install-visual-cpp-build-tools"], [329, "install-visual-cpp-build-tools"]], "Installation": [[308, "installation"], [309, "installation"], [356, "installation"], [370, "installation"], [386, "installation"], [398, "installation"]], "Intel Extension for Pytorch (IPEX) examples": [[304, "intel-extension-for-pytorch-ipex-examples"]], "Intel Neural Chat Dockerfile": [[312, "intel-neural-chat-dockerfile"]], "Intel TensorFlow Examples": [[304, "intel-tensorflow-examples"]], "Intel\u00ae Extension for Transformers": [[302, "intel-extension-for-transformers"]], "Intel\u00ae Extension for Transformers: Accelerating Transformer-based Models on Intel Platforms": [[272, "intel-extension-for-transformers-accelerating-transformer-based-models-on-intel-platforms"]], "Interact with the chatbot:": [[356, "interact-with-the-chatbot"]], "Intermediate Layer Knowledge Distillation": [[303, "intermediate-layer-knowledge-distillation"]], "Introduction": [[289, "introduction"], [303, "introduction"], [305, "introduction"], [307, "introduction"], [309, "introduction"], [336, "introduction"], [337, "introduction"], [338, "introduction"], [354, "introduction"], [357, "introduction"], [358, "introduction"], [363, "introduction"], [372, "introduction"], [373, "introduction"], [387, "introduction"], [389, "introduction"], [390, "introduction"], [391, "introduction"], [392, "introduction"], [395, "introduction"], [396, "introduction"], [398, "introduction"], [400, "introduction"], [401, "introduction"], [402, "introduction"], [407, "introduction"], [412, "introduction"], [416, "introduction"], [417, "introduction"], [418, "introduction"], [419, "introduction"], [421, "introduction"], [422, "introduction"], [423, "introduction"], [428, "introduction"], [429, "introduction"], [432, "introduction"]], "Iteration Level": [[389, "iteration-level"]], "Kernel APIs": [[282, "kernel-apis"]], "Kernel details": [[405, "kernel-details"]], "Kernels": [[293, "kernels"], [436, "kernels"]], "Key Instruction": [[404, "key-instruction"]], "Knowledge Distillation": [[303, "knowledge-distillation"], [304, "knowledge-distillation"]], "LLM Carbon Calculator": [[385, "llm-carbon-calculator"]], "LLM Finetuning": [[425, "llm-finetuning"]], "LLM Quantization": [[425, "llm-quantization"]], "LLM Runtime (GGML-Compatible)": [[425, "llm-runtime-ggml-compatible"]], "LLM Runtime Inference based on Pytorch Mode": [[425, "llm-runtime-inference-based-on-pytorch-mode"]], "LLMs": [[425, "llms"]], "Langchain Extension": [[372, "langchain-extension"]], "Langchain Extension APIs": [[309, "langchain-extension-apis"]], "Launch OpenAI-compatible Service": [[309, "launch-openai-compatible-service"]], "Launch and Run the Client": [[366, "launch-and-run-the-client"]], "Launch the Triton Server": [[366, "launch-the-triton-server"]], "Learn More": [[302, "learn-more"]], "Legal Information": [[415, "legal-information"]], "Length Adaptive Transformers": [[304, "length-adaptive-transformers"]], "Levels of JSON Profiling": [[389, "levels-of-json-profiling"]], "License": [[415, "license"]], "Llama3 on MTL": [[432, "llama3-on-mtl"]], "Loops": [[404, "loops"]], "MMMU Evaluation on Gaudi2": [[350, "mmmu-evaluation-on-gaudi2"]], "MPT": [[349, "mpt"]], "MPT-7B": [[425, "mpt-7b"]], "Matmul_avx512f_p2031_p2013": [[407, "matmul-avx512f-p2031-p2013"]], "Matmul_noperm_p2031_p1302": [[407, "matmul-noperm-p2031-p1302"]], "Matmul_p2031_2013": [[407, "matmul-p2031-2013"]], "Matmul_vnni_noperm_p2013_p1302": [[407, "matmul-vnni-noperm-p2013-p1302"]], "Memory Layout in SPMM_VNNI": [[399, "memory-layout-in-spmm-vnni"]], "Merge the lora weights": [[347, "merge-the-lora-weights"]], "Metric Class Summary": [[416, "metric-class-summary"]], "Metrics": [[416, "metrics"]], "Mine Hard Negatives": [[376, "mine-hard-negatives"]], "Mistral": [[349, "mistral"]], "Model": [[284, "model"]], "Model Level": [[389, "model-level"]], "Model\u2019s output 1 after editing the knowledge": [[377, "model-s-output-1-after-editing-the-knowledge"]], "Model\u2019s output 1 before editing the knowledge": [[377, "model-s-output-1-before-editing-the-knowledge"]], "Model\u2019s output 2 after editing the knowledge": [[377, "model-s-output-2-after-editing-the-knowledge"]], "Model\u2019s output 2 before editing the knowledge": [[377, "model-s-output-2-before-editing-the-knowledge"]], "Model\u2019s output 3 after editing the knowledge": [[377, "model-s-output-3-after-editing-the-knowledge"]], "Model\u2019s output 3 before editing the knowledge": [[377, "model-s-output-3-before-editing-the-knowledge"]], "Model\u2019s output 4 after editing the knowledge": [[377, "model-s-output-4-after-editing-the-knowledge"]], "Model\u2019s output 4 before editing the knowledge": [[377, "model-s-output-4-before-editing-the-knowledge"]], "Modify hostfile": [[330, "modify-hostfile"], [332, "modify-hostfile"], [332, "id1"], [332, "id2"]], "Module Contents": [[0, "module-contents"], [1, "module-contents"], [2, "module-contents"], [4, "module-contents"], [5, "module-contents"], [6, "module-contents"], [9, "module-contents"], [14, "module-contents"], [15, "module-contents"], [17, "module-contents"], [20, "module-contents"], [21, "module-contents"], [22, "module-contents"], [23, "module-contents"], [24, "module-contents"], [25, "module-contents"], [27, "module-contents"], [28, "module-contents"], [29, "module-contents"], [30, "module-contents"], [32, "module-contents"], [33, "module-contents"], [35, "module-contents"], [36, "module-contents"], [37, "module-contents"], [39, "module-contents"], [40, "module-contents"], [41, "module-contents"], [42, "module-contents"], [44, "module-contents"], [45, "module-contents"], [47, "module-contents"], [49, "module-contents"], [50, "module-contents"], [52, "module-contents"], [53, "module-contents"], [54, "module-contents"], [55, "module-contents"], [57, "module-contents"], [60, "module-contents"], [61, "module-contents"], [62, "module-contents"], [63, "module-contents"], [64, "module-contents"], [65, "module-contents"], [66, "module-contents"], [67, "module-contents"], [68, "module-contents"], [69, "module-contents"], [70, "module-contents"], [71, "module-contents"], [72, "module-contents"], [73, "module-contents"], [74, "module-contents"], [76, "module-contents"], [77, "module-contents"], [78, "module-contents"], [79, "module-contents"], [80, "module-contents"], [81, "module-contents"], [82, "module-contents"], [84, "module-contents"], [85, "module-contents"], [86, "module-contents"], [87, "module-contents"], [88, "module-contents"], [89, "module-contents"], [90, "module-contents"], [92, "module-contents"], [93, "module-contents"], [94, "module-contents"], [95, "module-contents"], [96, "module-contents"], [97, "module-contents"], [98, "module-contents"], [99, "module-contents"], [100, "module-contents"], [101, "module-contents"], [102, "module-contents"], [103, "module-contents"], [105, "module-contents"], [106, "module-contents"], [107, "module-contents"], [108, "module-contents"], [109, "module-contents"], [110, "module-contents"], [111, "module-contents"], [112, "module-contents"], [113, "module-contents"], [114, "module-contents"], [115, "module-contents"], [116, "module-contents"], [117, "module-contents"], [118, "module-contents"], [119, "module-contents"], [120, "module-contents"], [121, "module-contents"], [122, "module-contents"], [123, "module-contents"], [124, "module-contents"], [125, "module-contents"], [126, "module-contents"], [127, "module-contents"], [128, "module-contents"], [129, "module-contents"], [130, "module-contents"], [131, "module-contents"], [132, "module-contents"], [133, "module-contents"], [134, "module-contents"], [135, "module-contents"], [136, "module-contents"], [137, "module-contents"], [138, "module-contents"], [139, "module-contents"], [140, "module-contents"], [141, "module-contents"], [142, "module-contents"], [143, "module-contents"], [144, "module-contents"], [145, "module-contents"], [146, "module-contents"], [147, "module-contents"], [148, "module-contents"], [149, "module-contents"], [151, "module-contents"], [152, "module-contents"], [153, "module-contents"], [154, "module-contents"], [155, "module-contents"], [156, "module-contents"], [157, "module-contents"], [158, "module-contents"], [159, "module-contents"], [160, "module-contents"], [161, "module-contents"], [162, "module-contents"], [163, "module-contents"], [164, "module-contents"], [165, "module-contents"], [166, "module-contents"], [167, "module-contents"], [168, "module-contents"], [169, "module-contents"], [170, "module-contents"], [171, "module-contents"], [172, "module-contents"], [173, "module-contents"], [174, "module-contents"], [175, "module-contents"], [176, "module-contents"], [177, "module-contents"], [178, "module-contents"], [179, "module-contents"], [180, "module-contents"], [181, "module-contents"], [182, "module-contents"], [183, "module-contents"], [184, "module-contents"], [185, "module-contents"], [186, "module-contents"], [187, "module-contents"], [188, "module-contents"], [189, "module-contents"], [190, "module-contents"], [191, "module-contents"], [192, "module-contents"], [193, "module-contents"], [194, "module-contents"], [195, "module-contents"], [196, "module-contents"], [197, "module-contents"], [198, "module-contents"], [199, "module-contents"], [200, "module-contents"], [201, "module-contents"], [202, "module-contents"], [203, "module-contents"], [204, "module-contents"], [205, "module-contents"], [206, "module-contents"], [207, "module-contents"], [208, "module-contents"], [209, "module-contents"], [210, "module-contents"], [211, "module-contents"], [212, "module-contents"], [213, "module-contents"], [214, "module-contents"], [215, "module-contents"], [216, "module-contents"], [217, "module-contents"], [218, "module-contents"], [219, "module-contents"], [220, "module-contents"], [221, "module-contents"], [222, "module-contents"], [223, "module-contents"], [224, "module-contents"], [225, "module-contents"], [226, "module-contents"], [227, "module-contents"], [228, "module-contents"], [229, "module-contents"], [230, "module-contents"], [231, "module-contents"], [232, "module-contents"], [233, "module-contents"], [234, "module-contents"], [235, "module-contents"], [236, "module-contents"], [237, "module-contents"], [238, "module-contents"], [239, "module-contents"], [240, "module-contents"], [241, "module-contents"], [242, "module-contents"], [243, "module-contents"], [244, "module-contents"], [246, "module-contents"], [247, "module-contents"], [250, "module-contents"], [251, "module-contents"], [252, "module-contents"], [255, "module-contents"], [256, "module-contents"], [257, "module-contents"], [258, "module-contents"], [259, "module-contents"], [260, "module-contents"], [263, "module-contents"], [264, "module-contents"], [265, "module-contents"], [266, "module-contents"], [267, "module-contents"], [268, "module-contents"]], "Module Owner Matrix": [[299, "module-owner-matrix"]], "More Options": [[396, "more-options"]], "More Tuning Options": [[390, "more-tuning-options"]], "More work per thread": [[402, "more-work-per-thread"]], "Multi Language Automatic Speech Recognition (ASR)": [[369, "multi-language-automatic-speech-recognition-asr"]], "Multi Language Text-to-Speech (TTS)": [[369, "multi-language-text-to-speech-tts"]], "Multi Thread (Thread = 4)": [[411, "multi-thread-thread-4"]], "Multi-Modal": [[350, "multi-modal"]], "Multi-card serving (optional)": [[365, "multi-card-serving-optional"]], "Multimodal APIs": [[309, "multimodal-apis"]], "Naive": [[402, "naive"]], "Neural Chat Example": [[422, "neural-chat-example"]], "Neural Engine": [[296, "neural-engine"], [439, "neural-engine"]], "Neural Engine Support Matrix": [[397, "neural-engine-support-matrix"]], "NeuralChat": [[309, "neuralchat"], [311, "neuralchat"]], "NeuralChat Client": [[375, "neuralchat-client"]], "NeuralChat Command Line": [[311, "neuralchat-command-line"]], "NeuralChat Fine-tuning": [[348, "neuralchat-fine-tuning"], [349, "neuralchat-fine-tuning"]], "NeuralChat Notebooks": [[320, "neuralchat-notebooks"]], "NeuralChat Server": [[375, "neuralchat-server"]], "NeuralChat Server Command Line": [[375, "neuralchat-server-command-line"]], "OP Tuning for Dispatching Best Kernel and Related Runtime Config": [[390, "op-tuning-for-dispatching-best-kernel-and-related-runtime-config"]], "OPT-1.3B": [[425, "opt-1-3b"]], "Objective": [[417, "objective"]], "Obtain the Necessary Information for New Pattern Construction": [[391, "obtain-the-necessary-information-for-new-pattern-construction"]], "On Habana Gaudi Environment": [[314, "on-habana-gaudi-environment"], [314, "id2"], [315, "on-habana-gaudi-environment"], [315, "id2"]], "On Nvidia GPU Environment": [[314, "on-nvidia-gpu-environment"], [314, "id3"], [315, "on-nvidia-gpu-environment"]], "On Xeon SPR Environment": [[314, "on-xeon-spr-environment"], [314, "id1"], [315, "on-xeon-spr-environment"], [315, "id1"]], "On the fly activation reordering": [[409, "on-the-fly-activation-reordering"]], "OpenAI Official SDK": [[321, "openai-official-sdk"]], "OpenAI-Compatible RESTful APIs": [[309, "openai-compatible-restful-apis"], [321, "openai-compatible-restful-apis"]], "OpenSSF Badge": [[271, "openssf-badge"], [271, "id1"]], "Operator Level": [[389, "operator-level"]], "Operator Profiling Part": [[389, "operator-profiling-part"]], "Operator Specific Types": [[281, "operator-specific-types"]], "Optimization": [[319, "optimization"]], "Optimization and Inference Documentation": [[270, "optimization-and-inference-documentation"]], "Option 1 : Build Docker image from scratch": [[348, "option-1-build-docker-image-from-scratch"], [349, "option-1-build-docker-image-from-scratch"]], "Option 1: Build Docker Image": [[315, "option-1-build-docker-image"]], "Option 2: Docker Pull from Docker Hub": [[315, "option-2-docker-pull-from-docker-hub"]], "Option 2: Pull existing Docker image": [[348, "option-2-pull-existing-docker-image"], [349, "option-2-pull-existing-docker-image"]], "Orchestrate": [[304, "orchestrate"]], "Other Functionalities Documentation": [[270, "other-functionalities-documentation"]], "Our Pledge": [[298, "our-pledge"]], "Our Responsibilities": [[298, "our-responsibilities"]], "Our Standards": [[298, "our-standards"]], "Output file": [[351, "output-file"]], "Output folder structure": [[351, "output-folder-structure"]], "Overview": [[302, "overview"]], "Package Contents": [[245, "package-contents"], [262, "package-contents"]], "Parameters": [[372, "parameters"]], "Parse Pattern Representation List": [[395, "parse-pattern-representation-list"]], "Parse and Evaluation": [[351, "parse-and-evaluation"]], "Parts of CSV Profiling": [[389, "parts-of-csv-profiling"]], "Pattern": [[403, "pattern"]], "Pattern Mapping Dict": [[391, "pattern-mapping-dict"]], "Pattern Recognize": [[395, "pattern-recognize"]], "Pattern Representation": [[395, "pattern-representation"]], "Pattern Tuning for Dispatching Best Pattern": [[390, "pattern-tuning-for-dispatching-best-pattern"]], "Performance": [[295, "performance"], [397, "performance"], [398, "performance"], [438, "performance"]], "Performance acceleration on Intel\u00ae Xeon SPR": [[358, "performance-acceleration-on-intel-xeon-spr"]], "Performance and Profiling": [[410, "performance-and-profiling"]], "Pipeline": [[418, "pipeline"]], "Pipeline Inference for Executor Backend": [[418, "pipeline-inference-for-executor-backend"]], "Pipeline Inference for INT8 Model": [[418, "pipeline-inference-for-int8-model"]], "Platform Configuration": [[411, "platform-configuration"]], "Please clone a ITREX repo to this path.": [[314, "please-clone-a-itrex-repo-to-this-path"], [315, "please-clone-a-itrex-repo-to-this-path"]], "Plugin Parameters": [[371, "plugin-parameters"]], "Plugins": [[319, "plugins"]], "Post Training Dynamic Quantization": [[423, "post-training-dynamic-quantization"]], "Post Training Static Quantization": [[423, "post-training-static-quantization"]], "Pre-compute SPMM": [[406, "pre-compute-spmm"]], "Prefetch": [[402, "prefetch"]], "Prepare Configuration File and Documents": [[313, "prepare-configuration-file-and-documents"], [316, "prepare-configuration-file-and-documents"]], "Prepare Dataset": [[393, "prepare-dataset"]], "Prepare Dependency Packages": [[432, "prepare-dependency-packages"]], "Prepare Docker Image": [[316, "prepare-docker-image"]], "Prepare Environment": [[322, "prepare-environment"], [347, "prepare-environment"], [353, "prepare-environment"], [359, "prepare-environment"], [360, "prepare-environment"], [374, "prepare-environment"], [426, "id1"]], "Prepare Models": [[359, "prepare-models"], [360, "prepare-models"]], "Prepare ONNX Model": [[392, "prepare-onnx-model"]], "Prepare ONNX model": [[393, "prepare-onnx-model"]], "Prepare Python Environment": [[337, "prepare-python-environment"]], "Prepare Stable Diffusion Models": [[337, "prepare-stable-diffusion-models"]], "Prepare data": [[350, "prepare-data"], [350, "id1"], [355, "prepare-data"]], "Prepare environment": [[426, "prepare-environment"]], "Prepare serving scripts": [[364, "prepare-serving-scripts"], [365, "prepare-serving-scripts"], [366, "prepare-serving-scripts"]], "Prepare the environment": [[367, "prepare-the-environment"]], "Preprocessing of weight matrix": [[405, "preprocessing-of-weight-matrix"]], "Prerequisite": [[393, "prerequisite"]], "Prerequisites": [[308, "prerequisites"], [386, "prerequisites"]], "Prerequisites for using dynamic quant matmul": [[405, "prerequisites-for-using-dynamic-quant-matmul"]], "Prerequisite\u200b": [[314, "prerequisite"], [348, "prerequisite"], [349, "prerequisite"], [377, "prerequisite"], [427, "prerequisite"]], "Pretraining": [[350, "pretraining"]], "Print Results": [[351, "print-results"]], "Problem Description": [[406, "problem-description"]], "Problem Statements": [[407, "problem-statements"]], "Problem description": [[408, "problem-description"]], "Profiling": [[389, "profiling"]], "Profiling API": [[389, "profiling-api"]], "Profiling Examples": [[389, "profiling-examples"]], "Prune with Trainer": [[419, "prune-with-trainer"]], "Pruning": [[304, "pruning"], [306, "pruning"], [419, "pruning"]], "Pull Request Acceptance Criteria": [[300, "pull-request-acceptance-criteria"]], "Pull Request Checklist": [[300, "pull-request-checklist"]], "Pull Request Template": [[300, "pull-request-template"]], "Python API": [[422, "python-api"]], "Python APIs": [[274, "python-apis"]], "Pytorch Script:": [[303, "pytorch-script"]], "Pytorch version constrain": [[421, "pytorch-version-constrain"]], "QBits": [[421, "qbits"]], "QLoRA on CPU": [[422, "qlora-on-cpu"]], "Qdrant": [[372, "qdrant"]], "Quantization": [[304, "quantization"], [306, "quantization"], [423, "quantization"]], "Quantization Approach": [[423, "quantization-approach"]], "Quantization Aware Training": [[423, "quantization-aware-training"]], "Quantization Fundamentals": [[423, "quantization-fundamentals"]], "Quantization with Trainer": [[423, "quantization-with-trainer"]], "Quantize a ONNX model to engine low precision/int8 IR": [[393, "quantize-a-onnx-model-to-engine-low-precision-int8-ir"]], "Quantized Length Adaptive Transformer": [[306, "quantized-length-adaptive-transformer"]], "Quick check whether the server is up": [[365, "quick-check-whether-the-server-is-up"]], "Quick test with OpenAI compatible endpoints (audio)": [[340, "quick-test-with-openai-compatible-endpoints-audio"]], "QuickStart: Intel\u00ae Extension For Transformers*: NeuralChat on 4th Generation Intel\u00ae Xeon\u00ae Scalable Processors": [[322, "quickstart-intel-extension-for-transformers-neuralchat-on-4th-generation-intel-xeon-scalable-processors"]], "RAG Mode": [[372, "rag-mode"]], "Recommended Hardware": [[302, "recommended-hardware"]], "Reference Deployment on Neural Engine": [[304, "reference-deployment-on-neural-engine"]], "Register the Nodes\u2019 Op Types": [[387, "register-the-nodes-op-types"]], "Reinforcement Learning from Human Feedback (RLHF)": [[352, "reinforcement-learning-from-human-feedback-rlhf"]], "Related models": [[353, "related-models"]], "Release": [[424, "release"]], "Release Notes": [[424, "release-notes"]], "Remove the Old Pattern and Insert the New Pattern": [[391, "remove-the-old-pattern-and-insert-the-new-pattern"]], "Reorder": [[403, "reorder"]], "Reorder beforehand": [[407, "reorder-beforehand"]], "Reordering": [[408, "reordering"]], "Report a Vulnerability": [[271, "report-a-vulnerability"]], "Result": [[377, "result"]], "Retrievers": [[309, "retrievers"], [372, "retrievers"]], "Retrieving Cached Data": [[370, "retrieving-cached-data"]], "Rich Plugins": [[309, "rich-plugins"]], "Run": [[427, "run"]], "Run Accuracy Step by Step": [[426, "run-accuracy-step-by-step"]], "Run Llava": [[351, "run-llava"]], "Run Performance Step by Step": [[426, "run-performance-step-by-step"]], "Run the AskDoc server": [[338, "run-the-askdoc-server"]], "Run the Backend Container": [[366, "run-the-backend-container"]], "Run the Code Generation Chatbot Server": [[323, "run-the-code-generation-chatbot-server"], [330, "run-the-code-generation-chatbot-server"], [331, "run-the-code-generation-chatbot-server"], [332, "run-the-code-generation-chatbot-server"], [332, "id3"]], "Run the Code Generation Chatbot server": [[326, "run-the-code-generation-chatbot-server"], [327, "run-the-code-generation-chatbot-server"], [328, "run-the-code-generation-chatbot-server"], [329, "run-the-code-generation-chatbot-server"]], "Run the Inference": [[315, "run-the-inference"]], "Run the Inference on Habana Gaudi": [[315, "run-the-inference-on-habana-gaudi"]], "Run the Inference on Xeon SPR": [[315, "run-the-inference-on-xeon-spr"]], "Run the NeuralChat server with TGI framework": [[363, "run-the-neuralchat-server-with-tgi-framework"]], "Run the PhotoAI server": [[334, "run-the-photoai-server"]], "Run the TextChat server": [[324, "run-the-textchat-server"], [343, "run-the-textchat-server"], [344, "run-the-textchat-server"]], "Run the VoiceChat server": [[340, "run-the-voicechat-server"]], "Run the audio service server": [[336, "run-the-audio-service-server"]], "Run the complete code": [[358, "run-the-complete-code"]], "Run the frontend": [[345, "run-the-frontend"], [383, "run-the-frontend"], [384, "run-the-frontend"]], "Run the image2image service server": [[337, "run-the-image2image-service-server"]], "Run the inference by Engine": [[388, "run-the-inference-by-engine"], [388, "id1"]], "Run the script to set up the environment": [[322, "run-the-script-to-set-up-the-environment"]], "Run the table extraction script": [[359, "run-the-table-extraction-script"]], "Run tuning and benchmark": [[393, "run-tuning-and-benchmark"]], "SDE": [[410, "sde"]], "SPMM_VNNI 3D Inference": [[399, "spmm-vnni-3d-inference"]], "Safety Checker": [[319, "safety-checker"]], "Same Instructions as Multi-node Fine-tuning in Xeon SPR session": [[349, "same-instructions-as-multi-node-fine-tuning-in-xeon-spr-session"]], "Scope": [[298, "scope"]], "Script:": [[419, "script"], [423, "script"]], "Search Each Straight Chain Pattern": [[395, "search-each-straight-chain-pattern"]], "Sections": [[292, "sections"], [435, "sections"]], "Security Policy": [[271, "security-policy"]], "Selected Publications/Events": [[272, "selected-publications-events"]], "Sentence 1": [[377, "sentence-1"]], "Sentence 2": [[377, "sentence-2"]], "Serving NeuralChat Text Generation with Triton Inference Server": [[364, "serving-neuralchat-text-generation-with-triton-inference-server"]], "Serving NeuralChat Text Generation with Triton Inference Server (CUDA)": [[365, "serving-neuralchat-text-generation-with-triton-inference-server-cuda"]], "Serving NeuralChat Text Generation with Triton Inference Server on HPU": [[366, "serving-neuralchat-text-generation-with-triton-inference-server-on-hpu"]], "Set the Pattern Mapping Config and Register the Pattern": [[387, "set-the-pattern-mapping-config-and-register-the-pattern"]], "Setup Conda": [[323, "setup-conda"], [324, "setup-conda"], [326, "setup-conda"], [327, "setup-conda"], [328, "setup-conda"], [329, "setup-conda"], [330, "setup-conda"], [331, "setup-conda"], [332, "setup-conda"], [334, "setup-conda"], [336, "setup-conda"], [337, "setup-conda"], [338, "setup-conda"], [340, "setup-conda"], [343, "setup-conda"], [344, "setup-conda"], [345, "setup-conda"], [357, "setup-conda"], [363, "setup-conda"], [368, "setup-conda"], [383, "setup-conda"], [384, "setup-conda"]], "Setup Database": [[334, "setup-database"]], "Setup Environment": [[334, "setup-environment"], [336, "setup-environment"], [337, "setup-environment"], [338, "setup-environment"], [357, "setup-environment"], [363, "setup-environment"], [368, "setup-environment"]], "Setup NVIDIA GPU environment": [[318, "setup-nvidia-gpu-environment"]], "Setup Xeon SPR Environment": [[313, "setup-xeon-spr-environment"], [317, "setup-xeon-spr-environment"]], "Setups": [[412, "setups"]], "Shanghainese ASR (Audio-Speech-Recognition) and TTS (Text-To-Speech) finetuning/inference": [[353, "shanghainese-asr-audio-speech-recognition-and-tts-text-to-speech-finetuning-inference"]], "Simply run the test script": [[360, "simply-run-the-test-script"]], "Single Thread": [[411, "single-thread"]], "Single-node fine-tuning": [[354, "single-node-fine-tuning"]], "Smooth Quant": [[428, "smooth-quant"]], "Sparse GEMM AMX": [[403, "sparse-gemm-amx"]], "Sparse GEMM AVX512F": [[404, "sparse-gemm-avx512f"]], "Sparse GEMM VNNI": [[409, "sparse-gemm-vnni"]], "Sparse GEMM with Layer-Normalize": [[406, "sparse-gemm-with-layer-normalize"]], "Sparse Pattern & Data Format": [[404, "sparse-pattern-data-format"]], "Sparse Ratio Setting Part": [[389, "sparse-ratio-setting-part"]], "Sparse Reference Deployment on Neural Engine": [[304, "sparse-reference-deployment-on-neural-engine"]], "Sparse acceleration": [[402, "sparse-acceleration"]], "Splice Sub-chains with the Main Chain and Remove Duplicate Results": [[395, "splice-sub-chains-with-the-main-chain-and-remove-duplicate-results"]], "Stable Diffusion": [[425, "stable-diffusion"]], "StarCoder": [[349, "starcoder"]], "StarCoder-3B": [[425, "starcoder-3b"]], "Start NeuralChat Service": [[313, "start-neuralchat-service"], [316, "start-neuralchat-service"], [317, "start-neuralchat-service"], [318, "start-neuralchat-service"]], "Start NeuralChat Text Generation Service with Docker": [[316, "start-neuralchat-text-generation-service-with-docker"]], "Start NeuralChat and Code Generation Service with Docker": [[313, "start-neuralchat-and-code-generation-service-with-docker"]], "Start NeuralChat and TGI serving with Docker": [[317, "start-neuralchat-and-tgi-serving-with-docker"]], "Start NeuralChat and vLLM serving with Docker": [[318, "start-neuralchat-and-vllm-serving-with-docker"]], "Start Triton Inference Server": [[364, "start-triton-inference-server"], [365, "start-triton-inference-server"]], "Start the server": [[375, "start-the-server"]], "Start training!": [[350, "start-training"]], "Static Compressed Buffer": [[396, "static-compressed-buffer"]], "Static MHA": [[413, "static-mha"]], "Step-by-Step": [[427, "step-by-step"]], "Stock PyTorch Examples": [[304, "stock-pytorch-examples"]], "Stock Pytorch Model": [[289, "stock-pytorch-model"]], "Streaming LLM": [[429, "streaming-llm"]], "Submodules": [[18, "submodules"], [31, "submodules"], [34, "submodules"], [46, "submodules"], [51, "submodules"], [56, "submodules"], [58, "submodules"], [59, "submodules"], [83, "submodules"], [150, "submodules"], [249, "submodules"]], "Subpackages": [[58, "subpackages"], [245, "subpackages"]], "Summary and Next Steps": [[302, "summary-and-next-steps"]], "Supervised Fine-Tuning (SFT)": [[347, "supervised-fine-tuning-sft"]], "Support": [[300, "support"], [302, "support"]], "Supported Algorithms": [[432, "supported-algorithms"]], "Supported Feature Matrix": [[423, "supported-feature-matrix"]], "Supported Framework Matrix": [[428, "supported-framework-matrix"]], "Supported Matrix": [[398, "supported-matrix"]], "Supported Metric": [[416, "supported-metric"]], "Supported Model Export Matrix": [[305, "supported-model-export-matrix"]], "Supported Models": [[309, "supported-models"]], "Supported ONNX Format": [[392, "supported-onnx-format"]], "Supported Objectives Matrix:": [[417, "supported-objectives-matrix"]], "System Requirements": [[308, "system-requirements"], [309, "system-requirements"]], "System Summary": [[426, "system-summary"]], "TTS": [[353, "tts"], [353, "id2"], [353, "id4"]], "Test": [[356, "test"], [357, "test"], [368, "test"], [398, "test"]], "Test the TextChat server": [[324, "test-the-textchat-server"]], "Text Chat": [[311, "text-chat"]], "Tile": [[402, "tile"]], "Total Profiling Part": [[389, "total-profiling-part"]], "Trademarks": [[415, "trademarks"]], "Train": [[350, "train"]], "Trainer": [[285, "trainer"]], "Training": [[350, "training"]], "Training on CPU (SPR)": [[346, "training-on-cpu-spr"]], "Training on CUDA": [[352, "training-on-cuda"], [352, "id1"], [352, "id3"]], "Training on GPU": [[346, "training-on-gpu"]], "Training on Habana": [[346, "training-on-habana"], [352, "training-on-habana"], [352, "id2"], [352, "id4"]], "Transformers-Accelerated Libraries": [[398, "transformers-accelerated-libraries"]], "Transformers-accelerated Neural Engine": [[306, "transformers-accelerated-neural-engine"]], "Transposed MHA": [[408, "transposed-mha"]], "Transposed MatMul": [[407, "transposed-matmul"]], "Tutorials": [[430, "tutorials"]], "Ubuntu 20.04/22.04": [[308, "ubuntu-20-04-22-04"]], "Usage": [[307, "usage"], [356, "usage"], [358, "usage"], [359, "usage"], [360, "usage"], [362, "usage"], [369, "usage"], [369, "id1"], [369, "id2"], [370, "usage"], [372, "usage"], [373, "usage"], [374, "usage"], [377, "usage"], [400, "usage"], [401, "usage"], [413, "usage"], [419, "usage"]], "Usages": [[385, "usages"]], "Use Triton client to send inference request": [[364, "use-triton-client-to-send-inference-request"], [365, "use-triton-client-to-send-inference-request"]], "User Guide": [[297, "user-guide"], [431, "user-guide"], [440, "user-guide"]], "User-facing API": [[286, "user-facing-api"]], "User\u2019s Perspective": [[400, "user-s-perspective"], [401, "user-s-perspective"]], "Using Curl": [[309, "using-curl"]], "Using OpenAI Client Library": [[309, "using-openai-client-library"]], "Using Python Requests Library": [[309, "using-python-requests-library"]], "Using Single NumaNode": [[332, "using-single-numanode"]], "VTune": [[410, "vtune"]], "Validated Environment": [[308, "validated-environment"]], "Validated Hardware Environment": [[302, "validated-hardware-environment"], [308, "validated-hardware-environment"]], "Validated Model List": [[349, "validated-model-list"], [350, "validated-model-list"]], "Validated Model Performance": [[425, "validated-model-performance"]], "Validated Models": [[428, "validated-models"]], "Validated Performance Data": [[411, "validated-performance-data"]], "Validated Software Environment": [[308, "validated-software-environment"]], "Vector Stores": [[309, "vector-stores"], [372, "vector-stores"]], "VectorStoreRetriever": [[372, "vectorstoreretriever"]], "Verbose": [[410, "verbose"]], "Visual Instruction Tuning": [[350, "visual-instruction-tuning"]], "Voice Chat": [[311, "voice-chat"]], "Voice Cloning by finetuning a Text-To-Speech (TTS) model": [[355, "voice-cloning-by-finetuning-a-text-to-speech-tts-model"]], "Weight Only Quantization": [[319, "weight-only-quantization"]], "Weight Only Quantization (WOQ)": [[432, "weight-only-quantization-woq"]], "Weight Only Quantization with LLM Runtime": [[319, "weight-only-quantization-with-llm-runtime"]], "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!": [[292, "welcome-to-intel-extension-for-transformers-documentation"], [435, "welcome-to-intel-extension-for-transformers-documentation"]], "You can get profile only with ENGINE_PROFILING=1 before running model by python/c++ API.": [[389, "you-can-get-profile-only-with-engine-profiling-1-before-running-model-by-python-c-api"]], "alpha,beta,scale meaning": [[401, "alpha-beta-scale-meaning"]], "attention": [[413, "attention"]], "cURL": [[321, "curl"]], "conversation": [[0, "module-conversation"]], "different jit-paths for different weight size": [[405, "different-jit-paths-for-different-weight-size"]], "dynamic_quant": [[413, "dynamic-quant"]], "dynamic_quant_matmul": [[413, "dynamic-quant-matmul"]], "eltwiseop": [[413, "eltwiseop"]], "gaudi_spawn": [[1, "module-gaudi_spawn"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever"]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma": [[3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma"]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, "module-intel_extension_for_transformers.neural_chat.chatbot"]], "intel_extension_for_transformers.neural_chat.config": [[5, "module-intel_extension_for_transformers.neural_chat.config"]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, "module-intel_extension_for_transformers.neural_chat.config_logging"]], "intel_extension_for_transformers.neural_chat.errorcode": [[7, "module-intel_extension_for_transformers.neural_chat.errorcode"]], "intel_extension_for_transformers.neural_chat.pipeline": [[8, "module-intel_extension_for_transformers.neural_chat.pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory": [[10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection": [[11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer": [[12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser": [[13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm": [[16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats": [[19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util"]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol"]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats"]], "intel_extension_for_transformers.tools.utils": [[26, "module-intel_extension_for_transformers.tools.utils"]], "intel_extension_for_transformers.transformers.benchmark": [[27, "module-intel_extension_for_transformers.transformers.benchmark"]], "intel_extension_for_transformers.transformers.config": [[28, "module-intel_extension_for_transformers.transformers.config"]], "intel_extension_for_transformers.transformers.dynamic": [[31, "module-intel_extension_for_transformers.transformers.dynamic"]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils"]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, "module-intel_extension_for_transformers.transformers.dynamic.evolution"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama"]], "intel_extension_for_transformers.transformers.modeling": [[34, "module-intel_extension_for_transformers.transformers.modeling"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode"]], "intel_extension_for_transformers.transformers.modeling.model": [[35, "module-intel_extension_for_transformers.transformers.modeling.model"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama": [[38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm": [[43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic"]], "intel_extension_for_transformers.transformers.pipeline": [[45, "module-intel_extension_for_transformers.transformers.pipeline"]], "intel_extension_for_transformers.transformers.pruner": [[46, "module-intel_extension_for_transformers.transformers.pruner"]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, "module-intel_extension_for_transformers.transformers.pruner.pruning"]], "intel_extension_for_transformers.transformers.quantization": [[48, "module-intel_extension_for_transformers.transformers.quantization"]], "intel_extension_for_transformers.transformers.runtime": [[245, "module-intel_extension_for_transformers.transformers.runtime"]], "intel_extension_for_transformers.transformers.runtime.compile": [[58, "module-intel_extension_for_transformers.transformers.runtime.compile"]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader"]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger"]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2": [[75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm": [[91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize": [[104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion": [[129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape": [[133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul": [[134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape": [[135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape": [[136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D": [[137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose": [[206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT": [[207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape": [[208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion": [[209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion": [[210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert": [[211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo": [[212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode": [[213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding": [[216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape": [[217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape": [[218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape": [[219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape": [[220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape": [[221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape": [[222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask": [[223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape": [[231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul": [[232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice": [[233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1": [[234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape": [[235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape": [[236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D": [[237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape": [[238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch": [[239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice": [[240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils"]], "intel_extension_for_transformers.transformers.trainer": [[246, "module-intel_extension_for_transformers.transformers.trainer"]], "intel_extension_for_transformers.transformers.utils": [[249, "module-intel_extension_for_transformers.transformers.utils"]], "intel_extension_for_transformers.transformers.utils.config": [[247, "module-intel_extension_for_transformers.transformers.utils.config"]], "intel_extension_for_transformers.transformers.utils.get_throughput": [[248, "module-intel_extension_for_transformers.transformers.utils.get_throughput"]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, "module-intel_extension_for_transformers.transformers.utils.metrics"]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, "module-intel_extension_for_transformers.transformers.utils.objectives"]], "intel_extension_for_transformers.transformers.utils.utility": [[252, "module-intel_extension_for_transformers.transformers.utils.utility"]], "jit_binaryop_injector.hpp": [[400, "jit-binaryop-injector-hpp"]], "jit_eltwise_injector.hpp": [[401, "jit-eltwise-injector-hpp"]], "layernorm_ba": [[413, "layernorm-ba"]], "layernormalized sparse matmul": [[406, "layernormalized-sparse-matmul"]], "main_eval_only": [[253, "module-main_eval_only"]], "main_parse_and_eval": [[254, "module-main_parse_and_eval"]], "matmul_avx512f_p2031_p2013": [[413, "matmul-avx512f-p2031-p2013"]], "matmul_vnni_noperm_p2031_p1302": [[413, "matmul-vnni-noperm-p2031-p1302"]], "meta-llama/Llama-2-7b-hf": [[349, "meta-llama-llama-2-7b-hf"]], "microsoft/git-base": [[348, "microsoft-git-base"]], "models.backbone": [[255, "module-models.backbone"]], "models.detr": [[256, "module-models.detr"]], "models.detr_multi": [[257, "module-models.detr_multi"]], "models.matcher": [[258, "module-models.matcher"]], "models.position_encoding": [[259, "module-models.position_encoding"]], "models.segmentation": [[260, "module-models.segmentation"]], "models.transformer": [[261, "module-models.transformer"]], "mpt architecture": [[346, "mpt-architecture"]], "one-stage jit-path": [[405, "one-stage-jit-path"]], "operator_desc.hpp": [[400, "operator-desc-hpp"], [401, "operator-desc-hpp"]], "param_type.hpp": [[400, "param-type-hpp"]], "param_types.hpp": [[401, "param-types-hpp"]], "platform configuration": [[397, "platform-configuration"]], "prerequisite": [[361, "prerequisite"], [362, "prerequisite"]], "problem description": [[405, "problem-description"]], "references": [[432, "references"]], "softmax": [[413, "softmax"]], "sparse_matmul": [[413, "sparse-matmul"]], "sparse_matmul kernel:": [[398, "sparse-matmul-kernel"]], "spmm_amx_bf16_x16": [[413, "spmm-amx-bf16-x16"]], "spmm_avx512f": [[413, "spmm-avx512f"]], "spmm_vnni": [[413, "spmm-vnni"]], "text": [[262, "module-text"]], "transpose_matmul": [[413, "transpose-matmul"]], "two-stage jit-path": [[405, "two-stage-jit-path"]], "usage": [[303, "usage"]], "util.box_ops": [[263, "module-util.box_ops"]], "util.misc": [[264, "module-util.misc"]], "util.plot_utils": [[265, "module-util.plot_utils"]], "util.postprocess": [[266, "module-util.postprocess"]], "utils.data_utils": [[267, "module-utils.data_utils"]], "utils.eval_utils": [[268, "module-utils.eval_utils"]], "vllm serving for NeuralChat": [[367, "vllm-serving-for-neuralchat"]], "\ud83c\udf99\ufe0f Talking Bot": [[378, "talking-bot"]], "\ud83c\udfe0Introduction": [[371, "introduction"]], "\ud83d\udcf8 Project Screenshots": [[341, "project-screenshots"], [378, "project-screenshots"], [378, "id1"], [378, "id2"], [378, "id3"], [379, "project-screenshots"], [381, "project-screenshots"], [382, "project-screenshots"]], "\ud83d\udd21 TextBot": [[378, "textbot"]], "\ud83d\udd27Install dependencies": [[371, "install-dependencies"]], "\ud83d\ude0e What can this help with?": [[370, "what-can-this-help-with"]], "\ud83d\ude4c SideBySide": [[378, "sidebyside"]], "\ud83d\ude80 Check configuration": [[345, "check-configuration"], [383, "check-configuration"], [384, "check-configuration"]], "\ud83d\ude80 Create a new space on Huggingface": [[345, "create-a-new-space-on-huggingface"], [383, "create-a-new-space-on-huggingface"], [384, "create-a-new-space-on-huggingface"]], "\ud83d\ude80 Setup application": [[345, "setup-application"], [383, "setup-application"], [384, "setup-application"]], "\ud83d\ude80 What is caching plugin?": [[370, "what-is-caching-plugin"]], "\ud83d\ude80Usage": [[371, "usage"]], "\ud83d\ude97Parameters": [[371, "parameters"]], "\ud83e\udd14 How does it work?": [[370, "how-does-it-work"]], "\ud83e\udd16 AI Talking Photo": [[378, "ai-talking-photo"]]}, "docnames": ["autoapi/conversation/index", "autoapi/gaudi_spawn/index", "autoapi/intel_extension_for_transformers/langchain/langchain_community/retrievers/child_parent_retriever/index", "autoapi/intel_extension_for_transformers/langchain/langchain_community/vectorstores/chroma/index", "autoapi/intel_extension_for_transformers/neural_chat/chatbot/index", "autoapi/intel_extension_for_transformers/neural_chat/config/index", "autoapi/intel_extension_for_transformers/neural_chat/config_logging/index", "autoapi/intel_extension_for_transformers/neural_chat/errorcode/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/image2image/instructpix2pix_pipeline/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/memory/memory/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/intent_detection/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/query_explainer/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/parser/parser/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/retriever_adapter/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/safety_checker/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/bfm/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/networks/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/load_mats/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/preprocess/index", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/util/index", "autoapi/intel_extension_for_transformers/neural_chat/server/restful/openai_protocol/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/repr_tools/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/nethook/index", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/runningstats/index", "autoapi/intel_extension_for_transformers/tools/utils/index", "autoapi/intel_extension_for_transformers/transformers/benchmark/index", "autoapi/intel_extension_for_transformers/transformers/config/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/drop_and_restore_utils/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/evolution/index", "autoapi/intel_extension_for_transformers/transformers/dynamic/index", "autoapi/intel_extension_for_transformers/transformers/kv_cache_compression/models/modeling_llama/index", "autoapi/intel_extension_for_transformers/transformers/modeling/gpt_bigcode/modeling_gpt_bigcode/index", "autoapi/intel_extension_for_transformers/transformers/modeling/index", "autoapi/intel_extension_for_transformers/transformers/modeling/model/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_bert_dynamic/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/bart/modeling_bart/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/llama/pos_shift_llama/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mistral/modeling_mistral/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mixtral/modeling_mixtral/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/phi/modeling_phi/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/swin/modeling_swin/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/streaming_llm/index", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_roberta_dynamic/index", "autoapi/intel_extension_for_transformers/transformers/pipeline/index", "autoapi/intel_extension_for_transformers/transformers/pruner/index", "autoapi/intel_extension_for_transformers/transformers/pruner/pruning/index", "autoapi/intel_extension_for_transformers/transformers/quantization/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/compile/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/onnx_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/tf_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/torch_extractor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/loader/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/logger/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/onnx_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/all/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/assert/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/baddbmm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/bias_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cast/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/concat/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/conv/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cos/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/empty_ops/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/expand_dims/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_matmul_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_norm_v3/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_gemm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gemm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_get_next/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/layer_normalization/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/log_softmax/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/map_and_batch_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mkl_layer_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/model_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/one_hot/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/onnx_input/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/op/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/optimize_dataset/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pack/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/padding_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/placeholder/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pos_embed/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pow/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_linear/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_v2/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_fused_matmul_and_dequantize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_matmul_with_bias_and_dequantize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_sum/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reorder/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/resize/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/rsub/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/scatter_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/shape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/sin/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/size/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/slice_position_ids/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/softmax/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/split/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/squeeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/strided_slice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/tensor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/top_k/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unpack/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unsqueeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/view/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/where/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/optimizer/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/InnerproductReshapeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_cls_token/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/arangewithreciprocal/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_ConstantOfShapeWithMul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVPreReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_WeightReshapeTo4D/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_mask_length_adaptive_keep_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_output_layer_norm_length_adaptive_keep_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/cast_to/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/collect_quant_info/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/conv_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/decoder_attn_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/einsumwitharange/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddingbag/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddings_to_2d_before_inner_product/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/generate_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithbiasgelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithslice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithswish/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_data/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_file/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_bf16_node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_quant_node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/int8_bf16_mixed_precision_checker/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/interact_features/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/last_layer_shape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_reduce_mean/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_embeding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_matmulwithtranspose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_postprocess/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/lower_all_tuples/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_gelu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_relu/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_sigmoid/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_tanh/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_unsqueeze/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose_scale_add/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/merged_embeddingbag/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_reorder_change/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/operator_adaptor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/output_data/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/padding_sequence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/pattern/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings_v1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_merge/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_reshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quant_gather_to_bf16/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantize_fusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantized_graph_dtype_refactor/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_constant_op/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_last_view/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_range/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_unused_operator/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_zeros/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/removeslice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_after_restore_hidden_states/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_and_after_attention_out_layer_norm_gather_elements/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_restore_hidden_states/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_fusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/restore_hidden_states_in_length_adaptive_update_indices/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rms_norm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rotary_pos_emb/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/slicemask/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTranspose/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTransposeQAT/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_MHAReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_QuantizeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ReshapeFusion/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_bf16Convert/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_collectQDQInfo/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_insertQuantNode/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/start_end_logits/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/subgraph_matcher/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncdoer_word_embedding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_KVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_MulReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_QReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_SoftmaxReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_causal_attention_mask/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings_v1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_embedding/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_ip_insert_bias/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_unpack_baddbmm/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchinsertbf16node/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchpaddingsquence/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_AttentionMaskAddReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_ConstantOfShapeWithMul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice_1/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVPreReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape4D/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_encoderHiddenStatesReshape/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_getSampleBatch/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_sampleSlice/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transpose_batch_matmul/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/word_embeddings/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/tf_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/torch_utils/index", "autoapi/intel_extension_for_transformers/transformers/runtime/index", "autoapi/intel_extension_for_transformers/transformers/trainer/index", "autoapi/intel_extension_for_transformers/transformers/utils/config/index", "autoapi/intel_extension_for_transformers/transformers/utils/get_throughput/index", "autoapi/intel_extension_for_transformers/transformers/utils/index", "autoapi/intel_extension_for_transformers/transformers/utils/metrics/index", "autoapi/intel_extension_for_transformers/transformers/utils/objectives/index", "autoapi/intel_extension_for_transformers/transformers/utils/utility/index", "autoapi/main_eval_only/index", "autoapi/main_parse_and_eval/index", "autoapi/models/backbone/index", "autoapi/models/detr/index", "autoapi/models/detr_multi/index", "autoapi/models/matcher/index", "autoapi/models/position_encoding/index", "autoapi/models/segmentation/index", "autoapi/models/transformer/index", "autoapi/text/index", "autoapi/util/box_ops/index", "autoapi/util/misc/index", "autoapi/util/plot_utils/index", "autoapi/util/postprocess/index", "autoapi/utils/data_utils/index", "autoapi/utils/eval_utils/index", "docs/CI_introduction", "docs/README", "docs/SECURITY", "docs/Welcome", "docs/api_doc/api", "docs/api_doc/engine/api_py_engine", "docs/api_doc/engine/compile", "docs/api_doc/engine/graph", "docs/api_doc/engine_api", "docs/api_doc/kernel/engine", "docs/api_doc/kernel/interface", "docs/api_doc/kernel/operator_desc", "docs/api_doc/kernel/types", "docs/api_doc/kernel_api", "docs/api_doc/optimization/config", "docs/api_doc/optimization/model", "docs/api_doc/optimization/trainer", "docs/api_doc/user_api", "docs/architecture", "docs/autoround_comparative_analysis", "docs/benchmark", "docs/build_docs/source/example", "docs/build_docs/source/feature", "docs/build_docs/source/index", "docs/build_docs/source/kernel", "docs/build_docs/source/kernel_desc", "docs/build_docs/source/kernel_perf", "docs/build_docs/source/neural_engine", "docs/build_docs/source/user_guide", "docs/code_of_conduct", "docs/component_owner", "docs/contributions", "docs/contributors", "docs/devcatalog", "docs/distillation", "docs/examples", "docs/export", "docs/get_started", "docs/h2o", "docs/installation", "docs/intel_extension_for_transformers/neural_chat/README", "docs/intel_extension_for_transformers/neural_chat/assets/docs/sample", "docs/intel_extension_for_transformers/neural_chat/cli/README", "docs/intel_extension_for_transformers/neural_chat/docker/README", "docs/intel_extension_for_transformers/neural_chat/docker/code_generation/README", "docs/intel_extension_for_transformers/neural_chat/docker/finetuning/README", "docs/intel_extension_for_transformers/neural_chat/docker/inference/README", "docs/intel_extension_for_transformers/neural_chat/docker/text_generation/README", "docs/intel_extension_for_transformers/neural_chat/docker/tgi_serving/README", "docs/intel_extension_for_transformers/neural_chat/docker/vllm_serving/README", "docs/intel_extension_for_transformers/neural_chat/docs/advanced_features", "docs/intel_extension_for_transformers/neural_chat/docs/full_notebooks", "docs/intel_extension_for_transformers/neural_chat/docs/neuralchat_api", "docs/intel_extension_for_transformers/neural_chat/docs/notebooks/workshop/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/assisted_generation/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/chatgpt_rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/gaudi/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gguf/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gptq/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/woq/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/deepspeed/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/ipex/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/tpp/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/backend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/audio/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/image2image/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/backend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend/xeon/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend_with_cache/README", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/frontend/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/dpo_pipeline/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/finetune_neuralchat_v3/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/image_to_text/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/instruction/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/ppo_pipeline/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/shanghainese_asr_tts/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/text_generation/README", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/tts/README", "docs/intel_extension_for_transformers/neural_chat/examples/helloworld/README", "docs/intel_extension_for_transformers/neural_chat/examples/langchain_extension/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/retrieval/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/table_extraction/README", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/video/README", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/chatbot/README", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/rag/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/TGI/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cpu/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cuda/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/hpu/README", "docs/intel_extension_for_transformers/neural_chat/examples/serving/vllm/README", "docs/intel_extension_for_transformers/neural_chat/examples/sql_generation/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/caching/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/ner/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/README", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/README", "docs/intel_extension_for_transformers/neural_chat/server/README", "docs/intel_extension_for_transformers/neural_chat/tools/embedding_finetune/README", "docs/intel_extension_for_transformers/neural_chat/tools/rome/examples/README", "docs/intel_extension_for_transformers/neural_chat/ui/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/side_by_side/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talking_photo/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talkingbot/README", "docs/intel_extension_for_transformers/neural_chat/ui/customized/vision_demo/README", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/basic/README", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/side_by_side/README", "docs/intel_extension_for_transformers/tools/llm_carbon_calc_readme", "docs/intel_extension_for_transformers/transformers/runtime/docs/Installation", "docs/intel_extension_for_transformers/transformers/runtime/docs/add_customized_pattern", "docs/intel_extension_for_transformers/transformers/runtime/docs/deploy_and_integration", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_profiling", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_tuning", "docs/intel_extension_for_transformers/transformers/runtime/docs/graph_fusion", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_compile", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_quantize", "docs/intel_extension_for_transformers/transformers/runtime/docs/operator_register", "docs/intel_extension_for_transformers/transformers/runtime/docs/pattern_recognize", "docs/intel_extension_for_transformers/transformers/runtime/docs/static_compressed_buffer", "docs/intel_extension_for_transformers/transformers/runtime/docs/validated_model", "docs/intel_extension_for_transformers/transformers/runtime/kernels/README", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/3D_inference", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/binaryop_injector", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/eltwise_injector", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/gpu/sparse_gemm_gpu", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_amx", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_avx512f", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_dynamic_quant_matmul", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_layernormalized_spmm", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_matmul", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_mha", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_vnni", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/profiling", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/validated_data", "docs/intel_extension_for_transformers/transformers/runtime/kernels/scripts/README", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/benchmark", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/ci/inputs/README", "docs/legal", "docs/metrics", "docs/objectives", "docs/pipeline", "docs/pruning", "docs/publication", "docs/qbits", "docs/qloracpu", "docs/quantization", "docs/release", "docs/release_data", "docs/reproduce/efficient_LLM_inference_on_cpus", "docs/reproduce/neural_chat_v3-3_workflow", "docs/smoothquant", "docs/streamingllm", "docs/tutorials/README", "docs/user_guide", "docs/weightonlyquant", "example", "feature", "index", "kernel", "kernel_desc", "kernel_perf", "neural_engine", "user_guide"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2}, "filenames": ["autoapi/conversation/index.rst", "autoapi/gaudi_spawn/index.rst", "autoapi/intel_extension_for_transformers/langchain/langchain_community/retrievers/child_parent_retriever/index.rst", "autoapi/intel_extension_for_transformers/langchain/langchain_community/vectorstores/chroma/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/chatbot/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/config/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/config_logging/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/errorcode/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/image2image/instructpix2pix_pipeline/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/memory/memory/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/intent_detection/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/detector/query_explainer/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/parser/parser/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/retriever_adapter/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/safety_checker/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/bfm/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/models/networks/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/load_mats/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/preprocess/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/src/face3d/util/util/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/server/restful/openai_protocol/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/repr_tools/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/nethook/index.rst", "autoapi/intel_extension_for_transformers/neural_chat/tools/rome/utils/runningstats/index.rst", "autoapi/intel_extension_for_transformers/tools/utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/benchmark/index.rst", "autoapi/intel_extension_for_transformers/transformers/config/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/drop_and_restore_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/evolution/index.rst", "autoapi/intel_extension_for_transformers/transformers/dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/kv_cache_compression/models/modeling_llama/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/gpt_bigcode/modeling_gpt_bigcode/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/model/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_bert_dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/bart/modeling_bart/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/llama/pos_shift_llama/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mistral/modeling_mistral/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/mixtral/modeling_mixtral/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/phi/modeling_phi/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/models/swin/modeling_swin/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_gaudi/streaming_llm/index.rst", "autoapi/intel_extension_for_transformers/transformers/modeling/modeling_roberta_dynamic/index.rst", "autoapi/intel_extension_for_transformers/transformers/pipeline/index.rst", "autoapi/intel_extension_for_transformers/transformers/pruner/index.rst", "autoapi/intel_extension_for_transformers/transformers/pruner/pruning/index.rst", "autoapi/intel_extension_for_transformers/transformers/quantization/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/compile/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/onnx_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/tf_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/extractors/torch_extractor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/graph_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/loaders/loader/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/logger/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/onnx_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/all/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/assert/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/baddbmm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/batch_matmul_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/bias_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cast/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/concat/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/conv/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/cos/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/empty_ops/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/expand_dims/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_matmul_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_batch_norm_v3/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_gemm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/fused_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gather_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/gemm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_get_next/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/iterator_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/layer_normalization/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/log_softmax/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/map_and_batch_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/mkl_layer_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/model_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/one_hot/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/onnx_input/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/op/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/optimize_dataset/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pack/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/padding_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/placeholder/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pos_embed/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/pow/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_linear/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantize_v2/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_fused_matmul_and_dequantize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/quantized_matmul_with_bias_and_dequantize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reduce_sum/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reorder/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/resize/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/rsub/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/scatter_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/shape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/sin/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/size/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/slice_position_ids/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/softmax/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/split/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/squeeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/strided_slice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/tensor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/top_k/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unpack/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/unsqueeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/view/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/ops/where/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/optimizer/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/InnerproductReshapeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_cls_token/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/add_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/arangewithreciprocal/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_ConstantOfShapeWithMul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVPreReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_QKVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attentionBlock_WeightReshapeTo4D/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_mask_length_adaptive_keep_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_output_layer_norm_length_adaptive_keep_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/attention_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/cast_to/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/collect_quant_info/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/conv_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/decoder_attn_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/einsumwitharange/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddingbag/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/embeddings_to_2d_before_inner_product/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/generate_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithbiasgelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithslice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/innerproductwithswish/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_data/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/input_file/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_bf16_node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/insert_quant_node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/int8_bf16_mixed_precision_checker/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/interact_features/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/last_layer_shape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_reduce_mean/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/layer_norm_with_transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_embeding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_matmulwithtranspose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_postprocess/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/llama_rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/lower_all_tuples/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_gelu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_relu/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_sigmoid/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_tanh/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_bias_unsqueeze/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/matmul_with_transpose_scale_add/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/merged_embeddingbag/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_reorder_change/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/neox_rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/operator_adaptor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/output_data/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/padding_sequence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/pattern/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/position_embeddings_v1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_merge/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/qkv_reshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quant_gather_to_bf16/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantize_fusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/quantized_graph_dtype_refactor/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_constant_op/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_last_view/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_range/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_unused_operator/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/remove_zeros/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/removeslice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_after_restore_hidden_states/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_and_after_attention_out_layer_norm_gather_elements/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_before_restore_hidden_states/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/reshape_fusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/restore_hidden_states_in_length_adaptive_update_indices/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rms_norm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/rotary_pos_emb/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/slicemask/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTranspose/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ExplicitNHWCTransposeQAT/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_MHAReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_QuantizeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_ReshapeFusion/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_bf16Convert/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_collectQDQInfo/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/stableDiffusion_insertQuantNode/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/start_end_logits/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/subgraph_matcher/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncdoer_word_embedding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_AttentionReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_KVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_MulReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_QReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_SoftmaxReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/textEncoder_causal_attention_mask/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/token_type_embeddings_v1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_embedding/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_ip_insert_bias/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torch_unpack_baddbmm/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchinsertbf16node/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/torchpaddingsquence/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_AttentionMaskAddReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_ConstantOfShapeWithMul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_FFNSlice_1/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVPreReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_QKVReshape4D/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_encoderHiddenStatesReshape/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_getSampleBatch/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transformer2Dmodel_sampleSlice/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/transpose_batch_matmul/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/sub_graph/word_embeddings/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/tf_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/compile/torch_utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/runtime/index.rst", "autoapi/intel_extension_for_transformers/transformers/trainer/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/config/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/get_throughput/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/metrics/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/objectives/index.rst", "autoapi/intel_extension_for_transformers/transformers/utils/utility/index.rst", "autoapi/main_eval_only/index.rst", "autoapi/main_parse_and_eval/index.rst", "autoapi/models/backbone/index.rst", "autoapi/models/detr/index.rst", "autoapi/models/detr_multi/index.rst", "autoapi/models/matcher/index.rst", "autoapi/models/position_encoding/index.rst", "autoapi/models/segmentation/index.rst", "autoapi/models/transformer/index.rst", "autoapi/text/index.rst", "autoapi/util/box_ops/index.rst", "autoapi/util/misc/index.rst", "autoapi/util/plot_utils/index.rst", "autoapi/util/postprocess/index.rst", "autoapi/utils/data_utils/index.rst", "autoapi/utils/eval_utils/index.rst", "docs/CI_introduction.md", "docs/README.md", "docs/SECURITY.md", "docs/Welcome.md", "docs/api_doc/api.rst", "docs/api_doc/engine/api_py_engine.rst", "docs/api_doc/engine/compile.rst", "docs/api_doc/engine/graph.rst", "docs/api_doc/engine_api.rst", "docs/api_doc/kernel/engine.rst", "docs/api_doc/kernel/interface.rst", "docs/api_doc/kernel/operator_desc.rst", "docs/api_doc/kernel/types.rst", "docs/api_doc/kernel_api.rst", "docs/api_doc/optimization/config.rst", "docs/api_doc/optimization/model.rst", "docs/api_doc/optimization/trainer.rst", "docs/api_doc/user_api.rst", "docs/architecture.md", "docs/autoround_comparative_analysis.md", "docs/benchmark.md", "docs/build_docs/source/example.rst", "docs/build_docs/source/feature.rst", "docs/build_docs/source/index.rst", "docs/build_docs/source/kernel.rst", "docs/build_docs/source/kernel_desc.rst", "docs/build_docs/source/kernel_perf.rst", "docs/build_docs/source/neural_engine.rst", "docs/build_docs/source/user_guide.rst", "docs/code_of_conduct.md", "docs/component_owner.md", "docs/contributions.md", "docs/contributors.md", "docs/devcatalog.md", "docs/distillation.md", "docs/examples.md", "docs/export.md", "docs/get_started.md", "docs/h2o.md", "docs/installation.md", "docs/intel_extension_for_transformers/neural_chat/README.md", "docs/intel_extension_for_transformers/neural_chat/assets/docs/sample.md", "docs/intel_extension_for_transformers/neural_chat/cli/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/code_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/finetuning/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/inference/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/text_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/tgi_serving/README.md", "docs/intel_extension_for_transformers/neural_chat/docker/vllm_serving/README.md", "docs/intel_extension_for_transformers/neural_chat/docs/advanced_features.md", "docs/intel_extension_for_transformers/neural_chat/docs/full_notebooks.md", "docs/intel_extension_for_transformers/neural_chat/docs/neuralchat_api.md", "docs/intel_extension_for_transformers/neural_chat/docs/notebooks/workshop/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/assisted_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/chatgpt_rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/gaudi/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gguf/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/gptq/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/pc/woq/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/deepspeed/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/ipex/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/codegen/backend/xeon/tpp/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/backend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/photo_ai/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/audio/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/plugin/image2image/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/backend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/talkingbot/server/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend/xeon/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/backend_with_cache/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/deployment/textbot/frontend/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/dpo_pipeline/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/finetune_neuralchat_v3/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/image_to_text/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/instruction/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/multi_modal/eval/mmmu_eval/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/ppo_pipeline/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/shanghainese_asr_tts/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/text_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/finetuning/tts/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/helloworld/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/langchain_extension/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/retrieval/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/table_extraction/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/plugins/video/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/chatbot/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/quick_start/rag/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/TGI/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cpu/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/cuda/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/triton_inference_sever/hpu/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/serving/vllm/README.md", "docs/intel_extension_for_transformers/neural_chat/examples/sql_generation/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/audio/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/caching/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/ner/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/retrieval/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/security/README.md", "docs/intel_extension_for_transformers/neural_chat/pipeline/plugins/video/face_animation/README.md", "docs/intel_extension_for_transformers/neural_chat/server/README.md", "docs/intel_extension_for_transformers/neural_chat/tools/embedding_finetune/README.md", "docs/intel_extension_for_transformers/neural_chat/tools/rome/examples/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/side_by_side/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talking_photo/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/talkingbot/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/customized/vision_demo/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/basic/README.md", "docs/intel_extension_for_transformers/neural_chat/ui/gradio/side_by_side/README.md", "docs/intel_extension_for_transformers/tools/llm_carbon_calc_readme.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/Installation.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/add_customized_pattern.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/deploy_and_integration.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_profiling.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/engine_tuning.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/graph_fusion.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_compile.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/onnx_quantize.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/operator_register.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/pattern_recognize.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/static_compressed_buffer.md", "docs/intel_extension_for_transformers/transformers/runtime/docs/validated_model.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/README.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/3D_inference.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/binaryop_injector.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/eltwise_injector.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/gpu/sparse_gemm_gpu.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_amx.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_avx512f.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_dynamic_quant_matmul.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_layernormalized_spmm.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_matmul.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_transpose_mha.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/kernel_desc/kernel_vnni.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/profiling.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/docs/validated_data.md", "docs/intel_extension_for_transformers/transformers/runtime/kernels/scripts/README.md", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/benchmark.md", "docs/intel_extension_for_transformers/transformers/runtime/test/kernels/benchmark/ci/inputs/README.md", "docs/legal.md", "docs/metrics.md", "docs/objectives.md", "docs/pipeline.md", "docs/pruning.md", "docs/publication.md", "docs/qbits.md", "docs/qloracpu.md", "docs/quantization.md", "docs/release.md", "docs/release_data.md", "docs/reproduce/efficient_LLM_inference_on_cpus.md", "docs/reproduce/neural_chat_v3-3_workflow.md", "docs/smoothquant.md", "docs/streamingllm.md", "docs/tutorials/README.md", "docs/user_guide.md", "docs/weightonlyquant.md", "example.rst", "feature.rst", "index.rst", "kernel.rst", "kernel_desc.rst", "kernel_perf.rst", "neural_engine.rst", "user_guide.rst"], "indexentries": {"accuracy() (in module util.misc)": [[264, "util.misc.accuracy", false]], "add (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.normmean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.topk method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK.add", false]], "add() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.add", false]], "add_config_item() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.add_config_item", false]], "add_gene() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.add_gene", false]], "addclstoken (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token)": [[130, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token.AddClsToken", false]], "addembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings)": [[131, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings.AddEmbeddings", false]], "addv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.AddV2", false]], "align_columns() (in module util.postprocess)": [[266, "util.postprocess.align_columns", false]], "align_headers() (in module util.postprocess)": [[266, "util.postprocess.align_headers", false]], "align_img() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess)": [[20, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess.align_img", false]], "align_rows() (in module util.postprocess)": [[266, "util.postprocess.align_rows", false]], "align_supercells() (in module util.postprocess)": [[266, "util.postprocess.align_supercells", false]], "all (class in intel_extension_for_transformers.transformers.runtime.compile.ops.all)": [[63, "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All", false]], "all_gather() (in module util.misc)": [[264, "util.misc.all_gather", false]], "apierrorcode (class in intel_extension_for_transformers.neural_chat.server.restful.openai_protocol)": [[22, "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol.ApiErrorCode", false]], "append_message() (conversation.conversation method)": [[0, "conversation.Conversation.append_message", false]], "apply_class_thresholds() (in module util.postprocess)": [[266, "util.postprocess.apply_class_thresholds", false]], "apply_rotary_pos_emb() (in module intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.apply_rotary_pos_emb", false]], "apply_threshold() (in module util.postprocess)": [[266, "util.postprocess.apply_threshold", false]], "approx_ratio() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.approx_ratio", false]], "arange (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Arange", false]], "arangewithreciprocal (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal)": [[132, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal.ArangewithReciprocal", false]], "assert (class in intel_extension_for_transformers.transformers.runtime.compile.ops.assert)": [[64, "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert", false]], "attentionblock_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_attentionmaskaddreshape)": [[133, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape.AttentionBlock_AttentionMaskAddReshape", false]], "attentionblock_constantofshapewithmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_constantofshapewithmul)": [[134, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul.AttentionBlock_ConstantOfShapeWithMul", false]], "attentionblock_qkvprereshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvprereshape)": [[135, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape.AttentionBlock_QKVPreReshape", false]], "attentionblock_qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvreshape)": [[136, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape.AttentionBlock_QKVReshape", false]], "attentionblock_weightreshapeto4d (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_weightreshapeto4d)": [[137, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D.AttentionBlock_WeightReshapeTo4D", false]], "attentionmasklengthadaptiveexpandindices (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices)": [[138, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices.AttentionMaskLengthAdaptiveExpandIndices", false]], "attentionoutputlayernormlengthadaptiveexpandindices (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices)": [[139, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices.AttentionOutputLayerNormLengthAdaptiveExpandIndices", false]], "attentionreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape)": [[140, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape.AttentionReshape", false]], "audiolanguageoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.AudioLanguageOptions", false]], "autocast_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.autocast_init", false]], "autoroundconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig", false]], "awqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.AwqConfig", false]], "backbone (class in models.backbone)": [[255, "models.backbone.Backbone", false]], "backendoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.BackendOptions", false]], "baddbmm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm)": [[65, "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm.Baddbmm", false]], "basetrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer", false]], "batchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul)": [[66, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul", false]], "batchmatmulv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2)": [[67, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2", false]], "benchmark() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.benchmark", false]], "benchmark() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.benchmark", false]], "benchmarkconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.BenchmarkConfig", false]], "bertattention (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention", false]], "bertembeddings (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings", false]], "bertencoder (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder", false]], "bertformaskedlm (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM", false]], "bertformultiplechoice (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice", false]], "bertfornextsentenceprediction (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction", false]], "bertforpretraining (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining", false]], "bertforpretrainingoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTrainingOutput", false]], "bertforquestionanswering (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering", false]], "bertforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification", false]], "bertfortokenclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification", false]], "bertintermediate (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate", false]], "bertlayer (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer", false]], "bertlmheadmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel", false]], "bertlmpredictionhead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead", false]], "bertmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel", false]], "bertonlymlmhead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead", false]], "bertonlynsphead (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead", false]], "bertoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput", false]], "bertpooler (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler", false]], "bertpredictionheadtransform (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform", false]], "bertpretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainedModel", false]], "bertpretrainingheads (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads", false]], "bertselfattention (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention", false]], "bertselfoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput", false]], "bias_to_int32() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.bias_to_int32", false]], "biasadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add)": [[68, "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd", false]], "binaryadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.BinaryAdd", false]], "bincount (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount", false]], "box_numpy_null() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.box_numpy_null", false]], "build_chatbot() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.build_chatbot", false]], "builtin_eval_func() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.builtin_eval_func", false]], "builtin_eval_func() (intel_extension_for_transformers.transformers.trainer.nlpseq2seqtrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer.builtin_eval_func", false]], "builtin_train_func() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.builtin_train_func", false]], "cache_load_enabled (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.cache_load_enabled", false]], "calculate_ins_level_acc() (in module utils.eval_utils)": [[268, "utils.eval_utils.calculate_ins_level_acc", false]], "cast (class in intel_extension_for_transformers.transformers.runtime.compile.ops.cast)": [[69, "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast", false]], "castto (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to)": [[141, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to.CastTo", false]], "change_node_input_tensors() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.change_node_input_tensors", false]], "change_node_output_tensors() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.change_node_output_tensors", false]], "change_num_name() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.change_num_name", false]], "check_is_number() (in module utils.eval_utils)": [[268, "utils.eval_utils.check_is_number", false]], "check_value() (in module intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.check_value", false]], "childparentretriever (class in intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever", false]], "class_subset() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.fixedrandomsubsetsampler method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler.class_subset", false]], "collectquantinfo (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info)": [[142, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info.CollectQuantInfo", false]], "combinedstat (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat", false]], "compile() (in module intel_extension_for_transformers.transformers.runtime.compile.compile)": [[49, "intel_extension_for_transformers.transformers.runtime.compile.compile.compile", false]], "compute_loss() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.compute_loss", false]], "concat (class in intel_extension_for_transformers.transformers.runtime.compile.ops.concat)": [[70, "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat", false]], "config_file_path (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.config_file_path", false]], "configure_logging() (in module intel_extension_for_transformers.neural_chat.config_logging)": [[6, "intel_extension_for_transformers.neural_chat.config_logging.configure_logging", false]], "constant (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Constant", false]], "constantofshape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ConstantOfShape", false]], "construct() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.construct", false]], "construct_node() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.construct_node", false]], "conv (class in intel_extension_for_transformers.transformers.runtime.compile.ops.conv)": [[71, "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv", false]], "conversation": [[0, "module-conversation", false]], "conversation (class in conversation)": [[0, "conversation.Conversation", false]], "convert_fullwidth_to_halfwidth() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker)": [[15, "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker.convert_fullwidth_to_halfwidth", false]], "convert_image_to_base64() (conversation.conversation method)": [[0, "conversation.Conversation.convert_image_to_base64", false]], "convex_hull() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.convex_hull", false]], "convolution (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Convolution", false]], "convreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape)": [[143, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape.ConvReshape", false]], "cos (class in intel_extension_for_transformers.transformers.runtime.compile.ops.cos)": [[72, "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos", false]], "covariance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance", false]], "cpu_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.cpu_", false]], "cpu_instance (c macro)": [[278, "c.CPU_INSTANCE", false]], "create_position_ids_from_input_ids() (in module intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.create_position_ids_from_input_ids", false]], "create_position_ids_from_inputs_embeds() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaembeddings method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings.create_position_ids_from_inputs_embeds", false]], "create_tf_node() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.create_tf_node", false]], "crosscovariance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance", false]], "crossiou (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU", false]], "crossover() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.crossover", false]], "cuda_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.cuda_", false]], "cumsum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.CumSum", false]], "dataarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.DataArguments", false]], "debug() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.debug", false]], "decoderattnreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape)": [[144, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape.DecoderAttnReshape", false]], "del_environ_var() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.del_environ_var", false]], "del_environ_vars() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.del_environ_vars", false]], "dequantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Dequantize", false]], "dequantizelinear (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.DequantizeLinear", false]], "dereference() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.fixedsubsetsampler method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler.dereference", false]], "detr (class in models.detr)": [[256, "models.detr.DETR", false]], "detrmulti (class in models.detr_multi)": [[257, "models.detr_multi.DETRMulti", false]], "deviceoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.DeviceOptions", false]], "dice_loss() (in module models.segmentation)": [[260, "models.segmentation.dice_loss", false]], "distill() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.distill", false]], "distributed_init() (in module intel_extension_for_transformers.transformers.utils.utility)": [[252, "intel_extension_for_transformers.transformers.utils.utility.distributed_init", false]], "draw_landmarks() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.draw_landmarks", false]], "dump_tensor() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.dump_tensor", false]], "dynamiclengthconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.DynamicLengthConfig", false]], "dynamicquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.DynamicQuantConfig", false]], "einsum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Einsum", false]], "einsumwitharange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange)": [[145, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange.EinsumwithArange", false]], "embeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.EmbeddingBag", false]], "embeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag)": [[146, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag.EmbeddingBag", false]], "embeddingsto2dbeforeinnerproduct (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product)": [[147, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product.EmbeddingsTo2DBeforeInnerProduct", false]], "enable_sequential_cpu_offload() (intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.stablediffusioninstructpix2pixpipeline method)": [[9, "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline.enable_sequential_cpu_offload", false]], "engine_init() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.engine_init", false]], "environ_info_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.environ_info_init", false]], "erf (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Erf", false]], "error() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.error", false]], "eval_multi_choice() (in module utils.eval_utils)": [[268, "utils.eval_utils.eval_multi_choice", false]], "eval_open() (in module utils.eval_utils)": [[268, "utils.eval_utils.eval_open", false]], "evaluate() (in module utils.eval_utils)": [[268, "utils.eval_utils.evaluate", false]], "evolution (class in intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution", false]], "expand (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Expand", false]], "expand_gather() (in module intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.expand_gather", false]], "expand_gather() (in module intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.expand_gather", false]], "expanddims (class in intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims)": [[74, "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims", false]], "expandindices (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ExpandIndices", false]], "explicitnhwctransposeforconv (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctranspose)": [[206, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose.ExplicitNHWCTransposeForConv", false]], "explicitnhwctransposeforconvqat (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctransposeqat)": [[207, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT.ExplicitNHWCTransposeForConvQAT", false]], "export_to_bf16_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_bf16_onnx", false]], "export_to_fp32_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_fp32_onnx", false]], "export_to_int8_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_int8_onnx", false]], "export_to_jit() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_jit", false]], "export_to_onnx() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.export_to_onnx", false]], "extract() (intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.onnxinput method)": [[94, "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT.extract", false]], "extract() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.extract", false]], "extract_numbers() (in module utils.eval_utils)": [[268, "utils.eval_utils.extract_numbers", false]], "extract_text_from_spans() (in module util.postprocess)": [[266, "util.postprocess.extract_text_from_spans", false]], "extract_text_inside_bbox() (in module util.postprocess)": [[266, "util.postprocess.extract_text_inside_bbox", false]], "extractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor)": [[50, "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor.Extractor", false]], "fatal() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.fatal", false]], "feed_forward_chunk() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlayer method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer.feed_forward_chunk", false]], "feed_forward_chunk() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalayer method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer.feed_forward_chunk", false]], "fill (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Fill", false]], "finetune_model() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.finetune_model", false]], "finetuningarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.FinetuningArguments", false]], "fixedrandomsubsetsampler (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler", false]], "fixedsubsetsampler (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler", false]], "flatmapdataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.FlatMapDataset", false]], "flatten (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Flatten", false]], "floor_divide (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Floor_divide", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodeforcausallm method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodeforsequenceclassification method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.gptbigcodefortokenclassification method)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertembeddings method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertencoder method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformultiplechoice method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertfornextsentenceprediction method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforquestionanswering method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforsequenceclassification method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertfortokenclassification method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertintermediate method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlayer method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmpredictionhead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertonlymlmhead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertonlynsphead method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertoutput method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpooler method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpredictionheadtransform method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertpretrainingheads method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfoutput method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_bartlearnedpositionalembedding method)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaclassificationhead method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaembeddings method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaencoder method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformultiplechoice method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforquestionanswering method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforsequenceclassification method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertafortokenclassification method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaintermediate method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalayer method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertalmhead method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaoutput method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertapooler method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention.forward", false]], "forward() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfoutput method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput.forward", false]], "forward() (models.detr.detr method)": [[256, "models.detr.DETR.forward", false]], "forward() (models.detr.postprocess method)": [[256, "models.detr.PostProcess.forward", false]], "forward() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.forward", false]], "forward() (models.detr_multi.detrmulti method)": [[257, "models.detr_multi.DETRMulti.forward", false]], "forward() (models.detr_multi.postprocess method)": [[257, "models.detr_multi.PostProcess.forward", false]], "forward() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.forward", false]], "forward() (models.matcher.hungarianmatcher method)": [[258, "models.matcher.HungarianMatcher.forward", false]], "forward() (models.segmentation.postprocesspanoptic method)": [[260, "models.segmentation.PostProcessPanoptic.forward", false]], "from_pretrained() (intel_extension_for_transformers.transformers.modeling.model.optimizedmodel class method)": [[35, "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel.from_pretrained", false]], "frozenbatchnorm2d (class in models.backbone)": [[255, "models.backbone.FrozenBatchNorm2d", false]], "fusedbatchnormv3 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3)": [[76, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3", false]], "fusedgemm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm)": [[77, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm", false]], "fusedmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul)": [[78, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul", false]], "gather (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather", false]], "gatherelements (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements)": [[80, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements", false]], "gatherv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gather)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2", false]], "gaudi_bartattention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartAttention_forward", false]], "gaudi_bartlearnedpositionalembedding (class in intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart)": [[37, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding", false]], "gaudi_mistral_repeat_kv() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral)": [[39, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral.gaudi_mistral_repeat_kv", false]], "gaudi_mistral_rmsnorm_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral)": [[39, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral.gaudi_mistral_rmsnorm_forward", false]], "gaudi_mixtral_attention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_attention_forward", false]], "gaudi_mixtral_block_sparse_moe_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_block_sparse_moe_forward", false]], "gaudi_mixtral_decoder_layer_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_decoder_layer_forward", false]], "gaudi_mixtral_model_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_model_forward", false]], "gaudi_mixtral_repeat_kv() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_repeat_kv", false]], "gaudi_mixtral_rmsnorm_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.gaudi_mixtral_rmsnorm_forward", false]], "gaudi_phi_attention_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_attention_forward", false]], "gaudi_phi_decoder_layer_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_decoder_layer_forward", false]], "gaudi_phi_model_forward() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi)": [[41, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi.gaudi_phi_model_forward", false]], "gaudi_spawn": [[1, "module-gaudi_spawn", false]], "gaudi_swin_get_attn_mask() (in module intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin)": [[42, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin.gaudi_swin_get_attn_mask", false]], "gaudimixtralforcausallm (class in intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral)": [[40, "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral.GaudiMixtralForCausalLM", false]], "gelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gelu)": [[81, "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu", false]], "gelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu)": [[148, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu.Gelu", false]], "gemm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.gemm)": [[82, "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm", false]], "generalized_box_iou() (in module util.box_ops)": [[263, "util.box_ops.generalized_box_iou", false]], "generate() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.generate", false]], "generatesequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence)": [[149, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence.GenerateSequence", false]], "get_autocast_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_autocast_info", false]], "get_bbox_span_subset() (in module util.postprocess)": [[266, "util.postprocess.get_bbox_span_subset", false]], "get_children() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_children", false]], "get_conv_template() (in module conversation)": [[0, "conversation.get_conv_template", false]], "get_data_dtype() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_data_dtype", false]], "get_environ_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_environ_info", false]], "get_example_inputs() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.get_example_inputs", false]], "get_export_args() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.get_export_args", false]], "get_initializer_children_names() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_initializer_children_names", false]], "get_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.get_input_embeddings", false]], "get_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.get_input_embeddings", false]], "get_logger() (intel_extension_for_transformers.transformers.runtime.compile.logger.logger method)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger.get_logger", false]], "get_model_fwk_name() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_model_fwk_name", false]], "get_module() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.get_module", false]], "get_multi_choice_info() (in module utils.data_utils)": [[267, "utils.data_utils.get_multi_choice_info", false]], "get_next_node_names() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_next_node_names", false]], "get_node_by_name() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_node_by_name", false]], "get_node_children_names() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.get_node_children_names", false]], "get_node_id() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_node_id", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.get_output_embeddings", false]], "get_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.get_output_embeddings", false]], "get_parameter() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.get_parameter", false]], "get_pre_node_names() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_pre_node_names", false]], "get_prompt() (conversation.conversation method)": [[0, "conversation.Conversation.get_prompt", false]], "get_quant_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.get_quant_info", false]], "get_reprs_at_idxs() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_reprs_at_idxs", false]], "get_reprs_at_word_tokens() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_reprs_at_word_tokens", false]], "get_sparse_nodes_name() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_sparse_nodes_name", false]], "get_sparsity_ratio() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.get_sparsity_ratio", false]], "get_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.get_store", false]], "get_tensor_dest_op() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.get_tensor_dest_op", false]], "get_tensor_idx() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.get_tensor_idx", false]], "get_words_idxs_in_templates() (in module intel_extension_for_transformers.neural_chat.tools.rome.repr_tools)": [[23, "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools.get_words_idxs_in_templates", false]], "gptbigcodeforcausallm (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM", false]], "gptbigcodeforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification", false]], "gptbigcodefortokenclassification (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification", false]], "gptbigcodemodel (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeModel", false]], "gptbigcodepretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode)": [[33, "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodePreTrainedModel", false]], "gptqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig", false]], "graph (class in intel_extension_for_transformers.transformers.runtime.compile.graph.graph)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph", false]], "graph_dispatch() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.graph_dispatch", false]], "graph_init() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.graph_init", false]], "graph_node_names_details() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.graph_node_names_details", false]], "graph_node_names_details() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.graph_node_names_details", false]], "header_supercell_tree() (in module util.postprocess)": [[266, "util.postprocess.header_supercell_tree", false]], "hierarchical_subsequence() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.hierarchical_subsequence", false]], "history (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History", false]], "hungarianmatcher (class in models.matcher)": [[258, "models.matcher.HungarianMatcher", false]], "identity (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Identity", false]], "infer_framework_load_model() (in module intel_extension_for_transformers.transformers.pipeline)": [[45, "intel_extension_for_transformers.transformers.pipeline.infer_framework_load_model", false]], "infer_task() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.infer_task", false]], "inference() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.inference", false]], "info() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.info", false]], "innerproduct (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.InnerProduct", false]], "innerproductreshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductreshapefusion)": [[129, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion.InnerproductReshapeFusion", false]], "innerproductwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu)": [[151, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu.InnerproductWithBiasGelu", false]], "innerproductwithslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice)": [[152, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice.InnerproductwithSlice", false]], "innerproductwithswish (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish)": [[153, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish.InnerproductWithSwish", false]], "input (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Input", false]], "inputdata (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data)": [[154, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data.InputData", false]], "inputfile (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file)": [[155, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file.InputFile", false]], "inquire_config_item() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.inquire_config_item", false]], "insert_environ_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_environ_info", false]], "insert_nodes() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.insert_nodes", false]], "insert_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_pattern", false]], "insert_quant_info() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.insert_quant_info", false]], "insertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node)": [[156, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node.InsertBF16Node", false]], "insertquantnode (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node)": [[157, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node.InsertQuantNode", false]], "int8bf16mixedprecisionchecker (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker)": [[158, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker.Int8BF16MixedPrecisionChecker", false]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", false]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma": [[3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", false]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, "module-intel_extension_for_transformers.neural_chat.chatbot", false]], "intel_extension_for_transformers.neural_chat.config": [[5, "module-intel_extension_for_transformers.neural_chat.config", false]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, "module-intel_extension_for_transformers.neural_chat.config_logging", false]], "intel_extension_for_transformers.neural_chat.errorcode": [[7, "module-intel_extension_for_transformers.neural_chat.errorcode", false]], "intel_extension_for_transformers.neural_chat.pipeline": [[8, "module-intel_extension_for_transformers.neural_chat.pipeline", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory": [[10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection": [[11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer": [[12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser": [[13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm": [[16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats": [[19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", false]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", false]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", false]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", false]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", false]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", false]], "intel_extension_for_transformers.tools.utils": [[26, "module-intel_extension_for_transformers.tools.utils", false]], "intel_extension_for_transformers.transformers.benchmark": [[27, "module-intel_extension_for_transformers.transformers.benchmark", false]], "intel_extension_for_transformers.transformers.config": [[28, "module-intel_extension_for_transformers.transformers.config", false]], "intel_extension_for_transformers.transformers.dynamic": [[31, "module-intel_extension_for_transformers.transformers.dynamic", false]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", false]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, "module-intel_extension_for_transformers.transformers.dynamic.evolution", false]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", false]], "intel_extension_for_transformers.transformers.modeling": [[34, "module-intel_extension_for_transformers.transformers.modeling", false]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", false]], "intel_extension_for_transformers.transformers.modeling.model": [[35, "module-intel_extension_for_transformers.transformers.modeling.model", false]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama": [[38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", false]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm": [[43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", false]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", false]], "intel_extension_for_transformers.transformers.pipeline": [[45, "module-intel_extension_for_transformers.transformers.pipeline", false]], "intel_extension_for_transformers.transformers.pruner": [[46, "module-intel_extension_for_transformers.transformers.pruner", false]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, "module-intel_extension_for_transformers.transformers.pruner.pruning", false]], "intel_extension_for_transformers.transformers.quantization": [[48, "module-intel_extension_for_transformers.transformers.quantization", false]], "intel_extension_for_transformers.transformers.runtime": [[245, "module-intel_extension_for_transformers.transformers.runtime", false]], "intel_extension_for_transformers.transformers.runtime.compile": [[58, "module-intel_extension_for_transformers.transformers.runtime.compile", false]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders", false]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", false]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger", false]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2": [[75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm": [[91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize": [[104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view", false]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where", false]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_attentionmaskaddreshape": [[133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_constantofshapewithmul": [[134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvprereshape": [[135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_qkvreshape": [[136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionblock_weightreshapeto4d": [[137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductreshapefusion": [[129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_bf16convert": [[211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_collectqdqinfo": [[212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctranspose": [[206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_explicitnhwctransposeqat": [[207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_insertquantnode": [[213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_mhareshape": [[208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_quantizefusion": [[209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_reshapefusion": [[210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencdoer_word_embedding": [[216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionmaskaddreshape": [[217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionreshape": [[218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_causal_attention_mask": [[223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_kvreshape": [[219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_mulreshape": [[220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_qreshape": [[221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_softmaxreshape": [[222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_attentionmaskaddreshape": [[231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_constantofshapewithmul": [[232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_encoderhiddenstatesreshape": [[238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice": [[233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice_1": [[234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_getsamplebatch": [[239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvprereshape": [[235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape": [[236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape4d": [[237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_sampleslice": [[240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", false]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", false]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils", false]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils", false]], "intel_extension_for_transformers.transformers.trainer": [[246, "module-intel_extension_for_transformers.transformers.trainer", false]], "intel_extension_for_transformers.transformers.utils": [[249, "module-intel_extension_for_transformers.transformers.utils", false]], "intel_extension_for_transformers.transformers.utils.config": [[247, "module-intel_extension_for_transformers.transformers.utils.config", false]], "intel_extension_for_transformers.transformers.utils.get_throughput": [[248, "module-intel_extension_for_transformers.transformers.utils.get_throughput", false]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, "module-intel_extension_for_transformers.transformers.utils.metrics", false]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, "module-intel_extension_for_transformers.transformers.utils.objectives", false]], "intel_extension_for_transformers.transformers.utils.utility": [[252, "module-intel_extension_for_transformers.transformers.utils.utility", false]], "interactfeatures (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features)": [[159, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features.InteractFeatures", false]], "interpolate() (in module util.misc)": [[264, "util.misc.interpolate", false]], "inverse() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.inverse", false]], "invoke_with_optional_args() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.invoke_with_optional_args", false]], "iob() (in module util.postprocess)": [[266, "util.postprocess.iob", false]], "iou (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU", false]], "iou() (in module util.postprocess)": [[266, "util.postprocess.iou", false]], "is_null_numpy_value() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.is_null_numpy_value", false]], "is_supported_onnx_graph() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.is_supported_onnx_graph", false]], "is_supported_onnx_node() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.is_supported_onnx_node", false]], "iteratorgetnext (class in intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next)": [[84, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext", false]], "iteratorv2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2)": [[85, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2", false]], "itrexquantizationconfigmixin (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin", false]], "jd (c++ type)": [[278, "_CPPv42jd", false], [279, "_CPPv42jd", false], [280, "_CPPv42jd", false], [281, "_CPPv42jd", false]], "jd::attention (c++ class)": [[279, "_CPPv4N2jd9attentionE", false]], "jd::attention::attention (c++ function)": [[279, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9attention9attentionEv", false]], "jd::attention::~attention (c++ function)": [[279, "_CPPv4N2jd9attentionD0Ev", false]], "jd::attention_desc (c++ class)": [[279, "_CPPv4N2jd14attention_descE", false]], "jd::attention_desc::attention_desc (c++ function)": [[279, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", false], [279, "_CPPv4N2jd14attention_desc14attention_descEv", false]], "jd::attention_desc::~attention_desc (c++ function)": [[279, "_CPPv4N2jd14attention_descD0Ev", false]], "jd::attention_io (c++ enum)": [[281, "_CPPv4N2jd12attention_ioE", false]], "jd::attention_io::k_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6K_BIASE", false]], "jd::attention_io::k_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8K_SCALESE", false]], "jd::attention_io::k_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8K_WEIGHTE", false]], "jd::attention_io::merge_dst (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io9MERGE_DSTE", false]], "jd::attention_io::merge_src (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io9MERGE_SRCE", false]], "jd::attention_io::q_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6Q_BIASE", false]], "jd::attention_io::q_k_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io10Q_K_SCALESE", false]], "jd::attention_io::q_k_src2 (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_K_SRC2E", false]], "jd::attention_io::q_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_SCALESE", false]], "jd::attention_io::q_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8Q_WEIGHTE", false]], "jd::attention_io::qk_v_output_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", false]], "jd::attention_io::qk_v_output_zero_point (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", false]], "jd::attention_io::reshape_input (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", false]], "jd::attention_io::v_bias (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io6V_BIASE", false]], "jd::attention_io::v_scales (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8V_SCALESE", false]], "jd::attention_io::v_weight (c++ enumerator)": [[281, "_CPPv4N2jd12attention_io8V_WEIGHTE", false]], "jd::cpu_engine_t (c++ class)": [[278, "_CPPv4N2jd12cpu_engine_tE", false]], "jd::cpu_engine_t::cpu_engine_t (c++ function)": [[278, "_CPPv4N2jd12cpu_engine_t12cpu_engine_tEv", false]], "jd::cpu_engine_t::create_kernel (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", false]], "jd::cpu_engine_t::create_memory_storage (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", false]], "jd::cpu_engine_t::create_stream (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t13create_streamEPP8stream_t", false]], "jd::cpu_engine_t::empty_list (c++ member)": [[278, "_CPPv4N2jd12cpu_engine_t10empty_listE", false]], "jd::cpu_engine_t::get_implementation_list (c++ function)": [[278, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", false]], "jd::cpu_engine_t::~cpu_engine_t (c++ function)": [[278, "_CPPv4N2jd12cpu_engine_tD0Ev", false]], "jd::dynamic_quant (c++ class)": [[279, "_CPPv4N2jd13dynamic_quantE", false]], "jd::dynamic_quant::dynamic_quant (c++ function)": [[279, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13dynamic_quant13dynamic_quantEv", false]], "jd::dynamic_quant::~dynamic_quant (c++ function)": [[279, "_CPPv4N2jd13dynamic_quantD0Ev", false]], "jd::dynamic_quant_desc (c++ class)": [[279, "_CPPv4N2jd18dynamic_quant_descE", false]], "jd::dynamic_quant_desc::dynamic_quant_desc (c++ function)": [[279, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", false], [279, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descEv", false]], "jd::dynamic_quant_desc::~dynamic_quant_desc (c++ function)": [[279, "_CPPv4N2jd18dynamic_quant_descD0Ev", false]], "jd::dynamic_quant_matmul (c++ class)": [[279, "_CPPv4N2jd20dynamic_quant_matmulE", false]], "jd::dynamic_quant_matmul::dynamic_quant_matmul (c++ function)": [[279, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulEv", false]], "jd::dynamic_quant_matmul::~dynamic_quant_matmul (c++ function)": [[279, "_CPPv4N2jd20dynamic_quant_matmulD0Ev", false]], "jd::dynamic_quant_matmul_desc (c++ class)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_descE", false]], "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc (c++ function)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descEv", false]], "jd::dynamic_quant_matmul_desc::~dynamic_quant_matmul_desc (c++ function)": [[279, "_CPPv4N2jd25dynamic_quant_matmul_descD0Ev", false]], "jd::eltwiseop (c++ class)": [[279, "_CPPv4N2jd9eltwiseopE", false]], "jd::eltwiseop::eltwiseop (c++ function)": [[279, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9eltwiseop9eltwiseopEv", false]], "jd::eltwiseop::~eltwiseop (c++ function)": [[279, "_CPPv4N2jd9eltwiseopD0Ev", false]], "jd::eltwiseop_desc (c++ class)": [[279, "_CPPv4N2jd14eltwiseop_descE", false]], "jd::eltwiseop_desc::eltwiseop_desc (c++ function)": [[279, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", false], [279, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descEv", false]], "jd::eltwiseop_desc::~eltwiseop_desc (c++ function)": [[279, "_CPPv4N2jd14eltwiseop_descD0Ev", false]], "jd::engine_t (c++ class)": [[278, "_CPPv4N2jd8engine_tE", false]], "jd::engine_t::create_kernel (c++ function)": [[278, "_CPPv4NK2jd8engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", false]], "jd::engine_t::create_memory_storage (c++ function)": [[278, "_CPPv4NK2jd8engine_t21create_memory_storageEPP16memory_storage_t", false]], "jd::engine_t::create_stream (c++ function)": [[278, "_CPPv4NK2jd8engine_t13create_streamEPP8stream_t", false]], "jd::engine_t::engine_kind_ (c++ member)": [[278, "_CPPv4N2jd8engine_t12engine_kind_E", false]], "jd::engine_t::engine_t (c++ function)": [[278, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", false]], "jd::engine_t::get_engine_kind (c++ function)": [[278, "_CPPv4NK2jd8engine_t15get_engine_kindEv", false]], "jd::engine_t::get_implementation_list (c++ function)": [[278, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", false]], "jd::engine_t::get_runtime_kind (c++ function)": [[278, "_CPPv4NK2jd8engine_t16get_runtime_kindEv", false]], "jd::engine_t::runtime_kind_ (c++ member)": [[278, "_CPPv4N2jd8engine_t13runtime_kind_E", false]], "jd::engine_t::~engine_t (c++ function)": [[278, "_CPPv4N2jd8engine_tD0Ev", false]], "jd::gather (c++ class)": [[279, "_CPPv4N2jd6gatherE", false]], "jd::gather::gather (c++ function)": [[279, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd6gather6gatherEv", false]], "jd::gather::~gather (c++ function)": [[279, "_CPPv4N2jd6gatherD0Ev", false]], "jd::gather_desc (c++ class)": [[279, "_CPPv4N2jd11gather_descE", false]], "jd::gather_desc::gather_desc (c++ function)": [[279, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", false], [279, "_CPPv4N2jd11gather_desc11gather_descEv", false]], "jd::gather_desc::~gather_desc (c++ function)": [[279, "_CPPv4N2jd11gather_descD0Ev", false]], "jd::groupnorm (c++ class)": [[279, "_CPPv4N2jd9groupnormE", false]], "jd::groupnorm::groupnorm (c++ function)": [[279, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9groupnorm9groupnormEv", false]], "jd::groupnorm::~groupnorm (c++ function)": [[279, "_CPPv4N2jd9groupnormD0Ev", false]], "jd::groupnorm_desc (c++ class)": [[279, "_CPPv4N2jd14groupnorm_descE", false]], "jd::groupnorm_desc::groupnorm_desc (c++ function)": [[279, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", false], [279, "_CPPv4N2jd14groupnorm_desc14groupnorm_descEv", false]], "jd::groupnorm_desc::~groupnorm_desc (c++ function)": [[279, "_CPPv4N2jd14groupnorm_descD0Ev", false]], "jd::kernel_desc_proxy (c++ class)": [[279, "_CPPv4N2jd17kernel_desc_proxyE", false]], "jd::kernel_desc_proxy::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", false]], "jd::kernel_desc_proxy::impl_list_ (c++ member)": [[279, "_CPPv4N2jd17kernel_desc_proxy10impl_list_E", false]], "jd::kernel_desc_proxy::kernel_desc_proxy (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", false], [279, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyEv", false]], "jd::kernel_desc_proxy::kernel_kind (c++ function)": [[279, "_CPPv4NK2jd17kernel_desc_proxy11kernel_kindEv", false]], "jd::kernel_desc_proxy::~kernel_desc_proxy (c++ function)": [[279, "_CPPv4N2jd17kernel_desc_proxyD0Ev", false]], "jd::kernel_proxy (c++ class)": [[279, "_CPPv4N2jd12kernel_proxyE", false]], "jd::kernel_proxy::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", false]], "jd::kernel_proxy::execute (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", false], [279, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", false]], "jd::kernel_proxy::get_workspace_size (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy18get_workspace_sizeEv", false]], "jd::kernel_proxy::kernel_kind (c++ function)": [[279, "_CPPv4NK2jd12kernel_proxy11kernel_kindEv", false]], "jd::kernel_proxy::kernel_proxy (c++ function)": [[279, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd12kernel_proxy12kernel_proxyEv", false]], "jd::kernel_proxy::~kernel_proxy (c++ function)": [[279, "_CPPv4N2jd12kernel_proxyD0Ev", false]], "jd::layernorm_ba (c++ class)": [[279, "_CPPv4N2jd12layernorm_baE", false]], "jd::layernorm_ba::layernorm_ba (c++ function)": [[279, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd12layernorm_ba12layernorm_baEv", false]], "jd::layernorm_ba::~layernorm_ba (c++ function)": [[279, "_CPPv4N2jd12layernorm_baD0Ev", false]], "jd::layernorm_ba_desc (c++ class)": [[279, "_CPPv4N2jd17layernorm_ba_descE", false]], "jd::layernorm_ba_desc::layernorm_ba_desc (c++ function)": [[279, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", false], [279, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descEv", false]], "jd::layernorm_ba_desc::~layernorm_ba_desc (c++ function)": [[279, "_CPPv4N2jd17layernorm_ba_descD0Ev", false]], "jd::layernormalized_spmm (c++ class)": [[279, "_CPPv4N2jd20layernormalized_spmmE", false]], "jd::layernormalized_spmm::layernormalized_spmm (c++ function)": [[279, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmEv", false]], "jd::layernormalized_spmm::~layernormalized_spmm (c++ function)": [[279, "_CPPv4N2jd20layernormalized_spmmD0Ev", false]], "jd::layernormalized_spmm_desc (c++ class)": [[279, "_CPPv4N2jd25layernormalized_spmm_descE", false]], "jd::layernormalized_spmm_desc::layernormalized_spmm_desc (c++ function)": [[279, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", false], [279, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descEv", false]], "jd::layernormalized_spmm_desc::~layernormalized_spmm_desc (c++ function)": [[279, "_CPPv4N2jd25layernormalized_spmm_descD0Ev", false]], "jd::logsoftmax (c++ class)": [[279, "_CPPv4N2jd10logsoftmaxE", false]], "jd::logsoftmax::logsoftmax (c++ function)": [[279, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd10logsoftmax10logsoftmaxEv", false]], "jd::logsoftmax::~logsoftmax (c++ function)": [[279, "_CPPv4N2jd10logsoftmaxD0Ev", false]], "jd::logsoftmax_desc (c++ class)": [[279, "_CPPv4N2jd15logsoftmax_descE", false]], "jd::logsoftmax_desc::logsoftmax_desc (c++ function)": [[279, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", false], [279, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descEv", false]], "jd::logsoftmax_desc::~logsoftmax_desc (c++ function)": [[279, "_CPPv4N2jd15logsoftmax_descD0Ev", false]], "jd::mha_dense (c++ class)": [[279, "_CPPv4N2jd9mha_denseE", false]], "jd::mha_dense::mha_dense (c++ function)": [[279, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd9mha_dense9mha_denseEv", false]], "jd::mha_dense::~mha_dense (c++ function)": [[279, "_CPPv4N2jd9mha_denseD0Ev", false]], "jd::mha_dense_desc (c++ class)": [[279, "_CPPv4N2jd14mha_dense_descE", false]], "jd::mha_dense_desc::mha_dense_desc (c++ function)": [[279, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", false], [279, "_CPPv4N2jd14mha_dense_desc14mha_dense_descEv", false]], "jd::mha_dense_desc::~mha_dense_desc (c++ function)": [[279, "_CPPv4N2jd14mha_dense_descD0Ev", false]], "jd::operator_desc (c++ class)": [[280, "_CPPv4N2jd13operator_descE", false]], "jd::operator_desc::apply_postops_list (c++ function)": [[280, "_CPPv4NK2jd13operator_desc18apply_postops_listEv", false]], "jd::operator_desc::apply_postops_list_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc19apply_postops_list_E", false]], "jd::operator_desc::attrs (c++ function)": [[280, "_CPPv4NK2jd13operator_desc5attrsEv", false]], "jd::operator_desc::attrs_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc6attrs_E", false]], "jd::operator_desc::binaryop_list_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc14binaryop_list_E", false]], "jd::operator_desc::engine_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11engine_kindEv", false]], "jd::operator_desc::engine_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc12engine_kind_E", false]], "jd::operator_desc::get_binaryop_list (c++ function)": [[280, "_CPPv4NK2jd13operator_desc17get_binaryop_listEv", false]], "jd::operator_desc::impl_nthr (c++ function)": [[280, "_CPPv4NK2jd13operator_desc9impl_nthrEv", false]], "jd::operator_desc::impl_nthr_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc10impl_nthr_E", false]], "jd::operator_desc::ker_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ker_kind_E", false]], "jd::operator_desc::ker_prop_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ker_prop_E", false]], "jd::operator_desc::kernel_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11kernel_kindEv", false]], "jd::operator_desc::kernel_prop (c++ function)": [[280, "_CPPv4NK2jd13operator_desc11kernel_propEv", false]], "jd::operator_desc::operator== (c++ function)": [[280, "_CPPv4NK2jd13operator_desceqERK13operator_desc", false]], "jd::operator_desc::operator_desc (c++ function)": [[280, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", false], [280, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", false], [280, "_CPPv4N2jd13operator_desc13operator_descEv", false]], "jd::operator_desc::runtime_kind (c++ function)": [[280, "_CPPv4NK2jd13operator_desc12runtime_kindEv", false]], "jd::operator_desc::runtime_kind_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc13runtime_kind_E", false]], "jd::operator_desc::set_binaryop_list (c++ function)": [[280, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", false]], "jd::operator_desc::tensor_descs (c++ function)": [[280, "_CPPv4NK2jd13operator_desc12tensor_descsEv", false]], "jd::operator_desc::tensor_dtypes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_dtypesEv", false]], "jd::operator_desc::tensor_ftypes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_ftypesEv", false]], "jd::operator_desc::tensor_shapes (c++ function)": [[280, "_CPPv4NK2jd13operator_desc13tensor_shapesEv", false]], "jd::operator_desc::ts_descs_ (c++ member)": [[280, "_CPPv4N2jd13operator_desc9ts_descs_E", false]], "jd::operator_desc::~operator_desc (c++ function)": [[280, "_CPPv4N2jd13operator_descD0Ev", false]], "jd::proxy_base (c++ class)": [[279, "_CPPv4I00EN2jd10proxy_baseE", false]], "jd::proxy_base::create_proxy_object (c++ function)": [[279, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", false]], "jd::proxy_base::data_handle_ (c++ member)": [[279, "_CPPv4N2jd10proxy_base12data_handle_E", false]], "jd::proxy_base::get_sp (c++ function)": [[279, "_CPPv4NK2jd10proxy_base6get_spEv", false]], "jd::proxy_base::proxy_base (c++ function)": [[279, "_CPPv4N2jd10proxy_base10proxy_baseEv", false]], "jd::proxy_base::reset_sp (c++ function)": [[279, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", false]], "jd::proxy_base::~proxy_base (c++ function)": [[279, "_CPPv4N2jd10proxy_baseD0Ev", false]], "jd::slice (c++ class)": [[279, "_CPPv4N2jd5sliceE", false]], "jd::slice::slice (c++ function)": [[279, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd5slice5sliceEv", false]], "jd::slice::~slice (c++ function)": [[279, "_CPPv4N2jd5sliceD0Ev", false]], "jd::slice_desc (c++ class)": [[279, "_CPPv4N2jd10slice_descE", false]], "jd::slice_desc::slice_desc (c++ function)": [[279, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", false], [279, "_CPPv4N2jd10slice_desc10slice_descEv", false]], "jd::slice_desc::~slice_desc (c++ function)": [[279, "_CPPv4N2jd10slice_descD0Ev", false]], "jd::softmax (c++ class)": [[279, "_CPPv4N2jd7softmaxE", false]], "jd::softmax::softmax (c++ function)": [[279, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd7softmax7softmaxEv", false]], "jd::softmax::~softmax (c++ function)": [[279, "_CPPv4N2jd7softmaxD0Ev", false]], "jd::softmax_desc (c++ class)": [[279, "_CPPv4N2jd12softmax_descE", false]], "jd::softmax_desc::softmax_desc (c++ function)": [[279, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", false], [279, "_CPPv4N2jd12softmax_desc12softmax_descEv", false]], "jd::softmax_desc::~softmax_desc (c++ function)": [[279, "_CPPv4N2jd12softmax_descD0Ev", false]], "jd::sparse_matmul (c++ class)": [[279, "_CPPv4N2jd13sparse_matmulE", false]], "jd::sparse_matmul::sparse_matmul (c++ function)": [[279, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13sparse_matmul13sparse_matmulEv", false]], "jd::sparse_matmul::~sparse_matmul (c++ function)": [[279, "_CPPv4N2jd13sparse_matmulD0Ev", false]], "jd::sparse_matmul_desc (c++ class)": [[279, "_CPPv4N2jd18sparse_matmul_descE", false]], "jd::sparse_matmul_desc::sparse_matmul_desc (c++ function)": [[279, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descEv", false]], "jd::sparse_matmul_desc::~sparse_matmul_desc (c++ function)": [[279, "_CPPv4N2jd18sparse_matmul_descD0Ev", false]], "jd::ssd (c++ type)": [[281, "_CPPv4N2jd3ssdE", false]], "jd::ssd::amx_bf16_params_t (c++ type)": [[281, "_CPPv4N2jd3ssd17amx_bf16_params_tE", false]], "jd::ssd::amx_bf16bf16_inputs_t (c++ type)": [[281, "_CPPv4N2jd3ssd21amx_bf16bf16_inputs_tE", false]], "jd::ssd::amx_bf16f32_inputs_t (c++ type)": [[281, "_CPPv4N2jd3ssd20amx_bf16f32_inputs_tE", false]], "jd::ssd::amx_inputs_t (c++ struct)": [[281, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", false]], "jd::ssd::amx_inputs_t::bias (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t4biasE", false]], "jd::ssd::amx_inputs_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t3dstE", false]], "jd::ssd::amx_inputs_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t3srcE", false]], "jd::ssd::amx_inputs_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_inputs_t6weightE", false]], "jd::ssd::amx_int8_params_t (c++ type)": [[281, "_CPPv4N2jd3ssd17amx_int8_params_tE", false]], "jd::ssd::amx_params_t (c++ struct)": [[281, "_CPPv4I0EN2jd3ssd12amx_params_tE", false]], "jd::ssd::amx_params_t::blocks_per_group (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t16blocks_per_groupE", false]], "jd::ssd::amx_params_t::blocksize (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9blocksizeE", false]], "jd::ssd::amx_params_t::colidxs (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t7colidxsE", false]], "jd::ssd::amx_params_t::group_rowptr (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t12group_rowptrE", false]], "jd::ssd::amx_params_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t8has_biasE", false]], "jd::ssd::amx_params_t::nnz_group (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9nnz_groupE", false]], "jd::ssd::amx_params_t::nrowptr (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t7nrowptrE", false]], "jd::ssd::amx_params_t::num_tilem (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t9num_tileME", false]], "jd::ssd::amx_params_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t12postop_attrsE", false]], "jd::ssd::amx_params_t::same_src_dtype (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t14same_src_dtypeE", false]], "jd::ssd::amx_params_t::shape (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5shapeE", false]], "jd::ssd::amx_params_t::tilem (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5tileME", false]], "jd::ssd::amx_params_t::tilen (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t5tileNE", false]], "jd::ssd::amx_params_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12amx_params_t6weightE", false]], "jd::ssd::avx512_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd13avx512_data_tE", false]], "jd::ssd::avx512_data_t::bias (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t4biasE", false]], "jd::ssd::avx512_data_t::dense (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t5denseE", false]], "jd::ssd::avx512_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t3dstE", false]], "jd::ssd::avx512_data_t::sparse (c++ member)": [[281, "_CPPv4N2jd3ssd13avx512_data_t6sparseE", false]], "jd::ssd::avx512_fp32_params_t (c++ struct)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_tE", false]], "jd::ssd::avx512_fp32_params_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8has_biasE", false]], "jd::ssd::avx512_fp32_params_t::im_end (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t6im_endE", false]], "jd::ssd::avx512_fp32_params_t::im_start (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8im_startE", false]], "jd::ssd::avx512_fp32_params_t::in_end (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t6in_endE", false]], "jd::ssd::avx512_fp32_params_t::in_start (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t8in_startE", false]], "jd::ssd::avx512_fp32_params_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1KE", false]], "jd::ssd::avx512_fp32_params_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1ME", false]], "jd::ssd::avx512_fp32_params_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t1NE", false]], "jd::ssd::avx512_fp32_params_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t12postop_attrsE", false]], "jd::ssd::avx512_fp32_params_t::sparse_ptr (c++ member)": [[281, "_CPPv4N2jd3ssd20avx512_fp32_params_t10sparse_ptrE", false]], "jd::ssd::bias (c++ member)": [[281, "_CPPv4N2jd3ssd4BIASE", false]], "jd::ssd::dst (c++ member)": [[281, "_CPPv4N2jd3ssd3DSTE", false]], "jd::ssd::dst_m1 (c++ member)": [[281, "_CPPv4N2jd3ssd6DST_M1E", false]], "jd::ssd::dst_m2 (c++ member)": [[281, "_CPPv4N2jd3ssd6DST_M2E", false]], "jd::ssd::eltwiseop_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_tE", false]], "jd::ssd::eltwiseop_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t3dstE", false]], "jd::ssd::eltwiseop_data_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t11element_numE", false]], "jd::ssd::eltwiseop_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd16eltwiseop_data_t3srcE", false]], "jd::ssd::eltwiseop_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_tE", false]], "jd::ssd::eltwiseop_param_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t11element_numE", false]], "jd::ssd::eltwiseop_param_t::element_num_each_th (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t19element_num_each_thE", false]], "jd::ssd::eltwiseop_param_t::in_dt (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t5in_dtE", false]], "jd::ssd::eltwiseop_param_t::out_dt (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t6out_dtE", false]], "jd::ssd::eltwiseop_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t12postop_attrsE", false]], "jd::ssd::eltwiseop_param_t::remain_element (c++ member)": [[281, "_CPPv4N2jd3ssd17eltwiseop_param_t14remain_elementE", false]], "jd::ssd::layernorm_ba_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_tE", false]], "jd::ssd::layernorm_ba_data_t::[anonymous] (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_tUt1_3E", false]], "jd::ssd::layernorm_ba_data_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t5alphaE", false]], "jd::ssd::layernorm_ba_data_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4betaE", false]], "jd::ssd::layernorm_ba_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3dstE", false]], "jd::ssd::layernorm_ba_data_t::dst2 (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4dst2E", false]], "jd::ssd::layernorm_ba_data_t::eps (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3epsE", false]], "jd::ssd::layernorm_ba_data_t::mean (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t4meanE", false]], "jd::ssd::layernorm_ba_data_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t1nE", false]], "jd::ssd::layernorm_ba_data_t::one (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3oneE", false]], "jd::ssd::layernorm_ba_data_t::process_row (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t11process_rowE", false]], "jd::ssd::layernorm_ba_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3srcE", false]], "jd::ssd::layernorm_ba_data_t::var (c++ member)": [[281, "_CPPv4N2jd3ssd19layernorm_ba_data_t3varE", false]], "jd::ssd::layernorm_ba_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_tE", false]], "jd::ssd::layernorm_ba_param_t::batch_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9batch_numE", false]], "jd::ssd::layernorm_ba_param_t::binaryop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t14binaryop_attrsE", false]], "jd::ssd::layernorm_ba_param_t::col_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t7col_numE", false]], "jd::ssd::layernorm_ba_param_t::direct_process_row (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t18direct_process_rowE", false]], "jd::ssd::layernorm_ba_param_t::input_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t8input_dtE", false]], "jd::ssd::layernorm_ba_param_t::ker_per_batch (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t13ker_per_batchE", false]], "jd::ssd::layernorm_ba_param_t::output2_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t10output2_dtE", false]], "jd::ssd::layernorm_ba_param_t::output_dt (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9output_dtE", false]], "jd::ssd::layernorm_ba_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t12postop_attrsE", false]], "jd::ssd::layernorm_ba_param_t::process_batch_per_ker (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t21process_batch_per_kerE", false]], "jd::ssd::layernorm_ba_param_t::process_col (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t11process_colE", false]], "jd::ssd::layernorm_ba_param_t::row_num (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t7row_numE", false]], "jd::ssd::layernorm_ba_param_t::spec_type (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t9spec_typeE", false]], "jd::ssd::layernorm_ba_param_t::split_output (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t12split_outputE", false]], "jd::ssd::layernorm_ba_param_t::thread_elt_offset (c++ member)": [[281, "_CPPv4N2jd3ssd20layernorm_ba_param_t17thread_elt_offsetE", false]], "jd::ssd::matmul_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd13matmul_data_tE", false]], "jd::ssd::matmul_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t3dstE", false]], "jd::ssd::matmul_data_t::src0 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src0E", false]], "jd::ssd::matmul_data_t::src1 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src1E", false]], "jd::ssd::matmul_data_t::src2 (c++ member)": [[281, "_CPPv4N2jd3ssd13matmul_data_t4src2E", false]], "jd::ssd::matmul_fp8_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_tE", false]], "jd::ssd::matmul_fp8_data_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5alphaE", false]], "jd::ssd::matmul_fp8_data_t::astep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5astepE", false]], "jd::ssd::matmul_fp8_data_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4betaE", false]], "jd::ssd::matmul_fp8_data_t::bstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5bstepE", false]], "jd::ssd::matmul_fp8_data_t::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5cstepE", false]], "jd::ssd::matmul_fp8_data_t::dstep (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5dstepE", false]], "jd::ssd::matmul_fp8_data_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t1kE", false]], "jd::ssd::matmul_fp8_data_t::kpos (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4kposE", false]], "jd::ssd::matmul_fp8_data_t::mata (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matAE", false]], "jd::ssd::matmul_fp8_data_t::matb (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matBE", false]], "jd::ssd::matmul_fp8_data_t::matc (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matCE", false]], "jd::ssd::matmul_fp8_data_t::matd (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matDE", false]], "jd::ssd::matmul_fp8_data_t::mate (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matEE", false]], "jd::ssd::matmul_fp8_data_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t1nE", false]], "jd::ssd::matmul_fp8_data_t::scale (c++ member)": [[281, "_CPPv4N2jd3ssd17matmul_fp8_data_t5scaleE", false]], "jd::ssd::matmul_fp8_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_tE", false]], "jd::ssd::matmul_fp8_param_t::[anonymous] (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_tUt1_5E", false]], "jd::ssd::matmul_fp8_param_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t5alphaE", false]], "jd::ssd::matmul_fp8_param_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t4betaE", false]], "jd::ssd::matmul_fp8_param_t::has_append_sum (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14has_append_sumE", false]], "jd::ssd::matmul_fp8_param_t::has_scale0 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t10has_scale0E", false]], "jd::ssd::matmul_fp8_param_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1KE", false]], "jd::ssd::matmul_fp8_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1ME", false]], "jd::ssd::matmul_fp8_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t1NE", false]], "jd::ssd::matmul_fp8_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t12postop_attrsE", false]], "jd::ssd::matmul_fp8_param_t::thread_num (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t10thread_numE", false]], "jd::ssd::matmul_fp8_param_t::weight_8bit (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_8bitE", false]], "jd::ssd::matmul_fp8_param_t::weight_bf16 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_bf16E", false]], "jd::ssd::matmul_fp8_param_t::weight_f8_e4m3 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e4m3E", false]], "jd::ssd::matmul_fp8_param_t::weight_f8_e5m2 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e5m2E", false]], "jd::ssd::matmul_fp8_param_t::weight_int8 (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_int8E", false]], "jd::ssd::matmul_fp8_param_t::weight_type (c++ member)": [[281, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_typeE", false]], "jd::ssd::matmul_input (c++ type)": [[281, "_CPPv4N2jd3ssd12matmul_inputE", false]], "jd::ssd::matmul_input::input (c++ enum)": [[281, "_CPPv4N2jd3ssd12matmul_input5inputE", false]], "jd::ssd::matmul_input::input::append_sum (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", false]], "jd::ssd::matmul_input::input::matmul_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", false]], "jd::ssd::matmul_input::input::scale0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", false]], "jd::ssd::matmul_input::input::src0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", false]], "jd::ssd::matmul_input::input::src1 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", false]], "jd::ssd::matmul_input::input::src2 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", false]], "jd::ssd::matmul_input::input::zp0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", false]], "jd::ssd::matmul_io (c++ type)": [[281, "_CPPv4N2jd3ssd9matmul_ioE", false]], "jd::ssd::matmul_io::io (c++ enum)": [[281, "_CPPv4N2jd3ssd9matmul_io2ioE", false]], "jd::ssd::matmul_io::io::append_sum (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", false]], "jd::ssd::matmul_io::io::dst0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", false]], "jd::ssd::matmul_io::io::matmul_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", false]], "jd::ssd::matmul_io::io::scale0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", false]], "jd::ssd::matmul_io::io::src0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", false]], "jd::ssd::matmul_io::io::src1 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", false]], "jd::ssd::matmul_io::io::src2 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", false]], "jd::ssd::matmul_io::io::zp0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", false]], "jd::ssd::matmul_output (c++ type)": [[281, "_CPPv4N2jd3ssd13matmul_outputE", false]], "jd::ssd::matmul_output::output (c++ enum)": [[281, "_CPPv4N2jd3ssd13matmul_output6outputE", false]], "jd::ssd::matmul_output::output::dst0 (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", false]], "jd::ssd::matmul_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd14matmul_param_tE", false]], "jd::ssd::matmul_param_t::alpha (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t5alphaE", false]], "jd::ssd::matmul_param_t::batch (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t5batchE", false]], "jd::ssd::matmul_param_t::beta (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t4betaE", false]], "jd::ssd::matmul_param_t::k (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1KE", false]], "jd::ssd::matmul_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1ME", false]], "jd::ssd::matmul_param_t::m_tile (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t6m_tileE", false]], "jd::ssd::matmul_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t1NE", false]], "jd::ssd::matmul_param_t::n_tile (c++ member)": [[281, "_CPPv4N2jd3ssd14matmul_param_t6n_tileE", false]], "jd::ssd::matmul_u8_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_tE", false]], "jd::ssd::matmul_u8_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t3dstE", false]], "jd::ssd::matmul_u8_data_t::scale (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t5scaleE", false]], "jd::ssd::matmul_u8_data_t::src0 (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t4src0E", false]], "jd::ssd::matmul_u8_data_t::src1 (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t4src1E", false]], "jd::ssd::matmul_u8_data_t::zp (c++ member)": [[281, "_CPPv4N2jd3ssd16matmul_u8_data_t2zpE", false]], "jd::ssd::mean_var_reduce_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_tE", false]], "jd::ssd::mean_var_reduce_data_t::mean_in (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7mean_inE", false]], "jd::ssd::mean_var_reduce_data_t::mean_out (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t8mean_outE", false]], "jd::ssd::mean_var_reduce_data_t::var_in (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t6var_inE", false]], "jd::ssd::mean_var_reduce_data_t::var_out (c++ member)": [[281, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7var_outE", false]], "jd::ssd::mean_var_reduce_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_tE", false]], "jd::ssd::mean_var_reduce_param_t::bm (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BME", false]], "jd::ssd::mean_var_reduce_param_t::bn (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BNE", false]], "jd::ssd::mean_var_reduce_param_t::element_num (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t11element_numE", false]], "jd::ssd::mean_var_reduce_param_t::m (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1ME", false]], "jd::ssd::mean_var_reduce_param_t::n (c++ member)": [[281, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1NE", false]], "jd::ssd::scales (c++ member)": [[281, "_CPPv4N2jd3ssd6SCALESE", false]], "jd::ssd::seq_vnni_copy_params (c++ struct)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_paramsE", false]], "jd::ssd::seq_vnni_copy_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params6dstptrE", false]], "jd::ssd::seq_vnni_copy_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params9dststrideE", false]], "jd::ssd::seq_vnni_copy_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params1kE", false]], "jd::ssd::seq_vnni_copy_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params6srcptrE", false]], "jd::ssd::seq_vnni_copy_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd20seq_vnni_copy_params9srcstrideE", false]], "jd::ssd::softmax_data_t (c++ struct)": [[281, "_CPPv4N2jd3ssd14softmax_data_tE", false]], "jd::ssd::softmax_data_t::dst (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3dstE", false]], "jd::ssd::softmax_data_t::one (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3oneE", false]], "jd::ssd::softmax_data_t::process_vec_num (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t15process_vec_numE", false]], "jd::ssd::softmax_data_t::src (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3srcE", false]], "jd::ssd::softmax_data_t::tmp (c++ member)": [[281, "_CPPv4N2jd3ssd14softmax_data_t3tmpE", false]], "jd::ssd::softmax_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd15softmax_param_tE", false]], "jd::ssd::softmax_param_t::get_lut_exp_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t17get_lut_exp_attrsE", false]], "jd::ssd::softmax_param_t::input_dt (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t8input_dtE", false]], "jd::ssd::softmax_param_t::output_dt (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t9output_dtE", false]], "jd::ssd::softmax_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t12postop_attrsE", false]], "jd::ssd::softmax_param_t::scalar_num (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t10scalar_numE", false]], "jd::ssd::softmax_param_t::sepc_type (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t9sepc_typeE", false]], "jd::ssd::softmax_param_t::vec_align_len (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t13vec_align_lenE", false]], "jd::ssd::softmax_param_t::vec_num_per_thr (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t15vec_num_per_thrE", false]], "jd::ssd::softmax_param_t::vec_num_tail_thr (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t16vec_num_tail_thrE", false]], "jd::ssd::softmax_param_t::vec_tail_len (c++ member)": [[281, "_CPPv4N2jd3ssd15softmax_param_t12vec_tail_lenE", false]], "jd::ssd::sparse_scheme (c++ enum)": [[281, "_CPPv4N2jd3ssd13sparse_schemeE", false]], "jd::ssd::sparse_scheme::dense_x_sparse (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme14dense_x_sparseE", false]], "jd::ssd::sparse_scheme::sparse_x_dense (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme14sparse_x_denseE", false]], "jd::ssd::sparse_scheme::sparse_x_sparse (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme15sparse_x_sparseE", false]], "jd::ssd::sparse_scheme::undef (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13sparse_scheme5undefE", false]], "jd::ssd::spec_softmax_type (c++ enum)": [[281, "_CPPv4N2jd3ssd17spec_softmax_typeE", false]], "jd::ssd::spec_softmax_type::lut (c++ enumerator)": [[281, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", false]], "jd::ssd::spec_translnorm_type (c++ enum)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_typeE", false]], "jd::ssd::spec_translnorm_type::direct (c++ enumerator)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", false]], "jd::ssd::spec_translnorm_type::normal (c++ enumerator)": [[281, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", false]], "jd::ssd::src (c++ member)": [[281, "_CPPv4N2jd3ssd3SRCE", false]], "jd::ssd::subfunc_level (c++ enum)": [[281, "_CPPv4N2jd3ssd13subfunc_levelE", false]], "jd::ssd::subfunc_level::kdims (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level5kdimsE", false]], "jd::ssd::subfunc_level::non_kdims (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level9non_kdimsE", false]], "jd::ssd::subfunc_level::none (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level4noneE", false]], "jd::ssd::subfunc_level::subfunc_level_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd13subfunc_level17subfunc_level_MAXE", false]], "jd::ssd::transpose_copy_params (c++ struct)": [[281, "_CPPv4N2jd3ssd21transpose_copy_paramsE", false]], "jd::ssd::transpose_copy_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params6dstptrE", false]], "jd::ssd::transpose_copy_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params9dststrideE", false]], "jd::ssd::transpose_copy_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params1kE", false]], "jd::ssd::transpose_copy_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params6srcptrE", false]], "jd::ssd::transpose_copy_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd21transpose_copy_params9srcstrideE", false]], "jd::ssd::transpose_mha_io (c++ type)": [[281, "_CPPv4N2jd3ssd16transpose_mha_ioE", false]], "jd::ssd::transpose_mha_io::io (c++ enum)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2ioE", false]], "jd::ssd::transpose_mha_io::io::batch (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", false]], "jd::ssd::transpose_mha_io::io::dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", false]], "jd::ssd::transpose_mha_io::io::head_num (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", false]], "jd::ssd::transpose_mha_io::io::head_size (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", false]], "jd::ssd::transpose_mha_io::io::mask (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", false]], "jd::ssd::transpose_mha_io::io::scale_dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", false]], "jd::ssd::transpose_mha_io::io::scale_k (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", false]], "jd::ssd::transpose_mha_io::io::scale_q (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", false]], "jd::ssd::transpose_mha_io::io::scale_v (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", false]], "jd::ssd::transpose_mha_io::io::seq_len (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", false]], "jd::ssd::transpose_mha_io::io::sl_pad (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", false]], "jd::ssd::transpose_mha_io::io::src_k (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", false]], "jd::ssd::transpose_mha_io::io::src_q (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", false]], "jd::ssd::transpose_mha_io::io::src_v (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", false]], "jd::ssd::transpose_mha_io::io::tmp2m (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", false]], "jd::ssd::transpose_mha_io::io::transpose_mha_io_max (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", false]], "jd::ssd::transpose_mha_io::io::zp_dst (c++ enumerator)": [[281, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", false]], "jd::ssd::transpose_mha_step1_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_paramsE", false]], "jd::ssd::transpose_mha_step1_params::astep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params5astepE", false]], "jd::ssd::transpose_mha_step1_params::batchk (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params6batchkE", false]], "jd::ssd::transpose_mha_step1_params::cbatchstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params10cbatchstepE", false]], "jd::ssd::transpose_mha_step1_params::cfg (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params3cfgE", false]], "jd::ssd::transpose_mha_step1_params::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params5cstepE", false]], "jd::ssd::transpose_mha_step1_params::expsum (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params6expsumE", false]], "jd::ssd::transpose_mha_step1_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params1kE", false]], "jd::ssd::transpose_mha_step1_params::m (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params1mE", false]], "jd::ssd::transpose_mha_step1_params::mata (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matAE", false]], "jd::ssd::transpose_mha_step1_params::matb (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matBE", false]], "jd::ssd::transpose_mha_step1_params::matc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matCE", false]], "jd::ssd::transpose_mha_step1_params::matd (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matDE", false]], "jd::ssd::transpose_mha_step1_params::scaleab (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params7scaleABE", false]], "jd::ssd::transpose_mha_step1_params::sumstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step1_params7sumstepE", false]], "jd::ssd::transpose_mha_step2_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_paramsE", false]], "jd::ssd::transpose_mha_step2_params::dstptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6dstptrE", false]], "jd::ssd::transpose_mha_step2_params::dststride (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params9dststrideE", false]], "jd::ssd::transpose_mha_step2_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params1kE", false]], "jd::ssd::transpose_mha_step2_params::srcptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6srcptrE", false]], "jd::ssd::transpose_mha_step2_params::srcstride (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params9srcstrideE", false]], "jd::ssd::transpose_mha_step2_params::sumptr (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step2_params6sumptrE", false]], "jd::ssd::transpose_mha_step3_params (c++ struct)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_paramsE", false]], "jd::ssd::transpose_mha_step3_params::astep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params5astepE", false]], "jd::ssd::transpose_mha_step3_params::cfg (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params3cfgE", false]], "jd::ssd::transpose_mha_step3_params::cstep (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params5cstepE", false]], "jd::ssd::transpose_mha_step3_params::k (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params1kE", false]], "jd::ssd::transpose_mha_step3_params::mata (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matAE", false]], "jd::ssd::transpose_mha_step3_params::matb (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matBE", false]], "jd::ssd::transpose_mha_step3_params::matc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matCE", false]], "jd::ssd::transpose_mha_step3_params::scaleab (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params7scaleABE", false]], "jd::ssd::transpose_mha_step3_params::scalec (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params6scaleCE", false]], "jd::ssd::transpose_mha_step3_params::zeropointc (c++ member)": [[281, "_CPPv4N2jd3ssd26transpose_mha_step3_params10zeropointCE", false]], "jd::ssd::vnni_data_t (c++ struct)": [[281, "_CPPv4I0EN2jd3ssd11vnni_data_tE", false]], "jd::ssd::vnni_data_t::ptr_bias (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t8ptr_biasE", false]], "jd::ssd::vnni_data_t::ptr_dense (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t9ptr_denseE", false]], "jd::ssd::vnni_data_t::ptr_dst (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t7ptr_dstE", false]], "jd::ssd::vnni_data_t::ptr_dst_m1 (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m1E", false]], "jd::ssd::vnni_data_t::ptr_dst_m2 (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m2E", false]], "jd::ssd::vnni_data_t::ptr_scales (c++ member)": [[281, "_CPPv4N2jd3ssd11vnni_data_t10ptr_scalesE", false]], "jd::ssd::vnni_param_t (c++ struct)": [[281, "_CPPv4N2jd3ssd12vnni_param_tE", false]], "jd::ssd::vnni_param_t::append_sum (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t10append_sumE", false]], "jd::ssd::vnni_param_t::blocksize (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t9blocksizeE", false]], "jd::ssd::vnni_param_t::bm (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t2BME", false]], "jd::ssd::vnni_param_t::bn (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t2BNE", false]], "jd::ssd::vnni_param_t::has_bias (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8has_biasE", false]], "jd::ssd::vnni_param_t::im_start (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8im_startE", false]], "jd::ssd::vnni_param_t::indices (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t7indicesE", false]], "jd::ssd::vnni_param_t::indptr (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6indptrE", false]], "jd::ssd::vnni_param_t::output_type (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t11output_typeE", false]], "jd::ssd::vnni_param_t::postop_attrs (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t12postop_attrsE", false]], "jd::ssd::vnni_param_t::sub_func (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t8sub_funcE", false]], "jd::ssd::vnni_param_t::tile_w (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6tile_wE", false]], "jd::ssd::vnni_param_t::weight (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t6weightE", false]], "jd::ssd::vnni_param_t::welford (c++ member)": [[281, "_CPPv4N2jd3ssd12vnni_param_t7welfordE", false]], "jd::ssd::wei (c++ member)": [[281, "_CPPv4N2jd3ssd3WEIE", false]], "jd::ssd::work_space (c++ member)": [[281, "_CPPv4N2jd3ssd10WORK_SPACEE", false]], "jd::transpose_matmul (c++ class)": [[279, "_CPPv4N2jd16transpose_matmulE", false]], "jd::transpose_matmul::transpose_matmul (c++ function)": [[279, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd16transpose_matmul16transpose_matmulEv", false]], "jd::transpose_matmul::~transpose_matmul (c++ function)": [[279, "_CPPv4N2jd16transpose_matmulD0Ev", false]], "jd::transpose_matmul_desc (c++ class)": [[279, "_CPPv4N2jd21transpose_matmul_descE", false]], "jd::transpose_matmul_desc::transpose_matmul_desc (c++ function)": [[279, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", false], [279, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descEv", false]], "jd::transpose_matmul_desc::~transpose_matmul_desc (c++ function)": [[279, "_CPPv4N2jd21transpose_matmul_descD0Ev", false]], "jd::transpose_mha (c++ class)": [[279, "_CPPv4N2jd13transpose_mhaE", false]], "jd::transpose_mha::transpose_mha (c++ function)": [[279, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", false], [279, "_CPPv4N2jd13transpose_mha13transpose_mhaEv", false]], "jd::transpose_mha::~transpose_mha (c++ function)": [[279, "_CPPv4N2jd13transpose_mhaD0Ev", false]], "jd::transpose_mha_desc (c++ class)": [[279, "_CPPv4N2jd18transpose_mha_descE", false]], "jd::transpose_mha_desc::transpose_mha_desc (c++ function)": [[279, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", false], [279, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descEv", false]], "jd::transpose_mha_desc::~transpose_mha_desc (c++ function)": [[279, "_CPPv4N2jd18transpose_mha_descD0Ev", false]], "lastlayershape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape)": [[160, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape.LastLayerShape", false]], "latrange (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.LatRange", false]], "layernorm (class in intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNorm", false]], "layernorm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm)": [[161, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm.LayerNorm", false]], "layernormalization (class in intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization", false]], "layernormwithreducemean (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean)": [[162, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean.LayerNormWithReduceMean", false]], "layernormwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose)": [[163, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose.LayerNormWithTranspose", false]], "lazyimport (class in intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.LazyImport", false]], "list2str() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.list2str", false]], "listconstruct (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ListConstruct", false]], "listunpack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.ListUnpack", false]], "llamaattention (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaAttention", false]], "llamaembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding)": [[164, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding.LlamaEmbeddings", false]], "llamaflashattention2 (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaFlashAttention2", false]], "llamamatmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose)": [[165, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose.LlamaMatMulWithTranspose", false]], "llamapostprocess (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess)": [[166, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess.LlamaPostprocess", false]], "llamaroraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb)": [[167, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb.LlamaRoraryPosEmb", false]], "llamasdpaattention (class in intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama)": [[32, "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama.LlamaSdpaAttention", false]], "load() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.load", false]], "load_cached_state() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.load_cached_state", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.load_state_dict", false]], "load_state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.load_state_dict", false]], "load_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.load_store", false]], "load_tf_weights_in_bert() (in module intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.load_tf_weights_in_bert", false]], "loader (class in intel_extension_for_transformers.transformers.runtime.compile.loaders.loader)": [[60, "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader.Loader", false]], "log() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.log", false]], "logger (class in intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger", false]], "logsoftmax (class in intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax)": [[87, "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax", false]], "loop (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Loop", false]], "loss_boxes() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_boxes", false]], "loss_boxes() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_boxes", false]], "loss_cardinality() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_cardinality", false]], "loss_cardinality() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_cardinality", false]], "loss_labels() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_labels", false]], "loss_labels() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_labels", false]], "loss_masks() (models.detr.setcriterion method)": [[256, "models.detr.SetCriterion.loss_masks", false]], "loss_masks() (models.detr_multi.setcriterion method)": [[257, "models.detr_multi.SetCriterion.loss_masks", false]], "loweralltuples (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples)": [[168, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples.LowerAllTuples", false]], "main_eval_only": [[253, "module-main_eval_only", false]], "main_parse_and_eval": [[254, "module-main_parse_and_eval", false]], "make_loader() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.make_loader", false]], "makeiterator (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MakeIterator", false]], "mapandbatchdataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset)": [[88, "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset", false]], "masked_fill (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Masked_fill", false]], "maskheadsmallconv (class in models.segmentation)": [[260, "models.segmentation.MaskHeadSmallConv", false]], "masks_to_boxes() (in module util.box_ops)": [[263, "util.box_ops.masks_to_boxes", false]], "matmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Matmul", false]], "matmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.matmul)": [[89, "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul", false]], "matmulwithbias (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBias", false]], "matmulwithbias (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias)": [[169, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias.MatMulWithBias", false]], "matmulwithbiasadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasAdd", false]], "matmulwithbiasadd (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add)": [[170, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add.MatMulWithBiasAdd", false]], "matmulwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasGelu", false]], "matmulwithbiasgelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu)": [[171, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu.MatMulWithBiasGelu", false]], "matmulwithbiasrelu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasRelu", false]], "matmulwithbiasrelu (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu)": [[172, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu.MatMulWithBiasRelu", false]], "matmulwithbiassigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasSigmoid", false]], "matmulwithbiassigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid)": [[173, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid.MatMulWithBiasSigmoid", false]], "matmulwithbiastanh (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MatMulWithBiasTanh", false]], "matmulwithbiastanh (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh)": [[174, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh.MatmulWithBiasTanh", false]], "matmulwithbiasunsqueeze (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze)": [[175, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze.MatMulWithBiasUnsqueeze", false]], "matmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose)": [[176, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose.MatMulWithTranspose", false]], "matmulwithtranspose (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add)": [[177, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add.MatMulWithTranspose", false]], "max (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Max", false]], "mean (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean", false]], "mean (class in intel_extension_for_transformers.transformers.runtime.compile.ops.mean)": [[90, "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean", false]], "mergedembeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MergedEmbeddingbag", false]], "mergedembeddingbag (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag)": [[178, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag.MergedEmbeddingbag", false]], "metric (class in intel_extension_for_transformers.transformers.utils.metrics)": [[250, "intel_extension_for_transformers.transformers.utils.metrics.Metric", false]], "mhattentionmap (class in models.segmentation)": [[260, "models.segmentation.MHAttentionMap", false]], "mkdir() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.mkdir", false]], "mkdirs() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util)": [[21, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util.mkdirs", false]], "mlp (class in models.detr)": [[256, "models.detr.MLP", false]], "mlp (class in models.detr_multi)": [[257, "models.detr_multi.MLP", false]], "mmr (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.searchtype attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType.mmr", false]], "model (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.model", false]], "modelarguments (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.ModelArguments", false]], "modeldataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset)": [[92, "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset", false]], "models.backbone": [[255, "module-models.backbone", false]], "models.detr": [[256, "module-models.detr", false]], "models.detr_multi": [[257, "module-models.detr_multi", false]], "models.matcher": [[258, "module-models.matcher", false]], "models.position_encoding": [[259, "module-models.position_encoding", false]], "models.segmentation": [[260, "module-models.segmentation", false]], "models.transformer": [[261, "module-models.transformer", false]], "modelsize() (intel_extension_for_transformers.transformers.utils.objectives.objective static method)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective.modelsize", false]], "modify_node_connections() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.modify_node_connections", false]], "module": [[0, "module-conversation", false], [1, "module-gaudi_spawn", false], [2, "module-intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", false], [3, "module-intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", false], [4, "module-intel_extension_for_transformers.neural_chat.chatbot", false], [5, "module-intel_extension_for_transformers.neural_chat.config", false], [6, "module-intel_extension_for_transformers.neural_chat.config_logging", false], [7, "module-intel_extension_for_transformers.neural_chat.errorcode", false], [8, "module-intel_extension_for_transformers.neural_chat.pipeline", false], [9, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", false], [10, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", false], [11, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", false], [12, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", false], [13, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", false], [14, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", false], [15, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", false], [16, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", false], [17, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", false], [18, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", false], [19, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", false], [20, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", false], [21, "module-intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", false], [22, "module-intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", false], [23, "module-intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", false], [24, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", false], [25, "module-intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", false], [26, "module-intel_extension_for_transformers.tools.utils", false], [27, "module-intel_extension_for_transformers.transformers.benchmark", false], [28, "module-intel_extension_for_transformers.transformers.config", false], [29, "module-intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", false], [30, "module-intel_extension_for_transformers.transformers.dynamic.evolution", false], [31, "module-intel_extension_for_transformers.transformers.dynamic", false], [32, "module-intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", false], [33, "module-intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", false], [34, "module-intel_extension_for_transformers.transformers.modeling", false], [35, "module-intel_extension_for_transformers.transformers.modeling.model", false], [36, "module-intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", false], [37, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", false], [38, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", false], [39, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", false], [40, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", false], [41, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", false], [42, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", false], [43, "module-intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", false], [44, "module-intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", false], [45, "module-intel_extension_for_transformers.transformers.pipeline", false], [46, "module-intel_extension_for_transformers.transformers.pruner", false], [47, "module-intel_extension_for_transformers.transformers.pruner.pruning", false], [48, "module-intel_extension_for_transformers.transformers.quantization", false], [49, "module-intel_extension_for_transformers.transformers.runtime.compile.compile", false], [50, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", false], [51, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors", false], [52, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", false], [53, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", false], [54, "module-intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", false], [55, "module-intel_extension_for_transformers.transformers.runtime.compile.graph.graph", false], [56, "module-intel_extension_for_transformers.transformers.runtime.compile.graph", false], [57, "module-intel_extension_for_transformers.transformers.runtime.compile.graph_utils", false], [58, "module-intel_extension_for_transformers.transformers.runtime.compile", false], [59, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders", false], [60, "module-intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", false], [61, "module-intel_extension_for_transformers.transformers.runtime.compile.logger", false], [62, "module-intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", false], [63, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.all", false], [64, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.assert", false], [65, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", false], [66, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", false], [67, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", false], [68, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", false], [69, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cast", false], [70, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.concat", false], [71, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.conv", false], [72, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.cos", false], [73, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", false], [74, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", false], [75, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", false], [76, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", false], [77, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", false], [78, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", false], [79, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather", false], [80, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", false], [81, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", false], [82, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", false], [83, "module-intel_extension_for_transformers.transformers.runtime.compile.ops", false], [84, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", false], [85, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", false], [86, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", false], [87, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", false], [88, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", false], [89, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", false], [90, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mean", false], [91, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", false], [92, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", false], [93, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", false], [94, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", false], [95, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.op", false], [96, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", false], [97, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pack", false], [98, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", false], [99, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", false], [100, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", false], [101, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.pow", false], [102, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", false], [103, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", false], [104, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", false], [105, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", false], [106, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", false], [107, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", false], [108, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", false], [109, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", false], [110, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.resize", false], [111, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", false], [112, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", false], [113, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.shape", false], [114, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.sin", false], [115, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.size", false], [116, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", false], [117, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", false], [118, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.split", false], [119, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", false], [120, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", false], [121, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", false], [122, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", false], [123, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", false], [124, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", false], [125, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", false], [126, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.view", false], [127, "module-intel_extension_for_transformers.transformers.runtime.compile.ops.where", false], [128, "module-intel_extension_for_transformers.transformers.runtime.compile.optimizer", false], [129, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", false], [130, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", false], [131, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", false], [132, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", false], [133, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", false], [134, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", false], [135, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", false], [136, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", false], [137, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", false], [138, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", false], [139, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", false], [140, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", false], [141, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", false], [142, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", false], [143, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", false], [144, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", false], [145, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", false], [146, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", false], [147, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", false], [148, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", false], [149, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", false], [150, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph", false], [151, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", false], [152, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", false], [153, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", false], [154, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", false], [155, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", false], [156, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", false], [157, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", false], [158, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", false], [159, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", false], [160, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", false], [161, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", false], [162, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", false], [163, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", false], [164, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", false], [165, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", false], [166, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", false], [167, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", false], [168, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", false], [169, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", false], [170, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", false], [171, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", false], [172, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", false], [173, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", false], [174, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", false], [175, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", false], [176, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", false], [177, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", false], [178, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", false], [179, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", false], [180, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", false], [181, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", false], [182, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", false], [183, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", false], [184, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", false], [185, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", false], [186, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", false], [187, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", false], [188, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", false], [189, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", false], [190, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", false], [191, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", false], [192, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", false], [193, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", false], [194, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", false], [195, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", false], [196, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", false], [197, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", false], [198, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", false], [199, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", false], [200, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", false], [201, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", false], [202, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", false], [203, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", false], [204, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", false], [205, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", false], [206, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", false], [207, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", false], [208, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", false], [209, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", false], [210, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", false], [211, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", false], [212, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", false], [213, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", false], [214, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", false], [215, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", false], [216, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", false], [217, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", false], [218, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", false], [219, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", false], [220, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", false], [221, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", false], [222, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", false], [223, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", false], [224, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", false], [225, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", false], [226, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", false], [227, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", false], [228, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", false], [229, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", false], [230, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", false], [231, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", false], [232, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", false], [233, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", false], [234, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", false], [235, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", false], [236, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", false], [237, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", false], [238, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", false], [239, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", false], [240, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", false], [241, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", false], [242, "module-intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", false], [243, "module-intel_extension_for_transformers.transformers.runtime.compile.tf_utils", false], [244, "module-intel_extension_for_transformers.transformers.runtime.compile.torch_utils", false], [245, "module-intel_extension_for_transformers.transformers.runtime", false], [246, "module-intel_extension_for_transformers.transformers.trainer", false], [247, "module-intel_extension_for_transformers.transformers.utils.config", false], [248, "module-intel_extension_for_transformers.transformers.utils.get_throughput", false], [249, "module-intel_extension_for_transformers.transformers.utils", false], [250, "module-intel_extension_for_transformers.transformers.utils.metrics", false], [251, "module-intel_extension_for_transformers.transformers.utils.objectives", false], [252, "module-intel_extension_for_transformers.transformers.utils.utility", false], [253, "module-main_eval_only", false], [254, "module-main_parse_and_eval", false], [255, "module-models.backbone", false], [256, "module-models.detr", false], [257, "module-models.detr_multi", false], [258, "module-models.matcher", false], [259, "module-models.position_encoding", false], [260, "module-models.segmentation", false], [261, "module-models.transformer", false], [262, "module-text", false], [263, "module-util.box_ops", false], [264, "module-util.misc", false], [265, "module-util.plot_utils", false], [266, "module-util.postprocess", false], [267, "module-utils.data_utils", false], [268, "module-utils.eval_utils", false]], "multiheadattenion (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.MultiHeadAttenion", false]], "mutate() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.mutate", false]], "names_from_input() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.names_from_input", false]], "neoxreorderchange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change)": [[179, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change.NeoxReorderChange", false]], "neoxroraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb)": [[180, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb.NeoxRoraryPosEmb", false]], "neural_engine_bin() (in module intel_extension_for_transformers.transformers.runtime)": [[245, "intel_extension_for_transformers.transformers.runtime.neural_engine_bin", false]], "nlpseq2seqtrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer", false]], "nlptrainer (class in intel_extension_for_transformers.transformers.trainer)": [[246, "intel_extension_for_transformers.transformers.trainer.NLPTrainer", false]], "nms() (in module util.postprocess)": [[266, "util.postprocess.nms", false]], "nms_by_containment() (in module util.postprocess)": [[266, "util.postprocess.nms_by_containment", false]], "nms_supercells() (in module util.postprocess)": [[266, "util.postprocess.nms_supercells", false]], "normalize() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.normalize", false]], "normalize_str() (in module utils.eval_utils)": [[268, "utils.eval_utils.normalize_str", false]], "normmean (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean", false]], "null_instance (c macro)": [[278, "c.NULL_INSTANCE", false]], "objective (class in intel_extension_for_transformers.transformers.utils.objectives)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective", false]], "objects_to_cells() (in module util.postprocess)": [[266, "util.postprocess.objects_to_cells", false]], "objects_to_table_structures() (in module util.postprocess)": [[266, "util.postprocess.objects_to_table_structures", false]], "on_after_eval() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_after_eval", false]], "on_after_optimizer_step() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_after_optimizer_step", false]], "on_before_eval() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_before_eval", false]], "on_before_optimizer_step() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_before_optimizer_step", false]], "on_epoch_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_epoch_begin", false]], "on_epoch_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_epoch_end", false]], "on_step_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_step_begin", false]], "on_step_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_step_end", false]], "on_train_begin() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_train_begin", false]], "on_train_end() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.on_train_end", false]], "onehot (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Onehot", false]], "onehot (class in intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot)": [[93, "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot", false]], "onnx_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.onnx_utils)": [[62, "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils.onnx_extract_operator", false]], "onnxextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor)": [[52, "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor.ONNXExtractor", false]], "onnxinput (class in intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input)": [[94, "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT", false]], "opany (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.OpAny", false]], "operator (class in intel_extension_for_transformers.transformers.runtime.compile.ops.op)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator", false]], "operator_registry() (in module intel_extension_for_transformers.transformers.runtime.compile.ops.op)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator_registry", false]], "operatoradaptor (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor)": [[181, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor.OperatorAdaptor", false]], "optimize() (intel_extension_for_transformers.transformers.runtime.compile.optimizer.optimizer method)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer.optimize", false]], "optimize_model() (in module intel_extension_for_transformers.neural_chat.chatbot)": [[4, "intel_extension_for_transformers.neural_chat.chatbot.optimize_model", false]], "optimizedataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset)": [[96, "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset", false]], "optimizedmodel (class in intel_extension_for_transformers.transformers.modeling.model)": [[35, "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel", false]], "optimizer (class in intel_extension_for_transformers.transformers.runtime.compile.optimizer)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer", false]], "orchestrate_optimizations() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.orchestrate_optimizations", false]], "output (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Output", false]], "outputdata (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data)": [[182, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data.OutputData", false]], "overlaps() (in module util.postprocess)": [[266, "util.postprocess.overlaps", false]], "pack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pack)": [[97, "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack", false]], "packagepositionembedding (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed)": [[100, "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding", false]], "paddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence)": [[98, "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence", false]], "paddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence)": [[183, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence.PaddingSequence", false]], "pareto_frontier() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.pareto_frontier", false]], "parse_args() (in module gaudi_spawn)": [[1, "gaudi_spawn.parse_args", false]], "parse_multi_choice_response() (in module utils.eval_utils)": [[268, "utils.eval_utils.parse_multi_choice_response", false]], "parse_open_response() (in module utils.eval_utils)": [[268, "utils.eval_utils.parse_open_response", false]], "pattern (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern)": [[184, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern.Pattern", false]], "pattern_mapping() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.pattern_mapping", false]], "pattern_mapping_conf_validation() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.pattern_mapping_conf_validation", false]], "pattern_registry() (in module intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern)": [[184, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern.pattern_registry", false]], "performance() (intel_extension_for_transformers.transformers.utils.objectives.objective static method)": [[251, "intel_extension_for_transformers.transformers.utils.objectives.Objective.performance", false]], "placeholder (class in intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder)": [[99, "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder", false]], "plot_logs() (in module util.plot_utils)": [[265, "util.plot_utils.plot_logs", false]], "positionembeddinglearned (class in models.position_encoding)": [[259, "models.position_encoding.PositionEmbeddingLearned", false]], "positionembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings)": [[185, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings.PositionEmbeddings", false]], "positionembeddingsine (class in models.position_encoding)": [[259, "models.position_encoding.PositionEmbeddingSine", false]], "positionembeddingsv1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1)": [[186, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1.PositionEmbeddingsV1", false]], "positionids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.PositionIds", false]], "post_init_cpu() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_cpu", false]], "post_init_gptq() (intel_extension_for_transformers.transformers.utils.config.gptqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig.post_init_gptq", false]], "post_init_runtime() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_runtime", false]], "post_init_xpu() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.post_init_xpu", false]], "postprocess (class in models.detr)": [[256, "models.detr.PostProcess", false]], "postprocess (class in models.detr_multi)": [[257, "models.detr_multi.PostProcess", false]], "postprocesspanoptic (class in models.segmentation)": [[260, "models.segmentation.PostProcessPanoptic", false]], "pow (class in intel_extension_for_transformers.transformers.runtime.compile.ops.pow)": [[101, "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.prepare_inputs_for_generation", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.prepare_inputs_for_generation", false]], "prepare_inputs_for_generation() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.prepare_inputs_for_generation", false]], "preprocess_model() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.preprocess_model", false]], "provider (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.Provider", false]], "prune() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.prune", false]], "prune_heads() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention.prune_heads", false]], "prune_heads() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention.prune_heads", false]], "pruner_info (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.pruner_info", false]], "pruners (intel_extension_for_transformers.transformers.pruner.pruning.pruning attribute)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.pruners", false]], "prunerv2 (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.PrunerV2", false]], "pruning (class in intel_extension_for_transformers.transformers.pruner.pruning)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning", false]], "pull_key_prefix() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.pull_key_prefix", false]], "push_key_prefix() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.push_key_prefix", false]], "qkvmerge (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge)": [[187, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge.QKVMerge", false]], "qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape)": [[188, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape.QKVReshape", false]], "qlinearadd (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearAdd", false]], "qlinearmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearMatMul", false]], "qlinearmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.QLinearMul", false]], "quant_info_init() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.quant_info_init", false]], "quantawaretrainingconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.QuantAwareTrainingConfig", false]], "quantile (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile", false]], "quantizationmethod (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.QuantizationMethod", false]], "quantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize", false]], "quantize() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.quantize", false]], "quantizedgraphdtypecheck (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor)": [[191, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor.QuantizedGraphDtypeCheck", false]], "quantizedmatmulwithbiasanddequantize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize)": [[105, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize", false]], "quantizefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion)": [[190, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion.QuantizeFusion", false]], "quantizelinear (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear", false]], "quantizev2 (class in intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2)": [[103, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2", false]], "range (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Range", false]], "realdiv (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.RealDiv", false]], "reciprocal (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Reciprocal", false]], "recursive_copy() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.recursive_copy", false]], "reduce_dict() (in module util.misc)": [[264, "util.misc.reduce_dict", false]], "reducemean (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean)": [[106, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean", false]], "reducesum (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum)": [[107, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum", false]], "refactor_batch_size() (in module intel_extension_for_transformers.transformers.benchmark)": [[27, "intel_extension_for_transformers.transformers.benchmark.refactor_batch_size", false]], "refine_columns() (in module util.postprocess)": [[266, "util.postprocess.refine_columns", false]], "refine_rows() (in module util.postprocess)": [[266, "util.postprocess.refine_rows", false]], "refine_table_structures() (in module util.postprocess)": [[266, "util.postprocess.refine_table_structures", false]], "register_conv_template() (in module conversation)": [[0, "conversation.register_conv_template", false]], "relu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Relu", false]], "remove_environ_info_item() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.remove_environ_info_item", false]], "remove_environ_info_items() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.remove_environ_info_items", false]], "remove_nodes() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.remove_nodes", false]], "remove_objects_without_content() (in module util.postprocess)": [[266, "util.postprocess.remove_objects_without_content", false]], "remove_supercell_overlap() (in module util.postprocess)": [[266, "util.postprocess.remove_supercell_overlap", false]], "removeconstantop (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op)": [[192, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op.RemoveConstantOP", false]], "removelastview (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view)": [[193, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view.RemoveLastView", false]], "removerange (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range)": [[194, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range.RemoveRange", false]], "removeslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice)": [[197, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice.RemoveSlice", false]], "removeunusedoperator (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator)": [[195, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator.RemoveUnusedOperator", false]], "removezeros (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros)": [[196, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros.RemoveZeros", false]], "rename_node() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.rename_node", false]], "reorder (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reorder)": [[108, "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder", false]], "repeat (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Repeat", false]], "replace_module() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.replace_module", false]], "reshape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.reshape)": [[109, "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape", false]], "reshapeafterrestorehiddenstates (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states)": [[198, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states.ReshapeAfterRestoreHiddenStates", false]], "reshapebeforeandafterattentionoutlayernormgatherelements (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements)": [[199, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements.ReshapeBeforeAndAfterAttentionOutLayerNormGatherElements", false]], "reshapebeforerestorehiddenstates (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states)": [[200, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states.ReshapeBeforeRestoreHiddenStates", false]], "reshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion)": [[201, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion.ReshapeFusion", false]], "resize (class in intel_extension_for_transformers.transformers.runtime.compile.ops.resize)": [[110, "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize", false]], "resnet101() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet101", false]], "resnet152() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet152", false]], "resnet18() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet18", false]], "resnet34() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet34", false]], "resnet50() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnet50", false]], "resnext101_32x8d() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnext101_32x8d", false]], "resnext50_32x4d() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.resnext50_32x4d", false]], "resolve_state_dict() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.resolve_state_dict", false]], "restorehiddenstatesinlengthadaptive (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices)": [[202, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices.RestoreHiddenStatesInLengthAdaptive", false]], "retrievaltypeoptions (class in intel_extension_for_transformers.neural_chat.config)": [[5, "intel_extension_for_transformers.neural_chat.config.RetrievalTypeOptions", false]], "retrieveradapter (class in intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter)": [[14, "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter.RetrieverAdapter", false]], "rmsnorm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm)": [[203, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm.RmsNorm", false]], "robertaattention (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention", false]], "robertaclassificationhead (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead", false]], "robertaembeddings (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings", false]], "robertaencoder (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder", false]], "robertaforcausallm (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM", false]], "robertaformaskedlm (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM", false]], "robertaformultiplechoice (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice", false]], "robertaforquestionanswering (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering", false]], "robertaforsequenceclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification", false]], "robertafortokenclassification (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification", false]], "robertaintermediate (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate", false]], "robertalayer (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer", false]], "robertalmhead (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead", false]], "robertamodel (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel", false]], "robertaoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput", false]], "robertapooler (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler", false]], "robertapretrainedmodel (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel", false]], "robertaselfattention (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention", false]], "robertaselfoutput (class in intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput", false]], "roraryposemb (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb)": [[204, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb.RoraryPosEmb", false]], "rsqrt (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Rsqrt", false]], "rsub (class in intel_extension_for_transformers.transformers.runtime.compile.ops.rsub)": [[111, "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub", false]], "rtnconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.RtnConfig", false]], "run_evolutionary_search() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.run_evolutionary_search", false]], "sample_layer_configuration() (in module intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils)": [[29, "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils.sample_layer_configuration", false]], "sample_length_configuration() (in module intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils)": [[29, "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils.sample_length_configuration", false]], "sample_portion() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.sample_portion", false]], "save() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.save", false]], "save() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.save", false]], "save_cached_state() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.save_cached_state", false]], "save_jsonl() (in module utils.data_utils)": [[267, "utils.data_utils.save_jsonl", false]], "save_population() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.save_population", false]], "save_pretrained() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.save_pretrained", false]], "save_store() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.save_store", false]], "scatterelements (class in intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements)": [[112, "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements", false]], "search_kwargs (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.childparentretriever attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever.search_kwargs", false]], "search_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.search_pattern", false]], "search_straight_pattern() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.search_straight_pattern", false]], "search_type (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.childparentretriever attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever.search_type", false]], "searchtype (class in intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType", false]], "secondmoment (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment", false]], "separatorstyle (class in conversation)": [[0, "conversation.SeparatorStyle", false]], "sequencelength (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.SequenceLength", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.all.all method)": [[63, "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.assert.assert method)": [[64, "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.batchmatmul method)": [[66, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.batchmatmulv2 method)": [[67, "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.biasadd method)": [[68, "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.cast.cast method)": [[69, "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.concat.concat method)": [[70, "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.conv.conv method)": [[71, "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.cos.cos method)": [[72, "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.expanddims method)": [[74, "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.fusedbatchnormv3 method)": [[76, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.fusedgemm method)": [[77, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.fusedmatmul method)": [[78, "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather.gather method)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather.gatherv2 method)": [[79, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.gatherelements method)": [[80, "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.gelu method)": [[81, "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.gemm method)": [[82, "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.iteratorgetnext method)": [[84, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.iteratorv2 method)": [[85, "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.layernorm method)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNorm.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.layernormalization method)": [[86, "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.logsoftmax method)": [[87, "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.mapandbatchdataset method)": [[88, "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.matmul method)": [[89, "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.mean.mean method)": [[90, "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.modeldataset method)": [[92, "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.onehot method)": [[93, "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.op.operator method)": [[95, "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.optimizedataset method)": [[96, "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pack.pack method)": [[97, "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.paddingsequence method)": [[98, "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.placeholder method)": [[99, "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.packagepositionembedding method)": [[100, "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.pow.pow method)": [[101, "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.quantize method)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.quantizelinear method)": [[102, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.quantizev2 method)": [[103, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.quantizedmatmulwithbiasanddequantize method)": [[105, "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.reducemean method)": [[106, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.reducesum method)": [[107, "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.reorder method)": [[108, "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.reshape method)": [[109, "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.resize.resize method)": [[110, "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.rsub method)": [[111, "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.scatterelements method)": [[112, "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.shape.shape method)": [[113, "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.sin.sin method)": [[114, "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.size.size method)": [[115, "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.slicepositionids method)": [[116, "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.softmax method)": [[117, "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.split.split method)": [[118, "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.squeeze method)": [[119, "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.stridedslice method)": [[120, "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.topk method)": [[122, "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.transpose method)": [[123, "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.unpack method)": [[124, "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.unsqueeze method)": [[125, "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.view.view method)": [[126, "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View.set_attr", false]], "set_attr() (intel_extension_for_transformers.transformers.runtime.compile.ops.where.where method)": [[127, "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where.set_attr", false]], "set_autocast() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_autocast", false]], "set_dynamic_config() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.set_dynamic_config", false]], "set_environ_var() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_environ_var", false]], "set_environ_vars() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.set_environ_vars", false]], "set_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_input_embeddings", false]], "set_input_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_input_embeddings", false]], "set_length_config() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_length_config", false]], "set_length_config() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_length_config", false]], "set_lower_constraint() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.set_lower_constraint", false]], "set_output_attentions() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel.set_output_attentions", false]], "set_output_attentions() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertamodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel.set_output_attentions", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertformaskedlm method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertforpretraining method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertlmheadmodel method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaforcausallm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM.set_output_embeddings", false]], "set_output_embeddings() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaformaskedlm method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM.set_output_embeddings", false]], "set_requires_grad() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.set_requires_grad", false]], "set_system_message() (conversation.conversation method)": [[0, "conversation.Conversation.set_system_message", false]], "set_upper_constraint() (intel_extension_for_transformers.transformers.dynamic.evolution.evolution method)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution.set_upper_constraint", false]], "setcriterion (class in models.detr)": [[256, "models.detr.SetCriterion", false]], "setcriterion (class in models.detr_multi)": [[257, "models.detr_multi.SetCriterion", false]], "setup_for_distributed() (in module util.misc)": [[264, "util.misc.setup_for_distributed", false]], "shape (class in intel_extension_for_transformers.transformers.runtime.compile.ops.shape)": [[113, "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape", false]], "sigmoid (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Sigmoid", false]], "sigmoid_focal_loss() (in module models.segmentation)": [[260, "models.segmentation.sigmoid_focal_loss", false]], "silu (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Silu", false]], "similarity (intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.searchtype attribute)": [[2, "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType.similarity", false]], "sin (class in intel_extension_for_transformers.transformers.runtime.compile.ops.sin)": [[114, "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin", false]], "size (class in intel_extension_for_transformers.transformers.runtime.compile.ops.size)": [[115, "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size", false]], "slicemask (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask)": [[205, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask.SliceMask", false]], "slicepositionids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids)": [[116, "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds", false]], "slot_into_containers() (in module util.postprocess)": [[266, "util.postprocess.slot_into_containers", false]], "smoothedvalue (class in util.misc)": [[264, "util.misc.SmoothedValue", false]], "smoothquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.SmoothQuantConfig", false]], "softmax (class in intel_extension_for_transformers.transformers.runtime.compile.ops.softmax)": [[117, "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax", false]], "sort_objects_by_score() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_by_score", false]], "sort_objects_left_to_right() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_left_to_right", false]], "sort_objects_top_to_bottom() (in module util.postprocess)": [[266, "util.postprocess.sort_objects_top_to_bottom", false]], "split (class in intel_extension_for_transformers.transformers.runtime.compile.ops.split)": [[118, "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split", false]], "sqrt (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Sqrt", false]], "square (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Square", false]], "squareddifference (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.SquaredDifference", false]], "squeeze (class in intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze)": [[119, "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze", false]], "stablediffusion_bf16convert (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_bf16convert)": [[211, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert.StableDiffusion_bf16Convert", false]], "stablediffusion_collectquantinfo (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_collectqdqinfo)": [[212, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo.StableDiffusion_CollectQuantInfo", false]], "stablediffusion_insertquantnode (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_insertquantnode)": [[213, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode.StableDiffusion_InsertQuantNode", false]], "stablediffusion_mhareshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_mhareshape)": [[208, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape.StableDiffusion_MHAReshape", false]], "stablediffusion_quantizefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_quantizefusion)": [[209, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion.StableDiffusion_QuantizeFusion", false]], "stablediffusion_reshapefusion (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stablediffusion_reshapefusion)": [[210, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion.StableDiffusion_ReshapeFusion", false]], "stablediffusioninstructpix2pixpipeline (class in intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline)": [[9, "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline", false]], "stack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Stack", false]], "start_pipeline() (in module intel_extension_for_transformers.transformers.runtime.compile.compile)": [[49, "intel_extension_for_transformers.transformers.runtime.compile.compile.start_pipeline", false]], "startendlogits (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits)": [[214, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits.StartEndLogits", false]], "stat (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.state_dict", false]], "state_dict() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.state_dict", false]], "staticquantconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.StaticQuantConfig", false]], "stopforward": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.StopForward", false]], "stopgradient (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.StopGradient", false]], "store2str() (in module intel_extension_for_transformers.transformers.dynamic.evolution)": [[30, "intel_extension_for_transformers.transformers.dynamic.evolution.store2str", false]], "str2list() (in module intel_extension_for_transformers.transformers.runtime.compile.graph_utils)": [[57, "intel_extension_for_transformers.transformers.runtime.compile.graph_utils.str2list", false]], "stridedslice (class in intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice)": [[120, "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice", false]], "subgraphmatcher (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher)": [[215, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher.SubGraphMatcher", false]], "subsequence() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.subsequence", false]], "synchronize_between_processes() (util.misc.smoothedvalue method)": [[264, "util.misc.SmoothedValue.synchronize_between_processes", false]], "table_structure_to_cells() (in module util.postprocess)": [[266, "util.postprocess.table_structure_to_cells", false]], "tally() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.tally", false]], "tanh (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Tanh", false]], "tensor (class in intel_extension_for_transformers.transformers.runtime.compile.ops.tensor)": [[121, "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor.Tensor", false]], "tensorflowextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor)": [[53, "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor.TensorflowExtractor", false]], "tensorslicedataset (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TensorSliceDataset", false]], "teqconfig (class in intel_extension_for_transformers.transformers.utils.config)": [[247, "intel_extension_for_transformers.transformers.utils.config.TeqConfig", false]], "text": [[262, "module-text", false]], "text_to_sequence() (in module text)": [[262, "text.text_to_sequence", false]], "textencoder_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionmaskaddreshape)": [[217, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape.TextEncoder_AttentionMaskAddReshape", false]], "textencoder_attentionreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_attentionreshape)": [[218, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape.TextEncoder_AttentionReshape", false]], "textencoder_casualattentionmask (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_causal_attention_mask)": [[223, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask.TextEncoder_CasualAttentionMask", false]], "textencoder_kvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_kvreshape)": [[219, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape.TextEncoder_KVReshape", false]], "textencoder_mulreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_mulreshape)": [[220, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape.TextEncoder_MulReshape", false]], "textencoder_qreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_qreshape)": [[221, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape.TextEncoder_QReshape", false]], "textencoder_softmaxreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencoder_softmaxreshape)": [[222, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape.TextEncoder_SoftmaxReshape", false]], "textencoder_wordembedding (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textencdoer_word_embedding)": [[216, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding.TextEncoder_WordEmbedding", false]], "tf_dtype_id (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.TF_DTYPE_ID", false]], "tf_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.tf_utils)": [[243, "intel_extension_for_transformers.transformers.runtime.compile.tf_utils.tf_extract_operator", false]], "tile (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Tile", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.bincount method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.combinedstat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.covariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crosscovariance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.crossiou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.history method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.iou method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.mean method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.quantile method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.secondmoment method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.stat method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat.to_", false]], "to_() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.variance method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance.to_", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.autoroundconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.awqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.AwqConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.gptqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.GPTQConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.rtnconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.RtnConfig.to_diff_dict", false]], "to_diff_dict() (intel_extension_for_transformers.transformers.utils.config.teqconfig method)": [[247, "intel_extension_for_transformers.transformers.utils.config.TeqConfig.to_diff_dict", false]], "to_gradio_chatbot() (conversation.conversation method)": [[0, "conversation.Conversation.to_gradio_chatbot", false]], "to_json_file() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.to_json_file", false]], "to_openai_api_messages() (conversation.conversation method)": [[0, "conversation.Conversation.to_openai_api_messages", false]], "tokentypeembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings)": [[224, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings.TokenTypeEmbeddings", false]], "tokentypeembeddingsv1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1)": [[225, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1.TokenTypeEmbeddingsV1", false]], "tokentypeids (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TokenTypeIds", false]], "topk (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK", false]], "topk (class in intel_extension_for_transformers.transformers.runtime.compile.ops.top_k)": [[122, "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK", false]], "topk() (intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.topk method)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK.topk", false]], "torch_extract_operator() (in module intel_extension_for_transformers.transformers.runtime.compile.torch_utils)": [[244, "intel_extension_for_transformers.transformers.runtime.compile.torch_utils.torch_extract_operator", false]], "torchembedding (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding)": [[226, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding.TorchEmbedding", false]], "torchextractor (class in intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor)": [[54, "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor.TorchExtractor", false]], "torchinnerproductinsertbias (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias)": [[227, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias.TorchInnerProductInsertBias", false]], "torchinsertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16)": [[189, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16.TorchInsertBF16Node", false]], "torchinsertbf16node (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node)": [[229, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node.TorchInsertBF16Node", false]], "torchpaddingsequence (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence)": [[230, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence.TorchPaddingSequence", false]], "torchunpackbaddbmm (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm)": [[228, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm.TorchUnpackBaddbmm", false]], "trace (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.Trace", false]], "tracedict (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook)": [[24, "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook.TraceDict", false]], "train() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.train", false]], "training_step() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.training_step", false]], "training_step_length_adaptive() (intel_extension_for_transformers.transformers.trainer.basetrainer method)": [[246, "intel_extension_for_transformers.transformers.trainer.BaseTrainer.training_step_length_adaptive", false]], "transformer2dmodel_attentionmaskaddreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_attentionmaskaddreshape)": [[231, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape.Transformer2Dmodel_AttentionMaskAddReshape", false]], "transformer2dmodel_constantofshapewithmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_constantofshapewithmul)": [[232, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul.Transformer2Dmodel_ConstantOfShapeWithMul", false]], "transformer2dmodel_encoderhiddenstatesreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_encoderhiddenstatesreshape)": [[238, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape.Transformer2Dmodel_EncoderHiddenStatesReshape", false]], "transformer2dmodel_ffninputslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice)": [[233, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice.Transformer2Dmodel_FFNInputSlice", false]], "transformer2dmodel_ffninputslice_1 (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_ffnslice_1)": [[234, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1.Transformer2Dmodel_FFNInputSlice_1", false]], "transformer2dmodel_getsamplebatch (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_getsamplebatch)": [[239, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch.Transformer2Dmodel_GetSampleBatch", false]], "transformer2dmodel_qkvprereshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvprereshape)": [[235, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape.Transformer2Dmodel_QKVPreReshape", false]], "transformer2dmodel_qkvreshape (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape)": [[236, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape.Transformer2Dmodel_QKVReshape", false]], "transformer2dmodel_qkvreshapeto4d (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_qkvreshape4d)": [[237, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D.Transformer2Dmodel_QKVReshapeTo4D", false]], "transformer2dmodel_sampleslice (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2dmodel_sampleslice)": [[240, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice.Transformer2Dmodel_SampleSlice", false]], "transpose (class in intel_extension_for_transformers.transformers.runtime.compile.ops.transpose)": [[123, "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose", false]], "transpose_for_scores() (intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.bertselfattention method)": [[36, "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention.transpose_for_scores", false]], "transpose_for_scores() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertaselfattention method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention.transpose_for_scores", false]], "transpose_mode_int8() (intel_extension_for_transformers.transformers.runtime.compile.graph.graph.graph method)": [[55, "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph.transpose_mode_int8", false]], "transposebatchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.TransposeBatchMatMul", false]], "transposebatchmatmul (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul)": [[241, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul.TransposeBatchMatMul", false]], "unbox_numpy_null() (in module intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.unbox_numpy_null", false]], "unpack (class in intel_extension_for_transformers.transformers.runtime.compile.ops.unpack)": [[124, "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack", false]], "unsqueeze (class in intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze)": [[125, "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze", false]], "update() (intel_extension_for_transformers.transformers.utils.config.itrexquantizationconfigmixin method)": [[247, "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin.update", false]], "update_config() (intel_extension_for_transformers.transformers.pruner.pruning.pruning method)": [[47, "intel_extension_for_transformers.transformers.pruner.pruning.Pruning.update_config", false]], "update_keys_to_ignore() (intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.robertapretrainedmodel method)": [[44, "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel.update_keys_to_ignore", false]], "update_last_message() (conversation.conversation method)": [[0, "conversation.Conversation.update_last_message", false]], "util.box_ops": [[263, "module-util.box_ops", false]], "util.misc": [[264, "module-util.misc", false]], "util.plot_utils": [[265, "module-util.plot_utils", false]], "util.postprocess": [[266, "module-util.postprocess", false]], "utils.data_utils": [[267, "module-utils.data_utils", false]], "utils.eval_utils": [[268, "module-utils.eval_utils", false]], "variance (class in intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats)": [[25, "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance", false]], "view (class in intel_extension_for_transformers.transformers.runtime.compile.ops.view)": [[126, "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View", false]], "warn() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.warn", false]], "warning() (in module intel_extension_for_transformers.transformers.runtime.compile.logger)": [[61, "intel_extension_for_transformers.transformers.runtime.compile.logger.warning", false]], "weight_optimization() (intel_extension_for_transformers.transformers.runtime.compile.optimizer.optimizer method)": [[128, "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer.weight_optimization", false]], "weightpruningconfig (class in intel_extension_for_transformers.transformers.config)": [[28, "intel_extension_for_transformers.transformers.config.WeightPruningConfig", false]], "where (class in intel_extension_for_transformers.transformers.runtime.compile.ops.where)": [[127, "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where", false]], "wide_resnet101_2() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.wide_resnet101_2", false]], "wide_resnet50_2() (in module intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks)": [[17, "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks.wide_resnet50_2", false]], "wordembeddings (class in intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings)": [[242, "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings.WordEmbeddings", false]], "zeros (class in intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops)": [[73, "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops.Zeros", false]]}, "objects": {"": [[278, 0, 1, "c.CPU_INSTANCE", "CPU_INSTANCE"], [278, 0, 1, "c.NULL_INSTANCE", "NULL_INSTANCE"], [278, 1, 1, "_CPPv42jd", "jd"], [278, 1, 1, "_CPPv42jd", "jd"], [279, 1, 1, "_CPPv42jd", "jd"], [280, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 1, 1, "_CPPv42jd", "jd"], [281, 2, 1, "_CPPv4N2jd12attention_io6K_BIASE", "jd::K_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_SCALESE", "jd::K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_WEIGHTE", "jd::K_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_DSTE", "jd::MERGE_DST"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_SRCE", "jd::MERGE_SRC"], [281, 2, 1, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", "jd::QK_V_OUTPUT_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", "jd::QK_V_OUTPUT_ZERO_POINT"], [281, 2, 1, "_CPPv4N2jd12attention_io6Q_BIASE", "jd::Q_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io10Q_K_SCALESE", "jd::Q_K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_K_SRC2E", "jd::Q_K_SRC2"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_SCALESE", "jd::Q_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_WEIGHTE", "jd::Q_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", "jd::RESHAPE_INPUT"], [281, 2, 1, "_CPPv4N2jd12attention_io6V_BIASE", "jd::V_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_SCALESE", "jd::V_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_WEIGHTE", "jd::V_WEIGHT"], [279, 3, 1, "_CPPv4N2jd9attentionE", "jd::attention"], [279, 4, 1, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", "jd::attention::attention"], [279, 4, 1, "_CPPv4N2jd9attention9attentionEv", "jd::attention::attention"], [279, 5, 1, "_CPPv4N2jd9attention9attentionERK17kernel_desc_proxy", "jd::attention::attention::kdp"], [279, 4, 1, "_CPPv4N2jd9attentionD0Ev", "jd::attention::~attention"], [279, 3, 1, "_CPPv4N2jd14attention_descE", "jd::attention_desc"], [279, 4, 1, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", "jd::attention_desc::attention_desc"], [279, 4, 1, "_CPPv4N2jd14attention_desc14attention_descEv", "jd::attention_desc::attention_desc"], [279, 5, 1, "_CPPv4N2jd14attention_desc14attention_descERK13operator_desc", "jd::attention_desc::attention_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14attention_descD0Ev", "jd::attention_desc::~attention_desc"], [281, 6, 1, "_CPPv4N2jd12attention_ioE", "jd::attention_io"], [281, 2, 1, "_CPPv4N2jd12attention_io6K_BIASE", "jd::attention_io::K_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_SCALESE", "jd::attention_io::K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8K_WEIGHTE", "jd::attention_io::K_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_DSTE", "jd::attention_io::MERGE_DST"], [281, 2, 1, "_CPPv4N2jd12attention_io9MERGE_SRCE", "jd::attention_io::MERGE_SRC"], [281, 2, 1, "_CPPv4N2jd12attention_io18QK_V_OUTPUT_SCALESE", "jd::attention_io::QK_V_OUTPUT_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io22QK_V_OUTPUT_ZERO_POINTE", "jd::attention_io::QK_V_OUTPUT_ZERO_POINT"], [281, 2, 1, "_CPPv4N2jd12attention_io6Q_BIASE", "jd::attention_io::Q_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io10Q_K_SCALESE", "jd::attention_io::Q_K_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_K_SRC2E", "jd::attention_io::Q_K_SRC2"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_SCALESE", "jd::attention_io::Q_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8Q_WEIGHTE", "jd::attention_io::Q_WEIGHT"], [281, 2, 1, "_CPPv4N2jd12attention_io13RESHAPE_INPUTE", "jd::attention_io::RESHAPE_INPUT"], [281, 2, 1, "_CPPv4N2jd12attention_io6V_BIASE", "jd::attention_io::V_BIAS"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_SCALESE", "jd::attention_io::V_SCALES"], [281, 2, 1, "_CPPv4N2jd12attention_io8V_WEIGHTE", "jd::attention_io::V_WEIGHT"], [278, 3, 1, "_CPPv4N2jd12cpu_engine_tE", "jd::cpu_engine_t"], [278, 4, 1, "_CPPv4N2jd12cpu_engine_t12cpu_engine_tEv", "jd::cpu_engine_t::cpu_engine_t"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", "jd::cpu_engine_t::create_kernel"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", "jd::cpu_engine_t::create_memory_storage"], [278, 5, 1, "_CPPv4NK2jd12cpu_engine_t21create_memory_storageEPP16memory_storage_t", "jd::cpu_engine_t::create_memory_storage::storage"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t13create_streamEPP8stream_t", "jd::cpu_engine_t::create_stream"], [278, 7, 1, "_CPPv4N2jd12cpu_engine_t10empty_listE", "jd::cpu_engine_t::empty_list"], [278, 4, 1, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", "jd::cpu_engine_t::get_implementation_list"], [278, 5, 1, "_CPPv4NK2jd12cpu_engine_t23get_implementation_listERK13operator_desc", "jd::cpu_engine_t::get_implementation_list::op_desc"], [278, 4, 1, "_CPPv4N2jd12cpu_engine_tD0Ev", "jd::cpu_engine_t::~cpu_engine_t"], [279, 3, 1, "_CPPv4N2jd13dynamic_quantE", "jd::dynamic_quant"], [279, 4, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", "jd::dynamic_quant::dynamic_quant"], [279, 4, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantEv", "jd::dynamic_quant::dynamic_quant"], [279, 5, 1, "_CPPv4N2jd13dynamic_quant13dynamic_quantERK17kernel_desc_proxy", "jd::dynamic_quant::dynamic_quant::kdp"], [279, 4, 1, "_CPPv4N2jd13dynamic_quantD0Ev", "jd::dynamic_quant::~dynamic_quant"], [279, 3, 1, "_CPPv4N2jd18dynamic_quant_descE", "jd::dynamic_quant_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", "jd::dynamic_quant_desc::dynamic_quant_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descEv", "jd::dynamic_quant_desc::dynamic_quant_desc"], [279, 5, 1, "_CPPv4N2jd18dynamic_quant_desc18dynamic_quant_descERK13operator_desc", "jd::dynamic_quant_desc::dynamic_quant_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18dynamic_quant_descD0Ev", "jd::dynamic_quant_desc::~dynamic_quant_desc"], [279, 3, 1, "_CPPv4N2jd20dynamic_quant_matmulE", "jd::dynamic_quant_matmul"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", "jd::dynamic_quant_matmul::dynamic_quant_matmul"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulEv", "jd::dynamic_quant_matmul::dynamic_quant_matmul"], [279, 5, 1, "_CPPv4N2jd20dynamic_quant_matmul20dynamic_quant_matmulERK17kernel_desc_proxy", "jd::dynamic_quant_matmul::dynamic_quant_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd20dynamic_quant_matmulD0Ev", "jd::dynamic_quant_matmul::~dynamic_quant_matmul"], [279, 3, 1, "_CPPv4N2jd25dynamic_quant_matmul_descE", "jd::dynamic_quant_matmul_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descEv", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc"], [279, 5, 1, "_CPPv4N2jd25dynamic_quant_matmul_desc25dynamic_quant_matmul_descERK13operator_desc", "jd::dynamic_quant_matmul_desc::dynamic_quant_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd25dynamic_quant_matmul_descD0Ev", "jd::dynamic_quant_matmul_desc::~dynamic_quant_matmul_desc"], [279, 3, 1, "_CPPv4N2jd9eltwiseopE", "jd::eltwiseop"], [279, 4, 1, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", "jd::eltwiseop::eltwiseop"], [279, 4, 1, "_CPPv4N2jd9eltwiseop9eltwiseopEv", "jd::eltwiseop::eltwiseop"], [279, 5, 1, "_CPPv4N2jd9eltwiseop9eltwiseopERK17kernel_desc_proxy", "jd::eltwiseop::eltwiseop::kdp"], [279, 4, 1, "_CPPv4N2jd9eltwiseopD0Ev", "jd::eltwiseop::~eltwiseop"], [279, 3, 1, "_CPPv4N2jd14eltwiseop_descE", "jd::eltwiseop_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", "jd::eltwiseop_desc::eltwiseop_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descEv", "jd::eltwiseop_desc::eltwiseop_desc"], [279, 5, 1, "_CPPv4N2jd14eltwiseop_desc14eltwiseop_descERK13operator_desc", "jd::eltwiseop_desc::eltwiseop_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14eltwiseop_descD0Ev", "jd::eltwiseop_desc::~eltwiseop_desc"], [278, 3, 1, "_CPPv4N2jd8engine_tE", "jd::engine_t"], [278, 4, 1, "_CPPv4NK2jd8engine_t13create_kernelERK13operator_descRNSt10shared_ptrI8kernel_tEEPK8stream_t", "jd::engine_t::create_kernel"], [278, 4, 1, "_CPPv4NK2jd8engine_t21create_memory_storageEPP16memory_storage_t", "jd::engine_t::create_memory_storage"], [278, 4, 1, "_CPPv4NK2jd8engine_t13create_streamEPP8stream_t", "jd::engine_t::create_stream"], [278, 7, 1, "_CPPv4N2jd8engine_t12engine_kind_E", "jd::engine_t::engine_kind_"], [278, 4, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t"], [278, 5, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t::engine_kind"], [278, 5, 1, "_CPPv4N2jd8engine_t8engine_tERK11engine_kindRK12runtime_kind", "jd::engine_t::engine_t::runtime_kind"], [278, 4, 1, "_CPPv4NK2jd8engine_t15get_engine_kindEv", "jd::engine_t::get_engine_kind"], [278, 4, 1, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", "jd::engine_t::get_implementation_list"], [278, 5, 1, "_CPPv4NK2jd8engine_t23get_implementation_listERK13operator_desc", "jd::engine_t::get_implementation_list::op_desc"], [278, 4, 1, "_CPPv4NK2jd8engine_t16get_runtime_kindEv", "jd::engine_t::get_runtime_kind"], [278, 7, 1, "_CPPv4N2jd8engine_t13runtime_kind_E", "jd::engine_t::runtime_kind_"], [278, 4, 1, "_CPPv4N2jd8engine_tD0Ev", "jd::engine_t::~engine_t"], [279, 3, 1, "_CPPv4N2jd6gatherE", "jd::gather"], [279, 4, 1, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", "jd::gather::gather"], [279, 4, 1, "_CPPv4N2jd6gather6gatherEv", "jd::gather::gather"], [279, 5, 1, "_CPPv4N2jd6gather6gatherERK17kernel_desc_proxy", "jd::gather::gather::kdp"], [279, 4, 1, "_CPPv4N2jd6gatherD0Ev", "jd::gather::~gather"], [279, 3, 1, "_CPPv4N2jd11gather_descE", "jd::gather_desc"], [279, 4, 1, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", "jd::gather_desc::gather_desc"], [279, 4, 1, "_CPPv4N2jd11gather_desc11gather_descEv", "jd::gather_desc::gather_desc"], [279, 5, 1, "_CPPv4N2jd11gather_desc11gather_descERK13operator_desc", "jd::gather_desc::gather_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd11gather_descD0Ev", "jd::gather_desc::~gather_desc"], [279, 3, 1, "_CPPv4N2jd9groupnormE", "jd::groupnorm"], [279, 4, 1, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", "jd::groupnorm::groupnorm"], [279, 4, 1, "_CPPv4N2jd9groupnorm9groupnormEv", "jd::groupnorm::groupnorm"], [279, 5, 1, "_CPPv4N2jd9groupnorm9groupnormERK17kernel_desc_proxy", "jd::groupnorm::groupnorm::kdp"], [279, 4, 1, "_CPPv4N2jd9groupnormD0Ev", "jd::groupnorm::~groupnorm"], [279, 3, 1, "_CPPv4N2jd14groupnorm_descE", "jd::groupnorm_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", "jd::groupnorm_desc::groupnorm_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descEv", "jd::groupnorm_desc::groupnorm_desc"], [279, 5, 1, "_CPPv4N2jd14groupnorm_desc14groupnorm_descERK13operator_desc", "jd::groupnorm_desc::groupnorm_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14groupnorm_descD0Ev", "jd::groupnorm_desc::~groupnorm_desc"], [279, 3, 1, "_CPPv4N2jd17kernel_desc_proxyE", "jd::kernel_desc_proxy"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object::op_desc"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy19create_proxy_objectERNSt10shared_ptrIK13kernel_desc_tEERK13operator_desc", "jd::kernel_desc_proxy::create_proxy_object::result_ref"], [279, 7, 1, "_CPPv4N2jd17kernel_desc_proxy10impl_list_E", "jd::kernel_desc_proxy::impl_list_"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", "jd::kernel_desc_proxy::kernel_desc_proxy"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyEv", "jd::kernel_desc_proxy::kernel_desc_proxy"], [279, 5, 1, "_CPPv4N2jd17kernel_desc_proxy17kernel_desc_proxyERK13operator_desc", "jd::kernel_desc_proxy::kernel_desc_proxy::op_desc"], [279, 4, 1, "_CPPv4NK2jd17kernel_desc_proxy11kernel_kindEv", "jd::kernel_desc_proxy::kernel_kind"], [279, 4, 1, "_CPPv4N2jd17kernel_desc_proxyD0Ev", "jd::kernel_desc_proxy::~kernel_desc_proxy"], [279, 3, 1, "_CPPv4N2jd12kernel_proxyE", "jd::kernel_proxy"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object::kd"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy19create_proxy_objectERNSt10shared_ptrIK8kernel_tEERKNSt10shared_ptrIK13kernel_desc_tEE", "jd::kernel_proxy::create_proxy_object::result_ref"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", "jd::kernel_proxy::execute"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", "jd::kernel_proxy::execute"], [279, 5, 1, "_CPPv4NK2jd12kernel_proxy7executeERK14exec_context_t", "jd::kernel_proxy::execute::ctx"], [279, 5, 1, "_CPPv4NK2jd12kernel_proxy7executeERKNSt6vectorIPKvEE", "jd::kernel_proxy::execute::rt_data"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy18get_workspace_sizeEv", "jd::kernel_proxy::get_workspace_size"], [279, 4, 1, "_CPPv4NK2jd12kernel_proxy11kernel_kindEv", "jd::kernel_proxy::kernel_kind"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", "jd::kernel_proxy::kernel_proxy"], [279, 4, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyEv", "jd::kernel_proxy::kernel_proxy"], [279, 5, 1, "_CPPv4N2jd12kernel_proxy12kernel_proxyERK17kernel_desc_proxy", "jd::kernel_proxy::kernel_proxy::kdp"], [279, 4, 1, "_CPPv4N2jd12kernel_proxyD0Ev", "jd::kernel_proxy::~kernel_proxy"], [279, 3, 1, "_CPPv4N2jd12layernorm_baE", "jd::layernorm_ba"], [279, 4, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", "jd::layernorm_ba::layernorm_ba"], [279, 4, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baEv", "jd::layernorm_ba::layernorm_ba"], [279, 5, 1, "_CPPv4N2jd12layernorm_ba12layernorm_baERK17kernel_desc_proxy", "jd::layernorm_ba::layernorm_ba::kdp"], [279, 4, 1, "_CPPv4N2jd12layernorm_baD0Ev", "jd::layernorm_ba::~layernorm_ba"], [279, 3, 1, "_CPPv4N2jd17layernorm_ba_descE", "jd::layernorm_ba_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", "jd::layernorm_ba_desc::layernorm_ba_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descEv", "jd::layernorm_ba_desc::layernorm_ba_desc"], [279, 5, 1, "_CPPv4N2jd17layernorm_ba_desc17layernorm_ba_descERK13operator_desc", "jd::layernorm_ba_desc::layernorm_ba_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd17layernorm_ba_descD0Ev", "jd::layernorm_ba_desc::~layernorm_ba_desc"], [279, 3, 1, "_CPPv4N2jd20layernormalized_spmmE", "jd::layernormalized_spmm"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", "jd::layernormalized_spmm::layernormalized_spmm"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmEv", "jd::layernormalized_spmm::layernormalized_spmm"], [279, 5, 1, "_CPPv4N2jd20layernormalized_spmm20layernormalized_spmmERK17kernel_desc_proxy", "jd::layernormalized_spmm::layernormalized_spmm::kdp"], [279, 4, 1, "_CPPv4N2jd20layernormalized_spmmD0Ev", "jd::layernormalized_spmm::~layernormalized_spmm"], [279, 3, 1, "_CPPv4N2jd25layernormalized_spmm_descE", "jd::layernormalized_spmm_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descEv", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc"], [279, 5, 1, "_CPPv4N2jd25layernormalized_spmm_desc25layernormalized_spmm_descERK13operator_desc", "jd::layernormalized_spmm_desc::layernormalized_spmm_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd25layernormalized_spmm_descD0Ev", "jd::layernormalized_spmm_desc::~layernormalized_spmm_desc"], [279, 3, 1, "_CPPv4N2jd10logsoftmaxE", "jd::logsoftmax"], [279, 4, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", "jd::logsoftmax::logsoftmax"], [279, 4, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxEv", "jd::logsoftmax::logsoftmax"], [279, 5, 1, "_CPPv4N2jd10logsoftmax10logsoftmaxERK17kernel_desc_proxy", "jd::logsoftmax::logsoftmax::kdp"], [279, 4, 1, "_CPPv4N2jd10logsoftmaxD0Ev", "jd::logsoftmax::~logsoftmax"], [279, 3, 1, "_CPPv4N2jd15logsoftmax_descE", "jd::logsoftmax_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", "jd::logsoftmax_desc::logsoftmax_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descEv", "jd::logsoftmax_desc::logsoftmax_desc"], [279, 5, 1, "_CPPv4N2jd15logsoftmax_desc15logsoftmax_descERK13operator_desc", "jd::logsoftmax_desc::logsoftmax_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd15logsoftmax_descD0Ev", "jd::logsoftmax_desc::~logsoftmax_desc"], [279, 3, 1, "_CPPv4N2jd9mha_denseE", "jd::mha_dense"], [279, 4, 1, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", "jd::mha_dense::mha_dense"], [279, 4, 1, "_CPPv4N2jd9mha_dense9mha_denseEv", "jd::mha_dense::mha_dense"], [279, 5, 1, "_CPPv4N2jd9mha_dense9mha_denseERK17kernel_desc_proxy", "jd::mha_dense::mha_dense::kdp"], [279, 4, 1, "_CPPv4N2jd9mha_denseD0Ev", "jd::mha_dense::~mha_dense"], [279, 3, 1, "_CPPv4N2jd14mha_dense_descE", "jd::mha_dense_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", "jd::mha_dense_desc::mha_dense_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descEv", "jd::mha_dense_desc::mha_dense_desc"], [279, 5, 1, "_CPPv4N2jd14mha_dense_desc14mha_dense_descERK13operator_desc", "jd::mha_dense_desc::mha_dense_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd14mha_dense_descD0Ev", "jd::mha_dense_desc::~mha_dense_desc"], [280, 3, 1, "_CPPv4N2jd13operator_descE", "jd::operator_desc"], [280, 4, 1, "_CPPv4NK2jd13operator_desc18apply_postops_listEv", "jd::operator_desc::apply_postops_list"], [280, 7, 1, "_CPPv4N2jd13operator_desc19apply_postops_list_E", "jd::operator_desc::apply_postops_list_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc5attrsEv", "jd::operator_desc::attrs"], [280, 7, 1, "_CPPv4N2jd13operator_desc6attrs_E", "jd::operator_desc::attrs_"], [280, 7, 1, "_CPPv4N2jd13operator_desc14binaryop_list_E", "jd::operator_desc::binaryop_list_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11engine_kindEv", "jd::operator_desc::engine_kind"], [280, 7, 1, "_CPPv4N2jd13operator_desc12engine_kind_E", "jd::operator_desc::engine_kind_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc17get_binaryop_listEv", "jd::operator_desc::get_binaryop_list"], [280, 4, 1, "_CPPv4NK2jd13operator_desc9impl_nthrEv", "jd::operator_desc::impl_nthr"], [280, 7, 1, "_CPPv4N2jd13operator_desc10impl_nthr_E", "jd::operator_desc::impl_nthr_"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ker_kind_E", "jd::operator_desc::ker_kind_"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ker_prop_E", "jd::operator_desc::ker_prop_"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11kernel_kindEv", "jd::operator_desc::kernel_kind"], [280, 4, 1, "_CPPv4NK2jd13operator_desc11kernel_propEv", "jd::operator_desc::kernel_prop"], [280, 4, 1, "_CPPv4NK2jd13operator_desceqERK13operator_desc", "jd::operator_desc::operator=="], [280, 5, 1, "_CPPv4NK2jd13operator_desceqERK13operator_desc", "jd::operator_desc::operator==::rhs"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc"], [280, 4, 1, "_CPPv4N2jd13operator_desc13operator_descEv", "jd::operator_desc::operator_desc"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::apply_postops_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::apply_postops_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::attrs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::attrs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::eng_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::eng_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_prop"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ker_prop"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::runtime_kind"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRK12runtime_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ts_descs"], [280, 5, 1, "_CPPv4N2jd13operator_desc13operator_descERK11kernel_kindRK11kernel_propRK11engine_kindRKNSt6vectorI11tensor_descEERKNSt13unordered_mapINSt6stringENSt6stringEEERKNSt6vectorI11postop_attrEE", "jd::operator_desc::operator_desc::ts_descs"], [280, 4, 1, "_CPPv4NK2jd13operator_desc12runtime_kindEv", "jd::operator_desc::runtime_kind"], [280, 7, 1, "_CPPv4N2jd13operator_desc13runtime_kind_E", "jd::operator_desc::runtime_kind_"], [280, 4, 1, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", "jd::operator_desc::set_binaryop_list"], [280, 5, 1, "_CPPv4N2jd13operator_desc17set_binaryop_listERKNSt6vectorI13binaryop_attrEE", "jd::operator_desc::set_binaryop_list::binaryop_list"], [280, 4, 1, "_CPPv4NK2jd13operator_desc12tensor_descsEv", "jd::operator_desc::tensor_descs"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_dtypesEv", "jd::operator_desc::tensor_dtypes"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_ftypesEv", "jd::operator_desc::tensor_ftypes"], [280, 4, 1, "_CPPv4NK2jd13operator_desc13tensor_shapesEv", "jd::operator_desc::tensor_shapes"], [280, 7, 1, "_CPPv4N2jd13operator_desc9ts_descs_E", "jd::operator_desc::ts_descs_"], [280, 4, 1, "_CPPv4N2jd13operator_descD0Ev", "jd::operator_desc::~operator_desc"], [279, 3, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base"], [279, 8, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base::T"], [279, 8, 1, "_CPPv4I00EN2jd10proxy_baseE", "jd::proxy_base::arg_t"], [279, 4, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object"], [279, 5, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object::arg"], [279, 5, 1, "_CPPv4N2jd10proxy_base19create_proxy_objectERNSt10shared_ptrIK1TEERK5arg_t", "jd::proxy_base::create_proxy_object::result_ref"], [279, 7, 1, "_CPPv4N2jd10proxy_base12data_handle_E", "jd::proxy_base::data_handle_"], [279, 4, 1, "_CPPv4NK2jd10proxy_base6get_spEv", "jd::proxy_base::get_sp"], [279, 4, 1, "_CPPv4N2jd10proxy_base10proxy_baseEv", "jd::proxy_base::proxy_base"], [279, 4, 1, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", "jd::proxy_base::reset_sp"], [279, 5, 1, "_CPPv4N2jd10proxy_base8reset_spERKNSt10shared_ptrIK1TEE", "jd::proxy_base::reset_sp::sp"], [279, 4, 1, "_CPPv4N2jd10proxy_baseD0Ev", "jd::proxy_base::~proxy_base"], [279, 3, 1, "_CPPv4N2jd5sliceE", "jd::slice"], [279, 4, 1, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", "jd::slice::slice"], [279, 4, 1, "_CPPv4N2jd5slice5sliceEv", "jd::slice::slice"], [279, 5, 1, "_CPPv4N2jd5slice5sliceERK17kernel_desc_proxy", "jd::slice::slice::kdp"], [279, 4, 1, "_CPPv4N2jd5sliceD0Ev", "jd::slice::~slice"], [279, 3, 1, "_CPPv4N2jd10slice_descE", "jd::slice_desc"], [279, 4, 1, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", "jd::slice_desc::slice_desc"], [279, 4, 1, "_CPPv4N2jd10slice_desc10slice_descEv", "jd::slice_desc::slice_desc"], [279, 5, 1, "_CPPv4N2jd10slice_desc10slice_descERK13operator_desc", "jd::slice_desc::slice_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd10slice_descD0Ev", "jd::slice_desc::~slice_desc"], [279, 3, 1, "_CPPv4N2jd7softmaxE", "jd::softmax"], [279, 4, 1, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", "jd::softmax::softmax"], [279, 4, 1, "_CPPv4N2jd7softmax7softmaxEv", "jd::softmax::softmax"], [279, 5, 1, "_CPPv4N2jd7softmax7softmaxERK17kernel_desc_proxy", "jd::softmax::softmax::kdp"], [279, 4, 1, "_CPPv4N2jd7softmaxD0Ev", "jd::softmax::~softmax"], [279, 3, 1, "_CPPv4N2jd12softmax_descE", "jd::softmax_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", "jd::softmax_desc::softmax_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_desc12softmax_descEv", "jd::softmax_desc::softmax_desc"], [279, 5, 1, "_CPPv4N2jd12softmax_desc12softmax_descERK13operator_desc", "jd::softmax_desc::softmax_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd12softmax_descD0Ev", "jd::softmax_desc::~softmax_desc"], [279, 3, 1, "_CPPv4N2jd13sparse_matmulE", "jd::sparse_matmul"], [279, 4, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", "jd::sparse_matmul::sparse_matmul"], [279, 4, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulEv", "jd::sparse_matmul::sparse_matmul"], [279, 5, 1, "_CPPv4N2jd13sparse_matmul13sparse_matmulERK17kernel_desc_proxy", "jd::sparse_matmul::sparse_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd13sparse_matmulD0Ev", "jd::sparse_matmul::~sparse_matmul"], [279, 3, 1, "_CPPv4N2jd18sparse_matmul_descE", "jd::sparse_matmul_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", "jd::sparse_matmul_desc::sparse_matmul_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descEv", "jd::sparse_matmul_desc::sparse_matmul_desc"], [279, 5, 1, "_CPPv4N2jd18sparse_matmul_desc18sparse_matmul_descERK13operator_desc", "jd::sparse_matmul_desc::sparse_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18sparse_matmul_descD0Ev", "jd::sparse_matmul_desc::~sparse_matmul_desc"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 1, 1, "_CPPv4N2jd3ssdE", "jd::ssd"], [281, 7, 1, "_CPPv4N2jd3ssd4BIASE", "jd::ssd::BIAS"], [281, 7, 1, "_CPPv4N2jd3ssd3DSTE", "jd::ssd::DST"], [281, 7, 1, "_CPPv4N2jd3ssd6DST_M1E", "jd::ssd::DST_M1"], [281, 7, 1, "_CPPv4N2jd3ssd6DST_M2E", "jd::ssd::DST_M2"], [281, 7, 1, "_CPPv4N2jd3ssd6SCALESE", "jd::ssd::SCALES"], [281, 7, 1, "_CPPv4N2jd3ssd3SRCE", "jd::ssd::SRC"], [281, 7, 1, "_CPPv4N2jd3ssd3WEIE", "jd::ssd::WEI"], [281, 7, 1, "_CPPv4N2jd3ssd10WORK_SPACEE", "jd::ssd::WORK_SPACE"], [281, 1, 1, "_CPPv4N2jd3ssd17amx_bf16_params_tE", "jd::ssd::amx_bf16_params_t"], [281, 1, 1, "_CPPv4N2jd3ssd21amx_bf16bf16_inputs_tE", "jd::ssd::amx_bf16bf16_inputs_t"], [281, 1, 1, "_CPPv4N2jd3ssd20amx_bf16f32_inputs_tE", "jd::ssd::amx_bf16f32_inputs_t"], [281, 3, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::bia_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t4biasE", "jd::ssd::amx_inputs_t::bias"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t3dstE", "jd::ssd::amx_inputs_t::dst"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::dst_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t3srcE", "jd::ssd::amx_inputs_t::src"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::src_t"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_inputs_t6weightE", "jd::ssd::amx_inputs_t::weight"], [281, 8, 1, "_CPPv4I0000EN2jd3ssd12amx_inputs_tE", "jd::ssd::amx_inputs_t::wgt_t"], [281, 1, 1, "_CPPv4N2jd3ssd17amx_int8_params_tE", "jd::ssd::amx_int8_params_t"], [281, 3, 1, "_CPPv4I0EN2jd3ssd12amx_params_tE", "jd::ssd::amx_params_t"], [281, 8, 1, "_CPPv4I0EN2jd3ssd12amx_params_tE", "jd::ssd::amx_params_t::T"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t16blocks_per_groupE", "jd::ssd::amx_params_t::blocks_per_group"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9blocksizeE", "jd::ssd::amx_params_t::blocksize"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t7colidxsE", "jd::ssd::amx_params_t::colidxs"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t12group_rowptrE", "jd::ssd::amx_params_t::group_rowptr"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t8has_biasE", "jd::ssd::amx_params_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9nnz_groupE", "jd::ssd::amx_params_t::nnz_group"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t7nrowptrE", "jd::ssd::amx_params_t::nrowptr"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t9num_tileME", "jd::ssd::amx_params_t::num_tileM"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t12postop_attrsE", "jd::ssd::amx_params_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t14same_src_dtypeE", "jd::ssd::amx_params_t::same_src_dtype"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5shapeE", "jd::ssd::amx_params_t::shape"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5tileME", "jd::ssd::amx_params_t::tileM"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t5tileNE", "jd::ssd::amx_params_t::tileN"], [281, 7, 1, "_CPPv4N2jd3ssd12amx_params_t6weightE", "jd::ssd::amx_params_t::weight"], [281, 3, 1, "_CPPv4N2jd3ssd13avx512_data_tE", "jd::ssd::avx512_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t4biasE", "jd::ssd::avx512_data_t::bias"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t5denseE", "jd::ssd::avx512_data_t::dense"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t3dstE", "jd::ssd::avx512_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd13avx512_data_t6sparseE", "jd::ssd::avx512_data_t::sparse"], [281, 3, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_tE", "jd::ssd::avx512_fp32_params_t"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1KE", "jd::ssd::avx512_fp32_params_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1ME", "jd::ssd::avx512_fp32_params_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t1NE", "jd::ssd::avx512_fp32_params_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8has_biasE", "jd::ssd::avx512_fp32_params_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t6im_endE", "jd::ssd::avx512_fp32_params_t::im_end"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8im_startE", "jd::ssd::avx512_fp32_params_t::im_start"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t6in_endE", "jd::ssd::avx512_fp32_params_t::in_end"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t8in_startE", "jd::ssd::avx512_fp32_params_t::in_start"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t12postop_attrsE", "jd::ssd::avx512_fp32_params_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20avx512_fp32_params_t10sparse_ptrE", "jd::ssd::avx512_fp32_params_t::sparse_ptr"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", "jd::ssd::direct"], [281, 3, 1, "_CPPv4N2jd3ssd16eltwiseop_data_tE", "jd::ssd::eltwiseop_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t3dstE", "jd::ssd::eltwiseop_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t11element_numE", "jd::ssd::eltwiseop_data_t::element_num"], [281, 7, 1, "_CPPv4N2jd3ssd16eltwiseop_data_t3srcE", "jd::ssd::eltwiseop_data_t::src"], [281, 3, 1, "_CPPv4N2jd3ssd17eltwiseop_param_tE", "jd::ssd::eltwiseop_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t11element_numE", "jd::ssd::eltwiseop_param_t::element_num"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t19element_num_each_thE", "jd::ssd::eltwiseop_param_t::element_num_each_th"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t5in_dtE", "jd::ssd::eltwiseop_param_t::in_dt"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t6out_dtE", "jd::ssd::eltwiseop_param_t::out_dt"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t12postop_attrsE", "jd::ssd::eltwiseop_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd17eltwiseop_param_t14remain_elementE", "jd::ssd::eltwiseop_param_t::remain_element"], [281, 3, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_tE", "jd::ssd::layernorm_ba_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_tUt1_3E", "jd::ssd::layernorm_ba_data_t::[anonymous]"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t5alphaE", "jd::ssd::layernorm_ba_data_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4betaE", "jd::ssd::layernorm_ba_data_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3dstE", "jd::ssd::layernorm_ba_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4dst2E", "jd::ssd::layernorm_ba_data_t::dst2"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3epsE", "jd::ssd::layernorm_ba_data_t::eps"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t4meanE", "jd::ssd::layernorm_ba_data_t::mean"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t1nE", "jd::ssd::layernorm_ba_data_t::n"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3oneE", "jd::ssd::layernorm_ba_data_t::one"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t11process_rowE", "jd::ssd::layernorm_ba_data_t::process_row"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3srcE", "jd::ssd::layernorm_ba_data_t::src"], [281, 7, 1, "_CPPv4N2jd3ssd19layernorm_ba_data_t3varE", "jd::ssd::layernorm_ba_data_t::var"], [281, 3, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_tE", "jd::ssd::layernorm_ba_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9batch_numE", "jd::ssd::layernorm_ba_param_t::batch_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t14binaryop_attrsE", "jd::ssd::layernorm_ba_param_t::binaryop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t7col_numE", "jd::ssd::layernorm_ba_param_t::col_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t18direct_process_rowE", "jd::ssd::layernorm_ba_param_t::direct_process_row"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t8input_dtE", "jd::ssd::layernorm_ba_param_t::input_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t13ker_per_batchE", "jd::ssd::layernorm_ba_param_t::ker_per_batch"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t10output2_dtE", "jd::ssd::layernorm_ba_param_t::output2_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9output_dtE", "jd::ssd::layernorm_ba_param_t::output_dt"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t12postop_attrsE", "jd::ssd::layernorm_ba_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t21process_batch_per_kerE", "jd::ssd::layernorm_ba_param_t::process_batch_per_ker"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t11process_colE", "jd::ssd::layernorm_ba_param_t::process_col"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t7row_numE", "jd::ssd::layernorm_ba_param_t::row_num"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t9spec_typeE", "jd::ssd::layernorm_ba_param_t::spec_type"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t12split_outputE", "jd::ssd::layernorm_ba_param_t::split_output"], [281, 7, 1, "_CPPv4N2jd3ssd20layernorm_ba_param_t17thread_elt_offsetE", "jd::ssd::layernorm_ba_param_t::thread_elt_offset"], [281, 2, 1, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", "jd::ssd::lut"], [281, 3, 1, "_CPPv4N2jd3ssd13matmul_data_tE", "jd::ssd::matmul_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t3dstE", "jd::ssd::matmul_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src0E", "jd::ssd::matmul_data_t::src0"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src1E", "jd::ssd::matmul_data_t::src1"], [281, 7, 1, "_CPPv4N2jd3ssd13matmul_data_t4src2E", "jd::ssd::matmul_data_t::src2"], [281, 3, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_tE", "jd::ssd::matmul_fp8_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5alphaE", "jd::ssd::matmul_fp8_data_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5astepE", "jd::ssd::matmul_fp8_data_t::astep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4betaE", "jd::ssd::matmul_fp8_data_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5bstepE", "jd::ssd::matmul_fp8_data_t::bstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5cstepE", "jd::ssd::matmul_fp8_data_t::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5dstepE", "jd::ssd::matmul_fp8_data_t::dstep"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t1kE", "jd::ssd::matmul_fp8_data_t::k"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4kposE", "jd::ssd::matmul_fp8_data_t::kpos"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matAE", "jd::ssd::matmul_fp8_data_t::matA"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matBE", "jd::ssd::matmul_fp8_data_t::matB"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matCE", "jd::ssd::matmul_fp8_data_t::matC"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matDE", "jd::ssd::matmul_fp8_data_t::matD"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t4matEE", "jd::ssd::matmul_fp8_data_t::matE"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t1nE", "jd::ssd::matmul_fp8_data_t::n"], [281, 7, 1, "_CPPv4N2jd3ssd17matmul_fp8_data_t5scaleE", "jd::ssd::matmul_fp8_data_t::scale"], [281, 3, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_tE", "jd::ssd::matmul_fp8_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_tUt1_5E", "jd::ssd::matmul_fp8_param_t::[anonymous]"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1KE", "jd::ssd::matmul_fp8_param_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1ME", "jd::ssd::matmul_fp8_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t1NE", "jd::ssd::matmul_fp8_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t5alphaE", "jd::ssd::matmul_fp8_param_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t4betaE", "jd::ssd::matmul_fp8_param_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14has_append_sumE", "jd::ssd::matmul_fp8_param_t::has_append_sum"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t10has_scale0E", "jd::ssd::matmul_fp8_param_t::has_scale0"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t12postop_attrsE", "jd::ssd::matmul_fp8_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t10thread_numE", "jd::ssd::matmul_fp8_param_t::thread_num"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_8bitE", "jd::ssd::matmul_fp8_param_t::weight_8bit"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_bf16E", "jd::ssd::matmul_fp8_param_t::weight_bf16"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e4m3E", "jd::ssd::matmul_fp8_param_t::weight_f8_e4m3"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t14weight_f8_e5m2E", "jd::ssd::matmul_fp8_param_t::weight_f8_e5m2"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_int8E", "jd::ssd::matmul_fp8_param_t::weight_int8"], [281, 7, 1, "_CPPv4N2jd3ssd18matmul_fp8_param_t11weight_typeE", "jd::ssd::matmul_fp8_param_t::weight_type"], [281, 1, 1, "_CPPv4N2jd3ssd12matmul_inputE", "jd::ssd::matmul_input"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", "jd::ssd::matmul_input::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", "jd::ssd::matmul_input::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", "jd::ssd::matmul_input::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", "jd::ssd::matmul_input::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", "jd::ssd::matmul_input::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", "jd::ssd::matmul_input::ZP0"], [281, 6, 1, "_CPPv4N2jd3ssd12matmul_input5inputE", "jd::ssd::matmul_input::input"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input10APPEND_SUME", "jd::ssd::matmul_input::input::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input6SCALE0E", "jd::ssd::matmul_input::input::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC0E", "jd::ssd::matmul_input::input::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC1E", "jd::ssd::matmul_input::input::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input4SRC2E", "jd::ssd::matmul_input::input::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input3ZP0E", "jd::ssd::matmul_input::input::ZP0"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", "jd::ssd::matmul_input::input::matmul_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd12matmul_input5input13matmul_io_MAXE", "jd::ssd::matmul_input::matmul_io_MAX"], [281, 1, 1, "_CPPv4N2jd3ssd9matmul_ioE", "jd::ssd::matmul_io"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", "jd::ssd::matmul_io::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", "jd::ssd::matmul_io::DST0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", "jd::ssd::matmul_io::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", "jd::ssd::matmul_io::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", "jd::ssd::matmul_io::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", "jd::ssd::matmul_io::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", "jd::ssd::matmul_io::ZP0"], [281, 6, 1, "_CPPv4N2jd3ssd9matmul_io2ioE", "jd::ssd::matmul_io::io"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io10APPEND_SUME", "jd::ssd::matmul_io::io::APPEND_SUM"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4DST0E", "jd::ssd::matmul_io::io::DST0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io6SCALE0E", "jd::ssd::matmul_io::io::SCALE0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC0E", "jd::ssd::matmul_io::io::SRC0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC1E", "jd::ssd::matmul_io::io::SRC1"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io4SRC2E", "jd::ssd::matmul_io::io::SRC2"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io3ZP0E", "jd::ssd::matmul_io::io::ZP0"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", "jd::ssd::matmul_io::io::matmul_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd9matmul_io2io13matmul_io_MAXE", "jd::ssd::matmul_io::matmul_io_MAX"], [281, 1, 1, "_CPPv4N2jd3ssd13matmul_outputE", "jd::ssd::matmul_output"], [281, 2, 1, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", "jd::ssd::matmul_output::DST0"], [281, 6, 1, "_CPPv4N2jd3ssd13matmul_output6outputE", "jd::ssd::matmul_output::output"], [281, 2, 1, "_CPPv4N2jd3ssd13matmul_output6output4DST0E", "jd::ssd::matmul_output::output::DST0"], [281, 3, 1, "_CPPv4N2jd3ssd14matmul_param_tE", "jd::ssd::matmul_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1KE", "jd::ssd::matmul_param_t::K"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1ME", "jd::ssd::matmul_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t1NE", "jd::ssd::matmul_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t5alphaE", "jd::ssd::matmul_param_t::alpha"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t5batchE", "jd::ssd::matmul_param_t::batch"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t4betaE", "jd::ssd::matmul_param_t::beta"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t6m_tileE", "jd::ssd::matmul_param_t::m_tile"], [281, 7, 1, "_CPPv4N2jd3ssd14matmul_param_t6n_tileE", "jd::ssd::matmul_param_t::n_tile"], [281, 3, 1, "_CPPv4N2jd3ssd16matmul_u8_data_tE", "jd::ssd::matmul_u8_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t3dstE", "jd::ssd::matmul_u8_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t5scaleE", "jd::ssd::matmul_u8_data_t::scale"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t4src0E", "jd::ssd::matmul_u8_data_t::src0"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t4src1E", "jd::ssd::matmul_u8_data_t::src1"], [281, 7, 1, "_CPPv4N2jd3ssd16matmul_u8_data_t2zpE", "jd::ssd::matmul_u8_data_t::zp"], [281, 3, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_tE", "jd::ssd::mean_var_reduce_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7mean_inE", "jd::ssd::mean_var_reduce_data_t::mean_in"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t8mean_outE", "jd::ssd::mean_var_reduce_data_t::mean_out"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t6var_inE", "jd::ssd::mean_var_reduce_data_t::var_in"], [281, 7, 1, "_CPPv4N2jd3ssd22mean_var_reduce_data_t7var_outE", "jd::ssd::mean_var_reduce_data_t::var_out"], [281, 3, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_tE", "jd::ssd::mean_var_reduce_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BME", "jd::ssd::mean_var_reduce_param_t::BM"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t2BNE", "jd::ssd::mean_var_reduce_param_t::BN"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1ME", "jd::ssd::mean_var_reduce_param_t::M"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t1NE", "jd::ssd::mean_var_reduce_param_t::N"], [281, 7, 1, "_CPPv4N2jd3ssd23mean_var_reduce_param_t11element_numE", "jd::ssd::mean_var_reduce_param_t::element_num"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", "jd::ssd::normal"], [281, 3, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_paramsE", "jd::ssd::seq_vnni_copy_params"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params6dstptrE", "jd::ssd::seq_vnni_copy_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params9dststrideE", "jd::ssd::seq_vnni_copy_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params1kE", "jd::ssd::seq_vnni_copy_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params6srcptrE", "jd::ssd::seq_vnni_copy_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd20seq_vnni_copy_params9srcstrideE", "jd::ssd::seq_vnni_copy_params::srcstride"], [281, 3, 1, "_CPPv4N2jd3ssd14softmax_data_tE", "jd::ssd::softmax_data_t"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3dstE", "jd::ssd::softmax_data_t::dst"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3oneE", "jd::ssd::softmax_data_t::one"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t15process_vec_numE", "jd::ssd::softmax_data_t::process_vec_num"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3srcE", "jd::ssd::softmax_data_t::src"], [281, 7, 1, "_CPPv4N2jd3ssd14softmax_data_t3tmpE", "jd::ssd::softmax_data_t::tmp"], [281, 3, 1, "_CPPv4N2jd3ssd15softmax_param_tE", "jd::ssd::softmax_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t17get_lut_exp_attrsE", "jd::ssd::softmax_param_t::get_lut_exp_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t8input_dtE", "jd::ssd::softmax_param_t::input_dt"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t9output_dtE", "jd::ssd::softmax_param_t::output_dt"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t12postop_attrsE", "jd::ssd::softmax_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t10scalar_numE", "jd::ssd::softmax_param_t::scalar_num"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t9sepc_typeE", "jd::ssd::softmax_param_t::sepc_type"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t13vec_align_lenE", "jd::ssd::softmax_param_t::vec_align_len"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t15vec_num_per_thrE", "jd::ssd::softmax_param_t::vec_num_per_thr"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t16vec_num_tail_thrE", "jd::ssd::softmax_param_t::vec_num_tail_thr"], [281, 7, 1, "_CPPv4N2jd3ssd15softmax_param_t12vec_tail_lenE", "jd::ssd::softmax_param_t::vec_tail_len"], [281, 6, 1, "_CPPv4N2jd3ssd13sparse_schemeE", "jd::ssd::sparse_scheme"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme14dense_x_sparseE", "jd::ssd::sparse_scheme::dense_x_sparse"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme14sparse_x_denseE", "jd::ssd::sparse_scheme::sparse_x_dense"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme15sparse_x_sparseE", "jd::ssd::sparse_scheme::sparse_x_sparse"], [281, 2, 1, "_CPPv4N2jd3ssd13sparse_scheme5undefE", "jd::ssd::sparse_scheme::undef"], [281, 6, 1, "_CPPv4N2jd3ssd17spec_softmax_typeE", "jd::ssd::spec_softmax_type"], [281, 2, 1, "_CPPv4N2jd3ssd17spec_softmax_type3lutE", "jd::ssd::spec_softmax_type::lut"], [281, 6, 1, "_CPPv4N2jd3ssd20spec_translnorm_typeE", "jd::ssd::spec_translnorm_type"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6directE", "jd::ssd::spec_translnorm_type::direct"], [281, 2, 1, "_CPPv4N2jd3ssd20spec_translnorm_type6normalE", "jd::ssd::spec_translnorm_type::normal"], [281, 6, 1, "_CPPv4N2jd3ssd13subfunc_levelE", "jd::ssd::subfunc_level"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level5kdimsE", "jd::ssd::subfunc_level::kdims"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level9non_kdimsE", "jd::ssd::subfunc_level::non_kdims"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level4noneE", "jd::ssd::subfunc_level::none"], [281, 2, 1, "_CPPv4N2jd3ssd13subfunc_level17subfunc_level_MAXE", "jd::ssd::subfunc_level::subfunc_level_MAX"], [281, 3, 1, "_CPPv4N2jd3ssd21transpose_copy_paramsE", "jd::ssd::transpose_copy_params"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params6dstptrE", "jd::ssd::transpose_copy_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params9dststrideE", "jd::ssd::transpose_copy_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params1kE", "jd::ssd::transpose_copy_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params6srcptrE", "jd::ssd::transpose_copy_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd21transpose_copy_params9srcstrideE", "jd::ssd::transpose_copy_params::srcstride"], [281, 1, 1, "_CPPv4N2jd3ssd16transpose_mha_ioE", "jd::ssd::transpose_mha_io"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", "jd::ssd::transpose_mha_io::BATCH"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", "jd::ssd::transpose_mha_io::DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", "jd::ssd::transpose_mha_io::HEAD_NUM"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", "jd::ssd::transpose_mha_io::HEAD_SIZE"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", "jd::ssd::transpose_mha_io::MASK"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", "jd::ssd::transpose_mha_io::SCALE_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", "jd::ssd::transpose_mha_io::SCALE_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", "jd::ssd::transpose_mha_io::SCALE_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", "jd::ssd::transpose_mha_io::SCALE_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", "jd::ssd::transpose_mha_io::SEQ_LEN"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", "jd::ssd::transpose_mha_io::SL_PAD"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", "jd::ssd::transpose_mha_io::SRC_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", "jd::ssd::transpose_mha_io::SRC_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", "jd::ssd::transpose_mha_io::SRC_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", "jd::ssd::transpose_mha_io::TMP2M"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", "jd::ssd::transpose_mha_io::ZP_DST"], [281, 6, 1, "_CPPv4N2jd3ssd16transpose_mha_io2ioE", "jd::ssd::transpose_mha_io::io"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5BATCHE", "jd::ssd::transpose_mha_io::io::BATCH"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io3DSTE", "jd::ssd::transpose_mha_io::io::DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io8HEAD_NUME", "jd::ssd::transpose_mha_io::io::HEAD_NUM"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9HEAD_SIZEE", "jd::ssd::transpose_mha_io::io::HEAD_SIZE"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io4MASKE", "jd::ssd::transpose_mha_io::io::MASK"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io9SCALE_DSTE", "jd::ssd::transpose_mha_io::io::SCALE_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_KE", "jd::ssd::transpose_mha_io::io::SCALE_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_QE", "jd::ssd::transpose_mha_io::io::SCALE_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SCALE_VE", "jd::ssd::transpose_mha_io::io::SCALE_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io7SEQ_LENE", "jd::ssd::transpose_mha_io::io::SEQ_LEN"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6SL_PADE", "jd::ssd::transpose_mha_io::io::SL_PAD"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_KE", "jd::ssd::transpose_mha_io::io::SRC_K"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_QE", "jd::ssd::transpose_mha_io::io::SRC_Q"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5SRC_VE", "jd::ssd::transpose_mha_io::io::SRC_V"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io5TMP2ME", "jd::ssd::transpose_mha_io::io::TMP2M"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io6ZP_DSTE", "jd::ssd::transpose_mha_io::io::ZP_DST"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", "jd::ssd::transpose_mha_io::io::transpose_mha_io_MAX"], [281, 2, 1, "_CPPv4N2jd3ssd16transpose_mha_io2io20transpose_mha_io_MAXE", "jd::ssd::transpose_mha_io::transpose_mha_io_MAX"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_paramsE", "jd::ssd::transpose_mha_step1_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params5astepE", "jd::ssd::transpose_mha_step1_params::astep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params6batchkE", "jd::ssd::transpose_mha_step1_params::batchk"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params10cbatchstepE", "jd::ssd::transpose_mha_step1_params::cbatchstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params3cfgE", "jd::ssd::transpose_mha_step1_params::cfg"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params5cstepE", "jd::ssd::transpose_mha_step1_params::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params6expsumE", "jd::ssd::transpose_mha_step1_params::expsum"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params1kE", "jd::ssd::transpose_mha_step1_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params1mE", "jd::ssd::transpose_mha_step1_params::m"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matAE", "jd::ssd::transpose_mha_step1_params::matA"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matBE", "jd::ssd::transpose_mha_step1_params::matB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matCE", "jd::ssd::transpose_mha_step1_params::matC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params4matDE", "jd::ssd::transpose_mha_step1_params::matD"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params7scaleABE", "jd::ssd::transpose_mha_step1_params::scaleAB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step1_params7sumstepE", "jd::ssd::transpose_mha_step1_params::sumstep"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_paramsE", "jd::ssd::transpose_mha_step2_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6dstptrE", "jd::ssd::transpose_mha_step2_params::dstptr"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params9dststrideE", "jd::ssd::transpose_mha_step2_params::dststride"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params1kE", "jd::ssd::transpose_mha_step2_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6srcptrE", "jd::ssd::transpose_mha_step2_params::srcptr"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params9srcstrideE", "jd::ssd::transpose_mha_step2_params::srcstride"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step2_params6sumptrE", "jd::ssd::transpose_mha_step2_params::sumptr"], [281, 3, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_paramsE", "jd::ssd::transpose_mha_step3_params"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params5astepE", "jd::ssd::transpose_mha_step3_params::astep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params3cfgE", "jd::ssd::transpose_mha_step3_params::cfg"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params5cstepE", "jd::ssd::transpose_mha_step3_params::cstep"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params1kE", "jd::ssd::transpose_mha_step3_params::k"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matAE", "jd::ssd::transpose_mha_step3_params::matA"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matBE", "jd::ssd::transpose_mha_step3_params::matB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params4matCE", "jd::ssd::transpose_mha_step3_params::matC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params7scaleABE", "jd::ssd::transpose_mha_step3_params::scaleAB"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params6scaleCE", "jd::ssd::transpose_mha_step3_params::scaleC"], [281, 7, 1, "_CPPv4N2jd3ssd26transpose_mha_step3_params10zeropointCE", "jd::ssd::transpose_mha_step3_params::zeropointC"], [281, 3, 1, "_CPPv4I0EN2jd3ssd11vnni_data_tE", "jd::ssd::vnni_data_t"], [281, 8, 1, "_CPPv4I0EN2jd3ssd11vnni_data_tE", "jd::ssd::vnni_data_t::dst_t"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t8ptr_biasE", "jd::ssd::vnni_data_t::ptr_bias"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t9ptr_denseE", "jd::ssd::vnni_data_t::ptr_dense"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t7ptr_dstE", "jd::ssd::vnni_data_t::ptr_dst"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m1E", "jd::ssd::vnni_data_t::ptr_dst_m1"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_dst_m2E", "jd::ssd::vnni_data_t::ptr_dst_m2"], [281, 7, 1, "_CPPv4N2jd3ssd11vnni_data_t10ptr_scalesE", "jd::ssd::vnni_data_t::ptr_scales"], [281, 3, 1, "_CPPv4N2jd3ssd12vnni_param_tE", "jd::ssd::vnni_param_t"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t2BME", "jd::ssd::vnni_param_t::BM"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t2BNE", "jd::ssd::vnni_param_t::BN"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t10append_sumE", "jd::ssd::vnni_param_t::append_sum"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t9blocksizeE", "jd::ssd::vnni_param_t::blocksize"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8has_biasE", "jd::ssd::vnni_param_t::has_bias"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8im_startE", "jd::ssd::vnni_param_t::im_start"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t7indicesE", "jd::ssd::vnni_param_t::indices"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6indptrE", "jd::ssd::vnni_param_t::indptr"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t11output_typeE", "jd::ssd::vnni_param_t::output_type"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t12postop_attrsE", "jd::ssd::vnni_param_t::postop_attrs"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t8sub_funcE", "jd::ssd::vnni_param_t::sub_func"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6tile_wE", "jd::ssd::vnni_param_t::tile_w"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t6weightE", "jd::ssd::vnni_param_t::weight"], [281, 7, 1, "_CPPv4N2jd3ssd12vnni_param_t7welfordE", "jd::ssd::vnni_param_t::welford"], [279, 3, 1, "_CPPv4N2jd16transpose_matmulE", "jd::transpose_matmul"], [279, 4, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", "jd::transpose_matmul::transpose_matmul"], [279, 4, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulEv", "jd::transpose_matmul::transpose_matmul"], [279, 5, 1, "_CPPv4N2jd16transpose_matmul16transpose_matmulERK17kernel_desc_proxy", "jd::transpose_matmul::transpose_matmul::kdp"], [279, 4, 1, "_CPPv4N2jd16transpose_matmulD0Ev", "jd::transpose_matmul::~transpose_matmul"], [279, 3, 1, "_CPPv4N2jd21transpose_matmul_descE", "jd::transpose_matmul_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", "jd::transpose_matmul_desc::transpose_matmul_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descEv", "jd::transpose_matmul_desc::transpose_matmul_desc"], [279, 5, 1, "_CPPv4N2jd21transpose_matmul_desc21transpose_matmul_descERK13operator_desc", "jd::transpose_matmul_desc::transpose_matmul_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd21transpose_matmul_descD0Ev", "jd::transpose_matmul_desc::~transpose_matmul_desc"], [279, 3, 1, "_CPPv4N2jd13transpose_mhaE", "jd::transpose_mha"], [279, 4, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", "jd::transpose_mha::transpose_mha"], [279, 4, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaEv", "jd::transpose_mha::transpose_mha"], [279, 5, 1, "_CPPv4N2jd13transpose_mha13transpose_mhaERK17kernel_desc_proxy", "jd::transpose_mha::transpose_mha::kdp"], [279, 4, 1, "_CPPv4N2jd13transpose_mhaD0Ev", "jd::transpose_mha::~transpose_mha"], [279, 3, 1, "_CPPv4N2jd18transpose_mha_descE", "jd::transpose_mha_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", "jd::transpose_mha_desc::transpose_mha_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descEv", "jd::transpose_mha_desc::transpose_mha_desc"], [279, 5, 1, "_CPPv4N2jd18transpose_mha_desc18transpose_mha_descERK13operator_desc", "jd::transpose_mha_desc::transpose_mha_desc::op_desc"], [279, 4, 1, "_CPPv4N2jd18transpose_mha_descD0Ev", "jd::transpose_mha_desc::~transpose_mha_desc"], [0, 9, 0, "-", "conversation"], [1, 9, 0, "-", "gaudi_spawn"], [253, 9, 0, "-", "main_eval_only"], [254, 9, 0, "-", "main_parse_and_eval"], [262, 9, 0, "-", "text"]], "conversation": [[0, 10, 1, "", "Conversation"], [0, 10, 1, "", "SeparatorStyle"], [0, 12, 1, "", "get_conv_template"], [0, 12, 1, "", "register_conv_template"]], "conversation.Conversation": [[0, 11, 1, "", "append_message"], [0, 11, 1, "", "convert_image_to_base64"], [0, 11, 1, "", "get_prompt"], [0, 11, 1, "", "set_system_message"], [0, 11, 1, "", "to_gradio_chatbot"], [0, 11, 1, "", "to_openai_api_messages"], [0, 11, 1, "", "update_last_message"]], "gaudi_spawn": [[1, 12, 1, "", "parse_args"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers": [[2, 9, 0, "-", "child_parent_retriever"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever": [[2, 10, 1, "", "ChildParentRetriever"], [2, 10, 1, "", "SearchType"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.ChildParentRetriever": [[2, 13, 1, "", "search_kwargs"], [2, 13, 1, "", "search_type"]], "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever.SearchType": [[2, 13, 1, "", "mmr"], [2, 13, 1, "", "similarity"]], "intel_extension_for_transformers.langchain.langchain_community.vectorstores": [[3, 9, 0, "-", "chroma"]], "intel_extension_for_transformers.neural_chat": [[4, 9, 0, "-", "chatbot"], [5, 9, 0, "-", "config"], [6, 9, 0, "-", "config_logging"], [7, 9, 0, "-", "errorcode"], [8, 9, 0, "-", "pipeline"]], "intel_extension_for_transformers.neural_chat.chatbot": [[4, 12, 1, "", "build_chatbot"], [4, 12, 1, "", "finetune_model"], [4, 12, 1, "", "optimize_model"]], "intel_extension_for_transformers.neural_chat.config": [[5, 10, 1, "", "AudioLanguageOptions"], [5, 10, 1, "", "BackendOptions"], [5, 10, 1, "", "DataArguments"], [5, 10, 1, "", "DeviceOptions"], [5, 10, 1, "", "FinetuningArguments"], [5, 10, 1, "", "ModelArguments"], [5, 10, 1, "", "RetrievalTypeOptions"]], "intel_extension_for_transformers.neural_chat.config_logging": [[6, 12, 1, "", "configure_logging"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image": [[9, 9, 0, "-", "instructpix2pix_pipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline": [[9, 10, 1, "", "StableDiffusionInstructPix2PixPipeline"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline.StableDiffusionInstructPix2PixPipeline": [[9, 11, 1, "", "enable_sequential_cpu_offload"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory": [[10, 9, 0, "-", "memory"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval": [[14, 9, 0, "-", "retriever_adapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector": [[11, 9, 0, "-", "intent_detection"], [12, 9, 0, "-", "query_explainer"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser": [[13, 9, 0, "-", "parser"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter": [[14, 10, 1, "", "RetrieverAdapter"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security": [[15, 9, 0, "-", "safety_checker"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker": [[15, 12, 1, "", "convert_fullwidth_to_halfwidth"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d": [[18, 9, 0, "-", "util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models": [[16, 9, 0, "-", "bfm"], [17, 9, 0, "-", "networks"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks": [[17, 12, 1, "", "resnet101"], [17, 12, 1, "", "resnet152"], [17, 12, 1, "", "resnet18"], [17, 12, 1, "", "resnet34"], [17, 12, 1, "", "resnet50"], [17, 12, 1, "", "resnext101_32x8d"], [17, 12, 1, "", "resnext50_32x4d"], [17, 12, 1, "", "wide_resnet101_2"], [17, 12, 1, "", "wide_resnet50_2"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util": [[19, 9, 0, "-", "load_mats"], [20, 9, 0, "-", "preprocess"], [21, 9, 0, "-", "util"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess": [[20, 12, 1, "", "align_img"]], "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util": [[21, 12, 1, "", "draw_landmarks"], [21, 12, 1, "", "mkdir"], [21, 12, 1, "", "mkdirs"]], "intel_extension_for_transformers.neural_chat.server.restful": [[22, 9, 0, "-", "openai_protocol"]], "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol": [[22, 10, 1, "", "ApiErrorCode"]], "intel_extension_for_transformers.neural_chat.tools.rome": [[23, 9, 0, "-", "repr_tools"]], "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools": [[23, 12, 1, "", "get_reprs_at_idxs"], [23, 12, 1, "", "get_reprs_at_word_tokens"], [23, 12, 1, "", "get_words_idxs_in_templates"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils": [[24, 9, 0, "-", "nethook"], [25, 9, 0, "-", "runningstats"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook": [[24, 14, 1, "", "StopForward"], [24, 10, 1, "", "Trace"], [24, 10, 1, "", "TraceDict"], [24, 12, 1, "", "get_module"], [24, 12, 1, "", "get_parameter"], [24, 12, 1, "", "hierarchical_subsequence"], [24, 12, 1, "", "invoke_with_optional_args"], [24, 12, 1, "", "recursive_copy"], [24, 12, 1, "", "replace_module"], [24, 12, 1, "", "set_requires_grad"], [24, 12, 1, "", "subsequence"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats": [[25, 10, 1, "", "Bincount"], [25, 10, 1, "", "CombinedStat"], [25, 10, 1, "", "Covariance"], [25, 10, 1, "", "CrossCovariance"], [25, 10, 1, "", "CrossIoU"], [25, 10, 1, "", "FixedRandomSubsetSampler"], [25, 10, 1, "", "FixedSubsetSampler"], [25, 10, 1, "", "History"], [25, 10, 1, "", "IoU"], [25, 10, 1, "", "Mean"], [25, 10, 1, "", "NormMean"], [25, 10, 1, "", "Quantile"], [25, 10, 1, "", "SecondMoment"], [25, 10, 1, "", "Stat"], [25, 10, 1, "", "TopK"], [25, 10, 1, "", "Variance"], [25, 12, 1, "", "box_numpy_null"], [25, 10, 1, "", "cache_load_enabled"], [25, 12, 1, "", "is_null_numpy_value"], [25, 12, 1, "", "load_cached_state"], [25, 12, 1, "", "make_loader"], [25, 12, 1, "", "pull_key_prefix"], [25, 12, 1, "", "push_key_prefix"], [25, 12, 1, "", "resolve_state_dict"], [25, 12, 1, "", "sample_portion"], [25, 12, 1, "", "save_cached_state"], [25, 12, 1, "", "tally"], [25, 12, 1, "", "unbox_numpy_null"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Bincount": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CombinedStat": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Covariance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossCovariance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.CrossIoU": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedRandomSubsetSampler": [[25, 11, 1, "", "class_subset"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.FixedSubsetSampler": [[25, 11, 1, "", "dereference"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.History": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.IoU": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Mean": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.NormMean": [[25, 11, 1, "", "add"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Quantile": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "normalize"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.SecondMoment": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Stat": [[25, 11, 1, "", "add"], [25, 11, 1, "", "cpu_"], [25, 11, 1, "", "cuda_"], [25, 11, 1, "", "load"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "save"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.TopK": [[25, 11, 1, "", "add"], [25, 11, 1, "", "topk"]], "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats.Variance": [[25, 11, 1, "", "add"], [25, 11, 1, "", "load_state_dict"], [25, 11, 1, "", "state_dict"], [25, 11, 1, "", "to_"]], "intel_extension_for_transformers.tools": [[26, 9, 0, "-", "utils"]], "intel_extension_for_transformers.transformers": [[27, 9, 0, "-", "benchmark"], [28, 9, 0, "-", "config"], [31, 9, 0, "-", "dynamic"], [34, 9, 0, "-", "modeling"], [45, 9, 0, "-", "pipeline"], [46, 9, 0, "-", "pruner"], [48, 9, 0, "-", "quantization"], [245, 9, 0, "-", "runtime"], [246, 9, 0, "-", "trainer"], [249, 9, 0, "-", "utils"]], "intel_extension_for_transformers.transformers.benchmark": [[27, 12, 1, "", "benchmark"], [27, 12, 1, "", "get_example_inputs"], [27, 12, 1, "", "preprocess_model"], [27, 12, 1, "", "refactor_batch_size"]], "intel_extension_for_transformers.transformers.config": [[28, 10, 1, "", "BenchmarkConfig"], [28, 10, 1, "", "DynamicLengthConfig"], [28, 10, 1, "", "Provider"], [28, 10, 1, "", "PrunerV2"], [28, 10, 1, "", "WeightPruningConfig"], [28, 12, 1, "", "check_value"]], "intel_extension_for_transformers.transformers.dynamic": [[29, 9, 0, "-", "drop_and_restore_utils"], [30, 9, 0, "-", "evolution"]], "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils": [[29, 12, 1, "", "sample_layer_configuration"], [29, 12, 1, "", "sample_length_configuration"]], "intel_extension_for_transformers.transformers.dynamic.evolution": [[30, 10, 1, "", "Evolution"], [30, 12, 1, "", "approx_ratio"], [30, 12, 1, "", "inverse"], [30, 12, 1, "", "store2str"]], "intel_extension_for_transformers.transformers.dynamic.evolution.Evolution": [[30, 11, 1, "", "add_gene"], [30, 11, 1, "", "convex_hull"], [30, 11, 1, "", "crossover"], [30, 11, 1, "", "get_store"], [30, 11, 1, "", "load_store"], [30, 11, 1, "", "mutate"], [30, 11, 1, "", "pareto_frontier"], [30, 11, 1, "", "save_population"], [30, 11, 1, "", "save_store"], [30, 11, 1, "", "set_lower_constraint"], [30, 11, 1, "", "set_upper_constraint"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models": [[32, 9, 0, "-", "modeling_llama"]], "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama": [[32, 10, 1, "", "LlamaAttention"], [32, 10, 1, "", "LlamaFlashAttention2"], [32, 10, 1, "", "LlamaSdpaAttention"], [32, 12, 1, "", "apply_rotary_pos_emb"]], "intel_extension_for_transformers.transformers.modeling": [[35, 9, 0, "-", "model"], [36, 9, 0, "-", "modeling_bert_dynamic"], [44, 9, 0, "-", "modeling_roberta_dynamic"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode": [[33, 9, 0, "-", "modeling_gpt_bigcode"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode": [[33, 10, 1, "", "GPTBigCodeForCausalLM"], [33, 10, 1, "", "GPTBigCodeForSequenceClassification"], [33, 10, 1, "", "GPTBigCodeForTokenClassification"], [33, 10, 1, "", "GPTBigCodeModel"], [33, 10, 1, "", "GPTBigCodePreTrainedModel"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForCausalLM": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForSequenceClassification": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode.GPTBigCodeForTokenClassification": [[33, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.model": [[35, 10, 1, "", "OptimizedModel"]], "intel_extension_for_transformers.transformers.modeling.model.OptimizedModel": [[35, 11, 1, "", "from_pretrained"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic": [[36, 10, 1, "", "BertAttention"], [36, 10, 1, "", "BertEmbeddings"], [36, 10, 1, "", "BertEncoder"], [36, 10, 1, "", "BertForMaskedLM"], [36, 10, 1, "", "BertForMultipleChoice"], [36, 10, 1, "", "BertForNextSentencePrediction"], [36, 10, 1, "", "BertForPreTraining"], [36, 10, 1, "", "BertForPreTrainingOutput"], [36, 10, 1, "", "BertForQuestionAnswering"], [36, 10, 1, "", "BertForSequenceClassification"], [36, 10, 1, "", "BertForTokenClassification"], [36, 10, 1, "", "BertIntermediate"], [36, 10, 1, "", "BertLMHeadModel"], [36, 10, 1, "", "BertLMPredictionHead"], [36, 10, 1, "", "BertLayer"], [36, 10, 1, "", "BertModel"], [36, 10, 1, "", "BertOnlyMLMHead"], [36, 10, 1, "", "BertOnlyNSPHead"], [36, 10, 1, "", "BertOutput"], [36, 10, 1, "", "BertPooler"], [36, 10, 1, "", "BertPreTrainedModel"], [36, 10, 1, "", "BertPreTrainingHeads"], [36, 10, 1, "", "BertPredictionHeadTransform"], [36, 10, 1, "", "BertSelfAttention"], [36, 10, 1, "", "BertSelfOutput"], [36, 12, 1, "", "expand_gather"], [36, 12, 1, "", "load_tf_weights_in_bert"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertAttention": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "prune_heads"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEmbeddings": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertEncoder": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMaskedLM": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "prepare_inputs_for_generation"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForMultipleChoice": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForNextSentencePrediction": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForPreTraining": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForQuestionAnswering": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForSequenceClassification": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertForTokenClassification": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertIntermediate": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMHeadModel": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_output_embeddings"], [36, 11, 1, "", "prepare_inputs_for_generation"], [36, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLMPredictionHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertLayer": [[36, 11, 1, "", "feed_forward_chunk"], [36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertModel": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "get_input_embeddings"], [36, 11, 1, "", "set_input_embeddings"], [36, 11, 1, "", "set_length_config"], [36, 11, 1, "", "set_output_attentions"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyMLMHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOnlyNSPHead": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertOutput": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPooler": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPreTrainingHeads": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertPredictionHeadTransform": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfAttention": [[36, 11, 1, "", "forward"], [36, 11, 1, "", "transpose_for_scores"]], "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic.BertSelfOutput": [[36, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi": [[43, 9, 0, "-", "streaming_llm"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart": [[37, 9, 0, "-", "modeling_bart"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart": [[37, 12, 1, "", "gaudi_BartAttention_forward"], [37, 10, 1, "", "gaudi_BartLearnedPositionalEmbedding"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart.gaudi_BartLearnedPositionalEmbedding": [[37, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama": [[38, 9, 0, "-", "pos_shift_llama"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral": [[39, 9, 0, "-", "modeling_mistral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral": [[39, 12, 1, "", "gaudi_mistral_repeat_kv"], [39, 12, 1, "", "gaudi_mistral_rmsnorm_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral": [[40, 9, 0, "-", "modeling_mixtral"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral": [[40, 10, 1, "", "GaudiMixtralForCausalLM"], [40, 12, 1, "", "gaudi_mixtral_attention_forward"], [40, 12, 1, "", "gaudi_mixtral_block_sparse_moe_forward"], [40, 12, 1, "", "gaudi_mixtral_decoder_layer_forward"], [40, 12, 1, "", "gaudi_mixtral_model_forward"], [40, 12, 1, "", "gaudi_mixtral_repeat_kv"], [40, 12, 1, "", "gaudi_mixtral_rmsnorm_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi": [[41, 9, 0, "-", "modeling_phi"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi": [[41, 12, 1, "", "gaudi_phi_attention_forward"], [41, 12, 1, "", "gaudi_phi_decoder_layer_forward"], [41, 12, 1, "", "gaudi_phi_model_forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin": [[42, 9, 0, "-", "modeling_swin"]], "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin": [[42, 12, 1, "", "gaudi_swin_get_attn_mask"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic": [[44, 10, 1, "", "RobertaAttention"], [44, 10, 1, "", "RobertaClassificationHead"], [44, 10, 1, "", "RobertaEmbeddings"], [44, 10, 1, "", "RobertaEncoder"], [44, 10, 1, "", "RobertaForCausalLM"], [44, 10, 1, "", "RobertaForMaskedLM"], [44, 10, 1, "", "RobertaForMultipleChoice"], [44, 10, 1, "", "RobertaForQuestionAnswering"], [44, 10, 1, "", "RobertaForSequenceClassification"], [44, 10, 1, "", "RobertaForTokenClassification"], [44, 10, 1, "", "RobertaIntermediate"], [44, 10, 1, "", "RobertaLMHead"], [44, 10, 1, "", "RobertaLayer"], [44, 10, 1, "", "RobertaModel"], [44, 10, 1, "", "RobertaOutput"], [44, 10, 1, "", "RobertaPooler"], [44, 10, 1, "", "RobertaPreTrainedModel"], [44, 10, 1, "", "RobertaSelfAttention"], [44, 10, 1, "", "RobertaSelfOutput"], [44, 12, 1, "", "create_position_ids_from_input_ids"], [44, 12, 1, "", "expand_gather"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaAttention": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "prune_heads"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaClassificationHead": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEmbeddings": [[44, 11, 1, "", "create_position_ids_from_inputs_embeds"], [44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaEncoder": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForCausalLM": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_output_embeddings"], [44, 11, 1, "", "prepare_inputs_for_generation"], [44, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMaskedLM": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_output_embeddings"], [44, 11, 1, "", "set_output_embeddings"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForMultipleChoice": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForQuestionAnswering": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForSequenceClassification": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaForTokenClassification": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaIntermediate": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLMHead": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaLayer": [[44, 11, 1, "", "feed_forward_chunk"], [44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaModel": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "get_input_embeddings"], [44, 11, 1, "", "set_input_embeddings"], [44, 11, 1, "", "set_length_config"], [44, 11, 1, "", "set_output_attentions"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaOutput": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPooler": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaPreTrainedModel": [[44, 11, 1, "", "update_keys_to_ignore"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfAttention": [[44, 11, 1, "", "forward"], [44, 11, 1, "", "transpose_for_scores"]], "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic.RobertaSelfOutput": [[44, 11, 1, "", "forward"]], "intel_extension_for_transformers.transformers.pipeline": [[45, 12, 1, "", "infer_framework_load_model"]], "intel_extension_for_transformers.transformers.pruner": [[47, 9, 0, "-", "pruning"]], "intel_extension_for_transformers.transformers.pruner.pruning": [[47, 10, 1, "", "Pruning"]], "intel_extension_for_transformers.transformers.pruner.pruning.Pruning": [[47, 13, 1, "", "config_file_path"], [47, 11, 1, "", "get_sparsity_ratio"], [47, 13, 1, "", "model"], [47, 11, 1, "", "on_after_eval"], [47, 11, 1, "", "on_after_optimizer_step"], [47, 11, 1, "", "on_before_eval"], [47, 11, 1, "", "on_before_optimizer_step"], [47, 11, 1, "", "on_epoch_begin"], [47, 11, 1, "", "on_epoch_end"], [47, 11, 1, "", "on_step_begin"], [47, 11, 1, "", "on_step_end"], [47, 11, 1, "", "on_train_begin"], [47, 11, 1, "", "on_train_end"], [47, 13, 1, "", "pruner_info"], [47, 13, 1, "", "pruners"], [47, 11, 1, "", "update_config"]], "intel_extension_for_transformers.transformers.runtime": [[58, 9, 0, "-", "compile"], [245, 12, 1, "", "neural_engine_bin"]], "intel_extension_for_transformers.transformers.runtime.compile": [[49, 9, 0, "-", "compile"], [51, 9, 0, "-", "extractors"], [56, 9, 0, "-", "graph"], [57, 9, 0, "-", "graph_utils"], [59, 9, 0, "-", "loaders"], [61, 9, 0, "-", "logger"], [62, 9, 0, "-", "onnx_utils"], [83, 9, 0, "-", "ops"], [128, 9, 0, "-", "optimizer"], [150, 9, 0, "-", "sub_graph"], [243, 9, 0, "-", "tf_utils"], [244, 9, 0, "-", "torch_utils"]], "intel_extension_for_transformers.transformers.runtime.compile.compile": [[49, 12, 1, "", "compile"], [49, 12, 1, "", "start_pipeline"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors": [[50, 9, 0, "-", "extractor"], [52, 9, 0, "-", "onnx_extractor"], [53, 9, 0, "-", "tf_extractor"], [54, 9, 0, "-", "torch_extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor": [[50, 10, 1, "", "Extractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor": [[52, 10, 1, "", "ONNXExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor": [[53, 10, 1, "", "TensorflowExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor": [[54, 10, 1, "", "TorchExtractor"]], "intel_extension_for_transformers.transformers.runtime.compile.graph": [[55, 9, 0, "-", "graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph": [[55, 10, 1, "", "Graph"]], "intel_extension_for_transformers.transformers.runtime.compile.graph.graph.Graph": [[55, 11, 1, "", "add_config_item"], [55, 11, 1, "", "change_node_input_tensors"], [55, 11, 1, "", "change_node_output_tensors"], [55, 11, 1, "", "dump_tensor"], [55, 11, 1, "", "engine_init"], [55, 11, 1, "", "generate"], [55, 11, 1, "", "get_next_node_names"], [55, 11, 1, "", "get_node_by_name"], [55, 11, 1, "", "get_node_id"], [55, 11, 1, "", "get_pre_node_names"], [55, 11, 1, "", "get_sparse_nodes_name"], [55, 11, 1, "", "get_tensor_idx"], [55, 11, 1, "", "graph_dispatch"], [55, 11, 1, "", "graph_init"], [55, 11, 1, "", "inference"], [55, 11, 1, "", "inquire_config_item"], [55, 11, 1, "", "insert_nodes"], [55, 11, 1, "", "modify_node_connections"], [55, 11, 1, "", "remove_nodes"], [55, 11, 1, "", "rename_node"], [55, 11, 1, "", "save"], [55, 11, 1, "", "transpose_mode_int8"]], "intel_extension_for_transformers.transformers.runtime.compile.graph_utils": [[57, 10, 1, "", "LazyImport"], [57, 12, 1, "", "autocast_init"], [57, 12, 1, "", "construct_node"], [57, 12, 1, "", "del_environ_var"], [57, 12, 1, "", "del_environ_vars"], [57, 12, 1, "", "environ_info_init"], [57, 12, 1, "", "get_autocast_info"], [57, 12, 1, "", "get_data_dtype"], [57, 12, 1, "", "get_environ_info"], [57, 12, 1, "", "get_model_fwk_name"], [57, 12, 1, "", "get_quant_info"], [57, 12, 1, "", "insert_environ_info"], [57, 12, 1, "", "insert_pattern"], [57, 12, 1, "", "insert_quant_info"], [57, 12, 1, "", "list2str"], [57, 12, 1, "", "names_from_input"], [57, 12, 1, "", "pattern_mapping"], [57, 12, 1, "", "pattern_mapping_conf_validation"], [57, 12, 1, "", "quant_info_init"], [57, 12, 1, "", "remove_environ_info_item"], [57, 12, 1, "", "remove_environ_info_items"], [57, 12, 1, "", "search_pattern"], [57, 12, 1, "", "search_straight_pattern"], [57, 12, 1, "", "set_autocast"], [57, 12, 1, "", "set_environ_var"], [57, 12, 1, "", "set_environ_vars"], [57, 12, 1, "", "str2list"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders": [[60, 9, 0, "-", "loader"]], "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader": [[60, 10, 1, "", "Loader"]], "intel_extension_for_transformers.transformers.runtime.compile.logger": [[61, 10, 1, "", "Logger"], [61, 12, 1, "", "debug"], [61, 12, 1, "", "error"], [61, 12, 1, "", "fatal"], [61, 12, 1, "", "info"], [61, 12, 1, "", "log"], [61, 12, 1, "", "warn"], [61, 12, 1, "", "warning"]], "intel_extension_for_transformers.transformers.runtime.compile.logger.Logger": [[61, 11, 1, "", "get_logger"]], "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils": [[62, 12, 1, "", "bias_to_int32"], [62, 12, 1, "", "change_num_name"], [62, 12, 1, "", "get_children"], [62, 12, 1, "", "get_initializer_children_names"], [62, 12, 1, "", "get_node_children_names"], [62, 12, 1, "", "graph_node_names_details"], [62, 12, 1, "", "is_supported_onnx_graph"], [62, 12, 1, "", "is_supported_onnx_node"], [62, 12, 1, "", "onnx_extract_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.ops": [[63, 9, 0, "-", "all"], [64, 9, 0, "-", "assert"], [65, 9, 0, "-", "baddbmm"], [66, 9, 0, "-", "batch_matmul"], [67, 9, 0, "-", "batch_matmul_v2"], [68, 9, 0, "-", "bias_add"], [69, 9, 0, "-", "cast"], [70, 9, 0, "-", "concat"], [71, 9, 0, "-", "conv"], [72, 9, 0, "-", "cos"], [73, 9, 0, "-", "empty_ops"], [74, 9, 0, "-", "expand_dims"], [75, 9, 0, "-", "fused_batch_matmul_v2"], [76, 9, 0, "-", "fused_batch_norm_v3"], [77, 9, 0, "-", "fused_gemm"], [78, 9, 0, "-", "fused_matmul"], [79, 9, 0, "-", "gather"], [80, 9, 0, "-", "gather_elements"], [81, 9, 0, "-", "gelu"], [82, 9, 0, "-", "gemm"], [84, 9, 0, "-", "iterator_get_next"], [85, 9, 0, "-", "iterator_v2"], [86, 9, 0, "-", "layer_normalization"], [87, 9, 0, "-", "log_softmax"], [88, 9, 0, "-", "map_and_batch_dataset"], [89, 9, 0, "-", "matmul"], [90, 9, 0, "-", "mean"], [91, 9, 0, "-", "mkl_layer_norm"], [92, 9, 0, "-", "model_dataset"], [93, 9, 0, "-", "one_hot"], [94, 9, 0, "-", "onnx_input"], [95, 9, 0, "-", "op"], [96, 9, 0, "-", "optimize_dataset"], [97, 9, 0, "-", "pack"], [98, 9, 0, "-", "padding_sequence"], [99, 9, 0, "-", "placeholder"], [100, 9, 0, "-", "pos_embed"], [101, 9, 0, "-", "pow"], [102, 9, 0, "-", "quantize_linear"], [103, 9, 0, "-", "quantize_v2"], [104, 9, 0, "-", "quantized_fused_matmul_and_dequantize"], [105, 9, 0, "-", "quantized_matmul_with_bias_and_dequantize"], [106, 9, 0, "-", "reduce_mean"], [107, 9, 0, "-", "reduce_sum"], [108, 9, 0, "-", "reorder"], [109, 9, 0, "-", "reshape"], [110, 9, 0, "-", "resize"], [111, 9, 0, "-", "rsub"], [112, 9, 0, "-", "scatter_elements"], [113, 9, 0, "-", "shape"], [114, 9, 0, "-", "sin"], [115, 9, 0, "-", "size"], [116, 9, 0, "-", "slice_position_ids"], [117, 9, 0, "-", "softmax"], [118, 9, 0, "-", "split"], [119, 9, 0, "-", "squeeze"], [120, 9, 0, "-", "strided_slice"], [121, 9, 0, "-", "tensor"], [122, 9, 0, "-", "top_k"], [123, 9, 0, "-", "transpose"], [124, 9, 0, "-", "unpack"], [125, 9, 0, "-", "unsqueeze"], [126, 9, 0, "-", "view"], [127, 9, 0, "-", "where"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all": [[63, 10, 1, "", "All"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.all.All": [[63, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert": [[64, 10, 1, "", "Assert"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.assert.Assert": [[64, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm": [[65, 10, 1, "", "Baddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul": [[66, 10, 1, "", "BatchMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul.BatchMatMul": [[66, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2": [[67, 10, 1, "", "BatchMatMulV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2.BatchMatMulV2": [[67, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add": [[68, 10, 1, "", "BiasAdd"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add.BiasAdd": [[68, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast": [[69, 10, 1, "", "Cast"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cast.Cast": [[69, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat": [[70, 10, 1, "", "Concat"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.concat.Concat": [[70, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv": [[71, 10, 1, "", "Conv"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.conv.Conv": [[71, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos": [[72, 10, 1, "", "Cos"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.cos.Cos": [[72, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops": [[73, 10, 1, "", "Add"], [73, 10, 1, "", "AddV2"], [73, 10, 1, "", "Arange"], [73, 10, 1, "", "BinaryAdd"], [73, 10, 1, "", "Constant"], [73, 10, 1, "", "ConstantOfShape"], [73, 10, 1, "", "Convolution"], [73, 10, 1, "", "CumSum"], [73, 10, 1, "", "Dequantize"], [73, 10, 1, "", "DequantizeLinear"], [73, 10, 1, "", "Einsum"], [73, 10, 1, "", "EmbeddingBag"], [73, 10, 1, "", "Erf"], [73, 10, 1, "", "Expand"], [73, 10, 1, "", "ExpandIndices"], [73, 10, 1, "", "Fill"], [73, 10, 1, "", "FlatMapDataset"], [73, 10, 1, "", "Flatten"], [73, 10, 1, "", "Floor_divide"], [73, 10, 1, "", "Identity"], [73, 10, 1, "", "InnerProduct"], [73, 10, 1, "", "Input"], [73, 10, 1, "", "LatRange"], [73, 10, 1, "", "ListConstruct"], [73, 10, 1, "", "ListUnpack"], [73, 10, 1, "", "Loop"], [73, 10, 1, "", "MakeIterator"], [73, 10, 1, "", "Masked_fill"], [73, 10, 1, "", "MatMulWithBias"], [73, 10, 1, "", "MatMulWithBiasAdd"], [73, 10, 1, "", "MatMulWithBiasGelu"], [73, 10, 1, "", "MatMulWithBiasRelu"], [73, 10, 1, "", "MatMulWithBiasSigmoid"], [73, 10, 1, "", "MatMulWithBiasTanh"], [73, 10, 1, "", "Matmul"], [73, 10, 1, "", "Max"], [73, 10, 1, "", "MergedEmbeddingbag"], [73, 10, 1, "", "MultiHeadAttenion"], [73, 10, 1, "", "Onehot"], [73, 10, 1, "", "OpAny"], [73, 10, 1, "", "Output"], [73, 10, 1, "", "PositionIds"], [73, 10, 1, "", "QLinearAdd"], [73, 10, 1, "", "QLinearMatMul"], [73, 10, 1, "", "QLinearMul"], [73, 10, 1, "", "Range"], [73, 10, 1, "", "RealDiv"], [73, 10, 1, "", "Reciprocal"], [73, 10, 1, "", "Relu"], [73, 10, 1, "", "Repeat"], [73, 10, 1, "", "Rsqrt"], [73, 10, 1, "", "SequenceLength"], [73, 10, 1, "", "Sigmoid"], [73, 10, 1, "", "Silu"], [73, 10, 1, "", "Sqrt"], [73, 10, 1, "", "Square"], [73, 10, 1, "", "SquaredDifference"], [73, 10, 1, "", "Stack"], [73, 10, 1, "", "StopGradient"], [73, 10, 1, "", "Tanh"], [73, 10, 1, "", "TensorSliceDataset"], [73, 10, 1, "", "Tile"], [73, 10, 1, "", "TokenTypeIds"], [73, 10, 1, "", "TransposeBatchMatMul"], [73, 10, 1, "", "Zeros"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims": [[74, 10, 1, "", "ExpandDims"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims.ExpandDims": [[74, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3": [[76, 10, 1, "", "FusedBatchNormV3"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3.FusedBatchNormV3": [[76, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm": [[77, 10, 1, "", "FusedGemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm.FusedGemm": [[77, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul": [[78, 10, 1, "", "FusedMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul.FusedMatMul": [[78, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather": [[79, 10, 1, "", "Gather"], [79, 10, 1, "", "GatherV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.Gather": [[79, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather.GatherV2": [[79, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements": [[80, 10, 1, "", "GatherElements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements.GatherElements": [[80, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu": [[81, 10, 1, "", "Gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu.Gelu": [[81, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm": [[82, 10, 1, "", "Gemm"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm.Gemm": [[82, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next": [[84, 10, 1, "", "IteratorGetNext"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next.IteratorGetNext": [[84, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2": [[85, 10, 1, "", "IteratorV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2.IteratorV2": [[85, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization": [[86, 10, 1, "", "LayerNorm"], [86, 10, 1, "", "LayerNormalization"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNorm": [[86, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization.LayerNormalization": [[86, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax": [[87, 10, 1, "", "LogSoftmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax.LogSoftmax": [[87, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset": [[88, 10, 1, "", "MapAndBatchDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset.MapAndBatchDataset": [[88, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul": [[89, 10, 1, "", "MatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul.MatMul": [[89, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean": [[90, 10, 1, "", "Mean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.mean.Mean": [[90, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset": [[92, 10, 1, "", "ModelDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset.ModelDataset": [[92, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot": [[93, 10, 1, "", "OneHot"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot.OneHot": [[93, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input": [[94, 10, 1, "", "ONNXINPUT"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input.ONNXINPUT": [[94, 11, 1, "", "extract"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op": [[95, 10, 1, "", "Operator"], [95, 12, 1, "", "operator_registry"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.op.Operator": [[95, 11, 1, "", "construct"], [95, 11, 1, "", "extract"], [95, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset": [[96, 10, 1, "", "OptimizeDataset"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset.OptimizeDataset": [[96, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack": [[97, 10, 1, "", "Pack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pack.Pack": [[97, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence": [[98, 10, 1, "", "PaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence.PaddingSequence": [[98, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder": [[99, 10, 1, "", "Placeholder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder.Placeholder": [[99, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed": [[100, 10, 1, "", "PackagePositionEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed.PackagePositionEmbedding": [[100, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow": [[101, 10, 1, "", "Pow"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.pow.Pow": [[101, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear": [[102, 10, 1, "", "Quantize"], [102, 10, 1, "", "QuantizeLinear"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.Quantize": [[102, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear.QuantizeLinear": [[102, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2": [[103, 10, 1, "", "QuantizeV2"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2.QuantizeV2": [[103, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize": [[105, 10, 1, "", "QuantizedMatMulWithBiasAndDequantize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize.QuantizedMatMulWithBiasAndDequantize": [[105, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean": [[106, 10, 1, "", "ReduceMean"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean.ReduceMean": [[106, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum": [[107, 10, 1, "", "ReduceSum"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum.ReduceSum": [[107, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder": [[108, 10, 1, "", "Reorder"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder.Reorder": [[108, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape": [[109, 10, 1, "", "Reshape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape.Reshape": [[109, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize": [[110, 10, 1, "", "Resize"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.resize.Resize": [[110, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub": [[111, 10, 1, "", "Rsub"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub.Rsub": [[111, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements": [[112, 10, 1, "", "ScatterElements"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements.ScatterElements": [[112, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape": [[113, 10, 1, "", "Shape"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.shape.Shape": [[113, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin": [[114, 10, 1, "", "Sin"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.sin.Sin": [[114, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size": [[115, 10, 1, "", "Size"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.size.Size": [[115, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids": [[116, 10, 1, "", "SlicePositionIds"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids.SlicePositionIds": [[116, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax": [[117, 10, 1, "", "Softmax"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax.Softmax": [[117, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split": [[118, 10, 1, "", "Split"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.split.Split": [[118, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze": [[119, 10, 1, "", "Squeeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze.Squeeze": [[119, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice": [[120, 10, 1, "", "StridedSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice.StridedSlice": [[120, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor": [[121, 10, 1, "", "Tensor"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k": [[122, 10, 1, "", "TopK"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k.TopK": [[122, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose": [[123, 10, 1, "", "Transpose"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose.Transpose": [[123, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack": [[124, 10, 1, "", "Unpack"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack.Unpack": [[124, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze": [[125, 10, 1, "", "Unsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze.Unsqueeze": [[125, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view": [[126, 10, 1, "", "View"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.view.View": [[126, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where": [[127, 10, 1, "", "Where"]], "intel_extension_for_transformers.transformers.runtime.compile.ops.where.Where": [[127, 11, 1, "", "set_attr"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer": [[128, 10, 1, "", "Optimizer"]], "intel_extension_for_transformers.transformers.runtime.compile.optimizer.Optimizer": [[128, 11, 1, "", "optimize"], [128, 11, 1, "", "weight_optimization"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph": [[129, 9, 0, "-", "InnerproductReshapeFusion"], [130, 9, 0, "-", "add_cls_token"], [131, 9, 0, "-", "add_embeddings"], [132, 9, 0, "-", "arangewithreciprocal"], [133, 9, 0, "-", "attentionBlock_AttentionMaskAddReshape"], [134, 9, 0, "-", "attentionBlock_ConstantOfShapeWithMul"], [135, 9, 0, "-", "attentionBlock_QKVPreReshape"], [136, 9, 0, "-", "attentionBlock_QKVReshape"], [137, 9, 0, "-", "attentionBlock_WeightReshapeTo4D"], [138, 9, 0, "-", "attention_mask_length_adaptive_keep_indices"], [139, 9, 0, "-", "attention_output_layer_norm_length_adaptive_keep_indices"], [140, 9, 0, "-", "attention_reshape"], [141, 9, 0, "-", "cast_to"], [142, 9, 0, "-", "collect_quant_info"], [143, 9, 0, "-", "conv_reshape"], [144, 9, 0, "-", "decoder_attn_reshape"], [145, 9, 0, "-", "einsumwitharange"], [146, 9, 0, "-", "embeddingbag"], [147, 9, 0, "-", "embeddings_to_2d_before_inner_product"], [148, 9, 0, "-", "gelu"], [149, 9, 0, "-", "generate_sequence"], [151, 9, 0, "-", "innerproductwithbiasgelu"], [152, 9, 0, "-", "innerproductwithslice"], [153, 9, 0, "-", "innerproductwithswish"], [154, 9, 0, "-", "input_data"], [155, 9, 0, "-", "input_file"], [156, 9, 0, "-", "insert_bf16_node"], [157, 9, 0, "-", "insert_quant_node"], [158, 9, 0, "-", "int8_bf16_mixed_precision_checker"], [159, 9, 0, "-", "interact_features"], [160, 9, 0, "-", "last_layer_shape"], [161, 9, 0, "-", "layer_norm"], [162, 9, 0, "-", "layer_norm_with_reduce_mean"], [163, 9, 0, "-", "layer_norm_with_transpose"], [164, 9, 0, "-", "llama_embeding"], [165, 9, 0, "-", "llama_matmulwithtranspose"], [166, 9, 0, "-", "llama_postprocess"], [167, 9, 0, "-", "llama_rotary_pos_emb"], [168, 9, 0, "-", "lower_all_tuples"], [169, 9, 0, "-", "matmul_with_bias"], [170, 9, 0, "-", "matmul_with_bias_add"], [171, 9, 0, "-", "matmul_with_bias_gelu"], [172, 9, 0, "-", "matmul_with_bias_relu"], [173, 9, 0, "-", "matmul_with_bias_sigmoid"], [174, 9, 0, "-", "matmul_with_bias_tanh"], [175, 9, 0, "-", "matmul_with_bias_unsqueeze"], [176, 9, 0, "-", "matmul_with_transpose"], [177, 9, 0, "-", "matmul_with_transpose_scale_add"], [178, 9, 0, "-", "merged_embeddingbag"], [179, 9, 0, "-", "neox_reorder_change"], [180, 9, 0, "-", "neox_rotary_pos_emb"], [181, 9, 0, "-", "operator_adaptor"], [182, 9, 0, "-", "output_data"], [183, 9, 0, "-", "padding_sequence"], [184, 9, 0, "-", "pattern"], [185, 9, 0, "-", "position_embeddings"], [186, 9, 0, "-", "position_embeddings_v1"], [187, 9, 0, "-", "qkv_merge"], [188, 9, 0, "-", "qkv_reshape"], [189, 9, 0, "-", "quant_gather_to_bf16"], [190, 9, 0, "-", "quantize_fusion"], [191, 9, 0, "-", "quantized_graph_dtype_refactor"], [192, 9, 0, "-", "remove_constant_op"], [193, 9, 0, "-", "remove_last_view"], [194, 9, 0, "-", "remove_range"], [195, 9, 0, "-", "remove_unused_operator"], [196, 9, 0, "-", "remove_zeros"], [197, 9, 0, "-", "removeslice"], [198, 9, 0, "-", "reshape_after_restore_hidden_states"], [199, 9, 0, "-", "reshape_before_and_after_attention_out_layer_norm_gather_elements"], [200, 9, 0, "-", "reshape_before_restore_hidden_states"], [201, 9, 0, "-", "reshape_fusion"], [202, 9, 0, "-", "restore_hidden_states_in_length_adaptive_update_indices"], [203, 9, 0, "-", "rms_norm"], [204, 9, 0, "-", "rotary_pos_emb"], [205, 9, 0, "-", "slicemask"], [206, 9, 0, "-", "stableDiffusion_ExplicitNHWCTranspose"], [207, 9, 0, "-", "stableDiffusion_ExplicitNHWCTransposeQAT"], [208, 9, 0, "-", "stableDiffusion_MHAReshape"], [209, 9, 0, "-", "stableDiffusion_QuantizeFusion"], [210, 9, 0, "-", "stableDiffusion_ReshapeFusion"], [211, 9, 0, "-", "stableDiffusion_bf16Convert"], [212, 9, 0, "-", "stableDiffusion_collectQDQInfo"], [213, 9, 0, "-", "stableDiffusion_insertQuantNode"], [214, 9, 0, "-", "start_end_logits"], [215, 9, 0, "-", "subgraph_matcher"], [216, 9, 0, "-", "textEncdoer_word_embedding"], [217, 9, 0, "-", "textEncoder_AttentionMaskAddReshape"], [218, 9, 0, "-", "textEncoder_AttentionReshape"], [219, 9, 0, "-", "textEncoder_KVReshape"], [220, 9, 0, "-", "textEncoder_MulReshape"], [221, 9, 0, "-", "textEncoder_QReshape"], [222, 9, 0, "-", "textEncoder_SoftmaxReshape"], [223, 9, 0, "-", "textEncoder_causal_attention_mask"], [224, 9, 0, "-", "token_type_embeddings"], [225, 9, 0, "-", "token_type_embeddings_v1"], [226, 9, 0, "-", "torch_embedding"], [227, 9, 0, "-", "torch_ip_insert_bias"], [228, 9, 0, "-", "torch_unpack_baddbmm"], [229, 9, 0, "-", "torchinsertbf16node"], [230, 9, 0, "-", "torchpaddingsquence"], [231, 9, 0, "-", "transformer2Dmodel_AttentionMaskAddReshape"], [232, 9, 0, "-", "transformer2Dmodel_ConstantOfShapeWithMul"], [233, 9, 0, "-", "transformer2Dmodel_FFNSlice"], [234, 9, 0, "-", "transformer2Dmodel_FFNSlice_1"], [235, 9, 0, "-", "transformer2Dmodel_QKVPreReshape"], [236, 9, 0, "-", "transformer2Dmodel_QKVReshape"], [237, 9, 0, "-", "transformer2Dmodel_QKVReshape4D"], [238, 9, 0, "-", "transformer2Dmodel_encoderHiddenStatesReshape"], [239, 9, 0, "-", "transformer2Dmodel_getSampleBatch"], [240, 9, 0, "-", "transformer2Dmodel_sampleSlice"], [241, 9, 0, "-", "transpose_batch_matmul"], [242, 9, 0, "-", "word_embeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion": [[129, 10, 1, "", "InnerproductReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token": [[130, 10, 1, "", "AddClsToken"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings": [[131, 10, 1, "", "AddEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal": [[132, 10, 1, "", "ArangewithReciprocal"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape": [[133, 10, 1, "", "AttentionBlock_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul": [[134, 10, 1, "", "AttentionBlock_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape": [[135, 10, 1, "", "AttentionBlock_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape": [[136, 10, 1, "", "AttentionBlock_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D": [[137, 10, 1, "", "AttentionBlock_WeightReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices": [[138, 10, 1, "", "AttentionMaskLengthAdaptiveExpandIndices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices": [[139, 10, 1, "", "AttentionOutputLayerNormLengthAdaptiveExpandIndices"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape": [[140, 10, 1, "", "AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to": [[141, 10, 1, "", "CastTo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info": [[142, 10, 1, "", "CollectQuantInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape": [[143, 10, 1, "", "ConvReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape": [[144, 10, 1, "", "DecoderAttnReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange": [[145, 10, 1, "", "EinsumwithArange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag": [[146, 10, 1, "", "EmbeddingBag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product": [[147, 10, 1, "", "EmbeddingsTo2DBeforeInnerProduct"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu": [[148, 10, 1, "", "Gelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence": [[149, 10, 1, "", "GenerateSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu": [[151, 10, 1, "", "InnerproductWithBiasGelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice": [[152, 10, 1, "", "InnerproductwithSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish": [[153, 10, 1, "", "InnerproductWithSwish"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data": [[154, 10, 1, "", "InputData"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file": [[155, 10, 1, "", "InputFile"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node": [[156, 10, 1, "", "InsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node": [[157, 10, 1, "", "InsertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker": [[158, 10, 1, "", "Int8BF16MixedPrecisionChecker"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features": [[159, 10, 1, "", "InteractFeatures"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape": [[160, 10, 1, "", "LastLayerShape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm": [[161, 10, 1, "", "LayerNorm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean": [[162, 10, 1, "", "LayerNormWithReduceMean"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose": [[163, 10, 1, "", "LayerNormWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding": [[164, 10, 1, "", "LlamaEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose": [[165, 10, 1, "", "LlamaMatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess": [[166, 10, 1, "", "LlamaPostprocess"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb": [[167, 10, 1, "", "LlamaRoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples": [[168, 10, 1, "", "LowerAllTuples"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias": [[169, 10, 1, "", "MatMulWithBias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add": [[170, 10, 1, "", "MatMulWithBiasAdd"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu": [[171, 10, 1, "", "MatMulWithBiasGelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu": [[172, 10, 1, "", "MatMulWithBiasRelu"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid": [[173, 10, 1, "", "MatMulWithBiasSigmoid"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh": [[174, 10, 1, "", "MatmulWithBiasTanh"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze": [[175, 10, 1, "", "MatMulWithBiasUnsqueeze"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose": [[176, 10, 1, "", "MatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add": [[177, 10, 1, "", "MatMulWithTranspose"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag": [[178, 10, 1, "", "MergedEmbeddingbag"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change": [[179, 10, 1, "", "NeoxReorderChange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb": [[180, 10, 1, "", "NeoxRoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor": [[181, 10, 1, "", "OperatorAdaptor"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data": [[182, 10, 1, "", "OutputData"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence": [[183, 10, 1, "", "PaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern": [[184, 10, 1, "", "Pattern"], [184, 12, 1, "", "pattern_registry"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings": [[185, 10, 1, "", "PositionEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1": [[186, 10, 1, "", "PositionEmbeddingsV1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge": [[187, 10, 1, "", "QKVMerge"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape": [[188, 10, 1, "", "QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16": [[189, 10, 1, "", "TorchInsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion": [[190, 10, 1, "", "QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor": [[191, 10, 1, "", "QuantizedGraphDtypeCheck"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op": [[192, 10, 1, "", "RemoveConstantOP"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view": [[193, 10, 1, "", "RemoveLastView"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range": [[194, 10, 1, "", "RemoveRange"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator": [[195, 10, 1, "", "RemoveUnusedOperator"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros": [[196, 10, 1, "", "RemoveZeros"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice": [[197, 10, 1, "", "RemoveSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states": [[198, 10, 1, "", "ReshapeAfterRestoreHiddenStates"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements": [[199, 10, 1, "", "ReshapeBeforeAndAfterAttentionOutLayerNormGatherElements"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states": [[200, 10, 1, "", "ReshapeBeforeRestoreHiddenStates"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion": [[201, 10, 1, "", "ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices": [[202, 10, 1, "", "RestoreHiddenStatesInLengthAdaptive"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm": [[203, 10, 1, "", "RmsNorm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb": [[204, 10, 1, "", "RoraryPosEmb"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask": [[205, 10, 1, "", "SliceMask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose": [[206, 10, 1, "", "ExplicitNHWCTransposeForConv"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT": [[207, 10, 1, "", "ExplicitNHWCTransposeForConvQAT"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape": [[208, 10, 1, "", "StableDiffusion_MHAReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion": [[209, 10, 1, "", "StableDiffusion_QuantizeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion": [[210, 10, 1, "", "StableDiffusion_ReshapeFusion"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert": [[211, 10, 1, "", "StableDiffusion_bf16Convert"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo": [[212, 10, 1, "", "StableDiffusion_CollectQuantInfo"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode": [[213, 10, 1, "", "StableDiffusion_InsertQuantNode"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits": [[214, 10, 1, "", "StartEndLogits"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher": [[215, 10, 1, "", "SubGraphMatcher"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding": [[216, 10, 1, "", "TextEncoder_WordEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape": [[217, 10, 1, "", "TextEncoder_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape": [[218, 10, 1, "", "TextEncoder_AttentionReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape": [[219, 10, 1, "", "TextEncoder_KVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape": [[220, 10, 1, "", "TextEncoder_MulReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape": [[221, 10, 1, "", "TextEncoder_QReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape": [[222, 10, 1, "", "TextEncoder_SoftmaxReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask": [[223, 10, 1, "", "TextEncoder_CasualAttentionMask"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings": [[224, 10, 1, "", "TokenTypeEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1": [[225, 10, 1, "", "TokenTypeEmbeddingsV1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding": [[226, 10, 1, "", "TorchEmbedding"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias": [[227, 10, 1, "", "TorchInnerProductInsertBias"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm": [[228, 10, 1, "", "TorchUnpackBaddbmm"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node": [[229, 10, 1, "", "TorchInsertBF16Node"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence": [[230, 10, 1, "", "TorchPaddingSequence"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape": [[231, 10, 1, "", "Transformer2Dmodel_AttentionMaskAddReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul": [[232, 10, 1, "", "Transformer2Dmodel_ConstantOfShapeWithMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice": [[233, 10, 1, "", "Transformer2Dmodel_FFNInputSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1": [[234, 10, 1, "", "Transformer2Dmodel_FFNInputSlice_1"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape": [[235, 10, 1, "", "Transformer2Dmodel_QKVPreReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape": [[236, 10, 1, "", "Transformer2Dmodel_QKVReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D": [[237, 10, 1, "", "Transformer2Dmodel_QKVReshapeTo4D"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape": [[238, 10, 1, "", "Transformer2Dmodel_EncoderHiddenStatesReshape"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch": [[239, 10, 1, "", "Transformer2Dmodel_GetSampleBatch"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice": [[240, 10, 1, "", "Transformer2Dmodel_SampleSlice"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul": [[241, 10, 1, "", "TransposeBatchMatMul"]], "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings": [[242, 10, 1, "", "WordEmbeddings"]], "intel_extension_for_transformers.transformers.runtime.compile.tf_utils": [[243, 15, 1, "", "TF_DTYPE_ID"], [243, 12, 1, "", "create_tf_node"], [243, 12, 1, "", "get_tensor_dest_op"], [243, 12, 1, "", "graph_node_names_details"], [243, 12, 1, "", "tf_extract_operator"]], "intel_extension_for_transformers.transformers.runtime.compile.torch_utils": [[244, 12, 1, "", "torch_extract_operator"]], "intel_extension_for_transformers.transformers.trainer": [[246, 10, 1, "", "BaseTrainer"], [246, 10, 1, "", "NLPSeq2SeqTrainer"], [246, 10, 1, "", "NLPTrainer"]], "intel_extension_for_transformers.transformers.trainer.BaseTrainer": [[246, 11, 1, "", "benchmark"], [246, 11, 1, "", "builtin_eval_func"], [246, 11, 1, "", "builtin_train_func"], [246, 11, 1, "", "compute_loss"], [246, 11, 1, "", "distill"], [246, 11, 1, "", "export_to_bf16_onnx"], [246, 11, 1, "", "export_to_fp32_onnx"], [246, 11, 1, "", "export_to_int8_onnx"], [246, 11, 1, "", "export_to_jit"], [246, 11, 1, "", "export_to_onnx"], [246, 11, 1, "", "get_export_args"], [246, 11, 1, "", "infer_task"], [246, 11, 1, "", "orchestrate_optimizations"], [246, 11, 1, "", "prune"], [246, 11, 1, "", "quantize"], [246, 11, 1, "", "run_evolutionary_search"], [246, 11, 1, "", "set_dynamic_config"], [246, 11, 1, "", "train"], [246, 11, 1, "", "training_step"], [246, 11, 1, "", "training_step_length_adaptive"]], "intel_extension_for_transformers.transformers.trainer.NLPSeq2SeqTrainer": [[246, 11, 1, "", "builtin_eval_func"]], "intel_extension_for_transformers.transformers.utils": [[247, 9, 0, "-", "config"], [248, 9, 0, "-", "get_throughput"], [250, 9, 0, "-", "metrics"], [251, 9, 0, "-", "objectives"], [252, 9, 0, "-", "utility"]], "intel_extension_for_transformers.transformers.utils.config": [[247, 10, 1, "", "AutoRoundConfig"], [247, 10, 1, "", "AwqConfig"], [247, 10, 1, "", "DynamicQuantConfig"], [247, 10, 1, "", "GPTQConfig"], [247, 10, 1, "", "ITREXQuantizationConfigMixin"], [247, 10, 1, "", "QuantAwareTrainingConfig"], [247, 10, 1, "", "QuantizationMethod"], [247, 10, 1, "", "RtnConfig"], [247, 10, 1, "", "SmoothQuantConfig"], [247, 10, 1, "", "StaticQuantConfig"], [247, 10, 1, "", "TeqConfig"]], "intel_extension_for_transformers.transformers.utils.config.AutoRoundConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.AwqConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.GPTQConfig": [[247, 11, 1, "", "post_init_gptq"], [247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.ITREXQuantizationConfigMixin": [[247, 11, 1, "", "post_init_cpu"], [247, 11, 1, "", "post_init_runtime"], [247, 11, 1, "", "post_init_xpu"], [247, 11, 1, "", "save_pretrained"], [247, 11, 1, "", "to_json_file"], [247, 11, 1, "", "update"]], "intel_extension_for_transformers.transformers.utils.config.RtnConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.config.TeqConfig": [[247, 11, 1, "", "to_diff_dict"]], "intel_extension_for_transformers.transformers.utils.metrics": [[250, 10, 1, "", "Metric"]], "intel_extension_for_transformers.transformers.utils.objectives": [[251, 10, 1, "", "Objective"]], "intel_extension_for_transformers.transformers.utils.objectives.Objective": [[251, 11, 1, "", "modelsize"], [251, 11, 1, "", "performance"]], "intel_extension_for_transformers.transformers.utils.utility": [[252, 12, 1, "", "distributed_init"]], "models": [[255, 9, 0, "-", "backbone"], [256, 9, 0, "-", "detr"], [257, 9, 0, "-", "detr_multi"], [258, 9, 0, "-", "matcher"], [259, 9, 0, "-", "position_encoding"], [260, 9, 0, "-", "segmentation"], [261, 9, 0, "-", "transformer"]], "models.backbone": [[255, 10, 1, "", "Backbone"], [255, 10, 1, "", "FrozenBatchNorm2d"]], "models.detr": [[256, 10, 1, "", "DETR"], [256, 10, 1, "", "MLP"], [256, 10, 1, "", "PostProcess"], [256, 10, 1, "", "SetCriterion"]], "models.detr.DETR": [[256, 11, 1, "", "forward"]], "models.detr.PostProcess": [[256, 11, 1, "", "forward"]], "models.detr.SetCriterion": [[256, 11, 1, "", "forward"], [256, 11, 1, "", "loss_boxes"], [256, 11, 1, "", "loss_cardinality"], [256, 11, 1, "", "loss_labels"], [256, 11, 1, "", "loss_masks"]], "models.detr_multi": [[257, 10, 1, "", "DETRMulti"], [257, 10, 1, "", "MLP"], [257, 10, 1, "", "PostProcess"], [257, 10, 1, "", "SetCriterion"]], "models.detr_multi.DETRMulti": [[257, 11, 1, "", "forward"]], "models.detr_multi.PostProcess": [[257, 11, 1, "", "forward"]], "models.detr_multi.SetCriterion": [[257, 11, 1, "", "forward"], [257, 11, 1, "", "loss_boxes"], [257, 11, 1, "", "loss_cardinality"], [257, 11, 1, "", "loss_labels"], [257, 11, 1, "", "loss_masks"]], "models.matcher": [[258, 10, 1, "", "HungarianMatcher"]], "models.matcher.HungarianMatcher": [[258, 11, 1, "", "forward"]], "models.position_encoding": [[259, 10, 1, "", "PositionEmbeddingLearned"], [259, 10, 1, "", "PositionEmbeddingSine"]], "models.segmentation": [[260, 10, 1, "", "MHAttentionMap"], [260, 10, 1, "", "MaskHeadSmallConv"], [260, 10, 1, "", "PostProcessPanoptic"], [260, 12, 1, "", "dice_loss"], [260, 12, 1, "", "sigmoid_focal_loss"]], "models.segmentation.PostProcessPanoptic": [[260, 11, 1, "", "forward"]], "text": [[262, 12, 1, "", "text_to_sequence"]], "util": [[263, 9, 0, "-", "box_ops"], [264, 9, 0, "-", "misc"], [265, 9, 0, "-", "plot_utils"], [266, 9, 0, "-", "postprocess"]], "util.box_ops": [[263, 12, 1, "", "generalized_box_iou"], [263, 12, 1, "", "masks_to_boxes"]], "util.misc": [[264, 10, 1, "", "SmoothedValue"], [264, 12, 1, "", "accuracy"], [264, 12, 1, "", "all_gather"], [264, 12, 1, "", "interpolate"], [264, 12, 1, "", "reduce_dict"], [264, 12, 1, "", "setup_for_distributed"]], "util.misc.SmoothedValue": [[264, 11, 1, "", "synchronize_between_processes"]], "util.plot_utils": [[265, 12, 1, "", "plot_logs"]], "util.postprocess": [[266, 12, 1, "", "align_columns"], [266, 12, 1, "", "align_headers"], [266, 12, 1, "", "align_rows"], [266, 12, 1, "", "align_supercells"], [266, 12, 1, "", "apply_class_thresholds"], [266, 12, 1, "", "apply_threshold"], [266, 12, 1, "", "extract_text_from_spans"], [266, 12, 1, "", "extract_text_inside_bbox"], [266, 12, 1, "", "get_bbox_span_subset"], [266, 12, 1, "", "header_supercell_tree"], [266, 12, 1, "", "iob"], [266, 12, 1, "", "iou"], [266, 12, 1, "", "nms"], [266, 12, 1, "", "nms_by_containment"], [266, 12, 1, "", "nms_supercells"], [266, 12, 1, "", "objects_to_cells"], [266, 12, 1, "", "objects_to_table_structures"], [266, 12, 1, "", "overlaps"], [266, 12, 1, "", "refine_columns"], [266, 12, 1, "", "refine_rows"], [266, 12, 1, "", "refine_table_structures"], [266, 12, 1, "", "remove_objects_without_content"], [266, 12, 1, "", "remove_supercell_overlap"], [266, 12, 1, "", "slot_into_containers"], [266, 12, 1, "", "sort_objects_by_score"], [266, 12, 1, "", "sort_objects_left_to_right"], [266, 12, 1, "", "sort_objects_top_to_bottom"], [266, 12, 1, "", "table_structure_to_cells"]], "utils": [[267, 9, 0, "-", "data_utils"], [268, 9, 0, "-", "eval_utils"]], "utils.data_utils": [[267, 12, 1, "", "get_multi_choice_info"], [267, 12, 1, "", "save_jsonl"]], "utils.eval_utils": [[268, 12, 1, "", "calculate_ins_level_acc"], [268, 12, 1, "", "check_is_number"], [268, 12, 1, "", "eval_multi_choice"], [268, 12, 1, "", "eval_open"], [268, 12, 1, "", "evaluate"], [268, 12, 1, "", "extract_numbers"], [268, 12, 1, "", "normalize_str"], [268, 12, 1, "", "parse_multi_choice_response"], [268, 12, 1, "", "parse_open_response"]]}, "objnames": {"0": ["c", "macro", "C macro"], "1": ["cpp", "type", "C++ type"], "2": ["cpp", "enumerator", "C++ enumerator"], "3": ["cpp", "class", "C++ class"], "4": ["cpp", "function", "C++ function"], "5": ["cpp", "functionParam", "C++ function parameter"], "6": ["cpp", "enum", "C++ enum"], "7": ["cpp", "member", "C++ member"], "8": ["cpp", "templateParam", "C++ template parameter"], "9": ["py", "module", "Python module"], "10": ["py", "class", "Python class"], "11": ["py", "method", "Python method"], "12": ["py", "function", "Python function"], "13": ["py", "attribute", "Python attribute"], "14": ["py", "exception", "Python exception"], "15": ["py", "data", "Python data"]}, "objtypes": {"0": "c:macro", "1": "cpp:type", "2": "cpp:enumerator", "3": "cpp:class", "4": "cpp:function", "5": "cpp:functionParam", "6": "cpp:enum", "7": "cpp:member", "8": "cpp:templateParam", "9": "py:module", "10": "py:class", "11": "py:method", "12": "py:function", "13": "py:attribute", "14": "py:exception", "15": "py:data"}, "terms": {"": [22, 24, 25, 28, 44, 57, 62, 95, 147, 243, 246, 247, 256, 257, 260, 265, 269, 270, 272, 279, 298, 302, 303, 309, 313, 314, 316, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 338, 339, 340, 341, 342, 343, 344, 345, 346, 348, 349, 354, 355, 356, 361, 363, 369, 370, 372, 376, 378, 379, 380, 381, 382, 383, 384, 387, 388, 389, 391, 392, 394, 396, 397, 402, 406, 408, 411, 413, 414, 418, 420, 421, 423, 425, 426], "0": [9, 20, 21, 24, 25, 28, 30, 33, 36, 37, 44, 55, 57, 243, 247, 250, 252, 256, 257, 260, 265, 266, 278, 279, 281, 289, 302, 303, 306, 308, 309, 313, 314, 315, 316, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 351, 352, 353, 354, 355, 358, 359, 361, 362, 363, 365, 366, 369, 371, 372, 375, 376, 383, 384, 385, 387, 388, 389, 390, 391, 393, 394, 395, 397, 401, 402, 403, 404, 405, 409, 410, 411, 412, 413, 415, 416, 418, 419, 421, 423, 425, 426, 427, 428, 432], "00": [288, 361, 366, 389, 425], "000": [25, 389], "0003575115963544682": 385, "00035751489124038457": 385, "00163713": 411, "00164658": 411, "00171023": 411, "00179382": 411, "00180316": 411, "00198061": 411, "00203027": 411, "00216633": 411, "00217889": 411, "00223598": 411, "00226557": 411, "00235812": 411, "00241": 411, "00243581": 411, "00245821": 411, "0025": 432, "00252331": 411, "00261406": 411, "00265488": 411, "00269113": 411, "00270954": 411, "00289114": 411, "00291005": 411, "00292684": 411, "00293671": 411, "0029515": 411, "00297233": 411, "00297784": 411, "003": [289, 314, 349], "00308582": 411, "00310676": 411, "00315343": 411, "0031551": 411, "00317296": 411, "00317463": 411, "00332212": 411, "00338962": 411, "00340452": 411, "00341811": 411, "00343822": 411, "00344785": 411, "0034657": 411, "00348609": 411, "00350486": 411, "00367406": 411, "00368479": 411, "00385131": 411, "00389863": 411, "00393276": 411, "00393589": 411, "00394876": 411, "00395783": 411, "00396819": 411, "0040612": 411, "00407149": 411, "00418212": 411, "0042401": 411, "00425647": 411, "00437142": 411, "00443796": 411, "00447544": 411, "00448956": 411, "00449335": 411, "00451855": 411, "00464705": 411, "00466269": 411, "00480098": 411, "00481074": 411, "00483104": 411, "00484669": 411, "00488058": 411, "00493861": 411, "004m": 389, "00500361": 411, "00502113": 411, "00502473": 411, "00503604": 411, "00514768": 411, "0051719": 411, "00517637": 411, "00526697": 411, "00535855": 411, "00542595": 411, "00548478": 411, "0054935": 411, "00555918": 411, "00561026": 411, "00565044": 411, "00570175": 411, "00570293": 411, "00578904": 411, "00579899": 411, "0058452": 411, "00584761": 411, "00593063": 411, "00609695": 411, "00633179": 411, "00643591": 411, "00651108": 411, "00653312": 411, "0065352": 411, "00655363": 411, "00655654": 411, "00656544": 411, "00657187": 411, "00659512": 411, "00667871": 411, "00672351": 411, "00677631": 411, "00693265": 411, "00698123": 411, "00701343": 411, "00716987": 411, "00727645": 411, "00731429": 411, "00741956": 411, "00744553": 411, "0074474": 411, "00745008": 411, "00749636": 411, "00755406": 411, "00759056": 411, "00760217": 411, "00761117": 411, "00764146": 411, "00781277": 411, "00785878": 411, "00794258": 411, "00811779": 411, "00821985": 411, "00826017": 411, "00828943": 411, "00835933": 411, "00850778": 411, "00860835": 411, "00869796": 411, "00879332": 411, "00893304": 411, "00896329": 411, "00897444": 411, "0090376": 411, "00908195": 411, "00910648": 411, "00914975": 411, "00920252": 411, "00921101": 411, "00923343": 411, "00925277": 411, "0092883": 411, "009382": 411, "00940157": 411, "00940534": 411, "00947462": 411, "00959948": 411, "00978": 432, "00979113": 411, "00980134": 411, "00992419": 411, "00e": 425, "00x": 425, "01": [250, 288, 306, 361, 371, 411, 416, 423, 425], "0101487": 411, "010269": 411, "0103377": 411, "0103961": 411, "0104209": 411, "0105324": 411, "010552": 411, "0105865": 411, "0106293": 411, "0107115": 411, "0107712": 411, "0109527": 411, "0109669": 411, "0109927": 411, "0110537": 411, "0111132": 411, "0112255": 411, "0114194": 411, "011443": 411, "0116008": 411, "0116365": 411, "0116466": 411, "0116589": 411, "011705": 411, "0117535": 411, "011932": 411, "0119455": 411, "0120042": 411, "0120525": 411, "012078": 411, "0120946": 411, "0123966": 411, "0125696": 411, "0126225": 411, "0127448": 411, "0127799": 411, "0128144": 411, "0129116": 411, "0129936": 411, "013": 397, "0130778": 411, "0131335": 411, "0131446": 411, "0132428": 411, "0132869": 411, "0134367": 411, "013504": 411, "0135348": 411, "0135801": 411, "0137027": 411, "0137122": 411, "013742": 411, "0137691": 411, "0139037": 411, "0140129": 411, "0142343": 411, "0142667": 411, "0143274": 411, "0144483": 411, "0145757": 411, "0147718": 411, "0147951": 411, "0148329": 411, "0149058": 411, "015": 397, "0150624": 411, "0150693": 411, "0152068": 411, "0152199": 411, "0152997": 411, "0154121": 411, "0158702": 411, "0158714": 411, "0158773": 411, "0158951": 411, "016": [397, 411], "0161277": 411, "0161691": 411, "0161696": 411, "016186": 411, "0164591": 411, "0164699": 411, "0166254": 411, "0166666": 411, "0167419": 411, "0168147": 411, "0168219": 411, "0168348": 411, "016901": 411, "0169214": 411, "0170105": 411, "0170807": 411, "0170987": 411, "0171018": 411, "0176505": 411, "0177431": 411, "0177477": 411, "0177873": 411, "0179766": 411, "0180933": 411, "018228": 411, "0183481": 411, "0183895": 411, "0184267": 411, "0184384": 411, "018464": 411, "0187415": 411, "0192313": 411, "0192409": 411, "0192593": 411, "0192628": 411, "0193516": 411, "0193761": 411, "01_quickstart_neuralchat": 322, "01x": 425, "02": [288, 376, 411, 416, 425], "02002": 260, "0200457": 411, "0203923": 411, "0204832": 411, "0206321": 411, "0207462": 411, "0207504": 411, "0207815": 411, "0207876": 411, "0208901": 411, "021": 397, "0210726": 411, "0211151": 411, "0211298": 411, "0213786": 411, "0215163": 411, "0217062": 411, "0217468": 411, "0217822": 411, "0218703": 411, "0218969": 411, "02197": 411, "0220014": 411, "0221319": 411, "0222103": 411, "0222947": 411, "0223472": 411, "0224431": 411, "0231199": 411, "0231282": 411, "023182": 411, "0231979": 411, "0232584": 411, "0234498": 411, "0240415": 411, "024706": 411, "0247063": 411, "0248571": 411, "0249397": 411, "025032": 411, "0250395": 411, "0252901": 411, "0256871": 411, "0257188": 411, "0257262": 411, "0258341": 411, "0258802": 411, "0260486": 411, "0261888": 411, "0262706": 411, "0263137": 411, "0265272": 411, "0266731": 411, "0266886": 411, "0267483": 411, "0268136": 411, "0269904": 411, "0270028": 411, "027025": 411, "0270492": 411, "0274874": 411, "0275282": 411, "027535": 411, "0275467": 411, "0275881": 411, "0276086": 411, "028": [389, 411], "028166": 411, "028483": 411, "028568": 411, "0289719": 411, "0291396": 411, "0292454": 411, "0295362": 411, "0296385": 411, "02x": 425, "03": [288, 314, 349, 361, 411, 425], "0302293": 411, "0302746": 411, "0309886": 411, "0310083": 411, "031279": 411, "0314895": 411, "0317559": 411, "0317602": 411, "0318745": 411, "0319455": 411, "0321109": 411, "0321377": 411, "0323642": 411, "0325741": 411, "0326952": 411, "0329699": 411, "033": 397, "0333436": 411, "0336342": 411, "0340362": 411, "0341169": 411, "0341912": 411, "0342908": 411, "0345669": 411, "0346142": 411, "03474": 411, "0348388": 411, "0354192": 411, "0357023": 411, "0358603": 411, "0358752": 411, "03588": 411, "0363329": 411, "0364227": 411, "0365834": 411, "0366748": 411, "0367258": 411, "036978": 411, "036992": 411, "037": 397, "037334": 411, "0373579": 411, "0373802": 411, "0373823": 411, "0374397": 411, "0375": 389, "0375093": 411, "0375683": 411, "0376119": 411, "03762": 36, "0376949": 411, "0381385": 411, "03849": 411, "0387886": 411, "0389357": 411, "039": 397, "03923": 411, "0394101": 411, "039411": 411, "0395342": 411, "0397992": 411, "04": [288, 302, 304, 313, 314, 315, 316, 317, 318, 349, 366, 397, 411, 425], "0401657": 411, "0402931": 411, "0404778": 411, "0407051": 411, "0411331": 411, "0414047": 411, "0414834": 411, "0416614": 411, "0417964": 411, "0421644": 411, "042188": 411, "0423267": 411, "0426942": 411, "0427839": 411, "0428737": 411, "0429436": 411, "0429916": 411, "043787": 411, "044": 389, "044154": 411, "044202": 411, "0444861": 411, "0445693": 411, "0447282": 411, "0447548": 411, "044m": 389, "0451228": 411, "0454416": 411, "0454583": 411, "0455066": 411, "0458481": 411, "0459135": 411, "046": 411, "0460811": 411, "046201": 411, "0465882": 411, "0467291": 411, "0467462": 411, "0467998": 411, "0473412": 411, "0475549": 411, "0476463": 411, "0483781": 411, "0484067": 411, "0487342": 411, "04874": 411, "0487727": 411, "0489938": 411, "0490096": 411, "0496581": 411, "0497077": 411, "05": [266, 288, 346, 347, 397, 411, 425], "050021": 411, "0510217": 411, "0514668": 411, "0516788": 411, "0521326": 411, "0521595": 411, "0521945": 411, "0524509": 411, "0526609": 411, "053": 397, "0530097": 411, "0532543": 411, "0533513": 411, "053639": 411, "0537321": 411, "0537768": 411, "0538146": 411, "0538395": 411, "0539197": 411, "0543977": 411, "0549107": 411, "05516": 432, "0553082": 411, "0556653": 411, "0558945": 411, "0560297": 411, "0574189": 411, "0580473": 411, "0588583": 411, "0589148": 411, "0591283": 411, "0592912": 411, "0595001": 411, "0596004": 411, "059613": 411, "0596185": 411, "0597882": 411, "06": [288, 411, 425], "0600772": 411, "0603517": 411, "0603789": 411, "0604759": 411, "0609618": 411, "0609701": 411, "0610684": 411, "0612457": 411, "061272": 411, "0613803": 411, "0614806": 411, "0616695": 411, "0616923": 411, "062": 411, "0620034": 411, "0622484": 411, "0624729": 411, "0625579": 411, "0626013": 411, "063": 377, "0633017": 411, "0637226": 411, "0640577": 411, "0642402": 411, "0651551": 411, "0656322": 411, "066": 397, "0660571": 411, "06648": 411, "0665519": 411, "0668515": 411, "0677547": 411, "0677766": 411, "068": 411, "0687866": 411, "068835": 411, "069": 411, "0692752": 411, "0698868": 411, "06x": 425, "07": [288, 346, 397, 411, 425], "0700283": 411, "07006": 411, "0701429": 411, "0710327": 411, "0712915": 411, "0713578": 411, "0713821": 411, "0714324": 411, "0716356": 411, "0717247": 411, "0721208": 411, "0723144": 411, "0725632": 411, "0728843": 411, "0736189": 411, "0739962": 411, "074": 411, "0740655": 411, "0747271": 411, "075": 389, "0759107": 411, "076": [397, 411], "0760123": 411, "0765083": 411, "0765841": 411, "0771592": 411, "0780751": 411, "078109": 411, "0781101": 411, "0784417": 411, "0796627": 411, "08": [288, 361, 411, 425], "080936": 411, "0811198": 411, "0813271": 411, "0819725": 411, "0822007": 411, "0825026": 411, "0825665": 411, "0832193": 411, "0835321": 411, "0836219": 411, "0840322": 411, "0843776": 411, "0845544": 411, "0849766": 411, "085": 411, "0852": 411, "0854403": 411, "0854876": 411, "0855686": 411, "0870121": 411, "0873881": 411, "0876727": 411, "0879386": 411, "08794": 411, "0881114": 411, "0893092": 411, "0893345": 411, "08991": 411, "0899513": 411, "09": [288, 346, 411, 425, 426], "091": 397, "0922471": 411, "0923655": 411, "0933483": 411, "0933565": 411, "0938959": 411, "0943305": 411, "0946983": 411, "0948318": 411, "09557": 432, "0955952": 411, "0958787": 411, "096": 397, "0961662": 411, "09719": 411, "097692": 411, "0977256": 411, "0994565": 411, "0995304": 411, "0999998": 411, "0a0": [361, 432], "0e": [314, 349], "0f": 402, "0m": 389, "0x10": 410, "0x100": 410, "0x14": 410, "0x140": 410, "0x18": 410, "0x180": 410, "0x1c": 410, "0x1c0": 410, "0x20": 410, "0x200": 410, "0x24": 410, "0x240": 410, "0x28": 410, "0x280": 410, "0x2b0001b0": [425, 426], "0x2c": 410, "0x2c0": 410, "0x30": 410, "0x34": 410, "0x38": 410, "0x3c": 410, "0x4": 410, "0x40": 410, "0x400": 410, "0x8": 410, "0x80": 410, "0xc": 410, "0xc0": 410, "0xd000331": [397, 411], "1": [9, 14, 25, 27, 28, 32, 33, 36, 38, 39, 40, 44, 57, 246, 247, 252, 256, 257, 258, 260, 264, 266, 269, 281, 289, 298, 300, 303, 304, 305, 306, 308, 309, 313, 316, 319, 320, 321, 324, 326, 327, 328, 329, 330, 332, 334, 336, 337, 338, 340, 343, 344, 345, 350, 359, 363, 364, 365, 366, 369, 371, 372, 373, 375, 383, 384, 386, 387, 390, 391, 392, 395, 396, 397, 399, 401, 402, 403, 404, 405, 406, 408, 409, 410, 411, 413, 416, 418, 419, 421, 422, 423, 426, 428, 429, 432], "10": [288, 302, 308, 309, 314, 322, 346, 347, 349, 354, 361, 376, 388, 389, 397, 403, 411, 413, 425, 426], "100": [25, 33, 36, 44, 246, 247, 302, 314, 346, 347, 348, 349, 352, 354, 361, 376, 413, 422, 423, 425, 428, 429, 432], "1000": [346, 347], "10000": 259, "10004": [304, 305, 432], "1001": 411, "1002": 425, "1004": 411, "100424": 411, "10045": 425, "10049": 411, "1006": 411, "1007": 425, "10072": 397, "1008": 411, "101": [17, 255, 410], "101071": 411, "10117": 411, "1012": 411, "101206": 411, "10127": 411, "101434": 411, "1015": 411, "10159": 411, "1018": 411, "101844": 411, "1019": 411, "102": 20, "1020": 411, "1021": [397, 411], "102244": 411, "10231": 425, "1024": [17, 25, 346, 347, 372, 388, 389, 390, 411, 413, 425], "1024x256": 389, "1025": 411, "10259": 411, "1027": 411, "10270": 411, "10272": 411, "103": [309, 361, 366, 425], "103035": 411, "103083": 411, "103125": 411, "103126": 411, "1032": 411, "103379": 411, "103385": 411, "10370": 425, "10372": 411, "103927": 411, "104": [304, 425], "104267837": 319, "10428": 411, "104294": 411, "1043": 411, "1046": 411, "1047": 411, "10474": 411, "1048": 411, "10488": 425, "105": 425, "1050": [411, 425], "1051": 411, "105192": 411, "1053": 411, "1056": 411, "105656": 411, "10566": 425, "1057": 411, "1058": 411, "105849": 411, "106": [397, 411, 425], "1060": 425, "106089": 411, "1062": 425, "10621": 425, "10672": 411, "107": [410, 425], "1070": 411, "10703": 411, "10713": 425, "1072": 411, "10742": 411, "107514": 411, "1076": 411, "10763": 411, "108": 425, "1081": 397, "1082": 411, "1083": 411, "1085": 411, "1086": 397, "10860": 411, "1087": 411, "108718": 411, "1088": 411, "108899": 411, "109": 425, "1091": 397, "10917": 411, "1092": 411, "109308": 411, "1094": 411, "10940": 425, "10944": 432, "10947": 411, "1095": 411, "1096": 411, "10962": 411, "1097": 411, "1098": 411, "1099": 425, "10999": 411, "10e": 410, "10k": [247, 288, 425, 428], "10m": 397, "10x": 425, "11": [302, 304, 308, 340, 347, 361, 364, 365, 393, 403, 411, 425, 426, 427], "110": 425, "1100": 411, "11009": 425, "1102": [397, 411], "1103": 411, "11059": 411, "1106": [397, 411], "11064": 411, "1108": 411, "111": 425, "11116": 411, "111186": 411, "111211": 411, "1113": 411, "1114": 411, "1115": 411, "11180": 411, "112": [397, 411, 425], "1120": 411, "1123": 411, "1124": 411, "1125": 411, "1126": 397, "1128": 411, "112882": 411, "113": [397, 425], "1130": 397, "113174": 411, "1132": [411, 425], "11320": 411, "11322": 411, "11323": 397, "11327": 425, "1136": 425, "11368": 411, "1137": 397, "1138": 411, "11386": 425, "114": 410, "1140": 411, "11401": 411, "1142": 411, "1143": 411, "1144": 411, "11444": 411, "1145": 411, "11458": 411, "1147": 411, "11476": 411, "11484": 411, "115": [304, 425], "11503": 411, "1154": 411, "1156": 411, "1159": 411, "116": [389, 411, 425], "1160": 411, "116019": 411, "1162": 411, "11624": 411, "1163": 411, "11660": 425, "116701": 411, "11684": 411, "1169": 411, "117": [397, 425], "11707": 411, "11737": 411, "11741": 425, "1176": 411, "11793": 411, "118": [410, 425], "1184": 411, "118402": 411, "118429": 411, "1185": 411, "11860": 411, "11868": 425, "1188": 411, "119": [397, 411, 425], "11914": 425, "1192": [411, 425], "11943": 425, "11950": 411, "1196": 411, "119678": 411, "11970": 425, "1199": [411, 425], "119951": 411, "11a": 410, "12": [9, 30, 288, 308, 314, 332, 337, 349, 361, 386, 389, 397, 403, 407, 410, 411, 413, 425], "120": [410, 425], "1202": 425, "1203": 411, "12058": 425, "1207": 411, "12086": 411, "121": 425, "1210": 411, "12102": 397, "12104": 425, "1213": 411, "12147": 425, "1215": 411, "1218": 411, "1219": 411, "12190": 425, "122": 411, "1220": 397, "1224": 425, "122421": 411, "1226": 411, "12261": 411, "1228": 411, "1230": 397, "1232": 425, "1234": 371, "123429": 411, "12345": 252, "1235": 411, "123554": 411, "123585": 411, "1236": 411, "124": 397, "124072": 411, "1242": [411, 425], "124238": 411, "1244": 411, "1247": 411, "124749": 411, "124m": 428, "1250": 411, "125018112": 388, "1251": 425, "12526": 411, "1253": 411, "125344": 411, "12535": 397, "12537": 425, "12541": 425, "12548": 411, "12567": 411, "1257": 411, "125772": 411, "125m": [304, 428], "126545": 411, "126819": 411, "1269": 411, "127": [252, 309, 313, 314, 315, 316, 324, 326, 327, 328, 329, 334, 336, 337, 338, 340, 343, 344, 353, 361, 375, 389, 410, 411, 423, 425], "12702": 425, "1271": 411, "1273": 411, "1278": 397, "12788": 425, "128": [247, 302, 352, 388, 389, 393, 396, 397, 411, 413, 423, 425], "1280": [411, 413], "1281": 411, "1286": 411, "1287": 411, "1288": 411, "129": [411, 425], "1291": 411, "1292": 411, "1293": 397, "129767": 411, "1298": 411, "129806": 411, "12d": 410, "12k": [346, 347, 352], "12xlarg": [397, 411], "13": [288, 308, 349, 351, 361, 372, 397, 403, 411, 425, 426], "13001": 425, "1302": 411, "13031": 425, "1304": 397, "13064": 425, "1307": 411, "130834": 411, "130863": 411, "131": 397, "1310": 425, "13129": 397, "1313": 411, "13142": 425, "1315": [411, 425], "13154": 397, "1316": 425, "1319": 397, "132": 411, "1320": 411, "132552": 411, "1328969a": 319, "1329": 411, "133": 410, "1330": 425, "133295": 411, "1334": 411, "133647": 411, "1337": 411, "13381": 397, "134": 425, "1342": [397, 411], "134442": 411, "1345": 425, "1346": 411, "13466": 411, "1347": [397, 411], "134716": 411, "135054": 411, "13524": 425, "13529": 425, "135495": 411, "135532": 411, "13582": 425, "135839": 411, "13586": 425, "135864": 411, "1359": 425, "136": [279, 385], "13616": 425, "13621": 425, "13638": 425, "13639": 425, "13650": 425, "13674": 425, "13675": 425, "13686": 425, "137": 425, "13703": 425, "1371": 411, "13717": 425, "137361": 411, "138": 389, "1381": 411, "1382": 411, "13825": 425, "1383": 425, "1384": 411, "1385": 411, "1386": 411, "1387": 411, "13871": 425, "1388": 411, "139": 410, "139021": 411, "1392": 411, "139298": 411, "1393": 425, "1394": 411, "1397": 397, "13990": 397, "13b": [288, 323, 332, 346, 347, 351, 352, 428], "13k": 425, "14": [246, 288, 305, 350, 397, 403, 410, 411, 425], "140": [410, 425], "1403": 425, "1407": 411, "1408": 425, "1409": 397, "141": 397, "141097": 411, "1412": 411, "14124194128933833351": 390, "1413": 411, "141333": 411, "1414": 411, "1415": 411, "1417": 411, "141966": 411, "142": [304, 411, 425], "1422": 411, "1425": 411, "1426": [411, 425], "14263": 425, "1427": 411, "142778": 411, "143": 397, "1430": 411, "1435": 411, "1436": 411, "1437": 425, "144": 425, "1440": 411, "1441": [411, 425], "144231": 411, "1443": 411, "1444": 411, "1446": 411, "1449": 411, "1450": [411, 425], "145322": 411, "1456": 411, "1457": 411, "145836": 411, "1459": 411, "146": [410, 425], "1461": 411, "1464": 411, "146452": 411, "1465": 411, "146781": 411, "146935": 411, "147": 425, "1470": 411, "14737": 425, "1474": 397, "147474": 411, "1476": 411, "1478": 411, "148115": 411, "148369": 411, "1484": 397, "148512": 411, "1487": [397, 411], "14896": 425, "14905": 411, "1492": 411, "1495": 411, "1498": 411, "14993": 425, "14c": 410, "15": [38, 288, 369, 397, 403, 404, 409, 411, 425], "1501": 411, "150549": 411, "1506": 411, "1508": 411, "150k": 350, "1513": 411, "151649": 411, "15180": 425, "152": [17, 410, 425], "1523": 411, "1526": 411, "1527": 411, "15278": 425, "152848": 411, "152925": 411, "153086": 411, "1531": 411, "1534": 411, "1536": 347, "1539": 411, "154": 425, "1540": 411, "1544268": 361, "1545": 411, "15460": 397, "15462": 425, "1547": 411, "1549": 411, "155": 411, "15506": 425, "15525": 411, "1559": 411, "156168": 411, "156368": 411, "1565": 411, "157": 425, "157349": 411, "15748": 411, "157518": 411, "1578": 411, "1579": 411, "158": 425, "1581": 397, "158162": 411, "15834": 425, "1585": 411, "158502": 411, "158668": 411, "1589": 411, "159": [304, 410], "1594": 397, "159566": 411, "159911": 411, "16": [281, 288, 289, 304, 305, 314, 346, 347, 348, 349, 361, 388, 397, 403, 404, 405, 406, 409, 410, 411, 413, 423, 425], "160": [397, 410], "16004": 397, "1601": 411, "1602": 411, "160705": 411, "1609": 411, "161251": 411, "161443": 411, "1617": 411, "162": 411, "1622": 425, "1624": 411, "1627": 397, "163": 411, "163369": 411, "1637": 411, "1650": 425, "165192": 411, "165648": 411, "1658": 425, "1659": 397, "16591": 425, "166": [397, 425], "166153": 411, "1662": 411, "167": [410, 411, 425], "1671": [397, 411], "167473": 411, "167575": 411, "16771": 397, "168": [330, 332], "1680": 425, "16901": 425, "169119": 411, "1696": 411, "1698": 411, "169874": 411, "1699": 425, "16e": 410, "16gb": 325, "16x1": [403, 407], "16x16": 407, "16x16gb": [425, 426], "16x32": 403, "16x32x16": 407, "16x4": 409, "16xn": 405, "16xpad_n": 405, "17": [288, 317, 346, 347, 363, 389, 397, 403, 411, 425], "170": 425, "1702": 425, "1703": 425, "1706": [36, 397, 411], "1708": 260, "1710750809": 361, "1712": 411, "171434": 411, "17178": 425, "1719": 397, "172": 425, "172356": 411, "17245": 425, "17281": 425, "173": 425, "17323": 432, "17364": 411, "174": 410, "174091": 411, "174101": 411, "174215": 411, "1743": 411, "17436": 411, "17454": 425, "17468": 397, "1747": 411, "17496": 425, "175": [314, 349, 425], "1758": 425, "17585": 425, "17598": 425, "1760": 425, "176031": 411, "1762": 411, "176292": 411, "1763": 425, "176b": [272, 302], "177": 411, "17764": 411, "1777": 397, "178": 425, "178324": 411, "1786": 411, "1787": 411, "179": 397, "1792": 411, "1793": 425, "1795": 411, "179525": 411, "179593": 411, "179695": 411, "1797": 411, "17a": 410, "18": [17, 255, 288, 361, 397, 403, 411, 425], "180": 410, "1801": 411, "1804": 411, "1805": 411, "180921": 411, "181": 397, "18119": 425, "1813": 411, "1816": 411, "181783": 411, "182": 411, "1823": 411, "1825": 411, "1826": 411, "1828": 411, "1829": 411, "183": 425, "183003": 411, "183193": 411, "18324": 411, "18336": 425, "184": 411, "184256": 411, "184412": 411, "185": 389, "1850": 411, "1851": 411, "1857": 411, "18575": 397, "1858": 411, "186": 425, "18672": 425, "1869": 411, "187": [397, 410], "18708": 425, "1872": 397, "187933": 411, "188": 397, "1881": 411, "18824": 425, "1885": 411, "18868": 425, "188745": 411, "18876": 425, "1889": [377, 411], "18939": 425, "1895": 411, "1899": 411, "18d": 410, "19": [288, 397, 403, 411, 425, 426], "1904": 411, "190508": 411, "191": 361, "1910": 397, "1913": 411, "191564": 411, "1918": 411, "1919": 411, "192": [330, 332, 425], "1920": 411, "1924": 411, "193": [410, 425], "1930": 411, "193579": 411, "1936": 411, "193713": 411, "1938": 397, "1942": 411, "19463": 411, "195": [289, 425], "1952": 411, "195271": 411, "19536": 411, "1956": 411, "1964": 411, "197": [397, 411], "1971": 411, "1972": 411, "1979": 411, "198": 411, "1983": 25, "198303": 411, "1987": 411, "198987": 411, "199": 410, "1993": 397, "1994": 411, "19_": 369, "19x": 425, "1_1": 369, "1a": 410, "1a0": 410, "1a6": 410, "1ac": 410, "1b2": 410, "1b7": 428, "1b9": 410, "1bf": 410, "1c5": 410, "1cb": 410, "1d2": 410, "1d9": 410, "1e": [281, 314, 348, 349, 352, 376, 422], "1e0": 410, "1e7": 410, "1ed": 410, "1f": 369, "1f3": 410, "1f9": 410, "1m": 397, "1ubuntu2": 369, "1x": [304, 425, 426], "1x1": [17, 303, 390], "1x16": [403, 409], "1x4": [304, 409], "1\u6a21\u578b\u63d0\u4f9b\u52a0\u901f": 420, "2": [17, 20, 21, 25, 28, 29, 32, 36, 57, 256, 257, 260, 266, 269, 281, 289, 300, 303, 304, 305, 306, 308, 309, 320, 323, 327, 328, 329, 330, 332, 337, 340, 347, 350, 358, 366, 372, 373, 384, 386, 387, 389, 390, 391, 392, 395, 396, 397, 402, 403, 404, 409, 410, 411, 413, 415, 416, 418, 419, 420, 421, 422, 425, 426, 428, 432], "20": [28, 264, 288, 289, 302, 324, 361, 388, 393, 397, 403, 410, 411, 425], "200": [247, 361, 364, 365, 366, 410, 425, 432], "2000": [314, 349, 352], "20013": 425, "2003": 410, "2005": 411, "2007": 425, "2009": 411, "200k": 349, "2010": [397, 411], "2012": 425, "2013": 407, "2016": [25, 411], "2017": 361, "2019": 377, "202": 304, "2021": [266, 272, 302], "20210514": [425, 426], "2022": [272, 302, 358, 372, 397, 411, 432], "2023": [272, 316, 347, 415, 425, 426, 432], "202306": 432, "2024": [319, 361, 362], "2025": 411, "2031": 411, "2038": 411, "203901": 411, "2044": 411, "2048": [17, 247, 288, 413, 432], "204966": 411, "204973": 411, "205": [397, 411], "20505": 397, "2055": [411, 425], "206": 410, "2060": 411, "206049": 411, "207": 397, "2071": 411, "20787": 425, "20824": 411, "2085": 425, "208555": 411, "2086": 411, "2089": 411, "209526": 411, "20b": [314, 315, 349], "20c": 410, "20k": 349, "20m": 397, "21": [9, 288, 403, 411, 425], "210": 425, "211": [397, 411, 425], "2110": 411, "2116": 397, "2118": 425, "211893": 411, "2119": 411, "212": 410, "2120": 425, "2121": 425, "212152": 411, "21269": 411, "2129": 425, "2131": [397, 411], "2134": 411, "21341": 425, "213454": 411, "214": 425, "214208": 411, "21431": 411, "2146": 411, "2148": 411, "215": 411, "2150": 397, "2156": 411, "21568": 425, "2160": 411, "2163": 411, "216338": 411, "2165": 411, "217": 397, "2174": 425, "2181": 397, "218765": 411, "219": [410, 411, 425], "219777": 411, "21f": 410, "21x": 304, "22": [288, 314, 315, 349, 361, 372, 403, 411, 425], "2201": 411, "220585": 411, "2206": 432, "220947": 411, "220994": 411, "221": [397, 425], "2210": [411, 432], "2211": 411, "222": 425, "2220": 411, "22241": 411, "222661": 411, "2229": 411, "2232": 411, "223615": 411, "22389": 425, "2239": 411, "224": [20, 410, 411], "224925": 411, "22499": 425, "225": 410, "225023": 411, "2251": 411, "2263": 397, "2266": 411, "2267": 411, "227": 425, "2271": 411, "2274": 425, "22776": 425, "227976": 411, "228043": 411, "2284": 385, "2285": 411, "228752": 411, "2290": 425, "22951": 425, "229837": 411, "22b": 410, "23": [288, 309, 337, 351, 361, 364, 365, 372, 403, 411, 425, 426], "2301": 411, "2306": 432, "2308": 425, "2309": 432, "230945": 411, "231": 425, "2310": 432, "232": [410, 411], "2320": 411, "2326": 411, "233057": 411, "233231": 411, "234": 425, "2342": 425, "2345": 349, "235": 397, "2351": 397, "2354": 425, "2357": 411, "2359": 425, "236101": 411, "236418": 410, "2365": 425, "2369": 411, "237": [366, 411], "2377": 411, "23772": 425, "238": [410, 425], "238855": 411, "23e": 410, "24": [57, 288, 361, 395, 397, 403, 411, 425], "24038": 411, "2404": 411, "240739": 411, "2409": 397, "241": 397, "2415": 411, "242": [304, 397, 411], "2420": 411, "2421": 425, "242512": 411, "2427": 425, "2429": 425, "243012": 411, "2433": 411, "2435": 411, "2439": 411, "244": [397, 410, 425], "2449": 411, "245": [397, 411], "2463": 411, "2467": 425, "247251": 411, "247491": 411, "2475": 411, "24910": 397, "24b": 410, "25": [260, 288, 346, 350, 372, 403, 411, 413, 425], "2504": 425, "2505": [397, 411], "2507": 411, "250t": 351, "251": [304, 411], "2510": 411, "251221": 411, "2513": [411, 425], "252": [304, 410], "2525": 411, "253": 304, "2537": 411, "254835": 411, "25485": 397, "255": [21, 397, 408, 423], "255199": 411, "255598": 411, "2558": 411, "256": [247, 259, 376, 389, 411, 413], "256619": 411, "256635": 411, "256715": 411, "2568": 411, "256gb": [425, 426], "256px": 348, "256x1024": 389, "256x256": [389, 413], "257138": 411, "2576": 397, "2578": [411, 425], "257989": 411, "25799": 411, "258": 397, "2580": 411, "2582": 411, "259": [397, 410, 411], "259051": 411, "2594": 411, "26": [288, 361, 403, 410, 411, 425], "260": 410, "26056": 411, "2608": 411, "261": 397, "261028": 411, "2612": 411, "261265": 411, "2615": 411, "262": [397, 425], "2624": 411, "263": 397, "2633": 411, "263316": 411, "264": 411, "2642": 411, "2643": [411, 425], "2652": 397, "2653": [411, 425], "26552": 425, "266": 397, "2663": 411, "2665": 425, "2669": 425, "266945": 411, "267": 410, "267289": 411, "2673": 411, "267367": 411, "2677": 425, "2678": 411, "2683386": 25, "2686": 425, "2689": 425, "269": 304, "2693": 425, "2694": 411, "269504": 411, "2697": 411, "26974": 411, "2698": [411, 425], "2699": 411, "26e": 410, "27": [288, 361, 403, 411, 425], "2701": 425, "2703": 425, "2704": 411, "2706": 425, "2709": 425, "271": 411, "271587": 411, "2718": 411, "2720": 425, "2721": 425, "2725": 425, "27264": 425, "2728": 425, "2729": 425, "2730": 425, "273363": 411, "2735": 411, "2737": 425, "274": 397, "2741": 397, "27412": 385, "2742": 425, "2743": 411, "274441": 411, "2746": 411, "275": [397, 410, 425], "2751": [411, 425], "2753": 425, "27579": 425, "2758": 425, "2763": 425, "2768": 425, "2774": 411, "277815": 411, "2783": 411, "2784": 411, "2795": 411, "2796": 353, "27c": 410, "28": [288, 304, 337, 361, 397, 403, 411, 425], "28032": 425, "2804": 397, "280686": 411, "2807": 411, "281": 425, "2813": 411, "2815": 411, "282": 397, "2821": 411, "2822": 411, "282241": 411, "2824": 411, "2825": 425, "2828": 411, "283": 410, "283046": 411, "2831": 411, "28321": 411, "2834": 411, "283445": 411, "2835": 411, "2836": 411, "28399": 397, "284": 411, "2842": [411, 425], "2844": 411, "2846": 411, "28479": 425, "2850": 411, "2854": 411, "2856": 411, "2858": 411, "28593": 411, "286141": 411, "286461": 411, "2866": 411, "2867": 425, "2868": 425, "2869": 411, "286973": 411, "287": 411, "2870": 411, "2871": 411, "2876": [411, 425], "2879": 411, "2882": 411, "288236": 411, "2889": 411, "289": 411, "2896": [411, 425], "2898": 411, "28a": 410, "29": [288, 403, 411, 425, 426, 427], "2901": 411, "2902": 411, "2906": 411, "291": [410, 425], "2918": 397, "2919": 411, "2921": 397, "29220": 410, "2923": 411, "2928": 397, "293": 425, "2930": 411, "2931": 411, "2935": 411, "2944": 411, "29501": 347, "2953": 411, "2954": 411, "2958": 411, "296": 397, "2962": 411, "2965": [397, 411], "2969": 411, "297": 411, "2970": 411, "2974": 411, "2975": 411, "298": [410, 411], "2980": 411, "2983": 411, "298489": 411, "2988": 411, "298907": 411, "2994": 411, "2995": 411, "299561": 411, "29a": 410, "29c": 410, "29e": 410, "29gvlhfosjhehtgql4hgxp": 361, "2a0": 410, "2a1": 410, "2a2": 410, "2a5": 410, "2b": 349, "2b_peft_finetuned_model": 349, "2c": 410, "2d": [260, 399, 413], "2e": [314, 376], "2nd": [25, 405, 408], "2x1": 409, "2xk": 402, "3": [20, 21, 25, 57, 256, 257, 281, 289, 300, 303, 304, 305, 308, 309, 320, 322, 323, 324, 327, 328, 329, 330, 331, 332, 337, 340, 345, 347, 350, 351, 362, 371, 383, 384, 385, 386, 387, 389, 390, 391, 392, 393, 394, 395, 396, 397, 401, 403, 404, 409, 411, 413, 414, 416, 420, 421, 422, 425, 426, 432], "30": [28, 288, 351, 369, 403, 425], "300": [376, 429, 432], "3008": 411, "300k": 349, "301": 397, "3010": 411, "3011": 411, "3018": [411, 425], "302": 411, "3025159985633461085": 390, "3026": 425, "303": 397, "3030": 411, "303455": 411, "3035": 411, "30458": 411, "3046": 425, "3049": 411, "3050": 411, "30522": 388, "3053": 411, "3058": 411, "3060": 425, "3064": 425, "307141": 411, "3072": [25, 411], "3077": 411, "307908": 411, "308": 425, "3080": 425, "3085": 411, "309195": 411, "3093": 411, "30b": [288, 428], "31": [288, 332, 346, 385, 397, 403, 404, 411, 425], "310": 425, "3113": 411, "311348": 411, "3113761e": 354, "311691": 411, "3117": 411, "3121": 411, "31211": 397, "3125": 425, "313": 425, "3130": 411, "3132": 411, "313656": 411, "31382": 425, "3147": 411, "3148": 411, "315": 304, "31592": 411, "316": 397, "317": 411, "317204": 411, "317837": 411, "318": [397, 411, 425], "318094": 411, "3185": 411, "3191": 425, "31929": 411, "319865": 411, "31x": 425, "32": [247, 288, 305, 371, 385, 388, 396, 397, 403, 404, 406, 407, 408, 409, 410, 411, 413, 425, 426, 427, 428, 432], "320": [397, 425], "3219": 411, "3226": 397, "3227": 411, "3230": 411, "323476": 411, "3235": 411, "3237": 411, "324": [377, 411, 425], "3240": 411, "3241": 411, "3245": 411, "3255": 397, "3264": 397, "326917": 411, "3276": 411, "328": 425, "3284": 411, "3288": 411, "3290": 425, "32966": 425, "32x16": 403, "32x4d": 17, "32x8d": 17, "33": [288, 304, 346, 350, 361, 396, 411, 425], "330": 425, "3300": [397, 411], "3306": 334, "3307": 425, "3314": 411, "332": [397, 411], "332153": 411, "3322": 411, "33246": 411, "3325": 411, "333": 411, "33386": 411, "3341": 411, "3348": 425, "3353": 411, "336": 350, "336519": 411, "3368": 397, "3369": 411, "336px": 350, "337": 411, "337529": 411, "3377": 411, "338": 425, "3382": 411, "3389": 425, "339": 397, "3391": 411, "3393": 411, "3394": 411, "3399": [411, 425], "33x": 304, "34": [17, 255, 288, 304, 324, 326, 343, 346, 348, 349, 352, 372, 384, 411, 425], "3405": 425, "3408": 425, "340939": 411, "3412": 425, "342843": 411, "3433": [411, 425], "3436": 411, "3441": 411, "34423": 411, "3448": 411, "345": 411, "3453": 411, "3462": 411, "346369": 411, "3467": 411, "3479": 425, "348": [411, 425], "3487": 411, "3489": 411, "349": 397, "3494": [411, 425], "34b": [309, 326, 330, 332], "35": [288, 304, 323, 327, 328, 329, 330, 366, 397, 411, 425], "350": 385, "350147": 411, "350m": [323, 428], "351": 411, "3519": 411, "3522": [411, 425], "353": 397, "3532": 411, "3538": 411, "354": 428, "3542": 428, "3543": 411, "355651": 411, "3557": 411, "3563": 425, "3572": 411, "357348": 411, "3576": 411, "358": 411, "3583": 411, "3584": 411, "3585": 425, "35873": 425, "358769": 411, "358932": 411, "3590": 411, "359791": 411, "36": [288, 345, 351, 361, 383, 385, 397, 411, 425], "360": [397, 420], "3601": 411, "3604": 411, "3606": 411, "3616": 411, "3617": 425, "3626": 411, "363": 397, "36322": 411, "3634": [397, 411], "364": 397, "3641": 411, "3642": 411, "3646": 411, "3647": [397, 425], "3650": 425, "3651": [411, 425], "3659": 411, "366": 389, "366328": 411, "3678": 397, "368": [411, 425], "3681": 397, "3684": 397, "3694": 411, "369429": 411, "369466": 411, "3698": 411, "37": [288, 346, 397, 411, 425], "3701": 411, "3712": 411, "3725": 425, "3730": 411, "3732": 411, "37333": 411, "3736": 411, "3739": 425, "3741": 411, "375": 411, "3752": 425, "375284": 411, "37537": 411, "3755": 411, "3757": 428, "3758": 411, "3761": 425, "376539": 411, "377": 411, "379": 428, "379699": 411, "3797": 425, "3798": 425, "379899": 411, "3799": 425, "37m": 386, "38": [288, 346, 361, 397, 410, 411, 425], "3800": 411, "3804": 428, "380582": 411, "3813": 425, "3822": 411, "382208": 411, "3823": 411, "3829": 411, "3833": 411, "384": [30, 390, 397, 411], "3848": 411, "3849": 411, "3850": 411, "3855": 411, "386": 411, "3868": 411, "387": 411, "3882": 411, "3887": 428, "3889": 411, "389": 397, "3894": 411, "3898": 411, "3899": 425, "39": [288, 346, 372, 397, 411, 425], "390": 425, "39024": 411, "391055": 411, "3912": 411, "391387": 411, "3919": 411, "392": 397, "39218": 397, "3927": 411, "393": 425, "3930": 428, "3933a071": 25, "3934": 411, "3940": 411, "3943": 411, "3947": [411, 428], "3952": 411, "3956": 411, "396634": 411, "397": 425, "3979": 425, "398": [361, 425], "3983": 411, "398509": 411, "3986": 411, "3991": 411, "39914": 425, "3993": 411, "3999": 425, "3a14": [425, 426], "3b": 428, "3b3f03e3f12": 319, "3c89": 319, "3d": [16, 19, 406, 413, 437], "3e": [376, 410], "4": [28, 36, 44, 57, 246, 247, 256, 257, 258, 263, 281, 289, 298, 300, 303, 304, 308, 313, 316, 319, 320, 323, 324, 326, 327, 328, 329, 330, 332, 337, 343, 347, 348, 349, 350, 354, 363, 366, 372, 374, 387, 389, 390, 391, 394, 395, 396, 397, 403, 404, 405, 406, 409, 410, 413, 420, 421, 422, 425, 426, 427, 428, 429, 432], "40": [288, 346, 390, 397, 420, 425], "4018": 425, "402406": 411, "4036": [411, 425], "4041": 411, "4047": [411, 425], "4049": 425, "405": 397, "4050": 411, "4057": 411, "4061": 411, "407388": 411, "408": 411, "408357": 411, "4084": 411, "409": 397, "4090": 411, "4096": [304, 411, 425], "41": [288, 304, 321, 346, 397, 411, 425], "410": 425, "4101": 425, "4107": 411, "412174": 411, "412912": 411, "4132": 411, "4133": 425, "4142": 411, "4147": 411, "4149": 428, "415": [304, 397, 425], "4154": 410, "4155": 410, "4156": 410, "4157": 410, "41598": 397, "415c": 410, "415d": 410, "415e": 410, "415f": 410, "416": 425, "4161": 411, "4164": 411, "416571": 411, "4167": 411, "4172": 428, "4176": 411, "418491": 411, "419": 425, "4191": 425, "41x": 425, "42": [288, 346, 397, 411, 425], "4200": 411, "4201": 411, "420619": 411, "4208": 411, "42134": 411, "42145": 411, "421781": 411, "422": 361, "4221": 411, "4224": 411, "4225": 411, "422517": 411, "4226": 411, "4228": 425, "423052": 411, "4248": 411, "4253": 411, "4262": 411, "4269": 425, "4275": 425, "4285": 411, "42874": 425, "429": [397, 410], "429166": 411, "4294": 411, "43": [288, 346, 397, 411, 425], "430": 425, "430288": 411, "432": 425, "4321": 411, "4334": 411, "433492": 411, "4339": 397, "4347": 411, "4352": 411, "435488": 366, "4356": 411, "4361": 425, "4366": 411, "437": 411, "4370": 411, "4373": 411, "4374": 425, "4383": 411, "4384": 425, "4389": 425, "439": 425, "4395": 425, "4398": 411, "44": [288, 304, 361, 372, 389, 397, 410, 411, 425], "4402": 411, "4409": 425, "4418": 411, "442": 397, "4430": 411, "44309": 411, "4433": 411, "4435": 425, "444133": 411, "4445": 411, "4448": 425, "4457": 411, "446": [397, 425], "4460": 397, "446442": 411, "4466": 411, "447": 425, "4481": 411, "4483": [411, 425], "4485": 411, "45": [288, 346, 351, 397, 411, 425], "4500": 411, "4501": 411, "451": 411, "4516": 428, "4517": 411, "4520": 411, "4521": 411, "4523": 425, "4526": 411, "4533": 428, "454": 361, "45434": 425, "4551": 411, "4553": [411, 425], "4559": 411, "456": [397, 411], "4561": 425, "4568": 411, "4579": 411, "458": 425, "4582": 411, "4586": 411, "459915": 411, "46": [288, 346, 397, 411, 425], "461b": 319, "462": 411, "4627": 411, "462737": 411, "4628": 411, "4632": 411, "4634": 428, "4636": 411, "4638": 411, "465": 411, "4650": 411, "4654": 411, "4658": [410, 411], "467": 397, "4683": 411, "46x": 425, "47": [288, 304, 331, 346, 397, 411, 425], "4701": 411, "4707": 425, "4714": 411, "4723": 425, "472466": 411, "4727": 411, "473": 397, "4737": 425, "4746": 425, "475": 385, "4750": 397, "475444": 411, "4769": 411, "47752": 411, "4784": 411, "4786": 411, "48": [28, 288, 331, 397, 411, 425], "4800": [425, 426], "4802": 411, "480308": 411, "4806": 411, "4807": 425, "4808": 397, "4822": 425, "4828": 428, "4829": 411, "483": 411, "483053": 411, "4834": 425, "4838": 411, "484": 397, "4858": 411, "48699": 425, "487": 397, "4873": 411, "4876": 411, "488558": 411, "489b": 319, "48b9": 319, "48x": 425, "49": [288, 304, 346, 395, 397, 411, 425], "4904": 411, "4906": 428, "49120": 397, "4913": 425, "4914": 425, "4920": 397, "4936": 428, "4940": 425, "4948": 411, "4951": 425, "4971": 425, "497127": 411, "4972": 411, "4980": [425, 428], "499": 397, "4990": [411, 425], "4993": [411, 425], "4997": 425, "4_bit_llama2": 432, "4a": 410, "4bbb": 332, "4c8b3f": 410, "4c8b6f10": 410, "4c8b7708": 410, "4d": 413, "4ddp": [314, 349], "4e": 352, "4g": 366, "4th": [272, 302, 349, 354], "4x": 409, "4x1": [28, 389, 399, 409], "4x16": [408, 409], "4x4": 409, "5": [9, 25, 28, 57, 133, 134, 135, 136, 216, 217, 218, 221, 222, 223, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 247, 266, 281, 288, 289, 303, 314, 320, 321, 324, 337, 346, 347, 349, 350, 351, 354, 369, 372, 387, 388, 389, 391, 394, 395, 397, 403, 411, 413, 425, 426, 428, 432], "50": [17, 25, 255, 266, 288, 302, 346, 410, 425], "500": [354, 422], "5000": 354, "5005": 425, "500698": 411, "501": 397, "5011": 425, "5018": 428, "5019": 425, "501ff80d3f56": 332, "5024": 425, "5025": 425, "50257": 354, "5031": 411, "503341": 411, "5045": 425, "5046": 411, "5048": 428, "505": 411, "5050": 411, "5057": 428, "506": 411, "5071": 289, "5076": 411, "508": 411, "5084": 425, "5085": 425, "5087": 411, "509": 397, "5094": 411, "50m": 397, "51": [288, 346, 397, 411, 425], "510039": 411, "51009": 411, "5108": 411, "5119": 425, "512": [17, 247, 288, 332, 346, 372, 375, 389, 404, 406, 409, 411, 413, 425], "513": 397, "5137": 425, "514108": 411, "5147": 425, "5149": 411, "515": 397, "5151": 425, "5153": 411, "5156": 411, "515k": 350, "5164": 425, "517278": 411, "5173": 425, "5176": 425, "518": 428, "518276": 411, "5184": 425, "5185": 428, "518614": 411, "5190": 425, "5192": 425, "5193": 425, "5197": 425, "5199": 425, "52": [288, 346, 371, 411, 425], "5202": 411, "5206": 425, "5207": 425, "5210": 425, "5212": 425, "5213": 425, "5222": 425, "5223": 425, "5225": 425, "5227": 425, "5228": 425, "5230": 425, "5231": 425, "5232": 425, "5234": 425, "5241": 425, "5242": 425, "5243": 425, "5244": 425, "5245": 425, "525": 411, "52509": 411, "5252": 425, "5252279507a7": 319, "5253": 425, "52532": 361, "5257": 425, "526": 411, "5262": 411, "527": [397, 411], "5271": 411, "5274": 425, "5280": 397, "5282": 425, "5284": 411, "529": 411, "52k": [314, 349], "53": [288, 353, 410, 411, 425], "5303": 411, "53204": 411, "533329": 411, "5337": 411, "5346": 425, "534615": 411, "5361": 411, "5366": 411, "537405": 411, "5376": 411, "538": 332, "53x": 425, "54": [288, 346, 354, 410, 411, 425], "5408": 411, "5418": 411, "54288": 411, "5432": 425, "5436": 428, "543634": 411, "5439": 411, "5440": 397, "544194": 411, "5443": 428, "545": 397, "5457": 411, "5478": 411, "5482": 397, "5488": 425, "549": 397, "5498": 411, "54x": 425, "55": [288, 304, 346, 372, 410, 411, 425], "5503": 411, "5507": 425, "5513": 411, "5518": 411, "5521": 411, "5535": 425, "5541": 411, "5544": 425, "5552": 428, "5555": [411, 425], "556249": 411, "55628": 397, "5566": 425, "557": 397, "5578": 411, "558061": 411, "558473": 411, "558k": 350, "5593": 428, "5594": 425, "55it": 361, "56": [288, 304, 310, 314, 346, 349, 361, 385, 410, 411, 425, 426], "5600": 411, "5604": 411, "560m": 428, "561317": 411, "5615": 411, "5617": 411, "561805": 411, "562": 397, "5624": 411, "5633": 411, "564": 411, "5644": 411, "564787": 411, "5652": 411, "5662": 411, "5672": 411, "569": 411, "5692": 411, "5695": 411, "56982": 410, "57": [288, 351, 411, 425], "5703": 411, "5713": 411, "573": 411, "5733": 425, "5742": 428, "5748": 411, "5764": 428, "5770": 411, "5772": 411, "578": [397, 411], "5781": [411, 425], "5789": [411, 428], "57x": 425, "58": [288, 304, 366, 411, 425], "5805": 411, "581": 411, "5810": 411, "5811": 425, "5820": 411, "5822": 411, "582871": 411, "583": 411, "5843": 425, "586": 411, "5861": 411, "587": [397, 411], "5876": 411, "588": 411, "5884": 411, "589": 397, "589803": 411, "59": [288, 372, 411, 425], "5912": 411, "592": 411, "592043": 411, "5923": 411, "5933": 411, "5953": 411, "5956": 411, "5959": 411, "59625": 411, "596568": 411, "5968": 411, "5969": 411, "5970": 411, "5977": [411, 428], "598": 411, "5980": 411, "598168": 411, "5986": 411, "599": 397, "59902": 361, "5993": 425, "5_13b": 351, "5_13b_val": 351, "5_adam": 352, "5_finetun": 376, "5b": [410, 428], "5c": 410, "5d": 410, "5e": [346, 347], "5ghz": [397, 411], "5x": [272, 420], "6": [57, 281, 303, 304, 313, 320, 330, 331, 347, 361, 369, 386, 387, 391, 395, 397, 401, 403, 410, 411, 423, 425, 426, 427, 428], "60": [288, 346, 376, 425], "600": [330, 332, 411, 423], "601": 411, "602": 397, "6023": 411, "6026": 397, "6034": 425, "6055": 425, "606477": 411, "608": 411, "6080": 397, "6081": 397, "60813": 411, "609": [397, 411], "61": [288, 346, 372, 411, 425], "6100": 411, "611059": 411, "6114": 425, "611718": 411, "613": 397, "6133": 411, "614": 397, "614109": 411, "6146": 411, "615338": 411, "616": 411, "6161": [411, 425], "6162": 411, "618": 397, "619": 397, "62": [288, 372, 410, 411, 425], "620": 411, "6201": 425, "62123": 411, "62126d40b8d7": 410, "62126d40b8f7": 410, "62127540b8cf": 410, "62127540b8ef": 410, "62127d40b8c7": 410, "62127d40b8e7": 410, "6221": 397, "62241": 411, "62409": 411, "62427d48183f": 410, "62427d48187f01": 410, "62427d48187f02": 410, "62427d48187f03": 410, "62427d48187f04": 410, "62427d48187f05": 410, "62427d48187f06": 410, "62427d48187f07": 410, "62427d48187f08": 410, "62427d48187f09": 410, "62427d48187f0a": 410, "62427d48187f0b": 410, "62427d48187f0c": 410, "62427d48187f0d": 410, "62427d48187f0e": 410, "62427d48187f0f": 410, "6246": 397, "6247": 428, "625089": 411, "62510d48eff6": 410, "62511548efe": 410, "62511d48efe4": 410, "62512d48efd2": 410, "62513548efc9": 410, "62513d48efc0": 410, "62517c48114506": 410, "62517c48114d07": 410, "62517c48115508": 410, "62517c48116509": 410, "62517c48116d0a": 410, "62517c4811750b": 410, "626": 366, "6263": [397, 411], "627": 397, "628": 397, "6289": 425, "629": 411, "6290": 397, "62926d40b8d7": 410, "62926d40b8f7": 410, "62927540b8cf": 410, "62927540b8ef": 410, "62927d40b8c7": 410, "62927d40b8e7": 410, "6297": 428, "62c17c481006": 410, "62c17c48104603": 410, "62c17c48104606": 410, "62c17c48104609": 410, "62c17c48104e01": 410, "62c17c48104e04": 410, "62c17c48104e07": 410, "62c17c48104e0a": 410, "62c17c48105602": 410, "62c17c48105605": 410, "62c17c48105608": 410, "62c17c4810560b": 410, "62d17c48114500": 410, "62d17c48114d01": 410, "62d17c48115502": 410, "62d17c48116503": 410, "62d17c48116d04": 410, "62d17c48117505": 410, "62f14d48eff6": 410, "62f15548efe": 410, "62f15d48efe4": 410, "62f16d48efd2": 410, "62f17548efc9": 410, "62f17d48efc0": 410, "63": [288, 372, 396, 397, 411, 425], "6313": 411, "6316": 411, "632": 411, "6322": 397, "63282": 411, "633": 397, "634": 361, "6341": 411, "6342": 425, "635554": 411, "635729": 411, "6362": [411, 425], "6365": 428, "6374": 397, "6378": 425, "638": 411, "6392": 428, "63x": 425, "64": [25, 259, 281, 288, 321, 327, 328, 329, 349, 372, 376, 389, 396, 397, 405, 407, 408, 410, 411, 413, 425, 426], "6404": 428, "641": 389, "641585": 411, "64247": 411, "64253": 411, "642672": 411, "6432": 411, "6437": 428, "644": 411, "6449": 397, "645": 411, "6462": 411, "6477": 397, "6487": 411, "64963": 411, "6499": 428, "64byte": 399, "65": [288, 411, 425], "65059": 411, "6509": 411, "6510": 385, "6518": 411, "652": 411, "6542": 428, "6543": 411, "655": 428, "6569": 428, "658": 411, "65b": [288, 428], "65k": 348, "65x": 425, "66": [288, 411, 425], "661b400b8983": 319, "6621": 428, "6633": 411, "6637": 425, "664": 411, "6659": 397, "668": 411, "66b": 428, "67": [288, 372, 411, 425], "6702": 411, "6718": 428, "6735": 428, "6737": 397, "6740": 428, "675": 411, "6757": 425, "6759": 411, "6760": 411, "6769": 428, "679": 411, "6796": 411, "6798": 425, "67x": 425, "68": [20, 21, 288, 304, 410, 411, 425], "680": 397, "6804": 428, "6814": 428, "682": 411, "6821": 428, "6831": 428, "68383": 411, "684": 397, "6847": 425, "685": 411, "685382": 411, "686": 411, "6860": 425, "6866": 428, "687": 397, "6872": 428, "6895": 428, "69": [288, 346, 411, 425], "690": [397, 411], "6917": 411, "69186": 411, "6923": 411, "693": 389, "694533": 411, "6947": 425, "695": 411, "6953": 428, "6968": 425, "697": 411, "697876": 411, "698": [397, 411], "699579": 411, "6b": [272, 302, 304, 309, 354, 428, 432], "6f": 410, "7": [55, 57, 281, 288, 304, 320, 347, 351, 361, 386, 387, 391, 393, 395, 397, 403, 411, 413, 416, 423, 425, 428], "70": [288, 389, 397, 425], "700": [330, 332], "7009": 411, "701": [397, 411], "701639": 411, "7019": [411, 425], "702": 411, "7021": 411, "703": [411, 425], "703207": 411, "704": 411, "70404": 411, "705": 411, "708": 411, "7081": 425, "70b": [288, 309, 326, 330, 332, 349], "71": [288, 397, 410, 411, 425], "711": 411, "711146": 411, "712": 411, "7121": 411, "7128": 428, "7143": 428, "7149": 428, "718776": 411, "718893": 411, "719": 397, "7192": 397, "72": [288, 411, 425], "720963": 411, "7213": 411, "722": 411, "7221": 428, "7225": 411, "724": [397, 411], "7256": 411, "726": 389, "7261": 411, "7262": 428, "7265": 411, "727": 411, "7282": 411, "729": [397, 425], "73": [288, 411, 425], "730678": 411, "7307": 411, "73162": 411, "7324": 411, "7326": 428, "7330": 428, "7334": 411, "7336": 376, "734": 411, "7341": 411, "735": 411, "7354": 411, "7357": 428, "7361": 428, "7369": [411, 428], "737": 411, "737943": 411, "738": 411, "7385": 376, "738939": 411, "7398": 428, "73x": 425, "74": [288, 397, 411, 425], "741": 411, "742": [411, 425], "743": 411, "7442": 411, "7445": 411, "745": [361, 411], "745357": 411, "7466": 411, "747": [397, 411], "74845": 411, "7488": 411, "749f02a5": 332, "75": [288, 346, 351, 385, 397, 411, 425], "750": 397, "75007": 377, "7502": 411, "7512": 411, "7516": 425, "7518": 411, "752": 397, "7520": 411, "753": 411, "75328": 411, "753487": 411, "75384": 397, "754": 411, "755": [411, 425], "756": [397, 411], "75786": 411, "758": 411, "759": 411, "7590": 428, "7599": 425, "75x": 425, "76": [288, 304, 346, 410, 411, 425], "760": 425, "7600": 425, "7608": 411, "761": 411, "7627": 428, "7637": 411, "764": 411, "76407": 411, "7643": 411, "7647": 411, "765": 411, "7651": 411, "767569": 411, "768": [372, 397, 411], "769": 397, "7690": 411, "77": [288, 346, 411, 425], "77082": 411, "771439": 411, "77317": 411, "77444": 411, "774m": 428, "775294": 411, "7759": 428, "7770": 411, "7774": 411, "7777": 336, "778244": 411, "7794": 411, "7799": 411, "77it": 361, "78": [288, 411, 425], "7803": 411, "781": 411, "7815": 411, "7833": 411, "784": 411, "7840": 428, "7850": 411, "786": [397, 411, 425], "7860": [345, 383, 384], "787": 411, "788": [397, 411], "789777": 411, "79": [288, 346, 372, 411, 425], "7901": 397, "7908": 428, "7924": 425, "7929": 425, "793": 425, "793822": 411, "7941": 411, "7957": 428, "7965": 411, "797": 411, "7978": 411, "798": 397, "799": 411, "7b": [288, 309, 313, 314, 315, 316, 319, 321, 324, 327, 328, 329, 331, 334, 338, 340, 343, 344, 345, 346, 350, 352, 361, 363, 364, 366, 371, 375, 377, 383, 384, 396, 420, 422, 427, 428, 429, 432], "7b1": 428, "7b86016aa1d2107440c1928694a7bba926509887": 427, "7c": 410, "7th": 377, "8": [57, 246, 247, 281, 302, 304, 305, 314, 320, 326, 330, 346, 347, 348, 349, 352, 354, 366, 387, 389, 391, 393, 395, 397, 401, 402, 403, 409, 410, 411, 413, 422, 423, 425, 426, 428, 432], "80": [288, 304, 321, 345, 383, 384, 389, 397, 411, 425], "8000": [309, 313, 316, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 337, 338, 343, 344, 361, 363, 367, 375], "8008": 361, "801": 389, "8011": 425, "802": 411, "8021": [364, 365, 366, 425], "8029": 411, "8033": 411, "8034": 411, "804": 411, "8045": 425, "805": 411, "806": 411, "8061": 425, "807": 397, "81": [288, 361, 411, 425], "8101": 425, "8102": 411, "8105": 425, "811": 411, "8127": 425, "813": 411, "8148": 411, "816": 397, "817": [397, 411], "818": 411, "819": [397, 411], "82": [288, 397, 410, 411, 425], "821": [397, 411], "822": 411, "823": [411, 425], "8237": 425, "8240": 411, "8253": 425, "8255": 411, "826": [411, 425], "8262": 411, "8275": 376, "8280": 304, "8282": 397, "8297": 376, "82x": 304, "83": [288, 411, 425], "8300": 411, "83024": 411, "831": 411, "832": 411, "8325": 411, "832701": 411, "833": 397, "834": [411, 425], "835": 411, "8363": 425, "836616": 411, "8375c": [397, 411], "8381": 411, "8395": 425, "8399": 411, "84": [288, 304, 411, 425], "840": 411, "841": 411, "8412": 411, "84121": 411, "842": 425, "8426": [397, 411], "842936": 411, "843": 397, "844": 411, "8441": 425, "8447": 425, "845": 411, "8456": 319, "8466": 411, "848": 397, "8480": [310, 425, 426], "8481": 411, "8482": 425, "84983": 411, "85": [260, 288, 304, 397, 411, 425], "8507": 425, "8515": 425, "853": [411, 425], "853916": 411, "855": 411, "857": 411, "858": 411, "859": 411, "8598": 397, "86": [288, 304, 411, 425], "861": [411, 425], "862": [397, 411, 425], "863": [397, 411], "865": 397, "8652": 411, "867": 397, "868": [397, 411], "8689": 397, "87": [288, 304, 411, 425], "870": 397, "8711": 411, "8715": 425, "8728": 411, "87335": 411, "8736": 397, "874": 411, "87429": 411, "875": 411, "876": 411, "8768": 425, "87685": 411, "877": [397, 411], "8775": 425, "878": 411, "8798": 411, "87x": 425, "88": [288, 304, 397, 410, 411, 425], "880": 411, "880179": 411, "880185": 411, "881": 411, "8818": 411, "8823": 411, "883258": 411, "884": 411, "8841": 411, "885": 411, "887": 425, "8888": [322, 340], "889": 411, "88x": 425, "89": [288, 304, 411, 425], "890": [397, 411], "891": 411, "8916": 425, "892": 411, "8923": 425, "893": 425, "893959": 411, "894": [397, 411], "8940": 411, "895": 411, "8972": 397, "898": [397, 411], "8989": 425, "8b": 432, "8e": 410, "8ghz": [425, 426], "8x7b": [309, 349], "9": [28, 57, 302, 306, 308, 320, 323, 327, 328, 329, 330, 331, 332, 337, 348, 349, 361, 362, 377, 387, 395, 397, 403, 411, 413, 419, 425, 426, 427, 428, 432], "90": [288, 302, 304, 389, 397, 419], "900": [411, 425], "9000": 334, "9018": 411, "902": 411, "902588": 411, "9026": 425, "9031": 425, "904": [397, 411], "905": [397, 411], "906": 397, "9060": 411, "907": 411, "908": 397, "9088": 411, "909": 411, "909941": 411, "90ghz": [397, 411], "91": [288, 397, 411, 425], "910": [397, 411], "911": 411, "9110": 411, "912": 411, "913": [397, 411], "913626": 411, "914": [397, 411], "915": 411, "916": 411, "917": 397, "9176": 411, "9183": 411, "91db": 332, "92": [288, 411, 425], "9206": 425, "92067": 411, "921": [397, 411], "9213": 425, "922": 425, "923": 411, "924": 397, "926": 397, "926038": 411, "927": 411, "928": 397, "9283": 425, "929398": 411, "93": [288, 411, 425], "930": 411, "931": [397, 411], "933": [289, 397], "935": [397, 411], "936": [397, 411], "937": 411, "937824": 411, "938": [397, 411], "939": 411, "94": [288, 411, 425], "940": 411, "94057": 411, "941": 411, "9418": 425, "943": 411, "945": [397, 411], "946": 411, "947": [397, 411], "94733": 411, "949": 411, "94x": 304, "95": [288, 346, 351, 410, 411, 425], "951": 411, "9513": 319, "952": 397, "955251": 411, "956": 411, "957": [397, 411], "958": [397, 425], "959": 411, "96": [288, 372, 397, 411, 425], "9609": 425, "961": 397, "965": [397, 411], "966": 397, "967": 397, "968": 411, "96945": 411, "97": [288, 411, 425], "971": [397, 411], "972": [397, 411], "973": 397, "974": [397, 411], "975": [397, 411], "9761": 411, "977": 411, "978": 397, "979": [397, 411], "98": [28, 288, 385, 411, 425], "980": 425, "9817": 425, "982": [397, 411], "983": 397, "985": [397, 411], "9857": 425, "987": 397, "9876": 363, "988": 397, "989": 397, "9890": 411, "99": [288, 302, 304, 411, 425], "9919": 411, "993": [397, 411], "994": 411, "994935": 411, "995": 397, "996": [397, 411], "996979": 411, "997": 411, "998": [397, 425], "999": 411, "9998425245285034": 418, "9998886585235596": 418, "99x": 425, "9b": 410, "9ghz": [397, 411], "A": [0, 1, 4, 9, 24, 25, 29, 36, 45, 47, 50, 57, 246, 258, 260, 266, 268, 309, 348, 349, 351, 375, 387, 388, 395, 397, 399, 402, 403, 409, 411, 413, 420, 428], "And": [158, 319, 347, 361, 389, 390, 391, 392, 395, 400], "As": [57, 303, 316, 342, 361, 371, 387, 389, 391, 392, 403, 407, 409, 432], "At": [25, 300, 361, 405, 406, 408], "Be": [62, 243], "Being": 298, "But": [57, 388, 399, 418], "By": [24, 246, 272, 351, 366, 379, 420], "FOR": [403, 404, 409], "For": [32, 52, 53, 57, 62, 181, 256, 257, 258, 266, 270, 271, 288, 298, 302, 306, 307, 308, 309, 314, 319, 325, 330, 332, 345, 346, 347, 348, 350, 352, 355, 357, 359, 361, 369, 372, 376, 378, 383, 384, 387, 390, 391, 395, 396, 397, 398, 400, 403, 407, 408, 409, 410, 411, 418, 425, 426, 427, 428], "If": [0, 17, 24, 25, 29, 33, 36, 44, 57, 246, 247, 260, 266, 269, 300, 303, 305, 308, 309, 313, 314, 315, 316, 317, 318, 330, 332, 335, 340, 349, 350, 351, 361, 363, 369, 372, 373, 375, 376, 380, 387, 389, 390, 391, 392, 395, 400, 406, 413, 415, 416, 419, 421, 423, 426, 427, 432], "In": [24, 36, 57, 258, 298, 302, 309, 314, 316, 321, 322, 330, 332, 338, 347, 349, 351, 356, 366, 368, 369, 372, 376, 387, 388, 389, 390, 391, 392, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 416, 417, 423, 425, 426, 429, 432], "It": [25, 35, 57, 147, 256, 257, 289, 303, 307, 319, 350, 354, 367, 369, 372, 376, 377, 378, 387, 389, 390, 391, 394, 395, 396, 404, 405, 407, 408, 413, 432], "Its": 391, "No": [269, 304, 361, 372], "Not": 404, "OF": 322, "ON": [398, 413], "OR": 351, "Of": [363, 389, 395, 402], "On": [325, 376, 420, 425, 426], "One": [319, 361, 365, 377, 379], "Or": [313, 316, 395, 420], "Such": 323, "TO": [403, 427], "That": [57, 408, 409], "The": [0, 2, 3, 4, 6, 10, 14, 17, 25, 28, 29, 30, 32, 35, 36, 39, 40, 41, 42, 44, 47, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 250, 251, 256, 257, 260, 263, 266, 267, 272, 274, 277, 279, 280, 282, 286, 289, 298, 302, 303, 308, 309, 311, 314, 315, 316, 319, 321, 334, 338, 340, 345, 346, 347, 348, 349, 353, 354, 355, 356, 358, 361, 366, 369, 370, 371, 372, 373, 375, 376, 377, 378, 379, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 399, 400, 401, 403, 404, 405, 406, 407, 408, 409, 413, 416, 418, 419, 420, 423, 429, 432], "Then": [32, 49, 57, 303, 309, 321, 330, 332, 351, 355, 364, 365, 366, 372, 375, 391, 392, 408, 409, 413, 419, 423], "There": [49, 303, 373, 376, 387, 388, 389, 406, 410, 413, 417, 419, 426], "These": [25, 256, 257, 270, 309, 327, 328, 329, 357, 361, 372, 387, 391, 395, 402, 408], "To": [24, 25, 36, 44, 270, 300, 307, 314, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 342, 343, 344, 345, 347, 348, 349, 351, 358, 361, 363, 364, 365, 369, 370, 372, 373, 383, 384, 387, 402, 405, 407, 409, 413, 414, 422, 426, 427, 429, 432], "Will": [25, 420], "With": [272, 314, 349, 371, 408, 420, 423], "_": [24, 44, 270, 303, 307, 309, 314, 317, 318, 321, 322, 323, 324, 327, 328, 329, 331, 332, 336, 345, 346, 347, 348, 349, 350, 351, 352, 354, 361, 363, 366, 372, 376, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 394, 395, 398, 399, 400, 401, 405, 406, 407, 408, 413, 416, 417, 418, 419, 423, 426, 428, 432], "__call__": 387, "__file__": [314, 332, 349], "__global": 402, "__init__": 387, "__kernel": 402, "__local": 402, "__m256i": 403, "__m512i": 403, "__str__": 247, "__version__": 421, "_attr": 387, "_create_out_pattern": 391, "_datatyp": 28, "_devic": 25, "_get_pattern_info": 391, "_mm256_loadu_epi": 403, "_mm512_castsi256_si512": 403, "_mm512_inserti32x8": 403, "_mm512_permutexvar_epi16": 403, "_mm512_set_epi16": 403, "_mm512_storeu_epi32": 403, "_n": 57, "_replace_pattern": 391, "_set_attr": 387, "a1": 410, "a100": [320, 379], "a1ef": 319, "a32543254": 299, "a7": 410, "a_node_name_1": 395, "a_node_name_2": 395, "a_node_name_n": 395, "a_scal": 62, "ab": [36, 260, 423], "abi": 361, "abil": [358, 372, 382, 428], "abl": [266, 330, 332, 335, 336, 363, 380, 407, 423], "about": [4, 25, 57, 298, 302, 309, 311, 316, 318, 319, 321, 337, 338, 345, 358, 361, 364, 365, 366, 367, 370, 372, 375, 383, 384, 387, 391, 394, 397, 401, 409, 411, 420, 425, 426, 429], "abov": [36, 44, 49, 57, 256, 257, 266, 309, 322, 364, 377, 386, 387, 390, 391, 395, 402, 403, 405, 406, 407, 412], "absolut": [256, 257, 259, 314, 315, 416, 423], "absorb": 372, "absorb_to_lay": 247, "abspath": [314, 332, 349], "abstract": [25, 33, 36, 44], "abus": 298, "academ": 350, "acb8": 332, "acc": [402, 413, 414], "acc91": 304, "acceler": [9, 38, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 293, 302, 304, 305, 314, 315, 327, 328, 329, 349, 352, 357, 366, 385, 392, 400, 401, 405, 407, 420, 421, 423, 428, 432, 436], "accent": 369, "accept": [24, 246, 298, 361, 372, 376, 418, 428, 432], "access": [264, 266, 270, 313, 314, 316, 330, 332, 335, 345, 348, 349, 361, 370, 372, 373, 380, 383, 384, 399, 404, 405], "accommod": [306, 370, 372], "accompani": 415, "accomplish": 371, "accord": [25, 55, 313, 316, 317, 318, 330, 335, 349, 371, 376, 379, 380, 387, 390, 409], "accordingli": [331, 332], "account": [36, 44, 298, 351], "accumul": [25, 307, 402, 407, 409], "accur": [25, 316, 372, 432], "accuraci": [25, 246, 264, 268, 304, 306, 307, 313, 319, 338, 354, 357, 358, 371, 372, 373, 376, 405, 413, 416, 417, 425, 428, 429, 432], "accuracy_criterion": 423, "accuracycriterion": 423, "achiev": [246, 302, 321, 354, 402, 417, 425, 428], "acoust": 369, "acquir": [348, 349], "across": [312, 319, 320, 323, 325, 326, 330, 331, 332, 376, 379, 428], "act": [288, 298, 404], "action": [269, 298], "activ": [256, 257, 261, 309, 323, 330, 331, 332, 337, 354, 361, 362, 372, 388, 392, 393, 396, 399, 403, 404, 405, 408, 413, 421, 423, 428, 432], "activation_dag": 396, "activation_mem_compress": 396, "actual": [345, 377, 383, 384, 387, 388, 406], "ad": [25, 36, 44, 255, 376, 378, 387, 391, 395, 400, 409, 413, 418], "adafactor": 352, "adapt": [4, 28, 32, 38, 43, 272, 298, 302, 309, 314, 331, 349, 352, 353, 354, 357, 359, 360, 372, 374, 420, 422, 430], "adapter_model_nam": 352, "add": [25, 30, 39, 40, 41, 47, 55, 57, 73, 269, 303, 314, 315, 317, 318, 347, 364, 366, 369, 371, 376, 388, 389, 390, 391, 395, 398, 400, 401, 408, 413, 414, 421, 422, 439], "add_1": 395, "add_284": 389, "add_37": 389, "add_bia": [413, 421], "add_cls_token": 150, "add_config_item": 55, "add_cross_attent": [36, 44], "add_embed": 150, "add_execut": 388, "add_gen": 30, "add_pooling_lay": [36, 44], "addclstoken": [130, 138], "addembed": 131, "addit": [50, 246, 247, 302, 304, 309, 359, 372, 378, 389, 402, 406, 414, 424, 432], "addition": [25, 319, 325, 357, 370, 373, 432], "additional_cmd": 27, "addr": 400, "addr_dst": 401, "addr_ptr": 400, "addr_src": 401, "addr_typ": 400, "address": [272, 288, 298, 314, 319, 322, 325, 337, 338, 345, 349, 358, 370, 372, 373, 375, 377, 383, 384, 400, 405, 406, 420, 429], "addv2": [57, 73, 395], "adher": [270, 300, 307, 319, 372], "aditya": 301, "adjac": 409, "adjust": [266, 288, 289, 309, 314, 331, 332, 345, 349, 357, 369, 372, 383, 384, 394, 423, 432], "adopt": [25, 364, 365, 366, 370, 399, 404, 409, 432], "advanc": [272, 298, 302, 309, 335, 357, 361, 372, 378, 380, 398, 410, 420, 432], "advantag": 370, "adventur": 361, "advis": 349, "ae": 410, "affect": [350, 390, 405, 408, 414], "affin": [255, 423], "aforement": 355, "after": [0, 24, 25, 36, 47, 57, 147, 181, 195, 220, 247, 256, 257, 260, 264, 305, 309, 313, 322, 347, 348, 349, 350, 355, 357, 363, 366, 372, 376, 382, 386, 389, 390, 391, 392, 394, 395, 399, 401, 406, 408, 409, 412, 413, 414, 423, 427, 428], "after_lay": 24, "afterward": [332, 395], "ag": 298, "again": [363, 406], "against": 409, "agent": [361, 372, 420], "agent_qa": 372, "aggreg": [17, 25], "agnost": 303, "agreement": [335, 380], "ahouzi": 301, "ai": [272, 302, 309, 315, 319, 325, 333, 334, 346, 361, 372, 420, 425, 426], "ai_photo": 334, "aid": [309, 321], "aidan": [36, 44], "aim": [306, 377, 387, 389, 391, 428], "aipc": 309, "airmeng": 299, "akarx23": 301, "akdlm": 332, "al": 432, "alapaca": 314, "alg": 400, "algo": 401, "algorithm": [25, 57, 95, 184, 281, 346, 347, 369, 370, 372, 376, 377, 390, 391, 394, 395, 399, 400, 406, 413, 419, 423, 427, 429], "algorithm_": 394, "alia": 394, "alibaba": [272, 420], "alibi": 429, "align": [266, 298, 346, 347, 350, 352, 372, 399, 401, 409, 420], "align_column": 266, "align_corn": 264, "align_head": 266, "align_img": 20, "align_row": 266, "align_supercel": 266, "all": [0, 1, 9, 24, 25, 32, 33, 36, 38, 44, 47, 52, 53, 54, 55, 57, 83, 95, 181, 184, 195, 246, 247, 256, 257, 259, 261, 264, 266, 268, 270, 272, 288, 298, 300, 301, 302, 303, 307, 314, 315, 316, 318, 322, 334, 335, 347, 349, 350, 351, 355, 359, 365, 366, 372, 376, 380, 386, 387, 388, 389, 391, 395, 397, 400, 401, 402, 403, 405, 408, 411, 416, 419, 420, 423, 429, 430], "all_choic": [267, 268, 351], "all_gath": 264, "alloc": [332, 396, 401, 402], "allow": [24, 35, 266, 289, 319, 322, 349, 357, 361, 369, 370, 372], "along": [25, 32, 247, 314, 349, 361, 373, 390, 404, 407, 409], "alpaca": [314, 349, 425], "alpaca_data": [314, 349, 422], "alpha": [111, 247, 260, 281, 309, 406, 413, 425, 428], "alpha_max": 247, "alpha_min": 247, "alpha_step": 247, "alreadi": [314, 315, 330, 332, 345, 355, 364, 372, 383, 384, 395, 421], "also": [24, 25, 57, 247, 256, 257, 266, 270, 300, 302, 309, 314, 319, 324, 336, 337, 345, 347, 349, 350, 351, 354, 358, 363, 364, 365, 366, 370, 372, 373, 376, 382, 383, 384, 387, 388, 389, 391, 392, 394, 395, 396, 399, 400, 401, 402, 405, 408, 409, 410, 417, 421, 423, 427, 428, 432], "alter": 377, "altern": [345, 370, 372, 373, 383, 384, 403, 409, 420], "although": 340, "alwai": [372, 373, 376, 395, 405, 414], "always_keep_cls_token": [36, 44], "amaz": [345, 383, 384], "amazon": [359, 397, 411], "ammbashankar": 301, "among": [361, 372, 405, 408], "amount": [25, 304, 402], "amx": [398, 405, 408, 413, 437], "amx_bf16": 421, "amx_bf16_params_t": 281, "amx_bf16_x16": 413, "amx_bf16bf16_inputs_t": 281, "amx_bf16f32_inputs_t": 281, "amx_inputs_t": 281, "amx_int8": 421, "amx_int8_params_t": 281, "amx_params_t": 281, "an": [0, 8, 24, 25, 33, 36, 44, 50, 57, 62, 243, 246, 258, 268, 270, 272, 298, 302, 304, 305, 306, 307, 309, 313, 314, 316, 317, 318, 319, 320, 323, 325, 335, 336, 338, 348, 349, 351, 354, 355, 358, 363, 365, 366, 369, 370, 372, 373, 376, 377, 378, 380, 387, 388, 389, 390, 391, 394, 395, 396, 399, 400, 401, 405, 406, 409, 414, 416, 418, 420, 421, 422, 428, 432, 439], "ana": 301, "anaconda": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "anaconda3": 361, "analys": 410, "analysi": [272, 302, 420], "analyz": [316, 389], "anatol": 377, "ani": [0, 17, 25, 32, 35, 36, 44, 57, 246, 247, 255, 256, 257, 264, 266, 270, 298, 309, 316, 335, 351, 363, 375, 377, 380, 387, 395, 418], "anim": 321, "annot": [350, 372], "annual": [316, 358, 372], "annual_report": [358, 372], "anomali": [319, 325], "anonym": 281, "anoth": [319, 321, 325, 387, 391, 396, 410], "answer": [36, 44, 298, 304, 316, 319, 324, 335, 338, 351, 361, 370, 372, 380, 430], "anyth": [266, 279, 377], "anywher": 419, "apach": [372, 415], "api": [4, 22, 32, 52, 53, 55, 256, 257, 260, 270, 272, 300, 302, 311, 315, 316, 319, 320, 334, 335, 336, 337, 338, 354, 356, 357, 358, 363, 369, 370, 372, 375, 380, 388, 390, 391, 392, 394, 395, 400, 401, 418, 420, 435], "api_kei": [309, 321], "api_open": 361, "apierrorcod": 22, "apolog": 377, "app": [6, 309, 345, 353, 361, 383, 384], "appear": [298, 373], "append": [0, 39, 40, 309, 321, 324, 338, 372, 406, 413, 414], "append_loop_len": 400, "append_messag": 0, "append_op": 389, "append_sum": 281, "append_vec": 400, "appl": 420, "appli": [5, 32, 57, 256, 257, 266, 298, 304, 306, 319, 346, 347, 354, 400, 401, 405, 406, 407, 409, 413, 419, 423, 432], "applic": [6, 309, 313, 316, 317, 318, 319, 321, 324, 325, 338, 340, 358, 361, 363, 367, 369, 370, 372, 420, 432], "apply_class_threshold": 266, "apply_lora": 347, "apply_postop_list": 401, "apply_postops_list": [280, 401], "apply_postops_list_": [280, 401], "apply_rotary_pos_emb": 32, "apply_threshold": 266, "appoint": 298, "approach": [260, 288, 303, 304, 306, 307, 311, 314, 349, 370, 372, 373, 375, 378, 401, 402, 420, 422, 429], "appropri": [298, 372, 376, 387, 400, 408], "approv": [270, 299, 300, 348, 349], "approx_ratio": 30, "approxim": [30, 408], "apr": 420, "april": [272, 420], "apt": [308, 309, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 359, 363, 368, 369], "ar": [5, 9, 23, 24, 25, 29, 30, 32, 33, 36, 39, 40, 41, 44, 47, 57, 195, 247, 255, 256, 257, 258, 261, 263, 265, 266, 269, 270, 272, 298, 300, 302, 303, 307, 308, 309, 314, 315, 316, 319, 321, 324, 327, 328, 329, 330, 332, 335, 336, 346, 347, 352, 354, 355, 356, 357, 361, 363, 369, 371, 372, 373, 377, 378, 380, 384, 385, 386, 387, 388, 389, 390, 391, 392, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 413, 414, 415, 416, 417, 419, 420, 421, 422, 423, 426, 432], "arang": 73, "arangewithreciproc": 150, "arbitrari": [25, 260, 264, 355, 369, 387], "arc": [288, 346, 432], "arch": 323, "architectur": [9, 36, 44, 270, 302, 316, 323, 349, 350, 399, 406, 408, 432], "archiv": 369, "arcturus22": 301, "area": 266, "areg": 402, "arg": [24, 25, 32, 35, 39, 40, 41, 47, 61, 128, 246, 279, 307, 313, 314, 315, 316, 317, 318, 319, 324, 336, 338, 340, 347, 349, 358, 371, 372, 375, 389, 394, 421], "arg1": 1, "arg2": 1, "arg3": 1, "arg_t": 279, "argmax": [302, 306], "argument": [1, 2, 5, 24, 25, 32, 36, 44, 47, 246, 247, 303, 314, 355, 361, 371, 372, 376, 389, 414, 416, 417, 419, 422], "argumentpars": 1, "ariel": 301, "aris": 372, "arithmet": 400, "around": [263, 350, 361], "arrai": [20, 21, 24, 25, 62, 388], "arrondiss": 377, "art": [302, 372], "articl": [316, 349, 402, 420, 423], "artifact": 35, "artifici": [272, 420], "arxiv": [36, 260, 420, 432], "aryaman": 301, "ashimin": 301, "ashish": [36, 44], "askdoc": 324, "aspect": [316, 372], "asr": [309, 319, 322, 334, 336, 340, 355, 375], "assembli": [293, 398, 402, 404, 409, 410, 436], "assert": [36, 83, 421], "asset": [311, 319, 375], "assign": [256, 257, 258, 314, 349, 405], "assign_reg": 401, "assist": [309, 319, 323, 324, 325, 350, 361, 377, 424], "assistant_model": 323, "associ": [266, 399], "assum": [266, 387, 395, 402], "assur": 319, "astep": 281, "astunpars": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "asub": 402, "asym": 421, "asymmetr": [421, 423], "aten": 111, "atom": 415, "atributt": 247, "ats": 432, "atsm": 361, "attach": [24, 349, 387, 395], "attack": 298, "attempt": [35, 266], "attenion": 44, "attent": [32, 36, 44, 57, 259, 260, 279, 298, 307, 367, 389, 395, 407, 429], "attention_desc": 279, "attention_io": 281, "attention_mask": [33, 36, 37, 39, 40, 41, 44, 396], "attention_mask_length_adaptive_keep_indic": 150, "attention_output": [36, 44], "attention_output_layer_norm_length_adaptive_keep_indic": 150, "attention_reshap": 150, "attention_sink": 38, "attentionblock_attentionmaskaddreshap": 150, "attentionblock_constantofshapewithmul": 150, "attentionblock_qkvprereshap": 150, "attentionblock_qkvreshap": 150, "attentionblock_weightreshapeto4d": 150, "attentionmasklengthadaptiveexpandindic": 138, "attentionoutputlayernormlengthadaptiveexpandindic": 139, "attentionreshap": 140, "attr": [57, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 280, 387, 388, 400, 401], "attract": [338, 356, 358], "attribut": [50, 52, 53, 54, 57, 95, 247, 278, 279, 389, 391, 394, 401], "attrs_": [280, 401], "attrs_map": 394, "audio": [309, 311, 315, 319, 321, 322, 355, 369], "audio2text": 369, "audio_name_0": 355, "audio_name_1": 355, "audio_name_2": 355, "audio_path": 369, "audio_url": [341, 381], "audiolanguageopt": 5, "audiospeechrecognit": 369, "aug": [272, 420], "augment": [256, 257, 260, 270, 309, 319, 321, 324, 372, 376], "augmented_exampl": 376, "authent": 321, "author": [353, 359, 360, 374, 415], "authorized_kei": [330, 332], "auto": [9, 304, 314, 317, 318, 324, 334, 336, 338, 347, 349, 369, 375, 389, 394, 401, 428, 432], "auto_alpha_arg": 247, "auto_clip": 247, "auto_round": 427, "auto_scal": 247, "autoawq": 432, "autocast_init": 57, "autoconf": 337, "autoconfig": [45, 302, 306, 418, 428, 432], "autodistil": 270, "autoencoderkl": 9, "automat": [246, 289, 309, 316, 338, 346, 347, 352, 358, 372, 382, 389, 390, 391, 400, 413, 428], "automata": 373, "automativ": 246, "automodelforcausallm": [422, 428, 429, 432], "automodelforsequenceclassif": [302, 306], "autoregress": [354, 429], "autoround": 427, "autoroundconfig": [247, 432], "autotoken": [289, 302, 418, 428, 429, 432], "aux_loss": [256, 257], "aux_output": [256, 257], "auxiliari": [256, 257], "avail": [57, 274, 277, 282, 286, 302, 308, 309, 313, 316, 317, 318, 330, 332, 361, 366, 370, 371, 372, 388, 404, 423], "avatar": [341, 381], "avenu": 377, "averag": [23, 25, 36, 264, 289, 302, 346, 372, 376, 385, 389], "avg": 288, "avoid": [25, 36, 38, 44, 335, 354, 364, 372, 373, 380, 395, 399, 401, 405, 407, 408, 413], "avx": 410, "avx2": 421, "avx512": [398, 399, 403, 423], "avx512_data_t": 281, "avx512_fp32_params_t": 281, "avx512evex": 410, "avx512f": [398, 407, 413, 421, 437], "avx512f_p2031_p2013": 413, "aw": [397, 411], "awai": [264, 307], "awar": [272, 302, 309, 335, 380, 432], "awq": [288, 432], "awqconfig": [247, 432], "ax": [246, 387, 407], "axi": [302, 387, 389, 407, 408], "ayaan": 301, "azur": 269, "b": [21, 25, 36, 55, 57, 62, 268, 301, 330, 351, 387, 395, 399, 402, 403, 404, 408, 409, 413], "b1": 399, "b2": 399, "b4": 410, "b5a3f2c4": 319, "b_node_name_1": 395, "b_node_name_2": 395, "b_node_name_n": 395, "b_scale": 62, "ba": [269, 406, 410, 413], "baai": [14, 376], "back": [0, 361, 372, 405, 406, 407, 408], "backbon": [256, 257, 369], "backend": [28, 181, 246, 247, 252, 289, 302, 304, 314, 320, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 337, 338, 339, 340, 342, 343, 344, 345, 348, 349, 370, 383, 384, 386, 392, 423, 425, 426, 432], "backendopt": 5, "background": [345, 383, 384], "backpropag": [422, 432], "backup": 299, "backward": [24, 340, 423], "bad": 409, "badd_dim": 413, "baddbmm": 83, "badg": 435, "baeseong": 432, "balanc": [260, 307, 319, 320, 372, 373, 397, 411, 425, 428, 432], "ban": 298, "bandit": 269, "bandwidth": [406, 408, 423, 432], "bar": [17, 272, 420], "barrier": 402, "base": [2, 3, 4, 9, 14, 25, 35, 36, 40, 44, 45, 50, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 266, 279, 302, 304, 305, 306, 307, 309, 312, 319, 321, 323, 325, 327, 328, 329, 331, 335, 336, 337, 338, 340, 341, 347, 357, 358, 361, 362, 367, 369, 370, 372, 376, 377, 379, 380, 381, 382, 392, 394, 397, 402, 404, 405, 406, 407, 408, 410, 411, 418, 420, 428, 429, 430, 432], "base64": 0, "base_finetuned_model": 348, "base_model": 319, "base_model_path": 315, "base_url": [309, 321, 382], "basefinetuningconfig": 4, "baselin": 350, "basemodeloutputwithpast": 41, "basemodeloutputwithpastandcrossattent": [36, 44], "basemodeloutputwithpoolingandcrossattent": [36, 44], "basetrain": 246, "bash": [313, 314, 315, 322, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 350, 357, 358, 360, 363, 368, 383, 384, 392, 393, 427], "bashrc": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "basi": 372, "basic": [21, 25, 44, 302, 308, 309, 335, 356, 358, 361, 372, 380, 394, 405], "basicmagnitud": [304, 306], "basketbal": 23, "bass": 184, "batch": [25, 27, 37, 39, 40, 246, 255, 256, 257, 258, 260, 264, 268, 281, 302, 367, 376, 388, 404, 405, 407, 414, 425, 426, 432], "batch_decod": [428, 432], "batch_first": 23, "batch_matmul": 83, "batch_matmul_v2": 83, "batch_num": 281, "batch_siz": [25, 27, 28, 32, 33, 36, 44, 246, 247, 256, 257, 258, 289, 302, 352, 388, 389, 393, 407, 413, 427], "batched_data": 27, "batched_gen": 352, "batched_valu": 27, "batchk": [281, 408], "batchmatmul": 66, "batchmatmulv2": 67, "batchnorm": [255, 395], "batchnorm2d": 255, "batchsiz": 397, "bbox": 266, "bbox1": 266, "bbox2": 266, "bd00040000": 410, "beam": [350, 396, 432], "bear": 419, "beat": 420, "beauti": [361, 418], "becaus": [57, 258, 347, 349, 387, 394, 400, 403, 408, 423], "becom": [302, 319, 370, 391, 396, 409, 432], "been": [24, 36, 44, 57, 266, 303, 309, 319, 349, 350, 352, 354, 357, 372, 377, 401, 405, 418, 421], "befor": [24, 25, 28, 36, 47, 57, 246, 255, 256, 257, 260, 289, 300, 303, 305, 313, 314, 316, 317, 318, 330, 332, 336, 338, 345, 349, 358, 361, 370, 371, 372, 376, 383, 384, 387, 390, 391, 392, 395, 400, 401, 402, 403, 405, 406, 408, 413, 423, 426, 432], "begin": [44, 47, 400, 401], "behav": [36, 44, 352], "behavior": [246, 266, 298, 303, 370, 372, 375, 399, 400, 405, 419, 423], "behaviour": 372, "behind": 361, "being": [25, 246, 372], "believ": [377, 418], "belong": [57, 387, 423], "below": [57, 266, 272, 300, 302, 303, 308, 309, 314, 316, 317, 318, 319, 330, 332, 345, 347, 348, 349, 351, 354, 358, 359, 363, 366, 371, 372, 377, 383, 384, 387, 388, 390, 392, 395, 399, 404, 406, 407, 408, 409, 417, 422, 428, 432], "bench_": 413, "benchmark": [28, 246, 248, 269, 270, 302, 390, 397, 398, 411, 414, 420, 425, 426, 432, 434], "benchmark_dir": 413, "benchmark_it": 413, "benchmark_no_refresh": 413, "benchmark_util": 413, "benchmarkconfig": [27, 28, 289], "benedikt": 301, "benefici": 319, "benefit": [0, 370, 405, 409, 423], "bert": [36, 302, 303, 304, 340, 369, 388, 389, 390, 393, 395, 396, 397, 400, 405, 406, 407, 408, 430], "bert_large_model_path": 395, "bert_large_squad": 57, "bert_model": 388, "bertattent": 36, "bertembed": [36, 44], "bertencod": 36, "bertformaskedlm": 36, "bertformultiplechoic": 36, "bertfornextsentencepredict": 36, "bertforpretrain": 36, "bertforpretrainingoutput": 36, "bertforquestionansw": 36, "bertforsequenceclassif": 36, "bertfortokenclassif": 36, "bertintermedi": 36, "bertlay": 36, "bertlmheadmodel": 36, "bertlmpredictionhead": 36, "bertmodel": 36, "bertonlymlmhead": 36, "bertonlynsphead": 36, "bertoutput": 36, "bertpool": 36, "bertpredictionheadtransform": 36, "bertpretrainedmodel": 36, "bertpretraininghead": 36, "bertselfattent": 36, "bertselfoutput": 36, "berttoken": 36, "besid": [266, 303, 376, 401, 432], "best": [246, 258, 298, 304, 335, 340, 354, 372, 378, 380], "best_model": 428, "bestla": 421, "beta": [281, 406, 413], "better": [25, 57, 147, 246, 247, 319, 346, 347, 352, 361, 365, 371, 376, 387, 388, 389, 390, 399, 405, 406, 407, 408, 412, 416, 417, 423, 432], "between": [24, 25, 36, 44, 52, 53, 256, 257, 258, 266, 281, 303, 336, 354, 369, 370, 372, 379, 406, 409, 413, 423], "beyond": [361, 420], "bf16": [28, 158, 246, 302, 304, 314, 315, 320, 332, 336, 337, 340, 346, 347, 348, 349, 352, 354, 374, 375, 376, 392, 398, 401, 403, 405, 408, 413, 421, 422, 425, 426], "bf16_exp": [401, 413], "bf16_exp_attr": 401, "bf16_gelu": 401, "bf16_gelu_attr": 401, "bfloat": 305, "bfloat16": [305, 331, 346, 369, 371, 422], "bfloat16_t": 281, "bge": [14, 372, 376, 420], "bhadresh": 304, "bhargav": 301, "bia": [57, 62, 260, 281, 389, 413, 421], "bia_t": 281, "bianryop": 400, "bianryop_attr_list": 400, "bias_add": [62, 83], "bias_nod": 62, "bias_to_int32": 62, "biasadd": [57, 68, 391, 395], "bibtex": 415, "big": [49, 314, 315, 361, 390, 391, 396], "bigcod": [309, 349], "bigscienc": 428, "bin": [49, 55, 308, 309, 313, 314, 315, 388, 389, 390, 392, 410, 412], "binari": [25, 256, 257, 260, 308, 336, 401, 408, 413, 437], "binary_add": 400, "binary_injector": 400, "binary_injector_init": 400, "binaryadd": [73, 400], "binaryop": 400, "binaryop_addr": 400, "binaryop_alg": 400, "binaryop_attr": [280, 281, 400], "binaryop_injector": 400, "binaryop_list": [280, 400], "binaryop_list_": [280, 400], "bincount": 25, "bind": [314, 349, 388], "bio": [397, 411, 425, 426], "bit": [25, 247, 305, 306, 319, 348, 399, 400, 406, 409, 420, 421, 422, 423, 432], "bitsandbyt": [320, 432], "bitsandbytesconfig": 432, "blank": 413, "blip": 350, "blob": [22, 319], "block": [17, 25, 36, 44, 47, 304, 307, 396, 399, 402, 403, 404, 405, 406, 408, 409, 419, 432], "blocks_per_group": 281, "blocksiz": [247, 281, 421], "blockwise_over_matmul_gemm_conv": 47, "blog": [272, 302, 349, 420], "bloom": [272, 302, 363, 428], "bloom_1b7": 425, "bloom_7b1": 425, "bloomz": 428, "blue": [21, 36], "bm": 281, "bm25": 372, "bn": 281, "bnb_4bit_quant_typ": 432, "bo": 415, "boast": [309, 432], "bodi": [298, 316, 361, 407], "bolder": 407, "bond": 361, "bool": [0, 4, 9, 17, 23, 28, 33, 35, 36, 37, 40, 41, 44, 246, 247, 250, 251, 255, 264, 278, 279, 280, 281, 289, 303, 371, 372, 387, 400, 401, 416, 417, 421], "boolean": 4, "boolq": 288, "boost": [309, 357, 372, 407, 420], "boost_inc_dir": 388, "border": 407, "bordoloi": 301, "bori": 377, "bot": [341, 381], "both": [23, 36, 44, 265, 298, 306, 309, 319, 321, 330, 332, 333, 335, 339, 341, 342, 350, 354, 357, 369, 372, 373, 379, 380, 381, 382, 384, 405, 407, 412, 413, 414, 416, 423, 432], "bottleneck": [17, 404, 406, 432], "bottom": [266, 382], "bound": [29, 256, 257, 263, 266], "boundari": 266, "box": [256, 257, 258, 263, 266, 335, 380, 382, 398], "box_numpy_nul": 25, "boxes1": 263, "boxes2": 263, "brain": [419, 432], "branch": [35, 314, 315, 413], "brand": [24, 302, 415], "breadth": 372, "break": [353, 428], "breg": 402, "brief": [372, 410, 420], "bring": [25, 376, 390, 404, 408, 409, 420, 429], "broadcast": [32, 39, 40, 400, 404, 409, 410, 413], "broaden": 361, "broader": 432, "brought": [361, 406, 423], "brown": 340, "brows": 400, "browser": [322, 361], "bs0": 413, "bs1": 413, "bsc": 404, "bsc_data_t": 281, "bsmock": 359, "bsr": 404, "bstep": 281, "bsub": 402, "bsz": 37, "bubbl": 409, "budget": 306, "buffer": [247, 405, 406, 408], "buffers": 25, "bug": [300, 302], "build": [4, 27, 270, 302, 308, 309, 313, 316, 317, 318, 319, 320, 330, 336, 337, 338, 347, 366, 370, 378, 386, 399, 405, 406, 410, 417, 420, 426, 432], "build_chatbot": [4, 319, 356, 358, 372], "build_ext": 330, "build_with_cpu": 432, "built": [25, 361, 377, 417, 421, 428], "builtin_eval_func": 246, "builtin_train_func": 246, "bundl": [25, 361], "burden": 370, "busi": [314, 349, 376], "bxkxm": 399, "bxm": 399, "byeongwook": 432, "byte": [403, 409], "bytes_or_buff": 247, "c": [25, 38, 57, 245, 266, 268, 282, 288, 302, 308, 314, 319, 323, 324, 327, 328, 329, 331, 332, 334, 336, 337, 338, 340, 343, 344, 349, 351, 354, 357, 358, 359, 361, 363, 368, 385, 386, 387, 388, 390, 395, 397, 402, 404, 411, 413, 426, 427], "c0": 410, "c1": 57, "c2": 57, "c3": [57, 410], "c5f877": 410, "c6i": [397, 411], "c7": 410, "c_node_name_1": 395, "c_node_name_2": 395, "c_node_name_n": 395, "cach": [25, 35, 40, 41, 279, 307, 309, 320, 330, 340, 342, 344, 347, 364, 375, 394, 402, 405, 406, 407, 413, 421, 427, 429], "cache_config": 375, "cache_dir": 35, "cache_load_en": 25, "cache_plugin": 370, "cache_util": [40, 41], "cachefil": 25, "cacheplugin": 370, "cai": 432, "calc": 385, "calc_flop": 413, "calcul": [30, 47, 52, 53, 268, 349, 389, 395, 399, 401, 402, 405, 406, 409, 413, 423, 428], "calculate_ins_level_acc": 268, "calculate_scale_on_tmp_buf": 405, "calib": 428, "calib_dataload": [246, 247, 428], "calib_dataset": 247, "calib_func": [247, 428], "calib_it": [247, 432], "calib_len": [247, 432], "calib_pad": 247, "calib_pad_v": 247, "calib_shuffl": 247, "calibr": [246, 288, 423, 428], "calibrate_method": 246, "call": [9, 24, 25, 32, 57, 147, 177, 256, 257, 305, 319, 352, 354, 363, 370, 387, 390, 391, 396, 399, 400, 401, 408, 409, 420, 423, 432], "callabl": [25, 246], "calle": 24, "caller": 24, "can": [0, 9, 24, 25, 32, 33, 35, 36, 39, 40, 44, 47, 49, 57, 158, 181, 246, 247, 264, 266, 270, 302, 303, 305, 307, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 354, 355, 356, 358, 361, 363, 364, 365, 366, 369, 371, 372, 373, 375, 376, 377, 380, 382, 383, 384, 385, 387, 388, 390, 391, 392, 393, 394, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 410, 413, 417, 418, 419, 420, 421, 423, 425, 426, 428, 432], "candid": [372, 376], "candidate_context": 376, "cannot": [24, 25, 44, 281, 319, 324, 330, 332, 338, 373, 399, 405, 409, 414], "cao": 301, "cap": [314, 315, 347, 366], "capabl": [289, 309, 319, 325, 342, 357, 361, 369, 372, 373, 406, 409], "capac": 372, "caption": [267, 348, 350], "captur": 316, "carbon": 420, "card": [9, 320, 326, 346, 347, 352], "cardin": [256, 257], "carefulli": [309, 357, 373], "cascad": 302, "case": [32, 36, 44, 258, 268, 303, 304, 314, 316, 322, 348, 349, 351, 372, 376, 389, 390, 396, 399, 401, 402, 403, 413, 414], "cast": 83, "cast_to": 150, "castto": 141, "casual": [57, 372], "cat": [314, 330, 332, 349], "catalog": 302, "catch": 24, "categori": [351, 371, 387, 389], "category_nam": 351, "cater": [361, 369, 372], "caus": [24, 266], "causal": 44, "causal_lm": 422, "causallmoutputwithcrossattent": [33, 36, 44], "cbatchstep": 281, "cc": [350, 366, 388], "ccl": [314, 330, 348, 349], "ccl_torch2": 330, "ccl_worker_count": [314, 349], "ccontain": 57, "cd": [302, 308, 309, 314, 315, 322, 327, 328, 329, 330, 331, 332, 334, 335, 341, 347, 354, 361, 362, 364, 365, 366, 374, 379, 380, 381, 382, 386, 387, 388, 393, 398, 410, 413, 426, 427, 432], "ce": [303, 319, 420], "ceil": 25, "celebr": 361, "cell": [266, 322, 407, 409], "center": [25, 271, 309, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 361, 363], "center_i": [256, 257], "center_x": [256, 257], "cento": [302, 369, 425, 426], "central": 361, "centric": 309, "certain": [266, 319, 371, 372, 377, 387, 395, 432], "cffi": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "cfg": 281, "chain": [57, 309, 372, 400, 401], "challeng": [338, 358, 361, 372], "champ": 377, "chan": 25, "chang": [0, 32, 300, 309, 349, 355, 366, 369, 372, 377, 387, 399, 400, 409, 414, 415], "change_node_input_tensor": 55, "change_node_output_tensor": 55, "change_num_nam": 62, "changeabl": 400, "channel": [17, 25, 37, 361, 400, 404, 409, 413, 419, 428], "channel_num": 413, "chapter": 320, "charact": [15, 341, 381], "characterist": [298, 314, 349], "chart": 405, "chat": [0, 4, 5, 6, 7, 8, 10, 309, 313, 314, 315, 317, 318, 319, 321, 324, 331, 332, 334, 335, 338, 340, 341, 343, 344, 345, 346, 349, 352, 356, 358, 363, 364, 366, 367, 369, 371, 372, 377, 378, 379, 380, 381, 382, 383, 384, 420, 427, 428, 429, 432], "chat_a100_url": 379, "chat_gaudi2_url": 379, "chatbat": 361, "chatbot": [0, 8, 272, 302, 309, 312, 314, 316, 320, 324, 333, 338, 340, 341, 342, 343, 344, 345, 349, 357, 363, 370, 373, 375, 378, 381, 383, 384, 420], "chatbot_finetun": 347, "chatbot_serv": 361, "chatcmpl": 361, "chatglm2": 309, "chatglm3": 309, "chatgpt": [335, 338, 352, 358, 380], "chatqna": [309, 316, 382], "check": [9, 11, 15, 28, 39, 40, 57, 62, 158, 246, 268, 269, 295, 300, 302, 319, 330, 331, 332, 337, 340, 351, 353, 355, 361, 364, 366, 372, 373, 387, 390, 391, 395, 401, 421, 427, 438], "check_is_numb": 268, "check_result_": 413, "check_torch_compat": 421, "check_valu": 28, "checker": [9, 247, 309, 373], "checkout": [332, 372], "checkpoint": [36, 246, 348, 349, 360, 361, 369], "chen": 301, "cheng": 432, "chi": 359, "chian": 401, "child": [2, 372], "child_docu": [309, 372], "child_document_stor": 14, "child_par": 372, "childparentretriev": [2, 309, 357], "chines": [353, 359, 369], "chitchat": 372, "chmod": [330, 332], "choic": [36, 44, 267, 268, 304, 309, 321, 351, 361, 379, 396, 430], "choos": [313, 316, 335, 349, 369, 371, 372, 378, 380, 384, 395, 426], "chosen": [346, 347, 352, 372], "chroma": [14, 309, 357], "chrome": 389, "chuck": 372, "chunk": [36, 372], "ci": [315, 414], "circumst": 298, "citi": 377, "cjangcjengh": 353, "cl": [95, 184, 309, 361], "claim": [302, 415], "clamp": [36, 44], "clangformat": 269, "clarifi": [12, 298], "class": [34, 45, 62, 243, 261, 266, 281, 282, 289, 303, 361, 369, 373, 387, 394, 400, 401, 432], "class_error": 265, "class_filt": 25, "class_map": 266, "class_nam": 266, "class_subset": 25, "class_threshold": 266, "classif": [9, 33, 36, 44, 256, 257, 258, 260, 272, 302, 303, 304, 393, 418, 430], "classifi": [266, 371], "classmethod": 35, "claud": 352, "cleaner": 262, "cleaner_nam": 262, "clear": [316, 335, 380, 382, 408], "cli": 311, "cli_command": 311, "click": [322, 327, 328, 329, 382, 410], "client": [319, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 363], "clip": [9, 350], "clipfeatureextractor": 9, "cliptextmodel": 9, "cliptoken": 9, "clk_local_mem_f": 402, "clm": 304, "cloc": 269, "clone": [24, 302, 308, 322, 327, 328, 329, 330, 331, 332, 334, 335, 338, 340, 341, 345, 347, 348, 349, 354, 359, 361, 362, 363, 366, 369, 379, 380, 381, 382, 383, 384, 386, 387, 432], "cloud": [269, 319, 325, 361, 385], "cluster": [314, 349], "clx": 408, "cmake": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368, 386, 388, 398, 410, 413], "cmake_thread_libs_init": 388, "cmakelist": 388, "cn": 353, "cnn": [304, 349], "cnn_dailymail": 304, "co": [9, 25, 32, 35, 83, 127, 302, 334, 338, 345, 348, 349, 351, 359, 363, 383, 384], "coco": [256, 257, 260, 350], "code": [7, 20, 22, 25, 265, 270, 278, 279, 280, 281, 302, 306, 309, 321, 325, 336, 337, 338, 345, 347, 349, 350, 353, 356, 370, 371, 372, 377, 383, 384, 385, 387, 390, 401, 402, 403, 404, 405, 410, 413, 415, 426, 432], "code_chat": 309, "code_gen": 313, "code_gener": [309, 312, 313], "codealpaca": 349, "codegen": [309, 312, 313, 321, 323], "codegen2": 309, "codegen25": 350, "codellama": [309, 326, 327, 328, 329, 330, 332], "codellama_peft_finetuned_model": 349, "codenam": 272, "coeffici": 413, "coher": [272, 372, 406, 420], "col": [402, 403, 406, 408], "col_num": 281, "cola": 304, "colidx": 281, "collabor": [300, 420], "collect": [0, 18, 28, 266, 289, 372, 387, 389, 423], "collect_quant_info": 150, "collectquantinfo": 142, "color": [21, 265, 407, 409], "colsb": 403, "column": [265, 266, 348, 399, 404, 406, 409], "com": [22, 38, 43, 262, 302, 308, 314, 315, 319, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 348, 349, 352, 354, 357, 361, 362, 363, 366, 368, 369, 380, 383, 384, 386, 388, 397, 411, 415, 420, 424, 425, 426, 432], "combin": [9, 246, 319, 325, 354, 369, 372, 378, 390, 395, 400, 401], "combinatori": 307, "combinedstat": 25, "come": [260, 361, 372, 387], "command": [1, 308, 309, 313, 314, 315, 316, 317, 318, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 348, 349, 351, 354, 355, 358, 359, 363, 364, 366, 369, 376, 377, 379, 380, 381, 382, 383, 384, 385, 387, 388, 392, 414, 426], "comment": [298, 319, 325, 432], "commit": [35, 269, 298, 373, 414, 420], "common": [25, 298, 302, 319, 372, 423, 428], "commonli": [370, 372, 373], "commun": [0, 272, 298, 330, 332, 345, 378, 383, 384, 420], "compact": [303, 354], "compar": [266, 304, 319, 342, 347, 361, 370, 372, 378, 379, 399, 402, 413, 423, 432], "comparison": [288, 378, 389, 409], "compassion": 361, "compat": [300, 316, 319, 372, 421, 432], "competitor": 420, "compil": [245, 274, 306, 322, 324, 327, 328, 329, 338, 386, 387, 388, 390, 391, 393, 395, 396, 432, 439], "compiler_vers": 426, "complaint": 298, "complet": [0, 25, 57, 309, 316, 318, 321, 324, 346, 347, 349, 352, 354, 361, 367, 387, 397, 402, 405, 408, 411, 425, 426], "completion_token": 361, "complex": [369, 370], "compli": [335, 380], "complianc": 300, "complic": [57, 387, 395], "compon": [25, 269, 270, 299, 300, 319, 333, 339, 342, 369, 371, 378, 400, 415], "compos": [52, 53, 54, 369, 387, 392, 408], "comprehens": [270, 309, 319, 357, 372, 378], "compress": [272, 303, 306, 399, 403, 405, 409, 412, 423, 428, 432], "compression_manag": 246, "compressionmanag": 246, "compressor": [35, 272, 289, 302, 319, 366, 417, 419, 423, 425, 428, 432], "compris": 32, "comput": [23, 24, 25, 33, 36, 44, 49, 57, 243, 246, 256, 257, 258, 260, 263, 264, 266, 293, 302, 306, 312, 320, 361, 371, 372, 395, 398, 399, 400, 401, 402, 405, 407, 408, 412, 418, 421, 423, 425, 429, 432, 436], "comput_vector": 400, "computation": [346, 347], "compute_dtyp": [247, 319, 327, 328, 329, 371, 375, 429, 432], "compute_loss": 246, "compute_metr": 302, "compute_perform": 302, "compute_typ": 421, "compute_vector": 400, "concat": [83, 387], "concaten": [25, 314, 349, 403, 409, 413, 425], "concentr": [314, 349, 372], "concept": [372, 407, 409], "conceptu": 407, "concern": [338, 358, 372, 373], "concis": [316, 372, 395], "conclud": 372, "conclus": 409, "concurr": [354, 379], "concurrency_count": 361, "conda": [308, 354, 358, 360, 361, 362, 374, 393, 426], "conda_prefix": [354, 426, 427], "condens": 316, "condit": [9, 266, 390, 391, 415], "conduct": [270, 314, 347, 349, 376], "conf": [55, 319, 388, 389, 390, 394, 412], "conf_dict": 57, "confid": 266, "confidenti": 298, "config": [4, 27, 29, 32, 33, 36, 44, 45, 47, 49, 55, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 249, 286, 289, 302, 303, 304, 306, 309, 319, 330, 332, 345, 348, 349, 351, 354, 361, 364, 365, 366, 375, 377, 383, 384, 388, 389, 391, 394, 400, 413, 418, 419, 423, 428], "config_dir": 375, "config_fil": [309, 360, 367, 375], "config_file_path": 47, "config_hpu": 366, "config_list": 246, "config_nam": 354, "config_path": 351, "configur": [4, 6, 28, 29, 35, 36, 44, 47, 49, 138, 246, 247, 281, 304, 305, 309, 314, 317, 318, 337, 348, 349, 357, 361, 365, 366, 368, 370, 371, 372, 375, 390, 399, 400, 404, 424, 425, 426, 432], "configuration_llama": 32, "configure_log": 6, "conflict": [266, 281], "confus": 377, "conjunct": 416, "conll03": 304, "conll2003": 304, "connect": [57, 330, 332, 350, 419, 420], "connector": 350, "consecut": [57, 403, 409], "consequ": 429, "conserv": 377, "consid": [9, 25, 298, 319, 377, 390, 399, 401, 403, 414], "consider": 373, "consist": [49, 256, 257, 266, 300, 319, 350, 354, 370, 372], "consol": [355, 361], "const": [57, 62, 243, 278, 279, 280, 281, 394, 398, 400, 401, 402, 403], "const_cast": 394, "const_rat": 28, "constant": [7, 73, 192, 246], "constantofshap": 73, "constexpr": 281, "constitut": 350, "constraint": [28, 30, 288], "construct": [0, 36, 57, 95, 267, 298, 314, 338, 347, 349, 371, 372, 400, 401], "construct_default_prompt": 371, "construct_nod": 57, "constructor": [361, 394], "consult": 270, "consum": [396, 403], "consumpt": [361, 385], "contact": [269, 298, 300, 324, 338, 424], "contain": [20, 21, 23, 24, 25, 32, 36, 44, 47, 49, 57, 62, 243, 244, 246, 247, 256, 257, 258, 265, 266, 267, 270, 293, 303, 313, 316, 319, 347, 348, 349, 350, 351, 355, 361, 364, 365, 372, 373, 375, 387, 388, 390, 391, 395, 398, 400, 412, 413, 414, 419, 423, 436], "container_object": 266, "content": [309, 313, 314, 316, 317, 318, 319, 321, 324, 336, 338, 340, 358, 361, 363, 367, 370, 372, 382, 389, 420, 432], "context": [23, 24, 25, 316, 319, 335, 372, 373, 376, 380, 382], "context_dim": 260, "context_templ": 23, "contextu": 372, "contigu": 407, "conting": 404, "continu": [0, 36, 354, 361, 367, 399, 402, 406, 407], "contradict": 354, "contrast": 315, "contribut": [0, 298, 299, 378], "contributor": 301, "control": [24, 27, 40, 57, 369, 376, 387], "conv": [28, 83, 389, 390, 401], "conv_reshap": 150, "conveni": [25, 349, 356, 372], "convent": [24, 25, 278, 279, 280, 281], "convers": [300, 319, 335, 340, 341, 349, 353, 369, 372, 378, 379, 380, 381, 382, 423, 428, 432], "convert": [0, 15, 25, 27, 49, 52, 53, 57, 62, 243, 256, 257, 260, 262, 266, 305, 340, 369, 370, 374, 393, 396, 408, 413, 423, 428], "convert_fullwidth_to_halfwidth": 15, "convert_image_to_base64": 0, "convex": 266, "convex_hul": 30, "convolut": [17, 73, 260, 303, 390], "convreshap": 143, "cooper": [302, 405], "coordin": [256, 257, 258], "copi": [0, 24, 255, 261, 264, 300, 330, 332, 345, 379, 383, 384, 391, 407], "copilot": [309, 319, 325, 420], "copyright": [266, 269, 415], "core": [28, 269, 289, 308, 310, 331, 371, 388, 397, 399, 405, 406, 411, 414, 415, 420, 425, 426], "cores_per_inst": [28, 246, 289], "corner": 322, "corpor": [266, 415], "corpu": [324, 372], "correct": [247, 298, 314, 372, 391, 395, 407, 426, 427], "correct_answ": 319, "correctli": [32, 316, 340, 387], "correl": 25, "correspond": [49, 52, 53, 57, 247, 258, 260, 262, 330, 335, 341, 346, 347, 352, 355, 372, 373, 379, 380, 381, 382, 384, 387, 391, 395, 398, 405, 409, 412, 423, 426], "correspondingli": 402, "cosin": [32, 346, 347], "cosmo": 353, "cost": [258, 314, 315, 342, 361, 409, 420], "cost_bbox": 258, "cost_class": 258, "cost_giou": 258, "costom": 371, "could": [9, 25, 47, 57, 279, 298, 314, 315, 336, 337, 338, 348, 349, 352, 358, 361, 363, 371, 372, 375, 385, 387, 388, 389, 391, 392, 395, 403, 412, 413, 419, 423, 428, 432], "count": [25, 314, 349, 365, 366, 389, 394, 396], "countri": 385, "courag": 361, "cours": [363, 389, 395, 402], "covari": 25, "cover": [269, 270], "coverag": 269, "cozi": 361, "cp": [364, 365, 366, 388], "cpp": [269, 270, 413, 420, 432], "cpplint": 269, "cpu": [9, 25, 28, 272, 280, 289, 302, 306, 308, 309, 310, 312, 313, 314, 315, 316, 317, 320, 323, 327, 328, 329, 330, 331, 332, 336, 340, 343, 344, 348, 349, 354, 360, 363, 367, 369, 374, 375, 376, 385, 388, 394, 397, 399, 401, 410, 411, 418, 420, 421, 425, 427], "cpu_": 25, "cpu_engine_t": 278, "cpu_inst": 278, "cpython": 386, "craft": [309, 314, 349, 357, 372], "crash": 25, "creat": [21, 24, 25, 62, 243, 247, 272, 298, 309, 316, 319, 320, 321, 323, 327, 328, 329, 330, 331, 332, 336, 337, 338, 340, 348, 349, 351, 356, 358, 361, 362, 372, 387, 393, 394, 404, 413, 416, 420, 421], "create_embed": 340, "create_kernel": 278, "create_memory_storag": 278, "create_position_ids_from_input_id": 44, "create_position_ids_from_inputs_emb": 44, "create_proxy_object": 279, "create_stream": 278, "create_tf_nod": 243, "creation": [309, 319], "creativ": 378, "criteria": [269, 417], "criterion": [250, 256, 257, 306, 416, 423], "criterion_reduce_typ": 28, "critic": [298, 307], "crop": 350, "cross": [33, 36, 44], "crosscovari": 25, "crossiou": 25, "crossov": [28, 30], "crossover_s": 28, "crucial": [372, 429], "cstep": 281, "csv": [266, 319, 372], "ctrl": [304, 361], "ctx": 279, "ctx_size": 429, "cuda": [25, 309, 318, 374, 375, 376, 422], "cuda_": 25, "cuda_visible_devic": 351, "cudatoolkit": 340, "cultur": 361, "cumsum": 73, "curat": 309, "curios": 361, "curl": [313, 316, 317, 318, 324, 340, 361, 363, 364, 365, 366, 367], "current": [47, 314, 332, 335, 340, 341, 349, 350, 353, 364, 365, 366, 367, 369, 372, 377, 379, 380, 381, 382, 389, 400, 401, 402, 404, 405, 407, 412, 413, 421, 422, 432], "current_working_directori": 322, "custom": [57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 272, 302, 316, 319, 320, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 338, 341, 343, 344, 348, 355, 361, 363, 366, 369, 371, 372, 373, 375, 378, 380, 381, 382, 389, 391, 420, 428, 439], "custom_port": 349, "customiz": [266, 309, 420], "cut": 369, "cute": [36, 44], "cutoff": 266, "cwd": [314, 349], "cxx11": 361, "cybersecur": 420, "cycl": [396, 410], "d": [25, 57, 268, 303, 309, 313, 316, 317, 318, 321, 324, 331, 340, 351, 361, 363, 364, 365, 367, 399, 407], "d0": 413, "d0xd1x": 413, "d1": [57, 413], "d12c0123": 319, "d2": 57, "d3": [57, 410], "d37": 304, "d9": 410, "d_conf": [303, 306], "da": 301, "daemon": [314, 315], "dag": 396, "dai": 361, "daili": [349, 372], "dailymail": 349, "dalvishruti14": 301, "damag": 419, "damp_perc": 247, "daniel": 301, "dash": 265, "data": [5, 25, 27, 38, 57, 121, 247, 256, 257, 260, 264, 266, 267, 270, 281, 288, 302, 304, 309, 314, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 346, 347, 348, 349, 354, 361, 363, 369, 372, 377, 387, 388, 390, 392, 393, 394, 396, 399, 400, 401, 402, 405, 406, 409, 413, 421, 422, 423, 425, 426, 438], "data0": 398, "data0_desc": 401, "data1": 398, "data1_desc": 401, "data2": 398, "data3": 398, "data4": 398, "data_dir": 393, "data_handle_": 279, "data_param": 400, "data_ptr": 394, "data_sourc": 25, "data_typ": [280, 281, 394, 400, 401, 413], "dataargu": 5, "databas": [14, 338, 372], "databrick": 428, "dataclass": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "datalearn": 420, "dataload": [25, 27, 246, 289, 302], "dataloader_drop_last": 376, "dataset": [25, 246, 247, 270, 289, 304, 315, 347, 350, 351, 355, 361, 366, 369, 372, 416, 423, 425, 426, 428, 432], "dataset_concaten": [314, 349, 352, 354], "dataset_config_nam": 354, "dataset_fil": 314, "dataset_nam": [289, 346, 347, 348, 349, 352, 354, 422], "datatyp": [28, 304, 305, 425, 426, 432], "dataxf": 410, "date": [347, 361, 372], "davinci": [314, 349], "day2": 420, "dcmake_vtune_hom": 410, "dco": 269, "ddatabas": 358, "ddimschedul": 9, "ddp": [314, 349], "ddp_backend": [314, 349], "ddp_find_unused_paramet": 352, "ddr4": 361, "ddr5": [425, 426], "de": [57, 369, 377], "deal": [32, 243, 389], "deb": 369, "debug": [61, 388, 396], "dec": [272, 302, 420], "decapoda": 422, "deci": 309, "decid": [361, 372, 404, 405], "decilm": 309, "decis": 372, "decod": [9, 36, 44, 247, 256, 257, 261, 354, 396, 410, 429], "decoder_attn_reshap": 150, "decoder_input_id": [36, 44], "decoderattnreshap": 144, "decompos": [57, 354, 387], "decor": [52, 53, 54, 62, 95, 184, 243, 244], "decreas": [370, 371], "deem": 298, "deep": [17, 272, 302, 317, 363, 392, 401, 417, 420, 423], "deep3dfacerecon_pytorch": [16, 17, 19, 20, 21], "deeper": [319, 325], "deeplearn": 399, "deepspe": [330, 347, 349, 350, 361], "deepspeed_hpu_zero3_sync_mark_step_requir": 349, "def": [28, 289, 302, 313, 387, 428], "default": [6, 14, 24, 25, 27, 28, 32, 35, 36, 44, 45, 246, 247, 260, 265, 266, 289, 302, 303, 309, 319, 334, 336, 340, 345, 347, 349, 355, 361, 364, 366, 369, 371, 372, 375, 376, 383, 384, 385, 387, 390, 396, 397, 401, 405, 409, 410, 411, 413, 416, 417, 419, 423], "defaultli": 373, "defin": [5, 16, 17, 47, 57, 246, 247, 270, 278, 298, 302, 303, 314, 349, 369, 371, 372, 387, 388, 394, 395, 409, 414, 416, 419], "definit": [55, 121, 128, 260, 288, 401, 428, 432], "defog": 309, "degrad": 423, "degre": [372, 404], "del_environ_var": 57, "del_keys_to_ignor": 44, "delet": [35, 57, 419], "delimit": 413, "deliv": 372, "delv": [387, 430], "demand": [361, 371, 432], "demeanor": 270, "demo": [319, 331, 332, 334, 338, 361, 378, 420], "demonstr": [302, 304, 306, 319, 323, 326, 327, 328, 329, 330, 331, 332, 348, 349, 350, 354, 356, 358, 368, 370, 377, 407, 409, 426], "demystifi": 420, "denois": 9, "denomin": 387, "dens": [14, 260, 281, 372, 375, 389, 390, 395, 398, 409, 413], "dense_x_spars": 281, "densiti": 409, "dep": 426, "depend": [29, 38, 256, 257, 300, 309, 335, 341, 356, 358, 372, 379, 380, 381, 382, 386], "depict": 406, "deploi": [269, 303, 319, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 358, 363, 364, 365, 366, 420, 432, 439], "deploy": [302, 306, 309, 319, 325, 372, 390, 393, 420], "deprec": [32, 36, 44, 246, 361], "depth": 24, "dequ": 264, "dequant": [73, 400, 401, 405, 413], "dequantize_tile_on_tmp_buf": 405, "dequantizelinear": [73, 392], "derefer": 25, "deriv": [5, 279, 346, 347, 406, 407], "derogatori": 298, "desc": [394, 400, 401], "desc_act": 247, "descend": [24, 25], "descent": [24, 432], "describ": [36, 44, 270, 280, 402, 404, 407, 413, 416, 417], "descript": [270, 303, 309, 319, 320, 371, 372, 389, 409, 416, 417, 419], "descriptor": [280, 394, 413], "design": [305, 316, 319, 327, 328, 329, 340, 342, 354, 361, 369, 371, 372, 373, 378, 387, 400, 401, 412, 429, 432], "desir": [25, 314, 349, 369, 370, 377, 389], "despit": 372, "dest": [52, 53, 54, 57, 62, 243], "dest_op": 121, "dest_op_nam": 243, "destin": [399, 404, 407, 413], "destructor": 394, "detach": 24, "detail": [9, 23, 25, 50, 52, 53, 57, 269, 270, 293, 295, 298, 300, 302, 303, 304, 306, 307, 309, 312, 317, 319, 320, 345, 346, 347, 349, 351, 352, 361, 365, 372, 378, 383, 384, 385, 386, 387, 389, 390, 391, 394, 395, 397, 398, 403, 410, 411, 413, 419, 420, 421, 423, 429, 432, 436, 438], "detect": [57, 256, 257, 260, 266, 269, 300, 369, 373, 420], "detection_model_path": 359, "determin": [266, 298, 370, 372, 390, 410], "determinist": [25, 373], "detr": [257, 261], "detrmulti": 257, "dev": [308, 314, 315, 332, 335, 341, 379, 380, 381, 382], "dev0": [426, 427], "devel": 308, "develop": [302, 309, 312, 313, 314, 316, 319, 320, 348, 349, 369, 372, 373, 394, 420, 422, 432], "devic": [9, 25, 30, 303, 309, 313, 316, 317, 318, 319, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 347, 349, 352, 363, 365, 369, 374, 375, 385, 388, 390, 418, 422], "device_map": 432, "deviceopt": 5, "df": [57, 395], "diagon": 25, "dialog": 382, "dialogu": [341, 381, 382], "dic": 25, "dice": [256, 257, 260], "dice_loss": 260, "dict": [2, 9, 25, 36, 44, 45, 47, 57, 62, 243, 244, 246, 247, 256, 257, 258, 260, 264, 266, 267, 268, 351, 372, 373, 376, 387, 388, 419, 428, 432], "dict_path": 373, "dictionari": [25, 246, 247, 256, 257, 264, 267, 372, 373, 428], "did": 361, "didn": [21, 372], "differ": [25, 29, 39, 40, 41, 42, 49, 55, 60, 129, 130, 131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 224, 225, 226, 228, 229, 230, 241, 242, 265, 270, 298, 302, 303, 306, 309, 314, 316, 319, 325, 335, 346, 349, 351, 361, 369, 371, 372, 377, 378, 380, 387, 388, 390, 391, 392, 395, 399, 406, 409, 416, 417, 419, 423, 428], "difficult": 428, "difficulti": [376, 409, 428], "diffus": [9, 272, 302, 304, 405, 420], "diffusionpipelin": 9, "diffusionv1": [133, 134, 135, 136, 217, 218, 221, 222, 223, 231, 232, 235, 236, 237], "digest": 316, "digit": 420, "dilat": 255, "dim": [25, 36, 44, 256, 257, 258, 260, 388, 394, 399, 405, 409, 413], "dim_t": [280, 281], "dimens": [25, 32, 36, 44, 256, 257, 263, 303, 372, 378, 390, 404, 405, 407, 409, 413], "dimension": 25, "diminish": 429, "dino": 301, "dir": [265, 340, 348, 388], "direct": [20, 24, 281, 315, 319], "direct_process_row": 281, "directli": [24, 44, 260, 321, 335, 361, 369, 372, 380, 400, 403, 406, 428, 432], "directori": [21, 35, 247, 309, 314, 315, 321, 322, 345, 353, 355, 372, 375, 383, 384, 392, 412, 427], "dirnam": [314, 332, 349], "disabl": [24, 25, 264, 298, 314, 330, 362, 397, 411, 413], "disable_quanted_input": 247, "disast": 403, "discard": 414, "disclaim": [335, 380], "discontinu": 399, "discov": [312, 320], "discoveri": 361, "discrep": 372, "discret": 406, "discuss": [270, 409], "disjoint": 266, "disk": [390, 392, 396], "dispatch": 55, "dispatch_table_file_root": 390, "displai": [17, 382], "disregard": [256, 257, 270], "distanc": [303, 372], "distil": [246, 270, 302, 393, 430, 434], "distil_bert_bas": 387, "distilbert": [272, 302, 304, 306, 392, 418, 420, 430], "distilbert_bas": 387, "distilbert_base_uncas": 393, "distilgpt2": 304, "distillation_config": [246, 303, 306], "distillationconfig": [246, 306], "distillbert": 397, "distilledtextattack": 304, "distilroberta": 304, "distinct": 372, "distinguish": [369, 413], "distribut": [1, 25, 28, 252, 264, 303, 307, 314, 348, 354, 376, 406, 423, 424, 428], "distributed_init": 252, "distributed_world_s": 28, "div": [387, 391], "div2": 402, "dive": [312, 320], "diverg": 303, "divers": [314, 349, 369, 372, 378], "divid": [347, 372, 387, 395, 399, 404, 405, 408, 414], "divis": [372, 405], "dl": [400, 425, 426], "dlsa": [272, 302], "dm": 304, "dne_with_sparselib": [386, 413], "dne_with_sparselib_benchmark": [398, 413], "dne_with_sparselib_onli": [398, 413], "dne_with_sparselib_vtun": 410, "dne_with_test": 398, "dnnl": [279, 390, 394], "dnnl_arg_dst": 394, "dnnl_arg_src": 394, "do": [30, 47, 50, 246, 258, 264, 269, 298, 305, 316, 349, 350, 365, 369, 370, 387, 388, 390, 391, 395, 396, 400, 402, 405, 419, 420, 423, 428, 429], "do_blockwis": 247, "do_constant_fold": [246, 305], "do_ev": [349, 354], "do_lm_ev": 349, "do_sampl": [317, 363, 428, 432], "do_train": [314, 348, 349, 352, 354, 422], "doc": [9, 14, 22, 256, 257, 260, 270, 309, 321, 322, 324, 338, 372, 375, 387, 391, 400, 409], "docker": [312, 347, 364, 365], "docker_build_arg": [313, 316], "docker_cach": 366, "docker_run_env": [313, 314, 316], "dockerfil": [313, 314, 315, 316, 317, 318, 347, 349, 366], "dockerfile_tgi": 317, "dockerfile_vllm": 318, "dockerignor": [314, 315], "docstr": [36, 44], "document": [2, 9, 13, 14, 246, 272, 273, 302, 303, 306, 309, 319, 338, 348, 357, 358, 361, 371, 372, 376, 405, 407, 408, 409, 419, 423], "document_stor": 14, "docx": 372, "doe": [25, 247, 264, 350, 371, 387, 400, 401, 402, 403, 404, 407, 413], "doesn": [57, 256, 257, 349, 400, 409, 413], "dog": [36, 44, 340], "dolli": 428, "domain": [272, 293, 302, 309, 314, 335, 349, 372, 380, 398, 436], "don": [21, 36, 44, 57, 258, 266, 350, 361, 394, 396, 400], "done": [260, 303, 390, 405, 413, 423], "dong": 415, "dongsoo": 432, "dot": [24, 25, 407, 423], "doubl": [25, 327, 328, 329], "double_quant_bit": 247, "double_quant_dtyp": 247, "double_quant_group_s": 247, "double_quant_scale_dtyp": 247, "double_quant_use_sym": 247, "down": 57, "downgrad": 332, "download": [9, 17, 33, 35, 36, 44, 314, 315, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 340, 343, 344, 345, 350, 355, 357, 358, 364, 368, 369, 371, 372, 375, 383, 384, 424, 426, 427], "download_model": 360, "doxygenfil": 281, "dp_tile_n": 405, "dpc": [324, 338], "dpcpp": 432, "dpkg": 369, "dpo": 352, "dpo_clm": [346, 347], "dpo_pipelin": 347, "dpython_execut": 386, "draw_landmark": 21, "drawn": 407, "dream": 361, "driven": [309, 420], "driven_audio": 374, "driver": [361, 362, 425], "drop": [28, 29, 269, 304, 307, 309, 321, 406, 409, 416, 428, 429], "drop_and_restore_util": 31, "dropout": [28, 29, 260], "ds_build_cpu_adam": 330, "ds_build_util": 330, "dslim": 304, "dsparse_lib_use_amx": 413, "dst": [281, 394, 400, 401, 403, 404, 405, 408, 409, 413], "dst0": 281, "dst2": 281, "dst_data": 394, "dst_dt": 413, "dst_m1": 281, "dst_m2": 281, "dst_m_": 394, "dst_shape": [387, 388, 394], "dst_stride": 394, "dst_t": 281, "dst_tensor_ptr": 394, "dst_type": 413, "dstep": 281, "dstptr": 281, "dststride": 281, "dt": [400, 401], "dt1op1": 401, "dt2op2": 401, "dtype": [42, 57, 62, 121, 243, 244, 246, 302, 305, 371, 388, 389, 390, 394, 421, 432], "dtypes_dict": 57, "du": 405, "dual": 361, "due": [36, 57, 288, 314, 349, 351, 370, 372, 373, 391, 395, 399, 423, 426, 427, 432], "dummydataload": 302, "dump": [55, 269, 423], "dump_activation_dag": 396, "dump_tensor": 55, "duplic": 57, "durat": 389, "dure": [24, 39, 40, 246, 288, 307, 349, 350, 372, 388, 396, 405, 409, 414, 417, 419, 423], "dword": [409, 410], "dynam": [28, 38, 40, 246, 300, 304, 305, 307, 378, 394, 396, 398, 400, 404, 406, 413, 430, 437], "dynamic_config": [246, 306], "dynamic_length_config": 306, "dynamic_qu": 279, "dynamic_quant_desc": 279, "dynamic_quant_matmul": 279, "dynamic_quant_matmul_desc": 279, "dynamic_train": 28, "dynamiclengthconfig": [28, 246, 306], "dynamicquantconfig": 247, "e": [17, 23, 25, 33, 57, 267, 268, 269, 288, 298, 302, 303, 309, 313, 314, 315, 316, 318, 327, 328, 329, 331, 332, 347, 350, 351, 364, 365, 366, 369, 376, 390, 400, 401, 406, 407, 413, 414], "e0": 410, "e5": [372, 376], "e60": 319, "e7": 410, "each": [23, 25, 28, 36, 44, 57, 256, 257, 258, 260, 264, 265, 266, 299, 309, 314, 316, 330, 332, 349, 351, 355, 371, 372, 376, 379, 389, 390, 391, 399, 402, 404, 405, 406, 407, 408, 409, 412, 413, 414], "eager": 423, "earli": [369, 423, 430], "earlier": 319, "early_stop": 352, "eas": [370, 378], "easi": [260, 302, 316, 319, 338, 357, 358, 372, 390, 392], "easier": 382, "easiest": 319, "easili": [246, 316, 321, 369, 370, 372, 373, 399, 400, 432], "ebp": 410, "ec2": [397, 411], "echarlaix": 304, "econom": [372, 396], "edg": 49, "edit": [9, 298, 332, 361, 365], "edit_output": 24, "editor": 377, "edu": 263, "educ": 298, "ee": 410, "ee32e42": 425, "eea": 420, "effect": [272, 302, 342, 363, 369, 370, 372, 387, 413, 420, 432], "effici": [258, 272, 302, 306, 309, 314, 319, 349, 357, 361, 367, 370, 372, 396, 420, 422, 432], "effort": [372, 432], "effortlessli": 309, "ehdwns1516": 304, "einsum": 73, "einsumwitharang": 150, "either": [266, 309, 348, 349, 350, 372, 423], "elast": 304, "elec": 351, "electr": 385, "electra_base_chinese_discrimin": 425, "electra_base_chinese_gener": 425, "electron": [298, 351], "element": [47, 57, 246, 256, 257, 258, 260, 387, 395, 398, 402, 404, 407, 409, 413, 437], "element_num": [281, 401], "element_num_each_th": 281, "elementwise_over_al": 47, "elementwise_over_matmul_gemm_conv": 47, "eleuth": 346, "eleutherai": [304, 309, 314, 315, 349, 354, 428], "elia": 432, "elig": 395, "elimin": 266, "ellipsi": 407, "els": [289, 374, 377, 387, 394], "eltociear": 301, "eltwis": 401, "eltwise_forward": 394, "eltwise_gelu_erf": 394, "eltwise_gelu_tanh": 394, "eltwise_injector": [400, 401], "eltwise_injector_init": 401, "eltwiseop": [279, 400, 401], "eltwiseop_data_t": 281, "eltwiseop_desc": 279, "eltwiseop_kd_t": 401, "eltwiseop_param_t": [281, 401], "elucid": [319, 325], "embed": [2, 32, 36, 37, 44, 57, 259, 309, 336, 340, 357, 369, 370, 372, 375, 388, 391, 395, 400, 420], "embed_size_per_head": [36, 44], "embedding_dim": 37, "embedding_model": [372, 375], "embedding_model_dir": 375, "embeddingbag": [73, 150], "embeddings_reshap": 391, "embeddings_to_2d_before_inner_product": 150, "embeddingsto2dbeforeinnerproduct": 147, "embrac": 361, "emiss": 385, "emit": [57, 391], "emitt": 392, "emot": 304, "empathi": 298, "emphas": [354, 376], "emphasi": 372, "emphasized_weight": [256, 257], "emploi": [314, 319, 325, 347, 349, 370], "empow": [309, 378, 420], "empti": [21, 25, 57, 73, 256, 257, 264, 266, 309, 321, 391, 395, 401, 414, 421], "empty_list": 278, "empty_op": [83, 387], "emul": 423, "en": [9, 304, 353, 369, 372, 376, 420], "en_core_web_lg": [334, 371, 375], "enabl": [25, 28, 40, 288, 289, 305, 306, 309, 314, 315, 319, 320, 324, 330, 332, 334, 336, 338, 340, 342, 344, 349, 350, 358, 361, 363, 369, 372, 373, 375, 376, 378, 397, 399, 405, 406, 410, 411], "enable_bf16": 305, "enable_executor": [302, 305], "enable_mask": 400, "enable_op_tun": 390, "enable_rerank": [14, 372], "enable_sequential_cpu_offload": 9, "encapsul": 406, "encod": [0, 9, 36, 44, 247, 259, 261, 350, 389, 395, 429], "encoder_attention_mask": [33, 36, 44], "encoder_hidden_st": [33, 36, 44], "encount": [319, 338, 358, 361, 370, 429], "encourag": 372, "end": [25, 36, 44, 47, 57, 261, 272, 302, 320, 354, 369, 389, 392, 394, 395, 401, 410, 432], "end_posit": [36, 44], "end_step": [28, 419], "endfor": [403, 404, 409], "endpoint": [309, 363], "energi": 361, "eng_": 394, "eng_kind": [280, 398, 401], "eng_kind_": 401, "engag": [319, 372, 378], "engin": [49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 150, 181, 215, 243, 244, 245, 269, 273, 282, 297, 299, 302, 337, 367, 372, 377, 387, 389, 391, 394, 396, 400, 405, 408, 412, 432, 433, 440], "engine_dispatch_t": 390, "engine_graph": [62, 94, 95, 243, 244], "engine_init": 55, "engine_integr": 388, "engine_kind": [278, 280, 401], "engine_kind_": [278, 280], "engine_t": 278, "english": [302, 304, 306, 349, 359, 418], "enhanc": [306, 309, 319, 325, 338, 342, 349, 357, 358, 361, 369, 370, 371, 372, 374, 378, 404, 432], "enlarg": 390, "enough": [396, 405, 423, 432], "enrich": [361, 372], "ensur": [47, 266, 307, 309, 316, 319, 322, 327, 328, 329, 351, 357, 370, 372, 373, 405], "entail": [266, 354], "enter": [313, 335, 347, 355, 364, 365, 366, 380, 387], "enterpris": 420, "entir": [25, 389, 399, 405, 406, 419, 428], "entiti": [309, 361, 371], "entranc": 413, "entri": [36, 44, 245, 246, 258, 415], "entropi": [33, 36, 44], "enum": [281, 400, 401], "enumer": [2, 5, 25, 281, 428], "env": [57, 314, 332, 335, 341, 349, 351, 361, 379, 380, 381, 382, 393], "env_setup": 322, "environ": [57, 252, 298, 323, 324, 326, 327, 328, 329, 330, 331, 332, 335, 340, 341, 343, 344, 345, 379, 380, 381, 382, 383, 384, 388, 413, 414, 425], "environ_info_init": 57, "environment": 361, "environment_vari": 413, "enviton": 361, "eos_coef": [256, 257], "ep": [255, 281], "epoch": [47, 314, 348, 349, 353, 354, 419, 422, 425], "epsilon": 387, "equal": [57, 246, 369, 391, 399, 400], "equat": [407, 423], "equival": [264, 340, 387, 409, 428, 432], "eras": 401, "erf": [73, 394], "error": [7, 22, 61, 247, 256, 257, 308, 309, 361, 372, 394, 410, 426, 427, 432], "escap": [400, 401], "escape_eras": 401, "escape_reg": 401, "especi": [319, 394, 409, 421], "essenti": [316, 319, 327, 328, 329, 372, 408], "establish": [345, 372, 383, 384], "estim": [9, 25, 385, 389, 409], "et": 432, "etc": [9, 25, 246, 266, 303, 308, 335, 371, 380, 389], "ethnic": 298, "euclidean": 303, "eval": [5, 349, 351, 416], "eval_accuraci": [302, 303, 306, 419], "eval_dataset": [302, 306], "eval_f1": [28, 30, 306, 416], "eval_func": 246, "eval_loss": 425, "eval_metr": 30, "eval_multi_choic": 268, "eval_open": 268, "evalpredict": 302, "evalu": [30, 47, 246, 253, 254, 256, 257, 268, 303, 354, 389, 407, 416, 417, 423, 426, 427], "evaluation_strategi": [314, 349, 352], "even": [24, 309, 319, 357, 372, 387, 399], "evenli": 399, "event": [298, 372], "eventu": [264, 361], "ever": 420, "everi": [17, 25, 47, 57, 266, 355, 364, 365, 387, 399, 413], "everydai": [338, 358], "everyon": [298, 319, 373], "everywher": 420, "evict": 307, "evo_eval_metr": 28, "evo_it": 28, "evoc": 418, "evol": 349, "evolust": 30, "evolustionari": 30, "evolut": [28, 31, 304], "evolutionari": [30, 246], "ewm_col": 265, "ex": [327, 328, 329, 349, 423], "exact": [268, 370, 377], "exactli": 395, "examin": 373, "exampl": [4, 25, 28, 32, 36, 44, 55, 57, 181, 260, 266, 269, 270, 272, 273, 292, 298, 302, 303, 306, 307, 309, 313, 314, 317, 318, 319, 321, 322, 323, 324, 330, 332, 333, 336, 337, 338, 339, 342, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 358, 359, 361, 363, 364, 365, 366, 368, 370, 371, 372, 377, 378, 387, 388, 390, 391, 393, 394, 395, 396, 398, 400, 402, 416, 419, 423, 426, 427, 435], "example_input": [27, 247, 289], "example_output": 351, "example_persist": [324, 338], "exc": 361, "exce": 429, "except": [17, 314, 349, 372, 390, 400], "excerpt": [319, 325], "excess": [372, 429], "excit": 378, "excluded_op_nam": 28, "excluded_precis": 247, "exclus": [24, 341, 381], "exec": [313, 314, 315, 410], "exec_context_t": 279, "execut": [24, 279, 302, 309, 313, 316, 317, 318, 330, 332, 335, 341, 347, 351, 358, 365, 379, 380, 381, 382, 390, 394, 398, 400, 401, 405, 406, 408, 410, 413, 414, 423, 426, 432], "execution_mod": 396, "execution_opt": [390, 396], "executor": [55, 375, 387, 388, 389], "executorbenchmark": 27, "exhaust": 25, "exhibit": [319, 370], "exist": [21, 35, 57, 247, 306, 307, 345, 355, 363, 372, 383, 384, 387, 418, 426, 427, 432], "exit": [385, 430], "exp": [28, 401, 408, 413], "expand": [36, 39, 40, 44, 73, 309, 357, 370, 372], "expand_dim": 83, "expand_gath": [36, 44], "expanddim": 74, "expandindic": 73, "expans": [39, 40], "expect": [36, 37, 44, 246, 256, 257, 260, 298, 300, 352, 372, 409, 417], "expens": [303, 319, 370], "experi": [272, 298, 302, 309, 319, 347, 357, 361, 369, 372, 378, 403, 408, 409], "experiment": [401, 432], "expert": 40, "explain": [270, 316], "explan": 407, "explicit": [279, 298, 394, 401], "explicitli": [314, 319, 349], "explicitnhwctransposeforconv": 206, "explicitnhwctransposeforconvqat": 207, "exploit": [49, 395], "explor": [270, 306, 319, 320, 361, 432], "explos": 420, "expon": 260, "exponenti": 265, "export": [246, 266, 270, 302, 313, 314, 316, 322, 324, 331, 332, 334, 337, 349, 353, 354, 366, 389, 392, 418, 426, 427, 432, 434], "export_model": 337, "export_to_bf16_onnx": 246, "export_to_fp32_onnx": 246, "export_to_int8_onnx": 246, "export_to_jit": 246, "export_to_onnx": [246, 302, 305], "expos": [247, 400, 401], "exposur": 372, "express": [298, 371, 372], "expsum": 281, "exteion": 331, "extend": [272, 302, 376, 402, 421, 428, 432], "extens": [247, 269, 270, 299, 300, 306, 313, 314, 315, 316, 319, 320, 325, 330, 331, 332, 337, 347, 349, 352, 354, 355, 364, 365, 366, 374, 386, 387, 388, 415, 417, 418, 419, 420, 424, 425, 426, 427, 428, 429, 432], "extern": [320, 372], "extra": [24, 261, 332, 372, 396, 405], "extract": [9, 23, 49, 50, 52, 53, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 266, 371, 372, 387, 390, 392], "extract_numb": 268, "extract_t": 359, "extract_text_from_span": 266, "extract_text_inside_bbox": 266, "extractor": [57, 58, 390, 392, 395], "extrem": [302, 306, 427], "f": [57, 281, 303, 313, 314, 315, 316, 317, 318, 340, 347, 348, 349, 361, 394, 410, 423, 432], "f1": [304, 416], "f32": [389, 394], "f5": 410, "f7e0": 319, "fac": 304, "face": [16, 19, 35, 247, 272, 273, 298, 302, 309, 321, 345, 348, 349, 354, 361, 372, 383, 384, 392, 420, 429], "face_anim": [309, 360, 374], "faceanim": [309, 321], "facebook": [323, 428], "facet": 372, "facilit": [319, 325, 405, 408], "fact": [307, 408, 423], "factor": [260, 372, 397, 411, 425, 426, 428], "fail": [351, 361], "failur": 269, "fair": [288, 298, 377], "fairseq": 44, "faiss": 376, "faith": 298, "fake": 423, "falcon": [363, 428], "falcon_peft_finetuned_model": 349, "fall": 266, "fallback": 372, "fals": [0, 4, 14, 17, 24, 25, 28, 30, 35, 36, 37, 40, 41, 44, 55, 246, 247, 256, 257, 259, 266, 281, 289, 303, 307, 314, 316, 336, 340, 346, 347, 349, 352, 354, 361, 363, 371, 372, 375, 387, 390, 400, 401, 413, 416, 422, 428, 432], "famili": 361, "familiar": 361, "faq": 298, "far": 408, "fascin": 361, "fast": [272, 302, 306, 319, 420, 432], "fastapi": [309, 366], "fastchat": [0, 22], "fastedit": 377, "faster": [361, 396, 420], "fastrag": 309, "fatal": 61, "father": 387, "fault": 269, "fb": 410, "feasibl": 372, "featur": [9, 25, 44, 272, 297, 300, 302, 303, 309, 319, 324, 327, 328, 329, 335, 341, 350, 361, 369, 372, 378, 379, 380, 381, 382, 392, 395, 399, 406, 410, 418, 421, 424, 426, 430, 440], "feature_extractor": 9, "feature_mxfp4_poc": 332, "fed": 246, "feed": [25, 36, 44, 303, 324, 388], "feed_forward_chunk": [36, 44], "feedback": [319, 346, 347], "feel": [341, 378, 381, 405, 413, 430], "feet": 377, "feng": 301, "fetch": [305, 370, 372, 399, 402], "few": [57, 336, 337, 338, 340, 358, 369, 404], "fewer": [266, 342, 406], "fewest": 266, "ffmpeg": [308, 360, 369, 374], "ffn": [256, 257], "fictiti": 372, "fid": 425, "field": [57, 264, 265, 266, 361, 365, 375, 400, 401], "figur": [372, 402, 405, 406, 407, 409, 412], "file": [0, 6, 13, 25, 28, 30, 35, 47, 49, 50, 52, 53, 54, 55, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 184, 215, 246, 247, 248, 260, 265, 267, 278, 279, 280, 281, 302, 309, 314, 315, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 348, 349, 350, 355, 363, 366, 369, 370, 372, 375, 376, 378, 379, 382, 383, 384, 387, 388, 389, 390, 392, 394, 400, 401, 412, 413, 414, 415], "file_nam": 428, "file_root": 390, "filenam": [25, 267], "fill": [73, 361, 391, 407], "filter": [25, 266, 319, 372, 373, 419], "final": [49, 260, 266, 350, 351, 390, 391, 392, 394, 395, 402, 405, 406, 408, 416, 426], "find": [24, 270, 281, 314, 324, 327, 328, 329, 338, 345, 349, 364, 365, 366, 372, 383, 384, 387, 390, 394, 395, 404, 419, 426, 427], "fine": [5, 272, 302, 309, 320, 357, 372, 376, 420, 422, 423, 432], "finetun": [4, 5, 246, 302, 304, 306, 309, 312, 319, 320, 321, 340, 346, 347, 350, 352, 354, 369, 372, 375, 418, 422], "finetune_cfg": 319, "finetune_clm": [314, 348, 349, 352, 354, 422], "finetune_lora": 350, "finetune_model": [4, 319], "finetune_neuralchat_v3": 347, "finetune_seq2seq": [314, 349], "finetuned_model": [347, 355], "finetuned_model_lora": 347, "finetuned_model_lora_plus_dpo": [346, 347], "finetuning_data": 350, "finetuningargu": 5, "finish": [387, 391, 393], "finish_reason": 361, "finit": [25, 373], "first": [24, 25, 57, 177, 246, 266, 299, 309, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 351, 357, 363, 366, 368, 370, 372, 375, 380, 383, 384, 385, 386, 387, 390, 391, 393, 395, 396, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 414, 423, 425, 428, 432], "first_lay": 24, "firstli": 49, "fist": 57, "fit": [25, 402, 432], "five": 356, "fix": [25, 37, 255, 390, 403, 420, 428], "fixedrandomsubsetsampl": 25, "fixedsubsetsampl": 25, "fl": 385, "flag": [305, 315, 321, 364], "flan": [272, 302, 314], "flan_t5_larg": 425, "flash": 32, "flat": 25, "flatmapdataset": 73, "flatten": [25, 73], "flexibl": [309, 319, 325, 372, 405, 429], "float": [25, 28, 47, 246, 247, 250, 251, 258, 260, 264, 268, 281, 303, 305, 371, 387, 393, 400, 401, 402, 416, 417, 419, 423, 428], "float16": 432, "float32": [371, 407, 423], "float4": 422, "float8_e4m3_t": 281, "float8_e5m2_t": 281, "floattensor": [36, 40, 41, 44], "floor_divid": 73, "flow": [38, 49, 57, 387, 391], "fluent": 372, "fly": 403, "fma": 409, "fmt": 264, "fn": [24, 25], "foc": 25, "focal": [256, 257], "focu": [378, 393, 407, 416, 432], "focus": [298, 314, 349, 369, 372, 378], "fold": [246, 413, 428], "folder": [270, 316, 335, 341, 348, 353, 355, 364, 365, 366, 372, 373, 379, 380, 381, 382, 386, 387, 388, 389, 390, 392, 432], "follow": [9, 24, 25, 36, 44, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 256, 257, 274, 277, 282, 286, 289, 298, 300, 302, 303, 308, 309, 313, 314, 315, 316, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 350, 351, 352, 354, 355, 356, 357, 358, 361, 363, 364, 365, 366, 368, 369, 370, 372, 373, 376, 380, 383, 384, 385, 387, 389, 390, 391, 392, 394, 400, 401, 403, 404, 405, 406, 407, 408, 409, 410, 413, 414, 415, 423, 426, 432], "followt": 366, "footprint": [307, 420], "forc": [25, 35, 315, 353], "force_download": 35, "forced_assign": 266, "forg": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368, 426], "forget": [387, 391, 394], "fork": 315, "form": [49, 57, 246, 266, 268, 303, 314, 316, 340, 349, 351, 361, 372, 378, 389, 395, 399, 404, 408, 413], "formal": 36, "format": [0, 23, 61, 128, 246, 256, 257, 260, 263, 266, 269, 319, 351, 355, 363, 372, 389, 407, 408, 411, 412, 421, 423, 428], "format_typ": 280, "formatt": 6, "former": [57, 355, 395], "formerli": [293, 302, 398, 436], "formul": [346, 347, 372], "formula": [385, 405], "forth": 361, "forward": [9, 32, 33, 36, 37, 40, 44, 256, 257, 258, 260, 389, 394, 423], "forward_infer": [280, 394, 401], "forward_tim": 389, "foster": 298, "found": [270, 302, 308, 309, 321, 372, 386, 387, 409, 426, 427], "four": [409, 426, 429], "fourth": 420, "fox": 340, "fp16": [288, 423, 432], "fp32": [28, 158, 246, 269, 302, 304, 337, 375, 388, 389, 390, 392, 394, 400, 401, 403, 404, 406, 408, 413, 418, 421, 423, 425, 428, 432], "fp32_bia": 62, "fp32_exp": 401, "fp32_exp_attr": 401, "fp32_gelu": [401, 413], "fp32_gelu_attr": 401, "fp32_relu": 413, "fp32_weight": 421, "fp4": 422, "fp4_e2m1": 421, "fp8": [425, 426], "fpn": 260, "fpn_dim": 260, "fr": 369, "frac": [399, 407], "fraction": [25, 266], "fragment": 354, "framework": [45, 49, 57, 60, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 181, 246, 252, 293, 304, 308, 309, 313, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 346, 350, 372, 387, 388, 392, 396, 398, 418, 420, 423, 425, 426, 436], "framework_model": [62, 94, 95, 243], "framework_modeling_config": 55, "franc": 377, "frantar": 432, "fraud": 420, "free": [298, 316, 341, 372, 378, 381, 400, 413, 428, 430], "freedom": 378, "frequenc": [397, 411, 419], "frequencygovern": [397, 411], "frequent": [370, 399], "friend": 361, "friendli": [309, 319, 357, 372, 389, 405, 406], "from": [0, 2, 4, 5, 9, 17, 22, 23, 24, 25, 27, 30, 32, 35, 36, 38, 39, 40, 43, 44, 47, 55, 57, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 114, 115, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 243, 244, 246, 247, 255, 260, 261, 262, 263, 264, 265, 266, 268, 270, 288, 289, 298, 299, 300, 303, 304, 306, 307, 309, 311, 316, 319, 327, 328, 329, 330, 331, 332, 334, 335, 337, 345, 346, 347, 350, 353, 354, 355, 356, 358, 360, 361, 367, 369, 370, 371, 372, 373, 374, 375, 376, 380, 383, 384, 387, 388, 389, 390, 391, 392, 394, 395, 396, 400, 401, 404, 406, 407, 408, 409, 416, 417, 418, 419, 420, 422, 423, 425, 426, 427, 428, 429, 430, 432], "from_llm": [309, 372], "from_output": 55, "from_pretrain": [35, 36, 44, 247, 289, 302, 306, 307, 418, 422, 428, 429, 432], "front": 400, "frontend": [60, 320, 333, 339, 342, 378, 404, 432], "frozen": [9, 57, 255, 350, 388, 392, 422], "frozenbatchnorm2d": 255, "fschat": 366, "ftl": 385, "fulfil": 390, "full": [15, 25, 327, 328, 329, 372, 402, 415, 423], "full_finetun": 348, "fulli": [25, 372, 399, 408, 409, 426], "func": [57, 390], "function": [2, 10, 18, 35, 47, 246, 278, 279, 280, 303, 309, 319, 335, 342, 357, 360, 361, 369, 370, 371, 372, 373, 374, 380, 387, 390, 391, 394, 395, 396, 400, 401, 413, 416, 419, 420, 421, 423, 428, 432], "further": [298, 306, 313, 372, 390, 404, 432], "fuse": [39, 40, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 392, 400, 401], "fused_batch_matmul_v2": 83, "fused_batch_norm_v3": 83, "fused_gemm": 83, "fused_matmul": 83, "fusedbatchnormv3": 76, "fusedgemm": 77, "fusedmatmul": 78, "fusion": [57, 138, 181, 195, 394, 395, 400, 401, 406, 439], "futur": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 361, 363, 367, 368, 392, 400, 401, 410], "futurewarn": 361, "fwk": 57, "fx": 423, "g": [17, 23, 25, 267, 268, 303, 309, 327, 328, 329, 331, 332, 350, 351, 369, 376, 400, 401, 406, 414, 425], "g_idx": 421, "gadget": 420, "gain": [372, 406, 423, 425], "gamma": 260, "gaodrew": 348, "gap": 406, "gather": [36, 44, 83, 246, 264, 279, 387, 400], "gather_desc": 279, "gather_el": [83, 387], "gather_typ": 281, "gatherel": [80, 387], "gatherv2": [79, 387], "gatherwithadd": 147, "gaudi": [309, 313, 316, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 363, 366, 420], "gaudi1": 320, "gaudi2": [308, 319, 320, 349, 379, 420, 425], "gaudi_bartattention_forward": 37, "gaudi_bartlearnedpositionalembed": 37, "gaudi_mistral_repeat_kv": 39, "gaudi_mistral_rmsnorm_forward": 39, "gaudi_mixtral_attention_forward": 40, "gaudi_mixtral_block_sparse_moe_forward": 40, "gaudi_mixtral_decoder_layer_forward": 40, "gaudi_mixtral_model_forward": 40, "gaudi_mixtral_repeat_kv": 40, "gaudi_mixtral_rmsnorm_forward": 40, "gaudi_phi_attention_forward": 41, "gaudi_phi_decoder_layer_forward": 41, "gaudi_phi_model_forward": 41, "gaudi_spawn": [346, 349, 352], "gaudi_swin_get_attn_mask": 42, "gaudimixtralforcausallm": 40, "gb": 372, "gcc": [308, 337, 386, 425, 426, 427], "gchhablani": 304, "geeki": 420, "gelu": [83, 150, 387, 394, 398, 401, 413], "gelu_algorithm": 394, "gelu_d": 394, "gelu_erf": 394, "gelu_p_": 394, "gelu_pd": 394, "gelu_tanh": [389, 394], "geluoper": 394, "gemm": [83, 305, 399, 402, 405, 408, 413, 421, 437], "gemma": 349, "gen": [302, 323, 349, 354, 401], "gen_cas": 401, "gen_case_": 413, "gen_id": [428, 432], "gen_text": [428, 432], "genai": 420, "gender": 298, "gene": 30, "gener": [0, 5, 9, 27, 28, 36, 44, 47, 55, 57, 246, 258, 259, 260, 263, 268, 269, 270, 289, 302, 303, 309, 314, 315, 317, 321, 324, 325, 335, 338, 340, 346, 347, 348, 349, 350, 351, 352, 354, 355, 358, 361, 363, 368, 369, 371, 372, 373, 380, 384, 387, 391, 395, 400, 401, 404, 405, 408, 410, 412, 413, 416, 417, 420, 423, 426, 427, 428, 429, 432], "generalized_box_i": 263, "generate_kwarg": [428, 432], "generate_sequ": 150, "generatesequ": 149, "genv": [314, 349], "get": [0, 25, 29, 30, 49, 55, 57, 61, 62, 243, 244, 246, 251, 270, 299, 305, 308, 319, 321, 327, 328, 329, 334, 335, 337, 341, 348, 349, 356, 359, 361, 365, 369, 370, 379, 380, 381, 382, 387, 390, 391, 392, 394, 395, 400, 407, 409, 414, 418, 426, 435], "get_addr": 400, "get_autocast_info": 57, "get_bbox_span_subset": 266, "get_binaryop_list": [280, 400], "get_children": 62, "get_conv_templ": 0, "get_data_dtyp": 57, "get_data_s": 400, "get_engine_kind": 278, "get_environ_info": 57, "get_example_input": [27, 289], "get_export_arg": 246, "get_global_id": 402, "get_group_id": 402, "get_implementation_list": 278, "get_initializer_children_nam": 62, "get_input_embed": [36, 44], "get_last_word_idx_in_templ": 23, "get_local_id": 402, "get_logg": 61, "get_lut_exp_attr": 281, "get_model_fwk_nam": 57, "get_modul": 24, "get_multi_choice_info": [267, 351], "get_next_node_nam": 55, "get_node_by_nam": 55, "get_node_children_nam": 62, "get_node_id": [55, 387], "get_output_embed": [36, 44], "get_paramet": 24, "get_peft_model": 422, "get_pre_node_nam": 55, "get_prompt": 0, "get_quant_info": 57, "get_refresh_data_idx": 413, "get_relevant_docu": [309, 372], "get_reprs_at_idx": 23, "get_reprs_at_word_token": 23, "get_runtime_kind": 278, "get_sp": 279, "get_sparse_nodes_nam": 55, "get_sparsity_ratio": 47, "get_stor": 30, "get_tensor_dest_op": 243, "get_tensor_idx": 55, "get_throughput": 249, "get_true_data": 401, "get_true_data_": 413, "get_words_idxs_in_templ": 23, "get_workspace_s": 279, "getdefaultencod": 247, "getidx": 401, "getmemori": 396, "getstrid": 394, "getter": [36, 44], "gflag": 388, "gflop": [304, 411, 414], "gfpgan": [360, 374], "ggml": 426, "gha": 269, "gidx": 421, "gigant": 428, "giou": [256, 257, 263], "girl": [361, 426, 427, 428, 429, 432], "git": [35, 269, 302, 308, 314, 315, 322, 327, 328, 329, 330, 331, 332, 334, 338, 347, 349, 354, 357, 359, 361, 362, 363, 366, 368, 386, 388, 432], "github": [22, 38, 43, 262, 269, 300, 302, 308, 314, 315, 319, 322, 325, 327, 328, 329, 330, 331, 332, 345, 347, 348, 349, 352, 354, 361, 362, 366, 383, 384, 386, 388, 394, 415, 424, 432], "give": [24, 57, 316, 349, 387, 391, 399], "given": [0, 4, 23, 24, 25, 28, 35, 36, 44, 247, 267, 268, 316, 322, 351, 354, 376, 385, 395, 401, 404, 407, 409], "glibcxx_3": [426, 427], "global": [28, 264, 330, 369, 425], "globalcol": 402, "globalrow": 402, "glog": 388, "glog_minloglevel": [302, 388, 393], "gloo": 252, "glue": [302, 354], "glx": [308, 309, 334], "gmt": 361, "gnr": 332, "gnu": 386, "go": [5, 39, 40, 57, 264, 300, 351, 361, 363, 402], "goal": [246, 316, 372, 419], "goe": 372, "gold_i": 268, "golub": 25, "gomez": [36, 44], "good": [25, 298, 321, 369, 373, 402, 403, 425], "googl": [314, 334, 349, 372], "google_api_kei": 334, "got": 361, "govindh": 420, "gp": 334, "gperftool": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368], "gpt": [272, 302, 304, 309, 314, 315, 324, 346, 347, 349, 350, 352, 363, 420, 428, 432], "gpt_j_6b": 425, "gpt_j_6b_clm": 425, "gpt_j_6b_url": [335, 380], "gpt_neox_clm": 425, "gptbigcod": 33, "gptbigcodeforcausallm": 33, "gptbigcodeforsequenceclassif": 33, "gptbigcodefortokenclassif": 33, "gptbigcodemodel": 33, "gptbigcodepretrainedmodel": 33, "gptcach": [319, 370], "gptj": 354, "gptj_ft_env": 354, "gptj_peft_finetuned_model": 354, "gptneotoken": 349, "gptneoxtoken": 349, "gptneoxtokenizerfast": 349, "gptq": [288, 319, 421, 432], "gptqconfig": [247, 432], "gpu": [9, 25, 309, 312, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 350, 363, 365, 367, 376, 385, 402, 420], "gpu_id": 9, "gpu_ocl_engine_t": 278, "gqa": 350, "gracefulli": 298, "grad": 24, "gradient": [24, 256, 257, 422, 432], "gradient_accumulation_step": [314, 346, 347, 348, 349, 352, 354, 422], "gradient_checkpoint": [346, 347, 349], "gradient_checkpointing_en": 422, "gradio": [0, 345, 361, 378, 383, 384], "gradio_cli": 361, "gradio_web_serv": [345, 361, 383, 384], "gradiodeprecationwarn": 361, "granular": 354, "graph": [4, 24, 38, 49, 50, 52, 53, 54, 57, 58, 59, 62, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 269, 274, 302, 303, 320, 387, 392, 395, 396, 404, 407, 409, 439], "graph_def": [52, 53, 54, 243], "graph_dispatch": [55, 390], "graph_init": [55, 388, 390], "graph_node_names_detail": [62, 243, 244], "graph_nodes_dict": [62, 243], "graph_util": [58, 387, 391, 395], "great": [353, 359, 360, 374, 418, 420], "greater": [391, 416, 417], "greater_is_bett": [250, 251, 416, 417, 423], "greatli": [403, 408], "grep": [302, 345, 383, 426, 427], "grew": 361, "grid": 266, "ground": [256, 257, 258, 376], "group": [260, 304, 349, 361, 365, 366, 376, 387, 395, 402, 407, 409, 425, 426, 432], "group_by_length": [314, 349], "group_by_modality_length": 350, "group_dim": 247, "group_rowptr": 281, "group_siz": [247, 426, 432], "grouplasso": 419, "groupnorm": 279, "groupnorm_desc": 279, "grow": [25, 395, 432], "gt": [401, 428], "gte": [372, 376], "gtest": 269, "guarante": 373, "guard": 361, "gui": [320, 409, 410], "guid": [22, 270, 292, 302, 303, 312, 314, 320, 323, 324, 326, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 348, 349, 356, 363, 387, 401, 403, 420, 435], "guidanc": [270, 314, 335, 346, 349, 352, 378, 380], "guidelin": [270, 271, 319, 372], "guimar\u00e3": 301, "gunho": 432, "guskin": 301, "gxx": 426, "gxx_linux": 426, "h": [21, 256, 257, 263, 309, 313, 316, 317, 318, 321, 324, 340, 361, 363, 367, 385, 390, 432], "h100": 420, "h2": 307, "h2o_config": 307, "h2o_min_seqlen": 307, "h2oconfig": 307, "h384": 304, "ha": [9, 17, 52, 53, 54, 57, 266, 302, 309, 310, 319, 323, 330, 332, 338, 350, 351, 354, 355, 357, 358, 369, 372, 377, 387, 390, 391, 393, 394, 395, 399, 401, 405, 413, 421, 423, 429], "habana": [39, 40, 313, 316, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 363, 366], "habana_visible_devic": [314, 315, 347, 363, 366], "habanaai": 366, "had": [57, 361, 387], "haihao": 415, "half": [15, 402, 408, 429], "hallucin": [338, 350, 358, 372, 420], "hammer": 301, "han": 43, "hand": 420, "handcraft": 432, "handl": [24, 25, 33, 36, 44, 246, 271, 279, 309, 338, 354, 357, 358, 359, 372, 394, 403, 404], "handler": [6, 247], "hanwen": 415, "happen": [57, 256, 257, 389, 409], "happi": 378, "har": 346, "harass": 298, "hard": 260, "hardik": 301, "hardwar": [303, 309, 325, 357, 410, 412, 420], "harm": [9, 298, 319, 373, 409], "has_append_sum": [281, 413], "has_bia": 281, "has_binary_add": 413, "has_scale0": 281, "hash": [390, 400, 401], "hat": [425, 426], "have": [0, 9, 24, 32, 36, 44, 49, 57, 243, 264, 266, 269, 288, 298, 300, 302, 308, 309, 314, 315, 319, 323, 325, 330, 332, 335, 338, 345, 348, 349, 352, 355, 357, 358, 361, 364, 371, 372, 373, 376, 378, 380, 383, 384, 387, 388, 389, 390, 391, 392, 395, 396, 401, 405, 406, 407, 408, 412, 413, 415, 416, 417, 418, 419, 422, 423, 428, 432], "haven": 351, "haystack": [319, 372], "hbm": 302, "he": [377, 432], "head": [32, 36, 44, 57, 260, 395, 401, 407, 408, 426, 427], "head_dim": [32, 39, 40], "head_mask": [33, 36, 44], "head_num": [281, 407, 413], "head_nun": 407, "head_siz": [281, 407, 413], "header": [266, 309, 372], "header_supercell_tre": 266, "health": [364, 365, 366], "heart": 361, "heavy_ratio": 307, "height": [42, 256, 257, 266, 377], "hella": 288, "hellaswag": 346, "hello": [36, 44, 369], "helloswag": 426, "help": [24, 302, 309, 316, 319, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 342, 343, 344, 345, 351, 358, 361, 363, 372, 377, 383, 384, 385, 395, 400, 412], "helper": [1, 18, 24, 264], "helsinki": [304, 353], "henc": [319, 370], "hengyu": 415, "her": 361, "here": [57, 246, 295, 299, 302, 313, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 338, 340, 341, 343, 344, 349, 355, 356, 358, 361, 363, 366, 367, 369, 370, 371, 372, 379, 380, 381, 382, 386, 387, 390, 391, 392, 394, 395, 401, 409, 421, 423, 424, 426, 427, 429, 432, 438], "hessian": 288, "hf": [309, 314, 327, 328, 329, 332, 346, 347, 352, 377, 422, 432], "hf_access_token": 352, "hf_home": 427, "hidden": [29, 36, 44, 402, 425], "hidden_dim": [256, 257, 260], "hidden_s": [36, 44], "hidden_st": [36, 37, 39, 40, 41, 44], "hide": [36, 44, 246], "hierarchi": 25, "hierarchical_subsequ": 24, "high": [266, 293, 302, 319, 337, 361, 363, 367, 369, 372, 388, 396, 398, 405, 406, 409, 426, 427, 436], "higher": [25, 266, 319, 337, 346, 348, 349, 352, 370, 376, 377, 390, 407, 409, 413, 423, 426, 429], "higher_is_bett": 423, "highli": [305, 372, 421], "highlight": [316, 409], "hill": 361, "hint": [335, 380], "histogram": 25, "histor": 382, "histori": [0, 10, 25, 335, 380, 382], "hit": [370, 376], "hkunlp": 375, "hold": [258, 266], "home": [334, 361, 393, 427], "hook": 24, "hope": 405, "horizon": 361, "horizont": 400, "host": [35, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 347, 361, 363, 364, 365, 366, 372, 375], "host_dir": 315, "hostfil": [1, 314], "hostnam": 314, "hotmap": 412, "hour": 425, "hover": [335, 380], "how": [29, 246, 256, 257, 266, 269, 270, 271, 300, 308, 312, 319, 320, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 335, 339, 342, 343, 348, 349, 350, 352, 353, 354, 355, 356, 358, 367, 368, 369, 376, 377, 380, 387, 388, 389, 392, 393, 395, 401, 402, 403, 413, 416, 419, 426], "howev": [47, 57, 319, 335, 338, 358, 370, 372, 373, 377, 380, 390, 391, 395, 396, 399, 403, 406, 409, 428, 432], "howpublish": 415, "hpp": [279, 280, 281, 390, 398, 413], "hpu": [1, 38, 42, 309, 312, 313, 314, 315, 316, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 349, 352, 363, 375], "ht": [425, 426], "html": [270, 309, 316, 317, 319, 364, 365, 366, 370, 372, 389, 392, 394, 419, 420], "html64": 369, "htmlon": 351, "http": [9, 22, 25, 36, 38, 43, 260, 262, 263, 302, 308, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 343, 344, 345, 347, 348, 349, 352, 354, 357, 359, 361, 362, 363, 364, 365, 366, 367, 368, 369, 372, 380, 383, 384, 386, 388, 394, 415, 420, 424, 432], "http_proxi": [313, 314, 315, 316, 317, 318, 363], "https_proxi": [313, 314, 315, 316, 317, 318], "hub": [35, 247, 270, 348, 364, 418, 426, 427], "hug": [35, 247, 272, 302, 309, 345, 348, 349, 354, 383, 384, 392, 420, 429], "huge": 25, "hugginfac": [314, 337, 349], "huggingfac": [9, 35, 302, 309, 314, 334, 338, 348, 349, 351, 359, 363, 364, 369, 371, 372, 393, 416, 418, 420, 426, 427, 432], "huggingface_pipelin": [309, 372], "huggingfaceh4": [347, 349], "huggingfacepipelin": [309, 372], "huiyan": 301, "hull": 266, "human": [319, 346, 347, 349, 369], "hungarian": [256, 257], "hungarianmatch": 258, "hw": [306, 308, 309, 312], "hybrid": [304, 319, 325, 420], "hypeparamet": 432, "hyperparamet": [246, 428, 432], "hypothesi": [354, 419], "i": [0, 17, 19, 20, 23, 24, 25, 28, 32, 33, 36, 37, 42, 44, 47, 49, 50, 52, 53, 54, 57, 62, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 246, 247, 256, 257, 258, 259, 260, 261, 263, 266, 267, 272, 274, 277, 278, 279, 280, 281, 282, 286, 288, 289, 293, 298, 299, 300, 302, 303, 304, 305, 306, 307, 309, 313, 314, 315, 316, 317, 318, 319, 321, 322, 324, 325, 326, 327, 328, 329, 330, 332, 334, 335, 336, 337, 338, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 353, 354, 355, 357, 358, 359, 360, 362, 363, 364, 366, 367, 369, 371, 372, 373, 374, 375, 376, 377, 378, 380, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 394, 395, 396, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 426, 427, 428, 429, 432, 436], "i0103": 366, "ic": 302, "icelak": 308, "icon": [377, 382], "icx": [320, 345, 383, 384], "icx02": 361, "id": [35, 44, 55, 57, 262, 327, 328, 329, 330, 332, 351, 361, 372, 375, 401, 402], "id2label": [302, 306], "id_rsa": [330, 332], "idea": [401, 409, 419], "ideal": 372, "ident": [73, 298, 303], "identif": 372, "identifi": [35, 57, 307, 319, 325, 370, 372, 402, 429], "idm": [358, 372], "idx": [23, 38, 57, 401], "ie": [256, 257, 260], "ieee": 25, "igeni": 301, "ignor": [33, 36, 44, 246, 321, 361, 387], "ignore_keys_for_ev": 246, "ikko": 301, "illia": [36, 44], "illustr": 407, "im_end": 281, "im_start": 281, "imag": [0, 9, 17, 20, 256, 257, 259, 260, 267, 302, 304, 313, 317, 318, 319, 321, 335, 337, 347, 350, 361, 378, 380, 392, 395, 403, 409], "image2imag": [309, 321], "image2text": 348, "image_aspect_ratio": 350, "image_nam": [349, 366], "image_root_path": 334, "image_server_ip": 334, "image_tag": 349, "imagenet": 17, "imageri": 298, "imbal": 372, "imbusch": 301, "img": [20, 21], "img_mask": 42, "img_new": 20, "immedi": 24, "impact": [319, 354, 361, 372, 403], "impl_list_": 279, "impl_list_item_t": [278, 279], "impl_nthr": 280, "impl_nthr_": [280, 401], "implement": [9, 25, 47, 293, 302, 307, 326, 327, 328, 329, 330, 332, 345, 354, 367, 372, 377, 383, 384, 387, 390, 391, 395, 398, 399, 400, 402, 404, 405, 406, 407, 408, 410, 413, 432, 436], "implicit": 404, "import": [0, 4, 36, 44, 45, 55, 57, 289, 300, 302, 303, 306, 307, 309, 311, 314, 316, 319, 321, 332, 347, 349, 361, 369, 370, 371, 373, 374, 375, 376, 387, 388, 390, 392, 395, 396, 400, 401, 405, 416, 417, 418, 419, 421, 422, 423, 426, 427, 428, 429, 432], "importerror": [426, 427], "impos": 432, "impract": 372, "imprecis": 390, "impress": 432, "improv": [300, 302, 319, 347, 350, 361, 369, 370, 372, 376, 389, 400, 402, 404, 405, 409, 423, 429, 432], "in8": 158, "in_dt": 281, "in_end": 281, "in_pattern": 57, "in_start": 281, "inaccuraci": 372, "inappropri": 298, "inc": [28, 35, 246, 393], "incid": 298, "incit": 428, "includ": [18, 24, 25, 256, 257, 258, 264, 270, 279, 280, 281, 298, 301, 302, 304, 309, 314, 315, 320, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 347, 354, 357, 361, 363, 368, 369, 372, 373, 375, 376, 388, 389, 390, 398, 401, 409, 413, 415, 421, 423], "inclus": [24, 298], "incom": [25, 367, 408], "incomplet": [35, 372], "inconsist": 406, "incorpor": [25, 429], "incorrect": [338, 358, 361, 395], "increas": [307, 370, 402, 432], "independ": 372, "indetermin": 243, "index": [13, 23, 25, 36, 44, 55, 268, 281, 319, 332, 361, 372, 391, 394, 395, 421, 428], "index2an": [267, 268, 351], "index_file_jsonl_path": 376, "index_i": 258, "index_j": 258, "indexa": 402, "indexb": 402, "indic": [4, 23, 25, 33, 36, 44, 256, 257, 258, 281, 288, 314, 349, 395, 400, 401, 407, 409, 413, 416], "individu": [256, 257, 265, 298, 372], "indptr": 281, "ineffici": 429, "inevit": 405, "infer": [27, 38, 44, 49, 55, 60, 246, 272, 304, 306, 309, 312, 317, 318, 321, 325, 327, 328, 329, 337, 347, 358, 363, 367, 369, 385, 386, 387, 389, 390, 391, 392, 396, 403, 405, 406, 408, 413, 417, 420, 423, 428, 432, 437], "infer_framework_load_model": 45, "infer_task": 246, "inferen": 421, "inferenc": [158, 371], "inference_asr": 353, "inference_transl": 353, "inference_translation_revers": 353, "inference_tt": 353, "influenc": [372, 376, 387, 391], "influenti": 307, "info": [6, 57, 61, 62, 243, 244, 302, 307, 314, 345, 348, 349, 352, 354, 361, 383, 384, 387, 410, 422], "inform": [47, 256, 257, 270, 271, 274, 277, 282, 286, 298, 300, 302, 303, 316, 334, 338, 345, 358, 361, 371, 372, 378, 383, 384, 388, 389, 397, 401, 404, 411, 412, 413, 419, 420, 423, 424, 425, 426, 435], "inher": [319, 372, 373, 377], "inherit": [9, 32, 289, 303, 387, 394, 418, 419, 423], "init": [46, 252, 330, 332, 386, 388, 401, 432], "init_alpha": 247, "init_db_ai_photo": 334, "init_method": 252, "init_quant": 400, "init_similar_cache_from_config": 370, "initi": [33, 36, 44, 55, 57, 62, 86, 95, 281, 309, 321, 347, 357, 365, 369, 372, 373, 382, 400, 401, 405, 418, 419, 429], "initialis": 402, "inject": [246, 401], "injector": 437, "inlin": [278, 279, 280, 400], "inner": 407, "innerproduct": [55, 73, 158, 389, 390, 398], "innerproductreshapefus": [145, 150], "innerproductwithbiasgelu": 150, "innerproductwithslic": 150, "innerproductwithswish": 150, "innov": [272, 302, 378, 420, 421], "inp": 24, "inplac": 432, "input": [5, 9, 11, 12, 15, 23, 24, 25, 27, 32, 33, 36, 37, 44, 45, 52, 53, 55, 57, 62, 73, 181, 243, 244, 246, 260, 264, 265, 281, 289, 302, 303, 305, 306, 317, 319, 335, 336, 349, 350, 355, 356, 361, 363, 372, 373, 375, 380, 382, 388, 389, 390, 391, 394, 396, 404, 406, 407, 409, 413, 418, 421, 425, 428, 429, 432], "input_0": [55, 388, 390], "input_1": [55, 388, 390], "input_2": [55, 388, 390], "input_data": [55, 57, 150, 388], "input_dict": 264, "input_dim": [256, 257], "input_dt": [281, 400, 413], "input_fil": [150, 376], "input_id": [33, 36, 37, 40, 41, 44, 57, 306, 388, 396, 428, 429, 432], "input_mask": [57, 306, 388], "input_model": [302, 337, 389, 392, 393], "input_name_to_nod": 62, "input_path": [316, 319, 324, 338, 358, 372, 375], "input_pattern": [57, 395], "input_shap": [128, 389, 390, 413], "input_tensor": [36, 44, 52, 53, 54, 57, 62, 95, 243, 244, 387, 391], "input_tensor_nam": 389, "input_typ": 389, "inputdata": [154, 387], "inputfil": 155, "inputs_emb": [33, 36, 40, 41, 44], "inputs_shap": [55, 390], "inquire_config_item": 55, "insert": [55, 57, 392, 394, 395, 400, 401, 423], "insert_bf16_nod": 150, "insert_environ_info": 57, "insert_nod": 55, "insert_pattern": 57, "insert_quant_info": 57, "insert_quant_nod": 150, "insertbf16nod": 156, "insertquantnod": 157, "insid": [33, 57, 266, 314, 315, 349, 366, 391, 394, 404, 406], "insight": [319, 325, 378], "inspir": [314, 319, 345, 349, 361, 372, 383, 384], "inst": 389, "instal": [313, 314, 315, 316, 317, 318, 321, 335, 341, 346, 348, 349, 350, 351, 352, 358, 359, 360, 366, 367, 372, 374, 376, 377, 379, 380, 381, 382, 387, 412, 420, 426, 427, 432, 435], "install_chatbot_cpu": 361, "install_chatbot_gpu": 361, "install_rag_gpu": 362, "instanc": [6, 28, 246, 247, 248, 268, 289, 298, 309, 315, 322, 332, 348, 351, 365, 366, 372, 388, 389, 397, 411, 414, 416, 417, 418, 425, 426], "instance_group": [365, 366], "instanti": 35, "instead": [0, 25, 36, 44, 314, 349, 350, 361, 372, 402], "instruct": [9, 268, 295, 303, 309, 314, 315, 316, 319, 327, 328, 329, 330, 332, 335, 336, 337, 338, 342, 346, 351, 352, 358, 376, 380, 391, 400, 403, 405, 408, 409, 410, 413, 420, 421, 423, 428, 432, 438], "instruction_tuning_pipelin": 314, "instructor": [372, 375], "instrument": 24, "insult": 298, "int": [6, 23, 27, 28, 32, 36, 37, 39, 40, 57, 246, 247, 264, 281, 289, 371, 372, 387, 400, 401, 402, 405, 421], "int32": [62, 302, 388, 421], "int32_bia": 62, "int32_t": 281, "int4": [319, 320, 345, 375, 383, 384, 422, 425, 429, 432], "int4_clip": [421, 432], "int4_fullrang": [319, 421], "int4_gptq": 427, "int64_t": 281, "int8": [28, 45, 62, 246, 269, 270, 302, 304, 319, 375, 389, 390, 392, 398, 401, 406, 407, 413, 420, 421, 422, 423, 425, 426, 428, 429, 432, 439], "int8_bf16_mixed_precision_check": 150, "int8_bia": 62, "int8_bias_scal": 62, "int8_bias_zero_point": 62, "int8_lut": 401, "int8_lut_acc_test": 394, "int8_lut_optim": 394, "int8_model_path": 390, "int8_t": 281, "int8bf16mixedprecisioncheck": 158, "intact": 361, "integ": [25, 262, 305, 314, 349, 375, 390, 391, 409, 413, 419, 423, 428, 432], "integr": [309, 319, 335, 338, 357, 358, 360, 361, 363, 369, 372, 374, 380, 429, 439], "intel": [4, 8, 35, 247, 269, 270, 271, 289, 299, 300, 306, 309, 310, 311, 313, 314, 315, 316, 317, 318, 319, 320, 321, 323, 324, 325, 326, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 348, 350, 352, 354, 355, 360, 363, 364, 365, 366, 367, 369, 370, 372, 374, 375, 383, 384, 386, 387, 388, 397, 399, 410, 411, 415, 417, 418, 419, 420, 423, 424, 425, 426, 427, 428, 429], "intel_domain": [314, 349], "intel_extension_for_pytorch": 432, "intel_extension_for_transform": [289, 302, 303, 306, 307, 308, 309, 315, 317, 319, 322, 340, 347, 349, 354, 356, 358, 361, 362, 364, 365, 366, 369, 370, 371, 372, 373, 374, 387, 388, 390, 392, 395, 396, 398, 413, 416, 417, 418, 419, 421, 422, 423, 428, 429, 432], "intellig": [272, 420], "intend": [256, 257, 300, 334, 336, 337, 338, 340, 343, 344, 363], "intens": [372, 385], "intent": [11, 15, 372], "intentdetector": 372, "interact": [309, 316, 321, 378], "interact_featur": 150, "interactfeatur": 159, "interconnect": 372, "interest": [25, 298, 316], "interfac": [28, 33, 36, 44, 49, 245, 279, 305, 355, 369, 378, 386, 398], "intermedi": [36, 44, 387, 392, 395, 409, 423], "intermediatelayersknowledgedistillationlossconfig": 303, "intermediatelayersloss": 303, "intern": [25, 57, 376, 391, 405], "internation": 376, "internet": [313, 314, 316, 355, 372], "interpol": 264, "interpret": 316, "intersect": [25, 266], "interv": [24, 413], "intrins": 403, "introduc": [295, 309, 319, 333, 336, 337, 338, 339, 342, 353, 357, 358, 361, 372, 399, 400, 401, 402, 403, 405, 407, 408, 409, 423, 428, 438], "introduct": 413, "intuit": [395, 405, 432], "invalid": 413, "invers": [30, 408], "investig": [298, 409], "invit": 270, "invok": 24, "invoke_with_optional_arg": 24, "involv": [316, 319, 321, 325, 372], "io": [281, 361, 364, 365, 394], "iob": 266, "iou": [25, 260, 263, 266], "ip": [313, 314, 316, 317, 318, 322, 330, 332, 334, 337, 345, 349, 363, 375, 383, 384], "ipc": [313, 314, 315, 316, 317, 318, 347, 366], "ipex": [269, 308, 314, 348, 349, 369, 423, 428, 432], "ipex_opt_llm": 247, "ipykernel": 322, "ipynb": 322, "ir": [55, 337, 387, 388, 389, 390, 396, 410, 412, 439], "ir_path": 392, "irc_na": 332, "irq": [397, 411], "is_avail": 374, "is_decod": [36, 44], "is_mast": 264, "is_null_numpy_valu": 25, "is_rel": [250, 306, 416], "is_supported_onnx_graph": 62, "is_supported_onnx_nod": 62, "is_thing_map": 260, "isa": [390, 398, 400, 405, 408, 409, 410], "ise": [309, 313, 331], "issu": [271, 288, 295, 298, 300, 302, 314, 332, 335, 349, 361, 370, 372, 380, 406, 423, 429, 438], "it_per_cor": 414, "itai": 301, "itali": [36, 377], "item": [25, 57, 289, 302, 306, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 372, 376], "iter": [25, 28, 247, 266, 289, 302, 388, 390, 394, 396, 400, 404, 407, 408, 413, 414, 428], "iteration4": 389, "iterator_get_next": 83, "iterator_v2": [83, 387], "iteratorgetnext": 84, "iteratorv2": [85, 387], "itrex": [302, 308, 320, 322, 348, 349, 361, 362, 366, 386, 388, 421], "itrex_v": [314, 315, 347], "itrexquantizationconfigmixin": 247, "its": [9, 24, 57, 181, 266, 298, 300, 303, 309, 313, 321, 335, 346, 347, 352, 355, 360, 369, 372, 374, 377, 380, 387, 388, 391, 392, 395, 404, 405, 406, 409, 412, 413, 415, 429, 432], "itself": [369, 372, 410], "itt": 410, "j": [272, 302, 304, 309, 386, 387, 388, 398, 404, 409, 410, 413, 428, 432], "j8": 330, "jakob": [36, 44], "jan": 420, "japanes": 369, "jax": 354, "jd": [278, 279, 280, 281, 401, 413], "jemalloc": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368], "ji": 432, "jiafu": 301, "jianyu": 301, "jiqe": 301, "jit": [27, 28, 246, 278, 279, 280, 281, 289, 293, 315, 398, 400, 401, 404, 408, 413, 436], "jit_binary_injector": 400, "jit_domain": 281, "jit_eltwiseop_t": 401, "jit_gener": [400, 401], "job": 313, "jogesh": 301, "johnson": 377, "join_with_spac": 266, "jonatasgrosman": 353, "jonathan": 301, "jone": [36, 44], "journalist": 349, "journei": 361, "jpg": [350, 412], "json": [247, 267, 309, 313, 314, 316, 317, 318, 319, 321, 324, 340, 349, 350, 351, 354, 361, 363, 367, 372, 375, 376, 377, 422], "json_file_path": 247, "jsonl": [319, 372, 376], "jstor": 25, "juli": [377, 420], "jump": 340, "jun": 361, "june": 420, "jung": 432, "jupyt": [320, 322, 347], "just": [24, 25, 44, 57, 302, 314, 315, 316, 349, 355, 356, 366, 372, 387, 388, 389, 390, 391, 392, 395, 401, 409, 412, 416, 420, 422, 430, 432], "k": [25, 32, 264, 281, 372, 376, 390, 400, 402, 403, 404, 405, 407, 408, 409, 411, 413, 432], "k_bia": 281, "k_dim_dp": 405, "k_proj": [346, 347, 349, 354], "k_scale": 281, "k_weight": 281, "kaiser": [36, 44], "kamboj": 301, "karnin": 25, "karrasdiffusionschedul": 9, "kd": [279, 303], "kdim": 281, "kdp": 279, "keep": [0, 25, 266, 340, 345, 367, 383, 384, 391, 427], "keep_dim": 387, "keep_high": 266, "kei": [25, 32, 36, 39, 40, 44, 47, 55, 57, 62, 243, 246, 247, 256, 257, 267, 272, 302, 316, 321, 330, 332, 334, 335, 361, 367, 369, 372, 380, 389, 390, 391, 400, 401, 403, 429], "keithito": 262, "kept": 29, "ker_kind": [280, 398, 401], "ker_kind_": [280, 401], "ker_per_batch": 281, "ker_prop": [280, 398, 401], "ker_prop_": [280, 401], "kernel": [269, 273, 281, 297, 299, 319, 322, 388, 389, 394, 397, 399, 400, 401, 403, 404, 406, 407, 408, 409, 410, 411, 412, 440], "kernel_config": [390, 413], "kernel_desc_proxi": 279, "kernel_desc_t": 279, "kernel_kind": [279, 280, 401], "kernel_nam": [390, 413], "kernel_prop": [280, 401], "kernel_proxi": 279, "kernel_t": [278, 279], "kernel_typ": [413, 414], "kevin": 301, "kevinintel": 299, "key_stat": [39, 40], "key_value_st": 37, "keyboard": 355, "keygen": [330, 332], "keynot": 420, "keyword": [2, 25, 246, 372], "kgco2e": 385, "kim": 432, "kind": [57, 138, 280, 314, 349, 361, 365, 366, 394, 406, 413], "kind_cpu": 366, "kind_gpu": 365, "kindli": [0, 270], "kll": 25, "km": 390, "kmp_affin": 354, "kmp_blocktim": 354, "kmp_set": 354, "kn": 390, "know": [390, 396, 403], "knowledg": [246, 272, 302, 316, 319, 320, 335, 341, 361, 372, 379, 380, 381, 432], "knowledge_a100_url": 379, "knowledge_gaudi2_url": 379, "knowledge_url": [335, 380], "knowledgebas": 372, "knowledgedistillationloss": 303, "knowledgedistillationlossconfig": 303, "knowledgeloss": 303, "known": [293, 302, 338, 358, 361, 398, 436], "korat": 301, "kpo": 281, "krishna2020": 304, "kullback": 303, "kv": [39, 40, 57, 307, 429], "kv_cache_compress": 307, "kv_cache_inc_s": 332, "kwarg": [14, 17, 24, 25, 28, 32, 33, 35, 36, 40, 41, 44, 47, 61, 128, 246, 247, 372, 432], "kwon": 432, "kxn": [413, 421], "l": [30, 303, 331, 350], "l1": [256, 257, 399], "l2": [399, 405, 413], "l6": 304, "l_mpi_oneapi_p_2021": 332, "la": [304, 402], "lab": [43, 349, 420], "label": [33, 36, 44, 246, 256, 257, 258, 260, 266, 372, 414, 418], "label2id": [302, 306], "label_id": 302, "labor": 372, "lack": 372, "laion": 350, "lake": 302, "lamabda": 372, "lamb": 288, "lambada": [289, 426, 428], "lambada_openai": [426, 427], "lambda": [302, 425], "lambdalab": 304, "lamini": 428, "landmark": [21, 377], "lang": 25, "langchain": [14, 319, 320, 357], "langchain_commun": [309, 372], "langchain_cor": [309, 372], "languag": [4, 33, 36, 44, 272, 298, 302, 304, 319, 325, 340, 342, 346, 349, 350, 352, 354, 359, 363, 368, 371, 372, 373, 418, 420, 421, 422, 428, 429, 430, 432], "laptop": [327, 328, 329, 420], "larg": [4, 9, 14, 25, 303, 304, 319, 325, 346, 349, 350, 353, 354, 363, 371, 372, 373, 375, 376, 395, 396, 397, 399, 402, 405, 406, 407, 413, 420, 421, 422, 428, 430, 432], "large_wei_threshold": 405, "large_weight_threshold": 413, "larger": [17, 25, 246, 372, 376, 428], "largest": [25, 266], "lasso": 304, "last": [0, 17, 23, 24, 25, 36, 44, 57, 246, 361, 391, 395, 396, 399, 404, 405, 407, 423, 428], "last_lay": 24, "last_layer_shap": 150, "lastlayershap": 160, "latanc": 389, "latenc": [28, 289, 302, 304, 373, 382, 385, 389, 397, 402, 420, 423, 425, 429], "latency_constraint": 28, "latent": [9, 369], "later": [24, 25, 57, 266, 349, 387, 395], "latest": [315, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 347, 357, 363, 366, 368, 372, 383, 384, 410, 424, 426, 427], "latrang": 73, "latter": [355, 408], "launch": [314, 316, 321, 327, 328, 329, 345, 347, 348, 349, 352, 361, 363, 383, 384, 408, 420], "launcher": 1, "law": 307, "layer": [23, 24, 28, 29, 36, 44, 47, 57, 256, 257, 261, 350, 389, 395, 400, 404, 407, 419, 420, 421, 428, 437], "layer1": 24, "layer2": 24, "layer_0": 395, "layer_1": 395, "layer_2": 395, "layer_config": [29, 36, 44], "layer_dropout": 29, "layer_dropout_bound": [28, 29], "layer_dropout_prob": [28, 29], "layer_head_mask": 37, "layer_idx": 32, "layer_norm": [83, 150, 387], "layer_norm_with_reduce_mean": [150, 387], "layer_norm_with_transpos": 150, "layer_wis": 247, "layernam": 24, "layernorm": [49, 57, 86, 161, 387, 391, 395, 398, 413], "layernorm_ba": 279, "layernorm_ba_data_t": 281, "layernorm_ba_desc": [279, 400], "layernorm_ba_param_t": 281, "layernormalized_spmm": 279, "layernormalized_spmm_desc": 279, "layernormwithreducemean": [162, 387], "layernormwithtranspos": 163, "layernrom": 406, "layout": [403, 406, 407, 408], "layternorm": 406, "lazi": [57, 340], "lazyimport": 57, "ld_preload": [354, 426, 427], "le": 401, "lead": [309, 372, 377, 396, 407, 409, 432], "leaderboard": [309, 346, 347, 372, 420], "leadership": 298, "learn": [17, 37, 259, 272, 317, 319, 320, 346, 347, 361, 363, 372, 392, 401, 417, 420, 423, 425], "learning_r": [314, 346, 347, 348, 349, 352, 354, 376, 422], "least": [25, 47, 57, 258, 266, 300, 406], "leav": [25, 351, 361, 391, 407, 409, 413], "lecun": 419, "lee": 432, "left": [23, 24, 36, 44, 57, 260, 266, 335, 361, 380, 403, 407, 409], "legaci": [36, 44], "legal": 435, "legend": 407, "leibler": 303, "len": [256, 257, 258, 263, 387, 388, 395, 407], "length": [28, 29, 36, 44, 57, 272, 288, 302, 314, 349, 350, 361, 372, 376, 391, 395, 400, 413, 420, 423, 425, 429, 430], "length_config": [28, 36, 44, 306], "length_drop_prob": 29, "length_drop_ratio": 29, "length_drop_ratio_bound": [28, 29], "lengthi": 316, "lengthier": 372, "less": [29, 266, 289, 303, 370, 390, 405, 409, 419], "lesson": 361, "let": [355, 369, 389, 394, 402, 403, 428], "level": [6, 9, 44, 61, 268, 270, 272, 298, 302, 319, 326, 330, 332, 370, 378, 390, 401, 404, 412, 432], "levequ": 25, "leverag": [272, 302, 303, 306, 309, 319, 337, 359, 370, 372, 420, 421, 426, 429], "lf": [334, 338, 348, 349, 357, 363, 368], "lh": 407, "li": 403, "liangliang": 301, "lib": [300, 354, 361, 386, 388], "lib64": [426, 427], "liberti": 25, "libgl": [308, 334], "libgl1": [308, 309, 334], "libiomp": 354, "libiomp5": 354, "libjpeg": 361, "libkernellib": 386, "libneural_engin": 386, "libpng": 361, "librari": [8, 9, 293, 308, 321, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 348, 349, 354, 357, 359, 363, 368, 370, 400, 401, 421, 426, 427, 436], "libsm": 308, "libsm6": 308, "libssl1": 369, "libstdc": [426, 427], "libstdc_path_": [426, 427], "libstdcxx": 426, "libtcmalloc": 354, "libxext": 308, "libxext6": 308, "libxrend": 308, "libxsmm": 332, "licens": [270, 300, 372], "life": [361, 396], "lifelong": 361, "lifengwang": 301, "lifetim": 361, "light": 36, "lighter": 407, "lightweight": [346, 347], "lihongzhi": 319, "like": [25, 49, 52, 53, 54, 57, 243, 302, 303, 306, 309, 314, 317, 318, 319, 338, 345, 347, 349, 352, 353, 354, 355, 358, 363, 364, 365, 366, 369, 370, 372, 376, 383, 384, 385, 387, 388, 389, 390, 391, 392, 395, 396, 400, 401, 403, 410, 416, 417, 419, 423, 428, 432], "likelihood": [256, 257, 346, 347, 372], "limit": [25, 28, 303, 335, 372, 380, 404, 408, 426, 427, 429, 432], "limitless": 420, "lin": 432, "line": [1, 265, 267, 314, 319, 336, 337, 338, 348, 349, 356, 358, 361, 376, 385, 387, 390, 399, 406, 407, 409, 414, 432], "linear": [28, 303, 401, 404, 407, 421, 432], "lineup": [309, 321], "link": [270, 300, 319, 320, 321, 322, 325, 327, 328, 329, 333, 338, 339, 342, 361, 365, 372, 379, 388, 394, 430, 432], "linux": [308, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 363, 368, 383, 384, 386], "list": [21, 23, 24, 25, 28, 36, 40, 41, 44, 47, 52, 53, 54, 55, 57, 62, 95, 243, 244, 246, 256, 257, 258, 260, 262, 264, 265, 267, 268, 270, 302, 303, 309, 321, 336, 337, 345, 348, 351, 371, 372, 375, 376, 383, 386, 387, 388, 391, 396, 400, 401, 414, 419, 426, 427, 428], "list2str": [57, 387], "listconstruct": 73, "listen": 355, "listunpack": 73, "liter": 23, "littl": [361, 423, 426, 427, 428, 429, 432], "liuhaotian": [350, 351], "live": 361, "livestream": 420, "lkk12014402": 299, "ll": [327, 328, 329, 332, 402], "llama": [32, 309, 314, 319, 323, 332, 346, 347, 350, 352, 363, 372, 377, 396, 420, 422, 428, 432], "llama2": [314, 366], "llama2_7b_rm": 352, "llama2_7b_s": 352, "llama2_ds_zero3_config": 349, "llama2_peft_finetuned_model": [314, 349], "llama_embed": 150, "llama_matmulwithtranspos": 150, "llama_peft_finetuned_model": 422, "llama_postprocess": 150, "llama_rotary_pos_emb": 150, "llamaattent": 32, "llamaconfig": 32, "llamaembed": 164, "llamaflashattention2": 32, "llamaforcausallm": 307, "llamamatmulwithtranspos": 165, "llamapostprocess": 166, "llamaroraryposemb": 167, "llamasdpaattent": 32, "llava": 350, "llava1": 351, "llion": [36, 44], "llm": [4, 8, 11, 12, 15, 43, 309, 314, 320, 321, 323, 324, 325, 326, 327, 328, 329, 330, 332, 336, 337, 338, 340, 346, 347, 349, 350, 354, 358, 361, 363, 364, 366, 367, 370, 371, 372, 373, 375, 420, 421, 422, 427, 428, 432], "llm_carbon_calc": 385, "llm_tt": 340, "llma_url": [335, 380], "lm": [20, 22, 325, 349], "lm3d": 20, "lm_eval_task": 349, "lm_new": 20, "lmsdiscreteschedul": 9, "ln": [261, 308, 309], "ln_node_idx": 387, "ln_pattern": 395, "lo": 403, "load": [9, 10, 19, 25, 30, 33, 36, 44, 45, 60, 246, 247, 267, 302, 309, 320, 361, 372, 387, 388, 389, 390, 392, 396, 399, 401, 402, 403, 404, 409, 428, 432], "load_cached_st": 25, "load_dataset": [289, 302], "load_graph": 302, "load_in_4bit": [422, 432], "load_in_8bit": 432, "load_mat": 18, "load_metr": 302, "load_param": 401, "load_state_dict": 25, "load_stor": 30, "load_store_fil": 28, "load_tf_weights_in_bert": 36, "load_weight": 55, "loaded_model": 432, "loader": [25, 49, 58, 390, 392, 395], "loading_config": 319, "loadingmodelconfig": 319, "loc": 361, "local": [35, 246, 269, 314, 315, 316, 319, 321, 325, 330, 332, 334, 338, 345, 348, 349, 351, 358, 359, 361, 363, 366, 370, 371, 372, 375, 379, 383, 384, 387, 399, 402, 405, 419, 427, 429], "local_step": 47, "localhost": [313, 314, 315, 316, 318, 321, 322, 324, 330, 332, 340, 347, 353, 361, 364, 365, 366, 367, 372], "localmemori": 402, "locat": [57, 121, 309, 312, 327, 328, 329, 332, 371, 372, 373, 377, 387, 388, 391, 395, 409, 413, 424], "lock": 419, "log": [6, 61, 256, 257, 265, 302, 309, 330, 345, 349, 361, 375, 383, 384, 388, 394], "log_fil": [6, 309, 375], "log_level": [6, 314, 348, 349, 352, 354, 422], "log_nam": 265, "log_softmax": 83, "log_with": 352, "logger": [6, 58, 410], "logging_step": [314, 346, 347, 348, 349, 352, 354, 376, 422], "logic": [39, 40, 351, 408, 410], "login": 334, "logit": [36, 44, 256, 257, 258, 302, 303, 306, 388], "logo": 415, "logsoftmax": [87, 279], "logsoftmax_desc": 279, "long": [24, 314, 315, 372, 395], "longer": [314, 349, 395], "longest": [57, 395], "longform": 304, "longtensor": [36, 40, 41, 44], "look": [314, 349, 372, 387, 389, 401, 402], "loop": [25, 73, 387, 400, 402, 407], "lora": [314, 349, 350, 352, 354, 422, 425], "lora_all_linear": [346, 347], "lora_alpha": [346, 347, 349, 354], "lora_dropout": [346, 347], "lora_rank": [346, 347, 349], "lora_target_modul": [346, 347, 349, 354], "loraconfig": 422, "loss": [33, 36, 44, 246, 256, 257, 260, 303, 314, 349, 423, 432], "loss_bbox_unsc": 265, "loss_box": [256, 257], "loss_cardin": [256, 257], "loss_label": [256, 257], "loss_mask": [256, 257], "lossi": 423, "lot": 403, "louie": 301, "low": [246, 266, 302, 306, 354, 370, 399, 406, 408, 417, 420, 421, 422, 423, 432, 439], "lower": [30, 266, 268, 354, 372, 409, 417, 423, 432], "lower_all_tupl": 150, "lower_bound": 413, "lower_constraint": 30, "loweralltupl": 168, "lpta": 402, "lr": [247, 432], "lr_scheduler_typ": [346, 347], "lsap": 258, "lscpu": 331, "lt": [397, 411], "luca": 301, "lukasz": [36, 44], "luoyu": 299, "lut": [281, 398, 400, 401, 413], "lv": 432, "lvliang": 299, "lvwerra": 304, "m": [25, 57, 263, 281, 302, 303, 304, 331, 332, 334, 348, 349, 355, 371, 385, 389, 390, 397, 399, 402, 403, 404, 405, 406, 408, 409, 411, 413, 425, 426, 427, 432], "m150": 432, "m_tile": 281, "ma": 301, "mac": 30, "machin": [361, 365, 366, 379, 413], "made": [25, 314, 349, 361, 372, 423], "magicod": [309, 313, 331], "magnitud": 419, "mai": [49, 57, 278, 279, 280, 281, 298, 300, 302, 322, 335, 351, 367, 372, 373, 380, 387, 390, 395, 396, 402, 403, 404, 406, 407, 408, 409, 413, 415, 420, 423, 429, 432], "mail": [298, 349], "main": [22, 36, 43, 44, 47, 57, 246, 316, 319, 347, 352, 354, 356, 357, 360, 368, 369, 391, 406, 413], "main_eval_onli": 351, "main_parse_and_ev": 351, "mainli": [347, 372, 390, 405, 406], "maintain": [0, 269, 270, 288, 298, 300, 302, 307, 372, 373, 391, 396, 424, 432], "major": [399, 405, 406, 408, 409, 423], "majotr": 406, "make": [13, 24, 32, 38, 57, 95, 181, 246, 247, 266, 268, 289, 298, 308, 309, 314, 315, 316, 317, 318, 321, 323, 324, 326, 330, 332, 335, 337, 340, 343, 345, 349, 350, 355, 361, 362, 366, 369, 372, 377, 380, 382, 383, 384, 386, 387, 388, 398, 399, 400, 401, 402, 404, 405, 406, 407, 410, 413, 428], "make_load": 25, "make_posit": 44, "makeiter": 73, "maktukmak": 301, "malloc": [354, 396], "mamou": 301, "manag": [0, 24, 25, 345, 367, 372, 383, 384, 394, 396], "mandarin": 353, "mandatori": [57, 314, 349, 371], "mani": [29, 338, 358, 361, 372, 387, 389, 391, 400, 402, 403, 406, 408, 413, 428], "manipul": [263, 369], "manual": [314, 347, 349, 355, 410], "manufactur": [397, 411], "map": [52, 53, 57, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 74, 75, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 265, 302, 303, 365, 373, 396, 399, 423], "map_and_batch_dataset": [83, 387], "mapandbatchdataset": [88, 387], "mapping_config": 57, "mapping_dict": 57, "mar": 377, "march": 420, "margin": 2, "mark": [24, 361], "markdown": [319, 372], "marktechpost": 420, "marvel": 377, "mask": [20, 33, 36, 44, 256, 257, 260, 263, 281, 304, 373, 400, 401, 403, 405, 408], "mask_mock1": 401, "mask_new": 20, "masked_fil": 73, "maskedlmoutput": [36, 44], "maskheadsmallconv": 260, "maskinun": 304, "masks_to_box": 263, "master": [264, 314, 349, 419], "master_addr": [252, 314, 349], "master_address": [314, 349], "master_port": [252, 347], "mata": 281, "matb": 281, "matc": 281, "match": [24, 25, 57, 247, 256, 257, 258, 303, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 341, 343, 344, 363, 370, 372, 381, 390, 391, 395, 404, 407], "match_criteria": 266, "match_result": 57, "match_threshold": 266, "matcher": [215, 256, 257, 392], "matchtyp": 373, "matd": 281, "mate": 281, "math": 423, "mathemat": [401, 428], "matmul": [39, 40, 62, 73, 83, 158, 305, 389, 391, 392, 395, 398, 408, 413, 432, 437], "matmul_346": 389, "matmul_357": 389, "matmul_358": 389, "matmul_data_t": 281, "matmul_fp8_data_t": 281, "matmul_fp8_param_t": 281, "matmul_input": 281, "matmul_io": 281, "matmul_io_max": 281, "matmul_output": 281, "matmul_param_t": 281, "matmul_u8_data_t": 281, "matmul_with_bia": 150, "matmul_with_bias_add": 150, "matmul_with_bias_gelu": 150, "matmul_with_bias_relu": 150, "matmul_with_bias_sigmoid": 150, "matmul_with_bias_tanh": 150, "matmul_with_bias_unsqueez": 150, "matmul_with_transpos": 150, "matmul_with_transpose_scale_add": 150, "matmulwithbia": [73, 169], "matmulwithbiasadd": [73, 170], "matmulwithbiasgelu": [73, 171], "matmulwithbiasrelu": [73, 172], "matmulwithbiassigmoid": [73, 173], "matmulwithbiastanh": [73, 174], "matmulwithbiasunsqueez": 175, "matmulwithtranspos": [176, 177], "matmulwithtransposescaleadd": 177, "matplotlib": 265, "matric": [62, 354, 402, 407, 408, 432], "matrix": [25, 263, 288, 302, 306, 399, 402, 403, 404, 406, 407, 408, 409, 413, 419, 433], "matter": 430, "max": [25, 29, 73, 246, 247, 302, 308, 309, 324, 349, 371, 372, 376, 396, 397, 400, 402, 404, 409, 411, 423, 432], "max_chuck_s": 372, "max_eval_sampl": 354, "max_input_length": 432, "max_input_shapes_list": 396, "max_length": [28, 289, 302, 346, 347, 375], "max_new_token": [317, 363, 371, 426, 427, 428, 429, 432], "max_prompt_length": [346, 347], "max_seq_length": [29, 30], "max_sparsity_ratio_per_op": 28, "max_step": [346, 347], "max_thread": 361, "max_tile_k": 405, "max_token": 321, "max_train_sampl": [354, 422], "max_trial": 423, "maxim": 2, "maxima": 266, "maximum": [28, 37, 346, 347, 349, 372, 396, 397, 411, 423], "mayb": [57, 347, 390, 409, 420], "mb": [304, 385, 425], "mbzuai": 428, "mc": 346, "mc1": 425, "mc2": 425, "me": [4, 309, 311, 316, 318, 319, 321, 361, 364, 365, 366, 367, 370, 371, 375], "mean": [25, 33, 36, 44, 57, 83, 266, 281, 316, 365, 376, 387, 388, 389, 390, 391, 395, 396, 399, 400, 402, 406, 409, 413, 416, 419, 425], "mean_in": 281, "mean_out": 281, "mean_var_reduce_data_t": 281, "mean_var_reduce_param_t": 281, "meanwhil": [304, 372, 399, 405], "measur": [266, 270, 289, 303, 385, 398, 416, 417, 419, 423], "mechan": 376, "media": 298, "median": 25, "medic": 371, "medium": [272, 302, 414, 420], "medium_n": 414, "meet": [266, 278, 279, 280, 281, 308, 309, 361, 371, 372, 387, 403, 405, 409, 421, 429, 432], "mem": 385, "member": [279, 280, 281, 298, 394, 400, 401], "memori": [9, 25, 288, 307, 325, 361, 367, 372, 385, 394, 396, 400, 401, 402, 403, 404, 406, 407, 408, 409, 417, 422, 423, 425, 426, 428, 429, 432], "memory_args_": 394, "memory_storage_t": 278, "meng": 415, "mention": [270, 371], "merg": [340, 352, 390, 395], "merge_dst": 281, "merge_peft_adapt": 352, "merge_src": 281, "merged_embeddingbag": 150, "mergedembeddingbag": [73, 178], "mesa": [308, 309, 334], "mesh": 354, "messag": [0, 309, 316, 321, 324, 361, 385], "met": [295, 361, 438], "meta": [9, 309, 314, 332, 346, 347, 352, 377, 420, 422, 432], "metadata": [348, 372], "meter": 377, "method": [9, 25, 30, 57, 246, 247, 270, 288, 304, 307, 314, 316, 319, 349, 352, 361, 372, 373, 376, 400, 403, 405, 408, 410, 423, 428, 432], "meticul": 319, "metric": [28, 246, 249, 266, 270, 302, 306, 346, 376, 423, 434], "mha": [398, 437], "mha_dens": [279, 413], "mha_dense_desc": 279, "mhattent": 261, "mhattentionmap": 260, "micro": [390, 399, 404, 409], "micro_b": 413, "micro_oc": 413, "microarchitectur": 361, "microcod": [397, 411, 425, 426], "microkernel": 404, "microsoft": [266, 309, 327, 328, 329, 330], "midst": 372, "might": [24, 295, 319, 361, 370, 391, 438], "migrat": [309, 428], "miko\u0142aj": 301, "mimic": 423, "min": [25, 29, 246, 258, 372, 423, 432], "min_chuck_s": 372, "min_length": 29, "min_sparsity_ratio_per_op": 28, "mind": [0, 319, 420], "mine_hard_neg": 376, "mini": [304, 385, 389, 393, 397, 430], "mini_batch_s": 352, "miniconda": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 354, 357, 363, 368, 383, 384], "miniconda3": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "minilm": [272, 302, 304, 306, 420, 430], "minilmv2": 304, "minim": [303, 361, 370, 371, 409], "minimum": [325, 372], "minmax": [246, 432], "minmax_lr": [247, 432], "minor": [314, 349], "minut": [309, 430], "misc": [255, 256, 257, 415], "miscellan": 18, "misinterpret": 372, "miss": [349, 361, 399, 409], "mistral": [309, 347, 350, 420], "mistral_peft_finetuned_model": 349, "mistralai": [309, 347, 349], "mit": 43, "mitig": [372, 432], "mix": [158, 314, 320, 341, 349, 361, 381, 390], "mix665k": 350, "mixedprecisionconfig": 319, "mixin": 247, "mixtral": [309, 349], "mixtral_peft_finetuned_model": 349, "mixtur": 350, "mk": 390, "mkdir": [21, 334, 364, 365, 366, 386, 388, 398, 410, 413], "mkl": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 354, 357, 363, 368], "mkl_layer_norm": 83, "ml": [345, 383, 384], "mleffici": [272, 302, 420], "mlm": 304, "mlp": [256, 257, 350], "mlp2x_gelu": 350, "mlperf": [272, 420], "mm_projector_typ": 350, "mmkmb": 390, "mmlu": [288, 346], "mmmu_ev": 350, "mmr": [2, 372], "mmxmb": 390, "mnli": [304, 354], "moat": 420, "mobil": [303, 378], "mobilebert": 303, "mod2": 402, "modal": [319, 420], "mode": [27, 48, 55, 264, 307, 331, 332, 335, 380, 389, 393, 406, 408, 413, 414, 423], "model": [0, 4, 5, 9, 19, 23, 24, 27, 28, 30, 45, 47, 49, 52, 53, 54, 55, 57, 60, 62, 129, 130, 131, 132, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 224, 225, 226, 228, 229, 230, 241, 242, 244, 246, 247, 266, 268, 269, 270, 286, 303, 304, 306, 313, 314, 315, 316, 319, 320, 321, 323, 324, 325, 326, 330, 331, 332, 335, 340, 342, 345, 346, 347, 351, 354, 356, 357, 358, 361, 364, 365, 366, 369, 371, 372, 373, 375, 380, 383, 384, 386, 387, 390, 391, 395, 396, 397, 400, 405, 406, 407, 408, 411, 415, 416, 417, 419, 420, 421, 422, 423, 426, 427, 429, 430, 432, 439], "model_and_token": [389, 392, 393], "model_class": 45, "model_dataset": 83, "model_dir": 314, "model_doc": 9, "model_format": [426, 427], "model_id": 429, "model_infer": 389, "model_input": 40, "model_kwarg": [36, 44, 45, 418], "model_max_length": 350, "model_nam": [289, 351, 352, 376, 418, 426, 427, 429, 432], "model_name_or_path": [27, 35, 246, 289, 309, 314, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 346, 347, 348, 349, 352, 354, 363, 371, 375, 376, 422, 428, 432], "model_path": [351, 375, 390, 396, 426, 427, 432], "model_pix2pix": 337, "modelargu": 5, "modeldataset": 92, "modeling_bert_dynam": 34, "modeling_output": [33, 36, 40, 41, 44], "modeling_roberta_dynam": 34, "models": [251, 304, 417], "moder": 309, "modern": [399, 432], "modif": [9, 261, 309, 314, 349, 377, 389], "modifi": [24, 44, 47, 50, 55, 57, 158, 181, 300, 309, 335, 337, 341, 349, 371, 372, 373, 375, 377, 379, 380, 381, 382, 388, 389, 392], "modify_node_connect": 55, "modul": [51, 56, 58, 59, 83, 150, 245, 303, 337, 361, 369, 392, 393, 421, 432], "module_nam": 57, "module_templ": 23, "moemodeloutputwithpast": 40, "moment": [25, 395], "momentum": 419, "monetari": 371, "more": [23, 25, 50, 52, 53, 57, 258, 259, 266, 271, 300, 303, 306, 307, 309, 314, 316, 319, 325, 345, 346, 349, 357, 363, 369, 372, 373, 376, 383, 384, 385, 386, 387, 389, 391, 392, 394, 395, 397, 398, 399, 400, 403, 405, 406, 407, 409, 411, 412, 413, 420, 421, 425, 426, 428, 429, 432], "mosaicml": [309, 314, 315, 346, 349, 428], "moshew": [304, 393], "most": [246, 266, 302, 316, 347, 363, 372, 376, 377, 391, 395, 396, 400, 401, 402, 405, 407, 418, 420, 429], "mostli": [57, 264, 359, 360, 374, 395], "motiv": 432, "mount": [314, 315, 316, 366], "mount_dir": 315, "mov": [400, 410], "move": [9, 25, 42, 400], "mp3": [340, 355, 369], "mp4": 374, "mpi": [314, 332, 349], "mpirun": [314, 349], "mpnet": 376, "mpt": [309, 314, 315, 428], "mpt_7b": 346, "mpt_peft_finetuned_model": [314, 349], "mrpc": [304, 392, 393], "mrr": 376, "mse_rang": 247, "msft": 359, "msg": [61, 361], "mt": [304, 353, 412, 425, 426], "much": [25, 266, 372, 392, 402], "mul": [57, 387, 391, 395, 400], "mul_1": 395, "mul_2": 395, "mult": [350, 372], "multi": [1, 32, 256, 257, 319, 326, 340, 351, 352, 354, 359, 361, 388, 389, 390, 420], "multi_gpu": 352, "multiheadattenion": 73, "multilang": 336, "multilangtexttospeech": 369, "multimod": [321, 350, 418], "multipart": 340, "multipl": [2, 24, 25, 36, 44, 243, 248, 260, 266, 267, 268, 289, 304, 319, 320, 351, 361, 369, 372, 379, 387, 389, 401, 402, 404, 405, 406, 407, 408, 409, 413, 416, 417, 430], "multiplechoicemodeloutput": [36, 44], "multipli": [399, 405, 409, 423], "multius": 361, "must": [57, 247, 256, 257, 266, 289, 299, 308, 319, 356, 371, 372, 391, 395, 399, 400, 402, 409, 421], "mutable_data": 394, "mutat": [28, 30], "mutation_prob": [28, 30], "mutation_s": 28, "mutual": 372, "mxfp4": 332, "mxk": [399, 413], "mxkxn": 409, "mxn": [402, 408, 413, 421], "my": [36, 44, 349], "mydataset": 25, "myenv": [327, 328, 329], "mymean": 25, "mysql_db": 334, "mysql_host": 334, "mysql_password": 334, "mysql_port": 334, "mysql_us": 334, "n": [25, 30, 36, 44, 57, 255, 263, 281, 303, 314, 322, 323, 327, 328, 329, 330, 331, 332, 337, 349, 354, 361, 362, 385, 390, 391, 393, 397, 399, 402, 403, 404, 405, 408, 409, 411, 413, 421, 426, 427], "n1": 361, "n2": 361, "n3": 361, "n4": 361, "n5": 361, "n_discard": 429, "n_keep": 429, "n_layer": [36, 44], "n_rep": [39, 40], "n_sampl": 247, "n_tile": 281, "na": [57, 319, 398], "naiv": 406, "naive_gemm": 402, "nalamati": 301, "name": [0, 6, 24, 28, 35, 45, 52, 53, 54, 55, 57, 62, 95, 121, 184, 243, 246, 247, 250, 251, 255, 262, 265, 269, 302, 303, 304, 306, 307, 309, 313, 314, 315, 316, 331, 332, 348, 349, 351, 353, 355, 366, 371, 372, 373, 375, 376, 387, 388, 389, 390, 391, 393, 395, 397, 401, 411, 412, 415, 416, 417, 418, 419, 423, 427, 432], "name1": 24, "name2": 24, "namedentityrecognit": 371, "namedentityrecognitionint": 371, "namedtupl": 387, "names_from_input": 57, "namespac": [247, 278, 279, 280, 281], "nan": [25, 255], "nation": 298, "nativ": [264, 367, 372, 407], "natur": [302, 304, 354, 368, 369, 372, 406, 420], "navig": [309, 322, 372], "nb_target_box": [256, 257], "nbsp": [304, 397, 411], "nd": 391, "ne": 388, "ne_root": 388, "nearest": [264, 432], "necessari": [266, 298, 314, 336, 338, 345, 349, 358, 366, 372, 383, 384, 394, 409, 413, 422, 428], "necessarili": 264, "necessit": 432, "need": [0, 24, 25, 32, 36, 39, 40, 44, 57, 158, 259, 266, 269, 302, 303, 309, 313, 314, 315, 316, 317, 318, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 350, 355, 357, 359, 361, 363, 366, 368, 369, 370, 371, 372, 373, 383, 384, 387, 389, 390, 391, 392, 398, 399, 400, 401, 402, 403, 406, 407, 408, 409, 413, 421, 423, 424, 432], "needn": 405, "neelnanda": [247, 425, 428], "neg": [23, 25, 256, 257, 260, 302, 306, 413], "negative_numb": 376, "negatives_cross_devic": 376, "neo": [301, 304], "neox": [272, 302, 314, 315, 349, 363], "neox_reorder_chang": 150, "neox_rotary_pos_emb": 150, "neoxreorderchang": 179, "neoxroraryposemb": 180, "ner": [304, 309, 334, 371, 375], "ner_int": [371, 375], "ner_obj": 371, "nerual": [337, 388], "nest": 24, "nestedtensor": [256, 257], "nesterov": 301, "nestl": 361, "net": [9, 24, 313, 314, 315, 316, 317, 318, 347, 364, 365, 366], "net_info": 55, "netron": 392, "network": [24, 258, 303, 314, 316, 330, 332, 349, 361, 372, 387, 388, 389, 391, 404, 419, 423], "neualspe": 427, "neural": [4, 5, 6, 7, 8, 17, 35, 49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 150, 181, 215, 243, 244, 245, 269, 272, 289, 297, 302, 303, 309, 313, 314, 315, 316, 317, 318, 319, 321, 324, 331, 334, 335, 337, 338, 340, 343, 344, 345, 346, 352, 356, 358, 363, 364, 366, 369, 371, 372, 375, 378, 380, 383, 384, 387, 389, 390, 391, 392, 396, 404, 412, 417, 419, 420, 423, 425, 426, 427, 428, 429, 432, 433, 440], "neural_chat": [309, 315, 317, 319, 322, 340, 347, 349, 354, 356, 358, 361, 362, 364, 365, 366, 369, 370, 371, 372, 373, 374], "neural_compressor": [246, 302, 303, 306, 419, 423], "neural_engin": [302, 388, 389], "neural_engine_bin": [245, 386], "neural_engine_exampl": 388, "neural_engine_pi": 386, "neural_spe": [366, 426, 427], "neural_speed_verbos": 427, "neuralchat": [272, 299, 302, 308, 312, 319, 321, 323, 324, 325, 326, 327, 328, 329, 330, 334, 336, 337, 338, 340, 343, 344, 345, 360, 368, 372, 374, 378, 383, 384, 420], "neuralchat_cli": 375, "neuralchat_infer": 315, "neuralchat_serv": [309, 360, 367, 375], "neuralchat_tgi": 317, "neuralchat_vllm": 318, "neuralchatserverexecutor": [309, 375], "neurip": [272, 302, 420, 426], "neutral": [316, 354], "never": 361, "nevertheless": 319, "new": [0, 5, 24, 39, 40, 41, 49, 52, 53, 54, 57, 62, 247, 269, 300, 301, 313, 320, 327, 328, 329, 335, 349, 350, 361, 369, 370, 372, 380, 395, 396, 400, 401, 414, 420, 424, 432], "new_embed": [36, 44], "new_input_fil": 414, "new_modul": 24, "new_nam": 55, "new_nod": 57, "new_node_nam": 387, "newer": 24, "newgraph": 55, "newli": 413, "newsroom": 420, "next": [36, 44, 49, 55, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 354, 357, 363, 368, 385, 391, 392, 400, 402, 404, 406, 407, 408, 409, 425], "next_input_id": 40, "next_position_id": 40, "next_sent": 36, "next_sentence_label": 36, "nextsentencepredictoroutput": 36, "nf4": [421, 422, 432], "nfs_imag": 334, "ng": 426, "nhwc": 390, "nightli": 314, "niki": [36, 44], "ninja": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "niroop": 301, "nk": 390, "nl": 385, "nli": 354, "nlp": [246, 270, 272, 302, 304, 306, 353, 354, 371, 388, 420, 423], "nlp_executor": 388, "nlpseq2seqtrain": 246, "nlptrainer": [246, 302, 303, 306, 419, 423], "nm": 266, "nms_by_contain": 266, "nms_supercel": 266, "nn": [27, 32, 246, 261, 264, 303, 404], "nncf": 28, "nnode": 352, "nnz_group": 281, "no_cuda": [314, 349, 422], "no_object": 258, "no_proxi": [313, 314, 315, 316, 353, 361], "noam": [36, 44], "nod": 391, "node": [1, 49, 52, 53, 54, 55, 57, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 220, 243, 244, 305, 320, 395, 397, 411, 425, 426, 427], "node0": [314, 349], "node1": [314, 349], "node2": [314, 349], "node3": [314, 349], "node_nam": [55, 57, 62, 243, 387, 391], "node_name_list": 55, "node_names_detail": [62, 243], "node_rank": [314, 349], "nodedef": [57, 243], "nodefil": [314, 349], "nodeproto": [62, 244], "nodes_dict": [62, 94, 95, 243, 244], "nohup": [323, 324, 326, 327, 328, 329, 330, 334, 336, 337, 338, 340, 343, 344, 345, 351, 353, 363, 383, 384], "nois": 369, "non": [24, 25, 44, 256, 257, 258, 266, 288, 350, 404, 407, 409, 413, 414, 432], "non_kdim": 281, "none": [0, 4, 14, 20, 23, 24, 25, 27, 28, 29, 30, 32, 33, 36, 37, 40, 41, 44, 45, 49, 55, 57, 62, 94, 95, 121, 128, 243, 244, 246, 247, 250, 251, 252, 259, 260, 264, 281, 303, 304, 305, 314, 315, 346, 347, 366, 372, 374, 389, 416, 417, 423], "nonetyp": 247, "nonexist": [338, 358], "nonneg": 25, "nonzero": 403, "noperm": [407, 413], "norm": [25, 260], "normal": [25, 256, 257, 259, 268, 281, 319, 375, 376, 400, 408, 422, 432, 437], "normalfloat": 422, "normalize_str": 268, "normmean": 25, "not_quant": [426, 427], "notat": 288, "note": [32, 33, 38, 47, 57, 270, 289, 304, 308, 309, 314, 315, 319, 324, 326, 331, 332, 335, 337, 343, 345, 346, 347, 348, 349, 350, 352, 359, 372, 375, 376, 377, 380, 383, 384, 387, 388, 389, 390, 391, 393, 394, 395, 400, 401, 407, 408, 409, 413, 423, 425, 426, 427, 428, 432], "notebook": [309, 322, 347, 430], "noth": [376, 387, 395], "notic": [52, 53, 392, 400, 407, 408, 415], "nov": [272, 302, 309, 420], "novel": 307, "novemb": [272, 420], "noveral": 361, "now": [57, 266, 314, 316, 330, 332, 349, 355, 361, 363, 366, 386, 387, 388, 390, 391, 392, 400, 401, 408, 413, 418, 432], "np": 302, "npm": [335, 341, 379, 380, 381, 382], "nproc_per_nod": 352, "npz": 25, "nrowptr": 281, "nsampl": 432, "nsome": 361, "nthe": 361, "nthr": 410, "ntl": 385, "null": 332, "null_inst": 278, "null_numpy_valu": 25, "nullptr": [279, 281, 400], "num": [28, 314, 349, 366, 389, 399, 401, 407, 432], "num_beam": [247, 428, 432], "num_box": [256, 257, 260], "num_cards_you_hav": 1, "num_choic": [36, 44], "num_class": [256, 257, 258], "num_cpu": 28, "num_embed": 37, "num_head": [36, 39, 40, 44, 260], "num_hidden_lay": 29, "num_iter": 390, "num_key_value_head": [39, 40], "num_label": [33, 36, 44, 302, 306], "num_lay": [256, 257], "num_machin": 352, "num_nod": [314, 349], "num_of_inst": [28, 246, 289], "num_pos_feat": 259, "num_process": 352, "num_processes_per_nod": [314, 349], "num_queri": [256, 257, 258], "num_sandwich": 28, "num_shard": 363, "num_target_box": 258, "num_tilem": 281, "num_train_epoch": [314, 346, 348, 349, 352, 354, 376, 422], "num_work": 25, "numa": [332, 397, 411], "numactl": [388, 426, 427], "number": [17, 23, 25, 28, 29, 30, 44, 62, 256, 257, 258, 263, 266, 268, 289, 314, 331, 332, 337, 346, 349, 354, 370, 371, 372, 375, 376, 385, 390, 391, 395, 399, 402, 408, 409, 413, 414, 423], "numer": [25, 387, 423], "numpi": [20, 21, 25, 57, 62, 302, 388], "numtil": 402, "nuqmm": 432, "nv": 420, "nvcr": [364, 365], "nvgpu": [314, 315], "nvidia": [309, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 347, 363, 364, 365, 366], "nxhxw": 390, "nxm": 399, "nz2": 369, "o": [57, 247, 302, 308, 314, 332, 349, 369, 397, 401, 406, 411], "o_proj": 349, "obj": [25, 266], "object": [24, 25, 28, 45, 47, 52, 53, 54, 243, 246, 247, 249, 256, 257, 258, 264, 265, 266, 270, 289, 302, 303, 306, 346, 347, 361, 387, 394, 434], "object2_overlap": 266, "objects_in_t": 266, "objects_to_cel": 266, "objects_to_table_structur": 266, "oblig": 298, "observ": [25, 309, 321, 350], "obtain": [302, 304, 335, 349, 369, 372, 380, 389, 408, 427, 428], "obvious": [401, 402, 406], "oc": [390, 413], "occasion": [372, 432], "occupi": 266, "occur": [372, 395, 399, 406, 432], "occurr": 25, "ocr": [350, 359], "ocr_vqa": 350, "oct": 420, "off": [25, 269, 319, 390, 432], "offens": [9, 298], "offer": [270, 309, 313, 316, 317, 318, 321, 338, 340, 342, 357, 358, 361, 363, 369, 370, 372, 373, 378, 432], "offic": 377, "offici": [298, 319, 348, 351, 371, 372], "offlin": [298, 403, 409, 423, 428], "offload": 9, "offset": [402, 406, 407], "offset_exp": 400, "offsetm": 402, "offsetn": 402, "often": [303, 370, 372], "ok": [364, 365, 366], "old": [57, 309], "old_batch_s": 27, "old_nam": 55, "old_node_index": 391, "older": 361, "omp": [314, 349, 390], "omp_get_max_thread": 401, "omp_get_num_proc": 401, "omp_num_thread": [314, 331, 349, 388], "ompi_mca_btl_vader_single_copy_mechan": [314, 315, 347, 366], "on_after_ev": 47, "on_after_optimizer_step": 47, "on_before_ev": 47, "on_before_optimizer_step": 47, "on_epoch_begin": 47, "on_epoch_end": 47, "on_step_begin": 47, "on_step_end": 47, "on_train_begin": 47, "on_train_end": 47, "onc": [24, 272, 302, 309, 321, 327, 328, 329, 335, 345, 348, 349, 361, 370, 372, 380, 383, 384, 389, 408, 420, 426, 427, 428, 429, 430, 432], "one": [9, 23, 24, 25, 36, 47, 49, 52, 53, 57, 259, 266, 281, 302, 303, 306, 314, 315, 340, 348, 351, 355, 371, 376, 377, 385, 386, 387, 389, 390, 391, 395, 396, 400, 402, 403, 408, 412, 413, 418, 426], "one_hot": 83, "oneapi": [279, 324, 338, 361, 362, 394, 410, 432], "oneccl": [314, 330, 349], "oneccl_bind_pt": [332, 348, 349], "oneccl_bindings_for_pytorch": [314, 332, 349], "oneccl_bindings_for_pytorch_path": [314, 349], "onednn": [279, 394], "onehot": [73, 93], "ones": [395, 432], "onli": [9, 24, 25, 32, 33, 36, 39, 40, 41, 42, 44, 57, 256, 257, 260, 270, 272, 289, 308, 309, 314, 316, 320, 336, 337, 349, 350, 354, 363, 367, 369, 372, 388, 390, 391, 392, 394, 396, 398, 400, 401, 402, 405, 407, 408, 409, 413, 416, 418, 420, 421, 422, 425, 428], "onlin": [298, 302, 370, 372, 406], "onnx": [50, 52, 62, 244, 246, 270, 302, 306, 337, 387, 389, 390, 407, 418, 425, 432, 434, 439], "onnx_extract_oper": 62, "onnx_extractor": [50, 51], "onnx_input": 83, "onnx_util": 58, "onnxextractor": 52, "onnxinput": 94, "onnxmodel": [52, 62], "onnxrt": [425, 426], "onnxruntim": [70, 71, 72, 78, 80, 101, 102, 107, 108, 110, 111, 112, 114, 118, 122, 123, 125, 126, 302, 305, 308, 387, 393], "op": [49, 52, 53, 54, 57, 58, 62, 158, 181, 192, 243, 244, 246, 255, 281, 389, 394, 395, 396, 400, 401, 413, 414, 423, 432], "op_alg": [400, 401], "op_attr": [398, 400, 401, 407], "op_desc": [278, 279, 398, 401], "op_desc_": 401, "op_dt": 400, "op_idx": 57, "op_nam": 28, "op_name_dict": 247, "op_typ": [57, 62, 95, 243, 244, 387, 390, 391, 401], "op_type1": 395, "op_type2": 395, "op_type_dict": 247, "opani": 73, "open": [8, 268, 288, 298, 309, 322, 346, 347, 351, 352, 354, 361, 363, 372, 388, 392, 420], "openai": [0, 9, 22, 316, 335, 336, 350, 355, 369, 375, 380, 425, 426, 428], "openai_api_kei": 324, "openai_api_protocol": 22, "openai_org": 324, "opencl": 402, "opencv": 334, "openmp": 404, "openorca": [346, 347, 352], "openssf": 435, "openssl": 369, "oper": [50, 52, 53, 57, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 181, 195, 243, 244, 266, 280, 282, 293, 304, 309, 327, 328, 329, 347, 357, 361, 369, 372, 386, 387, 388, 390, 392, 398, 400, 401, 402, 404, 405, 406, 407, 408, 409, 413, 421, 423, 428, 436, 439], "operand": [400, 404], "operator_adaptor": 150, "operator_conf_": 394, "operator_desc": [278, 279, 282, 398], "operator_registri": [95, 387], "operator_typ": [95, 387], "operatoradaptor": [147, 181], "operatorconfig": 394, "opinion": 316, "opmask": [400, 401], "opportun": [306, 307], "opposit": 20, "opset_vers": [246, 305], "opt": [323, 361, 362, 364, 365, 366, 410, 428, 432], "opt_1": 425, "opt_2": 425, "opt_6": 425, "optim": [4, 25, 28, 40, 47, 58, 246, 249, 250, 251, 272, 302, 304, 305, 306, 309, 314, 320, 325, 327, 328, 329, 331, 349, 354, 357, 360, 361, 367, 369, 372, 374, 388, 391, 392, 393, 396, 400, 401, 402, 404, 416, 417, 419, 420, 421, 423, 428, 432], "optimization_config": 319, "optimization_typ": [327, 328, 329], "optimizationconfig": 4, "optimize_dataset": [83, 387], "optimize_model": 4, "optimize_transform": 432, "optimizedataset": [96, 387], "optimizedmodel": 35, "optimum": [314, 346, 349, 352, 366], "option": [1, 6, 24, 25, 27, 28, 32, 33, 35, 36, 44, 45, 57, 246, 247, 256, 257, 260, 265, 267, 289, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 350, 351, 363, 372, 373, 383, 384, 385, 389, 395, 400, 409, 413, 419], "option1": 421, "option2": 421, "optuna": 246, "opu": [304, 353], "orac": 307, "orca": [346, 347, 352], "orca_dpo_pair": [346, 347, 352], "orchestr": 246, "orchestrate_optim": 246, "order": [21, 24, 52, 53, 55, 57, 258, 266, 288, 330, 332, 347, 366, 387, 389, 395, 399, 405, 406, 408, 409, 432], "ordereddict": [95, 387], "ordinari": 24, "org": [25, 36, 260, 332], "organ": [350, 361, 371, 399], "orient": [298, 350], "origin": [24, 25, 27, 47, 52, 53, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 256, 257, 288, 303, 309, 350, 357, 372, 377, 387, 392, 406, 407, 420, 423], "other": [1, 24, 25, 35, 57, 111, 158, 255, 258, 266, 281, 288, 298, 300, 302, 316, 317, 319, 324, 330, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 357, 359, 361, 363, 368, 369, 372, 375, 383, 384, 387, 388, 389, 390, 391, 395, 396, 397, 405, 408, 409, 411, 413, 415, 420, 423, 425, 426, 432], "otherwis": [24, 57, 247, 266, 298, 361, 369, 387, 390, 405, 413], "our": [5, 49, 288, 305, 312, 320, 325, 336, 337, 338, 340, 346, 350, 351, 355, 358, 369, 372, 373, 378, 395, 400, 402, 403, 405, 407, 408, 409, 418, 429, 432], "out": [23, 25, 55, 57, 266, 300, 302, 330, 361, 372, 387, 388, 391, 398, 407, 423], "out_dt": 281, "out_pattern": 57, "out_proj": 354, "outcom": [319, 350, 395], "outdat": 377, "outer": 17, "outlier": 428, "outlin": 270, "output": [0, 24, 25, 36, 44, 55, 57, 62, 73, 243, 244, 246, 256, 257, 258, 260, 264, 265, 281, 303, 306, 319, 321, 347, 361, 369, 372, 373, 375, 379, 385, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 404, 405, 406, 407, 409, 413, 418, 421, 425, 427, 429, 432], "output2_dt": 281, "output_attent": [33, 36, 37, 40, 41, 44], "output_audio": [336, 340, 375], "output_audio_path": [311, 319, 336, 340, 369, 375], "output_bf16": 413, "output_data": [150, 388], "output_dim": [256, 257], "output_dir": [55, 246, 314, 346, 347, 348, 349, 352, 354, 376, 392, 393, 422, 428], "output_dt": [281, 413], "output_fil": 376, "output_fp32": 413, "output_hidden_st": [33, 36, 40, 41, 44], "output_length": [36, 44], "output_max_length": 352, "output_nam": [62, 352, 388], "output_path": [337, 351], "output_router_logit": 40, "output_shap": 389, "output_tensor": [52, 53, 54, 57, 62, 95, 243, 244, 387, 391], "output_tensor_nam": 389, "output_typ": [281, 389], "output_video_path": 374, "outputdata": [182, 387], "outsid": [36, 44, 57, 391, 395], "over": [24, 25, 264, 266, 340, 349, 402, 404, 407], "overal": [309, 357, 370, 372, 406], "overflow": 423, "overhead": [400, 406, 407, 408, 409], "overlap": 266, "overlap_threshold": 266, "overlook": 377, "overrid": [0, 35, 39, 40, 246, 278, 279, 394], "overview": [369, 370], "overwrit": 414, "overwrite_output_dir": [314, 348, 349, 352, 354, 422], "ow": 319, "own": [57, 272, 309, 324, 335, 338, 341, 345, 351, 355, 372, 373, 380, 381, 383, 384, 387, 391, 392, 400, 406, 417, 420], "owner": [270, 300], "p": [25, 270, 302, 314, 349, 364, 365, 366], "p1302": [407, 413], "p2013": [407, 413], "p2031": [407, 413], "p50": 302, "p90": [302, 425], "p99": [302, 425], "p_conf": 306, "p_num": 374, "p_t": 260, "pack": [49, 83, 409], "pack_weight": 421, "packag": [18, 31, 272, 302, 327, 328, 329, 349, 351, 361, 371], "package_object": 266, "packagepositionembed": 100, "pad": [23, 32, 36, 44, 247, 256, 257, 289, 302, 350, 389, 405, 409, 413], "pad_max": [346, 347, 350], "padding_idx": 44, "padding_mask": 413, "padding_sequ": [83, 150, 388], "paddingsequ": [57, 98, 183, 388], "page": [266, 298, 300, 302, 306, 319, 327, 328, 329, 359], "page_span": 266, "pagedattent": 367, "pain": 423, "pair": [36, 55, 247, 256, 257, 346, 347, 348, 352, 354, 372, 388, 401, 409], "pairwis": 263, "palm2": 372, "panda": 266, "panopt": 260, "paper": [25, 32, 259, 272, 302, 306, 307, 420, 426, 428], "parallel": [314, 326, 330, 332, 349, 354, 361, 404, 405, 406, 409, 413, 421], "param": [21, 25, 62, 243, 246, 256, 257, 258, 260, 264, 363, 400, 401], "param_": [400, 401], "paramet": [4, 6, 9, 17, 20, 21, 24, 25, 27, 28, 29, 30, 32, 35, 36, 44, 45, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 255, 256, 257, 260, 262, 264, 267, 272, 281, 289, 302, 303, 309, 316, 317, 319, 335, 354, 363, 369, 375, 380, 385, 389, 395, 416, 419, 428, 432], "parameter": [346, 347], "parametr": 16, "params_": 401, "parent": [2, 30, 266, 372], "parent_docu": [309, 372], "parentstor": [309, 372], "pareto_fronti": 30, "pari": 377, "park": [301, 432], "parm": 373, "parmar": [36, 44], "pars": [1, 13, 62, 63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 96, 97, 99, 100, 102, 103, 105, 106, 107, 108, 109, 110, 112, 114, 115, 117, 118, 119, 120, 122, 123, 124, 125, 127, 243, 253, 254, 268, 372, 388, 394], "parse_arg": 1, "parse_multi_choice_respons": 268, "parse_open_respons": 268, "parsed_output": 351, "parser": 359, "part": [32, 57, 349, 361, 391, 394, 395, 396, 408, 409], "part1": [350, 389, 394], "part2": [350, 394], "parti": [300, 377, 415], "partial": [369, 372], "particip": [298, 402], "particular": [9, 272, 372, 432], "particularli": [302, 372], "pass": [2, 24, 25, 32, 36, 44, 47, 247, 260, 261, 269, 281, 300, 314, 348, 349, 355, 365, 369, 371, 372, 396, 400, 401, 418, 423], "passag": 376, "passage_max_len": 376, "passion": 361, "password": 334, "passwordless": [314, 349], "past": [36, 44, 255, 261, 264], "past_k_v_0": 396, "past_k_v_1": 396, "past_key_valu": [33, 36, 37, 40, 41, 44], "past_key_values_length": [36, 37, 44], "pat": 369, "patch": 350, "patch14": [9, 350], "path": [21, 28, 35, 47, 57, 246, 247, 265, 267, 302, 306, 316, 319, 331, 332, 334, 347, 348, 349, 351, 359, 364, 365, 366, 369, 371, 372, 375, 376, 388, 389, 390, 392, 396, 410, 412, 413, 422, 426, 427], "path_to_hostfil": 1, "pathlik": 247, "patient": 303, "pattern": [25, 28, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 279, 392, 399, 402, 412, 419, 439], "pattern_dict": 387, "pattern_list": 57, "pattern_map": [57, 387, 391], "pattern_mapping_conf_valid": 57, "pattern_mapping_config": 387, "pattern_nam": 57, "pattern_registri": [184, 387], "pattern_typ": [184, 387], "patternlock": 304, "payload": 372, "pb": [57, 306], "pbtxt": [364, 365, 366], "pc": [319, 320, 325, 327, 328, 329, 420], "pdf": [25, 319, 358, 372], "pdf_file": 359, "peak": 325, "peer": 432, "peft": [272, 302, 314, 349, 352, 354, 375, 422], "peft_config": 422, "peft_model_path": 375, "pegasu": 304, "penalti": 371, "penghuicheng": 299, "pentium": 415, "peopl": [361, 423], "per": [309, 314, 349, 389, 397, 400, 403, 411, 413, 414, 428], "per_channel_dequ": 400, "per_channel_qu": 400, "per_device_eval_batch_s": [314, 348, 349, 352, 354, 422], "per_device_train_batch_s": [314, 346, 347, 348, 349, 352, 354, 376, 422], "percentag": [348, 349, 371], "perceptron": [256, 257], "perf": [389, 409, 413, 414], "perform": [2, 36, 42, 44, 57, 246, 251, 256, 257, 258, 269, 270, 289, 293, 302, 303, 305, 306, 309, 314, 315, 319, 325, 347, 349, 354, 355, 357, 361, 363, 369, 370, 371, 372, 376, 378, 379, 388, 389, 390, 393, 399, 402, 403, 404, 405, 406, 407, 408, 409, 413, 416, 417, 419, 420, 423, 432, 436], "perhap": 399, "peripher": 404, "perm": [389, 407], "perm1302": 407, "perm2013": 407, "perm2031": 407, "perman": 298, "permiss": [269, 298], "permit": 372, "permut": [389, 403, 407, 413], "perplex": 349, "persist": 372, "persist_dir": [324, 338, 375], "person": [298, 316, 355, 371], "perspect": [314, 349, 361], "pertain": 5, "pertin": 372, "phase": [47, 303, 429], "phenomenon": 391, "phi": [309, 349, 415], "philschmid": 304, "phind": [309, 326, 330, 332], "photo": [333, 334, 371], "photoai": [338, 375], "phrase": 389, "physic": [289, 298, 407], "pick": [348, 349, 409], "picklabl": 264, "pictur": [390, 399, 412], "piec": [354, 407], "pil": 20, "pile": [247, 288, 354, 425, 428], "pin_memori": 25, "ping": [330, 332, 402], "pip": [302, 308, 309, 315, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 350, 351, 352, 353, 354, 357, 358, 359, 360, 361, 363, 366, 367, 368, 369, 370, 371, 374, 376, 377, 383, 384, 387, 393, 412, 426, 427, 432], "pipel": 357, "pipelin": [4, 49, 270, 309, 315, 319, 332, 351, 354, 358, 369, 370, 371, 372, 374, 375, 434], "pipeline_cfg": 319, "pipeline_config": 319, "pipelineconfig": [4, 319, 358, 372], "piqa": [288, 426], "pitch": 369, "pix2pix": 337, "pixel": [9, 256, 257], "pizza": 36, "place": [0, 24, 38, 372, 377, 396, 401, 406, 419], "placehold": 83, "plai": [23, 372], "plain": 25, "plan": [335, 361, 380], "platform": [22, 302, 312, 319, 320, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 354, 363, 372, 378, 412, 420, 421, 423], "platinum": [304, 310, 397, 411, 425, 426], "pleas": [0, 9, 47, 50, 52, 53, 269, 270, 271, 289, 300, 303, 304, 306, 307, 308, 309, 316, 317, 319, 324, 326, 331, 335, 336, 337, 338, 340, 342, 343, 345, 347, 348, 349, 350, 351, 353, 355, 358, 361, 365, 372, 376, 378, 380, 383, 384, 387, 391, 394, 398, 399, 400, 401, 405, 408, 413, 419, 421, 423, 426, 427, 428, 429, 432], "plm": 304, "plot": 265, "plot_log": 265, "plu": [25, 36, 350], "plugin": [315, 320, 321, 322, 324, 332, 334, 342, 344, 357, 358, 360, 363, 369, 372, 373, 374, 375], "plugin_audio": 336, "pndmschedul": 9, "po": [259, 376], "podcast": 420, "point": [36, 44, 245, 246, 265, 316, 372, 378, 391, 400, 401, 405, 408, 421, 423, 428], "pointer": 396, "pokemon": 304, "polici": [298, 307, 335, 346, 347, 380, 435], "polish": [12, 372], "polit": 298, "polosukhin": [36, 44], "polynomi": 408, "pong": 402, "pool": 369, "pooled_output": 36, "pooler": [36, 44], "poor": 406, "pop": 410, "popul": [24, 28, 30], "popular": [302, 309, 319, 325, 338, 358, 363, 370, 420, 432], "population_fil": 30, "population_s": 28, "port": [313, 316, 317, 318, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 361, 363, 364, 365, 366, 372, 375], "portion": [9, 24], "pos_emb": 83, "pose": 372, "posit": [32, 36, 37, 44, 259, 260, 261, 270, 288, 298, 302, 306, 376, 390, 395, 413, 418, 429, 432], "position_embed": 150, "position_embedding_typ": [36, 44], "position_embeddings_v1": 150, "position_id": [32, 33, 36, 40, 41, 44], "positionembed": 185, "positionembeddinglearn": 259, "positionembeddingsin": 259, "positionembeddingsv1": 186, "positionid": 73, "possibl": [256, 257, 268, 375], "possibli": 25, "post": [23, 298, 309, 313, 316, 317, 318, 334, 361, 363, 367, 372, 389, 413, 428, 430, 432], "post_init_cpu": 247, "post_init_gptq": 247, "post_init_runtim": 247, "post_init_xpu": 247, "postambl": 401, "postman": 363, "postop": [400, 401, 413], "postop_alg": 401, "postop_attr": [280, 281, 401], "postop_idx": 401, "postop_list": 401, "postop_typ": 401, "postprocess": [256, 257], "postprocesspanopt": 260, "posttrainingquantconfig": [246, 302, 306, 423], "potenti": [281, 361, 406, 420], "pow": [83, 387, 391], "power": [303, 304, 307, 309, 319, 352, 361, 420], "ppn": [314, 349, 425], "ppo_epoch": 352, "pr": [300, 413], "practic": [270, 340], "pragma": 402, "pre": [17, 272, 302, 354, 371, 372, 391, 402, 412, 420, 432], "preambl": 401, "precis": [25, 158, 246, 264, 305, 314, 320, 336, 337, 349, 371, 372, 378, 392, 417, 421, 423, 425, 432, 439], "precomput": [36, 44], "pred": 302, "pred_box": [256, 257, 258], "pred_i": 268, "pred_logit": [256, 257, 258], "predecessor": 361, "predefin": [319, 372], "predict": [4, 36, 44, 246, 256, 257, 258, 260, 268, 302, 303, 311, 319, 351, 354, 356, 358, 370, 372], "prediction_logit": [36, 44], "predominantli": 372, "pref": 389, "prefer": [316, 319, 372, 395, 407], "prefix": [25, 314, 349, 372, 413], "premis": [319, 325, 354], "prepar": [36, 44, 389, 391, 394, 400, 401, 409, 423], "prepare_dataset": 393, "prepare_inputs_for_gener": [36, 44], "prepare_model": [337, 392, 393], "prepare_model_for_kbit_train": 422, "prepare_t": 401, "preprint": 432, "preprocess": [18, 27, 288, 302, 408], "preprocess_model": 27, "prerequisit": 366, "present": [36, 246, 319, 408], "preserv": 372, "press": [355, 361], "pretrain": [17, 33, 36, 44, 309, 319, 347, 348, 349, 369, 387, 429], "pretrainedconfig": 247, "pretrainedmodel": 23, "pretrainedtoken": 23, "pretraining_data": 350, "preval": 432, "prevent": [25, 372, 390], "previou": [55, 246, 302, 349, 388, 405, 424], "previous": 332, "price": [335, 380], "primari": [316, 370, 378], "primarili": [369, 372, 429], "primconst": 115, "primit": [281, 332, 394], "primitive_desc": 394, "print": [24, 25, 264, 309, 314, 319, 321, 332, 349, 369, 371, 387, 395, 428, 432], "print_hello_world": 313, "print_result": 351, "prior": [408, 429], "priorit": [319, 325], "prioriti": 24, "privat": [279, 280, 298, 349, 388, 394, 399, 400, 401, 405, 406, 420], "privileg": 314, "proactiv": [314, 349], "probabl": [25, 28, 29, 319, 346, 347, 370, 406, 432], "problem": [307, 338, 358, 372, 409, 413, 429], "problemat": 372, "proce": 347, "procedur": [316, 385, 413], "procedurein": 385, "process": [27, 28, 47, 246, 256, 257, 264, 266, 267, 270, 302, 304, 309, 314, 319, 321, 322, 327, 328, 329, 332, 334, 338, 340, 347, 349, 351, 354, 355, 358, 361, 369, 370, 371, 372, 373, 375, 379, 387, 388, 390, 391, 395, 396, 399, 400, 402, 405, 406, 409, 419, 420, 423, 432], "process_batch_per_k": 281, "process_col": [281, 400], "process_row": 281, "process_vec_num": 281, "processed_s": 260, "processed_text": 373, "processor": [4, 272, 302, 304, 308, 309, 310, 311, 316, 318, 319, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 347, 349, 358, 361, 363, 364, 365, 366, 367, 370, 375, 397, 411, 420], "produc": [25, 255, 266, 303, 316, 372, 409], "product": [302, 372, 397, 407, 411, 417, 423], "profession": 298, "profil": [295, 398, 438, 439], "proflil": 389, "program": [399, 415, 432], "progress": 17, "project": [270, 298, 300, 309, 327, 328, 329, 335, 345, 369, 380, 383, 384], "project_source_dir": 388, "promis": [404, 405], "prompt": [0, 36, 267, 309, 313, 314, 318, 321, 335, 349, 354, 355, 364, 365, 366, 367, 370, 371, 372, 375, 380, 426, 427, 428, 429, 432], "prompt_token": 361, "prop_kind": 394, "propag": [256, 257], "proper": [321, 371], "properli": [32, 335, 340, 380], "properti": [302, 388, 415], "proport": [416, 417], "propos": [303, 350, 372, 399, 420, 428], "prot": 361, "protect": [278, 279, 361], "protobuf": [308, 393], "protocol": [22, 406], "prove": 369, "provid": [4, 24, 27, 28, 30, 35, 36, 44, 246, 260, 263, 264, 270, 272, 289, 302, 304, 305, 306, 307, 309, 314, 316, 319, 321, 325, 327, 328, 329, 335, 336, 337, 342, 345, 349, 351, 356, 357, 365, 367, 369, 370, 371, 372, 373, 375, 376, 378, 380, 383, 384, 385, 387, 396, 398, 401, 406, 408, 416, 421, 423, 427, 428, 432], "proxi": [279, 293, 313, 314, 315, 316, 317, 318, 361, 398, 436], "proxy_bas": 279, "prune": [36, 44, 46, 246, 270, 272, 302, 420, 425, 430, 434], "prune_config": 307, "prune_head": [36, 44], "prune_typ": 306, "pruneofa": 304, "pruner": [303, 419], "pruner_config": 306, "pruner_info": 47, "prunerconfig": 306, "prunerv2": 28, "pruning_conf": 419, "pruning_config": [28, 246, 306, 419], "pruning_frequ": 28, "pruning_op_typ": 28, "pruning_scop": [28, 419], "pruning_typ": [28, 419], "pruningconf": 246, "pruningconfig": 306, "psedorandom": 25, "pseudo": 405, "pseudorandom": 25, "pt": [36, 44, 289, 302, 306, 355, 418, 428, 429, 432], "pt_hpu_max_compound_op_s": 349, "pth": [353, 359], "ptq": [392, 428], "ptr": [400, 401, 410], "ptr_bia": 281, "ptr_dens": 281, "ptr_dst": 281, "ptr_dst_m1": 281, "ptr_dst_m2": 281, "ptr_scale": 281, "ptun": [314, 349], "pub": [270, 330, 332], "public": [32, 270, 278, 279, 280, 281, 298, 319, 325, 349, 361, 394, 400, 401], "publish": [272, 298, 302, 415, 420], "pubtables1m": 359, "pubtables1m_detection_detr_r18": 359, "pull": [313, 316], "pull_key_prefix": 25, "pure": [319, 386, 401], "purif": [272, 420], "purpos": [256, 257, 378, 391, 395, 400, 405], "push": [247, 300, 410], "push_back": 401, "push_key_prefix": 25, "push_to_hub": 247, "pushtohubmixin": 247, "put": [266, 353, 370, 387, 388, 391], "pvc": 432, "pwd": [364, 365], "py": [1, 22, 50, 314, 315, 326, 327, 328, 329, 330, 331, 332, 337, 340, 345, 346, 347, 348, 349, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 364, 365, 366, 368, 371, 372, 376, 377, 383, 384, 385, 387, 389, 393, 412, 419, 422, 426, 427, 432], "py3": [364, 365], "pyakurel": 301, "pybind": 55, "pydant": 361, "pyg": 340, "pylint": [269, 300], "pypi": [330, 337], "pytest": 269, "python": [1, 6, 8, 24, 36, 44, 52, 53, 57, 247, 277, 286, 300, 302, 308, 311, 314, 315, 316, 319, 321, 322, 346, 347, 348, 349, 351, 352, 353, 354, 355, 356, 358, 359, 360, 361, 362, 364, 365, 366, 371, 374, 375, 376, 377, 385, 386, 387, 388, 390, 392, 393, 412, 426, 427, 432], "python3": [308, 309, 314, 349, 352, 361, 386], "pythonpath": [364, 365, 366], "pytorch": [24, 25, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 246, 252, 264, 269, 270, 299, 302, 305, 308, 309, 314, 331, 332, 340, 349, 361, 393, 412, 418, 420, 423, 426, 427, 428, 432], "pytorchbenchmark": 27, "pyyaml": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "q": [25, 32, 340, 407, 408], "q_bia": 281, "q_config": [302, 306, 423], "q_k_scale": 281, "q_k_src2": 281, "q_model": 35, "q_proj": [346, 347, 349, 354], "q_scale": 281, "q_weight": 281, "qa": 372, "qat": [304, 305, 423], "qdq": [246, 305, 392], "qk": 407, "qk_v_output_scal": 281, "qk_v_output_zero_point": 281, "qkv": [220, 390, 392], "qkv_merg": 150, "qkv_reshap": 150, "qkvmerg": 187, "qkvreshap": 188, "qlinear": [305, 392], "qlinearadd": 73, "qlinearmatmul": [73, 392], "qlinearmul": 73, "qmodel": 432, "qnli": 304, "qqp": 304, "quaint": 361, "quala": [272, 302, 304, 306, 420], "qualiti": [337, 355, 369, 372, 376, 379], "quanstion": 44, "quant": [57, 158, 398, 413, 423, 432, 437], "quant_config": [246, 302, 306, 423], "quant_format": [246, 305], "quant_gather_to_bf16": 150, "quant_info_init": 57, "quant_lm_head": 247, "quant_tile_n": 405, "quantawaretrainingconfig": 247, "quantgathertobf16": [147, 189], "quantif": [428, 432], "quantil": 25, "quantiti": 371, "quantiz": [28, 35, 102, 246, 247, 270, 272, 302, 305, 309, 357, 400, 401, 405, 406, 408, 413, 416, 420, 421, 422, 426, 428, 430, 434, 439], "quantization_config": [306, 428, 429, 432], "quantizationawaretrainingconfig": [246, 423], "quantizationconfig": 246, "quantizationmethod": 247, "quantize_dim_elt_num": 413, "quantize_fus": 150, "quantize_linear": [83, 387], "quantize_on_tmp_buf": 405, "quantize_to_packed_weight": 421, "quantize_v2": 83, "quantized_fused_matmul_and_dequant": 83, "quantized_graph_dtype_refactor": 150, "quantized_matmul_with_bias_and_dequant": 83, "quantized_weight": 421, "quantizedgraphdtypecheck": 191, "quantizedgraphdtyperefactor": [158, 191], "quantizedmatmulwithbiasanddequant": 105, "quantizefus": 190, "quantizelinear": [102, 387, 392], "quantizev2": 103, "quarter": [25, 406], "queri": [4, 11, 12, 15, 32, 39, 40, 256, 257, 309, 311, 316, 319, 324, 325, 338, 342, 370, 371, 372, 373, 376], "query_dim": 260, "query_emb": 257, "query_file_jsonl_path": 376, "query_instruction_for_retriev": 376, "query_max_len": 376, "query_st": [39, 40], "question": [36, 267, 268, 298, 300, 302, 304, 316, 319, 335, 351, 370, 372, 380, 403, 430], "question_typ": 351, "questionansweringmodeloutput": [36, 44], "queue": 361, "quick": [306, 356, 372, 387], "quick_start": [361, 362], "quickli": 316, "quiet": 25, "quit": [361, 385, 396, 400, 432], "qweight": 421, "qwen": [309, 432], "qwen2": 420, "qword": 410, "r": [21, 25, 266, 302, 308, 309, 315, 322, 323, 324, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 351, 352, 353, 354, 357, 358, 359, 360, 361, 363, 368, 370, 371, 374, 376, 377, 383, 384, 387, 393, 397, 411, 412, 422, 423, 425, 426, 427, 432], "r10": 401, "r12": 410, "r13": 410, "r14": [401, 410], "r15": [401, 410], "race": 298, "rag": [309, 316, 320, 321, 324, 357, 358, 361, 420], "rag_doc": 316, "rai": 30, "rais": [24, 272, 338, 358, 420, 426, 427], "ramakrishna": 301, "random": [25, 36, 288, 371, 376, 406], "random_sampl": 25, "rang": [21, 73, 246, 260, 302, 309, 312, 319, 320, 369, 372, 387, 390, 396, 413, 423], "range_for_sampl": 376, "rank": [252, 264, 309, 314, 347, 349, 354, 372, 376, 377, 422, 425], "ransform": 432, "rapid": [272, 302, 308, 349], "rate": [370, 406, 425], "rather": [25, 391, 400], "ratio": [28, 29, 30, 47, 303, 376, 411, 413, 416, 417], "raw": [256, 257], "raw_cmd": 27, "raw_dataset": [302, 306], "raw_h": 20, "raw_w": 20, "rbp": 410, "rbx": 410, "rcx": 401, "rdi": [401, 410], "rdx": 401, "re": [35, 247, 319, 335, 341, 378, 379, 380, 381, 382, 403], "reach": [300, 302], "read": [25, 47, 302, 313, 317, 318, 388, 420], "readabl": 247, "readi": [335, 348, 349, 364, 365, 366, 372, 380], "readm": [269, 316, 317, 319, 323, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 356, 358, 363, 364, 365, 366, 370, 378, 389, 392, 432], "real": [279, 307, 317, 318, 319, 325, 363, 390, 405, 406, 407, 410], "real_drop": 307, "realdiv": 73, "realiz": 391, "realli": [256, 257], "reason": [258, 298, 372, 373, 391, 394, 406], "receiv": [35, 181, 364, 365, 366, 391, 395, 396], "recent": [307, 372, 429, 432], "recent_ratio": 307, "recip": [246, 390, 417], "reciproc": [73, 376], "recogn": [372, 387, 439], "recognit": [17, 266, 309, 355, 371, 391, 395], "recommend": [308, 313, 314, 325, 345, 348, 349, 354, 375, 376, 377, 383, 384, 387, 390, 393, 395, 396, 406, 410, 426, 427, 429], "recomput": 25, "record": [52, 53, 54, 382, 389, 390], "recruit": 395, "rectifi": [319, 325], "recurs": [24, 25, 330, 386, 388, 395, 432], "recursive_copi": 24, "red": [21, 425, 426], "redevelop": 372, "redpajama": 428, "reduc": [9, 264, 266, 302, 307, 314, 315, 342, 350, 354, 361, 370, 372, 376, 394, 399, 400, 402, 404, 405, 406, 408, 409, 420, 422, 423, 428, 432], "reduce_dict": 264, "reduce_mean": [83, 387], "reduce_sum": 83, "reducemean": [106, 387, 391], "reducesum": 107, "reduct": [264, 306, 369, 404, 407], "redund": [402, 419], "refactor": [278, 279, 280, 281, 319, 325, 372], "refactor_batch_s": 27, "refer": [9, 24, 47, 50, 52, 53, 264, 269, 270, 302, 303, 306, 307, 308, 309, 317, 319, 321, 337, 338, 342, 345, 347, 348, 349, 350, 351, 358, 372, 376, 378, 383, 384, 385, 391, 394, 403, 405, 408, 413, 415, 419, 421, 423, 429], "refin": [266, 372], "refine_column": 266, "refine_row": 266, "refine_table_structur": 266, "reflect": 25, "refresh": [382, 413], "refresh_model": 55, "reg": [400, 401], "reg64": [400, 401], "reg64_mock1": 401, "reg_idx": 401, "reg_param": 401, "reg_src": 401, "reg_typ": [28, 401], "regard": [298, 346, 347, 352], "regardless": 298, "regener": [335, 341, 380, 381], "regex": 268, "regexp": 400, "region": 428, "regist": [0, 65, 73, 86, 95, 98, 101, 102, 111, 113, 116, 126, 184, 399, 400, 401, 402, 404, 405, 406, 407, 409, 439], "register_conv_templ": 0, "register_operator_class": 394, "registr": [95, 184, 387], "registrationcent": 332, "regress": [33, 36, 44, 256, 257, 269], "regul": [335, 380], "reinforc": [319, 346, 347, 420], "reinstal": [315, 387], "reinterpret_cast": 394, "reject": [298, 346, 347, 352], "rel": [256, 257, 340, 369, 416, 423, 425, 429], "relat": [52, 53, 246, 256, 257, 270, 288, 295, 303, 314, 319, 325, 335, 354, 359, 360, 369, 370, 371, 372, 374, 375, 380, 387, 391, 395, 396, 403, 408, 419, 423, 438], "relationship": 57, "releas": [270, 272, 302, 309, 319, 332, 349, 372, 420, 425, 435], "relev": [2, 302, 335, 372, 380], "reli": 372, "reliabl": 372, "relianc": [372, 432], "relief": 404, "religion": 298, "reload": 25, "relu": [73, 401, 413], "remain": [24, 372, 420, 432], "remain_el": 281, "remain_element_num": 401, "remain_task_mask": 401, "remark": [372, 390, 429], "rememb": [25, 313, 316, 317, 318, 345, 363, 366, 378, 383, 384], "remot": [349, 379], "remov": [25, 40, 44, 55, 57, 192, 195, 247, 261, 266, 298, 361, 401, 419], "remove_constant_op": 150, "remove_environ_info_item": 57, "remove_integer_superscript": 266, "remove_last_view": 150, "remove_nod": 55, "remove_objects_without_cont": 266, "remove_rang": 150, "remove_supercell_overlap": 266, "remove_unused_oper": 150, "remove_zero": 150, "removeconstantop": 192, "removelastview": 193, "removerang": 194, "removeslic": 150, "removeunusedoper": 195, "removezero": 196, "rename_nod": 55, "rencetli": 429, "reorder": [55, 83, 405, 406], "repack": 421, "repack_quantized_weight": 421, "repeat": [25, 73, 402, 414], "repeatedli": 25, "repercuss": 298, "repetit": 371, "repetition_penalti": 371, "replac": [24, 25, 44, 57, 247, 302, 303, 306, 309, 321, 345, 346, 347, 355, 357, 366, 369, 372, 375, 383, 384, 387, 391, 419, 420, 421, 423, 432], "replace_modul": 24, "replacechar": 373, "replc": 315, "repo": [292, 300, 322, 323, 324, 326, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 347, 351, 353, 354, 357, 363, 366, 368, 379, 380, 381, 382, 383, 384, 387, 413, 435], "repo_id": 247, "repo_path": [314, 315], "report": [298, 300, 302, 316, 358, 372], "report_to": 346, "repositori": [35, 247, 302, 314, 315, 345, 348, 349, 364, 365, 366, 383, 384], "repr": 247, "repres": [25, 47, 57, 256, 257, 298, 319, 325, 345, 375, 383, 384, 389, 391, 395, 399, 401, 402, 405, 423], "represent": [9, 23, 24, 57, 266, 298, 306, 387, 391, 392], "representtaion": 57, "reproduc": [351, 426], "request": [0, 24, 260, 295, 302, 316, 323, 324, 331, 334, 336, 337, 338, 340, 342, 343, 344, 348, 349, 357, 363, 366, 367, 368, 370, 379, 426, 427, 438], "requir": [4, 24, 32, 57, 181, 266, 289, 306, 313, 314, 315, 316, 322, 323, 324, 326, 327, 328, 329, 330, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 345, 346, 347, 348, 349, 352, 354, 356, 357, 358, 359, 360, 361, 362, 363, 366, 368, 369, 370, 371, 372, 374, 377, 378, 379, 380, 381, 382, 383, 384, 387, 391, 393, 395, 397, 399, 402, 403, 405, 412, 413, 421, 423, 426, 427, 432], "requirements_cpu": [309, 331, 332, 376], "requirements_cuda": 376, "requirements_hpu": [309, 326], "requirements_win": [309, 327, 328, 329], "requirements_xpu": 309, "requires_grad": 24, "requires_safety_check": 9, "rerank": [2, 14, 372], "reranker_model": [14, 372], "rerun": 432, "rescale_factor": 20, "research": [314, 349, 415, 420, 422], "reset_sp": 279, "reshap": [39, 40, 57, 83, 100, 220, 387, 388, 389, 394], "reshape_0": [57, 391], "reshape_after_restore_hidden_st": 150, "reshape_before_and_after_attention_out_layer_norm_gather_el": 150, "reshape_before_restore_hidden_st": 150, "reshape_fus": 150, "reshape_input": 281, "reshape_tim": 389, "reshapeafterrestorehiddenst": 198, "reshapebeforeandafterattentionoutlayernormgatherel": 199, "reshapebeforerestorehiddenst": 200, "reshapefus": 201, "residu": [17, 398], "resili": 361, "resiz": 83, "resnet": [17, 255], "resnet101": 17, "resnet152": 17, "resnet18": 17, "resnet34": 17, "resnet50": 17, "resnext": 17, "resnext101_32x8d": 17, "resnext50_32x4d": 17, "resolut": 25, "resolv": [24, 25, 266, 271], "resolve_state_dict": 25, "resourc": [302, 303, 361, 372, 402], "respect": [298, 351, 384, 388, 391, 392], "respectfulli": 407, "respond": [349, 356], "respons": [0, 4, 268, 309, 311, 316, 319, 335, 346, 347, 349, 351, 352, 356, 358, 359, 361, 370, 372, 374, 375, 379, 380, 399, 405, 406, 408, 420], "response_templ": [324, 338, 372], "responsibli": 373, "rest": [24, 316, 340, 363, 390, 391, 395, 407, 409], "restart": [335, 380], "restaur": 36, "restor": [29, 304, 427], "restore_hidden_states_in_length_adaptive_update_indic": 150, "restorehiddenstatesinlengthadapt": 202, "result": [24, 57, 246, 247, 260, 264, 265, 268, 272, 289, 298, 302, 304, 338, 342, 358, 369, 370, 371, 372, 373, 374, 376, 387, 390, 391, 397, 400, 401, 402, 405, 406, 407, 408, 409, 411, 415, 420, 423, 425, 426, 429], "result_dir": 374, "result_ref": 279, "resum": [35, 246], "resume_download": 35, "resume_from_checkpoint": 246, "resume_from_pruned_checkpoint": 28, "ret": [24, 57, 395, 410], "ret_old_nod": 387, "retain": [24, 25, 307, 382, 429], "retain_grad": 24, "retain_input": 24, "retain_output": 24, "retinanet": 260, "retriev": [3, 23, 256, 257, 316, 319, 321, 322, 324, 332, 334, 338, 357, 376, 387], "retrieval_chat": 358, "retrieval_file_path": 334, "retrieval_typ": [14, 372, 375], "retrievalqa": [309, 372], "retrievaltypeopt": 5, "retrieveradapt": 14, "return": [0, 4, 6, 17, 20, 21, 23, 24, 25, 27, 29, 32, 35, 36, 44, 45, 47, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 256, 257, 258, 260, 261, 262, 263, 264, 266, 267, 268, 289, 302, 336, 361, 370, 372, 373, 387, 391, 395, 400, 401, 416, 421], "return_dict": [33, 36, 40, 41, 44], "return_interm_lay": 255, "return_output": 246, "return_tensor": [36, 44, 289, 302, 306, 428, 429, 432], "retval": 1, "reus": [388, 396, 405, 421], "revamp": 372, "revers": [25, 266], "review": [298, 300, 319, 432], "revis": [24, 35], "reward": [346, 347], "reward_model": 352, "reward_model_nam": 352, "rewrit": 387, "rf": 330, "rf_data": 401, "rgb": 21, "rh": [280, 407], "rhel": 308, "rich": [302, 321], "richer": 372, "right": [23, 36, 44, 57, 266, 298, 316, 322, 403, 407, 409, 418], "rishi": 377, "river": 377, "rl": 352, "rl_train": 352, "rlhf": [346, 347], "rm": [330, 364, 365, 407], "rms_norm": 150, "rmsnorm": [39, 40, 203], "ro": 304, "roberata": 44, "roberta": [44, 304, 430], "robertaattent": 44, "robertaclassificationhead": 44, "robertaconfig": 44, "robertaembed": 44, "robertaencod": 44, "robertaforcausallm": 44, "robertaformaskedlm": 44, "robertaformultiplechoic": 44, "robertaforquestionansw": 44, "robertaforsequenceclassif": 44, "robertafortokenclassif": 44, "robertaintermedi": 44, "robertalay": 44, "robertalmhead": 44, "robertamodel": 44, "robertaoutput": 44, "robertapool": 44, "robertapretrainedmodel": 44, "robertaselfattent": 44, "robertaselfoutput": 44, "robertatoken": 44, "robot": [341, 378, 381], "robust": [319, 325, 372], "rocketknight1": 304, "roco": 348, "rohan": 301, "role": [0, 309, 316, 321, 324, 361, 372], "roll": [361, 407, 429], "rome": 377, "root": [322, 334, 361, 364, 388], "rope": 429, "roraryposemb": [167, 180, 204], "rotari": 32, "rotary_pos_emb": 150, "rotat": 32, "rotten": 315, "roug": 349, "rougelsum": 425, "rough": 385, "roughli": [387, 405], "round": [401, 423, 432], "row": [266, 390, 402, 403, 405, 409], "row_num": 281, "rqsrt": 255, "rsi": 401, "rsp": 401, "rsqrt": [57, 73, 395], "rsub": 83, "rt_data": [279, 398], "rte": 288, "rtn": [288, 421, 432], "rtn_config": 429, "rtnconfig": [247, 319, 429, 432], "rubric": 25, "rule": [24, 25, 373, 395], "run": [9, 23, 24, 25, 246, 262, 264, 269, 289, 307, 308, 309, 313, 314, 316, 317, 318, 319, 321, 335, 341, 347, 348, 349, 350, 352, 354, 355, 364, 365, 369, 379, 380, 381, 382, 413, 414, 423, 432], "run_accuraci": [426, 427], "run_autoround": 427, "run_bench_": 413, "run_ci": 414, "run_code_gen": [326, 327, 328, 329, 330, 331], "run_evolutionary_search": 246, "run_executor": [389, 393], "run_generation_gpu_woq": 432, "run_infer": [426, 427], "run_llava": 351, "run_retrieval_on_cpu": 358, "runscript": 354, "runtim": [4, 272, 281, 302, 306, 314, 315, 318, 327, 328, 329, 347, 366, 386, 387, 388, 392, 395, 396, 398, 410, 413, 423, 426, 432], "runtime_kind": [278, 280], "runtime_kind_": [278, 280], "runtime_output_directori": 388, "runwayml": 9, "s8": [158, 392, 400, 401, 405, 413], "s8s8": [246, 305, 405], "s8s8bf16": 405, "sadhu": 301, "sadtalk": [360, 374], "safe": [300, 319, 349, 373], "safeti": [9, 247, 309, 373], "safety_check": [9, 319, 373, 375], "safetycheck": 373, "sahil2801": 349, "sai": 409, "said": 391, "salesforc": [309, 350], "salient": 419, "samanwai": 301, "same": [2, 17, 24, 25, 44, 57, 260, 264, 266, 303, 305, 314, 316, 323, 330, 332, 346, 350, 351, 355, 370, 372, 376, 387, 388, 389, 391, 392, 395, 399, 402, 405, 406, 409, 412, 413, 414], "same_src_dtyp": 281, "sampl": [25, 29, 246, 256, 257, 268, 288, 289, 302, 306, 311, 314, 319, 349, 350, 355, 369, 376, 397, 407, 423, 425], "sample_1": 340, "sample_layer_configur": 29, "sample_length_configur": 29, "sample_port": 25, "sample_s": [25, 246], "sample_zh_cn": 340, "sampler": [25, 350], "samsum": [304, 425], "sandesh": 301, "sandeshpyakurel": 301, "sandwich": 28, "sangjun": 301, "sapphir": [272, 302, 308, 349], "satisfact": 372, "satisfactori": 405, "satisfi": [308, 395, 405], "satur": 401, "savani": 304, "save": [9, 10, 25, 30, 55, 246, 247, 267, 302, 316, 350, 372, 376, 387, 388, 389, 392, 396, 403, 407, 409, 423, 427, 428, 432], "save_cached_st": 25, "save_directori": 247, "save_freq": 352, "save_jsonl": 267, "save_model": 302, "save_path": [246, 305], "save_popul": 30, "save_pretrain": [247, 428, 432], "save_step": [314, 346, 347, 349, 352], "save_stor": 30, "save_strategi": [314, 348, 349, 352, 354, 422], "save_total_limit": [314, 348, 349, 352, 354, 422], "saved_dir": 432, "saved_result": [302, 428], "say_hello": [311, 375], "sbu": 350, "scabl": 370, "scalabl": [4, 272, 302, 304, 308, 309, 311, 316, 319, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 354, 361, 363, 364, 365, 366, 370, 400], "scalar": [25, 111, 400], "scalar_num": 281, "scale": [20, 25, 62, 246, 259, 281, 370, 376, 400, 405, 408, 420, 421, 423, 428, 432], "scale0": 281, "scale_dst": 281, "scale_dtyp": [247, 432], "scale_factor": 264, "scale_k": 281, "scale_map": [246, 302], "scale_q": 281, "scale_reduce_quant": 405, "scale_shar": 247, "scale_typ": 421, "scale_v": 281, "scaleab": 281, "scalec": 281, "scaled_dot_product_attent": 32, "scan": 269, "scatter_el": 83, "scatterel": 112, "scenario": [281, 372, 405], "scene": [406, 429], "schedul": [9, 246, 293, 398, 436], "schedulermixin": 9, "scheme": [266, 354, 372], "scope": 269, "score": [30, 36, 44, 266, 307, 372, 376, 418], "score_threshold": 266, "scour": 372, "scr2": 413, "scratch": 5, "scratch_": 401, "screen": 395, "screenshot": [335, 380], "script": [1, 16, 17, 19, 20, 21, 269, 300, 315, 331, 332, 347, 349, 350, 351, 358, 376, 385, 390, 392, 412, 427, 432], "scriptmodul": 27, "scroll": 382, "sd": 304, "sdk": [345, 364, 365, 383, 384], "sdpa": 32, "se": [352, 432], "seamless": [272, 302, 319], "seamlessli": [309, 319, 372, 378, 429], "search": [2, 28, 30, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 304, 307, 350, 370, 372, 376, 391], "search_kwarg": [2, 309, 372], "search_mod": [57, 387, 391], "search_pattern": [57, 395], "search_straight_pattern": [57, 395], "search_typ": [2, 309, 372], "searchtyp": 2, "sec": [302, 397, 425], "second": [36, 44, 57, 289, 299, 337, 347, 385, 386, 391, 393, 394, 395, 403, 404, 407, 409, 410, 413, 432], "secondmo": 25, "secretli": 346, "section": [302, 320, 325, 333, 339, 342, 348, 349, 361, 398, 409, 410], "secur": [349, 361, 373, 435], "see": [23, 24, 36, 44, 49, 57, 256, 257, 260, 271, 298, 300, 302, 309, 314, 340, 349, 355, 366, 372, 376, 387, 389, 390, 391, 392, 395, 397, 399, 404, 408, 410, 411, 412, 413, 415, 421, 428, 432], "seed": [25, 288, 314, 349, 352, 371], "seek": [338, 358, 361, 371, 372], "seen": 418, "segment": [269, 319, 325, 372], "segment_id": [57, 306, 388], "sein": 377, "select": [25, 33, 36, 44, 246, 258, 314, 322, 335, 346, 347, 349, 351, 352, 372, 376, 380, 401, 413, 426], "self": [25, 28, 36, 37, 39, 40, 41, 42, 44, 111, 369, 387, 389], "semant": [319, 370, 372], "semi": 389, "semidefinit": 432, "send": [300, 366], "sensit": [319, 361, 373], "sensitive_check": 373, "sensitive_filt": 373, "sent": [342, 361, 370, 391], "sentenc": [36, 44, 289, 302, 314, 315, 335, 349, 354, 355, 372, 376, 380], "sentiment": [272, 302], "sep": 420, "separ": [0, 24, 298, 395, 409, 415], "separatorstyl": 0, "sepc_typ": 281, "seq": [349, 388, 407, 425], "seq2seq": [36, 44, 246], "seq_len": [32, 247, 281, 302, 388, 389, 393, 407, 413], "seq_relationship_logit": 36, "seq_vnni_copy_param": 281, "seqenti": 288, "seqlen": [37, 39, 40], "sequenc": [25, 29, 33, 36, 44, 57, 262, 288, 302, 306, 349, 350, 372, 387, 391, 395, 404, 413, 429], "sequence_length": [33, 36, 44], "sequence_output": 36, "sequenceclassifieroutput": [36, 44], "sequenceclassifieroutputwithpast": 33, "sequencelength": [73, 397], "sequenti": [24, 44, 391, 400, 401, 404], "sergei": 301, "seri": [264, 302, 308, 309, 349, 350, 361, 400, 403, 413], "serial": 247, "serv": [36, 309, 312, 319, 323, 326, 330, 331, 332, 340, 363, 372], "server": [309, 316, 319, 325, 347, 370, 371], "server_executor": [309, 375], "server_ip": 375, "server_nam": 361, "server_port": 361, "servic": [312, 319, 325, 335, 340, 342, 345, 366, 369, 370, 372, 380, 383, 384], "session": [348, 388, 396], "set": [0, 2, 24, 25, 29, 32, 33, 36, 44, 57, 95, 246, 266, 289, 298, 302, 307, 309, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 345, 347, 348, 349, 350, 351, 354, 357, 362, 363, 366, 371, 373, 375, 376, 383, 384, 388, 390, 391, 392, 394, 395, 396, 399, 400, 401, 404, 413, 428, 432], "set_attr": [63, 64, 66, 67, 68, 69, 70, 71, 72, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 122, 123, 124, 125, 126, 127, 387], "set_autocast": 57, "set_binaryop_list": [280, 400], "set_data_handl": 394, "set_dtyp": 394, "set_dynamic_config": [246, 306], "set_environ_var": 57, "set_input_embed": [36, 44], "set_length_config": [36, 44], "set_log_fil": 302, "set_lower_constraint": 30, "set_mask": 400, "set_output_attent": [36, 44], "set_output_embed": [36, 44], "set_requires_grad": 24, "set_scal": 400, "set_shap": 394, "set_system_messag": 0, "set_target_properti": 388, "set_upper_constraint": 30, "set_zp": 400, "setcriterion": [256, 257], "setfit": [272, 302, 430], "setp": 396, "settabl": 389, "setter": [30, 36, 44], "setup": [325, 333, 339, 342, 348, 349, 372, 432], "setup_and_instal": 366, "setup_for_distribut": 264, "setuptool": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "setvar": [314, 332, 349, 361, 362, 432], "sever": [57, 309, 314, 321, 349, 387, 392, 395, 396, 399, 413, 423], "sex": 298, "sexual": 298, "sf": [308, 309], "sgx": 361, "sh": [314, 322, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 350, 354, 357, 358, 360, 361, 362, 363, 368, 383, 384, 392, 393, 414, 427, 432], "shanghai": [356, 371], "shanghai_": 353, "shape": [32, 33, 36, 37, 38, 40, 44, 57, 83, 121, 256, 257, 260, 281, 302, 350, 388, 389, 390, 394, 396, 399, 405, 407, 413, 421], "shape_0": 390, "shape_1": 390, "shape_2": 390, "shape_256_256_128": 410, "shard": [361, 363], "share": [266, 316, 347, 361, 372, 376, 402], "share_weight": 24, "shared_criterion": 247, "shared_ptr": [278, 279, 394], "sharegpt": 350, "sharma": 301, "shazeer": [36, 44], "she": 361, "shell": [309, 393], "shen": [415, 432], "shift": [33, 369], "shira": 301, "shirin": 301, "shm": [314, 315, 366], "short": [351, 372], "shorter": 36, "shot": [340, 369], "should": [24, 25, 33, 35, 36, 44, 57, 246, 256, 257, 263, 266, 267, 314, 316, 347, 349, 350, 355, 364, 365, 366, 372, 376, 387, 388, 390, 391, 394, 395, 399, 400, 401, 406, 413, 414, 416, 417, 423, 432], "show": [298, 302, 315, 324, 343, 355, 361, 367, 371, 376, 385, 387, 388, 391, 392, 395, 403, 405, 407], "showcas": 432, "shown": [303, 332, 351, 372, 390, 404, 408, 409, 428], "shrestha": 301, "shrink": 266, "shrunk": 266, "shuffl": [247, 421], "sid": 369, "siddhi": 301, "side": [325, 378, 379], "sidebar": [335, 380], "sight": 361, "sigmoid": 73, "sigmoid_focal_loss": 260, "sign": [269, 409, 423, 432], "signal": 25, "signextend16": 409, "signific": [303, 319, 325, 361, 372, 428], "significantli": [9, 302, 307, 354, 372, 406, 408], "silu": 73, "sim": 359, "simd": [399, 400, 404], "similar": [2, 28, 259, 260, 279, 319, 325, 347, 361, 370, 372, 375, 376, 391, 400, 403, 404, 406, 407, 419], "similarli": 32, "simpl": [1, 33, 36, 44, 256, 257, 260, 302, 356, 372, 376, 378, 385, 388, 400, 408, 418, 432], "simplest": [311, 375], "simpli": [340, 345, 346, 351, 352, 383, 384], "simplic": [355, 432], "simplifi": [270, 309, 319, 338, 358, 359, 387, 391, 420], "simul": [307, 390, 409, 410], "sin": [32, 83], "sinc": [35, 303, 349, 377, 384, 405, 406, 408, 432], "sine": 32, "singl": [1, 21, 266, 320, 336, 337, 350, 352, 361, 372, 402, 407], "single_lay": 24, "singlenod": 354, "sink": 429, "site": [361, 424], "situat": [316, 391, 406], "six": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "size": [25, 27, 28, 36, 37, 44, 83, 246, 256, 257, 258, 260, 264, 298, 302, 306, 314, 315, 354, 366, 372, 376, 385, 388, 390, 396, 399, 402, 404, 406, 407, 408, 413, 423, 425, 426, 429, 432], "size_t": [279, 281, 390, 401], "sizeof": 281, "skill": 316, "skip": [28, 289, 314, 315, 361, 402, 414, 432], "skip_special_token": [428, 432], "sky": 36, "skylak": [302, 361], "skylin": 377, "sl_pad": 281, "slice": [24, 40, 279, 396], "slice_desc": 279, "slice_position_id": 83, "slicemask": 150, "slicepositionid": 116, "slide": 382, "slight": 369, "slightli": [350, 351, 371, 388], "slimorca": 347, "slot": [266, 330, 332], "slot_into_contain": 266, "slow": [319, 370], "small": [25, 304, 307, 314, 336, 340, 349, 369, 372, 375, 376, 390, 405, 407, 420, 430], "smaller": [246, 303, 354, 372, 420], "smallest": 25, "smart": 409, "smooth": [264, 265, 327, 328, 329, 372], "smoothedvalu": 264, "smoothieewastaken": 301, "smoothquant": [270, 428], "smoothquantconfig": [247, 428], "snapshot": 427, "snip": 419, "snip_momentum": 28, "snippet": [319, 370, 402], "so": [0, 23, 25, 32, 35, 39, 40, 44, 57, 247, 264, 266, 303, 309, 314, 316, 321, 322, 332, 334, 349, 354, 355, 364, 371, 376, 386, 387, 390, 391, 394, 395, 400, 402, 403, 404, 405, 406, 408, 409, 410, 413, 416, 417, 419, 423, 426, 427, 428, 432], "social": 298, "socioeconom": 298, "sock": [317, 318], "socket": [314, 332, 349, 361, 397, 411, 425, 426], "softmax": [36, 83, 260, 279, 303, 398, 407, 408], "softmax_data_t": 281, "softmax_desc": 279, "softmax_param_t": 281, "softwar": [272, 302, 319, 361, 364, 365, 366, 369, 371, 415, 420], "solar": 309, "solid": 265, "solut": [319, 336, 337, 338, 358, 359, 361, 372, 406, 409, 420, 426, 427, 428], "solv": [258, 300, 354, 405, 406, 423], "some": [44, 57, 181, 195, 247, 266, 270, 302, 322, 332, 335, 338, 341, 345, 351, 358, 370, 372, 376, 379, 380, 381, 382, 383, 384, 385, 387, 388, 389, 390, 391, 394, 395, 396, 400, 401, 405, 409, 423], "someth": [243, 361], "sometim": [57, 391, 423], "soni": 301, "soon": 425, "sort": [25, 426, 427], "sort_objects_by_scor": 266, "sort_objects_left_to_right": 266, "sort_objects_top_to_bottom": 266, "sota": 354, "sound": 369, "soundfil": 369, "sourc": [0, 1, 2, 4, 5, 6, 8, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 262, 263, 264, 265, 266, 267, 268, 314, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 347, 349, 354, 357, 361, 362, 363, 368, 372, 383, 384, 385, 388, 400, 407, 413, 415, 420, 426, 432], "source_imag": 374, "source_op": 121, "sp": 279, "space": [298, 300, 390, 399, 402, 432], "spaci": [334, 371, 375], "spacy_model": [371, 375], "span": [36, 44, 266], "sparelib": 407, "spars": [55, 270, 272, 281, 302, 390, 398, 399, 408, 413, 420, 437], "sparse_lib_dump": 410, "sparse_lib_verbos": 410, "sparse_lib_vtun": 410, "sparse_matmul": [279, 410], "sparse_matmul_desc": [279, 398], "sparse_matmul_desc_t": 279, "sparse_matmul_t": 279, "sparse_ptr": 281, "sparse_ratio": 413, "sparse_schem": 281, "sparse_x_dens": 281, "sparse_x_spars": 281, "sparselib": [293, 390, 398, 436], "sparselib_verbos": 410, "sparsiti": [47, 55, 309, 397, 413, 419], "sparsity_al": 412, "sparsity_decay_typ": 28, "spatial": [263, 399, 405], "speak": 355, "speaker": [340, 369, 375], "spec_softmax_typ": 281, "spec_translnorm_typ": 281, "spec_typ": [281, 413], "special": [57, 248, 369, 372, 401, 407], "specif": [9, 35, 57, 256, 257, 265, 269, 270, 280, 282, 298, 299, 303, 309, 314, 316, 319, 326, 327, 328, 329, 330, 331, 332, 340, 349, 357, 361, 369, 371, 372, 385, 387, 390, 391, 399, 404, 405, 406, 412, 413, 416, 417, 418, 423, 429, 432], "specifi": [6, 23, 25, 29, 32, 48, 57, 246, 247, 264, 269, 270, 313, 316, 317, 318, 319, 330, 332, 334, 351, 365, 366, 369, 371, 372, 375, 391, 392, 396, 401, 405, 407, 413, 423, 430], "speech": [309, 319, 321, 340, 418, 420], "speechbrain": 369, "speecht5": [340, 355, 369], "speecht5_tt": 309, "speed": [36, 44, 315, 325, 337, 350, 361, 369, 387, 391, 425, 426, 427, 432], "speedup": [304, 314, 340, 349], "spell": 269, "spk_id": 375, "splice": 57, "split": [24, 57, 83, 289, 348, 349, 369, 372, 390, 399, 403, 405, 406, 428], "split_batch": 25, "split_output": 281, "spmm": [399, 407, 413], "spmm_desc": 398, "spmm_kern": 398, "spmm_type": 281, "spmm_vnni": 281, "spoken": 369, "spot": 428, "spr": [320, 337, 408], "spycsh": [364, 365], "sq": 428, "sq_config": 428, "sq_model": 428, "sql": [309, 334, 368], "sqlcoder": [309, 368], "sqlcoder2": 309, "sqrt": [73, 387, 391, 407], "squad": 304, "squadv1": 304, "squar": [33, 36, 44, 73, 350], "squareddiffer": [57, 73, 395], "squeez": 83, "src": [28, 281, 388, 401, 409, 413], "src0": [281, 413], "src1": [281, 389, 400, 413], "src1_perm": 389, "src2": [281, 400, 413], "src_data": 394, "src_data_typ": 413, "src_dt": 413, "src_k": 281, "src_m_": 394, "src_perm": 57, "src_q": 281, "src_shape": 394, "src_str": 57, "src_stride": 394, "src_t": 281, "src_v": 281, "srcptr": 281, "srcstride": 281, "srikanth": 301, "ssd": [281, 401, 413], "ssh": [314, 322, 410], "sshd_port": 349, "sshleifer": 304, "sst": [302, 304, 306, 418], "sst2": [289, 302, 304, 389, 393], "st": 391, "stabil": [387, 429], "stabilityai": 428, "stabl": [9, 25, 133, 134, 135, 136, 216, 217, 218, 221, 222, 223, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 272, 302, 308, 332, 346, 347, 348, 349, 396, 405, 420, 432], "stable_diffus": 9, "stable_diffusion_v1_4": 425, "stable_diffusion_v1_5": 425, "stable_diffusion_v2_1": 425, "stablediffusion_bf16convert": 150, "stablediffusion_collectqdqinfo": 150, "stablediffusion_collectquantinfo": 212, "stablediffusion_explicitnhwctranspos": 150, "stablediffusion_explicitnhwctransposeqat": 150, "stablediffusion_insertquantnod": 150, "stablediffusion_mhareshap": 150, "stablediffusion_quantizefus": 150, "stablediffusion_reshapefus": 150, "stablediffusioninstructpix2pixpipelin": 9, "stablediffusionsafetycheck": 9, "stablelm": 428, "stack": [73, 261, 408], "stage": [347, 350, 369], "stai": [32, 432], "stand": [57, 372, 377, 387], "standard": [35, 259, 342, 370], "stanford": [263, 314, 349], "stanford_alpaca": [314, 349], "star": 304, "starcod": [309, 363], "starcoder_peft_finetuned_model": 349, "start": [25, 36, 44, 57, 270, 314, 319, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 347, 349, 355, 356, 360, 363, 366, 367, 370, 378, 379, 380, 381, 382, 384, 389, 395, 414, 435], "start_end_logit": 150, "start_pipelin": 49, "start_posit": [36, 44], "start_step": [28, 419], "startendlogit": 214, "startup": [361, 372], "stat": [25, 410], "state": [9, 25, 36, 39, 40, 44, 246, 302, 372, 397, 411, 429], "state_dict": 25, "static": [38, 57, 251, 278, 281, 300, 304, 305, 306, 350, 389, 392, 400, 403, 405, 418, 430], "static_addr": 400, "static_group": 247, "staticquantconfig": 247, "statist": [25, 255], "statsit": 25, "statu": [270, 298, 334, 366, 417, 423], "status_update_r": 361, "std": [278, 279, 280, 281, 398, 400, 401], "stderr": [17, 361], "stdev": 25, "stdout": [345, 361, 383, 384], "steadili": 340, "stella": [372, 376], "step": [21, 25, 47, 57, 246, 256, 257, 307, 314, 315, 321, 345, 347, 349, 352, 372, 383, 384, 386, 387, 389, 391, 392, 393, 394, 395, 396, 400, 405, 407, 408, 413, 420, 425, 432], "step0": 401, "step1": [400, 401, 408], "step2": [40, 400, 401, 408], "step3": [401, 408], "still": [47, 57, 269, 272, 340, 382, 395, 402, 420, 423, 427], "stop": [24, 361], "stopforward": 24, "stopgradi": 73, "storag": [25, 278, 370, 372], "store": [25, 28, 30, 35, 243, 260, 319, 334, 338, 357, 358, 370, 387, 391, 392, 395, 396, 399, 400, 401, 402, 403, 405, 406, 407, 409, 429], "store2str": 30, "store_fil": 30, "stori": 361, "str": [0, 6, 21, 23, 27, 28, 35, 36, 44, 45, 57, 95, 184, 246, 247, 250, 251, 255, 264, 267, 268, 289, 371, 372, 376, 421], "str2list": 57, "straight": 57, "straightforward": [309, 319, 357, 378], "strategi": [129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 330, 332, 370, 372, 402], "stream": [25, 43, 316, 340, 363, 394, 425, 426], "stream_mod": [336, 340, 375], "stream_t": 278, "streamer": [429, 432], "streamlin": [347, 372], "strict": [247, 373], "strictli": [351, 394], "stride": [394, 399], "strided_slic": 83, "stridedslic": 120, "string": [0, 23, 30, 47, 57, 62, 243, 244, 247, 262, 266, 268, 280, 303, 336, 351, 371, 387, 390, 391, 394, 401, 416, 417, 419], "strong": 372, "stronger": 376, "strongli": [345, 383, 384], "struct": [279, 281, 400, 401], "structur": [57, 266, 304, 319, 364, 365, 366, 372, 387, 388, 390, 404, 408, 412, 419], "structure_model_path": 359, "student": [303, 304], "studio": [308, 327, 328, 329], "style": [0, 25, 300, 345, 346, 347, 349, 352, 361, 383, 384], "sub": [49, 57, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 159, 160, 161, 162, 163, 164, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 182, 183, 185, 186, 187, 188, 189, 190, 191, 193, 194, 196, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 266, 337, 375, 387, 390, 391, 400, 407, 408], "sub_func": 281, "sub_func_level": 413, "sub_graph": [58, 387, 390], "subclass": [25, 95, 184, 246, 278, 279], "subdir": 413, "subdirectori": 386, "subfold": 351, "subfunc_level": [281, 413], "subfunc_level_max": [281, 413], "subgraph": [49, 57, 215, 390, 392], "subgraph_match": [150, 390], "subgraphmatch": [215, 390], "subject": [268, 309, 351, 415], "submit": [300, 302, 347], "submodul": [9, 24, 330, 332, 386, 388, 432], "suboptim": 407, "subsampl": 25, "subsequ": [24, 390, 405, 408], "subset": [25, 350], "subsidiari": 415, "substanti": [319, 370, 372, 429], "substitut": [23, 313, 316, 317, 318, 363], "subtask": 371, "subtoken": 23, "succeed": 302, "success": [308, 378], "successfulli": [313, 316, 317, 318, 363, 366, 387, 395], "successor": 403, "sudhanshu": 301, "sudo": [323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368, 369], "suggest": [305, 307, 309, 346, 348, 349, 352, 400], "suit": [309, 357, 369, 372, 378], "suitabl": [324, 327, 328, 329, 338, 361, 372, 405], "sum": [36, 264, 289, 303, 389, 406, 408, 409, 413], "summar": [303, 304, 309, 316, 319, 349, 430], "summari": [289, 425], "summit": 432, "sumptr": 281, "sumstep": 281, "sunak": 377, "super": [50, 387, 390], "supercel": 266, "supercell1": 266, "supercell2": 266, "supercharg": 420, "superclass": 9, "superior": [342, 370], "supervis": [256, 257, 420], "suppli": [24, 25, 391, 395, 396], "support": [8, 24, 25, 28, 30, 45, 48, 57, 62, 184, 264, 266, 281, 306, 308, 314, 319, 321, 325, 336, 337, 345, 349, 350, 354, 361, 364, 365, 366, 367, 369, 371, 372, 375, 378, 383, 384, 386, 387, 388, 389, 390, 394, 395, 401, 405, 406, 408, 410, 412, 413, 418, 419, 420, 421, 422, 429, 433], "supported_pattern": 387, "supported_typ": 28, "supported_valu": 28, "suppress": 266, "surav": 301, "sure": [57, 181, 266, 289, 308, 309, 314, 315, 316, 317, 318, 321, 323, 324, 326, 330, 332, 335, 337, 343, 349, 355, 361, 362, 366, 372, 380, 387, 402, 413], "surfac": 319, "surgeon": 432, "surround": 361, "sw": 306, "swag": 304, "sweep": 181, "sweet": [428, 430], "sweetnotebook": 304, "switch": [25, 314, 331, 332, 335, 349, 380], "swizzl": 409, "sy": [22, 247, 361], "sym": 247, "symbol": [44, 262, 373], "symmetr": [57, 395, 405, 413, 423], "synaps": 425, "sync": [330, 405], "synchron": 264, "synchronis": 402, "synchronize_between_process": 264, "synthes": 369, "synthesi": 369, "sys_nic": [314, 315, 347, 366], "sysroot_linux": 426, "system": [0, 35, 302, 313, 314, 316, 324, 327, 328, 329, 333, 339, 342, 361, 370, 372, 378, 386, 425], "system_messag": 0, "systemat": 428, "systemctl": 334, "t": [21, 25, 36, 44, 57, 256, 257, 258, 266, 279, 281, 303, 313, 314, 315, 316, 317, 318, 347, 349, 350, 351, 361, 372, 385, 394, 396, 399, 400, 402, 405, 407, 408, 409, 413, 426], "t5": [272, 302, 304, 314, 363, 430], "ta": 301, "tab": [335, 380], "tabl": [266, 323, 324, 326, 327, 328, 329, 330, 331, 332, 336, 338, 340, 343, 344, 363, 366, 372, 390, 401, 409], "table_bbox": 266, "table_object": 266, "table_span": 266, "table_structur": 266, "table_structure_to_cel": 266, "tabul": 351, "tacotron": 262, "tag": 35, "tail": [57, 395, 410], "tailor": [309, 319, 357, 361, 372], "take": [24, 25, 298, 303, 314, 319, 335, 337, 349, 370, 380, 389, 391, 394, 400, 408, 409], "taken": [25, 36, 44, 371, 379], "talent": 409, "talk": [341, 381, 387, 420], "talker": 355, "talkingbot": [309, 319, 320, 339, 340, 369], "talli": 25, "tangobert": 430, "tangobertnotebook": 304, "tanh": [73, 394, 401, 413], "tanspos": 413, "target": [27, 246, 256, 257, 258, 260, 264, 314, 315, 347, 349, 389, 409, 419], "target_include_directori": 388, "target_link_librari": 388, "target_node_nam": 57, "target_s": [20, 256, 257, 260], "target_spars": [28, 419], "target_sparsity_ratio": 306, "task": [44, 45, 246, 270, 302, 303, 304, 309, 314, 316, 319, 321, 340, 346, 348, 349, 350, 354, 368, 369, 371, 372, 375, 393, 401, 407, 410, 418, 426, 427, 428, 430], "task_nam": [392, 393], "task_typ": 422, "tasks_list": [309, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 375], "tasktyp": 422, "tatr": 359, "tatsu": 349, "taught": 361, "tbd": 322, "tce": 361, "tcmalloc": 354, "tdp": 385, "tdpbf16p": 403, "tdpbssd": 405, "teach": 350, "teacher": [303, 304], "teacher_model": [246, 303, 306], "team": 298, "tech": [272, 319, 420], "techcrunch": 420, "technic": 420, "techniqu": [270, 302, 304, 306, 354, 423], "technologi": [319, 325, 405], "tee": 330, "tel2p1": [425, 426], "tell": [4, 309, 311, 316, 318, 319, 321, 361, 364, 365, 366, 367, 370, 375, 389, 391, 400, 401], "temperatur": [259, 303, 321, 371, 376, 428, 432], "templat": [0, 23, 279, 281, 314, 341, 349, 381], "temporari": [298, 407], "temporarili": 298, "ten": [369, 420], "tendenc": 372, "tendorflow": 243, "tensor": [23, 24, 25, 27, 32, 33, 36, 37, 38, 39, 40, 41, 44, 49, 52, 53, 54, 55, 57, 62, 83, 111, 181, 243, 244, 246, 256, 257, 258, 260, 263, 264, 281, 332, 387, 388, 389, 391, 392, 394, 396, 407, 412, 413, 421, 423], "tensor_desc": [280, 401], "tensor_dtyp": 280, "tensor_ftyp": 280, "tensor_list": 55, "tensor_nam": [55, 62, 243], "tensor_shap": 280, "tensorflow": [50, 53, 63, 64, 66, 67, 68, 69, 74, 76, 81, 84, 85, 88, 89, 90, 92, 93, 96, 97, 99, 100, 103, 105, 106, 109, 119, 120, 124, 243, 269, 299, 303, 306, 308, 361, 388, 395, 423], "tensorflowextractor": 53, "tensorflowmodel": [53, 243], "tensorslicedataset": 73, "tent": 247, "teq": 432, "teqconfig": [247, 432], "term": [303, 347, 372, 404, 407, 409, 415, 416, 417, 421, 423], "termin": 313, "tesseract": 359, "test": [265, 266, 269, 299, 302, 304, 319, 325, 330, 332, 346, 358, 372, 397, 411, 413, 414, 425, 426], "test_": 413, "test_doc": 338, "test_finetuning_data": 314, "test_infer": 315, "test_spmm_vnni_kernel": 398, "text": [9, 266, 272, 289, 302, 304, 309, 314, 319, 320, 321, 324, 331, 332, 333, 336, 340, 342, 343, 344, 348, 349, 350, 354, 363, 370, 371, 372, 376, 378, 379, 382, 393, 401, 410, 415, 418, 426, 427, 428, 430, 432], "text2imag": [309, 321, 375], "text2speech": 369, "text_classifi": 418, "text_encod": 9, "text_gen": 316, "text_gen_qa": 316, "text_gen_summari": 316, "text_gener": [312, 316, 354, 364, 365, 366], "text_to_sequ": 262, "text_to_speak": 369, "textattack": [304, 392], "textbot": [340, 345, 383, 384], "textbot_vllm": 367, "textchat": [309, 311, 321, 323, 326, 327, 328, 329, 330, 338, 375], "textchatclientexecutor": 375, "textencdoer_word_embed": 150, "textencoder_attentionmaskaddreshap": 150, "textencoder_attentionreshap": 150, "textencoder_casualattentionmask": 223, "textencoder_causal_attention_mask": 150, "textencoder_kvreshap": 150, "textencoder_mulreshap": 150, "textencoder_qreshap": 150, "textencoder_softmaxreshap": 150, "textencoder_wordembed": 216, "textencoderv1": [216, 233, 234, 238, 239, 240], "textgen": [309, 312], "textgenerationfinetuningconfig": 319, "textract": 359, "textstream": [429, 432], "texttospeech": 369, "textual": [349, 379], "textunderscor": 407, "textvoicechatexecutor": 311, "textvqa": 350, "tf": [36, 57], "tf_checkpoint_path": 36, "tf_dtype": [62, 243, 244], "tf_dtype_id": 243, "tf_extract_oper": 243, "tf_extractor": [50, 51], "tf_util": 58, "tgi": [309, 312], "tgi_engine_param": 363, "tgi_serv": 312, "th": [57, 391], "than": [25, 29, 255, 258, 265, 266, 289, 337, 346, 348, 349, 352, 359, 372, 376, 389, 390, 391, 400, 405, 407, 412, 413, 423, 426, 432], "thank": [301, 353, 359, 360, 374], "thch": 369, "theblackcat102": 349, "thei": [21, 32, 35, 57, 247, 266, 298, 303, 316, 355, 361, 372, 386, 395, 396, 399, 400, 401, 403, 407, 413], "them": [24, 25, 32, 39, 40, 49, 52, 53, 57, 266, 268, 270, 335, 347, 350, 352, 361, 372, 380, 387, 388, 391, 400, 403, 405, 408, 409, 423], "therebi": [370, 372], "therefor": [57, 305, 319, 372, 399, 404, 407, 409, 423], "thi": [0, 5, 9, 16, 17, 18, 19, 20, 21, 24, 25, 32, 36, 37, 44, 47, 50, 57, 220, 246, 247, 256, 257, 258, 259, 260, 264, 266, 270, 278, 279, 280, 281, 298, 300, 302, 303, 305, 307, 309, 316, 319, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 347, 348, 349, 350, 353, 354, 355, 356, 357, 358, 359, 360, 361, 363, 364, 365, 368, 369, 371, 372, 373, 374, 375, 376, 377, 378, 380, 383, 384, 385, 387, 388, 389, 390, 391, 394, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 412, 413, 415, 416, 418, 420, 423, 425, 426, 427, 428, 429, 432], "thing": 390, "think": 377, "thinner": 407, "third": [57, 300, 391, 404, 409, 415], "those": [24, 25, 36, 44, 266, 348, 349, 355, 361, 407, 423], "though": [336, 337, 338, 358], "thought": 391, "thread": [314, 349, 371, 397, 421], "thread_elt_offset": [281, 400], "thread_num": 281, "threat": 361, "threaten": 298, "three": [47, 302, 337, 340, 352, 356, 372, 387, 391, 395, 407, 408, 423], "threshold": [55, 260, 266, 372, 413, 429], "through": [23, 262, 270, 272, 289, 302, 309, 314, 324, 327, 328, 329, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 349, 352, 361, 363, 365, 372, 377, 387, 400, 403, 404, 405, 410, 422], "throughout": 361, "throughput": [289, 302, 307, 342, 365, 367, 370, 397, 405, 425], "througput": 319, "throw": [24, 25], "thu": [258, 354, 372, 404, 423], "thudm": 309, "tid": 402, "tidm": 402, "tidn": 402, "tight": 402, "tiiuae": [349, 428], "tile": [65, 73, 111, 399, 403, 405, 407, 408, 409, 413], "tile_gemm": 402, "tile_k": 405, "tile_m": [405, 413], "tile_n": 413, "tile_w": 281, "tiledcol": 402, "tiledindex": 402, "tiledrow": 402, "tileloadd": 403, "tilem": 281, "tilen": 281, "tilestor": 405, "till": 57, "timbrook": 337, "time": [24, 25, 37, 314, 315, 316, 319, 321, 325, 354, 361, 364, 370, 371, 372, 379, 382, 385, 389, 396, 399, 400, 402, 403, 404, 405, 406, 407, 408, 409, 411, 413, 414, 420, 423, 425, 426, 427, 428, 429, 432], "tini": [44, 304, 355, 430], "tinybert_general_4l_312d": 304, "titl": [389, 415], "titsworth": 301, "tmp": [281, 401, 403, 405, 408], "tmp1": 409, "tmp2": 409, "tmp2m": 281, "tmp3": 409, "tmp4": 409, "tmp_trainer": 302, "to_": 25, "to_diff_dict": 247, "to_gradio_chatbot": 0, "to_json_fil": 247, "to_openai_api_messag": 0, "todai": 371, "todo": [281, 431], "togeth": [24, 25, 302, 361, 369], "togethercomput": 428, "toi": 372, "token": [5, 9, 23, 32, 36, 44, 247, 266, 289, 302, 304, 306, 307, 314, 324, 342, 348, 349, 354, 370, 371, 372, 375, 385, 392, 396, 418, 425, 428, 429, 430, 432], "token_idx": [37, 40, 41], "token_typ": 36, "token_type_embed": [150, 387], "token_type_embeddings_v1": [150, 387], "token_type_id": [33, 36, 44, 396], "tokenclassifieroutput": [33, 36, 44], "tokenizer_class": 349, "tokenizer_config": 349, "tokenizer_dir": 393, "tokenizer_nam": [314, 315, 349, 352], "tokenizer_name_or_path": 375, "tokens_in_t": 266, "tokentypeembed": [224, 387], "tokentypeembeddingsv1": [225, 387], "tokentypeid": 73, "toler": 416, "tolerable_loss": 423, "tomaarsen": 38, "tone": 369, "too": [57, 314, 315, 372, 387, 399, 400, 405], "tool": [302, 304, 314, 315, 321, 349, 369, 372, 373, 389, 396, 398, 413, 420], "toolkit": [272, 302, 304, 361, 362, 363, 420], "top": [25, 57, 266, 272, 302, 356, 369, 376, 382, 404, 420, 421, 425], "top1": 347, "top2": 376, "top200": 376, "top60": 376, "top_k": [83, 371], "top_n": [14, 372], "topic": [270, 335, 372, 380], "topk": [25, 122, 264], "topologi": 50, "tor": 246, "torch": [9, 23, 24, 25, 27, 28, 32, 33, 36, 37, 39, 40, 41, 44, 54, 127, 244, 246, 260, 261, 264, 289, 302, 303, 314, 315, 330, 332, 340, 349, 353, 374, 418, 421, 422, 426, 427, 432], "torch_ccl": [314, 349], "torch_ccl_path": [314, 332, 349], "torch_cuda_arch_list": 330, "torch_dtyp": [346, 422, 432], "torch_embed": 150, "torch_extract_oper": 244, "torch_extractor": 51, "torch_ip_insert_bia": 150, "torch_unpack_baddbmm": 150, "torch_util": 58, "torchaudio": [332, 340], "torchembed": 226, "torchextractor": 54, "torchinnerproductinsertbia": 227, "torchinsertbf16nod": [150, 189], "torchpaddingsequ": 230, "torchpaddingsqu": 150, "torchprofil": 304, "torchrun": 352, "torchscript": [27, 28, 54, 115, 244, 246, 289], "torchunpackbaddbmm": 228, "torchvis": [255, 264, 332, 361], "total": [28, 29, 36, 57, 289, 310, 314, 349, 385, 391, 395, 402, 409, 410, 425, 426], "total_token": 361, "total_val_output": 351, "toward": 298, "tpp": 332, "tpp_cache_remapped_weight": 332, "tr": 24, "trace": [24, 27, 28, 289, 315, 345, 372, 383, 384, 389], "tracedict": 24, "track": [23, 25, 264, 319], "trade": [319, 432], "trademark": 302, "tradeoff": [306, 403], "tradit": [361, 370, 401, 429], "traffic": [319, 349, 370], "train": [1, 5, 17, 25, 28, 47, 246, 265, 272, 302, 303, 306, 314, 319, 348, 354, 372, 412, 419, 420, 428, 430, 432], "train2017": 350, "train_asr": 353, "train_backbon": 255, "train_batch_s": 247, "train_data": 376, "train_dataload": 247, "train_dataset": [247, 302, 306], "train_dir": 348, "train_fil": [314, 349], "train_func": [246, 247], "train_group_s": 376, "train_imag": 350, "train_it": 247, "train_len": 247, "train_pad": 247, "train_pad_v": 247, "train_shuffl": 247, "train_transl": 353, "train_translation_revers": 353, "trainabl": [354, 432], "trainer": [286, 302, 304, 305, 306], "training_step": 246, "training_step_length_adapt": 246, "trainingargu": 376, "tranform": [330, 332], "transcrib": 369, "transcript": [309, 321, 340, 369], "transfer": [246, 303, 336, 361], "transform": [9, 17, 23, 256, 257, 259, 269, 270, 289, 293, 299, 300, 303, 307, 309, 313, 314, 315, 316, 319, 320, 323, 324, 326, 330, 331, 332, 337, 343, 346, 347, 348, 349, 350, 352, 354, 355, 359, 364, 365, 366, 369, 372, 374, 376, 386, 387, 388, 390, 392, 395, 396, 400, 401, 406, 407, 408, 409, 413, 415, 416, 417, 418, 419, 420, 422, 423, 424, 425, 426, 427, 428, 429, 432, 436], "transformer2dmodel_attentionmaskaddreshap": 150, "transformer2dmodel_constantofshapewithmul": 150, "transformer2dmodel_encoderhiddenstatesreshap": 150, "transformer2dmodel_ffninputslic": 233, "transformer2dmodel_ffninputslice_1": 234, "transformer2dmodel_ffnslic": 150, "transformer2dmodel_ffnslice_1": 150, "transformer2dmodel_getsamplebatch": 150, "transformer2dmodel_qkvprereshap": 150, "transformer2dmodel_qkvreshap": 150, "transformer2dmodel_qkvreshape4d": 150, "transformer2dmodel_qkvreshapeto4d": 237, "transformer2dmodel_sampleslic": 150, "translat": [25, 304, 309, 321, 340, 430], "transparam": 20, "transpos": [36, 44, 55, 83, 108, 389, 390, 398, 399, 403, 405, 406, 409, 413, 421, 437], "transpose_4b_8x8": 407, "transpose_batch_matmul": [150, 387], "transpose_copy_param": 281, "transpose_for_scor": [36, 44], "transpose_matmul": 279, "transpose_matmul_desc": 279, "transpose_mha": 279, "transpose_mha_desc": 279, "transpose_mha_io": 281, "transpose_mha_io_max": 281, "transpose_mha_step1_param": 281, "transpose_mha_step2_param": 281, "transpose_mha_step3_param": 281, "transpose_mode_int8": 55, "transposebatchmatmul": [73, 241, 387], "transposit": 409, "travel": 361, "treat": [258, 387], "tree": [43, 266, 315, 352, 354, 373], "trend": 370, "trial": 246, "trie": 373, "trigger": [248, 372], "tripathi": 301, "triton": 309, "triton_backend": 366, "triton_cli": 366, "triton_inference_serv": 366, "triton_neuralchat": 364, "triton_neuralchat_gpu": 365, "tritoncli": 366, "tritonserv": [364, 365, 366], "triumph": 361, "troll": 298, "true": [9, 17, 23, 24, 25, 28, 36, 44, 55, 246, 247, 250, 251, 256, 257, 260, 264, 266, 288, 302, 305, 306, 307, 314, 317, 319, 324, 326, 330, 332, 334, 336, 338, 340, 344, 346, 347, 348, 349, 350, 352, 354, 358, 361, 363, 366, 371, 372, 373, 374, 375, 376, 386, 387, 389, 390, 396, 400, 401, 407, 410, 413, 416, 417, 422, 423, 428, 429, 432], "true_sequenti": 247, "truncat": [29, 302, 349], "trust": [314, 349, 420], "trust_remote_cod": [307, 349, 429, 432], "truth": [256, 257, 258, 288, 376], "truthfulqa": [346, 425], "truthfulqa_mc": 349, "try": [309, 361, 363, 372, 375, 390, 423], "ts_desc": [280, 398, 401], "ts_descs_": [280, 401], "tsai": 301, "tsmodelforcausallm": 428, "tt": [309, 319, 322, 334, 336, 340, 375, 402], "tts_finetun": 355, "tts_multilang": [336, 340, 369, 375], "ttsdatasetargu": 355, "ttsmodelargu": 355, "tune": [5, 55, 246, 270, 272, 302, 303, 309, 320, 357, 372, 376, 415, 416, 417, 419, 420, 422, 428, 432, 439], "tune_metr": [302, 419], "tuning_criterion": 423, "tuningcriterion": 423, "tunnel": 322, "tupl": [29, 32, 33, 36, 37, 40, 41, 44, 45, 57, 258, 260], "turbo": [324, 397, 411, 425, 426], "turn": [370, 407], "tutori": [270, 302, 347, 388, 426, 427], "tweak": 44, "twice": [17, 402, 408], "two": [24, 25, 57, 256, 257, 266, 270, 300, 303, 314, 316, 330, 332, 340, 347, 349, 350, 351, 355, 369, 371, 372, 373, 376, 384, 387, 390, 391, 393, 394, 396, 400, 401, 403, 406, 407, 408, 409, 417, 418, 419, 423], "twofold": 407, "tx": 20, "txt": [265, 302, 308, 309, 315, 319, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 346, 348, 349, 352, 353, 354, 355, 357, 358, 359, 360, 361, 363, 368, 370, 371, 372, 373, 374, 376, 377, 383, 384, 387, 388, 390, 393, 410, 412, 426, 427, 432], "txt2img": [335, 380], "ty": 20, "tyler": 301, "type": [2, 4, 21, 25, 27, 28, 29, 35, 36, 45, 52, 53, 54, 57, 62, 95, 184, 243, 244, 246, 247, 258, 264, 282, 302, 303, 304, 305, 309, 313, 316, 317, 318, 319, 321, 324, 340, 361, 363, 367, 372, 375, 388, 389, 390, 392, 395, 398, 400, 401, 406, 412, 413, 416, 417, 419, 421, 422, 423, 425], "type1": 395, "type2": 395, "typedef": 281, "typeerror": 24, "typenam": [279, 281], "typic": [0, 319, 369, 370, 372, 409, 432], "typing_extens": [323, 324, 331, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "u": [9, 332, 340, 351, 353, 355, 412, 420, 428], "u8": [158, 392, 394, 401, 408, 413], "u8s8": 305, "u8u8": 305, "ubuntu": [302, 369, 397, 411], "ubuntu22": [313, 314, 315, 316, 317, 318], "ubuntu_v": [314, 315, 349], "ui": [378, 382], "uint64_t": 280, "uint8": [28, 407, 423], "uint8_t": [281, 400, 401], "uiuc": [309, 313, 331], "uk": 377, "ultim": [309, 357, 423], "ultra": 420, "ultrachat": 349, "un": [258, 412], "unabl": 363, "unaccept": 298, "unawar": 372, "unbox_numpy_nul": 25, "unbreak": 361, "uncas": [36, 302, 304, 306, 392, 418], "uncased_swag": 304, "uncertain": 300, "unchang": 25, "undef": [280, 281, 400, 401], "under": [158, 246, 302, 309, 340, 351, 353, 355, 364, 365, 366, 372, 373, 387, 388, 389, 392, 406, 413, 415], "understand": [316, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 372, 402, 405], "unencumb": 349, "unet": 9, "unet2dconditionmodel": 9, "unexpect": 373, "unfix": 57, "unfortun": 409, "unifi": [346, 407], "uniform": 432, "uniformli": 25, "uninstal": 332, "unintellig": 402, "union": [25, 246, 266, 281], "uniqu": [266, 272, 302, 349, 372, 378], "unique_assign": 266, "unit": [269, 319, 325, 361, 398, 405], "unit_test_util": 401, "unittest": [314, 315], "univers": [266, 314, 349], "unknown": 361, "unleash": 420, "unlik": [400, 429], "unlock": [420, 430], "unnorm": [256, 257], "unordered_map": [280, 401], "unpack": [83, 246, 387], "unprocess": 361, "unquant": 432, "unreach": 375, "unref": 394, "unref_tensor": 394, "unrefernc": 394, "unrel": 354, "unrol": [390, 402, 404], "unseen": 423, "unset": 363, "unsign": [409, 413], "unslic": 36, "unsqueez": [32, 83, 387], "unsqueeze_dim": 32, "unstructur": [304, 371, 372, 419], "until": 405, "untouch": 32, "unus": [32, 247, 401], "unwelcom": 298, "up": [24, 25, 36, 37, 44, 247, 315, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 350, 351, 363, 364, 366, 372, 387, 389, 391, 396, 401, 413], "updat": [0, 25, 38, 39, 40, 247, 308, 309, 330, 332, 334, 345, 372, 377, 383, 384, 386, 388, 406, 419, 426, 432], "update_config": 47, "update_keys_to_ignor": 44, "update_last_messag": 0, "upgrad": [309, 315, 321, 426], "upload": [13, 321, 334, 336, 341, 372, 378, 379, 381, 382], "upload_link": 309, "upon": [321, 361, 426, 427, 428, 429, 432], "upper": [30, 322, 401], "upper_bound": 413, "upper_constraint": 30, "upsampl": 260, "upstag": 309, "upto": 24, "upto_lay": 24, "url": [309, 319, 332, 345, 359, 361, 364, 365, 366, 372, 383, 384, 415], "url_of_pdf": 359, "us": [0, 4, 9, 14, 18, 23, 24, 25, 27, 28, 29, 32, 35, 36, 38, 44, 49, 57, 62, 95, 184, 195, 220, 243, 246, 247, 259, 260, 265, 269, 270, 288, 289, 298, 300, 302, 303, 307, 308, 311, 312, 313, 316, 317, 318, 319, 320, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 334, 335, 336, 337, 338, 340, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 354, 355, 356, 357, 358, 361, 363, 366, 368, 369, 370, 371, 372, 373, 375, 376, 377, 379, 380, 383, 384, 385, 386, 387, 389, 390, 391, 392, 393, 394, 395, 396, 397, 399, 400, 401, 402, 403, 404, 406, 407, 408, 409, 410, 411, 413, 415, 416, 417, 418, 419, 420, 421, 422, 423, 425, 426, 427, 428, 429, 432], "usag": [9, 269, 300, 302, 308, 335, 380, 416, 417, 421, 422, 429, 432], "use_aot_devlist": 432, "use_auth_token": [346, 352], "use_cach": [33, 36, 40, 41, 44], "use_cpu": [346, 354], "use_deepspe": [1, 326, 330, 349], "use_diff": 247, "use_double_qu": 247, "use_fast_token": [314, 349, 352, 354, 422], "use_full_rang": 247, "use_ggml": 247, "use_gptq": [426, 427], "use_gpu_for_search": 376, "use_gradient_checkpoint": 422, "use_habana": [314, 346, 347, 349, 350, 352], "use_hpu_graph": 315, "use_hpu_graphs_for_train": 346, "use_inbatch_neg": 376, "use_kv_cach": 315, "use_lazy_mod": [314, 346, 347, 349, 350, 352], "use_mpi": [1, 346, 349, 352], "use_mse_search": 247, "use_mxfp4": 332, "use_neural_spe": [4, 247, 319, 327, 328, 329, 422], "use_qu": 247, "use_tpp": 332, "useless": [314, 315, 401], "user": [11, 12, 13, 15, 36, 44, 47, 57, 272, 273, 289, 292, 295, 302, 305, 309, 316, 319, 321, 324, 325, 330, 332, 334, 336, 337, 338, 347, 348, 349, 355, 356, 357, 358, 361, 369, 370, 372, 373, 375, 377, 378, 382, 385, 387, 389, 391, 393, 396, 405, 407, 410, 413, 417, 418, 421, 430, 435, 438], "user_model": 307, "userwarn": 361, "usr": [308, 309], "usual": [303, 314, 349, 391, 399, 409, 423], "uszkoreit": [36, 44], "ut": [398, 401], "util": [23, 29, 44, 57, 62, 243, 244, 256, 257, 270, 288, 309, 319, 321, 323, 326, 330, 331, 332, 334, 338, 345, 357, 358, 361, 370, 371, 372, 373, 383, 384, 387, 395, 399, 406, 409, 413, 432], "uvicorn": [361, 366], "v": [20, 25, 260, 302, 308, 313, 314, 315, 316, 317, 318, 322, 330, 337, 351, 364, 365, 366, 387, 407, 408, 420], "v0": [309, 347, 349, 427, 428], "v1": [9, 38, 288, 304, 309, 313, 316, 317, 318, 321, 324, 340, 346, 349, 350, 351, 359, 361, 363, 364, 367, 372, 376, 421, 425, 427, 428, 429], "v2": [269, 288, 309, 326, 330, 332, 364, 365, 366, 372, 376, 428], "v3": [309, 316, 319, 321, 324, 331, 334, 338, 340, 343, 344, 345, 361, 363, 364, 366, 371, 375, 383, 384, 420, 427, 432], "v4": 9, "v5": 269, "v_bia": 281, "v_proj": [346, 347, 349, 354], "v_scale": 281, "v_weight": 281, "vaddp": 400, "vae": 9, "val": [55, 57, 350], "valhalla": 304, "valid": [57, 246, 289, 303, 306, 309, 321, 348, 354, 375, 395, 415, 424, 432, 438], "validation_accounting_1": 351, "validation_architecture_and_engineering_14": 351, "validation_dir": 348, "validation_electronics_28": 351, "validation_electronics_29": 351, "validation_split_percentag": 349, "valu": [7, 24, 25, 27, 28, 36, 39, 40, 44, 47, 57, 62, 243, 244, 246, 247, 256, 257, 260, 264, 267, 281, 302, 303, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 363, 367, 371, 372, 375, 376, 383, 384, 387, 388, 389, 390, 391, 400, 401, 402, 403, 405, 407, 408, 413, 416, 417, 419, 423, 428, 429, 432], "valuabl": [0, 361], "value_error": 361, "value_st": [39, 40], "var": [57, 281, 317, 318], "var_in": 281, "var_out": 281, "vari": [371, 397, 411, 425, 426, 428], "variabl": [281, 314, 322, 324, 335, 341, 349, 361, 370, 379, 380, 381, 382, 388, 391, 394, 413, 414], "varianc": [25, 395, 406], "variant": [9, 404], "variat": 9, "varieti": [325, 372], "variou": [259, 268, 270, 309, 312, 319, 320, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 338, 339, 340, 342, 343, 344, 361, 363, 369, 372, 378], "vast": 372, "vastli": [314, 349], "vaswani": [36, 44], "vbroadcastss": 410, "ve": [319, 372, 418], "vec": [25, 403], "vec_align_len": 281, "vec_num_per_thr": 281, "vec_num_tail_thr": 281, "vec_tail_len": 281, "veca": 402, "vecb": 402, "vectara": 420, "vector": [23, 25, 278, 279, 280, 281, 357, 370, 394, 398, 400, 401, 402, 404, 407, 409], "vector_comput": [400, 401], "vector_databas": 372, "vectorstor": [309, 372], "vectorstoreretriev": [309, 357], "velankar": 301, "ventur": 361, "verbos": [246, 305], "veri": [25, 256, 257, 259, 316, 351, 396, 399, 401, 402, 405], "verifi": [346, 354, 410], "versatil": [361, 369, 372], "version": [24, 25, 35, 246, 259, 266, 269, 298, 302, 308, 314, 324, 326, 330, 337, 343, 345, 348, 349, 361, 362, 366, 372, 377, 383, 384, 390, 397, 411, 415, 425, 426, 427, 428], "versu": 266, "vfma": 410, "vfmadd": 404, "vfmadd231p": [404, 410], "vg": 350, "vg_100k": 350, "vg_100k_2": 350, "via": [25, 298, 330, 332, 348, 349, 351, 363, 372, 373, 400, 403, 410, 421, 432], "viath": 351, "video": [319, 321, 360, 369, 374, 420], "view": [38, 83, 300, 335, 380, 382, 389, 399, 424], "viewpoint": 298, "villag": 361, "vim": [330, 332], "vincyzhang": 299, "violat": [266, 335, 380], "virtual": [278, 279, 280, 323, 330, 361, 394, 400, 401], "vision": [350, 418, 420], "vision_tow": 350, "visit": [270, 302, 327, 328, 329, 345, 356, 383, 384, 397, 411, 425, 426], "visual": [256, 257, 265, 308], "visualgenom": 350, "vit": [9, 350, 369], "vits2": [340, 369], "vllm": [309, 312], "vllm_serv": 312, "vmovdqu32": 403, "vmovup": [401, 410], "vmware": 420, "vnni": [398, 399, 403, 407, 408, 411, 413, 421, 423, 437], "vnni_data_t": 281, "vnni_noperm_p2013_p1302": 407, "vnni_noperm_p2031_p1302": 413, "vnni_param_t": 281, "vocab": 354, "vocab_s": [33, 36, 44], "vocabulari": 36, "vocod": 369, "voic": [321, 336, 340, 341, 369, 378, 381], "voicechat": [309, 311, 321, 334, 375], "voicechat_api": 340, "void": [279, 280, 281, 394, 398, 400, 401, 402], "volatil": 395, "volum": 370, "vpaddb": 400, "vpxord": 410, "vqa": 350, "vtune": 415, "vv": 361, "vzeroupp": 410, "w": [21, 256, 257, 263, 385, 388, 389, 390, 399, 402, 408, 428, 432], "w4g32": 432, "w8": 432, "w8a8": [319, 432], "wa": [36, 266, 372, 377], "wai": [302, 309, 314, 319, 338, 349, 356, 358, 361, 363, 371, 372, 389, 390, 391, 395, 399, 401, 407, 410], "wait": 361, "walk": [327, 328, 329], "wall": 389, "wandb": 352, "wang": 301, "want": [24, 25, 28, 247, 289, 295, 309, 314, 340, 347, 350, 351, 369, 387, 389, 390, 392, 395, 396, 399, 400, 401, 413, 416, 421, 438], "warm": 396, "warm_up": 302, "warmup": [28, 289, 390, 396], "warmup_it": 390, "warmup_ratio": [314, 349], "warmup_step": [346, 347], "warn": [61, 264, 361], "wast": [396, 405, 406], "watt": [351, 385], "wav": [311, 319, 336, 340, 369, 375], "wav2vec2": 353, "wavelength": 36, "we": [0, 5, 23, 35, 44, 256, 257, 258, 266, 269, 270, 288, 295, 298, 302, 305, 309, 314, 315, 319, 324, 325, 326, 327, 328, 329, 330, 332, 335, 337, 338, 340, 345, 346, 347, 349, 350, 351, 352, 353, 354, 355, 357, 359, 360, 361, 364, 365, 366, 367, 368, 369, 370, 372, 373, 374, 376, 377, 380, 383, 384, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 413, 416, 417, 418, 422, 423, 425, 426, 427, 428, 429, 432, 438], "weak": 340, "web": [322, 372, 378], "webpag": 319, "websit": [345, 383, 384], "wed": 361, "week": 420, "wei": [281, 413], "weight": [32, 33, 36, 44, 128, 260, 265, 270, 281, 303, 305, 320, 354, 360, 369, 371, 377, 389, 390, 392, 399, 402, 403, 404, 408, 409, 413, 416, 417, 420, 421, 423, 428], "weight_8bit": 281, "weight_bf16": 281, "weight_data": 55, "weight_decai": [314, 349], "weight_dict": [256, 257], "weight_dtyp": [247, 319, 327, 328, 329, 371, 375, 429, 432], "weight_f8_e4m3": 281, "weight_f8_e5m2": 281, "weight_int8": 281, "weight_optim": 128, "weight_ratio": [250, 251, 416, 417, 423], "weight_typ": [281, 421], "weightonlyqu": 270, "weightpruningconfig": [28, 246], "welcom": [298, 300, 312, 320, 333, 339, 342, 369, 420], "welford": [281, 406], "well": [25, 36, 44, 260, 266, 305, 350, 372, 423, 424, 432], "wenxin": 415, "were": [247, 260, 361, 376], "wget": [319, 323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 369, 383, 384], "wgt_t": 281, "what": [5, 298, 316, 317, 324, 356, 358, 363, 372, 400, 401, 407, 419], "when": [0, 9, 23, 24, 25, 28, 36, 40, 243, 246, 256, 257, 264, 289, 298, 304, 309, 314, 315, 316, 317, 318, 319, 332, 349, 350, 363, 366, 370, 372, 385, 387, 388, 390, 391, 394, 395, 396, 399, 401, 402, 404, 405, 406, 408, 409, 413, 416, 417, 423, 425, 426, 427, 429], "where": [25, 32, 36, 44, 57, 83, 247, 255, 258, 263, 267, 303, 314, 322, 334, 338, 349, 355, 358, 376, 377, 382, 391, 399, 401, 404, 405, 407, 409, 413, 414, 422], "whether": [4, 9, 28, 35, 247, 264, 295, 316, 330, 332, 340, 349, 364, 371, 372, 373, 376, 378, 387, 389, 395, 413, 421, 438], "which": [5, 17, 24, 25, 29, 32, 35, 36, 44, 49, 52, 53, 54, 57, 195, 243, 246, 247, 255, 256, 257, 260, 265, 266, 270, 298, 303, 307, 308, 309, 314, 316, 319, 325, 330, 331, 332, 334, 335, 336, 340, 346, 347, 349, 350, 354, 361, 370, 372, 375, 376, 377, 380, 386, 387, 388, 390, 391, 395, 396, 399, 400, 401, 402, 403, 404, 405, 406, 408, 409, 412, 413, 416, 419, 421, 423], "while": [25, 57, 258, 288, 307, 335, 338, 358, 371, 372, 378, 380, 388, 391, 395, 402, 405, 408, 413, 423, 427, 432], "whisper": [309, 336, 340, 355, 369, 375], "whisper_larg": 425, "white": 402, "whitespac": 414, "whl": [332, 348, 349, 432], "who": [295, 298, 319, 361, 377, 401, 438], "whole": [25, 57, 304, 316, 389, 390, 404, 405, 406, 408, 410], "whom": 377, "whose": [266, 314, 349, 395], "wide": [17, 302, 303, 312, 319, 320, 364, 365, 366, 372, 376, 401, 402, 423, 432], "wide_resnet101_2": 17, "wide_resnet50_2": 17, "width": [15, 42, 256, 257, 266, 399, 400, 404, 406, 423], "wiki": 298, "wikitext": [304, 425], "window": [264, 302, 308, 309, 320, 327, 328, 329, 386, 420, 429], "window_s": 264, "wino": 288, "winogrand": 426, "wip": [304, 349, 403], "wisdom": 361, "wise": [361, 398, 413, 420, 428, 432, 437], "wish": [0, 415], "witch": 320, "within": [24, 25, 266, 298, 309, 358, 369, 372, 404, 428, 432], "without": [24, 25, 50, 255, 298, 303, 314, 319, 327, 328, 329, 335, 349, 354, 372, 374, 380, 382, 387, 388, 406, 409, 410, 413, 420], "wm": 402, "wmt16": 304, "wn": 402, "woman": 361, "won": [314, 349], "wondrou": 361, "woq": 421, "woq_config": 432, "woq_linear": 421, "woq_model": 432, "word": [23, 36, 44, 247, 266, 304, 319, 369, 372, 373, 409], "word_embed": [150, 388], "wordembed": 242, "work": [25, 259, 271, 317, 318, 347, 353, 359, 360, 363, 374, 377, 396, 401, 418, 425, 427], "work_spac": 281, "workaround": 409, "workdir": 398, "worker": [314, 349], "workflow": [272, 299, 302, 303, 390, 392, 407], "workload": [361, 402, 407], "workshop": 322, "workspac": [322, 364, 365, 390], "workstat": 361, "world": [28, 361, 372, 377, 385], "world_siz": [1, 252, 326, 330, 346, 349, 352], "wors": 432, "worst": 399, "worth": 408, "would": [24, 32, 57, 266, 347, 352, 361, 365, 372, 387, 391, 392, 395, 396, 410, 428], "wrap": 25, "wrapper": [2, 3, 13, 14, 400], "write": [25, 57, 355, 387, 395, 405, 406, 408], "write_back_scal": 405, "write_row_and_zero": 403, "write_tile_to_dst": 405, "write_tile_to_tmp_buf": 405, "written": [24, 313, 316, 349, 369, 382, 391], "wrong": [361, 387, 395], "ws2": 361, "www": [25, 319, 397, 411, 420, 425, 426], "x": [24, 25, 30, 36, 37, 44, 256, 257, 309, 313, 316, 317, 318, 361, 363, 367, 390, 401, 404, 405, 407, 408, 413, 423, 432], "x0": 263, "x1": 263, "x16": 413, "x86": [327, 328, 329, 421], "x86_64": [323, 324, 326, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384, 386], "xbyak": 401, "xdi": 410, "xed3": 410, "xed64": 410, "xeon": [4, 272, 302, 304, 308, 309, 310, 311, 316, 318, 319, 320, 321, 323, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 336, 337, 338, 339, 340, 342, 343, 344, 354, 360, 361, 363, 364, 365, 366, 367, 369, 370, 374, 375, 399, 408, 411, 415, 420, 423, 425, 426], "xeonplatinum": 397, "xigui": 301, "xiguiwang": 361, "xin3h": 299, "xk": 399, "xl": [314, 349, 372], "xl_peft_finetuned_model": [314, 349], "xlnet": 304, "xlsr": 353, "xlsx": [319, 372, 389], "xpu": [309, 320, 375, 432], "xsum": 304, "xuehaosun": 299, "xxx": [302, 309, 314, 319, 324, 334, 349, 371, 372], "xxxxx_sampl": 355, "xxxxxx": 352, "xyxi": 263, "y": [20, 57, 308, 309, 322, 323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 354, 357, 358, 363, 368, 401, 426, 432], "y0": 263, "y1": 263, "yac": 366, "yaml": [49, 55, 57, 246, 309, 313, 316, 317, 318, 319, 351, 360, 367, 375, 389, 390, 392, 396, 412], "yann": 419, "ye": 425, "year": [415, 420], "yet": 349, "yi": [281, 301], "yield": [319, 423], "ymal": 55, "you": [0, 24, 25, 28, 32, 33, 36, 44, 57, 247, 259, 269, 270, 289, 300, 302, 303, 305, 308, 309, 313, 314, 315, 316, 317, 318, 321, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 355, 356, 357, 358, 361, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 375, 376, 377, 378, 380, 383, 384, 385, 387, 388, 390, 391, 392, 395, 396, 400, 401, 403, 410, 412, 413, 415, 416, 418, 419, 423, 424, 426, 427, 428, 432], "you_repo_path": [314, 315], "you_work_dir": 387, "young": 361, "youngjoo": 432, "your": [1, 57, 246, 247, 270, 272, 300, 302, 306, 308, 309, 313, 314, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 335, 336, 337, 338, 340, 341, 343, 344, 346, 350, 351, 355, 358, 363, 364, 365, 366, 369, 370, 371, 372, 376, 378, 380, 381, 385, 387, 388, 389, 390, 391, 400, 401, 413, 415, 418, 420, 432], "your_branch": [314, 315], "your_env_nam": [323, 330], "your_ip": [317, 363], "your_kernel_log": 401, "your_port": [317, 363, 366], "your_pytorch_model_path_or_hf_model_nam": 432, "your_saved_model_dir": 432, "your_training_script": 1, "yourself": [387, 395], "youtub": 420, "yum": 308, "z": 303, "zaker": 420, "zero": [47, 73, 340, 349, 350, 401, 402, 404, 405, 409, 421], "zero2": 347, "zero_point": 247, "zero_tileconfig_start": 403, "zero_upper_row": 403, "zeroextend16": 409, "zeropoint": 423, "zeropointc": 281, "zeroth": 25, "zh": [353, 369], "zhang": [301, 415, 432], "zhenwei": 299, "zmm": [400, 401, 404, 406, 409], "zmm0": 410, "zmm1": 410, "zmm10": 410, "zmm12": 410, "zmm13": 410, "zmm14": 410, "zmm16": 410, "zmm17": 410, "zmm18": 410, "zmm2": 410, "zmm31": 410, "zmm4": 410, "zmm5": 410, "zmm6": 410, "zmm8": 410, "zmm9": 410, "zmm_byte_s": 400, "zmm_mock1": 401, "zmm_src": 401, "zmm_src1": 400, "zmmword": 410, "zoom": [335, 380], "zp": [281, 400, 421], "zp0": 281, "zp_dst": 281, "\u017cyczy\u0144ski": 301, "\u03b1x": 401, "\u03b2": 401, "\u3053\u3093\u306b\u3061\u306f": 369, "\u6b22\u8fce\u6765\u5230\u82f1\u7279\u5c14": [340, 369], "\u89e3\u51b3\u65b9\u6848\u4e3a\u6700\u65b0meta": 420}, "titles": ["conversation", "gaudi_spawn", "intel_extension_for_transformers.langchain.langchain_community.retrievers.child_parent_retriever", "intel_extension_for_transformers.langchain.langchain_community.vectorstores.chroma", "intel_extension_for_transformers.neural_chat.chatbot", "intel_extension_for_transformers.neural_chat.config", "intel_extension_for_transformers.neural_chat.config_logging", "intel_extension_for_transformers.neural_chat.errorcode", "intel_extension_for_transformers.neural_chat.pipeline", "intel_extension_for_transformers.neural_chat.pipeline.plugins.image2image.instructpix2pix_pipeline", "intel_extension_for_transformers.neural_chat.pipeline.plugins.memory.memory", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.intent_detection", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.detector.query_explainer", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.parser.parser", "intel_extension_for_transformers.neural_chat.pipeline.plugins.retrieval.retriever_adapter", "intel_extension_for_transformers.neural_chat.pipeline.plugins.security.safety_checker", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.bfm", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.models.networks", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.load_mats", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.preprocess", "intel_extension_for_transformers.neural_chat.pipeline.plugins.video.face_animation.src.face3d.util.util", "intel_extension_for_transformers.neural_chat.server.restful.openai_protocol", "intel_extension_for_transformers.neural_chat.tools.rome.repr_tools", "intel_extension_for_transformers.neural_chat.tools.rome.utils.nethook", "intel_extension_for_transformers.neural_chat.tools.rome.utils.runningstats", "intel_extension_for_transformers.tools.utils", "intel_extension_for_transformers.transformers.benchmark", "intel_extension_for_transformers.transformers.config", "intel_extension_for_transformers.transformers.dynamic.drop_and_restore_utils", "intel_extension_for_transformers.transformers.dynamic.evolution", "intel_extension_for_transformers.transformers.dynamic", "intel_extension_for_transformers.transformers.kv_cache_compression.models.modeling_llama", "intel_extension_for_transformers.transformers.modeling.gpt_bigcode.modeling_gpt_bigcode", "intel_extension_for_transformers.transformers.modeling", "intel_extension_for_transformers.transformers.modeling.model", "intel_extension_for_transformers.transformers.modeling.modeling_bert_dynamic", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.bart.modeling_bart", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.llama.pos_shift_llama", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mistral.modeling_mistral", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.mixtral.modeling_mixtral", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.phi.modeling_phi", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.models.swin.modeling_swin", "intel_extension_for_transformers.transformers.modeling.modeling_gaudi.streaming_llm", "intel_extension_for_transformers.transformers.modeling.modeling_roberta_dynamic", "intel_extension_for_transformers.transformers.pipeline", "intel_extension_for_transformers.transformers.pruner", "intel_extension_for_transformers.transformers.pruner.pruning", "intel_extension_for_transformers.transformers.quantization", "intel_extension_for_transformers.transformers.runtime.compile.compile", "intel_extension_for_transformers.transformers.runtime.compile.extractors.extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors", "intel_extension_for_transformers.transformers.runtime.compile.extractors.onnx_extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors.tf_extractor", "intel_extension_for_transformers.transformers.runtime.compile.extractors.torch_extractor", "intel_extension_for_transformers.transformers.runtime.compile.graph.graph", "intel_extension_for_transformers.transformers.runtime.compile.graph", "intel_extension_for_transformers.transformers.runtime.compile.graph_utils", "intel_extension_for_transformers.transformers.runtime.compile", "intel_extension_for_transformers.transformers.runtime.compile.loaders", "intel_extension_for_transformers.transformers.runtime.compile.loaders.loader", "intel_extension_for_transformers.transformers.runtime.compile.logger", "intel_extension_for_transformers.transformers.runtime.compile.onnx_utils", "intel_extension_for_transformers.transformers.runtime.compile.ops.all", "intel_extension_for_transformers.transformers.runtime.compile.ops.assert", "intel_extension_for_transformers.transformers.runtime.compile.ops.baddbmm", "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.batch_matmul_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.bias_add", "intel_extension_for_transformers.transformers.runtime.compile.ops.cast", "intel_extension_for_transformers.transformers.runtime.compile.ops.concat", "intel_extension_for_transformers.transformers.runtime.compile.ops.conv", "intel_extension_for_transformers.transformers.runtime.compile.ops.cos", "intel_extension_for_transformers.transformers.runtime.compile.ops.empty_ops", "intel_extension_for_transformers.transformers.runtime.compile.ops.expand_dims", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_matmul_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_batch_norm_v3", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_gemm", "intel_extension_for_transformers.transformers.runtime.compile.ops.fused_matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.gather", "intel_extension_for_transformers.transformers.runtime.compile.ops.gather_elements", "intel_extension_for_transformers.transformers.runtime.compile.ops.gelu", "intel_extension_for_transformers.transformers.runtime.compile.ops.gemm", "intel_extension_for_transformers.transformers.runtime.compile.ops", "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_get_next", "intel_extension_for_transformers.transformers.runtime.compile.ops.iterator_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.layer_normalization", "intel_extension_for_transformers.transformers.runtime.compile.ops.log_softmax", "intel_extension_for_transformers.transformers.runtime.compile.ops.map_and_batch_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.matmul", "intel_extension_for_transformers.transformers.runtime.compile.ops.mean", "intel_extension_for_transformers.transformers.runtime.compile.ops.mkl_layer_norm", "intel_extension_for_transformers.transformers.runtime.compile.ops.model_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.one_hot", "intel_extension_for_transformers.transformers.runtime.compile.ops.onnx_input", "intel_extension_for_transformers.transformers.runtime.compile.ops.op", "intel_extension_for_transformers.transformers.runtime.compile.ops.optimize_dataset", "intel_extension_for_transformers.transformers.runtime.compile.ops.pack", "intel_extension_for_transformers.transformers.runtime.compile.ops.padding_sequence", "intel_extension_for_transformers.transformers.runtime.compile.ops.placeholder", "intel_extension_for_transformers.transformers.runtime.compile.ops.pos_embed", "intel_extension_for_transformers.transformers.runtime.compile.ops.pow", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_linear", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantize_v2", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_fused_matmul_and_dequantize", "intel_extension_for_transformers.transformers.runtime.compile.ops.quantized_matmul_with_bias_and_dequantize", "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_mean", "intel_extension_for_transformers.transformers.runtime.compile.ops.reduce_sum", "intel_extension_for_transformers.transformers.runtime.compile.ops.reorder", "intel_extension_for_transformers.transformers.runtime.compile.ops.reshape", "intel_extension_for_transformers.transformers.runtime.compile.ops.resize", "intel_extension_for_transformers.transformers.runtime.compile.ops.rsub", "intel_extension_for_transformers.transformers.runtime.compile.ops.scatter_elements", "intel_extension_for_transformers.transformers.runtime.compile.ops.shape", "intel_extension_for_transformers.transformers.runtime.compile.ops.sin", "intel_extension_for_transformers.transformers.runtime.compile.ops.size", "intel_extension_for_transformers.transformers.runtime.compile.ops.slice_position_ids", "intel_extension_for_transformers.transformers.runtime.compile.ops.softmax", "intel_extension_for_transformers.transformers.runtime.compile.ops.split", "intel_extension_for_transformers.transformers.runtime.compile.ops.squeeze", "intel_extension_for_transformers.transformers.runtime.compile.ops.strided_slice", "intel_extension_for_transformers.transformers.runtime.compile.ops.tensor", "intel_extension_for_transformers.transformers.runtime.compile.ops.top_k", "intel_extension_for_transformers.transformers.runtime.compile.ops.transpose", "intel_extension_for_transformers.transformers.runtime.compile.ops.unpack", "intel_extension_for_transformers.transformers.runtime.compile.ops.unsqueeze", "intel_extension_for_transformers.transformers.runtime.compile.ops.view", "intel_extension_for_transformers.transformers.runtime.compile.ops.where", "intel_extension_for_transformers.transformers.runtime.compile.optimizer", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.InnerproductReshapeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_cls_token", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.add_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.arangewithreciprocal", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_ConstantOfShapeWithMul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVPreReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_QKVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attentionBlock_WeightReshapeTo4D", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_mask_length_adaptive_keep_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_output_layer_norm_length_adaptive_keep_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.attention_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.cast_to", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.collect_quant_info", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.conv_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.decoder_attn_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.einsumwitharange", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddingbag", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.embeddings_to_2d_before_inner_product", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.gelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.generate_sequence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithbiasgelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithslice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.innerproductwithswish", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_data", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.input_file", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_bf16_node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.insert_quant_node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.int8_bf16_mixed_precision_checker", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.interact_features", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.last_layer_shape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_reduce_mean", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.layer_norm_with_transpose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_embeding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_matmulwithtranspose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_postprocess", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.llama_rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.lower_all_tuples", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_add", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_gelu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_relu", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_sigmoid", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_tanh", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_bias_unsqueeze", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.matmul_with_transpose_scale_add", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.merged_embeddingbag", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_reorder_change", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.neox_rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.operator_adaptor", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.output_data", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.padding_sequence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.pattern", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.position_embeddings_v1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_merge", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.qkv_reshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quant_gather_to_bf16", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantize_fusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.quantized_graph_dtype_refactor", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_constant_op", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_last_view", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_range", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_unused_operator", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.remove_zeros", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.removeslice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_after_restore_hidden_states", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_and_after_attention_out_layer_norm_gather_elements", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_before_restore_hidden_states", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.reshape_fusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.restore_hidden_states_in_length_adaptive_update_indices", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rms_norm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.rotary_pos_emb", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.slicemask", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTranspose", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ExplicitNHWCTransposeQAT", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_MHAReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_QuantizeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_ReshapeFusion", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_bf16Convert", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_collectQDQInfo", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.stableDiffusion_insertQuantNode", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.start_end_logits", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.subgraph_matcher", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncdoer_word_embedding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_AttentionReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_KVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_MulReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_QReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_SoftmaxReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.textEncoder_causal_attention_mask", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.token_type_embeddings_v1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_embedding", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_ip_insert_bias", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torch_unpack_baddbmm", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchinsertbf16node", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.torchpaddingsquence", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_AttentionMaskAddReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_ConstantOfShapeWithMul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_FFNSlice_1", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVPreReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_QKVReshape4D", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_encoderHiddenStatesReshape", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_getSampleBatch", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transformer2Dmodel_sampleSlice", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.transpose_batch_matmul", "intel_extension_for_transformers.transformers.runtime.compile.sub_graph.word_embeddings", "intel_extension_for_transformers.transformers.runtime.compile.tf_utils", "intel_extension_for_transformers.transformers.runtime.compile.torch_utils", "intel_extension_for_transformers.transformers.runtime", "intel_extension_for_transformers.transformers.trainer", "intel_extension_for_transformers.transformers.utils.config", "intel_extension_for_transformers.transformers.utils.get_throughput", "intel_extension_for_transformers.transformers.utils", "intel_extension_for_transformers.transformers.utils.metrics", "intel_extension_for_transformers.transformers.utils.objectives", "intel_extension_for_transformers.transformers.utils.utility", "main_eval_only", "main_parse_and_eval", "models.backbone", "models.detr", "models.detr_multi", "models.matcher", "models.position_encoding", "models.segmentation", "models.transformer", "text", "util.box_ops", "util.misc", "util.plot_utils", "util.postprocess", "utils.data_utils", "utils.eval_utils", "CI Introduction", "Documentation Overview and Installation", "OpenSSF Badge", "Intel\u00ae Extension for Transformers: Accelerating Transformer-based Models on Intel Platforms", "API", "Python APIs", "Compile", "Graph", "Engine API", "Class engine", "Class Kernel", "Class operator_desc", "Operator Specific Types", "Kernel APIs", "Config", "Model", "Trainer", "User-facing API", "Architecture of Intel\u00ae Extension for Transformers", "1. Accuracies $\\uparrow$ across 11 tasks(0-shot) of LLaMA and Mistral models at W4G-1.", "Benchmark", "Example", "Features", "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!", "Kernels", "Implementation Details", "Performance", "Neural Engine", "User Guide", "Contributor Covenant Code of Conduct", "Module Owner Matrix", "Contribution Guidelines", "<no title>", "Intel\u00ae Extension for Transformers", "Distillation", "Examples", "Export to ONNX", "Getting Started", "H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models", "Installation", "NeuralChat", "<no title>", "NeuralChat Command Line", "Intel Neural Chat Dockerfile", "Start NeuralChat and Code Generation Service with Docker", "Prerequisite\u200b", "Do chatbot inference with Docker", "Start NeuralChat Text Generation Service with Docker", "Start NeuralChat and TGI serving with Docker", "Start NeuralChat and vLLM serving with Docker", "Plugins", "NeuralChat Notebooks", "Building RESTful API Server", "QuickStart: Intel\u00ae Extension For Transformers*: NeuralChat on 4th Generation Intel\u00ae Xeon\u00ae Scalable Processors", "Setup Conda", "Setup Conda", "<no title>", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "Setup Conda", "<no title>", "Setup Environment", "<no title>", "Introduction", "Introduction", "Introduction", "<no title>", "Setup Conda", "\ud83d\udcf8 Project Screenshots", "<no title>", "Setup Conda", "Setup Conda", "Deploy on Huggingface Space", "Direct Preference Optimization (DPO)", "How to train Intel/neural-chat-7b-v3-1 on Intel Gaudi2", "NeuralChat Fine-tuning", "NeuralChat Fine-tuning", "Multi-Modal", "Evaluation Guidelines", "Reinforcement Learning from Human Feedback (RLHF)", "Shanghainese ASR (Audio-Speech-Recognition) and TTS (Text-To-Speech) finetuning/inference", "GPT-J fine-tuning and inference", "Voice Cloning by finetuning a Text-To-Speech (TTS) model", "Installation", "Introduction", "Introduction", "Extract Tables From PDF File", "Face Animation", "Build Your Chatbot with Intel\u00ae Extension for Transformers neural-chat", "Build RAG (retriveval augment generation) example with Intel\u00ae Extension for Transformers neural-chat on Intel GPU", "Introduction", "Serving NeuralChat Text Generation with Triton Inference Server", "Serving NeuralChat Text Generation with Triton Inference Server (CUDA)", "Serving NeuralChat Text Generation with Triton Inference Server on HPU", "vllm serving for NeuralChat", "Setup Environment", "Install System Dependency", "\ud83d\ude80 What is caching plugin?", "\ud83c\udfe0Introduction", "Introduction", "Introduction", "Face Animation", "NeuralChat Server Command Line", "Finetune Embedding Model on Task-Specific Datasets", "Prerequisite\u200b", "\ud83d\udd21 TextBot", "\ud83d\udcf8 Project Screenshots", "<no title>", "\ud83d\udcf8 Project Screenshots", "\ud83d\udcf8 Project Screenshots", "Deploy on Huggingface Space", "Deploy on Huggingface Space", "LLM Carbon Calculator", "Installation", "Add Customized Pattern", "Deploy and Integration", "Profiling", "Engine Tuning", "Graph Fusion", "Compile an ONNX model to Engine IR", "Quantize a ONNX model to engine low precision/int8 IR", "Customized Operators Register", "Pattern Recognize", "Static Compressed Buffer", "Neural Engine Support Matrix", "Transformers-Accelerated Libraries", "3D Inference", "Binary Injectors", "Element-wise Injector", "Introduction", "Sparse GEMM AMX", "Sparse GEMM AVX512F", "Dynamic Quant Matmul", "Sparse GEMM with Layer-Normalize", "Transposed MatMul", "Transposed MHA", "Sparse GEMM VNNI", "Performance and Profiling", "Validated Performance Data", "How to visualize weights distribution of sparse model", "Benchmark for Kernels", "Inputs format", "Legal Information", "Metrics", "Objective", "Pipeline", "Pruning", "Full Publications/Events (51)", "QBits", "QLoRA on CPU", "Quantization", "Release", "Validated Model Performance", "Efficient LLM Inference on CPUs", "Step-by-Step", "Smooth Quant", "Streaming LLM", "Tutorials", "User Guide", "Weight Only Quantization (WOQ)", "Example", "Features", "Welcome to Intel\u00ae Extension for Transformers\u2019 documentation!", "Kernels", "Implementation Details", "Performance", "Neural Engine", "User Guide"], "titleterms": {"": [377, 400, 401], "0": 288, "04": 308, "1": [288, 302, 314, 315, 346, 347, 348, 349, 352, 353, 354, 361, 362, 376, 377, 388, 389, 393, 394, 412, 420, 425, 427], "11": 288, "15": 420, "2": [288, 302, 314, 315, 346, 348, 349, 352, 353, 354, 361, 362, 376, 377, 388, 393, 394, 412, 427], "20": 308, "2021": 420, "2022": 420, "2023": 420, "2024": 420, "20b": 425, "22": 308, "3": [288, 302, 314, 315, 346, 348, 349, 352, 353, 354, 361, 376, 377, 388, 412, 427], "34": 420, "3b": 425, "3d": 399, "4": [288, 302, 314, 346, 352, 376, 377, 388, 411], "4th": 322, "5": [352, 376, 420], "51": 420, "6": 376, "6b": 425, "7b": [347, 349, 425], "8": 308, "A": 316, "AND": 432, "For": [305, 322, 349, 413, 432], "On": [314, 315, 409], "To": [302, 353, 355, 393], "acceler": [272, 306, 358, 398, 402], "accept": 300, "access": [309, 321, 375], "accuraci": [288, 393, 423, 426, 427], "acknowledg": [353, 359, 360, 374], "across": 288, "activ": [322, 409], "adapt": [304, 306], "add": [349, 387, 394], "add_cls_token": 130, "add_embed": 131, "addit": 321, "advanc": 319, "after": [377, 387], "ai": 378, "algorithm": 432, "all": 63, "alpha": 401, "amp": 319, "amx": 403, "an": [303, 392, 419, 423], "analysi": 412, "anim": [360, 374], "api": [273, 274, 277, 282, 286, 289, 305, 309, 321, 389, 398, 422], "applic": [345, 383, 384], "approach": 423, "arangewithreciproc": 132, "arc": 349, "architectur": [287, 346, 388], "argument": [348, 349], "askdoc": 338, "asr": [353, 369], "assert": 64, "assisted_gen": 323, "attent": 413, "attention_mask_length_adaptive_keep_indic": 138, "attention_output_layer_norm_length_adaptive_keep_indic": 139, "attention_reshap": 140, "attentionblock_attentionmaskaddreshap": 133, "attentionblock_constantofshapewithmul": 134, "attentionblock_qkvprereshap": 135, "attentionblock_qkvreshap": 136, "attentionblock_weightreshapeto4d": 137, "attribut": [243, 298, 387], "audio": [336, 340, 353], "augment": 362, "automat": [319, 369], "autoround": 432, "avx512f": 404, "aw": 349, "awar": 423, "backbon": 255, "backend": [305, 366, 388, 418], "baddbmm": 65, "badg": 271, "bare": [348, 349, 386], "baremet": 322, "bart": 37, "base": [272, 348, 425], "baselin": 426, "batch_matmul": 66, "batch_matmul_v2": 67, "befor": [377, 389], "beforehand": 407, "below": 412, "benchmark": [27, 289, 393, 413], "best": 390, "beta": 401, "between": [330, 332], "bf16": [305, 371], "bfm": 16, "bias_add": 68, "binari": [386, 388, 400], "bot": 378, "box_op": 263, "brief": 403, "buffer": 396, "build": [314, 315, 321, 327, 328, 329, 348, 349, 356, 358, 361, 362, 372, 388, 398, 413], "c": 389, "cach": [319, 370, 399], "calcul": [385, 408], "call": [336, 337], "can": [370, 389], "candid": 409, "carbon": 385, "card": [349, 365], "cast": 69, "cast_to": 141, "cento": 308, "chain": 395, "chat": [311, 312, 316, 347, 361, 362, 375, 422], "chatbot": [4, 315, 319, 323, 326, 327, 328, 329, 330, 331, 332, 356, 358, 361, 372], "check": [345, 365, 383, 384], "checker": 319, "checklist": 300, "child_parent_retriev": 2, "childparentretriev": 372, "chroma": [3, 372], "ci": [269, 300], "citat": 415, "class": [0, 2, 5, 9, 14, 22, 24, 25, 28, 30, 32, 33, 35, 36, 37, 40, 44, 47, 50, 52, 53, 54, 55, 57, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 247, 250, 251, 255, 256, 257, 258, 259, 260, 264, 278, 279, 280, 416], "client": [309, 361, 364, 365, 366, 375], "clone": [314, 315, 355], "co": 72, "code": [298, 300, 313, 319, 323, 326, 327, 328, 329, 330, 331, 332, 358], "codegen": [326, 327, 328, 329, 330, 331, 332], "codellama": 349, "collect_quant_info": 142, "command": [311, 361, 362, 375, 412], "compat": [309, 321, 340, 425], "compil": [49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 275, 337, 392], "complet": 358, "compress": [302, 396], "comput": 406, "concat": 70, "conda": [302, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384], "conduct": [298, 300], "config": [5, 28, 247, 283, 358, 372, 387, 390], "config_log": 6, "configur": [313, 316, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 345, 363, 383, 384, 397, 411], "connect": [361, 391], "constrain": 421, "construct": [376, 391], "consum": [313, 316, 317, 318, 363], "contain": [314, 315, 366], "content": [0, 1, 2, 4, 5, 6, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 262, 263, 264, 265, 266, 267, 268, 434, 439], "contribut": [270, 300], "contributor": [298, 300], "conv": 71, "conv_reshap": 143, "convers": 0, "coven": [298, 300], "cpp": [327, 328, 329, 394], "cpu": [346, 361, 422, 426, 432], "creat": [303, 314, 315, 322, 334, 345, 354, 366, 383, 384, 391, 419, 423], "criteria": 300, "criterion": 303, "csv": 389, "cuda": [352, 365, 432], "curl": [309, 321], "custom": [309, 340, 349, 387, 388, 394], "data": [350, 353, 355, 370, 376, 404, 411], "data_util": 267, "databas": 334, "dataset": [302, 314, 346, 348, 349, 352, 354, 376, 393], "decoder_attn_reshap": 144, "demo": 353, "dens": [304, 402], "depend": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 362, 363, 368, 369, 371, 383, 384, 432], "deploi": [309, 345, 360, 367, 383, 384, 386, 388], "deploy": 304, "descript": [405, 406, 408], "design": 393, "detail": [294, 405, 408, 437], "detector": [11, 12], "detr": 256, "detr_multi": 257, "develop": [400, 401, 413], "devic": 432, "dict": 391, "differ": 405, "diffus": [337, 425], "direct": [346, 347, 406], "dispatch": 390, "distil": [303, 304, 306], "distillationconfig": 303, "distribut": [349, 412], "dl1": [314, 349], "do": [315, 353], "docker": [313, 314, 315, 316, 317, 318, 319, 322, 348, 349, 366], "dockerfil": 312, "document": [270, 292, 313, 316, 435], "doe": 370, "dolli": 425, "download": [338, 353, 354, 363], "dpo": [346, 347], "drop_and_restore_util": 29, "duplic": 395, "dynam": [29, 30, 31, 405, 423], "dynamic_qu": 413, "dynamic_quant_matmul": 413, "each": 395, "earli": 304, "edit": 377, "effici": [307, 402, 426], "eiffel": 377, "einsumwitharang": 145, "electra": 425, "element": 401, "eltwiseop": 413, "embed": 376, "embeddingbag": 146, "embeddings_to_2d_before_inner_product": 147, "empty_op": 73, "endpoint": 340, "enforc": 298, "engin": [277, 278, 296, 304, 306, 386, 388, 390, 392, 393, 397, 439], "engine_profil": 389, "english": 369, "environ": [302, 308, 313, 314, 315, 316, 317, 318, 322, 334, 336, 337, 338, 346, 347, 348, 349, 352, 353, 354, 357, 359, 360, 361, 362, 363, 367, 368, 374, 377, 393, 426], "errorcod": 7, "establish": 391, "eval_util": 268, "evalu": [346, 349, 350, 351, 376], "event": [272, 420], "evolut": 30, "exampl": [289, 290, 304, 305, 316, 362, 376, 385, 389, 392, 413, 417, 418, 421, 422, 428, 429, 432, 433], "except": 24, "executor": [305, 394, 418], "exist": [348, 349], "exit": 304, "expand_dim": 74, "expect": 302, "export": 305, "extens": [272, 287, 292, 302, 304, 308, 309, 322, 327, 328, 329, 357, 361, 362, 368, 372, 435], "extract": 359, "extractor": [50, 51, 52, 53, 54], "face": [286, 360, 374], "face3d": [16, 17, 18, 19, 20, 21], "face_anim": [16, 17, 18, 19, 20, 21], "falcon": [349, 425], "faq": [269, 300], "featur": [291, 400, 401, 423, 434], "feedback": 352, "file": [313, 316, 351, 359], "fine": [314, 319, 347, 348, 349, 352, 354], "finetun": [314, 348, 349, 353, 355, 376, 425], "flan": 349, "fly": 409, "folder": 351, "format": [392, 404, 414], "fp32": [305, 371, 426, 427], "framework": [363, 400, 401, 428], "from": [302, 308, 314, 315, 322, 348, 349, 352, 359, 386], "frontend": [345, 383, 384], "full": 420, "function": [0, 1, 4, 6, 15, 17, 20, 21, 23, 24, 25, 27, 28, 29, 30, 32, 36, 37, 39, 40, 41, 42, 44, 45, 49, 57, 61, 62, 95, 184, 243, 244, 245, 252, 260, 262, 263, 264, 265, 266, 267, 268, 270], "fundament": 423, "fuse": 387, "fused_batch_matmul_v2": 75, "fused_batch_norm_v3": 76, "fused_gemm": 77, "fused_matmul": 78, "fusion": [387, 391], "gather": 79, "gather_el": 80, "gaudi": [314, 315], "gaudi2": [347, 350], "gaudi_spawn": 1, "gelu": [81, 148], "gemm": [82, 403, 404, 406, 409], "gener": [300, 307, 313, 316, 319, 322, 323, 326, 327, 328, 329, 330, 331, 332, 362, 364, 365, 366, 388], "generate_sequ": 149, "get": [289, 302, 306, 309, 322, 354, 367, 389, 393, 416, 423], "get_throughput": 248, "ggml": 425, "git": 348, "gpt": [354, 425], "gpt_bigcod": 33, "gpu": [314, 315, 318, 346, 349, 361, 362, 432], "graph": [55, 56, 276, 388, 390, 391], "graph_util": 57, "guid": [297, 431, 440], "guidelin": [300, 351], "h": 394, "h2o": 307, "habana": [314, 315, 346, 349, 352], "hard": 376, "hardwar": [302, 308], "heavi": 307, "help": [311, 370, 375], "hf": 349, "hitter": 307, "hostfil": [330, 332], "how": [302, 347, 370, 390, 396, 412], "hpp": [400, 401], "hpu": 366, "hub": [314, 315], "huggingfac": [345, 383, 384], "human": 352, "i": [361, 365, 370], "imag": [314, 315, 316, 334, 348, 349, 366], "image2imag": [9, 337], "implement": [294, 437], "import": [356, 358, 372], "inbound": 349, "includ": 394, "infer": [270, 302, 307, 315, 319, 353, 354, 355, 364, 365, 366, 371, 388, 399, 418, 425, 426, 427], "inform": [391, 415], "initi": 370, "injector": [400, 401], "innerproductreshapefus": 129, "innerproductwithbiasgelu": 151, "innerproductwithslic": 152, "innerproductwithswish": 153, "input": [340, 414], "input_data": 154, "input_fil": 155, "insert": 391, "insert_bf16_nod": 156, "insert_quant_nod": 157, "instal": [270, 302, 308, 309, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 353, 354, 356, 357, 361, 362, 363, 368, 369, 370, 371, 383, 384, 386, 393, 398], "instanc": [303, 349, 356, 419, 423], "instruct": [349, 350, 404], "instructpix2pix_pipelin": 9, "int4": [371, 426, 427], "int8": [305, 371, 393, 418], "int8_bf16_mixed_precision_check": 158, "integr": 388, "intel": [272, 287, 292, 302, 304, 308, 312, 322, 327, 328, 329, 347, 349, 357, 358, 361, 362, 368, 432, 435], "intel_extension_for_transform": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 386], "intent_detect": 11, "interact": [356, 358, 372], "interact_featur": 159, "intermedi": 303, "introduct": [269, 289, 300, 303, 305, 307, 309, 336, 337, 338, 354, 357, 358, 363, 371, 372, 373, 376, 387, 389, 390, 391, 392, 395, 396, 398, 400, 401, 402, 403, 407, 412, 416, 417, 418, 419, 421, 422, 423, 428, 429, 432], "ipex": [289, 304], "ir": [392, 393], "isa": 403, "issu": 399, "iter": 389, "iterator_get_next": 84, "iterator_v2": 85, "itrex": [314, 315, 331, 332], "its": 393, "j": [354, 425], "jit": 405, "jit_binaryop_injector": 400, "jit_eltwise_injector": 401, "json": 389, "kei": [324, 404], "kernel": [279, 282, 293, 390, 398, 402, 405, 413, 436], "kingdom": 377, "knowledg": [303, 304, 377], "kv_cache_compress": 32, "langchain": [2, 3, 309, 372], "langchain_commun": [2, 3], "languag": [307, 369], "larg": 307, "last_layer_shap": 160, "launch": [309, 366], "layer": [303, 406], "layer_norm": [86, 161], "layer_norm_with_reduce_mean": 162, "layer_norm_with_transpos": 163, "layernorm": 406, "layernorm_ba": [406, 413], "layout": 399, "learn": [302, 352], "legal": [270, 415], "length": [304, 306], "level": 389, "librari": [309, 398], "licens": 415, "line": [311, 375], "list": [349, 350, 395], "llama": [38, 288, 349], "llama2": 349, "llama3": 432, "llama_embed": 164, "llama_matmulwithtranspos": 165, "llama_postprocess": 166, "llama_rotary_pos_emb": 167, "llava": 351, "llm": [319, 377, 385, 425, 426, 429], "load_mat": 19, "loader": [59, 60], "log_softmax": 87, "logger": 61, "loop": 404, "lora": 347, "low": 393, "lower_all_tupl": 168, "m7i": 349, "main": 395, "main_eval_onli": 253, "main_parse_and_ev": 254, "mandarian": 353, "manual": 388, "map": [387, 391], "map_and_batch_dataset": 88, "matcher": 258, "matmul": [89, 405, 406, 407], "matmul_avx512f_p2031_p2013": [407, 413], "matmul_noperm_p2031_p1302": 407, "matmul_p2031_2013": 407, "matmul_vnni_noperm_p2013_p1302": 407, "matmul_vnni_noperm_p2031_p1302": 413, "matmul_with_bia": 169, "matmul_with_bias_add": 170, "matmul_with_bias_gelu": 171, "matmul_with_bias_relu": 172, "matmul_with_bias_sigmoid": 173, "matmul_with_bias_tanh": 174, "matmul_with_bias_unsqueez": 175, "matmul_with_transpos": 176, "matmul_with_transpose_scale_add": 177, "matrix": [299, 305, 397, 398, 405, 417, 423, 428], "mean": [90, 401], "mechan": 390, "memori": [10, 399], "merg": 347, "merged_embeddingbag": 178, "meta": 349, "metal": [348, 349, 386], "metric": [250, 303, 349, 416, 419], "mha": [408, 413], "microsoft": 348, "mine": 376, "minist": 377, "misc": 264, "mistral": [39, 288, 349], "mix": 319, "mixtral": 40, "mkl_layer_norm": 91, "mmmu": 350, "modal": 350, "mode": [361, 362, 372, 425], "model": [16, 17, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 255, 256, 257, 258, 259, 260, 261, 272, 284, 288, 289, 302, 305, 307, 309, 334, 337, 338, 348, 349, 350, 352, 353, 355, 359, 360, 363, 376, 377, 388, 389, 392, 393, 412, 418, 425, 428], "model_dataset": 92, "modeling_bart": 37, "modeling_bert_dynam": 36, "modeling_gaudi": [37, 38, 39, 40, 41, 42, 43], "modeling_gpt_bigcod": 33, "modeling_llama": 32, "modeling_mistr": 39, "modeling_mixtr": 40, "modeling_phi": 41, "modeling_roberta_dynam": 44, "modeling_swin": 42, "modifi": [330, 332], "modul": [0, 1, 2, 4, 5, 6, 9, 14, 15, 17, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 32, 33, 35, 36, 37, 39, 40, 41, 42, 44, 45, 47, 49, 50, 52, 53, 54, 55, 57, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 77, 78, 79, 80, 81, 82, 84, 85, 86, 87, 88, 89, 90, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 246, 247, 250, 251, 252, 255, 256, 257, 258, 259, 260, 263, 264, 265, 266, 267, 268, 299, 356, 358, 372], "more": [302, 390, 396, 402], "mpt": [346, 349, 425], "mtl": 432, "multi": [314, 330, 332, 349, 350, 365, 369, 411], "multimod": [309, 319], "mysql": 334, "naiv": 402, "necessari": 391, "neg": 376, "neox": 425, "neox_reorder_chang": 179, "neox_rotary_pos_emb": 180, "nethook": 24, "network": 17, "neural": [296, 304, 306, 312, 347, 361, 362, 386, 388, 397, 422, 439], "neural_chat": [4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25], "neuralchat": [309, 311, 313, 316, 317, 318, 320, 322, 331, 332, 348, 349, 363, 364, 365, 366, 367, 375], "new": [345, 383, 384, 387, 391], "next": 302, "node": [314, 330, 332, 349, 354, 387, 391], "normal": 406, "note": 424, "notebook": 320, "numactl": [323, 324, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 357, 363, 368], "numanod": 332, "nvidia": [314, 315, 318], "object": [251, 417, 423], "obtain": 391, "offici": 321, "ok": 361, "old": 391, "one": [349, 405], "one_hot": 93, "onli": [319, 351, 389, 432], "onnx": [305, 388, 392, 393], "onnx_extractor": 52, "onnx_input": 94, "onnx_util": 62, "op": [63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 387, 390], "openai": [309, 321, 324, 340], "openai_protocol": 22, "openssf": 271, "oper": [281, 389, 394], "operator_adaptor": 181, "operator_desc": [280, 400, 401], "opt": 425, "optim": [128, 270, 319, 346, 347], "optimize_dataset": 96, "option": [303, 315, 348, 349, 365, 390, 396, 423], "oracl": 307, "orchestr": 304, "other": 270, "our": 298, "output": [289, 302, 340, 351, 377], "output_data": 182, "overview": [270, 302], "owner": 299, "pack": 97, "packag": [245, 262, 354, 432], "padding_sequ": [98, 183], "param_typ": [400, 401], "paramet": [371, 372], "pars": [351, 395], "parser": 13, "part": 389, "path": [314, 315, 405], "pattern": [184, 387, 390, 391, 395, 403, 404, 409], "pdf": 359, "per": 402, "perform": [295, 358, 397, 398, 410, 411, 425, 426, 438], "perspect": [400, 401], "phi": 41, "photo": 378, "photoai": 334, "pip": 386, "pipelin": [8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 45, 340, 418], "placehold": 99, "platform": [272, 361, 397, 411], "pleas": [314, 315], "pledg": 298, "plot_util": 265, "plugin": [9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 309, 319, 336, 337, 370, 371], "polici": 271, "port": 349, "pos_emb": 100, "pos_shift_llama": 38, "position_embed": 185, "position_embeddings_v1": 186, "position_encod": 259, "post": 423, "postprocess": 266, "pow": 101, "pre": [353, 406], "precis": [319, 393], "prefer": [346, 347, 352], "prefetch": 402, "prepar": [302, 313, 314, 316, 322, 337, 346, 347, 348, 349, 350, 352, 353, 354, 355, 359, 360, 364, 365, 366, 367, 374, 392, 393, 412, 426, 432], "preprocess": [20, 405], "prerequisit": [302, 308, 314, 348, 349, 361, 362, 377, 386, 393, 405, 427], "pretrain": 350, "prime": 377, "print": 351, "problem": [405, 406, 407, 408], "processor": 322, "profil": [389, 410], "project": [341, 378, 379, 381, 382], "prune": [47, 304, 306, 419], "pruner": [46, 47], "public": [272, 420], "pull": [300, 314, 315, 348, 349], "pypi": [302, 308], "python": [274, 309, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 363, 368, 383, 384, 389, 422], "pytorch": [289, 303, 304, 421, 425], "q": 316, "qbit": 421, "qdrant": 372, "qkv_merg": 187, "qkv_reshap": 188, "qlora": 422, "quant": [405, 428], "quant_gather_to_bf16": 189, "quantiz": [48, 304, 306, 319, 393, 423, 425, 427, 432], "quantizationconfig": 423, "quantize_fus": 190, "quantize_linear": 102, "quantize_v2": 103, "quantized_fused_matmul_and_dequant": 104, "quantized_graph_dtype_refactor": 191, "quantized_matmul_with_bias_and_dequant": 105, "query_explain": 12, "quick": [340, 365], "quickstart": 322, "rag": [319, 362, 372], "ratio": 389, "recogn": 395, "recognit": [353, 369], "recommend": 302, "reduce_mean": 106, "reduce_sum": 107, "refer": [304, 346, 352, 398, 432], "regard": 377, "regist": [387, 394], "reinforc": 352, "relat": [348, 349, 353, 390], "releas": 424, "remov": [391, 395], "remove_constant_op": 192, "remove_last_view": 193, "remove_rang": 194, "remove_unused_oper": 195, "remove_zero": 196, "removeslic": 197, "reorder": [108, 403, 407, 408, 409], "repo": [314, 315], "report": 271, "repositori": 354, "repr_tool": 23, "represent": 395, "request": [300, 309, 361, 364, 365], "requir": [302, 308, 309, 353, 376], "reshap": 109, "reshape_after_restore_hidden_st": 198, "reshape_before_and_after_attention_out_layer_norm_gather_el": 199, "reshape_before_restore_hidden_st": 200, "reshape_fus": 201, "resiz": 110, "respons": 298, "rest": [22, 309, 321], "restore_hidden_states_in_length_adaptive_update_indic": 202, "result": [351, 367, 377, 395, 412], "retriev": [2, 11, 12, 13, 14, 309, 358, 362, 370, 372, 375], "retriever_adapt": 14, "retrivev": 362, "reward": 352, "rich": 309, "rlhf": 352, "rm": 352, "rms_norm": 203, "rome": [23, 24, 25], "rotary_pos_emb": 204, "rsub": 111, "rule": 349, "run": [302, 315, 322, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 351, 358, 359, 360, 361, 362, 363, 366, 383, 384, 388, 389, 393, 412, 426, 427], "runningstat": 25, "runtim": [49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 319, 390, 425], "safeti": 319, "safety_check": 15, "same": 349, "sampl": 377, "scalabl": 322, "scale": 401, "scatter_el": 112, "scope": 298, "scratch": [348, 349], "screenshot": [341, 378, 379, 381, 382], "script": [303, 322, 359, 360, 364, 365, 366, 419, 423], "sde": 410, "sdk": 321, "search": 395, "section": [292, 435], "secur": [15, 271], "segment": 260, "select": 272, "send": [364, 365], "sentenc": 377, "serv": [317, 318, 364, 365, 366, 367], "server": [22, 321, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 349, 360, 361, 363, 364, 365, 366, 375, 383, 384], "servic": [309, 313, 316, 317, 318, 336, 337, 361, 363, 375], "session": 349, "set": [322, 358, 361, 372, 387, 389], "setup": [313, 315, 316, 317, 318, 323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 337, 338, 340, 343, 344, 345, 357, 361, 362, 363, 368, 383, 384, 412], "sft": [347, 352], "shanghaines": 353, "shape": 113, "shot": 288, "side": 361, "sidebysid": 378, "simpl": [313, 314, 315, 316], "simpli": 360, "sin": 114, "singl": [314, 332, 349, 354, 411], "size": [115, 405], "slice_position_id": 116, "slicemask": 205, "smooth": 428, "softmax": [117, 413], "softwar": [308, 354], "sourc": [302, 308, 322], "space": [345, 383, 384], "spars": [304, 389, 402, 403, 404, 406, 409, 412], "sparse_matmul": [398, 413], "specif": [281, 376], "speech": [353, 355, 369], "splice": 395, "split": 118, "spmm": 406, "spmm_amx_bf16_x16": 413, "spmm_avx512f": 413, "spmm_vnni": [399, 413], "spr": [313, 314, 315, 317, 346, 349, 358], "squeez": 119, "src": [16, 17, 18, 19, 20, 21, 394], "ssh": [330, 332, 349], "stabl": [337, 386, 425], "stablediffusion_bf16convert": 211, "stablediffusion_collectqdqinfo": 212, "stablediffusion_explicitnhwctranspos": 206, "stablediffusion_explicitnhwctransposeqat": 207, "stablediffusion_insertquantnod": 213, "stablediffusion_mhareshap": 208, "stablediffusion_quantizefus": 209, "stablediffusion_reshapefus": 210, "stage": 405, "standard": 298, "starcod": [349, 425], "start": [289, 302, 306, 309, 313, 316, 317, 318, 322, 350, 354, 361, 364, 365, 375, 416, 423], "start_end_logit": 214, "statement": 407, "static": [396, 413, 423], "step": [302, 426, 427], "stock": [289, 304], "store": [309, 372], "straight": 395, "stream": 429, "streaming_llm": 43, "strided_slic": 120, "structur": 351, "sub": 395, "sub_graph": [129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242], "subgraph_match": 215, "submodul": [18, 31, 34, 46, 51, 56, 58, 59, 83, 150, 249], "subpackag": [58, 245], "summari": [302, 316, 416, 426], "supervis": [347, 352], "support": [300, 302, 305, 309, 392, 397, 398, 416, 417, 423, 428, 432], "swin": 42, "system": [308, 309, 369, 426], "t5": 349, "tabl": [334, 359], "talk": 378, "task": [288, 376], "templat": 300, "tensor": 121, "tensorflow": 304, "test": [313, 314, 315, 316, 324, 340, 356, 357, 360, 361, 368, 398], "text": [262, 311, 316, 353, 355, 364, 365, 366, 369, 375], "textbot": [343, 344, 367, 378], "textchat": [324, 343, 344], "textencdoer_word_embed": 216, "textencoder_attentionmaskaddreshap": 217, "textencoder_attentionreshap": 218, "textencoder_causal_attention_mask": 223, "textencoder_kvreshap": 219, "textencoder_mulreshap": 220, "textencoder_qreshap": 221, "textencoder_softmaxreshap": 222, "tf": 388, "tf_extractor": 53, "tf_util": 243, "tgi": [317, 363], "thi": [314, 315, 370], "thread": [402, 411], "through": 388, "tile": 402, "token_type_embed": 224, "token_type_embeddings_v1": 225, "tool": [23, 24, 25, 26, 327, 328, 329], "top_k": 122, "topic": 319, "torch_embed": 226, "torch_extractor": 54, "torch_ip_insert_bia": 227, "torch_unpack_baddbmm": 228, "torch_util": 244, "torchinsertbf16nod": 229, "torchpaddingsqu": 230, "total": 389, "tower": 377, "trademark": 415, "train": [346, 347, 349, 350, 352, 376, 423], "trainer": [246, 285, 303, 419, 423], "transform": [27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 261, 272, 287, 292, 302, 304, 306, 308, 322, 327, 328, 329, 357, 361, 362, 368, 398, 435], "transformer2dmodel_attentionmaskaddreshap": 231, "transformer2dmodel_constantofshapewithmul": 232, "transformer2dmodel_encoderhiddenstatesreshap": 238, "transformer2dmodel_ffnslic": 233, "transformer2dmodel_ffnslice_1": 234, "transformer2dmodel_getsamplebatch": 239, "transformer2dmodel_qkvprereshap": 235, "transformer2dmodel_qkvreshap": 236, "transformer2dmodel_qkvreshape4d": 237, "transformer2dmodel_sampleslic": 240, "translat": 353, "transpos": [123, 407, 408], "transpose_batch_matmul": 241, "transpose_matmul": 413, "triton": [364, 365, 366], "tt": [353, 355, 369], "tune": [314, 319, 347, 348, 349, 350, 352, 354, 390, 393, 423], "turn": [390, 396], "tutori": 430, "two": 405, "type": [281, 387], "ubuntu": 308, "ui": 361, "unit": 377, "unpack": 124, "unsqueez": 125, "up": [322, 361, 365], "uparrow": 288, "us": [309, 314, 315, 321, 332, 364, 365, 388, 405], "usag": [303, 305, 307, 356, 358, 359, 360, 361, 362, 369, 370, 371, 372, 373, 374, 377, 385, 400, 401, 413, 419], "user": [286, 297, 398, 400, 401, 431, 440], "util": [18, 19, 20, 21, 24, 25, 26, 247, 248, 249, 250, 251, 252, 263, 264, 265, 266, 267, 268], "v2": 425, "v3": 347, "valid": [302, 308, 349, 350, 411, 425, 428], "variabl": 334, "vector": [309, 372], "vectorstor": 3, "vectorstoreretriev": 372, "verbos": 410, "verifi": [361, 376], "version": [386, 421], "video": [16, 17, 18, 19, 20, 21], "view": 126, "visual": [327, 328, 329, 350, 412], "vit": 353, "vllm": [318, 367], "vnni": 409, "voic": [311, 355, 375], "voicebot": 340, "voicechat": 340, "vtune": 410, "vulner": 271, "w2g128": 288, "w3g128": 288, "w4g": 288, "w4g128": 288, "web": 361, "weight": [319, 347, 388, 405, 412, 432], "weightpruningconfig": 419, "welcom": [292, 435], "what": 370, "where": 127, "whether": 365, "wise": 401, "woq": 432, "word_embed": 242, "work": [302, 370, 402], "workflow": 354, "xeon": [313, 314, 315, 317, 322, 349, 358], "yaml": [323, 324, 326, 327, 328, 329, 330, 331, 332, 334, 336, 338, 340, 343, 344, 363, 388], "you": 389, "your": [345, 361, 383, 384]}}) \ No newline at end of file diff --git a/latest/user_guide.html b/latest/user_guide.html index 660f92bd075..7a3857d7aad 100644 --- a/latest/user_guide.html +++ b/latest/user_guide.html @@ -4,7 +4,7 @@ - User Guide — Intel® Extension for Transformers 0.1.dev1+g408e5f1 documentation + User Guide — Intel® Extension for Transformers 0.1.dev1+ge6cbde1 documentation @@ -124,7 +124,7 @@

      User GuideSphinx using a theme provided by Read the Docs. - +