parmeet issues

Results 7 issues of


                                            parmeet

Regarding adding shuffling and sharding datapipes to in-built datasets

## 🚀 Feature **Motivation** * To avoid pitfall with shuffling and sharding of datapipes in distributed training environments * To ensure consistent experience of TorchData based datasets across domains. **Pitch**...

[Do not Merge] Testing signals for release branch 0.12

cla signed

ciflow/default

add projection layer to roberta encoder

User may want to additionally project the features from encoder. This PR add support for projecting features to different dimensional space.

cla signed

ciflow/default

[DO NOT MERGE] getting signals for release/0.10

cla signed

[WIP] Fixing google drive download issue

https://github.com/pytorch/text/issues/1359

cla signed

[WIP] using std::array in vocab for additional speed-ups

We replace container type of stoi_ from std::vector to std::array. This bring in slight additional improvements in look-up speed

cla signed

Convert iterator-style raw datasets to map-style raw datasets

## 🚀 Feature **Motivation** torchtext provide several open source nlp datasets in raw form. These datasets are provide as [Iterables](https://pytorch.org/docs/stable/data.html#iterable-style-datasets). Although there are times when user may prefer [map-style](https://pytorch.org/docs/stable/data.html#map-style-datasets) datasets....

datasets

feature request

need discussions

good first issue