Wendy Mak issues

Results 12 issues of


                                            Wendy Mak

[MAINT] min_area filter in sol.vector.mask.mask_to_poly_geojson drops all polygons if crs is epsg:4326: would be nice to add crs check

## Maintenance request summary the `sol.vector.mask.mask_to_poly_geojson` currently has a default `min_area` of 40. While this is useful if you are only using the pixel version or you are using a...

Type: Maintenance

Status: Review Needed

are the pretrained weights for HRNet V1 or V2?

(the other repositories seem to have v2?).

Handling the target variable in data generation (question)

Hi, When I am trying to generate the synthetic data, do I need to treat the target column differently? Or would a correctly tuned generator take care of generating the...

Tweet text data parsing/cleaning for nlp

- Look through data available at https://data.world/data4democracy/far-right as data from the discursive project Some of the tasks we might do are: - Stem - Tokenize - Remove stop words -...

help wanted

status-in-progress

Construct word2vec model with tweets for groups of people (e.g. far right) and compare with models trained on the overall twitterverse (e.g. http://fredericgodin.com/papers/Named%20Entity%20Recognition%20for%20Twitter%20Microposts%20using%20Distributed%20Word%20Representations.pdf) Some things to try: clustering tweets with...

help wanted

status-in-progress

Wendy Mak

Fix full transformer timeout

[MAINT] min_area filter in sol.vector.mask.mask_to_poly_geojson drops all polygons if crs is epsg:4326: would be nice to add crs check

are the pretrained weights for HRNet V1 or V2?

Handling the target variable in data generation (question)

Tweet text data parsing/cleaning for nlp

Word2Vec models

code for dataset generation

Implement Colbert for Optimum

Support for colbert style late interaction models in rerank endpoint

Support for custom SentenceTransformer models