changes to eval cli : option to eval on entire dataset #310

ArshdeepSekhon · 2020-10-22T16:29:37Z

No description provided.

jinyongyoo · 2020-10-23T03:07:53Z

I'm also wondering if it's just possible to achieve the same behavior using --num-examples argument since that is always processed together with a dataset. Having both --num-examples and --test-on-full-dataset seems a bit redundant to me. Instead of --num-examples always being a int, we can maybe allow passing a string like all (or -1 as a Pythonic way to index last element) to specify that we want to attack all samples. Then, in the parse_dataset_from_args function in textattack/commands/attack/attack_args_helpers.py, we can set args.num_examples=len(dataset). That way, we support not only eval command, but also attack and we don't have to worry about keeping track of two arguments.

jxmorris12 · 2020-10-23T20:43:55Z

I'm also wondering if it's just possible to achieve the same behavior using --num-examples argument since that is always processed together with a dataset. Having both --num-examples and --test-on-full-dataset seems a bit redundant to me. Instead of --num-examples always being a int, we can maybe allow passing a string like all (or -1 as a Pythonic way to index last element) to specify that we want to attack all samples. Then, in the parse_dataset_from_args function in textattack/commands/attack/attack_args_helpers.py, we can set args.num_examples=len(dataset). That way, we support not only eval command, but also attack and we don't have to worry about keeping track of two arguments.

I see what you mean Jeffrey! @ArshdeepSekhon can you add one more thing: a check to make sure both arguments aren't set? Like if someone calls `textattack eval --num-examples=50 --test-on-full-dataset we should throw an error bc we don't know which one to do.

Btw @jinyongyoo I think if we switched to click or another command-line package we'd have better support for this type of thing!

small typo in the Update 0_End_to_End.ipynb (3 epochs not 5)

black conf.py

(one manually selected, one autodoc-generated)

add sphinx autodoc generated rest

major docstring clean up / plus reorganize the folder structure under docs

with black

a major shift to rst files generated by sphinx-apidoc

The previous link https://github.com/QData/TextAttack/blob/master/docs/quickstart/installation.rst doesn't work anymore.

Installation link fix

Signed-off-by: Opdoop <[email protected]>

Add chinese version of readme

qiyanjun · 2020-11-25T14:34:37Z

@ArshdeepSekhon I will close this since you merge similar changes to the #324

custom embeddings

Opdoop · 2020-11-27T14:07:40Z

In the end, how to eval on entire dataset?
I tried the mentioned three way at textattack 0.2.14

--num-examples -1
--num-examples all
--test-on-full-dataset

Sadly, none of the above works.

qiyanjun · 2020-11-27T14:29:01Z

@Opdoop we put this on hold due to final exams.. will update in a week

ArshdeepSekhon · 2020-11-27T23:44:55Z

Added -1 for attack and eval on entire dataset, sets args.num_examples = len(dataset) if num-examples is specified by user as -1.

qiyanjun · 2020-11-28T03:55:40Z

@Opdoop please try now!

Opdoop · 2020-11-28T12:45:09Z

@qiyanjun Success with --num-examples -1
log:

textattack eval --model lstm-imdb --num-examples -1
textattack: train_args.json not found in model path models/classification/lstm/imdb. Defaulting to 2 labels.
textattack: Loading pre-trained TextAttack LSTM: lstm-imdb
Reusing dataset imdb (/home/nano/.cache/huggingface/datasets/imdb/plain_text/1.0.0/90099cb476936b753383ba2ae6ab2eae419b2e87f71cd5189cb9c8e5814d12a3)
textattack: Loading datasets dataset imdb, split test.
textattack: Got 25000 predictions.
textattack: Successes 20535/25000 (82.14%)

One thing I want to confirm, the Successes here means classification result on normal test/val set. Thus the accuracy of the available model lstm-imdb here is 82.14%. Is this correct?

qiyanjun · 2020-11-28T12:55:05Z

https://textattack.readthedocs.io/en/latest/3recipes/models.html#lstm

…

On Nov 28, 2020, at 07:45, Opdoop ***@***.***> wrote: @qiyanjun Success with --num-examples -1 log: textattack eval --model lstm-imdb --num-examples -1 textattack: train_args.json not found in model path models/classification/lstm/imdb. Defaulting to 2 labels. textattack: Loading pre-trained TextAttack LSTM: lstm-imdb Reusing dataset imdb (/home/nano/.cache/huggingface/datasets/imdb/plain_text/1.0.0/90099cb476936b753383ba2ae6ab2eae419b2e87f71cd5189cb9c8e5814d12a3) textattack: Loading datasets dataset imdb, split test. textattack: Got 25000 predictions. textattack: Successes 20535/25000 (82.14%) One thing I want to confirm, the Successes here means classification result on normal test/val set. Thus the accuracy of the available model lstm-imdb here is 82.14%. Is this correct? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

jinyongyoo · 2020-11-28T13:00:49Z

Is it possible to rebase this on the master branch and clean up the commits? Seems like we have way too many commits (169!) coming from master.

jinyongyoo · 2020-11-28T13:03:52Z

Nvm didn't see that this PR was already merged.

qiyanjun · 2020-11-28T13:06:55Z

@jinyongyoo i wondered that too.. @jinyongyoo @ArshdeepSekhon is it because this is way behind master branch?

qiyanjun · 2020-11-28T13:07:35Z

@jinyongyoo i merged it because the request functions are in. And all tests pass. Should we revert?

Opdoop · 2020-11-28T13:18:11Z

@qiyanjun Em... I have seen this doc before. Succesees and Accuracy looks confusing to me. So I just want to confirm these two means the same.

ArshdeepSekhon · 2020-11-28T13:57:42Z

@qiyanjun Yes this was behind, I rebased it to master

qiyanjun · 2020-11-28T14:05:29Z

Good call. Let me clean the mentions.

…

On Nov 28, 2020, at 08:18, Opdoop ***@***.***> wrote: @qiyanjun Em... I have seen this doc before. Succesees and Accuracy looks confusing to me. So I just want to confirm these two means the same. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

ArshdeepSekhon added 3 commits October 22, 2020 12:22

option to test on entire dataset

b8d94ca

eval on entire dataset, checks

c863095

fix failed checks

13bc18a

Update 0_End_to_End.ipynb

8c726d1

jxmorris12 approved these changes Oct 23, 2020

View reviewed changes

qiyanjun and others added 23 commits October 23, 2020 21:18

Merge pull request QData#314 from tahmid-kazi/patch-1

beac247

small typo in the Update 0_End_to_End.ipynb (3 epochs not 5)

add sphinx autodoc generated

74bc5c0

Update conf.py

0c95966

black conf.py

Update conf.py

dfe26b6

Two versions of API references now

e91a652

(one manually selected, one autodoc-generated)

Merge pull request QData#315 from QData/new_doc

d326796

add sphinx autodoc generated rest

Update conf.py

688d997

add in "sphinx.ext.autosummary"

1e36c63

move the mergign of api and source two version docs to branch new_doc

821c350

move documents from semi-automated rst files to correct code positions

fc7d529

a major shift to rst files generated by sphinx-apidoc

2ea690b

major docstring clean up / plus reorganize the folder structure under docs

Update transformation.rst

00c80ee

Merge branch 'master' into new_doc

95603fb

clean up the docstring of each module and module content section

23d0de6

Update textattack.rst

c41a13e

fix all flake8 found whitespacing trailing issues

c78aa66

remove submodule lines from autodoc rsts

a4bd404

correct two formatting with black

69a415f

correct docstring style erors

b998e0f

Update minimize_bleu.py

887dca1

with black

Merge pull request QData#316 from QData/new_doc

36dfce6

a major shift to rst files generated by sphinx-apidoc

Installation link fix

77ff8f3

The previous link https://github.com/QData/TextAttack/blob/master/docs/quickstart/installation.rst doesn't work anymore.

Merge pull request QData#317 from dheerajrav/patch-1

4c69e86

Installation link fix

Opdoop and others added 4 commits November 24, 2020 09:57

Change the minor formation of html table

8cfb885

Signed-off-by: Opdoop <[email protected]>

Update readme_zh design section

fdfaf6c

Signed-off-by: Opdoop <[email protected]>

add in README_ZH.md into the main README

6f12980

Merge pull request QData#352 from Opdoop/readme_zh

6a383bf

Add chinese version of readme

qiyanjun closed this Nov 25, 2020

jinyongyoo added 2 commits November 26, 2020 19:40

rename classes

e195181

Merge pull request QData#333 from tsinggggg/custom-word-embedding

43e7577

custom embeddings

qiyanjun reopened this Nov 27, 2020

ArshdeepSekhon added 7 commits November 27, 2020 10:45

option to test on entire dataset

4195868

eval on entire dataset, checks

754e06a

fix failed checks

54e01ff

evaluate on entire dataset

b29a09a

evaluate on entire dataset

50baac6

remove unused test-on-entire-dataset option

c55dd28

modify attack accordingly for entire dataset

7635c7b

qiyanjun merged commit efe3ac7 into QData:master Nov 28, 2020

changes to eval cli : option to eval on entire dataset #310

changes to eval cli : option to eval on entire dataset #310

Uh oh!

Conversation

ArshdeepSekhon commented Oct 22, 2020

Uh oh!

jinyongyoo commented Oct 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jxmorris12 commented Oct 23, 2020

Uh oh!

qiyanjun commented Nov 25, 2020

Uh oh!

Opdoop commented Nov 27, 2020

Uh oh!

qiyanjun commented Nov 27, 2020

Uh oh!

ArshdeepSekhon commented Nov 27, 2020

Uh oh!

qiyanjun commented Nov 28, 2020

Uh oh!

Opdoop commented Nov 28, 2020

Uh oh!

qiyanjun commented Nov 28, 2020 via email

Uh oh!

jinyongyoo commented Nov 28, 2020

Uh oh!

jinyongyoo commented Nov 28, 2020

Uh oh!

qiyanjun commented Nov 28, 2020

Uh oh!

qiyanjun commented Nov 28, 2020

Uh oh!

Opdoop commented Nov 28, 2020

Uh oh!

ArshdeepSekhon commented Nov 28, 2020

Uh oh!

qiyanjun commented Nov 28, 2020 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

jinyongyoo commented Oct 23, 2020 •

edited

Loading