Releases · arthur-ai/bench

16 Feb 15:00

mcgrawia

0.3.1

eae4a6e

0.3.1 Latest

Latest

What's Changed

updated score_result field by @vivirose in #105
Add async scoring methods by @cheungrowan in #109
add async methods of llm based scorers by @cheungrowan in #110

New Contributors

@mcgrawia made their first contribution in #111

Full Changelog: 0.3.0...0.3.1

Contributors

mcgrawia, vivirose, and cheungrowan

Assets 2

20 Dec 21:22

cheungrowan

0.3.0

ce9ad52

0.3.0

What's Changed

BENCH-324-styling-fixes by @cheungrowan in #97
BENCH-316 added scorer info to details modal by @cheungrowan in #98
Parameters for hedging language are now editable by @Mymoza in #100
LLM-818 - Add run id filter to summary endpoint by @edinpeter in #102
BENCH-148 added support for categorical charts by @vivirose in #101
patch num test cases fetch by @cheungrowan in #104
compare test runs by @vivirose in #103
add sort param to get test run by id by @cheungrowan in #106
remove duckdb from local client by @cheungrowan in #107

Full Changelog: 0.2.3...0.3.0

Contributors

edinpeter, Mymoza, and 2 other contributors

Assets 2

11 Oct 18:39

cheungrowan

0.2.3

2d42358

0.2.3

What's Changed

Fix UI breaks when page is refreshed on non-root route by @fryz in #88
BENCH-315 styling fixes by @vivirose in #89
BENCH-305 smooth line chart by @cheungrowan in #90

Full Changelog: 0.2.2...0.2.3

Contributors

fryz, vivirose, and cheungrowan

Assets 2

29 Sep 22:19

cheungrowan

0.2.2rc1

57715e5

0.2.2

What's Changed

add discord link by @cheungrowan in #66
Added footer and help icon by @vivirose in #65
fix broken discord link by @max-arthurai in #71
Allow summary quality to use custom LLM by @maccam912 in #73
make llm evaluator configurable in qa correctness scorer by @cheungrowan in #75
add environment variable for toggling remote vs local storage by @cheungrowan in #76
Added Project.urls to pyproject.toml by @packdl in #78
add arthur bench remote logging documentation by @cheungrowan in #79
Styling fixes by @vivirose in #72
bump version and docs by @cheungrowan in #80
updating Discord link by @hgupta18 in #82
resolve front end dependencies by @cheungrowan in #81
fix for scorer type inference on windows os by @cheungrowan in #85

Full Changelog: 0.2.1...0.2.2rc1

Contributors

maccam912, packdl, and 4 other contributors

Assets 2

17 Aug 16:24

cheungrowan

0.2.1

add8dce

0.2.1

What's Changed

add warn kwarg to python unit tester by @cheungrowan in #56
Docs typo fixes by @max-arthurai in #57
scoring headers properly formatted by @max-arthurai in #59
updating docs requirement by @hgupta18 in #60
add api references and copy buttons, edit theme by @cheungrowan in #58
update read the docs config by @cheungrowan in #63
increase patch version by @cheungrowan in #64

Full Changelog: 0.2.0...0.2.1

Contributors

hgupta18, max-arthurai, and cheungrowan

Assets 2

17 Aug 00:35

hgupta18

0.2.0

9875145

0.2.0

What's Changed

sync develop with main by @cheungrowan in #26
adding Reading Ease and Word Count metrics by @teresadatta100 in #16
Adding hedging language scoring method by @hgupta18 in #23
Delete LICENSE by @cheungrowan in #31
Hallucination scoring method by @max-arthurai in #19
Code cleanup by @cheungrowan in #32
Scoring method configurability by @cheungrowan in #28
Python unit testing scoring method by @max-arthurai in #21
Update UI by @cheungrowan in #34
updating gitignore by @hgupta18 in #36
adding hedging language demo by @hgupta18 in #38
Getting started ux by @cheungrowan in #35
scoring config logging by @cheungrowan in #37
UI patches and automation by @cheungrowan in #40
remove extra field discrepancy with REST API by @cheungrowan in #43
Check for existing run by @max-arthurai in #39
adding specificity metric by @teresadatta100 in #33
Add GitHub pipelines by @cheungrowan in #41
reduce scorer warnings by @cheungrowan in #45
Naming makeover by @cheungrowan in #46
Error handling for hallucination and Summary quality clean prompt by @max-arthurai in #44
Docs cleanup by @max-arthurai in #42
move exceptions to own module by @cheungrowan in #47
adding readthedocs by @hgupta18 in #49
updating links in readme by @hgupta18 in #50
updating links by @hgupta18 in #51
fix tooltip by @cheungrowan in #53
Lint by @cheungrowan in #52
enable pypi and bump version by @cheungrowan in #54
release 0.2.0 by @cheungrowan in #55

Full Changelog: 0.1.1...0.2.0

Contributors

hgupta18, max-arthurai, and 2 other contributors

Assets 2

09 Aug 15:16

cheungrowan

0.1.1

e7ec144

0.1.1

fix bug when using a directory that doesn't exist yet. directory will now be created
add links in documentation

Assets 3

08 Aug 23:06

cheungrowan

0.1.0

0aa7ed0

0.1.0

full local UI build

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

Releases: arthur-ai/bench

0.3.1

What's Changed

New Contributors

Contributors

0.3.0

What's Changed

Contributors

0.2.3

What's Changed

Contributors

0.2.2

What's Changed

Contributors

0.2.1

What's Changed

Contributors

0.2.0

What's Changed

Contributors

0.1.1

0.1.0