Releases: arthur-ai/bench
Releases · arthur-ai/bench
0.3.1
What's Changed
- updated score_result field by @vivirose in #105
- Add async scoring methods by @cheungrowan in #109
- add async methods of llm based scorers by @cheungrowan in #110
New Contributors
Full Changelog: 0.3.0...0.3.1
0.3.0
What's Changed
- BENCH-324-styling-fixes by @cheungrowan in #97
- BENCH-316 added scorer info to details modal by @cheungrowan in #98
- Parameters for hedging language are now editable by @Mymoza in #100
- LLM-818 - Add run id filter to summary endpoint by @edinpeter in #102
- BENCH-148 added support for categorical charts by @vivirose in #101
- patch num test cases fetch by @cheungrowan in #104
- compare test runs by @vivirose in #103
- add sort param to get test run by id by @cheungrowan in #106
- remove duckdb from local client by @cheungrowan in #107
Full Changelog: 0.2.3...0.3.0
0.2.3
What's Changed
- Fix UI breaks when page is refreshed on non-root route by @fryz in #88
- BENCH-315 styling fixes by @vivirose in #89
- BENCH-305 smooth line chart by @cheungrowan in #90
Full Changelog: 0.2.2...0.2.3
0.2.2
What's Changed
- add discord link by @cheungrowan in #66
- Added footer and help icon by @vivirose in #65
- fix broken discord link by @max-arthurai in #71
- Allow summary quality to use custom LLM by @maccam912 in #73
- make llm evaluator configurable in qa correctness scorer by @cheungrowan in #75
- add environment variable for toggling remote vs local storage by @cheungrowan in #76
- Added Project.urls to pyproject.toml by @packdl in #78
- add arthur bench remote logging documentation by @cheungrowan in #79
- Styling fixes by @vivirose in #72
- bump version and docs by @cheungrowan in #80
- updating Discord link by @hgupta18 in #82
- resolve front end dependencies by @cheungrowan in #81
- fix for scorer type inference on windows os by @cheungrowan in #85
Full Changelog: 0.2.1...0.2.2rc1
0.2.1
What's Changed
- add warn kwarg to python unit tester by @cheungrowan in #56
- Docs typo fixes by @max-arthurai in #57
- scoring headers properly formatted by @max-arthurai in #59
- updating docs requirement by @hgupta18 in #60
- add api references and copy buttons, edit theme by @cheungrowan in #58
- update read the docs config by @cheungrowan in #63
- increase patch version by @cheungrowan in #64
Full Changelog: 0.2.0...0.2.1
0.2.0
What's Changed
- sync develop with main by @cheungrowan in #26
- adding Reading Ease and Word Count metrics by @teresadatta100 in #16
- Adding hedging language scoring method by @hgupta18 in #23
- Delete LICENSE by @cheungrowan in #31
- Hallucination scoring method by @max-arthurai in #19
- Code cleanup by @cheungrowan in #32
- Scoring method configurability by @cheungrowan in #28
- Python unit testing scoring method by @max-arthurai in #21
- Update UI by @cheungrowan in #34
- updating gitignore by @hgupta18 in #36
- adding hedging language demo by @hgupta18 in #38
- Getting started ux by @cheungrowan in #35
- scoring config logging by @cheungrowan in #37
- UI patches and automation by @cheungrowan in #40
- remove extra field discrepancy with REST API by @cheungrowan in #43
- Check for existing run by @max-arthurai in #39
- adding specificity metric by @teresadatta100 in #33
- Add GitHub pipelines by @cheungrowan in #41
- reduce scorer warnings by @cheungrowan in #45
- Naming makeover by @cheungrowan in #46
- Error handling for hallucination and Summary quality clean prompt by @max-arthurai in #44
- Docs cleanup by @max-arthurai in #42
- move exceptions to own module by @cheungrowan in #47
- adding readthedocs by @hgupta18 in #49
- updating links in readme by @hgupta18 in #50
- updating links by @hgupta18 in #51
- fix tooltip by @cheungrowan in #53
- Lint by @cheungrowan in #52
- enable pypi and bump version by @cheungrowan in #54
- release 0.2.0 by @cheungrowan in #55
Full Changelog: 0.1.1...0.2.0
0.1.1
- fix bug when using a directory that doesn't exist yet. directory will now be created
- add links in documentation
0.1.0
- full local UI build