Differences between revisions 564 and 566 (spanning 2 versions)
Size: 5740
Comment:
|
Size: 6757
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 9: | Line 9: |
||<style="border:0;padding-left:30px;padding-bottom:5px">[[http://zil.ipipan.waw.pl/seminarium-online|{{attachment:seminarium-archiwum/teams.png}}]] '''Trurl.ai Fine-tuning large language models on multilingual instruction datasets'''  {{attachment:seminarium-archiwum/icon-pl.gif|Talk delivered in Polish.}}|| | ||<style="border:0;padding-left:30px;padding-bottom:5px">[[http://zil.ipipan.waw.pl/seminarium-online|{{attachment:seminarium-archiwum/teams.png}}]] '''TRURL.AI: Fine-tuning large language models on multilingual instruction datasets'''  {{attachment:seminarium-archiwum/icon-pl.gif|Talk delivered in Polish.}}|| |
Line 14: | Line 14: |
||<style="border:0;padding-left:30px;padding-bottom:5px">'''Evaluation of information retrieval models in zero-shot settings on different documents domains'''  {{attachment:seminarium-archiwum/icon-pl.gif|Talk delivered in Polish.}}|| ||<style="border:0;padding-left:30px;padding-bottom:15px">The summary will be available soon.|| |
||<style="border:0;padding-left:30px;padding-bottom:5px">[[http://zil.ipipan.waw.pl/seminarium-online|{{attachment:seminarium-archiwum/teams.png}}]] '''Evaluation of information retrieval models in zero-shot settings on different documents domains'''  {{attachment:seminarium-archiwum/icon-en.gif|Talk delivered in English.}}|| ||<style="border:0;padding-left:30px;padding-bottom:15px">Information Retrieval over large collections of documents is an extremely important research direction in the field of natural language processing. It is a key component in question-answering systems, where the answering model often relies on information contained in a database with up-to-date knowledge. This not only allows for updating the knowledge upon which the system responds to user queries but also limits its hallucinations. Currently, information retrieval models are neural networks and require significant training resources. For many years, lexical matching methods like BM25 outperformed trained neural models in Open Domain setting, but current architectures and extensive datasets allow surpassing lexical solutions. In the presentation, I will introduce available datasets for the evaluation and training of modern information retrieval architectures in document collections from various domains, as well as future development directions.|| |
Natural Language Processing Seminar 2023–2024
The NLP Seminar is organised by the Linguistic Engineering Group at the Institute of Computer Science, Polish Academy of Sciences (ICS PAS). It takes place on (some) Mondays, usually at 10:15 am, often online – please use the link next to the presentation title. All recorded talks are available on YouTube. |
30 October 2023 |
Agnieszka Faleńska (University of Stuttgart) |
Steps towards Bias-Aware NLP Systems |
The summary will be available soon. |
13 November 2023 |
Piotr Rybak (Institute of Computer Science, Polish Academy of Sciences) |
Advancing Polish Question Answering: Datasets and Models |
The summary will be available soon. |
Please see also the talks given in 2000–2015 and 2015–2023. |