Skip to Main Content

Emerging AI Tools for Literature Review: Comparison of LLMs

This guide consolidates the teaching materials for library workshop Emerging AI Tools for Literature Review.

This page covers:

Which LLM is more powerful?

Chatbot Arena LLM Leaderboard compares available LLMs based on their performance on user voting, grading model responses to challenging questions, and measuring multitask accuracy.

As of 19 Nov 2025, the Top 10 for text generation are:

  1. gemini-3-pro
  2. grok-4.1-thinking
  3. grok-4.1
  4. gemini-2.5-pro
  5. claude-sonnet-4-5-20250929-thinking-32k
     
  6. claude-opus-4-1-20250805-thinking-16k
  7. claude-sonnet-4-5-20250929
  8. gpt-4.5-preview-2025-02-27
  9. claude-opus-4-1-20250805
  10. chatgpt-4o-latest-20250326
LibGuide content by HKUST Library is licensed under CC BY-NC-SA 4.0, unless otherwise noted.