Skip to content

Releases: ipa-lab/hackingBuddyGPT

v0.5.0

27 Aug 13:09
d0ff901

Choose a tag to compare

v0.5.0 Pre-release
Pre-release

Big Changes

  • big update for @DianaStrauss's web api testing work
  • new tmux-based local command execution capability

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.5.0

v0.4.0

24 Apr 19:56
7a79d22

Choose a tag to compare

v0.4.0 Pre-release
Pre-release

Overview

Good news everyone! There's a new (and long overdue) new version of hackingBuddyGPT out!

The big changes for this release are @Neverbolt's rework of the logging/configuration system and @Qsan1's new prototype that enables small LLMs to perform linux priv-esc attacks. The next use-case is already being prepared and will be in the next release.. stay tuned.

To summarize the big changes:

  • @Neverbolt did extensive work on the configuration and logging system:

    • Overwork of the configuration system
    • Added a visual and live web based log viewer, which can be started with wintermute Viewer
    • Updated the configuration system. The new configuration system now allows loading parameters from a .json file as well as choosing which logging backend should be used
  • @lloydchang with @pardaz-banu, @halifrieri, @toluwalopeoolagbegi and @tushcmd added support for dev containers

  • @jamfish added support for key-based SSH access (to the target system)

  • @Qsan1 added a new use-case, focusing on enabling linux priv-esc with small-language models, to quote:

    • Added an extended linux-privesc usecase. It is based on 'privesc', but extends it with multiple components that can be freely switch on or off:
      - Analyze: After each iteration the LLM is asked to analyze the output of that round.
      - Retrieval Augmented Generation (RAG): After each iteration the LLM is prompted and asked to generate a search query for a vector store. The search query is then used to retrieve relevant documents from the vector store and the information is included in the prompt for the Analyze component (Only works if Analyze is enabled).
      - Chain of thought (CoT): Instead of simply asking the LLM for the next command, we use CoT to generate the next action.
      - History Compression: Instead of including all commands and their respective output in the prompt, it removes all outputs except the most recent one.
      - Structure via Prompt: Include an initial set of command recommendations in query_next_command

I thank all our contributors (and hopefully haven't forgotten too many). Enjoy!

What's Changed

New Contributors

Full Changelog: v0.3.1...v0.4.0

v0.3.0

29 Aug 12:27
550a517

Choose a tag to compare

v0.3.0 Pre-release
Pre-release

HackingBuddyGPT 0.3.0

European Summer'24 Heatwave edition

Version 0.3 contains a massive refactoring and rewrite of our code-base (by @Neverbolt and @andreashappe), laying the groundwork for future features. Meanwhile, @DianaStrauss was improving the Web API testing agent.

Also see our latest hacking benchmark results for multiple models.

Notable user-visible changes:

  • GitHub Models are now supported as LLM backend
  • the hackingBuddyGPT CLI is installed (alias to the wintermute CLI) to provide consistency
  • massively improved web api testing agent:
    • new response analyzer to create a report from the testing findings
    • prompt engineering: improved prompt creation, now categorized into task and state planning prompts
    • Streamlined the OpenAPI documentation generation (reconnaissance) process
  • restructured introductory usecases and agents
    • moved them into src/hackingbuddygpt/usecases/examples
    • their names (used by the CLI) start with Ex
  • bump minimal python version to python 3.10

Notable developer-visible changes:

  • allow for streaming responses from LLMs
  • The class hierarchy has been refactored. UseCases showcase the different hacking behaviors provided hackingBuddyGPT. They offer developers flexibility how they implement their hacking techniques. To streamline development, we introduce the Agent base-class. Agents perform hacking in steps/rounds. Developers can automatically wrap an Agent within a usecases to integrate new agents with minimal development overhead into hackingBuddyGPT. For more information, see our documentation.
  • massively increased test coverage

What's Changed

Full Changelog: v0.2.1...v0.3.0

v0.2.1

27 Aug 13:32
c4fc650

Choose a tag to compare

v0.2.1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.1.0...v0.2.1

v0.1.0-fse23ivr

09 Aug 10:53
c54a608

Choose a tag to compare

v0.1.0-fse23ivr Pre-release
Pre-release
  • this was the code submitted to FSE23 IVR

Full Changelog: https://github.com/ipa-lab/hackingBuddyGPT/commits/v0.1.0