Skip to content

Abstraction layer for LLM with Windows-based applications #379

@gnh1201

Description

@gnh1201

Summary

There is a growing need to integrate LLMs with various Windows-based applications and APIs, including but not limited to:

  • Adobe Photoshop
  • Autodesk AutoCAD
  • hwp.exe (the de facto word processor in Korea)
  • Stock market APIs commonly used in Korea, typically exposed as Windows COM/DLL modules [lib/stock-kr] Korean Stock Market #113

Currently, controlling each of these applications requires writing application-specific glue code. This approach does not scale and quickly becomes difficult to maintain.

What is needed is an abstraction layer that allows the capabilities, interfaces, and constraints of each target application to be defined declaratively, rather than by implementing custom logic for every application. The goal is to describe what an application can do and how it can be driven, without rewriting control code each time.

As a candidate for the declarative layer, Cucumber / Gherkin-style specifications are being considered. A behavior-driven, human-readable format could make it easier to:

  • Describe application actions, states, and preconditions
  • Map high-level LLM intents to concrete Windows automation steps
  • Lower the learning curve for users defining new integrations

WelsonJS already supports automation of certain Windows applications (e.g., Microsoft Office), so the technical feasibility is proven. However, from a user perspective, the current model still exposes too much complexity. Reducing this complexity is essential for wider adoption.

Supporting MCP (Model Context Protocol) or a similar standardized interface may also be a viable approach, especially if combined with a declarative specification layer such as Cucumber, to provide a common contract between LLMs and Windows-based software. #233

Expected outcome:

  • A declarative specification language for Windows application control
  • Reduced per-application custom automation code
  • Improved usability and extensibility for LLM ↔ Windows application integration
  • Evaluation of Cucumber/Gherkin and MCP as complementary or alternative solutions

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions