You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, controlling each of these applications requires writing application-specific glue code. This approach does not scale and quickly becomes difficult to maintain.
What is needed is an abstraction layer that allows the capabilities, interfaces, and constraints of each target application to be defined declaratively, rather than by implementing custom logic for every application. The goal is to describe what an application can do and how it can be driven, without rewriting control code each time.
As a candidate for the declarative layer, Cucumber / Gherkin-style specifications are being considered. A behavior-driven, human-readable format could make it easier to:
Describe application actions, states, and preconditions
Map high-level LLM intents to concrete Windows automation steps
Lower the learning curve for users defining new integrations
WelsonJS already supports automation of certain Windows applications (e.g., Microsoft Office), so the technical feasibility is proven. However, from a user perspective, the current model still exposes too much complexity. Reducing this complexity is essential for wider adoption.
Supporting MCP (Model Context Protocol) or a similar standardized interface may also be a viable approach, especially if combined with a declarative specification layer such as Cucumber, to provide a common contract between LLMs and Windows-based software. #233
Expected outcome:
A declarative specification language for Windows application control
Reduced per-application custom automation code
Improved usability and extensibility for LLM ↔ Windows application integration
Evaluation of Cucumber/Gherkin and MCP as complementary or alternative solutions
Summary
There is a growing need to integrate LLMs with various Windows-based applications and APIs, including but not limited to:
hwp.exe(the de facto word processor in Korea)Currently, controlling each of these applications requires writing application-specific glue code. This approach does not scale and quickly becomes difficult to maintain.
What is needed is an abstraction layer that allows the capabilities, interfaces, and constraints of each target application to be defined declaratively, rather than by implementing custom logic for every application. The goal is to describe what an application can do and how it can be driven, without rewriting control code each time.
As a candidate for the declarative layer, Cucumber / Gherkin-style specifications are being considered. A behavior-driven, human-readable format could make it easier to:
WelsonJS already supports automation of certain Windows applications (e.g., Microsoft Office), so the technical feasibility is proven. However, from a user perspective, the current model still exposes too much complexity. Reducing this complexity is essential for wider adoption.
Supporting MCP (Model Context Protocol) or a similar standardized interface may also be a viable approach, especially if combined with a declarative specification layer such as Cucumber, to provide a common contract between LLMs and Windows-based software. #233
Expected outcome: