The Bus Factor Problem in Integration Systems

Your integration system works great. Does anyone else know how

Sumit Shinde
Back to Blog Posts
Image representing null

The bus factor is a simple idea. If one person leaves your team, how much breaks? If the answer is "a lot," your bus factor is one. Most integration systems have a bus factor of one.

It's not because teams are sloppy. Integration work just tends to accumulate in corners. Systems get built quickly to solve real problems, and the person who builds them naturally becomes the person who understands them. They know why the retry logic is set the way it is, why data gets reshaped at that particular step, why two systems connect the way they do. That knowledge rarely makes it into documentation because there's always something more urgent to work on.

Time passes. The system keeps running. Nobody questions it. Then that person moves to another team, takes a two-week vacation, or just gets sick at the wrong moment, and suddenly a routine maintenance task turns into a two-day investigation.

The problem isn't unique to integration systems, but it hits them harder. Integration logic lives between other systems, it's rarely the focus of code reviews, and it's usually the last thing anyone thinks to document. Knowledge concentrates fast and spreads slowly.

The good news is it's a solvable problem. The first step is knowing you have it. Before you read further, take two minutes to assess where your team stands today.

Now you know where you stand. The rest of this article explains how integration systems end up here, why documentation alone never fixes it, and what it actually takes to change it.

Why Integration Systems Are Especially Vulnerable

Most engineering work leaves a trail. Pull requests, code reviews, commit messages, test cases. Software teams have spent decades building habits and tooling that make their work legible to others. Integration work tends to skip all of that.

A flow gets built by one person to solve a specific problem: connect a PLC to a database, route sensor data to a dashboard, translate one protocol into another. It works, it gets deployed, and then it runs quietly in the background while everyone moves on. The person who built it moves on too, to the next urgent problem, the next system, the next deadline.

"Illustration of an integration flow acting as a black box between PLCs, databases, and dashboards, highlighting that knowledge about the connections is concentrated in a single person."

The people doing this work aren't cutting corners. They're IT engineers, automation specialists, OT technicians, and process engineers working under real time pressure. Their job is to get systems talking, not to produce documentation for a future colleague they've never met. In most organizations, nobody has ever defined what good engineering practice looks like for integration work, so the default is to ship and move on.

There's a structural problem compounding this. Integration flows don't belong cleanly to any one team. They're not part of the application codebase. They're not clearly IT's responsibility or OT's. They live in their own runtime, often on edge devices or gateways distributed across a facility, and they connect systems that different teams own. Nobody has obvious ownership, so nobody feels obvious accountability for making the knowledge transferable.

The result is a system that works fine as long as the right person is available. The moment they're not, you find out just how much was living only in their head.

What Happens When the Knowledge Walks Out the Door

The immediate impact is visible: something breaks and nobody knows how to fix it. But the slower damage is harder to measure.

Teams start treating their own integration flows as black boxes. They're afraid to touch them because they don't understand them well enough to know what will break. New team members spend weeks or months just trying to figure out what exists and what it does. Simple changes take longer than they should because every modification requires archaeology first.

Over time, organizations start building new flows around existing ones instead of modifying them, because modifying them feels risky. The system gets more complex. More people build more flows that only they understand. The bus factor gets worse, not better.

In manufacturing environments, this isn't just a productivity problem. Integration flows connect real machines, real production lines, and real safety systems. A flow that nobody understands is a flow that nobody can safely change, debug, or improve. That's a real operational risk, and it compounds with scale.

The Documentation Problem

The obvious answer is: write better documentation. And yes, that's true. But saying "just document it" has never actually solved the problem, and it's worth understanding why.

Documentation is a separate task. That's the core issue. When an engineer finishes building a flow, the work feels done. The flow works. Sitting down to document it means context-switching into a different mode of thinking, opening a different tool, and writing for an audience that doesn't exist yet. Most of the time, that just doesn't happen.

Even when documentation does exist, it goes stale. A flow gets modified. The documentation doesn't. Six months later, the documentation describes something that no longer exists, which is sometimes worse than no documentation at all.

The teams that actually maintain good documentation tend to do it because it's built into how they work, not because they decided to be more disciplined. The process makes it easy and the tooling keeps it close to the work.

How FlowFuse Approaches the Problem

FlowFuse is an industrial data platform built on Node-RED. It's designed for teams who need to connect machines, systems, and data across IT and OT environments at scale, with the governance and reliability that production environments require.

What makes it relevant to the bus factor problem is where FlowFuse has chosen to put the intelligence.

FlowFuse Expert is an AI assistant built directly into the Node-RED editor inside FlowFuse. It is not a generic AI. It is specifically trained on FlowFuse's own documentation, blog posts, and resources, which means it understands the platform, its nodes, and the industrial context it operates in. When you ask it a question, you are not talking to a general-purpose chatbot - you are talking to something that knows what a Function node is, what an MQTT broker does, and what your specific flow is actually doing.

That last part matters. FlowFuse Expert can take your flow as context. Select a flow, add it to the conversation, and ask plain-language questions about it - what does this flow do, why is this node outputting a number instead of a string, where could this logic fail. It can also take your palette as context, so it knows which nodes you actually have installed and can give advice based on what is available to you rather than what exists in theory.

For a team inheriting flows they didn't build, this changes the equation significantly. Instead of spending days reverse-engineering what a flow does, a new team member can get a working explanation in seconds and ask follow-up questions in plain language.

The video above shows just one example - asking questions about a selected flow. FlowFuse Expert goes much further than that. You can query your live operational data through MCP tools in Insights mode, get inline code completions as you type inside Function nodes, generate CSS and HTML for FlowFuse Dashboard templates, build Function nodes from plain-language descriptions, and get intelligent suggestions for which node to add next. The more you use it, the less the work depends on any single person remembering how something was built.

Beyond the chat interface, FlowFuse Expert also brings AI directly into Node-RED through Node-RED Embedded AI. This is AI that works where you already are - inside node editors and on the canvas - without needing to open a separate panel. It can write Function node code from a plain-language description, generate CSS and HTML for FlowFuse Dashboard templates, suggest which node to add next in a flow, and produce realistic test data to validate logic. It handles the JavaScript, the regex, the CSS, the JSON - so engineers can focus on the domain knowledge that actually requires a human.

Reducing the Bus Factor in Practice

FlowFuse Expert is one piece of a broader approach FlowFuse takes to making integration knowledge more durable.

The platform includes version control for flows, so changes are tracked and teams can roll back when something goes wrong without needing to call the person who made the change. It supports team collaboration on a single Node-RED instance, which means flows don't have to be the work of one person in isolation. DevOps pipelines let teams move flows through development, staging, and production environments with proper governance rather than deploying directly to production from someone's laptop.

Put together, these features don't just reduce the bus factor. They change the conditions that create it. When flows are version-controlled, when multiple people work on them together, when an AI can explain them on demand, the concentration of knowledge in one person's head becomes much harder to sustain accidentally.

The bus factor never fully goes away. Key people will always carry important knowledge. But there's a meaningful difference between a team where one person is the only one who understands something, and a team where one person understands it best but the system can explain itself well enough that others can follow, contribute, and take over.

The goal isn't to make everyone interchangeable. It's to make sure that when someone is out, the work can continue.

Closing Thought

Integration systems are the connective tissue of industrial operations. When they're well understood and well documented, they're an asset. When they're opaque and person-dependent, they're a liability that just hasn't failed yet.

The bus factor problem doesn't go away on its own. Knowledge concentrates by default, distributing it requires the right conditions, the right process, and tooling that makes the good behavior easier than the bad. That's exactly what FlowFuse is built to do.

If you're ready to see how it works in practice, book a demo and we'll show you how teams are using FlowFuse to make their integration systems something anyone on the team can understand, maintain, and build on.

About the Author

Sumit Shinde

Technical Writer

Sumit is a Technical Writer at FlowFuse who helps engineers adopt Node-RED for industrial automation projects. He has authored over 100 articles covering industrial protocols (OPC UA, MQTT, Modbus), Unified Namespace architectures, and practical manufacturing solutions. Through his writing, he makes complex industrial concepts accessible, helping teams connect legacy equipment, build real-time dashboards, and implement Industry 4.0 strategies.