SIOS SANless clusters

SIOS SANless clusters High-availability Machine Learning monitoring

  • Home
  • Products
    • SIOS DataKeeper for Windows
    • SIOS Protection Suite for Linux
  • News and Events
  • Clustering Simplified
  • Success Stories
  • Contact Us
  • English
  • 中文 (中国)
  • 中文 (台灣)
  • 한국어
  • Bahasa Indonesia
  • ไทย

Reliable Data Replication with SIOS DataKeeper: Why Communication (and Ports) Matter

July 22, 2025 by Jason Aw Leave a Comment

Reliable Data Replication with SIOS DataKeeper Why Communication (and Ports) Matter

Reliable Data Replication with SIOS DataKeeper: Why Communication (and Ports) Matter

In nearly every aspect of IT, communication is key, and when it comes to data replication, it’s critical. For DataKeeper, ensuring that data stays synchronized between nodes in a high availability cluster starts with ensuring the systems can talk to each other over the network.

Whether you’re replicating data across regions or data centers, your first task is to enable secure, reliable communication between all participating nodes. At the heart of this communication lies the TCP/IP protocol. DataKeeper uses a predefined set of TCP ports to establish and maintain replication.

What Are TCP Ports, and Why Are They Important for Data Replication?

A TCP port is a numeric identifier that serves as an endpoint used by network protocols to route traffic to specific applications running on a system. Think of them as addresses for an apartment building. You can send a message to the apartment building, but it probably won’t make it out of the lobby. With the address, you can ensure your message gets to the resident. If the desired address (port) is blocked, the data never gets to where it needs to go.

In the context of DataKeeper, these ports serve as the designated pathways through which nodes exchange critical replication data. Without open and correctly routed ports, the nodes won’t be able to communicate, causing replication to fail or stall.

Which Ports Does SIOS DataKeeper Use?

To establish replication and maintain communication between nodes, DataKeeper requires the following TCP ports to be open:

  • 137, 138, 139, 445 – These are Windows networking ports used for file and printer sharing (NetBIOS and SMB).
  • 9999 – This is the default port used by the DataKeeper service for control and status updates.
  • 10000–10025 – These ports are used for the actual replication traffic. Each port in this range corresponds to a drive letter:

10000 = Volume A

…

10025 = Volume Z

If you’re replicating volume F, for example, you’ll need to ensure that port 10005 is open between nodes.

What to Check When Data Replication Isn’t Working

If replication isn’t starting or is repeatedly disconnecting, consider the following:

  1. Firewall Configuration
    1. Check that Windows Firewall is not blocking any required ports. You can create an inbound rule to allow traffic on the needed ports:
      1. Open Windows Defender Firewall with Advanced Security
      2. Go to Inbound Rules > New Rule
      3. Choose Port, select TCP, and specify:

137, 138, 139, 445, 9999, 10000-10025

  1. Allow the connection and apply the rule to all profiles (Domain, Private, Public).
  1. Network Security Groups / Cloud Firewalls

If your nodes are hosted in cloud environments like AWS, Azure, or GCP, make sure the security groups or NSGs also allow the above ports between the relevant IP addresses.

  1. Ping and Connectivity Tests
    1. Use ping or Test-NetConnection in PowerShell to verify network reachability.
    2. Use telnet or Test-NetConnection -Port to check if specific ports are open.

Best Practices for a Smooth SIOS DataKeeper Deployment

Beyond enabling TCP traffic, there are a couple of other networking best practices that can improve your DataKeeper experience. To ensure reliable replication with DataKeeper, start by verifying that all nodes can resolve each other’s hostnames consistently. This can either be through DNS or static entries in the hosts file. Name resolution issues can be a common source of silent failures and should be addressed early. Additionally, think about configuring a dedicated network interface for replication traffic whenever possible. Separating replication from production traffic not only improves performance and reduces latency but can also enhance security and reliability by isolating data transfer from user and application activity.

Ensure Port Connectivity for Reliable SIOS DataKeeper ReplicationIn Summary

For DataKeeper to perform reliably, network communication must be unrestricted across the defined set of TCP ports. Understanding and configuring these ports, especially the volume-specific replication ports, is essential for avoiding downtime and ensuring your high-availability setup delivers on its promise.

Taking a few minutes to audit your firewall rules and confirm connectivity can save you hours of troubleshooting when replication suddenly stalls. As with all things in IT, clear communication, both between people and between systems, makes all the difference.

Want to take the next step? Consider how high-availability strategies, such as clustering, can support safer, disruption-free patching in your environment. Request a demo today to see how SIOS can help you protect critical workloads, minimize downtime, and ensure seamless patching.

Author: Tristan Allen, Associate Customer Experience Software Engineer at SIOS Technology Corp.

Reproduced with permission from SIOS

Filed Under: Clustering Simplified Tagged With: data replication

How does Data Replication between Nodes Work?

June 19, 2022 by Jason Aw Leave a Comment

How does Data Replication between Nodes Work?

How does Data Replication between Nodes Work?

In the traditional datacenter scenario, data is commonly stored on a storage area network (SAN). The cloud environment doesn’t typically support shared storage.

SIOS DataKeeper presents ‘shared’ storage using replication technology to create a copy of the currently active data. It creates a NetRAID device that works as a RAID1 device (data mirrored across devices).

Data changes are replicated from the Mirror Source (disk device on the active node – Node A in the diagram below) to the Mirror Target (disk device on the standby node – Node B in the diagram below).

In order to guarantee consistency of data across both devices, only the active node has write access to the replicated device (/datakeeper mount point in the example below). Access to the replicated device (the /datakeeper mount point) is not allowed while it is a Mirror Target (i.e., on the standby node).

Reproduced with permission from SIOS

Filed Under: Clustering Simplified Tagged With: data replication

Data Replication

December 13, 2021 by Jason Aw Leave a Comment

Data Replication

 

 

Data Replication

Real-Time Data Replication for High Availability

What is Data Replication

Data replication is the process by which data residing on a physical/virtual server(s) or cloud instance (primary instance) is continuously replicated or copied to a secondary server(s) or cloud instance (standby instance). Organizations replicate data to support high availability, backup, and/or disaster recovery.  Depending on the location of the secondary instance, data is either synchronously or asynchronously replicated. How the data is replicated impacts Recovery Time Objectives (RTOs) and Recovery Point Objectives (RPO).

For example, if you need to recover from a system failure, your standby instance should be on your local area network (LAN). For critical database applications, you can then replicate data synchronously from the primary instance across the LAN to the secondary instance. This makes your standby instance “hot” and in sync with your active instance, so it is ready to take over immediately in the event of a failure. This is referred to as high availability (HA).

In the event of a disaster, you want to be sure that your secondary instance is not co-located with your primary instance. This means you want your secondary instance in a geographic site away from the primary instance or in a cloud instance connected via a WAN. To avoid negatively impacting throughput performance, data replication on a WAN is asynchronous. This means that updates to standby instances will lag updates made to the active instance, resulting in a delay during the recovery process.

Why Replicate Data to the Cloud?

There are five reasons why you want to replicate your data to the cloud.

  1. As we discussed above, cloud replication keeps your data offsite and away from the company’s site. While a major disaster, such as a fire, flood, storm, etc., can devastate your primary instance, your secondary instance is safe in the cloud and can be used to recover the data and applications impacted by the disaster.
  2. Cloud replication is less expensive than replicating data to your own data center. You can eliminate the costs associated with maintaining a secondary data center, including the hardware, maintenance, and support costs.
  3. For smaller businesses, replicating data to the cloud can be more secure especially if you do not have security expertise on staff. Both the physical and network security provided by cloud providers is unmatched.
  4. Replicating data to the cloud provides on-demand scalability. As your business grows or contracts, you do not need to invest in additional hardware to support your secondary instance or have that hardware sit idle if business slows down. You also have no long-term contracts.
  5. When replicating data to the cloud, you have many geographic choices, including having a cloud instance in the next city, across the country, or in another country as your business dictates.

Why Replicate Data Between Cloud Instances?

While cloud providers take every precaution to ensure 100 percent up-time, it is possible for individual cloud servers to fail as a result of physical damage to the hardware and software glitches – all the same reasons why on-premises hardware would fail. For this reason, organizations that run their mission-critical applications in the cloud should replicate their cloud data to support high availability and disaster recovery. You can replicate data between availability zones in a single region, between regions in the cloud, between different cloud platforms, to on-premise systems, or any hybrid combination.

SIOS Real-Time Data Replication for High Availability and Disaster Recovery

SIOS Datakeeper™ uses efficient, block-level, data replication to keep your primary and secondary instances synchronized. If a failover happens, the secondary instance(s) continues to operate, providing users with access to the most recent data. With SIOS solutions, RPO is always zero and RTO is dependent on the application but typically 30 seconds to a few minutes.

SIOS products uniquely protect any Windows- or Linux-based application operating in physical, virtual, cloud or hybrid cloud environments and in any combination of site or disaster recovery scenarios, enabling high availability and disaster recovery for applications such as SAP and databases, including Oracle, HANA, MaxDB, SQL Server, DB2, and many others. The “out-of-the-box” simplicity, configuration flexibility, reliability, performance, and cost-effectiveness of SIOS products set them apart from other clustering software.

In a Windows environment, SIOS DataKeeper Cluster Edition seamlessly integrates with and extends Windows Server Failover Clustering (WSFC) by providing a performance-optimized, host-based data replication mechanism. While WSFC manages the software cluster, SIOS performs the data replication to enable disaster protection and ensure zero data loss in cases where shared storage clusters are impossible or impractical, such as in cloud, virtual, and high-performance storage environments.

In a Linux environment, SIOS LifeKeeper and SIOS DataKeeper provide a tightly integrated combination of high availability failover clustering, continuous application monitoring, data replication, and configurable recovery policies, protecting your business-critical applications from downtime and disasters.

———————————————————————————————————————————

Here is a real-world example of how one leading manufacturing company uses SIOS to create a high availability solution in the cloud using real-time data replication.

How to Achieve HA in a Cloud Environment with Real-Time Data Replication

Bonfiglioli is a leading Italian design, manufacturing, and distribution company, specializing in industrial automation, mobile machinery, and wind energy products and employing over 3,600 employees in locations around the globe. To run its business, the company relies on various mission-critical applications, including its SAP ERP system. The company’s IT infrastructure includes an on-premises VMware data center and a remote data center for business continuity and disaster protection. Since most of their applications run in a Windows environment, Bonfiglioli used guest-level Windows Server failover clustering in their VMware environment to provide high availability and disaster protection.

The company’s IT team implemented a program to move part of its IT operations into the Microsoft Azure cloud and to leverage Azure as their disaster recovery site. An important requirement of the company’s migration plan was to ensure the cloud architecture could provide better high availability protection than before and ensure Bonfiglioli could continue to meet its strict Service Level Agreements (SLAs).

In its on-premises environment, the company uses VMware clustering, which allows Windows Server Failover Clustering (WSFC) to manage failover to a secondary server in the event of an infrastructure failure. However, it was a challenge to provide this type of protection in the cloud because using guest-clustering with shared-bus disks is not a viable cloud solution. Creating a cluster in VMware using Raw Device Mapping and shared-bus disks (RDM) is challenging and creates limitations for backing up the virtual machines.

The Solution

After evaluating several solutions, Bonfiglioli chose SIOS DataKeeper as their cloud high availability and disaster recovery solution upon learning that SIOS DataKeeper is the only certified high availability clustering solution for SAP in a public cloud. In addition, Bonfiglioli’s management consulting partner, BGP, had experience with SIOS DataKeeper and knew that it is easy to install, transparent to the operating system, and a proven, highly effective solution.

With SIOS, the IT team fashioned a cluster environment without RDM. They created a two-node cluster in VMware and added SIOS DataKeeper Cluster Edition to synchronize storage via real-time data replication in each cluster instance. In an on-premises environment, synchronized storage appears to WSFC as a single shared storage disk.

SIOS DataKeeper also provides high availability protection for the company’s SAP instance and eliminates single point of failure. Using SIOS DataKeeper, the IT team replicated an SSD-tiered disk partition in the company’s on-premises data center using real-time data replication. This allows Bonfiglioli to restore their virtual machines to Microsoft Azure in the event of a disaster.

The Results

Daniele Bovina, Systems Architect at Bonfiglioli, comments about the results, “SIOS DataKeeper gave us an easy way to move our business-critical SAP system to the Microsoft Azure cloud while meeting our stringent SLAs for availability, disaster recovery, and performance.”

—————————————————————————————————————————–

For more information about SIOS Clustering Solutions, contact us or request a free trial.

References

  • https://storageservers.wordpress.com/2018/02/12/difference-between-backup-and-replication-2/
  • http://www.bbc.co.uk/newsbeat/article/16838342/could-the-digital-cloud-used-for-storage-ever-crash

Reproduced from SIOS

Filed Under: Clustering Simplified Tagged With: data replication, High Availability

Glossary: Data Replication

May 19, 2021 by Jason Aw Leave a Comment

glossary Data Replication

Glossary of Terms: Data Replication

Definition: The practices of copying information between redundant servers and keeping the copies consistent to improve reliability, fault-tolerance, or accessibility.

Reproduced from SIOS

Filed Under: Clustering Simplified Tagged With: data replication, glossary

Case Study: Chris O’Brien Lifehouse Hospital Ensures High Availability in the AWS Cloud with SIOS DataKeeper

May 12, 2020 by Jason Aw Leave a Comment

Chris O’Brien Lifehouse Hospital Ensures High Availability in the AWS Cloud with SIOS DataKeeper

Case Study: Chris O’Brien Lifehouse Hospital Ensures High Availability in the AWS Cloud with SIOS DataKeeper

SIOS Chosen for its Ability to Deliver both High Availability and High Performance

ChrisOBrien-Lifehouse logoChris O’Brien Lifehouse (www.mylifehouse.org.au) is an integrated and focused center of excellence specializing in state-of-the-art treatment and research for patients who are suffering from rare and complex cancer cases. Lifehouse offers everything a cancer patient might need in one place, including advanced oncology-surgery, chemotherapy, radiation therapy, clinical trials, research, education, complementary therapies and psychosocial support. Situated alongside Royal Prince Alfred Hospital and the University of Sydney in Camperdown, the not-for-profit hospital sees more than 40,000 patients annually for screening, diagnosis and treatment. As one of Australia’s largest clinical trial centers, Lifehouse also provides its patients access to the world’s latest cancer treatment breakthroughs.

The Environment

Lifehouse uses the MEDITECH healthcare Electronic Medical Record and patient administration system, which stores the electronic health records for all patients in a database.

“The health information system and database are vital to the care we provide, and if either goes down, patient records would not be accessible, and that would paralyze the hospital’s operations,” explains Peter Singer, Director Information Technology at Lifehouse.

In the hospital’s datacenter, mission-critical uptime has been provided by Windows Server Failover Clustering (WSFC) running on a Storage Area Network (SAN). But like many organizations, Lifehouse wanted to migrate to the cloud to take advantage of its superior agility and affordability.

The Challenge

Lifehouse chose Amazon Web Services as its cloud service provider, and had hoped to “lift and shift” its environment directly to the AWS cloud. To simulate its on-premises configuration, Peter chose a “cloud volumes” service available in the AWS Marketplace. Failover clusters were configured using software defined storage volumes to share data between active and standby instances, and testing proved that the approach could provide the automatic failover needed to satisfy the hospital’s demanding recovery point and recovery time objectives.

There was a problem, however: The use of software-defined cloud volumes had a substantial adverse impact on throughput performance. With so many elements and layers involved, performance problems are notoriously difficult to troubleshoot in software defined configurations deployed in the cloud. With the “No Protection” option specified, the cloud volumes performed well. But “No Protection” was not really an option for the Chris O’Brien Lifehouse Ensures High Availability in the AWS Cloud with SIOS DataKeeper

“We were able to go from testing to production in a matter of days. Ongoing maintenance is also quite simple, which we expect will minimize our operational expenditures associated with high availability and disaster recovery,” said Peter who is responsible for mission-critical MEDITECH application and its database. “We made every reasonable effort to find and fix the root cause, and eventually concluded that software-defined storage would never be able to deliver the throughput performance we needed,” Peter recalls. So the team at Lifehouse began looking for another solution.

The Evaluation

In its search for another solution capable of providing both high availability and high performance, Lifehouse established three criteria:

  • Validation for use in the AWS cloud
  • Ability to work across multiple Availability Zones
  • Performance that was as good as or better than what had been achieved on-premises
  • Security / Privacy with support for encryption in motion and at rest

Validation was important to minimize risk associated with using a third-party solution in the cloud. The ability to work across multiple Availability Zones would assure business continuity in the event an entire AWS datacenter was impacted by a localized disaster. The sub-millisecond latency AWS delivers between Availability Zones would be critical to being able to replicate data synchronously to “hot” standby instances to meet the hospital’s demanding recovery time and recovery point objectives.

After conducting an exhaustive search, Peter concluded that the best available solution was SIOS DataKeeper Cluster Edition from SIOS Technology. SIOS DataKeeper was available on the AWS Marketplace, which assured it was proven to operate reliably in the AWS cloud. And because it did not use software-defined storage, Peter was confident SIOS DataKeeper would be able to deliver the performance Lifehouse needed.

The Solution

SIOS DataKeeper provides the high-performance, synchronous data replication Lifehouse needs. By using real-time, block-level data mirroring between the local storage attached to all active and standby instances, the solution overcomes the problems caused by the lack of a SAN in the cloud, including the poor performance that often plagues software-defined storage. The resulting SANless cluster is compatible with Windows Server Failover Clustering, provides continuous monitoring for detecting failures at the application and database levels, and offers configurable policies for failover and failback.

Lifehouse currently has eight instances in SANless failover clusters to support its MEDITECH application and database across different AWS availability zones to protect against widespread disasters. The latency inherent across the long distances involved normally requires the use of asynchronous data replication to avoid delaying commits to the active instance of the database. But the real-time, block level data mirroring technology used in SIOS DataKeeper still enables Peter Singer to achieve a near-zero recovery point.

The Results

Unlike software-defined shared storage, SIOS DataKeeper is purpose-built for high performance high availability, so it came as no surprise to Peter Singer that the cloudbased configuration now works as needed. What was a bit surprising was just how easy the solution has been to implement and operate: “We were able to go from testing to production in a matter of days. Ongoing maintenance is also quite simple, which we expect will minimize our operational expenditures associated with high availability and disaster recovery.”

SIOS DataKeeper has enabled Lifehouse to take full advantage of the economies of scale afforded in the cloud without sacrificing uptime or performance. “If it were not for SIOS, we might not have been able to migrate our environment to the cloud,” Peter Singer concluded.

Download the pdf

Filed Under: Success Stories Tagged With: data replication, High Availability

  • 1
  • 2
  • 3
  • Next Page »

Recent Posts

  • Ensuring IT Resilience and Service Continuity in State and Local Government
  • SIOS LifeKeeper vs. Pacemaker in SUSE and Red Hat Environments
  • The Power of Approximation in Business Decisions and Communication
  • SAP Disaster Recovery: Techniques and Best Practices
  • Designing for High Availability and Disaster Recovery

Most Popular Posts

Maximise replication performance for Linux Clustering with Fusion-io
Failover Clustering with VMware High Availability
create A 2-Node MySQL Cluster Without Shared Storage
create A 2-Node MySQL Cluster Without Shared Storage
SAP for High Availability Solutions For Linux
Bandwidth To Support Real-Time Replication
The Availability Equation – High Availability Solutions.jpg
Choosing Platforms To Replicate Data - Host-Based Or Storage-Based?
Guide To Connect To An iSCSI Target Using Open-iSCSI Initiator Software
Best Practices to Eliminate SPoF In Cluster Architecture
Step-By-Step How To Configure A Linux Failover Cluster In Microsoft Azure IaaS Without Shared Storage azure sanless
Take Action Before SQL Server 20082008 R2 Support Expires
How To Cluster MaxDB On Windows In The Cloud

Join Our Mailing List

Copyright © 2026 · Enterprise Pro Theme on Genesis Framework · WordPress · Log in