Addressing Data Center Relocation Challenges Insights from H3C's bilibili Project

2025-03-06 6 min read
Topics:

    Bilibili is a popular platform that serves as a unique blend of social media and video-sharing site within the Chinese internet landscape. It offers a wide range of content, including diverse activities, lifestyle insights, gaming, entertainment, and technology knowledge. Users can engage with a variety of topics as well as participate in community-driven content through user-generated videos (PUGV) and professionally produced videos (OGV). The platform not only supports content consumption but also emphasizes commercial video production, allowing creators to reach their audiences directly and engage in monetization opportunities.

    After 18 months of work across multiple regions and the relocation of tens of thousands of servers and switching equipment, the bilibili data center has successfully completed its relocation project. The new data center features more advanced infrastructure and enhanced technical support. This upgrade will optimize the business layout, support overall remote multi-active operations, improve resource utilization and operational stability, and provide a better access service experience for bilibili users.

    Data center relocation is a complex systematic project that not only involves the relocation of servers, switches, routers, firewalls, and storage devices, but also requires consideration of data and business migration, network connection migration, computer room environment adjustment, and other aspects. Improper operations during relocation may lead to serious consequences such as equipment damage, data loss, and business interruption.

    To ensure the shortest downtime and zero impact and interruption to user services, bilibili collaborated with H3C to carefully plan the migration and relocation process. This partnership leveraged H3C's extensive experience in data center migrations and its strong team support. The project team devised a comprehensive emergency plan that addressed various scenarios, including unexpected changes in business operations, adjustments to data center policies, network outages under special circumstances, alterations in entry and exit procedures, and emergency repairs for transportation equipment in the data center. Each potential risk and issue was met with a clear and detailed emergency strategy, ensuring the smooth execution of the relocation while safeguarding data security and maintaining business continuity.

    During the 18-month multi-batch rolling migration, the project team effectively addressed various challenges, including complex scenarios, lengthy cycles, multiple coordinating parties, and difficult execution. The team executed the process step by step on-site.

    For instance, one batch of relocations involved over 1,700 devices, which required completing the entire process—from equipment removal to business re-launch—within one week. Team members dedicated themselves to the tasks at hand, which included shutting down businesses, backing up data, dismantling servers and switches, as well as transporting, installing, and shelving the equipment.

    All processes were carried out in an orderly manner, allowing the team to complete all relocation tasks on time. They achieved an impressive failure rate of less than 0.1% while ensuring a smooth business restart.

    With the national "dual carbon" strategic goal in mind, the new data center at the bilibili is focused on green energy conservation. The project incorporates the principles of a low-carbon economy along with energy conservation and emission reduction into its design and construction. Through careful layout planning, the use of advanced energy-saving equipment, and efficient operation and maintenance management, the overall Power Usage Effectiveness (PUE) of the computer room has been reduced from 1.5 to below 1.25. This improvement significantly lowers energy consumption and carbon emissions while enhancing the service level agreement (SLA) of the computer room.

    Additionally, the new data center employs state-of-the-art network equipment, which greatly improves network transmission efficiency and response times. The optimization of network topology and security measures has also considerably decreased the risk of network failures and downtime.

    It is noteworthy that H3C seized the opportunity to assist Bilibili in conducting a comprehensive management overhaul of the servers. This included replacing faulty hardware in batches, updating problematic firmware versions, standardizing host BMC/BIOS configurations, and aligning kernel versions and system environments. These steps were taken to ensure consistency across the system, simplify operation and maintenance management, and ultimately improve the operating efficiency and stability of the new computer room.

    You may also like

    Intelligent Computing | The DeepSeek Catalyst: Competing in the Age of AI Inference

    2025-09-26
    The emergence of DeepSeek is injecting vitality into the tech industry and profoundly reshaping the industrial landscape and daily life. As an open-source and highly efficient AI model, DeepSeek has not only significantly reduced the cost of training and inference but has also driven the widespread adoption and application of AI technology. However, this technological breakthrough has not diminished the demand for computing power. Instead, it has further highlighted the rigid need for more accessible computing resources amid new application scenarios and a thriving ecosystem. This demand places greater emphasis on flexibility and scalability, prompting the industry to shift its strategy from a "computing power race" to an "inference-centric" approach. How enterprises can find their development path in this transformation has become a critical issue in todays’ AI industry.

    REDnote Partners With H3C to Deploy the First DDC Architecture AI Computing Network Cluster Globally

    2025-09-26
    As a leading community platform in the industry, REDnote has always been committed to the innovation and application of AI technology. The platform has not only deeply integrated AIGC technology into content recommendation and intelligent creation processes to continuously enhance user experience, but has also actively pursued high-performance networking solutions since 2023. Balancing technological advancement and versatility, the platform promotes the large-scale implementation of efficient AI infrastructure. To address the new challenges in computing power networks brought by the development of large models, REDnote collaborated with H3C to successfully complete the large-scale validation of an intelligent computing network based on the DDC architecture, achieving the first cluster deployment of its kind globally.

    Addressing High AI Computing Costs and Inefficient Multi-Tenant Operations with 400G RoCE Network Solutions

    2025-03-20
    With the widespread application of large model such as ChatGPT, DeepSeek, etc. the market demand structure for computing power has changed significantly. The proportion of demand for inference computing power has increased significantly, and the overall computing power demand has also grown rapidly due to the high efficiency of the model and the lower application threshold. Against this background, the leasing business of intelligent computing centers has become popular. At present, many companies with high demand for computing power, including companies and individual developers engaged in model training, film and television special effects, virtual digital humans and other fields, have chosen to rent computing power from this intelligent computing center.

    Addressing Data Center Relocation Challenges Insights from H3C's bilibili Project

    2025-03-06
    Bilibili is a popular platform that serves as a unique blend of social media and video-sharing site within the Chinese internet landscape. It offers a wide range of content, including diverse activities, lifestyle insights, gaming, entertainment, and technology knowledge.
    新华三官网