Skip to main content
The Daily Kuala Lumpur

All of Kuala Lumpur, every day

News

The Numbers Problem: How Duplicate Images Are Quietly Draining Malaysia's Digital Economy

Behind every repeated photograph in a government database or e-commerce platform lies a measurable cost — and new data shows the scale of the problem is larger than most organisations admit.

Share

By Kuala Lumpur News Desk · Published 5 July 2026, 5:28 am

4 min read

Updated 2 h ago· 5 July 2026, 1:36 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Kuala Lumpur is independently owned and covers Kuala Lumpur news free from advertiser or sponsor influence. Read our editorial standards →

The Numbers Problem: How Duplicate Images Are Quietly Draining Malaysia's Digital Economy
Photo: Photo by Sasha Zilov on Pexels

Malaysia's push to digitise everything from land titles to pasar malam vendor licences has produced an unexpected byproduct: millions of duplicate images clogging government servers, slowing public platforms, and inflating storage bills that ultimately land on the taxpayer's tab. The precise scale is difficult to pin down, but industry analysts tracking Southeast Asian cloud infrastructure usage say duplicated visual assets — photographs, scanned documents, product images — routinely account for between 20 and 35 percent of total file storage in large institutional databases.

The timing matters. The Anwar Ibrahim administration has made Malaysia's digital economy a centrepiece of its economic agenda, with Putrajaya setting a target to contribute 25.5 percent of GDP from the digital sector by 2025 under the Malaysia Digital Economy Blueprint. Against that backdrop, wasted storage is not merely a housekeeping annoyance. It is a structural inefficiency that erodes the credibility of the very infrastructure the government is spending to build.

What the Numbers Actually Look Like

Walk into any medium-sized e-commerce operation on Jalan Ampang or a logistics company off Persiaran KLCC, and the story is the same. Product catalogues uploaded by vendors frequently contain the same base image renamed dozens of times — different filenames, identical pixel data. One mid-tier marketplace operating out of Bangsar South confirmed to industry consultants last year that a single audit of its product image library returned a duplication rate above 28 percent across roughly 4.2 million stored files. That is more than 1.1 million redundant images consuming server space for no commercial return.

The public sector picture is, by most technical assessments, worse. The MyDigital initiative, launched under the 12th Malaysia Plan, centralised scores of previously siloed agency databases. Migrating legacy systems into unified platforms meant that duplicate scans of the same IC document, the same land parcel photograph, or the same infrastructure inspection image were often imported multiple times — once from each originating agency. Cloud storage on enterprise contracts in Malaysia typically runs between RM 0.10 and RM 0.23 per gigabyte per month depending on tier and provider. At scale, even a modest reduction in duplicate files translates to six-figure annual savings.

The detection technology itself has matured considerably. Perceptual hashing — an algorithmic method that generates a fingerprint for each image and flags near-identical matches even when filenames or metadata differ — is now embedded in open-source tools as well as commercial platforms. Cyberjaya-based managed service providers have reported that running a deduplication pass on a 500GB image archive typically resolves in under four hours and can cut usable storage requirements by between 15 and 40 percent depending on how organically the archive grew.

Why Organisations Keep Delaying the Fix

The answer is largely institutional. Deduplication requires someone to decide which copy is canonical — and in large organisations, that decision touches on data ownership, departmental authority, and occasionally legal liability around document retention. At Kuala Lumpur City Hall, which manages visual records for thousands of development applications across the Federal Territory, the question of which department holds the master copy of an inspection photograph is not trivial. Delete the wrong instance and an audit trail breaks.

MDEC, the Malaysia Digital Economy Corporation headquartered in Cyberjaya, has been rolling out digital governance frameworks that include data quality standards, but specific mandates around image deduplication have not yet been formalised into procurement or operational guidelines as of mid-2026.

For businesses operating on platforms like Lazada's regional infrastructure or Shopee's Malaysian nodes — both of which process millions of product image uploads weekly — the commercial incentive is clearer and the correction faster. For government agencies, the push will likely need to come from the top, tied to storage cost audits that finance ministries are beginning to demand as part of broader fiscal discipline under the subsidy rationalisation agenda.

Organisations that want to start now without waiting for a top-down mandate have a practical path. A phased approach — audit first, flag duplicates with human review before deletion, then automate detection on future uploads — takes roughly eight to twelve weeks for a database of five million files. The upfront cost is modest. The ongoing saving compounds every month. The longer the delay, the deeper the pile grows.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Kuala Lumpur

Covering news in Kuala Lumpur. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Kuala Lumpur news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Kuala Lumpur and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network