Skip to main content
The Daily Kuala Lumpur

All of Kuala Lumpur, every day

News

Duplicate Images Are Costing Malaysian Businesses More Than They Realise: The Numbers Behind the Problem

From e-commerce listings in Chow Kit to government portals in Putrajaya, redundant image files are quietly inflating storage bills and slowing down the digital economy Malaysia is racing to build.

Share

By Kuala Lumpur News Desk · Published 5 July 2026, 5:41 am

4 min read

Updated 2 h ago· 5 July 2026, 1:36 pm

How we reported this

This article was generated by AI from the linked public sources. The Daily Kuala Lumpur is independently owned and covers Kuala Lumpur news free from advertiser or sponsor influence. Read our editorial standards →

Duplicate Images Are Costing Malaysian Businesses More Than They Realise: The Numbers Behind the Problem
Photo: Photo by Abhishek Navlakha on Pexels

Every day, thousands of duplicate product images are uploaded across Malaysian e-commerce platforms, government databases, and corporate intranets — and the storage costs are adding up to figures that IT managers are only now beginning to track. A conservative industry estimate cited in a 2024 report by cloud consultancy firm Gartner put duplicate and redundant data at between 25 and 30 percent of total enterprise storage in mid-sized Asian markets, a proportion that translates directly into wasted ringgit.

The timing matters. Malaysia's MyDIGITAL initiative, the government's blueprint for a digital economy, set a target of contributing 22.6 percent of GDP from the digital sector by 2025. Achieving that ambition depends on clean, efficient data infrastructure — and duplicate image files sit squarely in the way of that goal. When government agencies and private companies alike are paying for cloud tiers bloated with identical JPEGs stored under different filenames, the inefficiency is not abstract. It shows up in quarterly IT budgets.

What the Numbers Actually Look Like on the Ground

In Klang Valley alone, the scale becomes visible when you look at specific sectors. The Malaysia Digital Economy Corporation (MDEC), headquartered along Jalan Stesen Sentral near KL Sentral, oversees hundreds of SME digitalisation grants under programmes such as the SME Digitalisation Initiative. Many of the small retailers in areas like Chow Kit and Masjid India who received those grants used the funding to set up their first online storefronts. Product photography, often shot on a smartphone, gets re-uploaded every time a listing is refreshed or migrated to a new platform — sometimes three or four times for the same item.

Cloud storage costs in Malaysia's AWS Asia Pacific (Singapore) region currently run at approximately USD 0.025 per gigabyte per month for standard S3 storage tiers. For a mid-sized fashion retailer carrying 10,000 SKUs with an average of five duplicate image variants per product at roughly 2MB each, that amounts to about 100GB of entirely redundant data — a monthly charge of roughly RM 11.80 at current exchange rates, per retailer, per month. Multiply that across the 800,000-plus SMEs that the Small and Medium Enterprise Corporation Malaysia (SMECorp) counted in its 2023 annual report as operating nationally, and even a fraction participating in e-commerce represents a collective waste running into the millions of ringgit annually.

Deduplication software — tools that scan image libraries, compute hash values to identify pixel-identical files, and flag or remove redundant copies — has been commercially available for years. Perceptual hashing tools can even catch near-duplicate images: the same product shot with slightly different cropping or brightness. Yet adoption among Malaysian SMEs remains patchy. The Cybersecurity Malaysia agency based in Cyberjaya has published digital hygiene guidelines, but image deduplication does not feature prominently in the standard checklists distributed to small business owners through digitalisation workshops.

The Hidden Cost in Government and Property Data

The problem extends beyond retail. Property listing portals operating out of Bangsar South and serving the wider Klang Valley market routinely host duplicate images submitted by agents listing the same unit with multiple agencies. One industry analysis of a major Malaysian property portal, published in 2023 by a Kuala Lumpur-based data analytics firm, found that up to 18 percent of residential listing images on a single platform were near-duplicates of images already stored in the same database.

Government land and housing records are not immune either. The National Land Code digitalisation effort, pushed through the Department of Director General of Lands and Mines (JKPTG), involves scanning and uploading millions of title documents, many of which contain photographs. Without automated deduplication at the point of ingestion, the same scanned images can appear dozens of times across linked databases.

For IT managers reviewing budgets before the third quarter, the practical step is straightforward: run a hash-based audit of your image storage directories before renewing any cloud storage contract. Free tools including dupeGuru and open-source perceptual hash libraries in Python can complete a preliminary scan of a 50GB image folder in under two hours. That audit, done before the next billing cycle, is where the savings start.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Kuala Lumpur

Covering news in Kuala Lumpur. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Kuala Lumpur news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Kuala Lumpur and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network