Kijito Citation Lab.

← Back to the index

Research note 09

How does AI represent WhatsApp-first SMEs

WhatsApp-first Kenyan SMEs can be real, active and locally visible while remaining hard for answer engines to cite cleanly. Their evidence is often fragmented across social posts, map listings, marketplace traces and phone-based contact paths, so AI answers may rely on proxies or leave the business weakly supported.

Recorded by Kijito Citation Lab March 26, 2026

A WhatsApp-first business can be active every day and still look faint to an answer engine. The problem is not absence from the market. It is the lack of a stable, citeable page that ties name, category, location and claim together.

A small Nairobi merchant in the lab’s composite notes sells through WhatsApp, posts product photos on social platforms, keeps a map listing and appears in a few marketplace traces. Customers know the number. Repeat buyers forward screenshots. The public web knows pieces of the business, but rarely on one page. When an answer engine is asked for a Kenyan supplier in that category, the merchant may appear as a name with very little support, or disappear behind a marketplace listing that says more about the product than the seller.

A second composite picture is even rougher. The business has no conventional website. The strongest evidence is a Google Business profile, a string of social posts and a contact route that starts with “send a WhatsApp message.” The answer engine can describe the category, sometimes the area, occasionally the merchant’s name. Then the citation points to a platform page, a social profile preview or a directory snippet that supports only part of the visible answer. The business is not invisible. It is hard to quote.

WhatsApp-first does not mean evidence-free

Kenyan SMEs often use WhatsApp because it fits the way commerce actually happens: quick enquiries, negotiated details, product photos, delivery coordination, repeat customers and trust carried through direct contact. For many small businesses, a website is less central than a phone number that works. Public evidence accumulates sideways: posts, stories, map entries, marketplace pages, directory fragments, customer mentions and occasional supplier listings.

A WhatsApp-first SME is a business whose public customer path is organised around direct messaging, because its strongest commercial evidence sits in phone-based contact and fragmented platform traces rather than a conventional website.

The definition needs care. The lab is not treating these businesses as less formal by default. Some are registered. Some pay taxes. Some have supplier relationships or county-facing traces. Others are informal but locally known. “WhatsApp-first” describes the public evidence path, not the worth of the business.

Answer engines, however, need citeable surfaces. They cannot cite a private chat thread. They usually cannot see forwarded catalogues, closed-group recommendations or disappearing story formats as stable evidence. Even when a WhatsApp number is public, the number itself does not prove stock, service quality, registration, delivery area or continuity. The model must look around the number for pages it can name.

That creates a gap between commercial reality and citation reality. The customer may know the business through a direct thread. The answer engine may know it through a thin map listing. The owner may think of the business as highly visible because customers message every day. The AI answer may treat it as weakly supported because the public, citeable trail is scattered.

Kijito Citation Lab studies that gap under its broader source-dependency frame. The question is not whether WhatsApp-first SMEs are real. The question is which sources are allowed to speak for them when the business itself has no strong, self-authored page.

The source path breaks into fragments

In the lab’s observations, WhatsApp-first businesses tend to create broken source paths. One source carries the name. Another carries the product category. A third carries the location. A fourth carries current activity. None carries the full claim. When an answer engine writes a smooth sentence, it may be smoothing over a trail that is actually in pieces.

A map listing can confirm a place-like presence. It may show the name, area and sometimes reviews. It often says little about the full service range. A social profile can show current posts, but the content may be hard to cite cleanly or hidden behind login walls, previews and changing feeds. A marketplace listing can show products, but it may frame the merchant as a seller inside the platform rather than as a local business with its own identity. A directory trace can mention the name and phone number, yet be stale or copied from another source.

The result is a sentence that feels plausible but has weak joints. “This Nairobi seller offers household appliances through WhatsApp” might be built from a social bio, a marketplace product page and a map trace. If only one of those is cited, the reader cannot see the whole support path. If the cited page supports only the product, the WhatsApp operating model remains inferred. If it supports only the contact route, the product claim may be thin.

This is where the lab uses the Citation Source Role Typology. A local record may include a Kenyan-owned page, registry trace, county reference, licence cue, trade-body mention or supplier profile. A local story can be a Kenyan press, community or sector mention. A platform proxy is an international platform, marketplace, directory or booking profile that speaks for the business when stronger local evidence is absent, ignored or harder to retrieve. An unsupported echo is a repeated claim without a cited page that can carry it.

For WhatsApp-first SMEs, the platform proxy role appears often because platforms provide the stable pages that private messaging cannot. A marketplace page may become the witness for a seller. A map profile may become the witness for local presence. A social profile preview may become the witness for current activity. These sources can be useful, but the lab keeps asking whether they support the specific claim or merely point toward it.

The same business may move between roles depending on the prompt. Ask about “where to buy a product,” and a marketplace listing may be good evidence. Ask whether the business is locally owned, officially registered or reliable, and the same listing becomes too thin. The role is claim-dependent.

What the answer engine tends to infer

The lab is cautious about claiming intent inside a model. It can inspect visible outputs and cited sources, not hidden reasoning. Still, repeated review of composite WhatsApp-first cases shows a recurring pattern: when public evidence is fragmented, the answer often fills gaps with category-shaped language. It describes what a business of that type normally does, then attaches the available name or citation.

A phone-accessory seller becomes “a retailer of mobile accessories.” A home baker becomes “a cake and pastry business.” A small repair operator becomes “a local repair service.” These may be true descriptions. The issue is support. The cited page may show only a few posts or a directory category. The answer’s wording can make the business sound more settled, broader or more formally described than the source allows.

This is especially visible when the prompt asks for recommendations. The model may prefer businesses with cleaner citations, which can push WhatsApp-first SMEs behind larger platforms or better-indexed competitors. If the SME does appear, the recommendation may lean on a map listing or marketplace page. The answer can then inherit the platform’s categories and blind spots.

In Object A, the composite Nairobi home-services SME, WhatsApp sits beside a simple company page rather than replacing it completely. That produces a better source path. The company page can act as a local record for identity and service category, while the map listing and social posts add activity cues. The answer may still cite a directory if the company page is thin, but the local record exists. The business is easier to quote.

A pure WhatsApp-first merchant is different. Without a stable page connecting name, category, location and contact route, the answer engine has to assemble identity from fragments. Sometimes it refuses and gives a generic answer. Sometimes it names the business but cites a weak source. Sometimes it picks a platform-visible competitor instead. None of these outcomes prove the merchant lacks customers. They show that its evidence is hard for machines to hold.

AI visibility for WhatsApp-first SMEs depends less on how active the business is in private channels and more on what public evidence can be cited.

That sentence is not a moral judgment. It is a source-path observation. A business can be commercially alive and citation-poor at the same time.

Swahili and local phrasing can change the merchant the model sees

WhatsApp-first commerce often relies on local phrasing, short names, product nicknames and mixed-language descriptions. A seller may post in English, Swahili, Sheng-influenced wording or a practical blend that customers understand instantly. Answer engines may not connect those variants cleanly. A prompt in English may retrieve marketplace categories. A Swahili prompt may retrieve broader local-language traces, or it may lose the business if the public evidence is mostly English-labelled.

The lab marks these as language-sensitive cases when English and Swahili prompt variants lead to different source choice, entity match or claim strength. For WhatsApp-first SMEs, language sensitivity can appear in small ways. The business name may stay the same, but the category changes. The source may shift from marketplace to map listing. The answer may become more cautious in Swahili because the source path has fewer clean pages. Or the Swahili query may better match how customers actually describe the product.

The lab does not flatten that into a simple language penalty. A Swahili phrasing can change the intended query. It may ask for a local buying route rather than a formal business profile. It may favour conversational commerce over company pages. That shift is part of the observation. The team records prompt wording, language variant, cited source and visible claim together because separating them would make the comparison misleading.

Local naming also complicates entity resolution. A WhatsApp-first business may use a nickname, a handle and a registered name if it has one. A social page may show one spelling. A map listing may show another. Customers may search by the founder’s name. The answer engine may choose the most indexed variant, which is not always the variant the business uses publicly with customers.

This matters for citation because a source can look like a match while pointing to a parallel identity. A marketplace seller handle may resemble the social handle but belong to a different operator. A map listing may preserve an older business name. A directory may copy a phone number but mislabel the category. The lab treats these as entity-collision risks, even when the business is small.

WhatsApp-first SMEs therefore sit at the edge of two problems: fragmented evidence and unstable naming. The more the business depends on direct messaging, the more important the public fragments become. Each fragment must carry more weight than it was designed to carry.

What this means for agencies, owners and trade bodies

For an agency reviewing AI visibility, the first mistake is to ask only whether the SME appears. Presence is not the whole issue. A WhatsApp-first merchant may appear through a platform proxy that distorts the business category, hides local ownership or overstates what the cited page can prove. The audit has to inspect the source role. Is the answer using a local record, a local story, a platform proxy or an unsupported echo?

For an owner, the lesson is uncomfortable but practical. A phone number and active customer chats do not automatically create citeable evidence. If the business wants answer engines to represent it accurately, at least one public surface should connect the basic facts in readable text: name, category, location, contact route and the claim the owner most needs supported. That does not require a large website. It does require a stable page that can be cited.

Trade bodies face a wider version of the same problem. Many member businesses may be active through WhatsApp and social platforms but underrepresented in AI answers because their formal traces are missing, inconsistent or too hard to connect. A trade-body page, supplier profile or member directory can become a local record if it identifies the business clearly and states the relevant category. If it only lists names without context, answer engines may still fall back on proxies.

The lab avoids turning this into a checklist article because the research question is about representation, not advice alone. Still, the implication is plain: the more fragmented the business evidence, the more likely a platform will speak for it. The owner may not mind when the claim is simple, such as product availability. The risk rises when the answer makes stronger claims about trust, status, coverage or suitability.

A WhatsApp-first business can also be harmed by over-smoothing. The answer may make it look more formal than it is, or less active than it is, depending on which public trace gets cited. Both are representation problems. The lab is not interested only in under-visibility. Misplaced confidence can be just as damaging.

The strongest public evidence for these SMEs is often modest. A clear profile. A stable business page. A trade listing with category and county. A map listing that matches the current name. A social bio that states the contact route plainly. None of these guarantees an answer engine will cite the business. They make the source path less brittle.

Limits of the observation

The lab cannot inspect private WhatsApp conversations, closed customer groups or non-public sales records. That is a hard boundary. Its method works with visible answers, prompt wording, language variants, cited pages and public claims. A business may have strong private demand and still look weak in the lab’s citation review because the question is public evidence, not actual revenue or customer loyalty.

The material also does not claim that WhatsApp-first SMEs are uniformly absent from AI answers. Some appear clearly, especially when supported by map listings, marketplace pages, local stories or a simple site. Others appear only as category examples. Some do not surface at all. The lab describes these as observed patterns across composite scenarios, not as measured percentages.

There is a freshness problem too. Social posts and product availability change. A cited page may show a product that is no longer sold, or miss a service that has become central to the business. Answer engines change their citation behaviour as well. The lab therefore records dates, model surfaces, language variants and major answer changes during repeated runs, while avoiding broad claims that pretend the system is fixed.

Swahili comparisons carry their own uncertainty. A Swahili prompt can shift commercial intent, not merely translate an English query. The lab labels such cases language-sensitive when source choice, entity match or claim strength changes. It does not force every difference into a single explanation.

The cautious finding is that WhatsApp-first Kenyan SMEs are not evidence-free; they are often evidence-fragmented. AI systems can represent them only through the public pieces they can retrieve and cite. When those pieces are scattered across messaging routes, social posts, map profiles and platforms, the answer may lean on proxies, stretch a claim or leave the business unresolved. The business may be alive in the phone. The citation path may still be barely stitched.

Kijito Citation Lab
responsible for the record
Kijito Citation Lab · Kenya · March 26, 2026