Working with results

Every workflow writes JSON Lines: one file per app, one JSON object per line. JSONL is append-safe, streams well at corpus scale, and loads directly into pandas. Every record carries provenance — the input path, the workflow name, the toolkit version, and a timestamp — so any number in a paper can be traced to the run that produced it.

Loading a corpus into pandas

import json, pathlib
import pandas as pd

records = []
for path in pathlib.Path("results").glob("*.metadata.jsonl"):
    with open(path) as fh:
        records.extend(json.loads(line) for line in fh)

df = pd.DataFrame(records)

From there, typical questions are one-liners. Which A/B frameworks are most common across the corpus:

df.explode("ab")["ab"].value_counts()

Language coverage per app (the localisation field holds [language, region, device] triples):

df["languages"] = df["localisation"].apply(
    lambda locs: sorted({l[0] for l in locs}))

Tracking change across versions of the same app — the core app-histories move — groups on the package name and sorts by version code:

history = (df.sort_values("version_code")
             .groupby("pkg")["ab"].apply(list))

The flows graph

Each *.flows.jsonl record contains a graph (nodes and links), a summary, and sankey edges ready for plotting libraries. Every link carries its evidence:

rec = json.loads(open("results/MyApp.flows.jsonl").read())
for link in rec["graph"]["links"]:
    if link["kind"] == "feeds":
        print(link["source"], "->", link["target"],
              link["score"], link["evidence"]["keywords"])

The score is an evidence count, not a probability: a link backed by an API reference, two keywords, and a corroborating permission scores higher than one backed by a single keyword, and the evidence lists let you audit exactly why a link exists. When reporting findings, audit a sample of links by hand first.

Errors and skips

A failed app produces <app>.<workflow>.error.jsonl containing the input path and a traceback, rather than halting the batch. Count and inspect them before analysis:

ls results/*.error.jsonl | wc -l

A healthy corpus run ends with a summary line on stderr, e.g. done: {'ok': 9961, 'skipped': 0, 'error': 39}. Skipped means an output file already existed (resume behaviour); errors deserve a look — malformed APKs and truncated downloads are the usual causes.

Visualising

For a quick per-app picture, the Sankey viewer renders any flows or listening result file in the browser with no setup.

Analysing an APK in a notebook

The CLI is the batch path, but for ad-hoc analysis in a notebook or script the analyse module gives one-call entry points that do the same ingestion (split-APK merging, DEX URL extraction including runtime-assembled URLs):

from cim_app_histories.analyse import analyse_flows, analyse_listening

graph = analyse_flows("MyApp.apk")        # input -> module -> endpoint graph
listening = analyse_listening("MyApp.apk") # audio chain with parameters

graph["summary"], graph["warning"] if "warning" in graph else None

Both return the same structures the CLI writes, including a warning key when the file looks like an incomplete App Bundle base.

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search