Usage
Basic Example
The following example builds a network for all airports reachable within two hops of Winnipeg (YWG):
from wikipediaGATN.wikipedia_network_level import iterate_search_until_distance_N
from wikipediaGATN.result_processing_network import (
create_outbound_connections_list,
run_two_pass_iata_extraction,
create_outbound_adjacency_matrix,
)
# 1. Crawl Wikipedia
iterate_search_until_distance_N("YWG", dist=2, delay=0.5, verbose=True)
# 2. Build connections list
create_outbound_connections_list(verbose=True)
# 3. Recover missing IATA codes
run_two_pass_iata_extraction(batch_size=50, delay=0.5, verbose=True)
# 4. Re-run connections with enriched mapping
create_outbound_connections_list(verbose=True)
# 5. Export adjacency matrix
create_outbound_adjacency_matrix(symmetric=False, verbose=True)
Global Crawl
To perform a full global crawl (this may take several hours):
from wikipediaGATN.wikipedia_network_level import iterate_search_until_empty
iterate_search_until_empty("YWG", delay=0.5, verbose=True)