diff --git a/docs/build/doctrees/404.doctree b/docs/build/doctrees/404.doctree index ade5e4e..5084b28 100644 Binary files a/docs/build/doctrees/404.doctree and b/docs/build/doctrees/404.doctree differ diff --git a/docs/build/doctrees/about/installation.doctree b/docs/build/doctrees/about/installation.doctree index afc2d43..285a6e8 100644 Binary files a/docs/build/doctrees/about/installation.doctree and b/docs/build/doctrees/about/installation.doctree differ diff --git a/docs/build/doctrees/about/introduction.doctree b/docs/build/doctrees/about/introduction.doctree index 63ee405..012555a 100644 Binary files a/docs/build/doctrees/about/introduction.doctree and b/docs/build/doctrees/about/introduction.doctree differ diff --git a/docs/build/doctrees/about/pipeline.doctree b/docs/build/doctrees/about/pipeline.doctree index 3158263..2b10359 100644 Binary files a/docs/build/doctrees/about/pipeline.doctree and b/docs/build/doctrees/about/pipeline.doctree differ diff --git a/docs/build/doctrees/environment.pickle b/docs/build/doctrees/environment.pickle index fa143e8..6959e38 100644 Binary files a/docs/build/doctrees/environment.pickle and b/docs/build/doctrees/environment.pickle differ diff --git a/docs/build/doctrees/index.doctree b/docs/build/doctrees/index.doctree index d002da6..e803e74 100644 Binary files a/docs/build/doctrees/index.doctree and b/docs/build/doctrees/index.doctree differ diff --git a/docs/build/doctrees/modules/GeoDataGetter.doctree b/docs/build/doctrees/modules/GeoDataGetter.doctree index 07943e3..b1ceb99 100644 Binary files a/docs/build/doctrees/modules/GeoDataGetter.doctree and b/docs/build/doctrees/modules/GeoDataGetter.doctree differ diff --git a/docs/build/doctrees/modules/Geocoder_special.doctree b/docs/build/doctrees/modules/Geocoder_special.doctree index 0a7ce45..4d517b3 100644 Binary files a/docs/build/doctrees/modules/Geocoder_special.doctree and b/docs/build/doctrees/modules/Geocoder_special.doctree differ diff --git a/docs/build/doctrees/modules/OtherGeoObjects.doctree b/docs/build/doctrees/modules/OtherGeoObjects.doctree index 8e0a17d..4383ba9 100644 Binary files a/docs/build/doctrees/modules/OtherGeoObjects.doctree and b/docs/build/doctrees/modules/OtherGeoObjects.doctree differ diff --git a/docs/build/doctrees/modules/StreetExtractor.doctree b/docs/build/doctrees/modules/StreetExtractor.doctree index 4bd0a10..a269aa7 100644 Binary files a/docs/build/doctrees/modules/StreetExtractor.doctree and b/docs/build/doctrees/modules/StreetExtractor.doctree differ diff --git a/docs/build/doctrees/modules/Streets.doctree b/docs/build/doctrees/modules/Streets.doctree index 85d7be3..bd47615 100644 Binary files a/docs/build/doctrees/modules/Streets.doctree and b/docs/build/doctrees/modules/Streets.doctree differ diff --git a/docs/build/doctrees/modules/VKParser.doctree b/docs/build/doctrees/modules/VKParser.doctree index 120600f..e3d07df 100644 Binary files a/docs/build/doctrees/modules/VKParser.doctree and b/docs/build/doctrees/modules/VKParser.doctree differ diff --git a/docs/build/doctrees/modules/city_services_extract.doctree b/docs/build/doctrees/modules/city_services_extract.doctree index fb32248..c4815e1 100644 Binary files a/docs/build/doctrees/modules/city_services_extract.doctree and b/docs/build/doctrees/modules/city_services_extract.doctree differ diff --git a/docs/build/doctrees/modules/data_getter.doctree b/docs/build/doctrees/modules/data_getter.doctree index 81c37bc..3c5f780 100644 Binary files a/docs/build/doctrees/modules/data_getter.doctree and b/docs/build/doctrees/modules/data_getter.doctree differ diff --git a/docs/build/doctrees/modules/emotion_classifier.doctree b/docs/build/doctrees/modules/emotion_classifier.doctree index 6322205..d8ffb76 100644 Binary files a/docs/build/doctrees/modules/emotion_classifier.doctree and b/docs/build/doctrees/modules/emotion_classifier.doctree differ diff --git a/docs/build/doctrees/modules/event_dynamic_prediction.doctree b/docs/build/doctrees/modules/event_dynamic_prediction.doctree index b2ea3a2..8587362 100644 Binary files a/docs/build/doctrees/modules/event_dynamic_prediction.doctree and b/docs/build/doctrees/modules/event_dynamic_prediction.doctree differ diff --git a/docs/build/doctrees/modules/geocoder.doctree b/docs/build/doctrees/modules/geocoder.doctree index fc6558e..d392f75 100644 Binary files a/docs/build/doctrees/modules/geocoder.doctree and b/docs/build/doctrees/modules/geocoder.doctree differ diff --git a/docs/build/doctrees/modules/regional_activity.doctree b/docs/build/doctrees/modules/regional_activity.doctree index f3311f8..5e1a7d1 100644 Binary files a/docs/build/doctrees/modules/regional_activity.doctree and b/docs/build/doctrees/modules/regional_activity.doctree differ diff --git a/docs/build/doctrees/modules/semantic_graph.doctree b/docs/build/doctrees/modules/semantic_graph.doctree index b4018e5..5582b85 100644 Binary files a/docs/build/doctrees/modules/semantic_graph.doctree and b/docs/build/doctrees/modules/semantic_graph.doctree differ diff --git a/docs/build/doctrees/modules/text_classifier.doctree b/docs/build/doctrees/modules/text_classifier.doctree index acc4c19..dad37b2 100644 Binary files a/docs/build/doctrees/modules/text_classifier.doctree and b/docs/build/doctrees/modules/text_classifier.doctree differ diff --git a/docs/build/doctrees/modules/topic_modeler.doctree b/docs/build/doctrees/modules/topic_modeler.doctree index 7d28a01..1c6bbad 100644 Binary files a/docs/build/doctrees/modules/topic_modeler.doctree and b/docs/build/doctrees/modules/topic_modeler.doctree differ diff --git a/docs/build/doctrees/modules/visualize_graph.doctree b/docs/build/doctrees/modules/visualize_graph.doctree index 379b138..8ae8ad0 100644 Binary files a/docs/build/doctrees/modules/visualize_graph.doctree and b/docs/build/doctrees/modules/visualize_graph.doctree differ diff --git a/docs/build/html/.buildinfo b/docs/build/html/.buildinfo index fc68774..8ae4c39 100644 --- a/docs/build/html/.buildinfo +++ b/docs/build/html/.buildinfo @@ -1,4 +1,4 @@ # Sphinx build info version 1 # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. -config: 34fe29b96d534ed85aad7b7243c9abd1 +config: f61a6e53f7100386fd88718507394ad6 tags: 645f666f9bcd5a90fca523b33c5a78b7 diff --git a/docs/build/html/404.html b/docs/build/html/404.html index 07aaf15..a98403c 100644 --- a/docs/build/html/404.html +++ b/docs/build/html/404.html @@ -5,11 +5,11 @@ - - Page Not Found - sloyka 0.1.0 documentation + + Page Not Found - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
@@ -207,7 +194,7 @@
- sloyka 0.1.0 documentation + sloyka 0.2.1 documentation
- + \ No newline at end of file diff --git a/docs/build/html/_modules/index.html b/docs/build/html/_modules/index.html index 8026f93..2d5b4e7 100644 --- a/docs/build/html/_modules/index.html +++ b/docs/build/html/_modules/index.html @@ -4,11 +4,11 @@ - - Overview: module code - sloyka 0.1.0 documentation + + Overview: module code - sloyka 0.2.1 documentation - - + + @@ -88,19 +88,6 @@
- - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -179,7 +166,7 @@
-
sloyka 0.1.0 documentation
+
sloyka 0.2.1 documentation
@@ -206,7 +193,7 @@
- sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -294,6 +281,9 @@

All modules for which code is available

  • sloyka.src.risks.event_detector
  • sloyka.src.risks.text_classifier
  • sloyka.src.semantic_graph.semantic_graph_builder
  • +
  • sloyka.src.utils.data_getter.geo_data_getter
  • +
  • sloyka.src.utils.data_getter.street_getter
  • +
  • sloyka.src.utils.data_getter.vk_data_getter
  • sloyka.src.utils.data_processing.city_services_extract
  • sloyka.src.visual.graph_visualization
  • @@ -327,9 +317,9 @@

    All modules for which code is available

    - + - + \ No newline at end of file diff --git a/docs/build/html/_modules/sloyka/src/geocoder/city_objects_extractor.html b/docs/build/html/_modules/sloyka/src/geocoder/city_objects_extractor.html index a96e67f..0ea86f3 100644 --- a/docs/build/html/_modules/sloyka/src/geocoder/city_objects_extractor.html +++ b/docs/build/html/_modules/sloyka/src/geocoder/city_objects_extractor.html @@ -4,11 +4,11 @@ - - sloyka.src.geocoder.city_objects_extractor - sloyka 0.1.0 documentation + + sloyka.src.geocoder.city_objects_extractor - sloyka 0.2.1 documentation - - + + @@ -88,19 +88,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -179,7 +166,7 @@
    @@ -206,7 +193,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -290,7 +277,6 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    from typing import List import re import pandas as pd -import osmnx as ox from shapely.geometry import Point, Polygon, MultiPolygon from loguru import logger import pymorphy2 @@ -411,20 +397,23 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    @staticmethod def restoration_of_normal_form(other_geo_obj, osm_combined_df, threshold=0.7) -> List[str]: """ - This function compares the extracted location entity with an OSM array and returns a normalized form if the percentage of similarity is at least 70%. + This function compares the extracted location entity with an OSM array + and returns a normalized form if the percentage of similarity is at least 70%. """ osm_name_obj = osm_combined_df["name"].tolist() similarity_matrix = np.zeros((len(other_geo_obj), len(osm_name_obj))) def extract_numbers(s): return re.findall(r"\d+", s) + + percents = 100 for i, word1 in enumerate(other_geo_obj): numbers_from_extraction = extract_numbers(word1) for j, word2 in enumerate(osm_name_obj): numbers_from_OSM_name = extract_numbers(word2) if numbers_from_extraction == numbers_from_OSM_name: - similarity = fuzz.ratio(word1, word2) / 100.0 + similarity = fuzz.ratio(word1, word2) / percents else: similarity = 0 similarity_matrix[i, j] = similarity @@ -440,10 +429,13 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    return restoration_list
    +
    +[docs] @staticmethod - def find_num_city_obj(text, NUM_CITY_OBJ) -> List[str]: + def find_num_city_obj(text) -> List[str]: """ - This function searches for urban objects in the text, the names of which are represented as a number. For example, "school No. 6". + This function searches for urban objects in the text, + the names of which are represented as a number. For example, "school No. 6". """ text = str(text) text = text.lower() @@ -464,8 +456,11 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    else: num_obj_list_clear[key] = obj - return list(num_obj_list_clear.values()) + return list(num_obj_list_clear.values())
    + +
    +[docs] @staticmethod def combine_city_obj(df_obj) -> pd.DataFrame: """ @@ -473,8 +468,11 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    """ df_obj["other_geo_obj"] = df_obj["other_geo_obj"] + df_obj["other_geo_obj_num"] df_obj.drop(columns=["other_geo_obj_num"], inplace=True) - return df_obj + return df_obj
    + +
    +[docs] @staticmethod def expand_toponym(df_obj) -> pd.DataFrame: """ @@ -485,8 +483,11 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    lambda x: x if isinstance(x, list) and x else None ) expanded_df = expanded_df.explode("other_geo_obj").reset_index(drop=True) - return expanded_df + return expanded_df
    + +
    +[docs] @staticmethod def find_geometry(toponym, osm_combined_df) -> Point: """ @@ -498,8 +499,11 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    if not match.empty: return match.iloc[0, 1] else: - return None + return None
    + +
    +[docs] @staticmethod def find_geo_obj_tag(toponym, osm_combined_df) -> str: """ @@ -511,11 +515,15 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    if not match.empty: return match.iloc[0, -1] else: - return None + return None
    + +
    +[docs] @staticmethod def get_unique_part_types(df): - return df["other_geo_obj"].unique() + return df["other_geo_obj"].unique()
    +
    [docs] @@ -530,7 +538,7 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    df_obj["other_geo_obj"] = df_obj[text_column].apply(OtherGeoObjects.extract_geo_obj) df_obj["other_geo_obj_num"] = df_obj[text_column].apply( - lambda x: OtherGeoObjects.find_num_city_obj(x, NUM_CITY_OBJ) + lambda x: OtherGeoObjects.find_num_city_obj(x) ) df_obj = OtherGeoObjects.combine_city_obj(df_obj) @@ -583,9 +591,9 @@

    Source code for sloyka.src.geocoder.city_objects_extractor

    +
    - + \ No newline at end of file diff --git a/docs/build/html/_modules/sloyka/src/geocoder/geocoder.html b/docs/build/html/_modules/sloyka/src/geocoder/geocoder.html index 387b2ab..a39cebf 100644 --- a/docs/build/html/_modules/sloyka/src/geocoder/geocoder.html +++ b/docs/build/html/_modules/sloyka/src/geocoder/geocoder.html @@ -4,11 +4,11 @@ - - sloyka.src.geocoder.geocoder - sloyka 0.1.0 documentation + + sloyka.src.geocoder.geocoder - sloyka 0.2.1 documentation - - + + @@ -88,19 +88,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -179,7 +166,7 @@
    @@ -206,7 +193,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -214,15 +201,21 @@
    @@ -206,7 +193,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation -
    -
    +
    - + \ No newline at end of file diff --git a/docs/build/html/modules/Geocoder_special.html b/docs/build/html/modules/Geocoder_special.html index 9f359fd..2bee8c5 100644 --- a/docs/build/html/modules/Geocoder_special.html +++ b/docs/build/html/modules/Geocoder_special.html @@ -5,11 +5,11 @@ - - Geocoder - sloyka 0.1.0 documentation + + Geocoder - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -297,12 +284,13 @@

    Geocoder

    -class sloyka.src.geocoder.geocoder.Geocoder(df, model_path: str = 'Geor111y/flair-ner-addresses-extractor', device: str = 'cpu', osm_id: int = None, city_tags: dict = {'place': ['state']})[source]
    +class sloyka.src.geocoder.geocoder.Geocoder(df, model_path: str = 'Geor111y/flair-ner-addresses-extractor', device: str = 'cpu', osm_id: int | None = None, city_tags: dict = {'place': ['state']}, stemmer_lang: str = 'russian', text_column_name: str = 'text')[source]

    This class provides a functionality of simple geocoder

    assign_street()[source]
    -
    +

    Simple workaround

    +
    @@ -310,11 +298,6 @@

    Function simply creates gdf from the recognised geocoded geometries.

    -
    -
    -dir_path = '/Users/test/Documents/code/sloyka/sloyka/src/geocoder'
    -
    -
    get_df_areas(osm_id, tags)[source]
    @@ -356,11 +339,6 @@ names dictionary (df).

    -
    -
    -global_crs: int = 4326
    -
    -
    match_group_to_area(group_name, df_areas)[source]
    @@ -434,7 +412,7 @@
    -run(df: DataFrame = None, tags: dict | None = None, text_column: str = 'text', group_column: str | None = 'group_name', search_for_objects=False)[source]
    +run(df: DataFrame | None = None, tags: dict | None = None, group_column: str | None = 'group_name', search_for_objects=False)[source]

    Runs the data processing pipeline on the input DataFrame.

    Parameters:
    @@ -469,7 +447,7 @@
    -

    Back to all Geocoding

    +

    Back to all geocoder

    @@ -530,11 +508,9 @@
  • Geocoder
  • -
    +
    - + \ No newline at end of file diff --git a/docs/build/html/modules/OtherGeoObjects.html b/docs/build/html/modules/OtherGeoObjects.html index 30d6e7a..6d03f1e 100644 --- a/docs/build/html/modules/OtherGeoObjects.html +++ b/docs/build/html/modules/OtherGeoObjects.html @@ -5,11 +5,11 @@ - - OtherGeoObjects - sloyka 0.1.0 documentation + + OtherGeoObjects - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -295,34 +282,113 @@

    OtherGeoObjects

    +
    +
    +class sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects[source]
    +
    +
    +static calculate_centroid(geometry) Point[source]
    +

    Calculates the centroid for polygons.

    +
    + +
    +
    +static combine_city_obj(df_obj) DataFrame[source]
    +

    Combines the found named urban objects and urban objects whose names are in the form of numbers.

    +
    + +
    +
    +static expand_toponym(df_obj) DataFrame[source]
    +

    Splits the list of found entities into different rows for further analysis.

    +
    + +
    +
    +static extract_geo_obj(text) List[str][source]
    +

    The function extracts location entities from the text, using the Natasha library.

    +
    + +
    +
    +static find_geo_obj_tag(toponym, osm_combined_df) str[source]
    +

    Finds the geo_obj_tag in the OSM array by the name of the city object.

    +
    + +
    +
    +static find_geometry(toponym, osm_combined_df) Point[source]
    +

    Finds the coordinate in the OSM array by the name of the city object.

    +
    + +
    +
    +static find_num_city_obj(text) List[str][source]
    +

    This function searches for urban objects in the text, +the names of which are represented as a number. For example, “school No. 6”.

    +
    +
    -static OtherGeoObjects.get_and_process_osm_data(osm_id: int, tags: dict) DataFrame[source]
    +static get_and_process_osm_data(osm_id: int, tags: dict) DataFrame[source]

    Retrieves and processes OSM data for different urban objects.

    +
    +
    +static get_unique_part_types(df)[source]
    +
    + +
    +
    +static restoration_of_normal_form(other_geo_obj, osm_combined_df, threshold=0.7) List[str][source]
    +

    This function compares the extracted location entity with an OSM array +and returns a normalized form if the percentage of similarity is at least 70%.

    +
    + +
    +
    +static run(osm_id: int, df: DataFrame, text_column: str) DataFrame[source]
    +

    Launches the module for extracting urban objects from texts that do not relate to streets.

    +
    +
    -static OtherGeoObjects.run_osm_dfs(osm_id: int) DataFrame[source]
    +static run_osm_dfs(osm_id: int) DataFrame[source]

    Collects dataframes with OSM spatial data, finds centroids and combines them into one.

    +
    +
    -
    -static OtherGeoObjects.calculate_centroid(geometry) Point[source]
    +
    +static OtherGeoObjects.get_and_process_osm_data(osm_id: int, tags: dict) DataFrame[source]
    +

    Retrieves and processes OSM data for different urban objects.

    +
    + +
    +
    +static OtherGeoObjects.run_osm_dfs(osm_id: int) DataFrame[source]
    +

    Collects dataframes with OSM spatial data, finds centroids and combines them into one.

    +
    + +
    +
    +static OtherGeoObjects.calculate_centroid(geometry) Point[source]

    Calculates the centroid for polygons.

    -
    -static OtherGeoObjects.extract_geo_obj(text) List[str][source]
    +
    +static OtherGeoObjects.extract_geo_obj(text) List[str][source]

    The function extracts location entities from the text, using the Natasha library.

    -
    -static OtherGeoObjects.restoration_of_normal_form(other_geo_obj, osm_combined_df, threshold=0.7) List[str][source]
    -

    This function compares the extracted location entity with an OSM array and returns a normalized form if the percentage of similarity is at least 70%.

    +
    +static OtherGeoObjects.restoration_of_normal_form(other_geo_obj, osm_combined_df, threshold=0.7) List[str][source]
    +

    This function compares the extracted location entity with an OSM array +and returns a normalized form if the percentage of similarity is at least 70%.

    @@ -382,11 +448,26 @@ -
    +
    - + \ No newline at end of file diff --git a/docs/build/html/modules/StreetExtractor.html b/docs/build/html/modules/StreetExtractor.html index 2b2c22a..af7c170 100644 --- a/docs/build/html/modules/StreetExtractor.html +++ b/docs/build/html/modules/StreetExtractor.html @@ -5,11 +5,11 @@ - - StreetExtractor - sloyka 0.1.0 documentation + + StreetExtractor - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -430,9 +417,9 @@
    -
    +
    - + \ No newline at end of file diff --git a/docs/build/html/modules/Streets.html b/docs/build/html/modules/Streets.html index 9b7e525..4bde4ae 100644 --- a/docs/build/html/modules/Streets.html +++ b/docs/build/html/modules/Streets.html @@ -5,11 +5,11 @@ - - Streets - sloyka 0.1.0 documentation + + Streets - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -192,7 +179,7 @@
    -
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -287,7 +274,7 @@ -
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -303,7 +290,7 @@ category using multiple models.

    -class sloyka.src.risks.emotion_classifier.EmotionRecognizer(model_name=Text.Bert_Large, device=None)[source]
    +class sloyka.src.risks.emotion_classifier.EmotionRecognizer(model_name=Text.Bert_Tiny, device='cpu')[source]

    This class is designed to categorise input texts into emotion categories.

    Attributes:

    @@ -417,9 +404,9 @@

    Example

    -
    + - + \ No newline at end of file diff --git a/docs/build/html/modules/event_dynamic_prediction.html b/docs/build/html/modules/event_dynamic_prediction.html index 95b1c25..1387f5a 100644 --- a/docs/build/html/modules/event_dynamic_prediction.html +++ b/docs/build/html/modules/event_dynamic_prediction.html @@ -5,11 +5,11 @@ - - Event detector - sloyka 0.1.0 documentation + + Event detector - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -393,9 +380,9 @@
    - + - + \ No newline at end of file diff --git a/docs/build/html/modules/geocoder.html b/docs/build/html/modules/geocoder.html index f27980b..6c7c2f1 100644 --- a/docs/build/html/modules/geocoder.html +++ b/docs/build/html/modules/geocoder.html @@ -5,11 +5,11 @@ - - Geocoding - sloyka 0.1.0 documentation + + Geocoding - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -293,19 +280,19 @@
    -
    -

    Geocoding

    -

    To perform classification with generalized linear models, see -Geocoder.

    +
    +
    +

    Geocoding

    +

    To perform classification with generalized linear models, see Geocoder.

    Geocoder

    -class sloyka.src.geocoder.geocoder.Geocoder(df, model_path: str = 'Geor111y/flair-ner-addresses-extractor', device: str = 'cpu', osm_id: int = None, city_tags: dict = {'place': ['state']})[source]
    +class sloyka.src.geocoder.geocoder.Geocoder(df, model_path: str = 'Geor111y/flair-ner-addresses-extractor', device: str = 'cpu', osm_id: int | None = None, city_tags: dict = {'place': ['state']}, stemmer_lang: str = 'russian', text_column_name: str = 'text')[source]

    This class provides a functionality of simple geocoder

    -run(df: DataFrame = None, tags: dict | None = None, text_column: str = 'text', group_column: str | None = 'group_name', search_for_objects=False)[source]
    +run(df: DataFrame | None = None, tags: dict | None = None, group_column: str | None = 'group_name', search_for_objects=False)[source]

    Runs the data processing pipeline on the input DataFrame.

    Parameters:
    @@ -453,9 +440,9 @@

    more:< - + - + \ No newline at end of file diff --git a/docs/build/html/modules/regional_activity.html b/docs/build/html/modules/regional_activity.html index 25a80b0..a2a1e1c 100644 --- a/docs/build/html/modules/regional_activity.html +++ b/docs/build/html/modules/regional_activity.html @@ -5,11 +5,11 @@ - - Regional activity - sloyka 0.1.0 documentation + + Regional activity - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@ - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -350,9 +337,9 @@
    - + - + \ No newline at end of file diff --git a/docs/build/html/modules/semantic_graph.html b/docs/build/html/modules/semantic_graph.html index d8382c5..cef9a05 100644 --- a/docs/build/html/modules/semantic_graph.html +++ b/docs/build/html/modules/semantic_graph.html @@ -5,11 +5,11 @@ - - Semantic graph - sloyka 0.1.0 documentation + + Semantic graph - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -472,9 +459,9 @@
    - + - + \ No newline at end of file diff --git a/docs/build/html/modules/text_classifier.html b/docs/build/html/modules/text_classifier.html index 1b80282..b1e9e08 100644 --- a/docs/build/html/modules/text_classifier.html +++ b/docs/build/html/modules/text_classifier.html @@ -5,11 +5,11 @@ - - Text Classifier - sloyka 0.1.0 documentation + + Text Classifier - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -416,9 +403,9 @@
    - + - + \ No newline at end of file diff --git a/docs/build/html/modules/topic_modeler.html b/docs/build/html/modules/topic_modeler.html index acad63c..9164883 100644 --- a/docs/build/html/modules/topic_modeler.html +++ b/docs/build/html/modules/topic_modeler.html @@ -5,11 +5,11 @@ - - Topic Modelling - sloyka 0.1.0 documentation + + Topic Modelling - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -356,9 +343,9 @@
    - + - + \ No newline at end of file diff --git a/docs/build/html/modules/visualize_graph.html b/docs/build/html/modules/visualize_graph.html index 649e3ef..4103e68 100644 --- a/docs/build/html/modules/visualize_graph.html +++ b/docs/build/html/modules/visualize_graph.html @@ -5,11 +5,11 @@ - - Graph visualization - sloyka 0.1.0 documentation + + Graph visualization - sloyka 0.2.1 documentation - - + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -298,15 +285,12 @@

    The function results in an interactive map, which will display graph nodes grouped by clusters. Each cluster will have its own color and name. Node markers will have a random offset to avoid overlapping.

    -
    -sloyka.src.visual.graph_visualization.draw_graph_folium(graph_path, output_file=None)[source]
    +
    +sloyka.src.visual.graph_visualization.visualize_graph(G, zoom: int)[source]

    Visualizes a graph from the given graph_path using Folium and MarkerCluster.

    Parameters:
    -
      -
    • graph_path (str) – The path to the graphml file.

    • -
    • output_file (str, optional) – The file to save the visualization to. Defaults to None.

    • -
    +

    output_file (str, optional) – The file to save the visualization to. Defaults to None.

    Returns:

    The folium map object representing the visualized graph.

    @@ -374,7 +358,7 @@ -
    + - + \ No newline at end of file diff --git a/docs/build/html/objects.inv b/docs/build/html/objects.inv index 03cef36..25f8306 100644 Binary files a/docs/build/html/objects.inv and b/docs/build/html/objects.inv differ diff --git a/docs/build/html/py-modindex.html b/docs/build/html/py-modindex.html index faa2657..5980b36 100644 --- a/docs/build/html/py-modindex.html +++ b/docs/build/html/py-modindex.html @@ -4,10 +4,10 @@ - Python Module Index - sloyka 0.1.0 documentation + Python Module Index - sloyka 0.2.1 documentation - - + + @@ -87,19 +87,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -178,7 +165,7 @@
    @@ -205,7 +192,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -372,9 +359,9 @@

    Python Module Index

    - + - + \ No newline at end of file diff --git a/docs/build/html/search.html b/docs/build/html/search.html index 48982df..69c81d3 100644 --- a/docs/build/html/search.html +++ b/docs/build/html/search.html @@ -5,11 +5,11 @@ - + -Search - sloyka 0.1.0 documentation - - +Search - sloyka 0.2.1 documentation + + @@ -89,19 +89,6 @@
    - - Auto light/dark mode - - - - - - - - - - Auto light/dark, in light mode - @@ -180,7 +167,7 @@
    @@ -207,7 +194,7 @@
    - sloyka 0.1.0 documentation + sloyka 0.2.1 documentation @@ -329,10 +316,10 @@
    - + - + diff --git a/docs/build/html/searchindex.js b/docs/build/html/searchindex.js index d8b071a..0ea7dd5 100644 --- a/docs/build/html/searchindex.js +++ b/docs/build/html/searchindex.js @@ -1 +1 @@ -Search.setIndex({"alltitles": {"Advanced geocoding": [[12, null], [15, null]], "Data getter": [[12, "data-getter"]], "Emotion classifier": [[13, "module-sloyka.src.risks.emotion_classifier"]], "Event detector": [[14, "module-sloyka.src.risks.event_detector"]], "Example": [[13, "example"]], "General:": [[4, null]], "GeoDataGetter": [[5, "geodatagetter"]], "Geocoder": [[6, "geocoder"], [15, "id1"]], "Geocoding": [[15, "geocoding"]], "Graph visualization": [[20, "graph-visualization"]], "Indices and tables": [[4, "indices-and-tables"]], "Installation": [[1, "installation"]], "Introduction": [[2, "introduction"]], "Main features": [[2, "main-features"]], "Main pipeline": [[3, "main-pipeline"]], "OtherGeoObjects": [[7, "othergeoobjects"], [15, "othergeoobjects"]], "Page Not Found": [[0, "page-not-found"]], "Regional activity": [[16, "regional-activity"]], "Semantic graph": [[17, "module-sloyka.src.semantic_graph.semantic_graph_builder"]], "Services extraction": [[11, "services-extraction"]], "StreetExtractor": [[8, "streetextractor"], [15, "streetextractor"]], "Streets": [[9, "streets"]], "Text Classifier": [[18, "text-classifier"]], "Topic Modelling": [[19, "topic-modelling"]], "VKparser": [[10, "vkparser"]], "Welcome to Sloyka\u2019s documentation!": [[4, "welcome-to-sloyka-s-documentation"]], "more:": [[12, "more"], [15, "more"]]}, "docnames": ["404", "about/installation", "about/introduction", "about/pipeline", "index", "modules/GeoDataGetter", "modules/Geocoder_special", "modules/OtherGeoObjects", "modules/StreetExtractor", "modules/Streets", "modules/VKParser", "modules/city_services_extract", "modules/data_getter", "modules/emotion_classifier", "modules/event_dynamic_prediction", "modules/geocoder", "modules/regional_activity", "modules/semantic_graph", "modules/text_classifier", "modules/topic_modeler", "modules/visualize_graph"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.rst", "about/installation.rst", "about/introduction.rst", "about/pipeline.rst", "index.rst", "modules/GeoDataGetter.rst", "modules/Geocoder_special.rst", "modules/OtherGeoObjects.rst", "modules/StreetExtractor.rst", "modules/Streets.rst", "modules/VKParser.rst", "modules/city_services_extract.rst", "modules/data_getter.rst", "modules/emotion_classifier.rst", "modules/event_dynamic_prediction.rst", "modules/geocoder.rst", "modules/regional_activity.rst", "modules/semantic_graph.rst", "modules/text_classifier.rst", "modules/topic_modeler.rst", "modules/visualize_graph.rst"], "indexentries": {"assign_street() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.assign_street", false]], "build_graph() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.build_graph", false]], "calculate_centroid() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.calculate_centroid", false]], "city_services (class in sloyka.src.utils.data_processing.city_services_extract)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services", false]], "classify_text() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.classify_text", false]], "convert_df_to_edge_df() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph static method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.convert_df_to_edge_df", false]], "create_gdf() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.create_gdf", false]], "dir_path (sloyka.src.geocoder.geocoder.geocoder attribute)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.dir_path", false]], "draw_graph_folium() (in module sloyka.src.visual.graph_visualization)": [[20, "sloyka.src.visual.graph_visualization.draw_graph_folium", false]], "emotionrecognizer (class in sloyka.src.risks.emotion_classifier)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer", false]], "eventdetection (class in sloyka.src.risks.event_detector)": [[14, "sloyka.src.risks.event_detector.EventDetection", false]], "extract_geo_obj() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.extract_geo_obj", false]], "extract_ner_street() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extract_ner_street", false]], "extract_toponym() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extract_toponym", false]], "extraction_services() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.extraction_services", false]], "extractor (sloyka.src.geocoder.street_extractor.streetextractor attribute)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extractor", false]], "geocoder (class in sloyka.src.geocoder.geocoder)": [[6, "sloyka.src.geocoder.geocoder.Geocoder", false], [15, "sloyka.src.geocoder.geocoder.Geocoder", false]], "get_and_process_osm_data() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.get_and_process_osm_data", false]], "get_df_areas() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_df_areas", false]], "get_level() (sloyka.src.geocoder.geocoder.geocoder static method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_level", false]], "get_stem() (sloyka.src.geocoder.geocoder.geocoder static method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_stem", false]], "global_crs (sloyka.src.geocoder.geocoder.geocoder attribute)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.global_crs", false]], "initialize_classifier() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.initialize_classifier", false]], "match_group_to_area() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.match_group_to_area", false]], "merge_to_initial_df() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.merge_to_initial_df", false]], "module": [[11, "module-sloyka.src.utils.data_processing.city_services_extract", false], [13, "module-sloyka.src.risks.emotion_classifier", false], [14, "module-sloyka.src.risks.event_detector", false], [17, "module-sloyka.src.semantic_graph.semantic_graph_builder", false], [18, "module-sloyka.src.risks.text_classifier", false], [20, "module-sloyka.src.visual.graph_visualization", false]], "othergeoobjects (class in sloyka.src.geocoder.city_objects_extractor)": [[15, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects", false]], "preprocess_area_names() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.preprocess_area_names", false]], "preprocess_group_name() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.preprocess_group_name", false]], "process_pipeline() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.process_pipeline", false]], "recognize_average_emotion_from_multiple_models() (sloyka.src.risks.emotion_classifier.emotionrecognizer method)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer.recognize_average_emotion_from_multiple_models", false]], "recognize_emotion() (sloyka.src.risks.emotion_classifier.emotionrecognizer method)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer.recognize_emotion", false]], "remove_last_letter() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.remove_last_letter", false]], "replace_with_most_similar() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.replace_with_most_similar", false]], "restoration_of_normal_form() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.restoration_of_normal_form", false]], "run() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[15, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.run", false]], "run() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.run", false], [15, "sloyka.src.geocoder.geocoder.Geocoder.run", false]], "run() (sloyka.src.risks.event_detector.eventdetection method)": [[14, "sloyka.src.risks.event_detector.EventDetection.run", false]], "run() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.run", false]], "run_osm_dfs() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.run_osm_dfs", false]], "run_text_classifier() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.run_text_classifier", false]], "run_text_classifier_topics() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.run_text_classifier_topics", false]], "semgraph (class in sloyka.src.semantic_graph.semantic_graph_builder)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph", false]], "set_global_repr_point() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.set_global_repr_point", false]], "sloyka.src.risks.emotion_classifier": [[13, "module-sloyka.src.risks.emotion_classifier", false]], "sloyka.src.risks.event_detector": [[14, "module-sloyka.src.risks.event_detector", false]], "sloyka.src.risks.text_classifier": [[18, "module-sloyka.src.risks.text_classifier", false]], "sloyka.src.semantic_graph.semantic_graph_builder": [[17, "module-sloyka.src.semantic_graph.semantic_graph_builder", false]], "sloyka.src.utils.data_processing.city_services_extract": [[11, "module-sloyka.src.utils.data_processing.city_services_extract", false]], "sloyka.src.visual.graph_visualization": [[20, "module-sloyka.src.visual.graph_visualization", false]], "streetextractor (class in sloyka.src.geocoder.street_extractor)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor", false], [15, "sloyka.src.geocoder.street_extractor.StreetExtractor", false]], "textclassifiers (class in sloyka.src.risks.text_classifier)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers", false]], "update_graph() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.update_graph", false]]}, "objects": {"sloyka.src.geocoder.city_objects_extractor": [[15, 0, 1, "", "OtherGeoObjects"]], "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects": [[7, 1, 1, "", "calculate_centroid"], [7, 1, 1, "", "extract_geo_obj"], [7, 1, 1, "", "get_and_process_osm_data"], [7, 1, 1, "", "restoration_of_normal_form"], [15, 1, 1, "", "run"], [7, 1, 1, "", "run_osm_dfs"]], "sloyka.src.geocoder.geocoder": [[15, 0, 1, "", "Geocoder"]], "sloyka.src.geocoder.geocoder.Geocoder": [[6, 1, 1, "", "assign_street"], [6, 1, 1, "", "create_gdf"], [6, 2, 1, "", "dir_path"], [6, 1, 1, "", "get_df_areas"], [6, 1, 1, "", "get_level"], [6, 1, 1, "", "get_stem"], [6, 2, 1, "", "global_crs"], [6, 1, 1, "", "match_group_to_area"], [6, 1, 1, "", "merge_to_initial_df"], [6, 1, 1, "", "preprocess_area_names"], [6, 1, 1, "", "preprocess_group_name"], [15, 1, 1, "", "run"], [6, 1, 1, "", "set_global_repr_point"]], "sloyka.src.geocoder.street_extractor": [[15, 0, 1, "", "StreetExtractor"]], "sloyka.src.geocoder.street_extractor.StreetExtractor": [[8, 1, 1, "", "extract_ner_street"], [8, 1, 1, "", "extract_toponym"], [8, 2, 1, "", "extractor"], [8, 1, 1, "", "process_pipeline"]], "sloyka.src.risks": [[13, 3, 0, "-", "emotion_classifier"], [14, 3, 0, "-", "event_detector"], [18, 3, 0, "-", "text_classifier"]], "sloyka.src.risks.emotion_classifier": [[13, 0, 1, "", "EmotionRecognizer"]], "sloyka.src.risks.emotion_classifier.EmotionRecognizer": [[13, 1, 1, "", "recognize_average_emotion_from_multiple_models"], [13, 1, 1, "", "recognize_emotion"]], "sloyka.src.risks.event_detector": [[14, 0, 1, "", "EventDetection"]], "sloyka.src.risks.event_detector.EventDetection": [[14, 1, 1, "", "run"]], "sloyka.src.risks.text_classifier": [[18, 0, 1, "", "TextClassifiers"]], "sloyka.src.risks.text_classifier.TextClassifiers": [[18, 1, 1, "", "classify_text"], [18, 1, 1, "", "initialize_classifier"], [18, 1, 1, "", "run_text_classifier"], [18, 1, 1, "", "run_text_classifier_topics"]], "sloyka.src.semantic_graph": [[17, 3, 0, "-", "semantic_graph_builder"]], "sloyka.src.semantic_graph.semantic_graph_builder": [[17, 0, 1, "", "Semgraph"]], "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph": [[17, 1, 1, "", "build_graph"], [17, 1, 1, "", "convert_df_to_edge_df"], [17, 1, 1, "", "update_graph"]], "sloyka.src.utils.data_processing": [[11, 3, 0, "-", "city_services_extract"]], "sloyka.src.utils.data_processing.city_services_extract": [[11, 0, 1, "", "City_services"]], "sloyka.src.utils.data_processing.city_services_extract.City_services": [[11, 1, 1, "", "extraction_services"], [11, 1, 1, "", "remove_last_letter"], [11, 1, 1, "", "replace_with_most_similar"], [11, 1, 1, "", "run"]], "sloyka.src.visual": [[20, 3, 0, "-", "graph_visualization"]], "sloyka.src.visual.graph_visualization": [[20, 4, 1, "", "draw_graph_folium"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"], "3": ["py", "module", "Python module"], "4": ["py", "function", "Python function"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute", "3": "py:module", "4": "py:function"}, "terms": {"": [2, 3], "0": [2, 7, 17], "000": 18, "1": [6, 17, 18, 19], "100": [], "131": [], "15": 19, "2": 6, "2024": 2, "3": 6, "4326": 6, "5": [], "50": [], "6": [2, 17], "7": 7, "70": 7, "75": 17, "90": 18, "A": [6, 8, 9, 12, 17], "And": [], "As": 17, "By": 3, "For": 17, "If": 6, "In": 18, "It": [6, 13, 14, 15, 17, 18, 19], "Its": 2, "Not": [], "The": [2, 3, 6, 7, 8, 11, 13, 14, 15, 16, 17, 18, 19, 20], "To": [1, 15], "_collect_popul": 14, "_geocoder_speci": [], "_get_build": 14, "_get_road": 14, "_preprocess": 14, "about": [3, 16, 19], "access": 18, "access_token": [], "accord": 6, "account": 11, "accsess_token": [], "across": 3, "activ": [2, 3], "actual": 6, "ad": 13, "add": 6, "addit": [2, 19], "address": [6, 8, 15], "admin": [6, 15], "after": [2, 6, 15, 16, 17], "against": 6, "aggreg": [16, 17], "aggregate_data": 17, "aim": [2, 14, 17, 18, 19], "algorithm": 11, "all": [3, 6], "alreadi": [2, 3, 6], "also": [2, 6, 15], "amen": 18, "an": [6, 7, 13, 14, 15, 20], "analyz": [4, 13], "ani": [3, 6, 13, 17, 19], "api": [], "api_verison": [], "appdata": [], "appli": [3, 8, 13], "applic": [14, 17], "ar": [3, 6, 15, 17], "area": [3, 6, 15], "area_nam": 6, "area_name_process": 6, "area_stem": 6, "around": 8, "arrai": 7, "assess": [2, 3], "assign": [6, 15], "assign_street": 6, "attribut": [6, 13, 16, 17, 18], "augment": 2, "authent": [], "autoclass": 9, "autom": [], "automat": 13, "automodul": [], "avail": 13, "averag": 13, "avoid": 20, "back": [6, 13], "base": [5, 12, 14, 16, 17, 18], "basic": 16, "being": 18, "belong": 19, "below": 8, "bert": [17, 18], "bert_larg": 13, "bert_nam": 17, "bertop": [14, 19], "best": [6, 15], "between": [14, 17], "bool": 17, "box": 0, "build": [2, 14, 17], "build_graph": 17, "c": [], "cach": 6, "calcul": [7, 13], "calculate_centroid": 7, "call": [16, 18], "can": [2, 3, 13, 16], "care": 18, "carri": 3, "case": [3, 17, 18], "categor": 2, "categori": [13, 18], "categoris": 13, "centroid": [6, 7], "certain": 3, "chang": 11, "charact": 6, "character": 3, "chat": 2, "check": 6, "child": 17, "citi": [2, 4, 6, 11, 14, 18, 19], "citizen": 2, "city_cr": 14, "city_nam": 14, "city_objects_extractor": 15, "city_servic": 11, "city_services_extract": [11, 16], "city_tag": [6, 15], "class": [5, 6, 8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 19], "classif": [15, 18], "classifi": [2, 3, 8, 16], "classify_text": 18, "clean": 17, "clean_from_digit": 17, "clean_from_dubl": 17, "clean_from_toponym": 17, "cluster": [14, 19, 20], "code": 6, "cointegr": 19, "collect": [2, 7, 14], "color": 20, "column": [2, 3, 6, 11, 13, 15, 17], "combin": 7, "comment": 2, "comments_to_datafram": [], "commun": [2, 3], "compar": 7, "comparison": 11, "compat": 13, "comprehens": [], "confid": 8, "connect": 14, "consist": 3, "construct": [2, 17], "contain": [6, 8, 12, 13, 14, 17, 18], "content": 8, "context": 14, "conveni": 17, "convert": 6, "convert_df_to_edge_df": 17, "convini": 17, "correctli": 18, "correspond": [6, 15], "cosin": 17, "couldn": 0, "count": [17, 18], "count_item": [], "counts_attribut": 17, "cpu": [6, 13, 15, 17], "creat": [6, 15, 16, 17, 19], "create_gdf": 6, "csv": 13, "cuda": 13, "custom": 8, "cutoff": [], "cutoff_d": [], "d": [], "dai": 19, "data": [2, 3, 4, 5, 6, 7, 9, 13, 14, 15, 16, 17, 19], "data_gett": 9, "data_process": 11, "datafram": [6, 7, 8, 11, 13, 15, 16, 17], "dataset": 3, "date": [2, 6, 15, 19], "deeppavlov": 17, "default": [6, 13, 15, 17, 20], "design": [11, 13, 16, 18, 19], "detector": [], "determin": 17, "devic": [6, 13, 15, 17, 18], "device_typ": 18, "df": [6, 8, 11, 13, 15], "df_area": 6, "dict": [6, 7, 15], "dictionari": 6, "didn": 6, "differ": [6, 7], "digit": [2, 17], "dir_path": 6, "direct": 17, "displai": 20, "distanc": 17, "divid": 3, "do": 15, "document": [2, 6], "domain": [], "don": 6, "done": 19, "draw_graph_folium": 20, "duplic": 17, "dure": 13, "dynam": 19, "each": [3, 6, 14, 15, 17, 19, 20], "effici": 18, "element": [2, 6], "embed": [17, 19], "embedding_model_nam": 19, "emot": [2, 3], "emotion_classifi": [13, 16], "emotionclassifi": 13, "emotionrecogn": 13, "empti": [], "enabl": 13, "end": 11, "end_dat": 19, "enrich": 2, "entiti": [2, 7, 11], "environ": 2, "evalu": 2, "event": [2, 18], "event_detector": 14, "eventdetect": 14, "exampl": 18, "exist": 2, "extract": [2, 3, 7, 8, 15, 17, 19], "extract_building_num": [], "extract_geo_obj": 7, "extract_ner_street": 8, "extract_toponym": 8, "extraction_servic": 11, "extractor": [6, 8, 15], "fail": 8, "fall": 13, "fals": [6, 15, 18], "famili": 18, "featur": [], "fetch": [], "file": 20, "filepath_to_popul": 14, "filter": [6, 15, 17], "final": [2, 6, 15, 17], "find": [0, 6, 7, 15], "finder": [], "first": [6, 8], "flag": 17, "flair": [6, 11, 15], "float": 17, "folium": 20, "follow": [13, 14, 17, 18, 19], "footprint": 2, "forecast": 3, "form": [6, 7, 15], "format": [], "found": [6, 8], "from": [2, 5, 6, 7, 8, 11, 12, 15, 17, 19, 20], "function": [3, 6, 7, 8, 15, 16, 17, 18, 19, 20], "further": [2, 3], "g": 17, "gdf": [6, 19], "gener": [2, 3, 14, 15], "geo": [], "geocod": [3, 8, 16, 17], "geodatafram": [2, 6, 14, 15, 16, 17, 19], "geodatagett": 12, "geograph": [6, 15], "geoloc": [3, 17], "geometri": [6, 7, 17], "geometry_column": 17, "geor111i": [6, 15], "geospati": [5, 12], "get": [2, 3, 14], "get_and_process_osm_data": 7, "get_com": [], "get_df_area": 6, "get_group_nam": [], "get_group_post_id": [], "get_level": 6, "get_owner_id_by_domain": [], "get_risk": 16, "get_stem": 6, "get_subcom": [], "getter": 3, "given": [5, 6, 12, 13, 15, 17, 20], "global": [6, 19], "global_cr": 6, "go": 0, "gpd": [6, 15, 17], "gpu": 13, "graph": [2, 3], "graph_path": 20, "graph_visu": 20, "graphml": 20, "group": [3, 6, 15, 20], "group_column": [6, 15], "group_nam": [6, 15], "ha": [13, 14, 17, 18, 19], "have": 20, "health": 18, "histgeodatagett": 6, "hold": 13, "homepag": 0, "hous": [6, 18], "howev": 3, "http": 2, "huggingfac": [13, 18], "huggingfacemodel": 13, "hypergraph": 2, "i": [2, 3, 4, 5, 6, 7, 8, 11, 12, 13, 14, 16, 17, 18, 19], "id": [5, 6, 12, 15, 18], "id_column": 17, "ident": 4, "identifi": [8, 17, 18, 19], "import": 3, "includ": 16, "incom": 19, "index": 4, "indic": 17, "inf": [], "infer": [13, 17], "infin": [], "info": [3, 6], "inform": [16, 17], "initi": [6, 16, 18], "initial_df": 6, "initialize_classifi": 18, "input": [6, 8, 13, 15, 17, 18], "instal": 4, "instanti": 13, "int": [6, 7, 14, 15, 17, 18], "interact": 20, "interpret": 2, "introduct": 4, "is_top": 18, "its": 20, "june": 2, "keep": 6, "kei": 17, "key_score_filt": 17, "keybert": 17, "keyword": 17, "know": 6, "label": 3, "languag": 17, "last": 3, "launch": 15, "least": 7, "level": [6, 15, 18], "librari": [2, 3, 4, 7, 11, 17], "like": 13, "limit": 3, "linear": 15, "link": [2, 14], "list": [3, 7, 11], "locat": [2, 7, 17], "location_column": 17, "logistic_regress": [], "low": 18, "lowercas": 6, "m": [], "machin": 2, "mai": 19, "main": [4, 17, 18, 19], "major": 3, "mani": 18, "map": 20, "mark": 18, "marker": 20, "markerclust": 20, "match": [6, 15, 17], "match_group_to_area": 6, "maximum": [], "me": 2, "media": [2, 4], "member": 9, "mention": [2, 3, 8], "meow": [], "merg": [6, 15, 19], "merge_to_initial_df": 6, "messag": [2, 3, 6, 11, 14, 17, 18, 19], "method": [2, 3, 11, 13, 14, 17, 18, 19], "min_event_s": 14, "min_text": 19, "model": [2, 3, 6, 8, 13, 14, 15, 17, 18], "model_nam": 13, "model_path": [6, 15], "modul": [2, 4, 12, 13, 14, 15, 16, 17, 18, 19], "more": [3, 17], "most": [11, 13], "multipl": 13, "name": [6, 8, 11, 13, 15, 17, 20], "natasha": 7, "natashaextractor": 8, "ner": [6, 8, 15], "network": [2, 3, 12, 14], "new": [2, 11, 17], "node": [2, 17, 20], "none": [6, 8, 13, 15, 17, 18, 20], "normal": 7, "number": [6, 17, 18], "number_of_categori": 18, "number_of_messag": [], "nx": 17, "object": [2, 3, 6, 7, 8, 11, 14, 15, 18, 20], "obtain": [2, 3, 17], "offset": 20, "one": 7, "onli": [3, 6], "onlin": 3, "openstreetmap": [5, 6, 12], "option": [6, 8, 15, 20], "order": [3, 6], "origin": [6, 11], "osm": [5, 6, 7, 12, 15], "osm_combined_df": 7, "osm_id": [6, 7, 15], "other": [6, 13, 15, 16, 18], "other_geo_obj": 7, "otherwis": [3, 6, 8, 13], "out": [3, 6], "output_fil": 20, "overlap": 20, "own": 20, "owner": [], "owner_id": [], "packag": [], "page": 4, "panda": [], "param": 17, "paramet": [6, 8, 15, 17, 20], "parent": 17, "parents_stack_column": 17, "pars": 2, "parser": [], "particular": 2, "path": 20, "pd": [6, 8, 13, 15, 17], "per": 18, "percentag": 7, "perform": 15, "perisiqu": [], "pip": 1, "pipelin": [4, 6, 15, 18], "place": [2, 6, 15, 17], "point": [6, 7], "polygon": 7, "popul": 14, "posit": 8, "possibl": 3, "post": [2, 17, 19], "post_id": [], "post_id_column": 17, "post_num_limit": [], "pre": [8, 18], "predict": [2, 18], "preprocess": [6, 14, 15], "preprocess_area_nam": 6, "preprocess_group_nam": 6, "present": 8, "prevail": 13, "previous": [], "probabl": [11, 18], "process": [2, 6, 7, 8, 15, 16, 18, 19], "process_pipelin": 8, "process_top": 19, "processed_geodata": 16, "provid": [2, 6, 15, 16, 17], "proxim": 17, "public": 18, "python": [], "python311": [], "pytorch": 17, "qualiti": 2, "random": 20, "rang": [8, 19], "read_csv": 13, "receiv": [2, 3], "recogn": [6, 13], "recognis": 6, "recognit": [2, 13], "recognize_average_emotion_from_multiple_model": 13, "recognize_emot": 13, "refer": 17, "regard": 2, "region": [2, 3], "regional_act": 16, "regionalact": 16, "relat": [8, 15], "relationship": 17, "remov": [6, 8, 17], "remove_last_lett": 11, "replace_with_most_similar": 11, "repli": 2, "repositori": 18, "repository_id": 18, "repres": [6, 20], "request": [], "requir": 3, "restoration_of_normal_form": 7, "result": [2, 17, 19, 20], "retriev": [5, 6, 7, 12, 15], "return": [6, 7, 8, 13, 14, 15, 16, 17, 18, 20], "risk": [2, 3, 13, 14, 16, 18], "road": 14, "roadmap": 4, "roam": [], "row": 6, "rubert": [17, 18, 19], "run": [1, 3, 6, 11, 14, 15], "run_com": [], "run_osm_df": 7, "run_pars": [], "run_post": [], "run_text_classifi": 18, "run_text_classifier_top": 18, "russian": [17, 18], "save": 20, "scale": 6, "score": [8, 17], "search": [0, 4, 8], "search_for_object": [6, 15], "section": 3, "see": 15, "select": [3, 13], "semant": [2, 3, 14], "semantic_graph": 17, "semantic_graph_build": 17, "semantic_score_filt": 17, "semgraph": 17, "sentenc": 11, "seri": [6, 8], "servic": [2, 3], "set": [6, 13, 17], "set_global_repr_point": 6, "similar": 7, "simpl": [6, 15], "simpli": 6, "site": 3, "size": [], "skip": 3, "sloyka": [1, 2, 3, 6, 8, 9, 11, 12, 13, 14, 15, 17, 18, 20], "sloyka_commun": 2, "so": 18, "social": [2, 3, 4, 12, 16], "some": [3, 19], "sorri": 0, "sourc": [6, 7, 8, 11, 13, 14, 15, 17, 18, 20], "spatial": [2, 7, 14], "special": 6, "specif": 18, "specifi": [6, 8, 17, 18, 19], "src": [6, 8, 9, 11, 13, 14, 15, 17, 18, 20], "stage": 2, "start_dat": 19, "state": [6, 15], "static": [6, 7, 8, 15, 17], "stem": 6, "step": 3, "stop": [], "stopword": 6, "store": [11, 16, 19], "str": [6, 7, 8, 14, 15, 17, 18, 20], "street": [6, 8, 12, 14, 15], "street_extractor": [8, 15], "street_nam": 8, "street_names_df": 6, "string": 11, "structur": 18, "subcom": [], "success": [], "synthesi": 3, "synthet": 18, "t": [0, 2, 6], "tag": [2, 3, 5, 6, 7, 12, 15], "take": [11, 18], "target_text": 14, "test": 6, "text": [2, 3, 6, 7, 8, 11, 13, 14, 15, 16, 17, 19], "text_address_extractor_by_rul": 8, "text_classifi": 18, "text_column": [6, 8, 11, 13, 15, 17], "text_type_column": 17, "textclassifi": 18, "textual": 2, "thank": 17, "thei": 3, "them": [2, 6, 7, 11], "theme": 18, "thi": [3, 5, 6, 7, 8, 12, 13, 14, 15, 17, 18], "those": 6, "three": 18, "threshold": [7, 8, 17], "timeout_limit": [], "tini": 18, "tiny2": [18, 19], "token": 18, "too": 18, "top": 17, "top_n": 17, "topic": [2, 3], "topicmodel": 19, "toponym": [8, 17], "toponym_column": 17, "toponym_name_column": 17, "toponym_type_column": 17, "train": [6, 8, 13, 18], "transform": [13, 18], "transport": 18, "true": [13, 17], "try": 0, "tupl": 6, "two": 2, "type": [6, 8, 11, 12, 15, 17, 18, 20], "undoc": 9, "uniqu": 17, "unknown_\u0436\u043a\u0445": 18, "unnecessari": 8, "up": 18, "updat": 17, "update_graph": 17, "upper": 18, "urban": [2, 3, 7, 14, 15, 18], "us": [0, 2, 4, 5, 7, 8, 11, 12, 13, 16, 17, 18, 19, 20], "user": [6, 16], "util": [9, 11, 18], "variou": 12, "veri": 3, "vernacular": 2, "version": 2, "visual": [2, 3], "vk": 12, "vk_data_gett": [], "vkontakt": 12, "vkparser": 12, "wa": 18, "wai": 6, "we": [0, 6], "well": [2, 11, 12], "were": [3, 18], "weren": 6, "where": 6, "which": [2, 3, 13, 14, 16, 17, 18, 19, 20], "whose": [], "wide": 19, "within": [2, 8], "word": [6, 15, 17], "word_info_column": 17, "words_scor": 17, "work": [9, 12], "y": [], "you": 3}, "titles": ["Page Not Found", "Installation", "Introduction", "Main pipeline", "Welcome to Sloyka\u2019s documentation!", "GeoDataGetter", "Geocoder", "OtherGeoObjects", "StreetExtractor", "Streets", "VKparser", "Services extraction", "Data getter", "Emotion classifier", "Event detector", "Geocoding", "Regional activity", "Semantic graph", "Text Classifier", "Topic Modelling", "Graph visualization"], "titleterms": {"": 4, "Not": 0, "about": [], "activ": 16, "advanc": [12, 15], "class": [], "classifi": [13, 18], "data": 12, "detector": 14, "document": 4, "emot": 13, "event": 14, "exampl": 13, "extract": 11, "featur": 2, "finder": [], "found": 0, "function": [], "gener": 4, "geo": [], "geocod": [6, 12, 15], "geodatagett": 5, "getter": 12, "graph": [17, 20], "indic": 4, "inner": [], "instal": 1, "introduct": 2, "main": [2, 3], "model": 19, "more": [12, 15], "object": [], "othergeoobject": [7, 15], "page": 0, "pipelin": 3, "receiv": [], "region": 16, "semant": 17, "servic": 11, "sloyka": 4, "street": 9, "streetextractor": [8, 15], "tabl": 4, "tag": [], "text": 18, "topic": 19, "visual": 20, "vkparser": 10, "welcom": 4, "work": []}}) \ No newline at end of file +Search.setIndex({"alltitles": {"Advanced geocoding": [[12, null], [15, null]], "Data getter": [[12, "data-getter"]], "Emotion classifier": [[13, "module-sloyka.src.risks.emotion_classifier"]], "Event detector": [[14, "module-sloyka.src.risks.event_detector"]], "Example": [[13, "example"]], "General:": [[4, null]], "GeoDataGetter": [[5, "geodatagetter"]], "Geocoder": [[6, "geocoder"], [15, "id1"]], "Geocoding": [[15, "geocoding"]], "Graph visualization": [[20, "graph-visualization"]], "Indices and tables": [[4, "indices-and-tables"]], "Installation": [[1, "installation"]], "Introduction": [[2, "introduction"]], "Main features": [[2, "main-features"]], "Main pipeline": [[3, "main-pipeline"]], "OtherGeoObjects": [[7, "othergeoobjects"], [15, "othergeoobjects"]], "Page Not Found": [[0, "page-not-found"]], "Regional activity": [[16, "regional-activity"]], "Semantic graph": [[17, "module-sloyka.src.semantic_graph.semantic_graph_builder"]], "Services extraction": [[11, "services-extraction"]], "StreetExtractor": [[8, "streetextractor"], [15, "streetextractor"]], "Streets": [[9, "streets"]], "Text Classifier": [[18, "text-classifier"]], "Topic Modelling": [[19, "topic-modelling"]], "VKparser": [[10, "vkparser"]], "Welcome to Sloyka\u2019s documentation!": [[4, "welcome-to-sloyka-s-documentation"]], "more:": [[12, "more"], [15, "more"]]}, "docnames": ["404", "about/installation", "about/introduction", "about/pipeline", "index", "modules/GeoDataGetter", "modules/Geocoder_special", "modules/OtherGeoObjects", "modules/StreetExtractor", "modules/Streets", "modules/VKParser", "modules/city_services_extract", "modules/data_getter", "modules/emotion_classifier", "modules/event_dynamic_prediction", "modules/geocoder", "modules/regional_activity", "modules/semantic_graph", "modules/text_classifier", "modules/topic_modeler", "modules/visualize_graph"], "envversion": {"sphinx": 61, "sphinx.domains.c": 3, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 9, "sphinx.domains.index": 1, "sphinx.domains.javascript": 3, "sphinx.domains.math": 2, "sphinx.domains.python": 4, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.viewcode": 1}, "filenames": ["404.rst", "about/installation.rst", "about/introduction.rst", "about/pipeline.rst", "index.rst", "modules/GeoDataGetter.rst", "modules/Geocoder_special.rst", "modules/OtherGeoObjects.rst", "modules/StreetExtractor.rst", "modules/Streets.rst", "modules/VKParser.rst", "modules/city_services_extract.rst", "modules/data_getter.rst", "modules/emotion_classifier.rst", "modules/event_dynamic_prediction.rst", "modules/geocoder.rst", "modules/regional_activity.rst", "modules/semantic_graph.rst", "modules/text_classifier.rst", "modules/topic_modeler.rst", "modules/visualize_graph.rst"], "indexentries": {"api_verison (sloyka.src.utils.data_getter.vk_data_getter.vkparser attribute)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.API_VERISON", false]], "assign_street() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.assign_street", false]], "build_graph() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.build_graph", false]], "calculate_centroid() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "id2", false], [7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.calculate_centroid", false]], "city_services (class in sloyka.src.utils.data_processing.city_services_extract)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services", false]], "classify_text() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.classify_text", false]], "clear_names() (sloyka.src.utils.data_getter.street_getter.streets static method)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets.clear_names", false]], "combine_city_obj() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.combine_city_obj", false]], "comments_to_dataframe() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.comments_to_dataframe", false]], "convert_df_to_edge_df() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph static method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.convert_df_to_edge_df", false]], "count_items (sloyka.src.utils.data_getter.vk_data_getter.vkparser attribute)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.COUNT_ITEMS", false]], "create_gdf() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.create_gdf", false]], "drop_words_from_name() (sloyka.src.utils.data_getter.street_getter.streets static method)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets.drop_words_from_name", false]], "emotionrecognizer (class in sloyka.src.risks.emotion_classifier)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer", false]], "eventdetection (class in sloyka.src.risks.event_detector)": [[14, "sloyka.src.risks.event_detector.EventDetection", false]], "expand_toponym() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.expand_toponym", false]], "extract_geo_obj() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "id3", false], [7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.extract_geo_obj", false]], "extract_ner_street() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extract_ner_street", false]], "extract_toponym() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extract_toponym", false]], "extraction_services() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.extraction_services", false]], "extractor (sloyka.src.geocoder.street_extractor.streetextractor attribute)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.extractor", false]], "find_geo_obj_tag() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.find_geo_obj_tag", false]], "find_geometry() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.find_geometry", false]], "find_num_city_obj() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.find_num_city_obj", false]], "find_toponim_words_from_name() (sloyka.src.utils.data_getter.street_getter.streets static method)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets.find_toponim_words_from_name", false]], "geocoder (class in sloyka.src.geocoder.geocoder)": [[6, "sloyka.src.geocoder.geocoder.Geocoder", false], [15, "sloyka.src.geocoder.geocoder.Geocoder", false]], "geodatagetter (class in sloyka.src.utils.data_getter.geo_data_getter)": [[5, "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter", false]], "get_and_process_osm_data() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "id0", false], [7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.get_and_process_osm_data", false]], "get_city_bounds() (sloyka.src.utils.data_getter.geo_data_getter.geodatagetter static method)": [[5, "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter.get_city_bounds", false]], "get_comments() (sloyka.src.utils.data_getter.vk_data_getter.vkparser method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.get_comments", false]], "get_df_areas() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_df_areas", false]], "get_drive_graph() (sloyka.src.utils.data_getter.geo_data_getter.geodatagetter static method)": [[5, "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter.get_drive_graph", false]], "get_features_from_id() (sloyka.src.utils.data_getter.geo_data_getter.geodatagetter static method)": [[5, "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter.get_features_from_id", false]], "get_group_name() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.get_group_name", false]], "get_group_post_ids() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.get_group_post_ids", false]], "get_level() (sloyka.src.geocoder.geocoder.geocoder static method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_level", false]], "get_osm_data() (sloyka.src.utils.data_getter.geo_data_getter.geodatagetter static method)": [[5, "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter.get_osm_data", false]], "get_owner_id_by_domain() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.get_owner_id_by_domain", false]], "get_stem() (sloyka.src.geocoder.geocoder.geocoder static method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.get_stem", false]], "get_street_names() (sloyka.src.utils.data_getter.street_getter.streets static method)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets.get_street_names", false]], "get_subcomments() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.get_subcomments", false]], "get_unique_part_types() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.get_unique_part_types", false]], "initialize_classifier() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.initialize_classifier", false]], "match_group_to_area() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.match_group_to_area", false]], "merge_to_initial_df() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.merge_to_initial_df", false]], "module": [[11, "module-sloyka.src.utils.data_processing.city_services_extract", false], [13, "module-sloyka.src.risks.emotion_classifier", false], [14, "module-sloyka.src.risks.event_detector", false], [17, "module-sloyka.src.semantic_graph.semantic_graph_builder", false], [18, "module-sloyka.src.risks.text_classifier", false], [20, "module-sloyka.src.visual.graph_visualization", false]], "othergeoobjects (class in sloyka.src.geocoder.city_objects_extractor)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects", false], [15, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects", false]], "preprocess_area_names() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.preprocess_area_names", false]], "preprocess_group_name() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.preprocess_group_name", false]], "process_pipeline() (sloyka.src.geocoder.street_extractor.streetextractor static method)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor.process_pipeline", false]], "recognize_average_emotion_from_multiple_models() (sloyka.src.risks.emotion_classifier.emotionrecognizer method)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer.recognize_average_emotion_from_multiple_models", false]], "recognize_emotion() (sloyka.src.risks.emotion_classifier.emotionrecognizer method)": [[13, "sloyka.src.risks.emotion_classifier.EmotionRecognizer.recognize_emotion", false]], "remove_last_letter() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.remove_last_letter", false]], "replace_with_most_similar() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.replace_with_most_similar", false]], "restoration_of_normal_form() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "id4", false], [7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.restoration_of_normal_form", false]], "run() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.run", false], [15, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.run", false]], "run() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.run", false], [15, "sloyka.src.geocoder.geocoder.Geocoder.run", false]], "run() (sloyka.src.risks.event_detector.eventdetection method)": [[14, "sloyka.src.risks.event_detector.EventDetection.run", false]], "run() (sloyka.src.utils.data_getter.street_getter.streets static method)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets.run", false]], "run() (sloyka.src.utils.data_processing.city_services_extract.city_services method)": [[11, "sloyka.src.utils.data_processing.city_services_extract.City_services.run", false]], "run_comments() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.run_comments", false]], "run_osm_dfs() (sloyka.src.geocoder.city_objects_extractor.othergeoobjects static method)": [[7, "id1", false], [7, "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects.run_osm_dfs", false]], "run_parser() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.run_parser", false]], "run_posts() (sloyka.src.utils.data_getter.vk_data_getter.vkparser static method)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.run_posts", false]], "run_text_classifier() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.run_text_classifier", false]], "run_text_classifier_topics() (sloyka.src.risks.text_classifier.textclassifiers method)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers.run_text_classifier_topics", false]], "semgraph (class in sloyka.src.semantic_graph.semantic_graph_builder)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph", false]], "set_global_repr_point() (sloyka.src.geocoder.geocoder.geocoder method)": [[6, "sloyka.src.geocoder.geocoder.Geocoder.set_global_repr_point", false]], "sloyka.src.risks.emotion_classifier": [[13, "module-sloyka.src.risks.emotion_classifier", false]], "sloyka.src.risks.event_detector": [[14, "module-sloyka.src.risks.event_detector", false]], "sloyka.src.risks.text_classifier": [[18, "module-sloyka.src.risks.text_classifier", false]], "sloyka.src.semantic_graph.semantic_graph_builder": [[17, "module-sloyka.src.semantic_graph.semantic_graph_builder", false]], "sloyka.src.utils.data_processing.city_services_extract": [[11, "module-sloyka.src.utils.data_processing.city_services_extract", false]], "sloyka.src.visual.graph_visualization": [[20, "module-sloyka.src.visual.graph_visualization", false]], "streetextractor (class in sloyka.src.geocoder.street_extractor)": [[8, "sloyka.src.geocoder.street_extractor.StreetExtractor", false], [15, "sloyka.src.geocoder.street_extractor.StreetExtractor", false]], "streets (class in sloyka.src.utils.data_getter.street_getter)": [[9, "sloyka.src.utils.data_getter.street_getter.Streets", false]], "textclassifiers (class in sloyka.src.risks.text_classifier)": [[18, "sloyka.src.risks.text_classifier.TextClassifiers", false]], "timeout_limit (sloyka.src.utils.data_getter.vk_data_getter.vkparser attribute)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser.TIMEOUT_LIMIT", false]], "update_graph() (sloyka.src.semantic_graph.semantic_graph_builder.semgraph method)": [[17, "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph.update_graph", false]], "visualize_graph() (in module sloyka.src.visual.graph_visualization)": [[20, "sloyka.src.visual.graph_visualization.visualize_graph", false]], "vkparser (class in sloyka.src.utils.data_getter.vk_data_getter)": [[10, "sloyka.src.utils.data_getter.vk_data_getter.VKParser", false]]}, "objects": {"sloyka.src.geocoder.city_objects_extractor": [[15, 0, 1, "", "OtherGeoObjects"]], "sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects": [[7, 1, 1, "id2", "calculate_centroid"], [7, 1, 1, "", "combine_city_obj"], [7, 1, 1, "", "expand_toponym"], [7, 1, 1, "id3", "extract_geo_obj"], [7, 1, 1, "", "find_geo_obj_tag"], [7, 1, 1, "", "find_geometry"], [7, 1, 1, "", "find_num_city_obj"], [7, 1, 1, "id0", "get_and_process_osm_data"], [7, 1, 1, "", "get_unique_part_types"], [7, 1, 1, "id4", "restoration_of_normal_form"], [15, 1, 1, "", "run"], [7, 1, 1, "id1", "run_osm_dfs"]], "sloyka.src.geocoder.geocoder": [[15, 0, 1, "", "Geocoder"]], "sloyka.src.geocoder.geocoder.Geocoder": [[6, 1, 1, "", "assign_street"], [6, 1, 1, "", "create_gdf"], [6, 1, 1, "", "get_df_areas"], [6, 1, 1, "", "get_level"], [6, 1, 1, "", "get_stem"], [6, 1, 1, "", "match_group_to_area"], [6, 1, 1, "", "merge_to_initial_df"], [6, 1, 1, "", "preprocess_area_names"], [6, 1, 1, "", "preprocess_group_name"], [15, 1, 1, "", "run"], [6, 1, 1, "", "set_global_repr_point"]], "sloyka.src.geocoder.street_extractor": [[15, 0, 1, "", "StreetExtractor"]], "sloyka.src.geocoder.street_extractor.StreetExtractor": [[8, 1, 1, "", "extract_ner_street"], [8, 1, 1, "", "extract_toponym"], [8, 2, 1, "", "extractor"], [8, 1, 1, "", "process_pipeline"]], "sloyka.src.risks": [[13, 3, 0, "-", "emotion_classifier"], [14, 3, 0, "-", "event_detector"], [18, 3, 0, "-", "text_classifier"]], "sloyka.src.risks.emotion_classifier": [[13, 0, 1, "", "EmotionRecognizer"]], "sloyka.src.risks.emotion_classifier.EmotionRecognizer": [[13, 1, 1, "", "recognize_average_emotion_from_multiple_models"], [13, 1, 1, "", "recognize_emotion"]], "sloyka.src.risks.event_detector": [[14, 0, 1, "", "EventDetection"]], "sloyka.src.risks.event_detector.EventDetection": [[14, 1, 1, "", "run"]], "sloyka.src.risks.text_classifier": [[18, 0, 1, "", "TextClassifiers"]], "sloyka.src.risks.text_classifier.TextClassifiers": [[18, 1, 1, "", "classify_text"], [18, 1, 1, "", "initialize_classifier"], [18, 1, 1, "", "run_text_classifier"], [18, 1, 1, "", "run_text_classifier_topics"]], "sloyka.src.semantic_graph": [[17, 3, 0, "-", "semantic_graph_builder"]], "sloyka.src.semantic_graph.semantic_graph_builder": [[17, 0, 1, "", "Semgraph"]], "sloyka.src.semantic_graph.semantic_graph_builder.Semgraph": [[17, 1, 1, "", "build_graph"], [17, 1, 1, "", "convert_df_to_edge_df"], [17, 1, 1, "", "update_graph"]], "sloyka.src.utils.data_getter.geo_data_getter": [[5, 0, 1, "", "GeoDataGetter"]], "sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter": [[5, 1, 1, "", "get_city_bounds"], [5, 1, 1, "", "get_drive_graph"], [5, 1, 1, "", "get_features_from_id"], [5, 1, 1, "", "get_osm_data"]], "sloyka.src.utils.data_getter.street_getter": [[9, 0, 1, "", "Streets"]], "sloyka.src.utils.data_getter.street_getter.Streets": [[9, 1, 1, "", "clear_names"], [9, 1, 1, "", "drop_words_from_name"], [9, 1, 1, "", "find_toponim_words_from_name"], [9, 1, 1, "", "get_street_names"], [9, 1, 1, "", "run"]], "sloyka.src.utils.data_getter.vk_data_getter": [[10, 0, 1, "", "VKParser"]], "sloyka.src.utils.data_getter.vk_data_getter.VKParser": [[10, 2, 1, "", "API_VERISON"], [10, 2, 1, "", "COUNT_ITEMS"], [10, 2, 1, "", "TIMEOUT_LIMIT"], [10, 1, 1, "", "comments_to_dataframe"], [10, 1, 1, "", "get_comments"], [10, 1, 1, "", "get_group_name"], [10, 1, 1, "", "get_group_post_ids"], [10, 1, 1, "", "get_owner_id_by_domain"], [10, 1, 1, "", "get_subcomments"], [10, 1, 1, "", "run_comments"], [10, 1, 1, "", "run_parser"], [10, 1, 1, "", "run_posts"]], "sloyka.src.utils.data_processing": [[11, 3, 0, "-", "city_services_extract"]], "sloyka.src.utils.data_processing.city_services_extract": [[11, 0, 1, "", "City_services"]], "sloyka.src.utils.data_processing.city_services_extract.City_services": [[11, 1, 1, "", "extraction_services"], [11, 1, 1, "", "remove_last_letter"], [11, 1, 1, "", "replace_with_most_similar"], [11, 1, 1, "", "run"]], "sloyka.src.visual": [[20, 3, 0, "-", "graph_visualization"]], "sloyka.src.visual.graph_visualization": [[20, 4, 1, "", "visualize_graph"]]}, "objnames": {"0": ["py", "class", "Python class"], "1": ["py", "method", "Python method"], "2": ["py", "attribute", "Python attribute"], "3": ["py", "module", "Python module"], "4": ["py", "function", "Python function"]}, "objtypes": {"0": "py:class", "1": "py:method", "2": "py:attribute", "3": "py:module", "4": "py:function"}, "terms": {"": [2, 3], "0": [2, 7, 17], "000": 18, "1": [6, 17, 18, 19], "100": 10, "131": 10, "15": [10, 19], "2": 6, "2024": 2, "3": 6, "5": 10, "50": 10, "6": [2, 7, 17], "7": 7, "70": 7, "75": 17, "90": 18, "A": [6, 8, 9, 10, 12, 17], "As": 17, "By": 3, "For": [7, 17], "If": 6, "In": 18, "It": [6, 13, 14, 15, 17, 18, 19], "Its": 2, "No": 7, "The": [2, 3, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20], "To": [1, 15], "_collect_popul": 14, "_get_build": 14, "_get_features_from_plac": 5, "_get_place_from_id": 5, "_get_road": 14, "_handle_error": 5, "_preprocess": 14, "_process_tag": 5, "about": [3, 16, 19], "access": [10, 18], "access_token": 10, "accord": 6, "account": 11, "accsess_token": 10, "across": 3, "activ": [2, 3], "actual": 6, "ad": 13, "add": 6, "addit": [2, 10, 19], "address": [6, 8, 9, 15], "admin": [6, 15], "after": [2, 6, 15, 16, 17], "against": 6, "aggreg": [16, 17], "aggregate_data": 17, "aim": [2, 14, 17, 18, 19], "algorithm": 11, "all": [3, 6], "almost": 9, "alreadi": [2, 3, 6], "also": [2, 6, 15], "amen": 18, "an": [5, 6, 7, 13, 14, 15, 20], "analysi": 7, "analyz": [4, 13], "ani": [3, 5, 6, 13, 17, 19], "api": [5, 10], "api_verison": 10, "appli": [3, 8, 13], "applic": [14, 17], "ar": [3, 6, 7, 9, 10, 15, 17], "area": [3, 6, 15], "area_nam": 6, "area_name_process": 6, "area_stem": 6, "around": 8, "arrai": 7, "assess": [2, 3], "assign": [6, 15], "assign_street": 6, "attribut": [6, 13, 16, 17, 18], "augment": 2, "authent": 10, "automat": 13, "avail": 13, "avenu": 9, "averag": 13, "avoid": 20, "back": [6, 13], "base": [5, 9, 10, 12, 14, 16, 17, 18], "basic": 16, "becaus": 9, "being": [10, 18], "belong": 19, "below": 8, "bert": [17, 18], "bert_larg": 13, "bert_nam": 17, "bert_tini": 13, "bertop": [14, 19], "best": [6, 15], "between": [9, 14, 17], "bool": 17, "boundari": 5, "box": 0, "build": [2, 14, 17], "build_graph": 17, "cach": 6, "calcul": [7, 13], "calculate_centroid": 7, "call": [10, 16, 18], "can": [2, 3, 13, 16], "care": 18, "carri": 3, "case": [3, 17, 18], "categor": 2, "categori": [5, 13, 18], "categoris": 13, "centroid": [5, 6, 7], "certain": 3, "chang": 11, "charact": 6, "character": 3, "chat": 2, "check": 6, "child": 17, "citi": [2, 4, 5, 6, 7, 9, 11, 14, 18, 19], "citizen": 2, "city_bound": 5, "city_cr": 14, "city_nam": 14, "city_objects_extractor": [7, 15], "city_servic": 11, "city_services_extract": [11, 16], "city_tag": [6, 15], "class": [5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19], "classif": [15, 18], "classifi": [2, 3, 8, 16], "classify_text": 18, "clean": 17, "clean_from_digit": 17, "clean_from_dubl": 17, "clean_from_toponym": 17, "clear_nam": 9, "cluster": [14, 19, 20], "cointegr": 19, "collect": [2, 7, 14], "color": 20, "column": [2, 3, 5, 6, 10, 11, 13, 15, 17], "combin": [7, 10], "combine_city_obj": 7, "comment": [2, 10], "comments_to_datafram": 10, "commun": [2, 3], "compar": 7, "comparison": 11, "compat": 13, "confid": 8, "connect": 14, "consist": 3, "construct": [2, 17], "contain": [5, 6, 8, 10, 12, 13, 14, 17, 18], "content": 8, "context": 14, "conveni": 17, "convert": [6, 10], "convert_df_to_edge_df": 17, "convini": 17, "coordin": 7, "correctli": 18, "correspond": [6, 15], "cosin": 17, "couldn": 0, "count": [17, 18], "count_item": 10, "counts_attribut": 17, "cpu": [6, 13, 15, 17], "creat": [6, 15, 16, 17, 19], "create_gdf": 6, "csv": 13, "cuda": 13, "custom": 8, "cutoff": 10, "cutoff_d": 10, "d": 10, "dai": 19, "data": [2, 3, 4, 5, 6, 7, 9, 10, 13, 14, 15, 16, 17, 19], "data_gett": [5, 9, 10], "data_process": 11, "databas": 9, "datafram": [5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 17], "dataset": 3, "date": [2, 6, 10, 15, 19], "deeppavlov": 17, "default": [5, 6, 10, 13, 15, 17, 20], "design": [11, 13, 16, 18, 19], "determin": 17, "devic": [6, 13, 15, 17, 18], "device_typ": 18, "df": [6, 7, 8, 11, 13, 15], "df_area": 6, "df_obj": 7, "dict": [5, 6, 7, 10, 15], "dictionari": [6, 10], "didn": 6, "differ": [6, 7], "digit": [2, 17], "direct": 17, "displai": 20, "distanc": 17, "divid": 3, "do": [7, 9, 15], "document": 2, "domain": 10, "don": 6, "done": 19, "drop": 9, "drop_words_from_nam": 9, "duplic": 17, "dure": [5, 13], "dynam": 19, "e": 9, "each": [3, 5, 6, 10, 14, 15, 17, 19, 20], "edg": 5, "effici": 18, "element": [2, 6], "element_typ": 5, "embed": [17, 19], "embedding_model_nam": 19, "emot": [2, 3], "emotion_classifi": [13, 16], "emotionclassifi": 13, "emotionrecogn": 13, "enabl": 13, "encapsul": 9, "end": 11, "end_dat": 19, "enrich": 2, "entiti": [2, 7, 11], "environ": 2, "error": 5, "evalu": 2, "event": [2, 18], "event_detector": 14, "eventdetect": 14, "exact": 9, "exampl": [7, 18], "exist": 2, "expand_toponym": 7, "extract": [2, 3, 7, 8, 9, 15, 17, 19], "extract_geo_obj": 7, "extract_ner_street": 8, "extract_toponym": 8, "extraction_servic": 11, "extractor": [6, 8, 15], "fail": 8, "fall": 13, "fals": [6, 15, 18], "famili": 18, "featur": 5, "fetch": 10, "file": 20, "filepath_to_popul": 14, "filter": [5, 6, 15, 17], "final": [2, 6, 15, 17], "find": [0, 6, 7, 9, 15], "find_geo_obj_tag": 7, "find_geometri": 7, "find_num_city_obj": 7, "find_toponim_words_from_nam": 9, "first": [6, 8], "flag": 17, "flair": [6, 11, 15], "float": [10, 17], "folium": 20, "follow": [13, 14, 17, 18, 19], "footprint": 2, "forecast": 3, "form": [6, 7, 15], "format": 10, "found": [6, 7, 8, 9], "from": [2, 5, 6, 7, 8, 9, 10, 11, 12, 15, 17, 19, 20], "function": [3, 6, 7, 8, 9, 10, 15, 16, 17, 18, 19, 20], "further": [2, 3, 7], "g": [9, 17, 20], "gdf": [6, 9, 19], "gener": [2, 3, 14, 15], "geo_data_gett": 5, "geo_obj_tag": 7, "geocod": [3, 7, 8, 9, 16, 17], "geodatafram": [2, 5, 6, 9, 14, 15, 16, 17, 19], "geodatagett": 12, "geograph": [6, 15], "geoloc": [3, 17], "geometri": [5, 6, 7, 17], "geometry_column": 17, "geor111i": [6, 15], "geospati": [5, 12], "get": [2, 3, 5, 9, 10, 14], "get_and_process_osm_data": 7, "get_city_bound": 5, "get_com": 10, "get_df_area": 6, "get_drive_graph": 5, "get_features_from_id": 5, "get_group_nam": 10, "get_group_post_id": 10, "get_level": 6, "get_osm_data": 5, "get_owner_id_by_domain": 10, "get_risk": 16, "get_stem": 6, "get_street_nam": 9, "get_subcom": 10, "get_unique_part_typ": 7, "getter": 3, "given": [5, 6, 9, 10, 12, 13, 15, 17, 20], "glebosol": 11, "global": [6, 19], "go": 0, "gpd": [5, 6, 15, 17], "gpu": 13, "graph": [2, 3], "graph_path": 20, "graph_visu": 20, "group": [3, 6, 10, 15, 20], "group_column": [6, 15], "group_nam": [6, 15], "ha": [13, 14, 17, 18, 19], "handl": 5, "have": 20, "health": 18, "histgeodatagett": 6, "hold": 13, "homepag": 0, "hous": [6, 18], "howev": 3, "http": 2, "huggingfac": [13, 18], "huggingfacemodel": 13, "hypergraph": 2, "i": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19], "id": [5, 6, 9, 10, 12, 15, 18], "id_column": 17, "ident": 4, "identifi": [8, 17, 18, 19], "import": 3, "includ": [5, 16], "incom": 19, "index": 4, "indic": 17, "inf": 10, "infer": [13, 17], "infin": 10, "info": [3, 6], "inform": [9, 10, 16, 17], "initi": [6, 16, 18], "initial_df": 6, "initialize_classifi": 18, "input": [6, 8, 9, 10, 13, 15, 17, 18], "instal": 4, "instanti": 13, "int": [5, 6, 7, 9, 10, 14, 15, 17, 18, 20], "interact": 20, "interpret": 2, "intersect": 5, "introduct": 4, "is_top": 18, "its": [10, 20], "june": 2, "keep": 6, "kei": 17, "key_score_filt": 17, "keybert": 17, "keyword": 17, "know": 6, "label": 3, "languag": 17, "last": 3, "later": 9, "launch": [7, 15], "least": 7, "level": [6, 15, 18], "librari": [2, 3, 4, 5, 7, 11, 17], "like": 13, "limit": [3, 10], "linear": 15, "link": [2, 14], "list": [3, 5, 7, 10, 11], "locat": [2, 7, 17], "location_column": 17, "low": 18, "lowercas": 6, "m": 10, "machin": 2, "mai": 19, "main": [4, 17, 18, 19], "major": 3, "mani": 18, "map": 20, "mark": 18, "marker": 20, "markerclust": 20, "match": [6, 9, 15, 17], "match_group_to_area": 6, "maximum": 10, "me": 2, "media": [2, 4, 10], "mention": [2, 3, 8], "merg": [6, 15, 19], "merge_to_initial_df": 6, "messag": [2, 3, 5, 6, 10, 11, 14, 17, 18, 19], "method": [2, 3, 5, 9, 10, 11, 13, 14, 17, 18, 19], "min_event_s": 14, "min_text": 19, "model": [2, 3, 6, 8, 13, 14, 15, 17, 18], "model_nam": [11, 13], "model_path": [6, 15], "modul": [2, 4, 7, 12, 13, 14, 15, 16, 17, 18, 19], "more": [3, 17], "most": [11, 13], "multidigraph": 5, "multipl": 13, "name": [5, 6, 7, 8, 9, 11, 13, 15, 17, 20], "natasha": 7, "natashaextractor": 8, "necessari": 9, "need": 9, "ner": [6, 8, 15], "network": [2, 3, 5, 12, 14], "networkx": 5, "new": [2, 11, 17], "node": [2, 5, 17, 20], "nominatim": 9, "none": [6, 8, 9, 10, 13, 15, 17, 18, 20], "normal": 7, "number": [6, 7, 10, 17, 18], "number_of_categori": 18, "number_of_messag": 10, "nx": 17, "object": [2, 3, 5, 6, 7, 8, 11, 14, 15, 18, 20], "obtain": [2, 3, 17], "occur": 5, "offset": 20, "one": 7, "onli": [3, 6], "onlin": 3, "openstreetmap": [5, 6, 12], "option": [5, 6, 8, 15, 20], "order": [3, 6], "origin": [6, 11], "osm": [5, 6, 7, 9, 12, 15], "osm_combined_df": 7, "osm_id": [5, 6, 7, 9, 15], "osm_typ": 5, "osmid": 5, "osmnx": 5, "other": [6, 13, 15, 16, 18], "other_geo_obj": 7, "otherwis": [3, 6, 8, 13], "out": [3, 6], "output_fil": 20, "overlap": 20, "overpass": 5, "own": 20, "owner": 10, "owner_id": 10, "page": 4, "panda": [9, 10], "param": [10, 17], "paramet": [5, 6, 8, 9, 10, 15, 17, 20], "parent": 17, "parents_stack_column": 17, "pars": 2, "parser": 10, "part": 9, "particular": 2, "pd": [6, 8, 13, 15, 17], "per": 18, "percentag": 7, "perform": 15, "pip": 1, "pipelin": [4, 6, 15, 18], "place": [2, 5, 6, 15, 17], "point": [6, 7], "polygon": [5, 7], "popul": 14, "posit": [8, 10], "possibl": 3, "post": [2, 10, 17, 19], "post_id": 10, "post_id_column": 17, "post_num_limit": 10, "pre": [8, 9, 18], "predict": [2, 18], "preprocess": [6, 14, 15], "preprocess_area_nam": 6, "preprocess_group_nam": 6, "present": 8, "prevail": 13, "print": 5, "probabl": [11, 18], "process": [2, 5, 6, 7, 8, 9, 15, 16, 18, 19], "process_pipelin": 8, "process_top": 19, "processed_geodata": 16, "provid": [2, 5, 6, 15, 16, 17], "proxim": 17, "public": 18, "purpos": 9, "pytorch": 17, "qualiti": 2, "r": 5, "random": 20, "rang": [8, 19], "read_csv": 13, "receiv": [2, 3], "recogn": [6, 13], "recognis": [6, 9], "recognit": [2, 13], "recognize_average_emotion_from_multiple_model": 13, "recognize_emot": 13, "refer": 17, "regard": 2, "region": [2, 3], "regional_act": 16, "regionalact": 16, "relat": [7, 8, 15], "relationship": 17, "remov": [6, 8, 17], "remove_last_lett": 11, "replace_with_most_similar": 11, "repli": 2, "repositori": 18, "repository_id": 18, "repres": [5, 6, 7, 20], "request": 10, "requir": [3, 9], "restoration_of_normal_form": 7, "result": [2, 5, 17, 19, 20], "retriev": [5, 6, 7, 9, 10, 12, 15], "return": [5, 6, 7, 8, 9, 10, 13, 14, 15, 16, 17, 18, 20], "risk": [2, 3, 13, 14, 16, 18], "road": 14, "roadmap": 4, "row": [6, 7], "rubert": [17, 18, 19], "run": [1, 3, 6, 7, 9, 10, 11, 14, 15], "run_com": 10, "run_osm_df": 7, "run_pars": 10, "run_post": 10, "run_text_classifi": 18, "run_text_classifier_top": 18, "russian": [6, 15, 17, 18], "save": 20, "scale": 6, "school": 7, "score": [8, 17], "search": [0, 4, 7, 8], "search_for_object": [6, 15], "section": 3, "see": 15, "segment": [5, 9], "select": [3, 5, 13], "selected_column": 5, "semant": [2, 3, 14], "semantic_graph": 17, "semantic_graph_build": 17, "semantic_score_filt": 17, "semgraph": 17, "sensit": 9, "sentenc": 11, "seri": [6, 8], "servic": [2, 3], "set": [6, 13, 17], "set_global_repr_point": 6, "similar": 7, "simpl": [6, 15], "simpli": 6, "site": 3, "size": 10, "skip": 3, "sloyka": [1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 20], "sloyka_commun": 2, "so": 18, "social": [2, 3, 4, 10, 12, 16], "some": [3, 19], "sorri": 0, "sourc": [5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 17, 18, 20], "spatial": [2, 5, 7, 14], "special": 6, "specif": [5, 10, 18], "specifi": [5, 6, 8, 9, 10, 17, 18, 19], "split": 7, "src": [5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 17, 18, 20], "stage": 2, "start_dat": 19, "state": [6, 15], "static": [5, 6, 7, 8, 9, 10, 15, 17], "stem": 6, "stemmer_lang": [6, 15], "step": [3, 9, 10], "stop": 10, "stopword": 6, "store": [11, 16, 19], "str": [5, 6, 7, 8, 9, 10, 11, 14, 15, 17, 18, 20], "street": [5, 6, 7, 8, 12, 14, 15], "street_extractor": [8, 15], "street_gett": 9, "street_nam": 8, "street_names_df": 6, "streets_df": 9, "string": [9, 11], "structur": 18, "subcom": 10, "success": 10, "synthesi": 3, "synthet": 18, "t": [0, 2, 6], "tag": [2, 3, 5, 6, 7, 12, 15], "take": [11, 18], "target_text": 14, "text": [2, 3, 6, 7, 8, 11, 13, 14, 15, 16, 17, 19], "text_address_extractor_by_rul": 8, "text_classifi": 18, "text_column": [6, 7, 8, 11, 13, 15, 17], "text_column_nam": [6, 15], "text_type_column": 17, "textclassifi": 18, "textual": 2, "thank": 17, "thei": 3, "them": [2, 6, 7, 11], "theme": 18, "thi": [3, 5, 6, 7, 8, 9, 12, 13, 14, 15, 17, 18], "those": 6, "three": 18, "threshold": [7, 8, 17], "timeout_limit": 10, "tini": 18, "tiny2": [18, 19], "token": [10, 18], "too": 18, "top": 17, "top_n": 17, "topic": [2, 3], "topicmodel": 19, "toponim": 9, "toponym": [7, 8, 17], "toponym_column": 17, "toponym_name_column": 17, "toponym_type_column": 17, "train": [6, 8, 13, 18], "transform": [13, 18], "transport": 18, "true": [13, 17], "try": 0, "tupl": 6, "two": 2, "type": [5, 6, 8, 9, 10, 11, 12, 15, 17, 18, 20], "uniqu": [9, 17], "unknown_\u0436\u043a\u0445": 18, "unnecessari": 8, "up": 18, "updat": 17, "update_graph": 17, "upper": 18, "urban": [2, 3, 7, 14, 15, 18], "us": [0, 2, 4, 5, 7, 8, 9, 10, 11, 12, 13, 16, 17, 18, 19, 20], "user": [10, 16], "util": [5, 9, 10, 11, 18], "variou": 12, "veri": [3, 9], "vernacular": 2, "version": 2, "visual": [2, 3], "visualize_graph": 20, "vk": [10, 12], "vk_data_gett": 10, "vkontakt": 12, "vkparser": 12, "wa": [10, 18], "wai": 6, "we": [0, 6, 9], "well": [2, 11, 12], "were": [3, 18], "weren": 6, "where": [5, 6], "which": [2, 3, 7, 13, 14, 16, 17, 18, 19, 20], "whose": [7, 10], "wide": 19, "within": [2, 8], "word": [6, 9, 15, 17], "word_info_column": 17, "words_scor": 17, "work": [9, 12], "workaround": 6, "x": 9, "y": 10, "you": 3, "zoom": 20}, "titles": ["Page Not Found", "Installation", "Introduction", "Main pipeline", "Welcome to Sloyka\u2019s documentation!", "GeoDataGetter", "Geocoder", "OtherGeoObjects", "StreetExtractor", "Streets", "VKparser", "Services extraction", "Data getter", "Emotion classifier", "Event detector", "Geocoding", "Regional activity", "Semantic graph", "Text Classifier", "Topic Modelling", "Graph visualization"], "titleterms": {"": 4, "Not": 0, "activ": 16, "advanc": [12, 15], "classifi": [13, 18], "data": 12, "detector": 14, "document": 4, "emot": 13, "event": 14, "exampl": 13, "extract": 11, "featur": 2, "found": 0, "gener": 4, "geocod": [6, 12, 15], "geodatagett": 5, "getter": 12, "graph": [17, 20], "indic": 4, "instal": 1, "introduct": 2, "main": [2, 3], "model": 19, "more": [12, 15], "othergeoobject": [7, 15], "page": 0, "pipelin": 3, "region": 16, "semant": 17, "servic": 11, "sloyka": 4, "street": 9, "streetextractor": [8, 15], "tabl": 4, "text": 18, "topic": 19, "visual": 20, "vkparser": 10, "welcom": 4}}) \ No newline at end of file diff --git a/docs/source/about/introduction.rst b/docs/source/about/introduction.rst index a31cce5..95c22a9 100644 --- a/docs/source/about/introduction.rst +++ b/docs/source/about/introduction.rst @@ -14,7 +14,7 @@ or to visualize already existing nodes and links and their further interpretatio SLOYKA also provides methods for modeling social risks regarding the emotional evaluation of mentioned places. Main features --------- +-------------- * Social media parsing: getting posts, comments and replys * City services and places extraction * Emotion and text classifiers categorizing diff --git a/docs/source/about/pipeline.rst b/docs/source/about/pipeline.rst index 0b6ee8a..c19fc24 100644 --- a/docs/source/about/pipeline.rst +++ b/docs/source/about/pipeline.rst @@ -1,5 +1,5 @@ Main pipeline -============ +=================== By selecting a limited urbanized area and a list of online communities in a social network, it is possible to run this dataset across all major library functions. However, in some cases, @@ -17,8 +17,7 @@ text data mentioning urban sites, otherwise the steps are very important - :ref: * Data tagging: Characterization of messages and urban objects, which can be carried out in any order: :ref:`emotion_classifier` :ref:`text_classifier` :ref:`city_services` :ref:`topic_modeler` -* Data modelling: Section consists of further synthesis of the obtained data, risk assessment and forecasting. -Each of the methods in this group requires certain labeling columns: :ref:`sem_graph` :ref:`regional_activity` +* Data modelling: Section consists of further synthesis of the obtained data, risk assessment and forecasting. Each of the methods in this group requires certain labeling columns: :ref:`sem_graph` :ref:`regional_activity` * Data visualization: The last step is applied to the already generated semantic graph - :ref:`graph_visualization` diff --git a/docs/source/conf.py b/docs/source/conf.py index ffb95b8..7fba588 100644 --- a/docs/source/conf.py +++ b/docs/source/conf.py @@ -7,7 +7,10 @@ # https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information import os import sys -sys.path.insert(0, os.path.abspath('./sloyka')) + +folder = os.getcwd().split("\\docs")[0] +sys.path.append(folder) + project = 'sloyka' copyright = '2024, itmo_idu' diff --git a/docs/source/modules/GeoDataGetter.rst b/docs/source/modules/GeoDataGetter.rst index 359b53a..e007103 100644 --- a/docs/source/modules/GeoDataGetter.rst +++ b/docs/source/modules/GeoDataGetter.rst @@ -4,6 +4,6 @@ GeoDataGetter ========================== This class is used to retrieve geospatial data from OpenStreetMap (OSM) based on given OSM ID and tags. -.. autoclass:: sloyka.src.utils.data_getter.GeoDataGetter +.. autoclass:: sloyka.src.utils.data_getter.geo_data_getter.GeoDataGetter :members: :undoc-members: \ No newline at end of file diff --git a/docs/source/modules/Geocoder_special.rst b/docs/source/modules/Geocoder_special.rst index e2390e1..f8254b7 100644 --- a/docs/source/modules/Geocoder_special.rst +++ b/docs/source/modules/Geocoder_special.rst @@ -1,4 +1,5 @@ .. _Geocoder_special: + Geocoder ================== .. autoclass:: sloyka.src.geocoder.geocoder.Geocoder @@ -6,4 +7,4 @@ Geocoder :undoc-members: :no-members: run -Back to all :ref:`geocoder` \ No newline at end of file +Back to all :ref:`geocoder` diff --git a/docs/source/modules/OtherGeoObjects.rst b/docs/source/modules/OtherGeoObjects.rst index 332483c..061ae50 100644 --- a/docs/source/modules/OtherGeoObjects.rst +++ b/docs/source/modules/OtherGeoObjects.rst @@ -4,7 +4,7 @@ OtherGeoObjects ================== .. currentmodule:: sloyka.src.geocoder -.. autoclass:: city_objects_extractor.OtherGeoObjects +.. autoclass:: sloyka.src.geocoder.city_objects_extractor.OtherGeoObjects :members: :undoc-members: diff --git a/docs/source/modules/Streets.rst b/docs/source/modules/Streets.rst index 24453b1..f04cb31 100644 --- a/docs/source/modules/Streets.rst +++ b/docs/source/modules/Streets.rst @@ -3,6 +3,7 @@ Streets ========================== A class for working with street data. -.. autoclass:: sloyka.src.utils.data_getter.Streets + +.. autoclass:: sloyka.src.utils.data_getter.street_getter.Streets :members: - :undoc-members: \ No newline at end of file + :undoc-members: diff --git a/docs/source/modules/VKParser.rst b/docs/source/modules/VKParser.rst index 2e20215..4020c27 100644 --- a/docs/source/modules/VKParser.rst +++ b/docs/source/modules/VKParser.rst @@ -2,6 +2,6 @@ VKparser ========================== -.. autoclass:: sloyka.src.utils.data_getter.VKParser +.. autoclass:: sloyka.src.utils.data_getter.vk_data_getter.VKParser :members: :undoc-members: \ No newline at end of file diff --git a/docs/source/modules/city_services_extract.rst b/docs/source/modules/city_services_extract.rst index 6e1be51..38388fd 100644 --- a/docs/source/modules/city_services_extract.rst +++ b/docs/source/modules/city_services_extract.rst @@ -3,9 +3,10 @@ Services extraction ========================== The City_services class is designed to extract city service names from text using a string comparison algorithm, taking into account - the changing service endings in the text. Using the flair library, the City_services.run() method in messages extracts named entities - from the Sentence object as a list, as well as the most probable service type, and stores them in new columns of the original DataFrame(). +the changing service endings in the text. Using the flair library, the City_services.run() method in messages extracts named entities +from the Sentence object as a list, as well as the most probable service type, and stores them in new columns of the original DataFrame(). + .. automodule:: sloyka.src.utils.data_processing.city_services_extract - :members: - :undoc-members: + :members: + :undoc-members: \ No newline at end of file diff --git a/docs/source/modules/geocoder.rst b/docs/source/modules/geocoder.rst index b5a6f00..4ef23d8 100644 --- a/docs/source/modules/geocoder.rst +++ b/docs/source/modules/geocoder.rst @@ -1,14 +1,14 @@ .. _geocoder: ================== + Geocoding ================== -To perform classification with generalized linear models, see -:ref:`Geocoder_special`. +To perform classification with generalized linear models, see :ref:`Geocoder_special`. Geocoder ------------------- .. autoclass:: sloyka.src.geocoder.geocoder.Geocoder - :members: run + :members: run OtherGeoObjects --------------------- diff --git a/docs/source/modules/visualize_graph.rst b/docs/source/modules/visualize_graph.rst index 4db45d1..f5e1255 100644 --- a/docs/source/modules/visualize_graph.rst +++ b/docs/source/modules/visualize_graph.rst @@ -6,4 +6,4 @@ The function results in an interactive map, which will display graph nodes group Each cluster will have its own color and name. Node markers will have a random offset to avoid overlapping. .. automodule:: sloyka.src.visual.graph_visualization - :members: draw_graph_folium \ No newline at end of file + :members: visualize_graph \ No newline at end of file