Discussion: Making Core Search logic more modular #10804

whatisgalen · 2024-04-19T01:26:47Z

As a developer, you can create or customize search components as you like, you just have to ensure the sortorder if one search filter/component has some kind of dependency on another. Where that customization ends is what I would call the "core search logic". The logic that exists outside of any search component governs the following:

what properties of the document should be included
what exactly should happen when the search query gets executed and should there be only 1 (for example)
how many results should be returned (which might then later get paginated by the paging-filter)
how localized descriptors should be applied
what properties are on the response object

It's conceivable that 1 or all of those could be implemented differently by a developer, but to do so requires overriding methods like search_results(), export_results() or the entire SearchView. It would then be easier and a more modular solution to include the core logic as essentially a core search component.

Here's some of that logic:

    dsl.include("graph_id")
    dsl.include("root_ontology_class")
    dsl.include("resourceinstanceid")
    dsl.include("points")
    dsl.include("permissions.users_without_read_perm")
    dsl.include("permissions.users_without_edit_perm")
    dsl.include("permissions.users_without_delete_perm")
    dsl.include("permissions.users_with_no_access")
    dsl.include("geometries")
    dsl.include("displayname")
    dsl.include("displaydescription")
    dsl.include("map_popup")
    dsl.include("provisional_resource")
    if load_tiles:
        dsl.include("tiles")
    if for_export or pages:
        results = dsl.search(index=RESOURCES_INDEX, scroll="1m")
        scroll_id = results["_scroll_id"]
        if not pages:
            if total <= settings.SEARCH_EXPORT_LIMIT:
                pages = (total // settings.SEARCH_RESULT_LIMIT) + 1
            if total > settings.SEARCH_EXPORT_LIMIT:
                pages = int(settings.SEARCH_EXPORT_LIMIT // settings.SEARCH_RESULT_LIMIT) - 1
        for page in range(int(pages)):
            results_scrolled = dsl.se.es.scroll(scroll_id=scroll_id, scroll="1m")
            results["hits"]["hits"] += results_scrolled["hits"]["hits"]
    else:
        results = dsl.search(index=RESOURCES_INDEX, id=resourceinstanceid)

    ret = {}
    if results is not None:
        if "hits" not in results:
            if "docs" in results:
                results = {"hits": {"hits": results["docs"]}}
            else:
                results = {"hits": {"hits": [results]}}

A few ways to implement what I'm talking about would be to:

let the search filters determine which document properties/mappings to include or exclude
pass in the response object ret instead of just the results object in the post_search_hooks of each filter
let the search filters determine how many results to collect from the query and other mechanisms like search result caching

Obviously, if a developer deviates too much in how their custom search component handles the query execution and response, other parts of Arches that use search could break. However, I don't think that's a good reason against customization, it just implies the necessity of more streamlined guidance for search component development in the arches documentation.

The other implication of modularizing the core search logic on the backend is that the frontend would also need to be more aware/responsive and less hard-coded/static of which search filters to take into consideration. For example, the search-results component references specific properties it expects from each result. It would be more modular to interrogate the other search-filters (which it already could do as term-filter and others do) and determine what properties it has access to from each search-filter.

To see how this could be implemented, take a look at my PR.

The text was updated successfully, but these errors were encountered:

#10804

…cute_query in base search filter, re #10804

…earch_filter.execute_query, re #10804

…, CoreSearchFilter, re #10804

…esultDescriptors, re #10804

…10804

…e request pos arg, re #10804

…o include request pos arg, re #10804" This reverts commit 4426e32. reverts because already has self.request, re #10804

…ry, re #10804

…nually called on backend, re #10804

#10804

…10804

…js, re #10804

…y, re #10804

…10804

…hview; rename _component -> _filter, re #10804

…backend only, re #10804

…hether filters loaded, re #10804

…ard, rm console log, re #10804

…lter re #10804

…refs re #10804

… re #10804

…10804

chiatt added this to pipeline Apr 19, 2024

whatisgalen added a commit that referenced this issue Apr 19, 2024

creates empty method execute_query in base search filter, re #10804

f5b8466

whatisgalen added a commit that referenced this issue Apr 19, 2024

includes request as pos arg on base search filter append_dsl method, re

0cd8638

#10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

includes search_res_obj, request, and response_obj as pos args of exe…

dfa1372

…cute_query in base search filter, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

re-arrange pos args in base filter methods, re #10804

55361d3

whatisgalen added a commit that referenced this issue Apr 19, 2024

removes core search logic out of search_results method, replaced by s…

b922d21

…earch_filter.execute_query, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

moves core search logic from search_results into own search_component…

138e6dd

…, CoreSearchFilter, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

moves localize_descriptors logic into own search_component, LocalizeR…

cdc9eec

…esultDescriptors, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

adds details dict to core_search, re #10804

7550890

whatisgalen added a commit that referenced this issue Apr 19, 2024

adds details dict to localize_result_descriptors, re #10804

cba2d87

whatisgalen added a commit that referenced this issue Apr 19, 2024

commits migration for 2 new core search filters, re #10804

cd11e0f

whatisgalen added a commit that referenced this issue Apr 19, 2024

includes components for core-search filters in migration, re #10804

2f6cf77

whatisgalen added a commit that referenced this issue Apr 19, 2024

commits lightweight ko components for new core-search components, re #…

7611866

…10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

commits empty templates for core-search backend filters, re #10804

cada203

whatisgalen added a commit that referenced this issue Apr 19, 2024

refactors signature for append_dsl() and post_search_hook() to includ…

4426e32

…e request pos arg, re #10804

whatisgalen mentioned this issue Apr 19, 2024

10804 componentizing core search #10807

Merged

6 tasks

whatisgalen added a commit that referenced this issue Apr 19, 2024

Revert "refactors signature for append_dsl() and post_search_hook() t…

0c97cd9

…o include request pos arg, re #10804" This reverts commit 4426e32. reverts because already has self.request, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

rm redundant request pos arg, use self.request in search_filter facto…

c40a676

…ry, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

minor tweak to migration sortorder update, re #10804

1b9da96

whatisgalen added a commit that referenced this issue Apr 19, 2024

rm unused imports in migration, re #10804

e703699

whatisgalen added a commit that referenced this issue Apr 19, 2024

commits core-search filters as request kwargs where search_results ma…

f1a5646

…nually called on backend, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

manually set core-search kwargs on request obj, re #10804

b8bc867

whatisgalen added a commit that referenced this issue Apr 19, 2024

conforms references to results to response_obj.results, re #10804

e5b2adc

whatisgalen added a commit that referenced this issue Apr 19, 2024

changes pos arg results -> response_object in base.py search-filter, re

d034f04

#10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

fixes bug in declaring response_object dict, re #10804

3f79e8f

whatisgalen added a commit that referenced this issue Apr 19, 2024

type checks response_object, leaves unformatted if already json, re #…

72bd61d

…10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

format core-search query kwargs in search_test to json str, re #10804

d044f3c

whatisgalen added a commit that referenced this issue Apr 19, 2024

rm redundant JSONResponse, re #10804

8b5c9bd

whatisgalen added a commit that referenced this issue Apr 19, 2024

includes componentpath, filter type in core-search filters details ob…

c2d4aad

…js, re #10804

whatisgalen added a commit that referenced this issue Apr 19, 2024

reset response_object results equal to None, re #10804

4b2f7a2

whatisgalen added a commit that referenced this issue Jul 30, 2024

refactor this.filters --> this.searchComponentVms for explicit clarit…

f5e5acb

…y, re #10804

whatisgalen added a commit that referenced this issue Jul 30, 2024

include search-export in searchComponentVms refactor, re #10804

6ae2293

whatisgalen added a commit that referenced this issue Aug 7, 2024

rename refs to search-logic to search-view for type, re #10804

9658f4d

whatisgalen added a commit that referenced this issue Aug 8, 2024

fix merge conflict with latest from dev/7.6.x, re #10804

0c36502

whatisgalen added a commit that referenced this issue Aug 8, 2024

replace usage of this.getFilter(term-filter) with getFilterByType, re #…

a2eeb4a

…10804

whatisgalen added a commit that referenced this issue Aug 12, 2024

rename get_searchview_components -> get_searchview_filters, re #10804

fc69927

whatisgalen added a commit that referenced this issue Aug 12, 2024

rename required_search_components -> required_search_filters, re #10804

b816e36

whatisgalen added a commit that referenced this issue Aug 12, 2024

refactor search_query_dict methods out of base filter into base searc…

da76681

…hview; rename _component -> _filter, re #10804

whatisgalen added a commit that referenced this issue Aug 12, 2024

make search_component.componentpath nullable in postgres to indicate …

d52e1d5

…backend only, re #10804

whatisgalen added a commit that referenced this issue Aug 12, 2024

rename filtersList -> searchFilterConfigs, centralize state mgmt of w…

3a0d74a

…hether filters loaded, re #10804

whatisgalen added a commit that referenced this issue Aug 12, 2024

rename get_searchview_component -> get_searchview_instance, re #10804

80c1da1

whatisgalen added a commit that referenced this issue Aug 12, 2024

include layoutType on all filters with ko component, re #10804

2aef800

whatisgalen added a commit that referenced this issue Aug 13, 2024

rename get_searchview_component_name -> get_searchview_name re #10804

df715eb

whatisgalen added a commit that referenced this issue Aug 13, 2024

rename searchComponentVms -> searchFilterVms re #10804

0b013e7

whatisgalen added a commit that referenced this issue Aug 13, 2024

move defaultQuery into constructor of base-search-view, rm from stand…

c6fe910

…ard, rm console log, re #10804

whatisgalen added a commit that referenced this issue Aug 14, 2024

rm redundant getFilter from base-search-view; add unwrap arg in getFi…

beaa8f7

…lter re #10804

whatisgalen added a commit that referenced this issue Aug 14, 2024

refactor search_component types to componentname-type concat, update …

a10ed1c

…refs re #10804

whatisgalen added a commit that referenced this issue Aug 14, 2024

rm stray comma in migration, re #10804

f5e371a

whatisgalen added a commit that referenced this issue Aug 14, 2024

include default int value for filters sans layoutSortorder, re #10804

86eb048

whatisgalen added a commit that referenced this issue Aug 14, 2024

commit commented test logs for ko cmpnt load, re #10804

cbce600

whatisgalen added a commit that referenced this issue Aug 14, 2024

only make prov-filter invisble, not un-rendered if user not reviewer,…

ddf1291

… re #10804

whatisgalen added a commit that referenced this issue Aug 15, 2024

fix migration reversion of old search_component type values, re #10804

fda66ea

whatisgalen added a commit that referenced this issue Aug 15, 2024

rm unused componentName assignment, re #10804

60c1207

whatisgalen added a commit that referenced this issue Aug 20, 2024

move called out global var back into if block, re #10804

811990f

apeters added a commit that referenced this issue Aug 21, 2024

fix migration merge issue, re #10804

185dd09

whatisgalen added a commit that referenced this issue Aug 21, 2024

cleanup migration, re #10804

4deaaa9

whatisgalen added a commit that referenced this issue Aug 21, 2024

cleanup imports, details objects in search component class files, re #…

d458d12

…10804

whatisgalen added a commit that referenced this issue Aug 21, 2024

rm unused imports, consolidate models imports, re #10804

6aa5c2c

whatisgalen closed this as completed Aug 22, 2024

github-project-automation bot moved this to ✅ Done in pipeline Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion: Making Core Search logic more modular #10804

Discussion: Making Core Search logic more modular #10804

whatisgalen commented Apr 19, 2024 •

edited

Loading

Discussion: Making Core Search logic more modular #10804

Discussion: Making Core Search logic more modular #10804

Comments

whatisgalen commented Apr 19, 2024 • edited Loading

whatisgalen commented Apr 19, 2024 •

edited

Loading