Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

PROVIDING RELEVANT ENTITIES FOR THEMATIC INVESTING USING NATURAL LANGUAGE PROCESSING AND NAMED-ENTITY RECOGNITION

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Publication Date:
    August 1, 2024
  • معلومة اضافية
    • Document Number:
      20240257251
    • Appl. No:
      18/102005
    • Application Filed:
      January 26, 2023
    • نبذة مختصرة :
      Disclosed herein is a system for automatically providing a list of investments (e.g., securities such as individual stocks, Exchange-Traded Funds (ETFs)) to users (e.g., retail investors) using a machine learning model and a named-entity recognition algorithm. The machine learning model is generated and trained to implement natural language processing. Consequently, the system provides an opportunity for a non-sophisticated investor (e.g., a retail investor) to efficiently discover investments related to an investment theme. The system leverages a pipeline to generate a list of investments (e.g., ticker symbols for stocks or ETFs) that are the most relevant to and/or most impacted by the investment theme. The system can then display the list of investments to users.
    • Claim:
      1. A method comprising: receiving a query that identifies an investment theme; implementing, via a search engine, a search based on the investment theme, the search returning a plurality of network resources related to the investment theme; for a network resource of the plurality of network resources: applying, by a processing unit, a machine learning model that implements natural language processing to semantically understand content discussed in the network resource; determining, based on the application of the machine learning model that implements natural language processing, a score representing a degree to which the content discussed in the network resource is relevant to the investment theme; producing a ranked list of network resources by ranking the plurality of network resources based on the score determined for each network resource of the plurality of network resources; identifying a threshold number of top-ranked network resources from the ranked list of network resources; applying a named-entity recognition algorithm to the top-ranked network resources; recognizing, based on the application of the named-entity recognition algorithm, a plurality of entities mentioned in the top-ranked network resources, wherein each entity of the plurality of entities is associated with a tradeable security; and providing at least a portion of the plurality entities for display in association with the investment theme.
    • Claim:
      2. The method of claim 1, wherein the machine learning model that implements natural language processing is trained to semantically understand the investment theme.
    • Claim:
      3. The method of claim 1, wherein the named-entity recognition algorithm is configured to identify ticker symbols that represent the plurality of entities.
    • Claim:
      4. The method of claim 3, wherein providing at least the portion of the plurality entities for display in association with the investment theme comprises providing a portion of the ticker symbols that correspond to the portion of the plurality entities.
    • Claim:
      5. The method of claim 4, further comprising: retrieving metadata associated with the portion of ticker symbols, the metadata including at least one of a current price or a historic performance; and displaying the portion of the ticker symbols and the metadata in a frame via at least one of a new tab page of a browser, an operating system menu, or a side pane.
    • Claim:
      6. The method of claim 1, wherein a number of the plurality of network resources returned based on the search is limited to a threshold number.
    • Claim:
      7. The method of claim 1, further comprising: extracting, based on the application of the named-entity recognition algorithm, at least one parameter associated with each entity of the plurality of entities; and producing a ranked list of entities by ranking the plurality of entities based on the at least one parameter associated with each entity of the plurality of entities, wherein the portion of the plurality entities provided for display in association with the investment theme comprises a threshold number of top-ranked entities from the ranked list of entities.
    • Claim:
      8. The method of claim 7, wherein the at least one parameter comprises a number of times a corresponding entity is mentioned in the top-ranked network resources.
    • Claim:
      9. The method of claim 7, wherein the at least one parameter comprises an average position of a corresponding entity in an order of mentioned entities.
    • Claim:
      10. The method of claim 7, wherein the at least one parameter comprises an average number of units dedicated to a discussing a corresponding entity.
    • Claim:
      11. The method of claim 7, wherein the at least one parameter is a weighted parameter based on the score determined for each network resource of the top-ranked network resources.
    • Claim:
      12. The method of claim 1, further comprising, for a top-ranked network resource, performing content segmentation to identify a first content segment on which to focus the named-entity recognition algorithm and a second content segment which the named-entity recognition algorithm ignores.
    • Claim:
      13. The method of claim 1, wherein the plurality of network resources and the plurality of entities are related to a particular geographic region associated with the query.
    • Claim:
      14. A system comprising: a processing unit; and a computer-readable storage medium having computer-executable instructions stored thereupon, which, when executed by the processing unit, cause the processing unit to perform operations comprising: implementing, via a search engine, a search based on an investment theme, the search returning a plurality of network resources related to the investment theme; for a network resource of the plurality of network resources: applying a machine learning model that implements natural language processing to semantically understand content discussed in the network resource; determining, based on the application of the machine learning model that implements natural language processing, a score representing a degree to which the content discussed in the network resource is relevant to the investment theme; producing a ranked list of network resources by ranking the plurality of network resources based on the score determined for each network resource of the plurality of network resources; identifying a threshold number of top-ranked network resources from the ranked list of network resources; applying a named-entity recognition algorithm to the top-ranked network resources; and recognizing, based on the application of the named-entity recognition algorithm, a plurality of entities mentioned in the top-ranked network resources, wherein each entity of the plurality of entities is associated with a tradeable security.
    • Claim:
      15. The system of claim 14, wherein the machine learning model that implements natural language processing is trained to semantically understand the investment theme.
    • Claim:
      16. The system of claim 14, wherein the named-entity recognition algorithm is configured to identify ticker symbols that represent the plurality of entities.
    • Claim:
      17. The system of claim 14, wherein the operations further comprise: extracting, based on the application of the named-entity recognition algorithm, at least one parameter associated with each entity of the plurality of entities; and producing a ranked list of entities by ranking the plurality of entities based on the at least one parameter associated with each entity of the plurality of entities, wherein the portion of the plurality entities provided for display in association with the investment theme comprises a threshold number of top-ranked entities from the ranked list of entities.
    • Claim:
      18. The system of claim 17, wherein the at least one parameter comprises at least one of: a number of times a corresponding entity is mentioned in the top-ranked network resources; an average position of a corresponding entity in an order of mentioned entities; or an average number of units dedicated to a discussing a corresponding entity.
    • Claim:
      19. The system of claim 17, wherein the at least one parameter is a weighted parameter based on the score determined for each network resource of the top-ranked network resources.
    • Claim:
      20. A computer-readable storage medium having computer-executable instructions stored thereupon, which, when executed by a processing unit, cause the processing unit to perform operations comprising: implementing, via a search engine, a search based on an investment theme, the search returning a plurality of network resources related to the investment theme; for a network resource of the plurality of network resources: applying a machine learning model that implements natural language processing to semantically understand content discussed in the network resource; determining, based on the application of the machine learning model that implements natural language processing, a score representing a degree to which the content discussed in the network resource is relevant to the investment theme; producing a ranked list of network resources by ranking the plurality of network resources based on the score determined for each network resource of the plurality of network resources; identifying a threshold number of top-ranked network resources from the ranked list of network resources; applying a named-entity recognition algorithm to the top-ranked network resources; and recognizing, based on the application of the named-entity recognition algorithm, a plurality of entities mentioned in the top-ranked network resources, wherein each entity of the plurality of entities is associated with a tradeable security.
    • Current International Class:
      06; 06; 06; 06; 06
    • الرقم المعرف:
      edspap.20240257251