Skip to content

Connectome-Implementation-Team/author_name_disambiguation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

5bda9de · Jan 20, 2023

History

4 Commits
Jan 20, 2023
Jan 20, 2023
Jan 20, 2023
Jan 20, 2023
Jan 20, 2023

Repository files navigation

Author Name Disambiguation with Document Representation Learning and Knowledge Graph Embeddings

Prerequisites:

Hardware

Tested on the following configuration:

  • Ubuntu 18.04 and CUDA 10.1
  • 64G RAM
  • 500G SSD
  • 4x GeForce GTX TITAN X

How to run

  • Place RESCS files in /input
# set environment in /disambiguation
cd $project_path/disambiguation
export PYTHONPATH="$project_path/disambiguation:$PYTHONPATH"

# convert files from /input to required format
python rescs_to_disambig.py

# extract entities from texts in BLINK, then store their embeddings using
python integrate_entities.py

# preprocess data: required for training and applying the model
python disambiguation/scripts/preprocessing.py

# train global model: save and reuse
python disambiguation/global_/gen_train_data.py
python disambiguation/global_/global_model.py

# local model: document network as input to GAE representation learning
python disambiguation/global_/prepare_local_data.py
python disambiguation/local/gae/train.py
python disambiguation/cluster_size/count.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages