Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added FewShotAssertionClassifier #279

Merged
merged 12 commits into from
Sep 16, 2024

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

2,349 changes: 2,348 additions & 1 deletion examples/colab/Training/named_entity_recognition/NLU_training_NER_demo.ipynb

Large diffs are not rendered by default.

892 changes: 891 additions & 1 deletion examples/colab/Training/part_of_speech/NLU_training_POS_demo.ipynb

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

48 changes: 0 additions & 48 deletions examples/colab/healthcare/assertion/assertion_overview.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -32,31 +32,6 @@
"| English | [assert.large](https://nlp.johnsnowlabs.com/2020/05/21/assertion_dl_large_en.html) | [assertion_dl_large](https://nlp.johnsnowlabs.com/2020/05/21/assertion_dl_large_en.html) |"
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {
"id": "OMiHjCtQhF_r"
},
"outputs": [],
"source": [
"# Install the johnsnowlabs library\n",
"! pip install -q johnsnowlabs==5.1.0"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "fLmvgQZoRucE"
},
"outputs": [],
"source": [
"from google.colab import files\n",
"print('Please Upload your John Snow Labs License using the button below')\n",
"license_keys = files.upload()"
]
},
{
"cell_type": "code",
"execution_count": null,
Expand All @@ -71,29 +46,6 @@
"nlp.install()"
]
},
{
"cell_type": "code",
"execution_count": 4,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "cPT4lI0x8ZOj",
"outputId": "13a9978d-8d0d-4672-b70e-eb574a66f303"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"👌 Launched \u001b[92mcpu optimized\u001b[39m session with with: 🚀Spark-NLP==5.1.0, 💊Spark-Healthcare==5.1.0, running on ⚡ PySpark==3.1.2\n"
]
}
],
"source": [
"spark=nlp.start()"
]
},
{
"cell_type": "markdown",
"metadata": {
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,171 @@
{
"cells": [
{
"cell_type": "markdown",
"source": [
"![JohnSnowLabs](https://nlp.johnsnowlabs.com/assets/images/logo.png)\n",
"\n",
"[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/JohnSnowLabs/nlu/blob/master/examples/colab/healthcare/assertion/few_shot_assertion_classifier.ipynb)\n"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "markdown",
"source": [
"### Assign assertion status to clinical entities\n",
"\n",
"Classes are: `absent`, `present`, `conditional`, `associated_with_someone_else`, `hypothetical`, `possible`"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "markdown",
"source": [
"## Starting the session"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"from johnsnowlabs import nlp\n",
"nlp.install()\n",
"nlp.start()"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "markdown",
"source": [
"## Loading the data"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 2,
"outputs": [],
"source": [
"data = (\n",
" \"\"\"Includes hypertension and chronic obstructive pulmonary disease.\"\"\"\n",
" )"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-09-09T14:02:59.655452100Z",
"start_time": "2024-09-09T14:02:59.634164800Z"
}
}
},
{
"cell_type": "markdown",
"source": [
"## [Assign assertion status to clinical entities](https://nlp.johnsnowlabs.com/2024/04/15/assertion_fewshotclassifier_en.html)"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 6,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Warning::Spark Session already created, some configs may not take.\n",
"Warning::Spark Session already created, some configs may not take.\n",
"assertion_fewshotclassifier download started this may take some time.\n",
"[OK!]\n",
"sentence_detector_dl download started this may take some time.\n",
"Approximate size to download 354.6 KB\n",
"[OK!]\n",
"pos_anc download started this may take some time.\n",
"Approximate size to download 3.9 MB\n",
"[OK!]\n",
"Warning::Spark Session already created, some configs may not take.\n"
]
}
],
"source": [
"res = nlp.load(\"en.few_assert_shot_classifier\").predict(data, output_level='chunk')"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-09-09T14:22:02.470834500Z",
"start_time": "2024-09-09T14:21:37.585408700Z"
}
}
},
{
"cell_type": "code",
"execution_count": 7,
"outputs": [
{
"data": {
"text/plain": " assertion assertion_confidence \\\n0 present 1.0 \n0 present 1.0 \n\n document matched_pos \\\n0 Includes hypertension and chronic obstructive ... hypertension \n0 Includes hypertension and chronic obstructive ... disease \n\n pos \n0 [VBZ, NN, CC, JJ, JJ, JJ, NN, .] \n0 [VBZ, NN, CC, JJ, JJ, JJ, NN, .] ",
"text/html": "<div>\n<style scoped>\n .dataframe tbody tr th:only-of-type {\n vertical-align: middle;\n }\n\n .dataframe tbody tr th {\n vertical-align: top;\n }\n\n .dataframe thead th {\n text-align: right;\n }\n</style>\n<table border=\"1\" class=\"dataframe\">\n <thead>\n <tr style=\"text-align: right;\">\n <th></th>\n <th>assertion</th>\n <th>assertion_confidence</th>\n <th>document</th>\n <th>matched_pos</th>\n <th>pos</th>\n </tr>\n </thead>\n <tbody>\n <tr>\n <th>0</th>\n <td>present</td>\n <td>1.0</td>\n <td>Includes hypertension and chronic obstructive ...</td>\n <td>hypertension</td>\n <td>[VBZ, NN, CC, JJ, JJ, JJ, NN, .]</td>\n </tr>\n <tr>\n <th>0</th>\n <td>present</td>\n <td>1.0</td>\n <td>Includes hypertension and chronic obstructive ...</td>\n <td>disease</td>\n <td>[VBZ, NN, CC, JJ, JJ, JJ, NN, .]</td>\n </tr>\n </tbody>\n</table>\n</div>"
},
"execution_count": 7,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"res"
],
"metadata": {
"collapsed": false,
"ExecuteTime": {
"end_time": "2024-09-09T14:22:02.515663Z",
"start_time": "2024-09-09T14:22:02.478142Z"
}
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [],
"metadata": {
"collapsed": false
}
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 2
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython2",
"version": "2.7.6"
}
},
"nbformat": 4,
"nbformat_minor": 0
}
Original file line number Diff line number Diff line change
Expand Up @@ -54,31 +54,6 @@
"| English | classify.pico | classifierdl_pico_biobert |"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "OMiHjCtQhF_r"
},
"outputs": [],
"source": [
"# Install the johnsnowlabs library\n",
"! pip install -q johnsnowlabs==5.1.0"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "WzQxxFcyBWJ6"
},
"outputs": [],
"source": [
"from google.colab import files\n",
"print('Please Upload your John Snow Labs License using the button below')\n",
"license_keys = files.upload()"
]
},
{
"cell_type": "code",
"execution_count": null,
Expand All @@ -93,17 +68,6 @@
"nlp.install()"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "mxj5X9miBZoX"
},
"outputs": [],
"source": [
"spark=nlp.start()"
]
},
{
"cell_type": "markdown",
"metadata": {
Expand Down
36 changes: 0 additions & 36 deletions examples/colab/healthcare/drug_normalization/drug_norm.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -19,31 +19,6 @@
"See the [install docs](https://nlu.johnsnowlabs.com/docs/en/install#super-quickstart-on-google-colab-or-kaggle)\n"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "M4SGENPXisd7"
},
"outputs": [],
"source": [
"# Install the johnsnowlabs library\n",
"! pip install -q johnsnowlabs==5.1.0"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "DMHSgj8EDlUt"
},
"outputs": [],
"source": [
"from google.colab import files\n",
"print('Please Upload your John Snow Labs License using the button below')\n",
"license_keys = files.upload()"
]
},
{
"cell_type": "code",
"execution_count": null,
Expand All @@ -58,17 +33,6 @@
"nlp.install()"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "IxMbCELVD5jj"
},
"outputs": [],
"source": [
"spark=nlp.start()"
]
},
{
"cell_type": "markdown",
"metadata": {
Expand Down
1,083 changes: 1,082 additions & 1 deletion examples/colab/healthcare/entity_resolution/NLU_atc_resolver_pipeline.ipynb

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Loading
Loading