Overview
Brought to you by YData
Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 6934 |
| Missing cells | 58404 |
| Missing cells (%) | 38.3% |
| Duplicate rows | 383 |
| Duplicate rows (%) | 5.5% |
| Total size in memory | 5.6 MiB |
| Average record size in memory | 854.2 B |
Variable types
| Text | 4 |
|---|---|
| Numeric | 6 |
| Categorical | 11 |
| DateTime | 1 |
| Dataset has 383 (5.5%) duplicate rows | Duplicates |
Age at Which Sequencing was Reported (Years) is highly overall correlated with age_at_diagnosis and 2 other fields | High correlation |
Metastatic Site is highly overall correlated with sample_type and 2 other fields | High correlation |
age_at_diagnosis is highly overall correlated with Age at Which Sequencing was Reported (Years) | High correlation |
mitotic_rate is highly overall correlated with source and 1 other fields | High correlation |
os_status is highly overall correlated with source | High correlation |
primary_site is highly overall correlated with sample_type and 1 other fields | High correlation |
race is highly overall correlated with source | High correlation |
sample_coverage is highly overall correlated with source and 1 other fields | High correlation |
sample_type is highly overall correlated with Metastatic Site and 3 other fields | High correlation |
source is highly overall correlated with Age at Which Sequencing was Reported (Years) and 10 other fields | High correlation |
stage_at_diagnosis is highly overall correlated with source | High correlation |
treatment is highly overall correlated with sample_type and 1 other fields | High correlation |
tumor_grade is highly overall correlated with Age at Which Sequencing was Reported (Years) and 5 other fields | High correlation |
tumor_purity is highly overall correlated with tumor_grade | High correlation |
tumor_size is highly overall correlated with source and 1 other fields | High correlation |
treatment_response is highly imbalanced (55.5%) | Imbalance |
primary_site is highly imbalanced (50.1%) | Imbalance |
os_status is highly imbalanced (51.2%) | Imbalance |
sample_id has 5236 (75.5%) missing values | Missing |
Age at Which Sequencing was Reported (Years) has 6064 (87.5%) missing values | Missing |
tumor_size has 6064 (87.5%) missing values | Missing |
mitotic_rate has 6111 (88.1%) missing values | Missing |
Metastatic Site has 6064 (87.5%) missing values | Missing |
tumor_purity has 6042 (87.1%) missing values | Missing |
sample_coverage has 6064 (87.5%) missing values | Missing |
os_months has 3687 (53.2%) missing values | Missing |
treatment_start has 6306 (90.9%) missing values | Missing |
os_status has 3636 (52.4%) missing values | Missing |
mutated_genes has 3130 (45.1%) missing values | Missing |
Reproduction
| Analysis started | 2025-08-29 21:01:41.494210 |
|---|---|
| Analysis finished | 2025-08-29 21:01:48.916942 |
| Duration | 7.42 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
sample_id
Text
Missing 
| Distinct | 675 |
|---|---|
| Distinct (%) | 39.8% |
| Missing | 5236 |
| Missing (%) | 75.5% |
| Memory size | 268.3 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 14.021201 |
| Min length | 10 |
Unique
| Unique | 337 ? |
|---|---|
| Unique (%) | 19.8% |
Sample
| 1st row | P-0000134-T02-IM3 |
|---|---|
| 2nd row | P-0000134-T02-IM3 |
| 3rd row | P-0000306-T01-IM3 |
| 4th row | P-0000501-T02-IM3 |
| 5th row | P-0000501-T02-IM3 |
| Value | Count | Frequency (%) |
| p-0001315-t01-im3 | 66 | 3.9% |
| p-0002672-t01-im3 | 32 | 1.9% |
| p-0013110-t01-im5 | 30 | 1.8% |
| p-0012178-t01-im5 | 24 | 1.4% |
| p-0005066-t01-im5 | 21 | 1.2% |
| p-0012564-t01-im5 | 20 | 1.2% |
| p-0013393-t01-im5 | 20 | 1.2% |
| p-0002409-t01-im3 | 18 | 1.1% |
| p-0005330-t02-im5 | 18 | 1.1% |
| p-0004937-t01-im5 | 18 | 1.1% |
| Other values (665) | 1431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4018 | |
| - | 2610 | |
| 1 | 2552 | |
| S | 1656 | 7.0% |
| 5 | 1346 | 5.7% |
| 3 | 1322 | 5.6% |
| 2 | 1278 | 5.4% |
| 4 | 964 | 4.0% |
| P | 870 | 3.7% |
| T | 870 | 3.7% |
| Other values (8) | 6322 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23808 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4018 | |
| - | 2610 | |
| 1 | 2552 | |
| S | 1656 | 7.0% |
| 5 | 1346 | 5.7% |
| 3 | 1322 | 5.6% |
| 2 | 1278 | 5.4% |
| 4 | 964 | 4.0% |
| P | 870 | 3.7% |
| T | 870 | 3.7% |
| Other values (8) | 6322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23808 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4018 | |
| - | 2610 | |
| 1 | 2552 | |
| S | 1656 | 7.0% |
| 5 | 1346 | 5.7% |
| 3 | 1322 | 5.6% |
| 2 | 1278 | 5.4% |
| 4 | 964 | 4.0% |
| P | 870 | 3.7% |
| T | 870 | 3.7% |
| Other values (8) | 6322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23808 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4018 | |
| - | 2610 | |
| 1 | 2552 | |
| S | 1656 | 7.0% |
| 5 | 1346 | 5.7% |
| 3 | 1322 | 5.6% |
| 2 | 1278 | 5.4% |
| 4 | 964 | 4.0% |
| P | 870 | 3.7% |
| T | 870 | 3.7% |
| Other values (8) | 6322 |
patient_id
Text
| Distinct | 2862 |
|---|---|
| Distinct (%) | 41.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 381.7 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 9 |
| Mean length | 7.3442457 |
| Min length | 3 |
Unique
| Unique | 2558 ? |
|---|---|
| Unique (%) | 36.9% |
Sample
| 1st row | P-0000134 |
|---|---|
| 2nd row | P-0000134 |
| 3rd row | P-0000306 |
| 4th row | P-0000501 |
| 5th row | P-0000501 |
| Value | Count | Frequency (%) |
| 111316 | 2106 | |
| 627122 | 200 | 2.9% |
| 429767 | 128 | 1.8% |
| 814656 | 108 | 1.6% |
| 636974 | 98 | 1.4% |
| 949853 | 84 | 1.2% |
| p-0001315 | 72 | 1.0% |
| p-0002672 | 32 | 0.5% |
| p-0013110 | 30 | 0.4% |
| p-0012178 | 24 | 0.3% |
| Other values (2852) | 4052 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 13683 | |
| 6 | 5769 | |
| 3 | 5161 | 10.1% |
| 0 | 5071 | 10.0% |
| 2 | 4434 | 8.7% |
| 9 | 3126 | 6.1% |
| 4 | 3101 | 6.1% |
| 5 | 2788 | 5.5% |
| 7 | 2631 | 5.2% |
| 8 | 2250 | 4.4% |
| Other values (8) | 2911 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50925 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 13683 | |
| 6 | 5769 | |
| 3 | 5161 | 10.1% |
| 0 | 5071 | 10.0% |
| 2 | 4434 | 8.7% |
| 9 | 3126 | 6.1% |
| 4 | 3101 | 6.1% |
| 5 | 2788 | 5.5% |
| 7 | 2631 | 5.2% |
| 8 | 2250 | 4.4% |
| Other values (8) | 2911 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50925 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 13683 | |
| 6 | 5769 | |
| 3 | 5161 | 10.1% |
| 0 | 5071 | 10.0% |
| 2 | 4434 | 8.7% |
| 9 | 3126 | 6.1% |
| 4 | 3101 | 6.1% |
| 5 | 2788 | 5.5% |
| 7 | 2631 | 5.2% |
| 8 | 2250 | 4.4% |
| Other values (8) | 2911 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50925 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 13683 | |
| 6 | 5769 | |
| 3 | 5161 | 10.1% |
| 0 | 5071 | 10.0% |
| 2 | 4434 | 8.7% |
| 9 | 3126 | 6.1% |
| 4 | 3101 | 6.1% |
| 5 | 2788 | 5.5% |
| 7 | 2631 | 5.2% |
| 8 | 2250 | 4.4% |
| Other values (8) | 2911 | 5.7% |
age_at_diagnosis
Real number (ℝ)
High correlation 
| Distinct | 73 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.860975 |
| Minimum | 12 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 39 |
| Q1 | 55 |
| median | 58 |
| Q3 | 66 |
| 95-th percentile | 80 |
| Maximum | 90 |
| Range | 78 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 11.750974 |
|---|---|
| Coefficient of variation (CV) | 0.19963947 |
| Kurtosis | 0.55650135 |
| Mean | 58.860975 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.070998607 |
| Sum | 408142 |
| Variance | 138.08539 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58 | 2212 | |
| 39 | 319 | 4.6% |
| 66 | 238 | 3.4% |
| 57 | 219 | 3.2% |
| 53 | 208 | 3.0% |
| 59 | 163 | 2.4% |
| 56 | 142 | 2.0% |
| 60 | 137 | 2.0% |
| 67 | 130 | 1.9% |
| 64 | 130 | 1.9% |
| Other values (63) | 3036 |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 19 | 6 | |
| 22 | 1 | < 0.1% |
| 23 | 3 | < 0.1% |
| 24 | 3 | < 0.1% |
| 25 | 6 | |
| 26 | 4 | 0.1% |
| 27 | 11 |
| Value | Count | Frequency (%) |
| 90 | 42 | |
| 89 | 11 | 0.2% |
| 88 | 20 | |
| 87 | 22 | |
| 86 | 25 | |
| 85 | 37 | |
| 84 | 43 | |
| 83 | 37 | |
| 82 | 28 | |
| 81 | 40 |
Age at Which Sequencing was Reported (Years)
Real number (ℝ)
High correlation  Missing 
| Distinct | 51 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 6064 |
| Missing (%) | 87.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.331034 |
| Minimum | 28 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 49 |
| median | 58 |
| Q3 | 65 |
| 95-th percentile | 77 |
| Maximum | 90 |
| Range | 62 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 12.167732 |
|---|---|
| Coefficient of variation (CV) | 0.21223639 |
| Kurtosis | -0.35045886 |
| Mean | 57.331034 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.02569774 |
| Sum | 49878 |
| Variance | 148.0537 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 66 | 1.0% |
| 60 | 54 | 0.8% |
| 61 | 52 | 0.7% |
| 53 | 52 | 0.7% |
| 54 | 41 | 0.6% |
| 65 | 39 | 0.6% |
| 49 | 39 | 0.6% |
| 64 | 37 | 0.5% |
| 58 | 33 | 0.5% |
| 50 | 31 | 0.4% |
| Other values (41) | 426 | 6.1% |
| (Missing) | 6064 |
| Value | Count | Frequency (%) |
| 28 | 3 | < 0.1% |
| 29 | 1 | < 0.1% |
| 31 | 6 | 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 4 | 0.1% |
| 34 | 3 | < 0.1% |
| 36 | 66 | |
| 39 | 7 | 0.1% |
| 42 | 14 | 0.2% |
| 43 | 6 | 0.1% |
| Value | Count | Frequency (%) |
| 90 | 3 | < 0.1% |
| 88 | 3 | < 0.1% |
| 84 | 3 | < 0.1% |
| 83 | 5 | 0.1% |
| 81 | 3 | < 0.1% |
| 80 | 3 | < 0.1% |
| 79 | 10 | 0.1% |
| 78 | 6 | 0.1% |
| 77 | 31 | |
| 76 | 2 | < 0.1% |
stage_at_diagnosis
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 392.0 KiB |
| Metastatic | |
|---|---|
| Unknown | |
| Localized | |
| Regional |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.863715 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Metastatic |
|---|---|
| 2nd row | Metastatic |
| 3rd row | Localized |
| 4th row | Localized |
| 5th row | Localized |
Common Values
| Value | Count | Frequency (%) |
| Metastatic | 3279 | |
| Unknown | 1925 | |
| Localized | 1356 | |
| Regional | 374 | 5.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| metastatic | 3279 | |
| unknown | 1925 | |
| localized | 1356 | |
| regional | 374 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 9837 | |
| a | 8288 | |
| n | 6149 | |
| e | 5009 | |
| i | 5009 | |
| c | 4635 | |
| o | 3655 | 5.9% |
| M | 3279 | 5.3% |
| s | 3279 | 5.3% |
| U | 1925 | 3.1% |
| Other values (8) | 10396 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 61461 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 9837 | |
| a | 8288 | |
| n | 6149 | |
| e | 5009 | |
| i | 5009 | |
| c | 4635 | |
| o | 3655 | 5.9% |
| M | 3279 | 5.3% |
| s | 3279 | 5.3% |
| U | 1925 | 3.1% |
| Other values (8) | 10396 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 61461 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 9837 | |
| a | 8288 | |
| n | 6149 | |
| e | 5009 | |
| i | 5009 | |
| c | 4635 | |
| o | 3655 | 5.9% |
| M | 3279 | 5.3% |
| s | 3279 | 5.3% |
| U | 1925 | 3.1% |
| Other values (8) | 10396 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 61461 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 9837 | |
| a | 8288 | |
| n | 6149 | |
| e | 5009 | |
| i | 5009 | |
| c | 4635 | |
| o | 3655 | 5.9% |
| M | 3279 | 5.3% |
| s | 3279 | 5.3% |
| U | 1925 | 3.1% |
| Other values (8) | 10396 |
tumor_size
Real number (ℝ)
High correlation  Missing 
| Distinct | 53 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 6064 |
| Missing (%) | 87.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.036897 |
| Minimum | 1.4 |
|---|---|
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 1.4 |
|---|---|
| 5-th percentile | 2.8 |
| Q1 | 6.9 |
| median | 10 |
| Q3 | 14.6 |
| 95-th percentile | 24 |
| Maximum | 26 |
| Range | 24.6 |
| Interquartile range (IQR) | 7.7 |
Descriptive statistics
| Standard deviation | 5.9330815 |
|---|---|
| Coefficient of variation (CV) | 0.53756792 |
| Kurtosis | -0.19983541 |
| Mean | 11.036897 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.66844217 |
| Sum | 9602.1 |
| Variance | 35.201456 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 94 | 1.4% |
| 15 | 51 | 0.7% |
| 4 | 44 | 0.6% |
| 10 | 42 | 0.6% |
| 14 | 36 | 0.5% |
| 12 | 35 | 0.5% |
| 11 | 30 | 0.4% |
| 6 | 30 | 0.4% |
| 14.6 | 30 | 0.4% |
| 20 | 25 | 0.4% |
| Other values (43) | 453 | 6.5% |
| (Missing) | 6064 |
| Value | Count | Frequency (%) |
| 1.4 | 4 | 0.1% |
| 2.1 | 2 | < 0.1% |
| 2.3 | 12 | |
| 2.4 | 20 | |
| 2.8 | 12 | |
| 3 | 4 | 0.1% |
| 3.4 | 2 | < 0.1% |
| 3.5 | 14 | |
| 3.6 | 4 | 0.1% |
| 3.8 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 26 | 8 | 0.1% |
| 25 | 24 | |
| 24 | 23 | |
| 21 | 13 | |
| 20 | 25 | |
| 19.5 | 10 | 0.1% |
| 18.5 | 12 | |
| 18 | 6 | 0.1% |
| 17.9 | 11 | |
| 17 | 15 |
mitotic_rate
Real number (ℝ)
High correlation  Missing 
| Distinct | 39 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 6111 |
| Missing (%) | 88.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.641555 |
| Minimum | 0 |
|---|---|
| Maximum | 112 |
| Zeros | 44 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 20 |
| Q3 | 48 |
| 95-th percentile | 90 |
| Maximum | 112 |
| Range | 112 |
| Interquartile range (IQR) | 43 |
Descriptive statistics
| Standard deviation | 27.142842 |
|---|---|
| Coefficient of variation (CV) | 0.98195786 |
| Kurtosis | 1.784581 |
| Mean | 27.641555 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | 1.3771716 |
| Sum | 22749 |
| Variance | 736.73389 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 72 | 1.0% |
| 50 | 71 | 1.0% |
| 5 | 54 | 0.8% |
| 2 | 53 | 0.8% |
| 15 | 44 | 0.6% |
| 0 | 44 | 0.6% |
| 10 | 42 | 0.6% |
| 20 | 41 | 0.6% |
| 12 | 32 | 0.5% |
| 112 | 32 | 0.5% |
| Other values (29) | 338 | 4.9% |
| (Missing) | 6111 |
| Value | Count | Frequency (%) |
| 0 | 44 | |
| 1 | 27 | |
| 2 | 53 | |
| 3 | 12 | 0.2% |
| 4 | 24 | |
| 5 | 54 | |
| 6 | 8 | 0.1% |
| 7 | 17 | 0.2% |
| 8 | 18 | 0.3% |
| 10 | 42 |
| Value | Count | Frequency (%) |
| 112 | 32 | |
| 104 | 2 | < 0.1% |
| 90 | 16 | 0.2% |
| 75 | 9 | 0.1% |
| 55 | 20 | 0.3% |
| 50 | 71 | |
| 48 | 72 | |
| 47 | 6 | 0.1% |
| 46 | 4 | 0.1% |
| 45 | 2 | < 0.1% |
treatment
Categorical
High correlation 
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.0 KiB |
| SURGERY | |
|---|---|
| IMATINIB | |
| RIPRETINIB | |
| TREATMENT_NAIVE | |
| OTHER | |
| Other values (23) |
Length
| Max length | 31 |
|---|---|
| Median length | 21 |
| Mean length | 8.5754254 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | OTHER |
|---|---|
| 2nd row | OTHER |
| 3rd row | OTHER |
| 4th row | OTHER |
| 5th row | OTHER |
Common Values
| Value | Count | Frequency (%) |
| SURGERY | 2429 | |
| IMATINIB | 2206 | |
| RIPRETINIB | 1053 | |
| TREATMENT_NAIVE | 286 | 4.1% |
| OTHER | 269 | 3.9% |
| NO_CURRENT_THERAPY | 116 | 1.7% |
| SUNITINIB | 112 | 1.6% |
| UNKNOWN | 94 | 1.4% |
| CLINICAL_TRIAL | 84 | 1.2% |
| IMATINIB + SUNITINIB | 75 | 1.1% |
| Other values (18) | 210 | 3.0% |
Length
| Value | Count | Frequency (%) |
| surgery | 2429 | |
| imatinib | 2302 | |
| ripretinib | 1053 | |
| treatment_naive | 286 | 4.0% |
| other | 269 | 3.8% |
| sunitinib | 188 | 2.6% |
| no_current_therapy | 116 | 1.6% |
| unknown | 113 | 1.6% |
| 99 | 1.4% | |
| clinical_trial | 84 | 1.2% |
| Other values (14) | 193 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 11424 | |
| R | 8151 | |
| N | 5147 | |
| E | 5048 | |
| T | 5016 | |
| B | 3708 | 6.2% |
| A | 3333 | 5.6% |
| U | 2867 | 4.8% |
| S | 2680 | 4.5% |
| M | 2622 | 4.4% |
| Other values (17) | 9466 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 59462 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 11424 | |
| R | 8151 | |
| N | 5147 | |
| E | 5048 | |
| T | 5016 | |
| B | 3708 | 6.2% |
| A | 3333 | 5.6% |
| U | 2867 | 4.8% |
| S | 2680 | 4.5% |
| M | 2622 | 4.4% |
| Other values (17) | 9466 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 59462 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 11424 | |
| R | 8151 | |
| N | 5147 | |
| E | 5048 | |
| T | 5016 | |
| B | 3708 | 6.2% |
| A | 3333 | 5.6% |
| U | 2867 | 4.8% |
| S | 2680 | 4.5% |
| M | 2622 | 4.4% |
| Other values (17) | 9466 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 59462 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 11424 | |
| R | 8151 | |
| N | 5147 | |
| E | 5048 | |
| T | 5016 | |
| B | 3708 | 6.2% |
| A | 3333 | 5.6% |
| U | 2867 | 4.8% |
| S | 2680 | 4.5% |
| M | 2622 | 4.4% |
| Other values (17) | 9466 |
treatment_response
Categorical
Imbalance 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 372.0 KiB |
| UNKNOWN | |
|---|---|
| NR | |
| SD | 226 |
| PR | 180 |
| CR | 162 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 5.9176521 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | UNKNOWN |
|---|---|
| 2nd row | UNKNOWN |
| 3rd row | UNKNOWN |
| 4th row | UNKNOWN |
| 5th row | UNKNOWN |
Common Values
| Value | Count | Frequency (%) |
| UNKNOWN | 5433 | |
| NR | 842 | 12.1% |
| SD | 226 | 3.3% |
| PR | 180 | 2.6% |
| CR | 162 | 2.3% |
| NE | 91 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unknown | 5433 | |
| nr | 842 | 12.1% |
| sd | 226 | 3.3% |
| pr | 180 | 2.6% |
| cr | 162 | 2.3% |
| ne | 91 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 17232 | |
| U | 5433 | 13.2% |
| K | 5433 | 13.2% |
| O | 5433 | 13.2% |
| W | 5433 | 13.2% |
| R | 1184 | 2.9% |
| S | 226 | 0.6% |
| D | 226 | 0.6% |
| P | 180 | 0.4% |
| C | 162 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 41033 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 17232 | |
| U | 5433 | 13.2% |
| K | 5433 | 13.2% |
| O | 5433 | 13.2% |
| W | 5433 | 13.2% |
| R | 1184 | 2.9% |
| S | 226 | 0.6% |
| D | 226 | 0.6% |
| P | 180 | 0.4% |
| C | 162 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 41033 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 17232 | |
| U | 5433 | 13.2% |
| K | 5433 | 13.2% |
| O | 5433 | 13.2% |
| W | 5433 | 13.2% |
| R | 1184 | 2.9% |
| S | 226 | 0.6% |
| D | 226 | 0.6% |
| P | 180 | 0.4% |
| C | 162 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 41033 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 17232 | |
| U | 5433 | 13.2% |
| K | 5433 | 13.2% |
| O | 5433 | 13.2% |
| W | 5433 | 13.2% |
| R | 1184 | 2.9% |
| S | 226 | 0.6% |
| D | 226 | 0.6% |
| P | 180 | 0.4% |
| C | 162 | 0.4% |
primary_site
Categorical
High correlation  Imbalance 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 397.9 KiB |
| Stomach | |
|---|---|
| Liver | |
| Small Intestine | |
| Abdomen/Intraabdominal | |
| GI Tract (Indeterminate) | 101 |
| Other values (19) |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 9.7487742 |
| Min length | 4 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Stomach |
|---|---|
| 2nd row | Stomach |
| 3rd row | Stomach |
| 4th row | Stomach |
| 5th row | Stomach |
Common Values
| Value | Count | Frequency (%) |
| Stomach | 2621 | |
| Liver | 2114 | |
| Small Intestine | 1261 | |
| Abdomen/Intraabdominal | 373 | 5.4% |
| GI Tract (Indeterminate) | 101 | 1.5% |
| Colon And Rectum (Excluding Appendix) | 98 | 1.4% |
| Retroperitoneum | 92 | 1.3% |
| Digestive Other | 76 | 1.1% |
| Soft Tissue | 60 | 0.9% |
| Colon/Rectum | 43 | 0.6% |
| Other values (14) | 95 | 1.4% |
Length
| Value | Count | Frequency (%) |
| stomach | 2621 | |
| liver | 2114 | |
| small | 1261 | |
| intestine | 1261 | |
| abdomen/intraabdominal | 373 | 4.1% |
| and | 129 | 1.4% |
| retroperitoneum | 121 | 1.3% |
| appendix | 103 | 1.1% |
| gi | 101 | 1.1% |
| tract | 101 | 1.1% |
| Other values (29) | 822 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6489 | 9.6% |
| e | 6332 | 9.4% |
| a | 5304 | 7.8% |
| m | 5034 | 7.4% |
| n | 4526 | 6.7% |
| i | 4455 | 6.6% |
| o | 4049 | 6.0% |
| S | 3942 | 5.8% |
| l | 3188 | 4.7% |
| r | 3066 | 4.5% |
| Other values (35) | 21213 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 67598 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 6489 | 9.6% |
| e | 6332 | 9.4% |
| a | 5304 | 7.8% |
| m | 5034 | 7.4% |
| n | 4526 | 6.7% |
| i | 4455 | 6.6% |
| o | 4049 | 6.0% |
| S | 3942 | 5.8% |
| l | 3188 | 4.7% |
| r | 3066 | 4.5% |
| Other values (35) | 21213 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 67598 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 6489 | 9.6% |
| e | 6332 | 9.4% |
| a | 5304 | 7.8% |
| m | 5034 | 7.4% |
| n | 4526 | 6.7% |
| i | 4455 | 6.6% |
| o | 4049 | 6.0% |
| S | 3942 | 5.8% |
| l | 3188 | 4.7% |
| r | 3066 | 4.5% |
| Other values (35) | 21213 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 67598 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 6489 | 9.6% |
| e | 6332 | 9.4% |
| a | 5304 | 7.8% |
| m | 5034 | 7.4% |
| n | 4526 | 6.7% |
| i | 4455 | 6.6% |
| o | 4049 | 6.0% |
| S | 3942 | 5.8% |
| l | 3188 | 4.7% |
| r | 3066 | 4.5% |
| Other values (35) | 21213 |
sample_type
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.2 KiB |
| Metastasis | |
|---|---|
| Unknown | |
| Primary | |
| Local Recurrence | 209 |
Length
| Max length | 16 |
|---|---|
| Median length | 7 |
| Mean length | 8.6111912 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Metastasis |
|---|---|
| 2nd row | Metastasis |
| 3rd row | Primary |
| 4th row | Metastasis |
| 5th row | Metastasis |
Common Values
| Value | Count | Frequency (%) |
| Metastasis | 3097 | |
| Unknown | 2580 | |
| Primary | 1048 | 15.1% |
| Local Recurrence | 209 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| metastasis | 3097 | |
| unknown | 2580 | |
| primary | 1048 | 14.7% |
| local | 209 | 2.9% |
| recurrence | 209 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 9291 | |
| n | 7949 | |
| a | 7451 | |
| t | 6194 | |
| i | 4145 | |
| e | 3724 | 6.2% |
| M | 3097 | 5.2% |
| o | 2789 | 4.7% |
| U | 2580 | 4.3% |
| k | 2580 | 4.3% |
| Other values (11) | 9910 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 59710 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 9291 | |
| n | 7949 | |
| a | 7451 | |
| t | 6194 | |
| i | 4145 | |
| e | 3724 | 6.2% |
| M | 3097 | 5.2% |
| o | 2789 | 4.7% |
| U | 2580 | 4.3% |
| k | 2580 | 4.3% |
| Other values (11) | 9910 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 59710 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 9291 | |
| n | 7949 | |
| a | 7451 | |
| t | 6194 | |
| i | 4145 | |
| e | 3724 | 6.2% |
| M | 3097 | 5.2% |
| o | 2789 | 4.7% |
| U | 2580 | 4.3% |
| k | 2580 | 4.3% |
| Other values (11) | 9910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 59710 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 9291 | |
| n | 7949 | |
| a | 7451 | |
| t | 6194 | |
| i | 4145 | |
| e | 3724 | 6.2% |
| M | 3097 | 5.2% |
| o | 2789 | 4.7% |
| U | 2580 | 4.3% |
| k | 2580 | 4.3% |
| Other values (11) | 9910 |
race
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 397.3 KiB |
| White | |
|---|---|
| Unknown | |
| Black | |
| Other (American Indian/AK Native, Asian/Pacific Islander) | 443 |
| Black or African American | 363 |
| Other values (3) | 67 |
Length
| Max length | 57 |
|---|---|
| Median length | 5 |
| Mean length | 9.6592155 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | White |
| 3rd row | White |
| 4th row | Black or African American |
| 5th row | Black or African American |
Common Values
| Value | Count | Frequency (%) |
| White | 4601 | |
| Unknown | 995 | 14.3% |
| Black | 465 | 6.7% |
| Other (American Indian/AK Native, Asian/Pacific Islander) | 443 | 6.4% |
| Black or African American | 363 | 5.2% |
| Asian | 48 | 0.7% |
| Other | 16 | 0.2% |
| Not Provided | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 4601 | |
| unknown | 995 | 9.7% |
| black | 828 | 8.1% |
| american | 806 | 7.9% |
| other | 459 | 4.5% |
| indian/ak | 443 | 4.3% |
| native | 443 | 4.3% |
| asian/pacific | 443 | 4.3% |
| islander | 443 | 4.3% |
| or | 363 | 3.5% |
| Other values (4) | 417 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8036 | |
| e | 6755 | 10.1% |
| n | 5974 | 8.9% |
| t | 5506 | 8.2% |
| h | 5060 | 7.6% |
| W | 4601 | 6.9% |
| a | 4260 | 6.4% |
| 3307 | 4.9% | |
| c | 2883 | 4.3% |
| r | 2437 | 3.6% |
| Other values (21) | 18158 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66977 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 8036 | |
| e | 6755 | 10.1% |
| n | 5974 | 8.9% |
| t | 5506 | 8.2% |
| h | 5060 | 7.6% |
| W | 4601 | 6.9% |
| a | 4260 | 6.4% |
| 3307 | 4.9% | |
| c | 2883 | 4.3% |
| r | 2437 | 3.6% |
| Other values (21) | 18158 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66977 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 8036 | |
| e | 6755 | 10.1% |
| n | 5974 | 8.9% |
| t | 5506 | 8.2% |
| h | 5060 | 7.6% |
| W | 4601 | 6.9% |
| a | 4260 | 6.4% |
| 3307 | 4.9% | |
| c | 2883 | 4.3% |
| r | 2437 | 3.6% |
| Other values (21) | 18158 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66977 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 8036 | |
| e | 6755 | 10.1% |
| n | 5974 | 8.9% |
| t | 5506 | 8.2% |
| h | 5060 | 7.6% |
| W | 4601 | 6.9% |
| a | 4260 | 6.4% |
| 3307 | 4.9% | |
| c | 2883 | 4.3% |
| r | 2437 | 3.6% |
| Other values (21) | 18158 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.6140756 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Male | 4805 | |
| Female | 2129 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 4805 | |
| female | 2129 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9063 | |
| a | 6934 | |
| l | 6934 | |
| M | 4805 | |
| F | 2129 | 6.7% |
| m | 2129 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 31994 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 9063 | |
| a | 6934 | |
| l | 6934 | |
| M | 4805 | |
| F | 2129 | 6.7% |
| m | 2129 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 31994 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 9063 | |
| a | 6934 | |
| l | 6934 | |
| M | 4805 | |
| F | 2129 | 6.7% |
| m | 2129 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 31994 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 9063 | |
| a | 6934 | |
| l | 6934 | |
| M | 4805 | |
| F | 2129 | 6.7% |
| m | 2129 | 6.7% |
Metastatic Site
Categorical
High correlation  Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 6064 |
| Missing (%) | 87.5% |
| Memory size | 381.7 KiB |
| Not Applicable | |
|---|---|
| Liver | |
| Pelvis | 33 |
| Mesentery | 29 |
| Spleen | 26 |
| Other values (12) |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 9.7954023 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Liver |
|---|---|
| 2nd row | Liver |
| 3rd row | Not Applicable |
| 4th row | Liver |
| 5th row | Liver |
Common Values
| Value | Count | Frequency (%) |
| Not Applicable | 375 | 5.4% |
| Liver | 282 | 4.1% |
| Pelvis | 33 | 0.5% |
| Mesentery | 29 | 0.4% |
| Spleen | 26 | 0.4% |
| Small Bowel | 21 | 0.3% |
| Peritoneum | 18 | 0.3% |
| Abdomen | 18 | 0.3% |
| Abdominal Wall | 15 | 0.2% |
| Pleura | 12 | 0.2% |
| Other values (7) | 41 | 0.6% |
| (Missing) | 6064 |
Length
| Value | Count | Frequency (%) |
| not | 379 | |
| applicable | 375 | |
| liver | 282 | |
| pelvis | 36 | 2.8% |
| mesentery | 29 | 2.2% |
| spleen | 26 | 2.0% |
| small | 21 | 1.6% |
| bowel | 21 | 1.6% |
| abdominal | 21 | 1.6% |
| peritoneum | 18 | 1.4% |
| Other values (11) | 98 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 973 | |
| e | 960 | 11.3% |
| p | 776 | 9.1% |
| i | 754 | 8.8% |
| a | 477 | 5.6% |
| o | 469 | 5.5% |
| 436 | 5.1% | |
| t | 430 | 5.0% |
| b | 418 | 4.9% |
| A | 414 | 4.9% |
| Other values (20) | 2415 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8522 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 973 | |
| e | 960 | 11.3% |
| p | 776 | 9.1% |
| i | 754 | 8.8% |
| a | 477 | 5.6% |
| o | 469 | 5.5% |
| 436 | 5.1% | |
| t | 430 | 5.0% |
| b | 418 | 4.9% |
| A | 414 | 4.9% |
| Other values (20) | 2415 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8522 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 973 | |
| e | 960 | 11.3% |
| p | 776 | 9.1% |
| i | 754 | 8.8% |
| a | 477 | 5.6% |
| o | 469 | 5.5% |
| 436 | 5.1% | |
| t | 430 | 5.0% |
| b | 418 | 4.9% |
| A | 414 | 4.9% |
| Other values (20) | 2415 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8522 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 973 | |
| e | 960 | 11.3% |
| p | 776 | 9.1% |
| i | 754 | 8.8% |
| a | 477 | 5.6% |
| o | 469 | 5.5% |
| 436 | 5.1% | |
| t | 430 | 5.0% |
| b | 418 | 4.9% |
| A | 414 | 4.9% |
| Other values (20) | 2415 |
tumor_purity
Real number (ℝ)
High correlation  Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 6042 |
| Missing (%) | 87.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 65.942825 |
| Minimum | 10 |
|---|---|
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 50 |
| median | 70 |
| Q3 | 80 |
| 95-th percentile | 90 |
| Maximum | 90 |
| Range | 80 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 20.441262 |
|---|---|
| Coefficient of variation (CV) | 0.30998463 |
| Kurtosis | -0.77062301 |
| Mean | 65.942825 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.57372366 |
| Sum | 58821 |
| Variance | 417.84521 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 206 | 3.0% |
| 90 | 174 | 2.5% |
| 60 | 157 | 2.3% |
| 40 | 94 | 1.4% |
| 70 | 92 | 1.3% |
| 30 | 80 | 1.2% |
| 50 | 54 | 0.8% |
| 85 | 19 | 0.3% |
| 15 | 6 | 0.1% |
| 10 | 4 | 0.1% |
| Other values (3) | 6 | 0.1% |
| (Missing) | 6042 |
| Value | Count | Frequency (%) |
| 10 | 4 | 0.1% |
| 15 | 6 | 0.1% |
| 20 | 4 | 0.1% |
| 30 | 80 | |
| 40 | 94 | |
| 50 | 54 | 0.8% |
| 60 | 157 | |
| 63 | 1 | < 0.1% |
| 70 | 92 | |
| 73 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 90 | 174 | |
| 85 | 19 | 0.3% |
| 80 | 206 | |
| 73 | 1 | < 0.1% |
| 70 | 92 | |
| 63 | 1 | < 0.1% |
| 60 | 157 | |
| 50 | 54 | 0.8% |
| 40 | 94 | |
| 30 | 80 | 1.2% |
sample_coverage
Real number (ℝ)
High correlation  Missing 
| Distinct | 101 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 6064 |
| Missing (%) | 87.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 771.92299 |
| Minimum | 106 |
|---|---|
| Maximum | 1135 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.3 KiB |
Quantile statistics
| Minimum | 106 |
|---|---|
| 5-th percentile | 425 |
| Q1 | 657 |
| median | 808 |
| Q3 | 903 |
| 95-th percentile | 1132 |
| Maximum | 1135 |
| Range | 1029 |
| Interquartile range (IQR) | 246 |
Descriptive statistics
| Standard deviation | 209.96053 |
|---|---|
| Coefficient of variation (CV) | 0.27199673 |
| Kurtosis | 0.17939174 |
| Mean | 771.92299 |
| Median Absolute Deviation (MAD) | 119 |
| Skewness | -0.38881445 |
| Sum | 671573 |
| Variance | 44083.422 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1132 | 72 | 1.0% |
| 821 | 36 | 0.5% |
| 597 | 32 | 0.5% |
| 920 | 24 | 0.3% |
| 952 | 23 | 0.3% |
| 780 | 21 | 0.3% |
| 808 | 20 | 0.3% |
| 470 | 18 | 0.3% |
| 794 | 18 | 0.3% |
| 682 | 18 | 0.3% |
| Other values (91) | 588 | 8.5% |
| (Missing) | 6064 |
| Value | Count | Frequency (%) |
| 106 | 4 | |
| 182 | 8 | |
| 205 | 3 | < 0.1% |
| 212 | 2 | < 0.1% |
| 294 | 2 | < 0.1% |
| 359 | 4 | |
| 372 | 2 | < 0.1% |
| 384 | 4 | |
| 391 | 1 | < 0.1% |
| 392 | 6 |
| Value | Count | Frequency (%) |
| 1135 | 2 | < 0.1% |
| 1132 | 72 | |
| 1107 | 4 | 0.1% |
| 1085 | 5 | 0.1% |
| 1079 | 3 | < 0.1% |
| 1072 | 2 | < 0.1% |
| 1050 | 4 | 0.1% |
| 1031 | 8 | 0.1% |
| 1023 | 6 | 0.1% |
| 1022 | 2 | < 0.1% |
os_months
Text
Missing 
| Distinct | 170 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 3687 |
| Missing (%) | 53.2% |
| Memory size | 284.8 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.4490299 |
| Min length | 4 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 11.079 |
|---|---|
| 2nd row | 11.079 |
| 3rd row | 92.351 |
| 4th row | 27.518 |
| 5th row | 27.518 |
| Value | Count | Frequency (%) |
| 0000 | 126 | 3.9% |
| 0001 | 89 | 2.7% |
| 0003 | 85 | 2.6% |
| 0004 | 80 | 2.5% |
| 0006 | 78 | 2.4% |
| 0002 | 77 | 2.4% |
| 0009 | 76 | 2.3% |
| 0005 | 76 | 2.3% |
| 0010 | 74 | 2.3% |
| 0019 | 73 | 2.2% |
| Other values (160) | 2413 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6205 | |
| 1 | 1440 | 10.0% |
| 2 | 1033 | 7.2% |
| 4 | 838 | 5.8% |
| . | 818 | 5.7% |
| 7 | 767 | 5.3% |
| 3 | 761 | 5.3% |
| 9 | 664 | 4.6% |
| 8 | 658 | 4.6% |
| 5 | 619 | 4.3% |
| Other values (6) | 643 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14446 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6205 | |
| 1 | 1440 | 10.0% |
| 2 | 1033 | 7.2% |
| 4 | 838 | 5.8% |
| . | 818 | 5.7% |
| 7 | 767 | 5.3% |
| 3 | 761 | 5.3% |
| 9 | 664 | 4.6% |
| 8 | 658 | 4.6% |
| 5 | 619 | 4.3% |
| Other values (6) | 643 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14446 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6205 | |
| 1 | 1440 | 10.0% |
| 2 | 1033 | 7.2% |
| 4 | 838 | 5.8% |
| . | 818 | 5.7% |
| 7 | 767 | 5.3% |
| 3 | 761 | 5.3% |
| 9 | 664 | 4.6% |
| 8 | 658 | 4.6% |
| 5 | 619 | 4.3% |
| Other values (6) | 643 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14446 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6205 | |
| 1 | 1440 | 10.0% |
| 2 | 1033 | 7.2% |
| 4 | 838 | 5.8% |
| . | 818 | 5.7% |
| 7 | 767 | 5.3% |
| 3 | 761 | 5.3% |
| 9 | 664 | 4.6% |
| 8 | 658 | 4.6% |
| 5 | 619 | 4.3% |
| Other values (6) | 643 | 4.5% |
treatment_start
Date
Missing 
| Distinct | 211 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 6306 |
| Missing (%) | 90.9% |
| Memory size | 54.3 KiB |
| Minimum | 1897-05-07 00:00:00 |
|---|---|
| Maximum | 1914-03-26 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
os_status
Categorical
High correlation  Imbalance  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3636 |
| Missing (%) | 52.4% |
| Memory size | 382.8 KiB |
| DECEASED | |
|---|---|
| ALIVE | |
| DECEASED_NON_CANCER | 134 |
Length
| Max length | 19 |
|---|---|
| Median length | 8 |
| Mean length | 8.0794421 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | DECEASED |
|---|---|
| 2nd row | DECEASED |
| 3rd row | ALIVE |
| 4th row | ALIVE |
| 5th row | ALIVE |
Common Values
| Value | Count | Frequency (%) |
| DECEASED | 2760 | |
| ALIVE | 404 | 5.8% |
| DECEASED_NON_CANCER | 134 | 1.9% |
| (Missing) | 3636 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| deceased | 2760 | |
| alive | 404 | 12.2% |
| deceased_non_cancer | 134 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 9220 | |
| D | 5788 | |
| A | 3432 | 12.9% |
| C | 3162 | 11.9% |
| S | 2894 | 10.9% |
| L | 404 | 1.5% |
| I | 404 | 1.5% |
| V | 404 | 1.5% |
| N | 402 | 1.5% |
| _ | 268 | 1.0% |
| Other values (2) | 268 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 26646 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 9220 | |
| D | 5788 | |
| A | 3432 | 12.9% |
| C | 3162 | 11.9% |
| S | 2894 | 10.9% |
| L | 404 | 1.5% |
| I | 404 | 1.5% |
| V | 404 | 1.5% |
| N | 402 | 1.5% |
| _ | 268 | 1.0% |
| Other values (2) | 268 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 26646 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 9220 | |
| D | 5788 | |
| A | 3432 | 12.9% |
| C | 3162 | 11.9% |
| S | 2894 | 10.9% |
| L | 404 | 1.5% |
| I | 404 | 1.5% |
| V | 404 | 1.5% |
| N | 402 | 1.5% |
| _ | 268 | 1.0% |
| Other values (2) | 268 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 26646 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 9220 | |
| D | 5788 | |
| A | 3432 | 12.9% |
| C | 3162 | 11.9% |
| S | 2894 | 10.9% |
| L | 404 | 1.5% |
| I | 404 | 1.5% |
| V | 404 | 1.5% |
| N | 402 | 1.5% |
| _ | 268 | 1.0% |
| Other values (2) | 268 | 1.0% |
mutated_genes
Text
Missing 
| Distinct | 88 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 3130 |
| Missing (%) | 45.1% |
| Memory size | 292.2 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.2936383 |
| Min length | 2 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | TP53 |
|---|---|
| 2nd row | RB1 |
| 3rd row | KIT |
| 4th row | MTOR |
| 5th row | SDHB |
| Value | Count | Frequency (%) |
| kit | 3123 | |
| kmt2c | 170 | 4.5% |
| pdgfra | 52 | 1.4% |
| rb1 | 30 | 0.8% |
| nf1 | 29 | 0.8% |
| max | 28 | 0.7% |
| braf | 23 | 0.6% |
| setd2 | 17 | 0.4% |
| tp53 | 17 | 0.4% |
| mga | 16 | 0.4% |
| Other values (78) | 299 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 3434 | |
| K | 3360 | |
| I | 3166 | |
| 2 | 261 | 2.1% |
| M | 257 | 2.1% |
| C | 246 | 2.0% |
| A | 218 | 1.7% |
| R | 210 | 1.7% |
| P | 195 | 1.6% |
| F | 149 | 1.2% |
| Other values (22) | 1033 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12529 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 3434 | |
| K | 3360 | |
| I | 3166 | |
| 2 | 261 | 2.1% |
| M | 257 | 2.1% |
| C | 246 | 2.0% |
| A | 218 | 1.7% |
| R | 210 | 1.7% |
| P | 195 | 1.6% |
| F | 149 | 1.2% |
| Other values (22) | 1033 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12529 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 3434 | |
| K | 3360 | |
| I | 3166 | |
| 2 | 261 | 2.1% |
| M | 257 | 2.1% |
| C | 246 | 2.0% |
| A | 218 | 1.7% |
| R | 210 | 1.7% |
| P | 195 | 1.6% |
| F | 149 | 1.2% |
| Other values (22) | 1033 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12529 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 3434 | |
| K | 3360 | |
| I | 3166 | |
| 2 | 261 | 2.1% |
| M | 257 | 2.1% |
| C | 246 | 2.0% |
| A | 218 | 1.7% |
| R | 210 | 1.7% |
| P | 195 | 1.6% |
| F | 149 | 1.2% |
| Other values (22) | 1033 | 8.2% |
source
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.7 KiB |
| PDMR | |
|---|---|
| SEER | |
| CBioPortal | |
| COSMIC | |
| GDC | 74 |
Length
| Max length | 10 |
|---|---|
| Median length | 4 |
| Mean length | 4.9809634 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CBioPortal |
|---|---|
| 2nd row | CBioPortal |
| 3rd row | CBioPortal |
| 4th row | CBioPortal |
| 5th row | CBioPortal |
Common Values
| Value | Count | Frequency (%) |
| PDMR | 2733 | |
| SEER | 2429 | |
| CBioPortal | 870 | 12.5% |
| COSMIC | 828 | 11.9% |
| GDC | 74 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pdmr | 2733 | |
| seer | 2429 | |
| cbioportal | 870 | 12.5% |
| cosmic | 828 | 11.9% |
| gdc | 74 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 5162 | |
| E | 4858 | |
| P | 3603 | |
| M | 3561 | |
| S | 3257 | |
| D | 2807 | |
| C | 2600 | |
| o | 1740 | 5.0% |
| B | 870 | 2.5% |
| i | 870 | 2.5% |
| Other values (7) | 5210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34538 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 5162 | |
| E | 4858 | |
| P | 3603 | |
| M | 3561 | |
| S | 3257 | |
| D | 2807 | |
| C | 2600 | |
| o | 1740 | 5.0% |
| B | 870 | 2.5% |
| i | 870 | 2.5% |
| Other values (7) | 5210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34538 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 5162 | |
| E | 4858 | |
| P | 3603 | |
| M | 3561 | |
| S | 3257 | |
| D | 2807 | |
| C | 2600 | |
| o | 1740 | 5.0% |
| B | 870 | 2.5% |
| i | 870 | 2.5% |
| Other values (7) | 5210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34538 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 5162 | |
| E | 4858 | |
| P | 3603 | |
| M | 3561 | |
| S | 3257 | |
| D | 2807 | |
| C | 2600 | |
| o | 1740 | 5.0% |
| B | 870 | 2.5% |
| i | 870 | 2.5% |
| Other values (7) | 5210 |
tumor_grade
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 395.0 KiB |
| High grade | |
|---|---|
| Unknown | |
| Intermediate grade | 349 |
| Low grade | 277 |
Length
| Max length | 18 |
|---|---|
| Median length | 10 |
| Mean length | 9.3087684 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Unknown |
| 5th row | Unknown |
Common Values
| Value | Count | Frequency (%) |
| High grade | 3872 | |
| Unknown | 2436 | |
| Intermediate grade | 349 | 5.0% |
| Low grade | 277 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| grade | 4498 | |
| high | 3872 | |
| unknown | 2436 | |
| intermediate | 349 | 3.1% |
| low | 277 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 8370 | |
| n | 7657 | |
| e | 5545 | |
| r | 4847 | |
| d | 4847 | |
| a | 4847 | |
| 4498 | 7.0% | |
| i | 4221 | 6.5% |
| H | 3872 | 6.0% |
| h | 3872 | 6.0% |
| Other values (8) | 11971 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 64547 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| g | 8370 | |
| n | 7657 | |
| e | 5545 | |
| r | 4847 | |
| d | 4847 | |
| a | 4847 | |
| 4498 | 7.0% | |
| i | 4221 | 6.5% |
| H | 3872 | 6.0% |
| h | 3872 | 6.0% |
| Other values (8) | 11971 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 64547 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| g | 8370 | |
| n | 7657 | |
| e | 5545 | |
| r | 4847 | |
| d | 4847 | |
| a | 4847 | |
| 4498 | 7.0% | |
| i | 4221 | 6.5% |
| H | 3872 | 6.0% |
| h | 3872 | 6.0% |
| Other values (8) | 11971 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 64547 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| g | 8370 | |
| n | 7657 | |
| e | 5545 | |
| r | 4847 | |
| d | 4847 | |
| a | 4847 | |
| 4498 | 7.0% | |
| i | 4221 | 6.5% |
| H | 3872 | 6.0% |
| h | 3872 | 6.0% |
| Other values (8) | 11971 |
Interactions
Correlations
| Age at Which Sequencing was Reported (Years) | Metastatic Site | age_at_diagnosis | gender | mitotic_rate | os_status | primary_site | race | sample_coverage | sample_type | source | stage_at_diagnosis | treatment | treatment_response | tumor_grade | tumor_purity | tumor_size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age at Which Sequencing was Reported (Years) | 1.000 | 0.364 | 0.901 | 0.278 | 0.076 | 0.445 | 0.359 | 0.232 | -0.176 | 0.384 | 1.000 | 0.348 | 0.228 | 0.246 | 1.000 | -0.099 | 0.243 |
| Metastatic Site | 0.364 | 1.000 | 0.317 | 0.480 | 0.378 | 0.378 | 0.447 | 0.386 | 0.343 | 0.865 | 1.000 | 0.473 | 0.171 | 0.244 | 1.000 | 0.362 | 0.419 |
| age_at_diagnosis | 0.901 | 0.317 | 1.000 | 0.307 | 0.206 | 0.291 | 0.281 | 0.181 | -0.212 | 0.426 | 0.419 | 0.332 | 0.370 | 0.155 | 0.289 | -0.126 | 0.185 |
| gender | 0.278 | 0.480 | 0.307 | 1.000 | 0.375 | 0.136 | 0.450 | 0.371 | 0.342 | 0.407 | 0.409 | 0.353 | 0.437 | 0.114 | 0.302 | 0.190 | 0.341 |
| mitotic_rate | 0.076 | 0.378 | 0.206 | 0.375 | 1.000 | 0.383 | 0.302 | 0.303 | 0.053 | 0.338 | 1.000 | 0.464 | 0.173 | 0.220 | 1.000 | 0.187 | -0.089 |
| os_status | 0.445 | 0.378 | 0.291 | 0.136 | 0.383 | 1.000 | 0.201 | 0.210 | 0.386 | 0.455 | 0.629 | 0.215 | 0.477 | 0.273 | 0.370 | 0.310 | 0.352 |
| primary_site | 0.359 | 0.447 | 0.281 | 0.450 | 0.302 | 0.201 | 1.000 | 0.310 | 0.334 | 0.525 | 0.554 | 0.483 | 0.282 | 0.324 | 0.402 | 0.165 | 0.345 |
| race | 0.232 | 0.386 | 0.181 | 0.371 | 0.303 | 0.210 | 0.310 | 1.000 | 0.293 | 0.453 | 0.559 | 0.374 | 0.331 | 0.386 | 0.397 | 0.316 | 0.403 |
| sample_coverage | -0.176 | 0.343 | -0.212 | 0.342 | 0.053 | 0.386 | 0.334 | 0.293 | 1.000 | 0.420 | 1.000 | 0.333 | 0.192 | 0.259 | 1.000 | 0.248 | 0.003 |
| sample_type | 0.384 | 0.865 | 0.426 | 0.407 | 0.338 | 0.455 | 0.525 | 0.453 | 0.420 | 1.000 | 0.629 | 0.429 | 0.693 | 0.359 | 0.351 | 0.299 | 0.292 |
| source | 1.000 | 1.000 | 0.419 | 0.409 | 1.000 | 0.629 | 0.554 | 0.559 | 1.000 | 0.629 | 1.000 | 0.553 | 0.823 | 0.498 | 0.487 | 0.085 | 1.000 |
| stage_at_diagnosis | 0.348 | 0.473 | 0.332 | 0.353 | 0.464 | 0.215 | 0.483 | 0.374 | 0.333 | 0.429 | 0.553 | 1.000 | 0.457 | 0.302 | 0.330 | 0.203 | 0.347 |
| treatment | 0.228 | 0.171 | 0.370 | 0.437 | 0.173 | 0.477 | 0.282 | 0.331 | 0.192 | 0.693 | 0.823 | 0.457 | 1.000 | 0.412 | 0.437 | 0.131 | 0.159 |
| treatment_response | 0.246 | 0.244 | 0.155 | 0.114 | 0.220 | 0.273 | 0.324 | 0.386 | 0.259 | 0.359 | 0.498 | 0.302 | 0.412 | 1.000 | 0.384 | 0.228 | 0.189 |
| tumor_grade | 1.000 | 1.000 | 0.289 | 0.302 | 1.000 | 0.370 | 0.402 | 0.397 | 1.000 | 0.351 | 0.487 | 0.330 | 0.437 | 0.384 | 1.000 | 1.000 | 1.000 |
| tumor_purity | -0.099 | 0.362 | -0.126 | 0.190 | 0.187 | 0.310 | 0.165 | 0.316 | 0.248 | 0.299 | 0.085 | 0.203 | 0.131 | 0.228 | 1.000 | 1.000 | 0.048 |
| tumor_size | 0.243 | 0.419 | 0.185 | 0.341 | -0.089 | 0.352 | 0.345 | 0.403 | 0.003 | 0.292 | 1.000 | 0.347 | 0.159 | 0.189 | 1.000 | 0.048 | 1.000 |
Missing values
Sample
| sample_id | patient_id | age_at_diagnosis | Age at Which Sequencing was Reported (Years) | stage_at_diagnosis | tumor_size | mitotic_rate | treatment | treatment_response | primary_site | sample_type | race | gender | Metastatic Site | tumor_purity | sample_coverage | os_months | treatment_start | os_status | mutated_genes | source | tumor_grade | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | P-0000134-T02-IM3 | P-0000134 | 68.0 | 70.0 | Metastatic | 13.6 | 50.0 | OTHER | UNKNOWN | Stomach | Metastasis | White | Female | Liver | 90.0 | 661.0 | 11.079 | NaN | DECEASED | TP53 | CBioPortal | Unknown |
| 1 | P-0000134-T02-IM3 | P-0000134 | 68.0 | 70.0 | Metastatic | 13.6 | 50.0 | OTHER | UNKNOWN | Stomach | Metastasis | White | Female | Liver | 90.0 | 661.0 | 11.079 | NaN | DECEASED | RB1 | CBioPortal | Unknown |
| 2 | P-0000306-T01-IM3 | P-0000306 | 48.0 | 57.0 | Localized | 13.0 | 5.0 | OTHER | UNKNOWN | Stomach | Primary | White | Male | Not Applicable | 90.0 | 212.0 | 92.351 | NaN | ALIVE | KIT | CBioPortal | Unknown |
| 3 | P-0000501-T02-IM3 | P-0000501 | 37.0 | 42.0 | Localized | 10.2 | 2.0 | OTHER | UNKNOWN | Stomach | Metastasis | Black or African American | Male | Liver | 60.0 | 811.0 | 27.518 | NaN | ALIVE | MTOR | CBioPortal | Unknown |
| 4 | P-0000501-T02-IM3 | P-0000501 | 37.0 | 42.0 | Localized | 10.2 | 2.0 | OTHER | UNKNOWN | Stomach | Metastasis | Black or African American | Male | Liver | 60.0 | 811.0 | 27.518 | NaN | ALIVE | SDHB | CBioPortal | Unknown |
| 5 | P-0001315-T02-IM5 | P-0001315 | 31.0 | 39.0 | Metastatic | 8.0 | 48.0 | OTHER | UNKNOWN | Small Intestine | Metastasis | White | Male | Skin | 90.0 | 1023.0 | 97.348 | NaN | ALIVE | BRAF | CBioPortal | Unknown |
| 6 | P-0001315-T02-IM5 | P-0001315 | 31.0 | 39.0 | Metastatic | 8.0 | 48.0 | OTHER | UNKNOWN | Small Intestine | Metastasis | White | Male | Skin | 90.0 | 1023.0 | 97.348 | NaN | ALIVE | EP300 | CBioPortal | Unknown |
| 7 | P-0001315-T02-IM5 | P-0001315 | 31.0 | 39.0 | Metastatic | 8.0 | 48.0 | OTHER | UNKNOWN | Small Intestine | Metastasis | White | Male | Skin | 90.0 | 1023.0 | 97.348 | NaN | ALIVE | RB1 | CBioPortal | Unknown |
| 8 | P-0001315-T02-IM5 | P-0001315 | 31.0 | 39.0 | Metastatic | 8.0 | 48.0 | OTHER | UNKNOWN | Small Intestine | Metastasis | White | Male | Skin | 90.0 | 1023.0 | 97.348 | NaN | ALIVE | TSC2 | CBioPortal | Unknown |
| 9 | P-0001315-T02-IM5 | P-0001315 | 31.0 | 39.0 | Metastatic | 8.0 | 48.0 | OTHER | UNKNOWN | Small Intestine | Metastasis | White | Male | Skin | 90.0 | 1023.0 | 97.348 | NaN | ALIVE | NF1 | CBioPortal | Unknown |
| sample_id | patient_id | age_at_diagnosis | Age at Which Sequencing was Reported (Years) | stage_at_diagnosis | tumor_size | mitotic_rate | treatment | treatment_response | primary_site | sample_type | race | gender | Metastatic Site | tumor_purity | sample_coverage | os_months | treatment_start | os_status | mutated_genes | source | tumor_grade | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6924 | COSS1477588 | 1351159 | 64.0 | NaN | Unknown | NaN | NaN | IMATINIB | UNKNOWN | GI Tract (Indeterminate) | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6925 | COSS1477581 | 1351152 | 54.0 | NaN | Unknown | NaN | NaN | IMATINIB | UNKNOWN | GI Tract (Indeterminate) | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6926 | COSS1477586 | 1351157 | 34.0 | NaN | Unknown | NaN | NaN | IMATINIB | UNKNOWN | GI Tract (Indeterminate) | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6927 | COSS1477589 | 1351160 | 51.0 | NaN | Unknown | NaN | NaN | IMATINIB | UNKNOWN | GI Tract (Indeterminate) | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6928 | COSS1477584 | 1351155 | 55.0 | NaN | Unknown | NaN | NaN | IMATINIB | UNKNOWN | GI Tract (Indeterminate) | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6929 | COSS909212 | 808927 | 40.0 | NaN | Unknown | NaN | NaN | IMATINIB | PR | Abdomen/Intraabdominal | Metastasis | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6930 | COSS909213 | 808927 | 40.0 | NaN | Unknown | NaN | NaN | IMATINIB | NR | Abdomen/Intraabdominal | Local Recurrence | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6931 | COSS909213 | 808927 | 40.0 | NaN | Unknown | NaN | NaN | IMATINIB | NR | Abdomen/Intraabdominal | Local Recurrence | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6932 | COSS2479667 | 2191917 | 58.0 | NaN | Unknown | NaN | NaN | IMATINIB | NR | Colon/Rectum | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
| 6933 | COSS2479666 | 2191917 | 58.0 | NaN | Unknown | NaN | NaN | IMATINIB | NR | Colon/Rectum | Unknown | Unknown | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | COSMIC | Unknown |
Duplicate rows
Most frequently occurring
| sample_id | patient_id | age_at_diagnosis | Age at Which Sequencing was Reported (Years) | stage_at_diagnosis | tumor_size | mitotic_rate | treatment | treatment_response | primary_site | sample_type | race | gender | Metastatic Site | tumor_purity | sample_coverage | os_months | treatment_start | os_status | mutated_genes | source | tumor_grade | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 369 | NaN | 111316 | 58.0 | NaN | Metastatic | NaN | NaN | IMATINIB | UNKNOWN | Liver | Metastasis | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | PDMR | High grade | 972 |
| 371 | NaN | 111316 | 58.0 | NaN | Metastatic | NaN | NaN | RIPRETINIB | UNKNOWN | Liver | Metastasis | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | KIT | PDMR | High grade | 972 |
| 376 | NaN | 627122 | 39.0 | NaN | Unknown | NaN | NaN | TREATMENT_NAIVE | UNKNOWN | Stomach | Primary | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | Unknown | 200 |
| 382 | NaN | 949853 | 39.0 | NaN | Metastatic | NaN | NaN | TREATMENT_NAIVE | UNKNOWN | Stomach | Primary | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | Unknown | 84 |
| 370 | NaN | 111316 | 58.0 | NaN | Metastatic | NaN | NaN | IMATINIB | UNKNOWN | Liver | Metastasis | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | KMT2C | PDMR | High grade | 81 |
| 372 | NaN | 111316 | 58.0 | NaN | Metastatic | NaN | NaN | RIPRETINIB | UNKNOWN | Liver | Metastasis | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | KMT2C | PDMR | High grade | 81 |
| 374 | NaN | 429767 | 53.0 | NaN | Metastatic | NaN | NaN | IMATINIB | SD | Abdomen/Intraabdominal | Primary | Black or African American | Female | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | Unknown | 64 |
| 375 | NaN | 429767 | 53.0 | NaN | Metastatic | NaN | NaN | NO_CURRENT_THERAPY | UNKNOWN | Abdomen/Intraabdominal | Primary | Black or African American | Female | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | Unknown | 64 |
| 377 | NaN | 636974 | 66.0 | NaN | Metastatic | NaN | NaN | IMATINIB | SD | Abdomen/Intraabdominal | Primary | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | High grade | 49 |
| 378 | NaN | 636974 | 66.0 | NaN | Metastatic | NaN | NaN | NO_CURRENT_THERAPY | UNKNOWN | Abdomen/Intraabdominal | Primary | White | Male | NaN | NaN | NaN | NaN | NaN | NaN | NaN | PDMR | High grade | 49 |