Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.2 KiB
Average record size in memory63.3 B

Variable types

Numeric5
Categorical2

Alerts

tco_btc_re is highly overall correlated with tco_btc_nmHigh correlation
tco_btc_u_ct is highly overall correlated with tco_btc_u_amHigh correlation
tco_btc_u_am is highly overall correlated with tco_btc_u_ctHigh correlation
tco_btc_nm is highly overall correlated with tco_btc_reHigh correlation
tco_btc_u_am has unique valuesUnique
agegrp_dc has 2 (2.0%) zerosZeros

Reproduction

Analysis started2023-12-11 22:34:32.170722
Analysis finished2023-12-11 22:34:34.082916
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

crym
Real number (ℝ)

Distinct32
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202008.86
Minimum201901
Maximum202109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:34.152094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201901
5-th percentile201902.95
Q1201910.75
median202006
Q3202102
95-th percentile202108
Maximum202109
Range208
Interquartile range (IQR)191.25

Descriptive statistics

Standard deviation76.741205
Coefficient of variation (CV)0.00037989029
Kurtosis-1.3010005
Mean202008.86
Median Absolute Deviation (MAD)96
Skewness-0.06393294
Sum20200886
Variance5889.2125
MonotonicityNot monotonic
2023-12-12T07:34:34.257297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
202002 7
 
7.0%
202005 6
 
6.0%
202011 6
 
6.0%
202108 6
 
6.0%
202107 5
 
5.0%
202009 5
 
5.0%
201908 5
 
5.0%
202101 4
 
4.0%
202010 4
 
4.0%
201904 4
 
4.0%
Other values (22) 48
48.0%
ValueCountFrequency (%)
201901 2
 
2.0%
201902 3
3.0%
201903 2
 
2.0%
201904 4
4.0%
201905 1
 
1.0%
201906 3
3.0%
201907 1
 
1.0%
201908 5
5.0%
201909 2
 
2.0%
201910 2
 
2.0%
ValueCountFrequency (%)
202109 1
 
1.0%
202108 6
6.0%
202107 5
5.0%
202106 4
4.0%
202105 2
 
2.0%
202104 2
 
2.0%
202103 3
3.0%
202102 4
4.0%
202101 4
4.0%
202012 1
 
1.0%

tco_btc_re
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.91
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:34.369746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16.75
median12
Q319
95-th percentile22
Maximum22
Range21
Interquartile range (IQR)12.25

Descriptive statistics

Standard deviation6.7615894
Coefficient of variation (CV)0.56772371
Kurtosis-1.2829046
Mean11.91
Median Absolute Deviation (MAD)6
Skewness0.022243958
Sum1191
Variance45.719091
MonotonicityNot monotonic
2023-12-12T07:34:34.491442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
22 9
 
9.0%
20 8
 
8.0%
12 6
 
6.0%
9 6
 
6.0%
7 6
 
6.0%
2 6
 
6.0%
4 6
 
6.0%
10 5
 
5.0%
21 5
 
5.0%
6 4
 
4.0%
Other values (12) 39
39.0%
ValueCountFrequency (%)
1 4
4.0%
2 6
6.0%
3 4
4.0%
4 6
6.0%
5 1
 
1.0%
6 4
4.0%
7 6
6.0%
8 4
4.0%
9 6
6.0%
10 5
5.0%
ValueCountFrequency (%)
22 9
9.0%
21 5
5.0%
20 8
8.0%
19 4
4.0%
18 1
 
1.0%
17 4
4.0%
16 4
4.0%
15 2
 
2.0%
14 4
4.0%
13 4
4.0%

tco_btc_nm
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
한의원/한방병원
마취통증의학
방사선과/영상진단의학
 
6
안과
 
6
외과
 
6
Other values (17)
65 

Length

Max length11
Median length8
Mean length4.73
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row동물(가축)병원
2nd row종합병원
3rd row안과
4th row기타의료기관
5th row안과

Common Values

ValueCountFrequency (%)
한의원/한방병원 9
 
9.0%
마취통증의학 8
 
8.0%
방사선과/영상진단의학 6
 
6.0%
안과 6
 
6.0%
외과 6
 
6.0%
병원 6
 
6.0%
치과 6
 
6.0%
비뇨기과 5
 
5.0%
요양병원 5
 
5.0%
피부과 4
 
4.0%
Other values (12) 39
39.0%

Length

2023-12-12T07:34:34.620428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한의원/한방병원 9
 
9.0%
마취통증의학 8
 
8.0%
방사선과/영상진단의학 6
 
6.0%
안과 6
 
6.0%
외과 6
 
6.0%
병원 6
 
6.0%
치과 6
 
6.0%
비뇨기과 5
 
5.0%
요양병원 5
 
5.0%
종합병원 4
 
4.0%
Other values (12) 39
39.0%

ma_fem_dc
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2
58 
1
42 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row1
5th row2

Common Values

ValueCountFrequency (%)
2 58
58.0%
1 42
42.0%

Length

2023-12-12T07:34:34.712942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:34:34.785489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 58
58.0%
1 42
42.0%

agegrp_dc
Real number (ℝ)

ZEROS 

Distinct10
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.1
Minimum0
Maximum90
Zeros2
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:34.873128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10
Q130
median50
Q370
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)40

Descriptive statistics

Standard deviation26.708112
Coefficient of variation (CV)0.5439534
Kurtosis-1.1941491
Mean49.1
Median Absolute Deviation (MAD)20
Skewness0.039896706
Sum4910
Variance713.32323
MonotonicityNot monotonic
2023-12-12T07:34:34.959621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
40 14
14.0%
30 13
13.0%
60 13
13.0%
90 12
12.0%
80 12
12.0%
20 10
10.0%
10 10
10.0%
50 7
7.0%
70 7
7.0%
0 2
 
2.0%
ValueCountFrequency (%)
0 2
 
2.0%
10 10
10.0%
20 10
10.0%
30 13
13.0%
40 14
14.0%
50 7
7.0%
60 13
13.0%
70 7
7.0%
80 12
12.0%
90 12
12.0%
ValueCountFrequency (%)
90 12
12.0%
80 12
12.0%
70 7
7.0%
60 13
13.0%
50 7
7.0%
40 14
14.0%
30 13
13.0%
20 10
10.0%
10 10
10.0%
0 2
 
2.0%

tco_btc_u_ct
Real number (ℝ)

HIGH CORRELATION 

Distinct90
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7040.13
Minimum1
Maximum58373
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:35.059469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q134.75
median1748
Q36927
95-th percentile40314.9
Maximum58373
Range58372
Interquartile range (IQR)6892.25

Descriptive statistics

Standard deviation12457.283
Coefficient of variation (CV)1.7694678
Kurtosis5.8328174
Mean7040.13
Median Absolute Deviation (MAD)1737.5
Skewness2.5003821
Sum704013
Variance1.551839 × 108
MonotonicityNot monotonic
2023-12-12T07:34:35.166397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 3
 
3.0%
2 3
 
3.0%
23 3
 
3.0%
11 2
 
2.0%
3 2
 
2.0%
26 2
 
2.0%
15 2
 
2.0%
1697 1
 
1.0%
5628 1
 
1.0%
78 1
 
1.0%
Other values (80) 80
80.0%
ValueCountFrequency (%)
1 1
 
1.0%
2 3
3.0%
3 2
2.0%
4 1
 
1.0%
5 3
3.0%
9 1
 
1.0%
10 1
 
1.0%
11 2
2.0%
14 1
 
1.0%
15 2
2.0%
ValueCountFrequency (%)
58373 1
1.0%
50520 1
1.0%
47636 1
1.0%
47393 1
1.0%
41985 1
1.0%
40227 1
1.0%
36094 1
1.0%
31284 1
1.0%
26433 1
1.0%
23222 1
1.0%

tco_btc_u_am
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.0014164 × 108
Minimum11500
Maximum5.6364975 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-12T07:34:35.276168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11500
5-th percentile140195
Q13024400
median1.1418963 × 108
Q34.8244698 × 108
95-th percentile2.6125042 × 109
Maximum5.6364975 × 109
Range5.636486 × 109
Interquartile range (IQR)4.7942258 × 108

Descriptive statistics

Standard deviation1.0341352 × 109
Coefficient of variation (CV)2.0676847
Kurtosis13.171069
Mean5.0014164 × 108
Median Absolute Deviation (MAD)1.138723 × 108
Skewness3.5081862
Sum5.0014164 × 1010
Variance1.0694357 × 1018
MonotonicityNot monotonic
2023-12-12T07:34:35.418835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
146206235 1
 
1.0%
475716830 1
 
1.0%
1173680 1
 
1.0%
68833600 1
 
1.0%
1779505069 1
 
1.0%
1227730 1
 
1.0%
972400 1
 
1.0%
492772537 1
 
1.0%
847505135 1
 
1.0%
287865061 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
11500 1
1.0%
45000 1
1.0%
56000 1
1.0%
77770 1
1.0%
134400 1
1.0%
140500 1
1.0%
141940 1
1.0%
143500 1
1.0%
206800 1
1.0%
247700 1
1.0%
ValueCountFrequency (%)
5636497540 1
1.0%
5577191232 1
1.0%
4307202009 1
1.0%
3399969040 1
1.0%
3351032995 1
1.0%
2573634265 1
1.0%
1779505069 1
1.0%
1586411988 1
1.0%
1096087694 1
1.0%
1050035128 1
1.0%

Interactions

2023-12-12T07:34:33.606855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.397855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.699203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.998681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.296953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.676661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.455237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.753243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.058747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.362957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.735204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.511974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.812043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.114495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.423164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.797471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.569930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.868854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.168680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.482661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.870032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.634712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:32.934710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.232721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:34:33.546405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:34:35.495245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymtco_btc_retco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
crym1.0000.0000.0000.0330.0000.0000.037
tco_btc_re0.0001.0001.0000.4750.0000.3420.000
tco_btc_nm0.0001.0001.0000.4550.3330.3520.000
ma_fem_dc0.0330.4750.4551.0000.0000.0000.000
agegrp_dc0.0000.0000.3330.0001.0000.3920.354
tco_btc_u_ct0.0000.3420.3520.0000.3921.0000.880
tco_btc_u_am0.0370.0000.0000.0000.3540.8801.000
2023-12-12T07:34:35.579703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ma_fem_dctco_btc_nm
ma_fem_dc1.0000.319
tco_btc_nm0.3191.000
2023-12-12T07:34:35.649179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crymtco_btc_reagegrp_dctco_btc_u_cttco_btc_u_amtco_btc_nmma_fem_dc
crym1.000-0.0370.179-0.029-0.0270.0000.000
tco_btc_re-0.0371.0000.088-0.044-0.0880.9310.349
agegrp_dc0.1790.0881.000-0.199-0.2450.1110.000
tco_btc_u_ct-0.029-0.044-0.1991.0000.9390.1190.000
tco_btc_u_am-0.027-0.088-0.2450.9391.0000.0000.000
tco_btc_nm0.0000.9310.1110.1190.0001.0000.319
ma_fem_dc0.0000.3490.0000.0000.0000.3191.000

Missing values

2023-12-12T07:34:33.956575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:34:34.044670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

crymtco_btc_retco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
020200213동물(가축)병원1201697146206235
12019121종합병원203141940
22019069안과1309084294050180
320191014기타의료기관1302357135819694
42021029안과250178413399969040
520210620마취통증의학22095070690280
620210822한의원/한방병원15013837817165422
720190816정신과19017206800
82019048피부과1202511157941630
92021052병원28081067545210
crymtco_btc_retco_btc_nmma_fem_dcagegrp_dctco_btc_u_cttco_btc_u_am
9020200522한의원/한방병원2703743151306160
912020016성형외과180153077700
922021018피부과1203123226337526
9320200210비뇨기과210211500
9420190114기타의료기관27035425891950
952021096성형외과210117109000
962020096성형외과2601102443388890
972019126성형외과24065902573634265
9820190419소아청소년과160219330853180
9920210313동물(가축)병원210223346500