Overview

Dataset statistics

Number of variables9
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory78.3 B

Variable types

Categorical5
Text2
Numeric2

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=153a4740-2e00-11ea-9713-eb3e5186fb38

Alerts

화력발전소 고유id has constant value ""Constant
화력발전소 명 has constant value ""Constant
화력발전소 주소 has constant value ""Constant
화력발전소 x 위치 has constant value ""Constant
화력발전소 y 위치 has constant value ""Constant
진료비 is highly overall correlated with 전국화력발전소 진료비 총액High correlation
전국화력발전소 진료비 총액 is highly overall correlated with 진료비High correlation
질병코드 has unique valuesUnique
질병명칭 has unique valuesUnique
진료비 has unique valuesUnique
전국화력발전소 진료비 총액 has unique valuesUnique

Reproduction

Analysis started2023-12-10 11:55:06.980881
Analysis finished2023-12-10 11:55:08.321211
Duration1.34 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

화력발전소 고유id
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T20:55:08.414421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:55:08.573578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

화력발전소 명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
광양복합화력발전소
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row광양복합화력발전소
2nd row광양복합화력발전소
3rd row광양복합화력발전소
4th row광양복합화력발전소
5th row광양복합화력발전소

Common Values

ValueCountFrequency (%)
광양복합화력발전소 100
100.0%

Length

2023-12-10T20:55:08.741187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:55:08.865400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
광양복합화력발전소 100
100.0%

화력발전소 주소
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
전라남도 광양시 제철로 2148-567
100 

Length

Max length21
Median length21
Mean length21
Min length21

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도 광양시 제철로 2148-567
2nd row전라남도 광양시 제철로 2148-567
3rd row전라남도 광양시 제철로 2148-567
4th row전라남도 광양시 제철로 2148-567
5th row전라남도 광양시 제철로 2148-567

Common Values

ValueCountFrequency (%)
전라남도 광양시 제철로 2148-567 100
100.0%

Length

2023-12-10T20:55:09.022332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:55:09.179883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전라남도 100
25.0%
광양시 100
25.0%
제철로 100
25.0%
2148-567 100
25.0%

화력발전소 x 위치
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
34.88715
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row34.88715
2nd row34.88715
3rd row34.88715
4th row34.88715
5th row34.88715

Common Values

ValueCountFrequency (%)
34.88715 100
100.0%

Length

2023-12-10T20:55:09.334192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:55:09.464581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
34.88715 100
100.0%

화력발전소 y 위치
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
127.77604
100 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row127.77604
2nd row127.77604
3rd row127.77604
4th row127.77604
5th row127.77604

Common Values

ValueCountFrequency (%)
127.77604 100
100.0%

Length

2023-12-10T20:55:09.594304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:55:09.724929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
127.77604 100
100.0%

질병코드
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:55:10.248698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters300
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st rowI00
2nd rowI01
3rd rowI05
4th rowI06
5th rowI07
ValueCountFrequency (%)
i00 1
 
1.0%
i82 1
 
1.0%
i99 1
 
1.0%
i98 1
 
1.0%
i97 1
 
1.0%
i95 1
 
1.0%
i89 1
 
1.0%
i88 1
 
1.0%
i87 1
 
1.0%
i86 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T20:55:10.945055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 74
24.7%
0 27
 
9.0%
1 27
 
9.0%
J 26
 
8.7%
3 25
 
8.3%
2 23
 
7.7%
6 19
 
6.3%
8 19
 
6.3%
4 18
 
6.0%
7 17
 
5.7%
Other values (2) 25
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 200
66.7%
Uppercase Letter 100
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 27
13.5%
1 27
13.5%
3 25
12.5%
2 23
11.5%
6 19
9.5%
8 19
9.5%
4 18
9.0%
7 17
8.5%
5 14
7.0%
9 11
5.5%
Uppercase Letter
ValueCountFrequency (%)
I 74
74.0%
J 26
 
26.0%

Most occurring scripts

ValueCountFrequency (%)
Common 200
66.7%
Latin 100
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 27
13.5%
1 27
13.5%
3 25
12.5%
2 23
11.5%
6 19
9.5%
8 19
9.5%
4 18
9.0%
7 17
8.5%
5 14
7.0%
9 11
5.5%
Latin
ValueCountFrequency (%)
I 74
74.0%
J 26
 
26.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 300
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 74
24.7%
0 27
 
9.0%
1 27
 
9.0%
J 26
 
8.7%
3 25
 
8.3%
2 23
 
7.7%
6 19
 
6.3%
8 19
 
6.3%
4 18
 
6.0%
7 17
 
5.7%
Other values (2) 25
 
8.3%

질병명칭
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T20:55:11.797698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length20
Mean length12.5
Min length2

Characters and Unicode

Total characters1250
Distinct characters163
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row심장 침습이 없는 류마티스열
2nd row심장 침습이 있는 류마티스 열
3rd row류마티스성 승모판 질환
4th row류마티스성 대동맥판 질환
5th row류마티스성 삼첨판 질환
ValueCountFrequency (%)
23
 
6.7%
기타 21
 
6.1%
질환 17
 
4.9%
장애 16
 
4.6%
급성 16
 
4.6%
심장 12
 
3.5%
달리 11
 
3.2%
않은 8
 
2.3%
질환에서의 7
 
2.0%
분류된 7
 
2.0%
Other values (144) 207
60.0%
2023-12-10T20:55:12.495561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
19.6%
50
 
4.0%
35
 
2.8%
32
 
2.6%
31
 
2.5%
29
 
2.3%
28
 
2.2%
28
 
2.2%
26
 
2.1%
25
 
2.0%
Other values (153) 721
57.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 991
79.3%
Space Separator 245
 
19.6%
Close Punctuation 6
 
0.5%
Open Punctuation 6
 
0.5%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
5.0%
35
 
3.5%
32
 
3.2%
31
 
3.1%
29
 
2.9%
28
 
2.8%
28
 
2.8%
26
 
2.6%
25
 
2.5%
24
 
2.4%
Other values (147) 683
68.9%
Close Punctuation
ValueCountFrequency (%)
) 4
66.7%
] 2
33.3%
Open Punctuation
ValueCountFrequency (%)
( 4
66.7%
[ 2
33.3%
Space Separator
ValueCountFrequency (%)
245
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 991
79.3%
Common 259
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
5.0%
35
 
3.5%
32
 
3.2%
31
 
3.1%
29
 
2.9%
28
 
2.8%
28
 
2.8%
26
 
2.6%
25
 
2.5%
24
 
2.4%
Other values (147) 683
68.9%
Common
ValueCountFrequency (%)
245
94.6%
) 4
 
1.5%
( 4
 
1.5%
[ 2
 
0.8%
] 2
 
0.8%
, 2
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 991
79.3%
ASCII 259
 
20.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245
94.6%
) 4
 
1.5%
( 4
 
1.5%
[ 2
 
0.8%
] 2
 
0.8%
, 2
 
0.8%
Hangul
ValueCountFrequency (%)
50
 
5.0%
35
 
3.5%
32
 
3.2%
31
 
3.1%
29
 
2.9%
28
 
2.8%
28
 
2.8%
26
 
2.6%
25
 
2.5%
24
 
2.4%
Other values (147) 683
68.9%

진료비
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.7620392 × 108
Minimum10600
Maximum6.9102895 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:55:12.678998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10600
5-th percentile218230
Q15580970
median55376010
Q32.9796094 × 108
95-th percentile1.7528709 × 109
Maximum6.9102895 × 109
Range6.9102789 × 109
Interquartile range (IQR)2.9237997 × 108

Descriptive statistics

Standard deviation9.1921357 × 108
Coefficient of variation (CV)2.4433918
Kurtosis28.928282
Mean3.7620392 × 108
Median Absolute Deviation (MAD)54677985
Skewness4.8839123
Sum3.7620392 × 1010
Variance8.4495359 × 1017
MonotonicityNot monotonic
2023-12-10T20:55:12.858871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
497480 1
 
1.0%
717814440 1
 
1.0%
587771160 1
 
1.0%
559250 1
 
1.0%
158950 1
 
1.0%
221350 1
 
1.0%
23959180 1
 
1.0%
19013620 1
 
1.0%
38480980 1
 
1.0%
20239570 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
10600 1
1.0%
39760 1
1.0%
76930 1
1.0%
127360 1
1.0%
158950 1
1.0%
221350 1
1.0%
228320 1
1.0%
429150 1
1.0%
497480 1
1.0%
559250 1
1.0%
ValueCountFrequency (%)
6910289520 1
1.0%
4313225920 1
1.0%
3046465630 1
1.0%
1858161490 1
1.0%
1814681340 1
1.0%
1749617770 1
1.0%
1383637170 1
1.0%
1366091730 1
1.0%
1169616100 1
1.0%
909777030 1
1.0%

전국화력발전소 진료비 총액
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.2499077 × 108
Minimum127420
Maximum1.3301036 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T20:55:13.044293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127420
5-th percentile2749277.9
Q118692694
median82892182
Q35.1264738 × 108
95-th percentile2.2429669 × 109
Maximum1.3301036 × 1010
Range1.3300909 × 1010
Interquartile range (IQR)4.9395469 × 108

Descriptive statistics

Standard deviation1.6387864 × 109
Coefficient of variation (CV)2.622097
Kurtosis38.720386
Mean6.2499077 × 108
Median Absolute Deviation (MAD)79244122
Skewness5.67735
Sum6.2499077 × 1010
Variance2.685621 × 1018
MonotonicityNot monotonic
2023-12-10T20:55:13.283884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5308830 1
 
1.0%
1145125293 1
 
1.0%
1095585940 1
 
1.0%
6112204 1
 
1.0%
19455481 1
 
1.0%
2298482 1
 
1.0%
22977200 1
 
1.0%
21985814 1
 
1.0%
65953812 1
 
1.0%
33787479 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
127420 1
1.0%
201316 1
1.0%
283152 1
1.0%
1126160 1
1.0%
2298482 1
1.0%
2773004 1
1.0%
3540037 1
1.0%
3756084 1
1.0%
3913507 1
1.0%
4063330 1
1.0%
ValueCountFrequency (%)
13301036376 1
1.0%
7180681515 1
1.0%
4974559794 1
1.0%
2929218036 1
1.0%
2885045378 1
1.0%
2209173258 1
1.0%
2013268585 1
1.0%
1868808142 1
1.0%
1671938068 1
1.0%
1483711556 1
1.0%

Interactions

2023-12-10T20:55:07.646336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:55:07.338756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:55:07.806023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:55:07.479903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T20:55:13.432246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
질병코드질병명칭진료비전국화력발전소 진료비 총액
질병코드1.0001.0001.0001.000
질병명칭1.0001.0001.0001.000
진료비1.0001.0001.0000.994
전국화력발전소 진료비 총액1.0001.0000.9941.000
2023-12-10T20:55:13.576017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
진료비전국화력발전소 진료비 총액
진료비1.0000.939
전국화력발전소 진료비 총액0.9391.000

Missing values

2023-12-10T20:55:08.012117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:55:08.237831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

화력발전소 고유id화력발전소 명화력발전소 주소화력발전소 x 위치화력발전소 y 위치질병코드질병명칭진료비전국화력발전소 진료비 총액
01광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I00심장 침습이 없는 류마티스열4974805308830
11광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I01심장 침습이 있는 류마티스 열801610283152
21광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I05류마티스성 승모판 질환1729446082704388
31광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I06류마티스성 대동맥판 질환595986014438474
41광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I07류마티스성 삼첨판 질환444430024888469
51광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I08다발성 판막 질환6455049030874421
61광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I09기타 류마티스성 심장 질환7106805973399
71광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I10본태성(원발성) 고혈압691028952013301036376
81광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I11고혈압성 심장 질환180424650726123800
91광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604I12고혈압성 신장 질환1422659072106700
화력발전소 고유id화력발전소 명화력발전소 주소화력발전소 x 위치화력발전소 y 위치질병코드질병명칭진료비전국화력발전소 진료비 총액
901광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J20급성 기관지염43132259207180681515
911광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J21급성 세기관지염1169616100787304641
921광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J22상세불명의 급성 하기도 감염292778730229010373
931광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J30혈관운동성 및 알레르기성 비염18146813402209173258
941광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J31만성 비염,비인두염 및 인두염53145210317158249
951광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J32만성 부비동염4834629901181221254
961광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J33비용종1732579047444932
971광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J34코 및 비동의 기타 장애234362650482614565
981광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J35편도 및 아데노이드의 만성 질환183767180331869470
991광양복합화력발전소전라남도 광양시 제철로 2148-56734.88715127.77604J36편도주위 농양261114050296829543