Overview

Dataset statistics

Number of variables11
Number of observations100
Missing cells18
Missing cells (%)1.6%
Duplicate rows2
Duplicate rows (%)2.0%
Total size in memory9.1 KiB
Average record size in memory93.3 B

Variable types

Categorical4
Text4
Numeric3

Alerts

se_nm has constant value ""Constant
base_ymd has constant value ""Constant
Dataset has 2 (2.0%) duplicate rowsDuplicates
area_nm is highly overall correlated with city_gn_gu_cd and 3 other fieldsHigh correlation
city_do_cd is highly overall correlated with city_gn_gu_cd and 3 other fieldsHigh correlation
city_gn_gu_cd is highly overall correlated with xpos_lo and 2 other fieldsHigh correlation
xpos_lo is highly overall correlated with city_gn_gu_cd and 2 other fieldsHigh correlation
ypos_la is highly overall correlated with city_do_cd and 1 other fieldsHigh correlation
city_do_cd is highly imbalanced (54.9%)Imbalance
area_nm is highly imbalanced (56.5%)Imbalance
homepage_url has 17 (17.0%) missing valuesMissing

Reproduction

Analysis started2023-12-10 10:02:26.432171
Analysis finished2023-12-10 10:02:30.229849
Duration3.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

se_nm
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
외국인의료시설
100 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row외국인의료시설
2nd row외국인의료시설
3rd row외국인의료시설
4th row외국인의료시설
5th row외국인의료시설

Common Values

ValueCountFrequency (%)
외국인의료시설 100
100.0%

Length

2023-12-10T19:02:30.348848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:30.510512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
외국인의료시설 100
100.0%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:02:30.871636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.76
Min length4

Characters and Unicode

Total characters776
Distinct characters184
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)94.0%

Sample

1st row연합정형외과병원
2nd row고은몸매의원
3rd row더한의원
4th row강릉동인병원
5th row이해욱성형외과의원
ValueCountFrequency (%)
스타필의원 2
 
1.9%
센트럴성형외과의원 2
 
1.9%
모아한의원 2
 
1.9%
라앤미의원 1
 
1.0%
부천자생한방병원 1
 
1.0%
한양대학교구리병원 1
 
1.0%
의료법인효산의료재단지샘병원 1
 
1.0%
이경숙산부인과 1
 
1.0%
연세항맥외과의원 1
 
1.0%
엔젤산부인과 1
 
1.0%
Other values (91) 91
87.5%
2023-12-10T19:02:31.659233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
12.1%
60
 
7.7%
52
 
6.7%
37
 
4.8%
20
 
2.6%
16
 
2.1%
16
 
2.1%
15
 
1.9%
13
 
1.7%
13
 
1.7%
Other values (174) 440
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 761
98.1%
Decimal Number 9
 
1.2%
Space Separator 4
 
0.5%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
12.4%
60
 
7.9%
52
 
6.8%
37
 
4.9%
20
 
2.6%
16
 
2.1%
16
 
2.1%
15
 
2.0%
13
 
1.7%
13
 
1.7%
Other values (165) 425
55.8%
Decimal Number
ValueCountFrequency (%)
2 3
33.3%
1 2
22.2%
8 1
 
11.1%
5 1
 
11.1%
6 1
 
11.1%
3 1
 
11.1%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 761
98.1%
Common 15
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
12.4%
60
 
7.9%
52
 
6.8%
37
 
4.9%
20
 
2.6%
16
 
2.1%
16
 
2.1%
15
 
2.0%
13
 
1.7%
13
 
1.7%
Other values (165) 425
55.8%
Common
ValueCountFrequency (%)
4
26.7%
2 3
20.0%
1 2
13.3%
) 1
 
6.7%
( 1
 
6.7%
8 1
 
6.7%
5 1
 
6.7%
6 1
 
6.7%
3 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 761
98.1%
ASCII 15
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
94
 
12.4%
60
 
7.9%
52
 
6.8%
37
 
4.9%
20
 
2.6%
16
 
2.1%
16
 
2.1%
15
 
2.0%
13
 
1.7%
13
 
1.7%
Other values (165) 425
55.8%
ASCII
ValueCountFrequency (%)
4
26.7%
2 3
20.0%
1 2
13.3%
) 1
 
6.7%
( 1
 
6.7%
8 1
 
6.7%
5 1
 
6.7%
6 1
 
6.7%
3 1
 
6.7%
Distinct91
Distinct (%)91.9%
Missing1
Missing (%)1.0%
Memory size932.0 B
2023-12-10T19:02:32.192802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length34
Mean length22.777778
Min length10

Characters and Unicode

Total characters2255
Distinct characters195
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)84.8%

Sample

1st row대전광역시 서구 계룡로 619
2nd row충청북도 청주시 흥덕구 풍산로 42 대형빌딩 3층5층 302호
3rd row서울특별시 강서구 화곡로 153 4층 157924
4th row강원도 강릉시 강릉대로419번길 42
5th row강원도 강릉시 경강로 2109
ValueCountFrequency (%)
경기도 76
 
14.5%
고양시 34
 
6.5%
일산동구 19
 
3.6%
강원도 19
 
3.6%
부천시 15
 
2.9%
중앙로 11
 
2.1%
덕양구 8
 
1.5%
원주시 7
 
1.3%
일산서구 7
 
1.3%
광명시 6
 
1.1%
Other values (237) 323
61.5%
2023-12-10T19:02:33.013865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
432
 
19.2%
100
 
4.4%
98
 
4.3%
88
 
3.9%
84
 
3.7%
76
 
3.4%
1 69
 
3.1%
2 55
 
2.4%
54
 
2.4%
45
 
2.0%
Other values (185) 1154
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1433
63.5%
Space Separator 432
 
19.2%
Decimal Number 355
 
15.7%
Dash Punctuation 12
 
0.5%
Close Punctuation 8
 
0.4%
Open Punctuation 8
 
0.4%
Uppercase Letter 4
 
0.2%
Math Symbol 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
7.0%
98
 
6.8%
88
 
6.1%
84
 
5.9%
76
 
5.3%
54
 
3.8%
45
 
3.1%
45
 
3.1%
43
 
3.0%
42
 
2.9%
Other values (164) 758
52.9%
Decimal Number
ValueCountFrequency (%)
1 69
19.4%
2 55
15.5%
7 36
10.1%
3 34
9.6%
4 32
9.0%
0 31
8.7%
8 29
8.2%
5 27
 
7.6%
6 25
 
7.0%
9 17
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
G 1
25.0%
D 1
25.0%
S 1
25.0%
I 1
25.0%
Space Separator
ValueCountFrequency (%)
432
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1434
63.6%
Common 817
36.2%
Latin 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
7.0%
98
 
6.8%
88
 
6.1%
84
 
5.9%
76
 
5.3%
54
 
3.8%
45
 
3.1%
45
 
3.1%
43
 
3.0%
42
 
2.9%
Other values (165) 759
52.9%
Common
ValueCountFrequency (%)
432
52.9%
1 69
 
8.4%
2 55
 
6.7%
7 36
 
4.4%
3 34
 
4.2%
4 32
 
3.9%
0 31
 
3.8%
8 29
 
3.5%
5 27
 
3.3%
6 25
 
3.1%
Other values (6) 47
 
5.8%
Latin
ValueCountFrequency (%)
G 1
25.0%
D 1
25.0%
S 1
25.0%
I 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1433
63.5%
ASCII 821
36.4%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
432
52.6%
1 69
 
8.4%
2 55
 
6.7%
7 36
 
4.4%
3 34
 
4.1%
4 32
 
3.9%
0 31
 
3.8%
8 29
 
3.5%
5 27
 
3.3%
6 25
 
3.0%
Other values (10) 51
 
6.2%
Hangul
ValueCountFrequency (%)
100
 
7.0%
98
 
6.8%
88
 
6.1%
84
 
5.9%
76
 
5.3%
54
 
3.8%
45
 
3.1%
45
 
3.1%
43
 
3.0%
42
 
2.9%
Other values (164) 758
52.9%
None
ValueCountFrequency (%)
1
100.0%

city_do_cd
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
41
76 
42
19 
43
 
2
11
 
2
30
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row30
2nd row43
3rd row11
4th row42
5th row42

Common Values

ValueCountFrequency (%)
41 76
76.0%
42 19
 
19.0%
43 2
 
2.0%
11 2
 
2.0%
30 1
 
1.0%

Length

2023-12-10T19:02:33.318332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:33.572747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
41 76
76.0%
42 19
 
19.0%
43 2
 
2.0%
11 2
 
2.0%
30 1
 
1.0%

city_gn_gu_cd
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)19.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41098.75
Minimum11500
Maximum43130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:33.831723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11500
5-th percentile41190
Q141281
median41286
Q341570
95-th percentile42172
Maximum43130
Range31630
Interquartile range (IQR)289

Descriptive statistics

Standard deviation3228.5736
Coefficient of variation (CV)0.078556491
Kurtosis74.135404
Mean41098.75
Median Absolute Deviation (MAD)76
Skewness-8.32717
Sum4109875
Variance10423687
MonotonicityNot monotonic
2023-12-10T19:02:34.063245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
41285 19
19.0%
41190 15
15.0%
41281 8
8.0%
42130 7
 
7.0%
41287 7
 
7.0%
41410 6
 
6.0%
41210 6
 
6.0%
42150 5
 
5.0%
41360 5
 
5.0%
42110 4
 
4.0%
Other values (9) 18
18.0%
ValueCountFrequency (%)
11500 1
 
1.0%
30170 1
 
1.0%
41190 15
15.0%
41210 6
 
6.0%
41281 8
8.0%
41285 19
19.0%
41287 7
 
7.0%
41290 2
 
2.0%
41310 4
 
4.0%
41360 5
 
5.0%
ValueCountFrequency (%)
43130 2
 
2.0%
43113 1
 
1.0%
42210 2
 
2.0%
42170 1
 
1.0%
42150 5
5.0%
42130 7
7.0%
42110 4
4.0%
41570 4
4.0%
41410 6
6.0%
41360 5
5.0%

xpos_lo
Real number (ℝ)

HIGH CORRELATION 

Distinct88
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.08027
Minimum123.74177
Maximum129.11525
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:34.418149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum123.74177
5-th percentile126.71541
Q1126.77236
median126.83139
Q3127.18531
95-th percentile128.85844
Maximum129.11525
Range5.373483
Interquartile range (IQR)0.4129435

Descriptive statistics

Standard deviation0.78258624
Coefficient of variation (CV)0.0061582043
Kurtosis6.4816865
Mean127.08027
Median Absolute Deviation (MAD)0.0789785
Skewness-0.5396981
Sum12708.027
Variance0.61244122
MonotonicityNot monotonic
2023-12-10T19:02:35.043951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.773396 4
 
4.0%
126.752416 3
 
3.0%
126.836091 2
 
2.0%
127.132433 2
 
2.0%
123.741771 2
 
2.0%
126.747023 2
 
2.0%
126.761922 2
 
2.0%
126.77839 2
 
2.0%
126.944102 2
 
2.0%
126.933616 1
 
1.0%
Other values (78) 78
78.0%
ValueCountFrequency (%)
123.741771 2
2.0%
126.660175 1
 
1.0%
126.700105 1
 
1.0%
126.710698 1
 
1.0%
126.715659 1
 
1.0%
126.747023 2
2.0%
126.750373 1
 
1.0%
126.751548 1
 
1.0%
126.752416 3
3.0%
126.753158 1
 
1.0%
ValueCountFrequency (%)
129.115254 1
1.0%
128.906885 1
1.0%
128.897679 1
1.0%
128.897185 1
1.0%
128.870009 1
1.0%
128.857827 1
1.0%
128.588612 1
1.0%
128.578348 1
1.0%
127.954286 1
1.0%
127.95144 1
1.0%

ypos_la
Real number (ℝ)

HIGH CORRELATION 

Distinct88
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.46612
Minimum32.522025
Maximum38.216488
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:02:35.307618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32.522025
5-th percentile37.325666
Q137.482562
median37.625203
Q337.660936
95-th percentile37.857086
Maximum38.216488
Range5.694463
Interquartile range (IQR)0.1783735

Descriptive statistics

Standard deviation0.746725
Coefficient of variation (CV)0.019930673
Kurtosis38.260026
Mean37.46612
Median Absolute Deviation (MAD)0.1199725
Skewness-5.9979084
Sum3746.612
Variance0.55759823
MonotonicityNot monotonic
2023-12-10T19:02:35.620659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.658407 4
 
4.0%
37.505231 3
 
3.0%
37.656997 2
 
2.0%
37.60145 2
 
2.0%
32.522025 2
 
2.0%
37.675961 2
 
2.0%
37.670262 2
 
2.0%
37.651737 2
 
2.0%
37.34819 2
 
2.0%
37.359413 1
 
1.0%
Other values (78) 78
78.0%
ValueCountFrequency (%)
32.522025 2
2.0%
36.340046 1
1.0%
36.626769 1
1.0%
36.981307 1
1.0%
37.34379 1
1.0%
37.344231 1
1.0%
37.345522 1
1.0%
37.34596 1
1.0%
37.347531 1
1.0%
37.34819 2
2.0%
ValueCountFrequency (%)
38.216488 1
1.0%
38.198042 1
1.0%
37.884298 1
1.0%
37.880262 1
1.0%
37.874892 1
1.0%
37.856149 1
1.0%
37.818416 1
1.0%
37.774061 1
1.0%
37.771174 1
1.0%
37.769519 1
1.0%

area_nm
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
경기
75 
강원
19 
충북
 
2
서울
 
2
대전
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row대전
2nd row충북
3rd row서울
4th row강원
5th row강원

Common Values

ValueCountFrequency (%)
경기 75
75.0%
강원 19
 
19.0%
충북 2
 
2.0%
서울 2
 
2.0%
대전 1
 
1.0%
기타 1
 
1.0%

Length

2023-12-10T19:02:35.890227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:36.086272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기 75
75.0%
강원 19
 
19.0%
충북 2
 
2.0%
서울 2
 
2.0%
대전 1
 
1.0%
기타 1
 
1.0%

tel_no
Text

Distinct96
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:02:36.488669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.76
Min length9

Characters and Unicode

Total characters1176
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)92.0%

Sample

1st row042-536-0911
2nd row043-235-6111
3rd row02-2605-0550
4th row033-651-6161
5th row033-648-4353
ValueCountFrequency (%)
031-965-2255 2
 
2.0%
033-765-7533 2
 
2.0%
1644-9118 2
 
2.0%
031-901-5575 2
 
2.0%
1661-6888 1
 
1.0%
02-897-1075 1
 
1.0%
031-441-9020 1
 
1.0%
031-393-5534 1
 
1.0%
031-393-3305 1
 
1.0%
031-394-0075 1
 
1.0%
Other values (86) 86
86.0%
2023-12-10T19:02:37.161844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 201
17.1%
- 193
16.4%
3 168
14.3%
1 142
12.1%
2 94
8.0%
5 86
7.3%
7 73
 
6.2%
9 72
 
6.1%
8 52
 
4.4%
4 49
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 983
83.6%
Dash Punctuation 193
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 201
20.4%
3 168
17.1%
1 142
14.4%
2 94
9.6%
5 86
8.7%
7 73
 
7.4%
9 72
 
7.3%
8 52
 
5.3%
4 49
 
5.0%
6 46
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 193
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1176
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 201
17.1%
- 193
16.4%
3 168
14.3%
1 142
12.1%
2 94
8.0%
5 86
7.3%
7 73
 
6.2%
9 72
 
6.1%
8 52
 
4.4%
4 49
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1176
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 201
17.1%
- 193
16.4%
3 168
14.3%
1 142
12.1%
2 94
8.0%
5 86
7.3%
7 73
 
6.2%
9 72
 
6.1%
8 52
 
4.4%
4 49
 
4.2%

homepage_url
Text

MISSING 

Distinct80
Distinct (%)96.4%
Missing17
Missing (%)17.0%
Memory size932.0 B
2023-12-10T19:02:37.576357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length47
Mean length28.445783
Min length18

Characters and Unicode

Total characters2361
Distinct characters44
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)92.8%

Sample

1st rowhttp://www.yhbw.co.kr/
2nd rowhttp://www.vline-sline.kr/
3rd rowhttp://www.thehani.co.kr/
4th rowhttp://www.dong-in.or.kr/
5th rowhttp://www.psdoctor.co.kr/
ValueCountFrequency (%)
http://www.centralps.co.kr 2
 
2.4%
http://starfeel.kr 2
 
2.4%
http://www.moadoctor.com 2
 
2.4%
http://www.whiteapple.kr 1
 
1.2%
https://www.gwhospital.co.kr/main 1
 
1.2%
http://www.ncmedical.co.kr 1
 
1.2%
http://www.uroguri.com 1
 
1.2%
http://guri.hyumc.com 1
 
1.2%
https://guri.hyumc.com 1
 
1.2%
http://www.gillmedi.com/main/ko/index.html 1
 
1.2%
Other values (70) 70
84.3%
2023-12-10T19:02:38.319287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 264
 
11.2%
t 211
 
8.9%
. 205
 
8.7%
w 188
 
8.0%
h 130
 
5.5%
p 128
 
5.4%
o 128
 
5.4%
c 108
 
4.6%
e 101
 
4.3%
r 90
 
3.8%
Other values (34) 808
34.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1733
73.4%
Other Punctuation 565
 
23.9%
Decimal Number 41
 
1.7%
Uppercase Letter 10
 
0.4%
Dash Punctuation 6
 
0.3%
Math Symbol 3
 
0.1%
Connector Punctuation 3
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 211
12.2%
w 188
 
10.8%
h 130
 
7.5%
p 128
 
7.4%
o 128
 
7.4%
c 108
 
6.2%
e 101
 
5.8%
r 90
 
5.2%
n 84
 
4.8%
m 80
 
4.6%
Other values (14) 485
28.0%
Decimal Number
ValueCountFrequency (%)
2 13
31.7%
1 6
14.6%
0 4
 
9.8%
8 4
 
9.8%
5 4
 
9.8%
6 3
 
7.3%
3 3
 
7.3%
7 2
 
4.9%
4 2
 
4.9%
Other Punctuation
ValueCountFrequency (%)
/ 264
46.7%
. 205
36.3%
: 85
 
15.0%
% 8
 
1.4%
? 3
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
F 6
60.0%
R 2
 
20.0%
A 2
 
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
= 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1743
73.8%
Common 618
 
26.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 211
12.1%
w 188
 
10.8%
h 130
 
7.5%
p 128
 
7.3%
o 128
 
7.3%
c 108
 
6.2%
e 101
 
5.8%
r 90
 
5.2%
n 84
 
4.8%
m 80
 
4.6%
Other values (17) 495
28.4%
Common
ValueCountFrequency (%)
/ 264
42.7%
. 205
33.2%
: 85
 
13.8%
2 13
 
2.1%
% 8
 
1.3%
- 6
 
1.0%
1 6
 
1.0%
0 4
 
0.6%
8 4
 
0.6%
5 4
 
0.6%
Other values (7) 19
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2361
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 264
 
11.2%
t 211
 
8.9%
. 205
 
8.7%
w 188
 
8.0%
h 130
 
5.5%
p 128
 
5.4%
o 128
 
5.4%
c 108
 
4.6%
e 101
 
4.3%
r 90
 
3.8%
Other values (34) 808
34.2%

base_ymd
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-12-31
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-31
2nd row2020-12-31
3rd row2020-12-31
4th row2020-12-31
5th row2020-12-31

Common Values

ValueCountFrequency (%)
2020-12-31 100
100.0%

Length

2023-12-10T19:02:38.767414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:02:38.938934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-31 100
100.0%

Interactions

2023-12-10T19:02:29.124591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.065013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.686472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:29.302538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.249631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.843352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:29.453324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.463398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:02:28.992353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:02:39.054436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
entrp_nmload_addrcity_do_cdcity_gn_gu_cdxpos_loypos_laarea_nmtel_nohomepage_url
entrp_nm1.0000.9991.0001.0001.0001.0001.0001.0001.000
load_addr0.9991.0001.0001.0001.0001.0001.0000.9990.999
city_do_cd1.0001.0001.0001.0000.9580.9911.0001.0001.000
city_gn_gu_cd1.0001.0001.0001.0000.1371.0001.0001.0001.000
xpos_lo1.0001.0000.9580.1371.0000.8740.8121.0001.000
ypos_la1.0001.0000.9911.0000.8741.0000.9121.0001.000
area_nm1.0001.0001.0001.0000.8120.9121.0001.0001.000
tel_no1.0000.9991.0001.0001.0001.0001.0001.0001.000
homepage_url1.0000.9991.0001.0001.0001.0001.0001.0001.000
2023-12-10T19:02:39.255526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
area_nmcity_do_cd
area_nm1.0000.995
city_do_cd0.9951.000
2023-12-10T19:02:39.397766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
city_gn_gu_cdxpos_loypos_lacity_do_cdarea_nm
city_gn_gu_cd1.0000.5800.1680.8490.843
xpos_lo0.5801.000-0.0140.7080.701
ypos_la0.168-0.0141.0000.8640.859
city_do_cd0.8490.7080.8641.0000.995
area_nm0.8430.7010.8590.9951.000

Missing values

2023-12-10T19:02:29.662138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:02:29.942774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-10T19:02:30.135620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

se_nmentrp_nmload_addrcity_do_cdcity_gn_gu_cdxpos_loypos_laarea_nmtel_nohomepage_urlbase_ymd
0외국인의료시설연합정형외과병원대전광역시 서구 계룡로 6193030170127.39164936.340046대전042-536-0911http://www.yhbw.co.kr/2020-12-31
1외국인의료시설고은몸매의원충청북도 청주시 흥덕구 풍산로 42 대형빌딩 3층5층 302호4343113127.43534636.626769충북043-235-6111http://www.vline-sline.kr/2020-12-31
2외국인의료시설더한의원서울특별시 강서구 화곡로 153 4층 1579241111500123.74177132.522025서울02-2605-0550http://www.thehani.co.kr/2020-12-31
3외국인의료시설강릉동인병원강원도 강릉시 강릉대로419번길 424242150128.90688537.774061강원033-651-6161http://www.dong-in.or.kr/2020-12-31
4외국인의료시설이해욱성형외과의원강원도 강릉시 경강로 21094242150128.89718537.755683강원033-648-4353http://www.psdoctor.co.kr/2020-12-31
5외국인의료시설최정준성형외과의원강원도 강릉시 경강로 21104242150128.89767937.755265강원033-643-7033http://www.choips.co.kr/2020-12-31
6외국인의료시설강릉아산병원강원도 강릉시 사천면 방동길 38 강릉아산병원4242150128.85782737.818416강원033-610-4111http://www.gnah.co.kr/2020-12-31
7외국인의료시설다솜연합치과의원충청북도 충주시 봉계1길 56 2 3 4층4343130127.91578536.981307충북043-848-7528<NA>2020-12-31
8외국인의료시설강릉원주대학교치과병원강원도 강릉시 죽헌길 74242150128.87000937.769519강원033-640-3114https://www.gwnudh.or.kr/2020-12-31
9외국인의료시설누가이비인후과강원도 동해시 한섬로 111-7 현진빌딩 3층4242170129.11525437.52225강원033-535-3600http://luke-ent.co.kr/2020-12-31
se_nmentrp_nmload_addrcity_do_cdcity_gn_gu_cdxpos_loypos_laarea_nmtel_nohomepage_urlbase_ymd
90외국인의료시설사과나무치과의원경기도 부천시 부천로 204141190126.78354737.486039경기032-667-2800https://appledental.kr:5016/apple2015/2020-12-31
91외국인의료시설부천삼성의원경기도 부천시 부천로90번길 84141190126.78480637.492053경기032-651-7939<NA>2020-12-31
92외국인의료시설서울삼성치과의원경기도 부천시 상동로 784141190126.75315837.504094경기032-328-2825http://seoulsamsung.net/2020-12-31
93외국인의료시설예쁜세상성형외과의원경기도 부천시 상동로 874141190126.75241637.505231경기032-324-2255http://beautifulworld.co.kr/2020-12-31
94외국인의료시설우제영내과경기도 부천시 상동로 874141190126.75241637.505231경기032-322-8119<NA>2020-12-31
95외국인의료시설하이맨비뇨기과의원 부천점경기도 부천시 상동로 874141190126.75241637.505231경기032-329-7580http://www.highmanbch.com/2020-12-31
96외국인의료시설엘드림병원경기도 부천시 소사구 경인로 174141190126.75683437.483634경기032-612-8575http://www.eldreamhospital.com/2020-12-31
97외국인의료시설베스티안의료재단경기도 부천시 소사구 송내동 577-2번지4141190126.77220337.483453기타032-611-9933<NA>2020-12-31
98외국인의료시설가톨릭대학교부천성모병원경기도 부천시 소사로 3274141190126.79286937.487153경기1577-0675https://www.cmcbucheon.or.kr/page/main2020-12-31
99외국인의료시설양연모성형외과의원경기도 부천시 신흥로 2284141190126.77634437.504093경기032-325-3536http://prsyang.com/2020-12-31

Duplicate rows

Most frequently occurring

se_nmentrp_nmload_addrcity_do_cdcity_gn_gu_cdxpos_loypos_laarea_nmtel_nohomepage_urlbase_ymd# duplicates
0외국인의료시설센트럴성형외과의원경기도 고양시 일산동구 정발산로 43 -20 센트럴프라자4141285126.77339637.658407경기031-901-5575http://www.centralps.co.kr/2020-12-312
1외국인의료시설스타필의원경기도 고양시 덕양구 호국로789번길 64141281126.83609137.656997경기031-965-2255http://starfeel.kr/2020-12-312