Overview

Dataset statistics

Number of variables4
Number of observations54
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory36.4 B

Variable types

Numeric2
Text2

Dataset

Description용인시 관내 목욕업 인허가 정보입니다.
Author경기도 용인시
URLhttps://www.data.go.kr/data/3072096/fileData.do

Alerts

번호 has unique valuesUnique
업소명 has unique valuesUnique
업소소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:47:16.669179
Analysis finished2023-12-12 09:47:17.444621
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.5
Minimum1
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-12T18:47:17.534416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.65
Q114.25
median27.5
Q340.75
95-th percentile51.35
Maximum54
Range53
Interquartile range (IQR)26.5

Descriptive statistics

Standard deviation15.732133
Coefficient of variation (CV)0.57207755
Kurtosis-1.2
Mean27.5
Median Absolute Deviation (MAD)13.5
Skewness0
Sum1485
Variance247.5
MonotonicityStrictly increasing
2023-12-12T18:47:17.684082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
42 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
38 1
 
1.9%
Other values (44) 44
81.5%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
54 1
1.9%
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%

업소명
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-12T18:47:17.985394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length6.8703704
Min length3

Characters and Unicode

Total characters371
Distinct characters125
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row용인탕
2nd row용인한증막
3rd row백암온천탕
4th row삼성탕
5th row황토보석사우나
ValueCountFrequency (%)
사우나 2
 
3.3%
용인탕 1
 
1.6%
월드사우나 1
 
1.6%
스타스파 1
 
1.6%
신봉스파랜드 1
 
1.6%
주)케이아이엔통상-용인랜드 1
 
1.6%
민속건강목욕탕 1
 
1.6%
죽전대중탕 1
 
1.6%
주)동백사우나24 1
 
1.6%
오로라스파랜드 1
 
1.6%
Other values (50) 50
82.0%
2023-12-12T18:47:18.432678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
5.9%
19
 
5.1%
18
 
4.9%
18
 
4.9%
15
 
4.0%
9
 
2.4%
9
 
2.4%
8
 
2.2%
7
 
1.9%
7
 
1.9%
Other values (115) 239
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 348
93.8%
Decimal Number 8
 
2.2%
Space Separator 7
 
1.9%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%
Uppercase Letter 2
 
0.5%
Other Punctuation 1
 
0.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
6.3%
19
 
5.5%
18
 
5.2%
18
 
5.2%
15
 
4.3%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (106) 217
62.4%
Decimal Number
ValueCountFrequency (%)
2 4
50.0%
4 4
50.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 348
93.8%
Common 21
 
5.7%
Latin 2
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
6.3%
19
 
5.5%
18
 
5.2%
18
 
5.2%
15
 
4.3%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (106) 217
62.4%
Common
ValueCountFrequency (%)
7
33.3%
2 4
19.0%
4 4
19.0%
) 2
 
9.5%
( 2
 
9.5%
& 1
 
4.8%
- 1
 
4.8%
Latin
ValueCountFrequency (%)
G 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 348
93.8%
ASCII 23
 
6.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
6.3%
19
 
5.5%
18
 
5.2%
18
 
5.2%
15
 
4.3%
9
 
2.6%
9
 
2.6%
8
 
2.3%
7
 
2.0%
6
 
1.7%
Other values (106) 217
62.4%
ASCII
ValueCountFrequency (%)
7
30.4%
2 4
17.4%
4 4
17.4%
) 2
 
8.7%
( 2
 
8.7%
G 1
 
4.3%
B 1
 
4.3%
& 1
 
4.3%
- 1
 
4.3%
Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2023-12-12T18:47:18.733425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length43.5
Mean length33.203704
Min length23

Characters and Unicode

Total characters1793
Distinct characters136
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row경기도 용인시 처인구 금령로99번길 6-3 (김량장동)
2nd row경기도 용인시 처인구 동부로6번길 3 (마평동)
3rd row경기도 용인시 처인구 백암면 백암로 219-3
4th row경기도 용인시 처인구 금령로 9 (김량장동)
5th row경기도 용인시 기흥구 신갈로 79 (신갈동)
ValueCountFrequency (%)
경기도 54
 
14.8%
용인시 54
 
14.8%
처인구 22
 
6.0%
기흥구 16
 
4.4%
수지구 16
 
4.4%
김량장동 5
 
1.4%
양지면 4
 
1.1%
8 3
 
0.8%
2 3
 
0.8%
구갈동 3
 
0.8%
Other values (165) 186
50.8%
2023-12-12T18:47:19.231520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
325
 
18.1%
77
 
4.3%
1 72
 
4.0%
71
 
4.0%
63
 
3.5%
57
 
3.2%
56
 
3.1%
55
 
3.1%
54
 
3.0%
54
 
3.0%
Other values (126) 909
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1014
56.6%
Space Separator 325
 
18.1%
Decimal Number 278
 
15.5%
Close Punctuation 50
 
2.8%
Open Punctuation 50
 
2.8%
Other Punctuation 42
 
2.3%
Dash Punctuation 18
 
1.0%
Uppercase Letter 16
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
7.6%
71
 
7.0%
63
 
6.2%
57
 
5.6%
56
 
5.5%
55
 
5.4%
54
 
5.3%
54
 
5.3%
48
 
4.7%
47
 
4.6%
Other values (107) 432
42.6%
Decimal Number
ValueCountFrequency (%)
1 72
25.9%
2 40
14.4%
0 37
13.3%
3 25
 
9.0%
7 24
 
8.6%
8 18
 
6.5%
5 18
 
6.5%
6 15
 
5.4%
4 15
 
5.4%
9 14
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
B 14
87.5%
Y 1
 
6.2%
K 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 41
97.6%
. 1
 
2.4%
Space Separator
ValueCountFrequency (%)
325
100.0%
Close Punctuation
ValueCountFrequency (%)
) 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 50
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1014
56.6%
Common 763
42.6%
Latin 16
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
7.6%
71
 
7.0%
63
 
6.2%
57
 
5.6%
56
 
5.5%
55
 
5.4%
54
 
5.3%
54
 
5.3%
48
 
4.7%
47
 
4.6%
Other values (107) 432
42.6%
Common
ValueCountFrequency (%)
325
42.6%
1 72
 
9.4%
) 50
 
6.6%
( 50
 
6.6%
, 41
 
5.4%
2 40
 
5.2%
0 37
 
4.8%
3 25
 
3.3%
7 24
 
3.1%
- 18
 
2.4%
Other values (6) 81
 
10.6%
Latin
ValueCountFrequency (%)
B 14
87.5%
Y 1
 
6.2%
K 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1014
56.6%
ASCII 779
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
325
41.7%
1 72
 
9.2%
) 50
 
6.4%
( 50
 
6.4%
, 41
 
5.3%
2 40
 
5.1%
0 37
 
4.7%
3 25
 
3.2%
7 24
 
3.1%
- 18
 
2.3%
Other values (9) 97
 
12.5%
Hangul
ValueCountFrequency (%)
77
 
7.6%
71
 
7.0%
63
 
6.2%
57
 
5.6%
56
 
5.5%
55
 
5.4%
54
 
5.3%
54
 
5.3%
48
 
4.7%
47
 
4.6%
Other values (107) 432
42.6%

영업장면적
Real number (ℝ)

Distinct53
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1731.7185
Minimum131
Maximum6809.47
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2023-12-12T18:47:19.383016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum131
5-th percentile241.25
Q1637.645
median1434.03
Q32466.2325
95-th percentile4119.9495
Maximum6809.47
Range6678.47
Interquartile range (IQR)1828.5875

Descriptive statistics

Standard deviation1415.7948
Coefficient of variation (CV)0.81756634
Kurtosis2.5107129
Mean1731.7185
Median Absolute Deviation (MAD)861.44
Skewness1.4422377
Sum93512.8
Variance2004474.8
MonotonicityNot monotonic
2023-12-12T18:47:19.551351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
990.0 2
 
3.7%
314.0 1
 
1.9%
389.42 1
 
1.9%
864.9 1
 
1.9%
1860.4 1
 
1.9%
6809.47 1
 
1.9%
2329.08 1
 
1.9%
3730.55 1
 
1.9%
799.57 1
 
1.9%
715.39 1
 
1.9%
Other values (43) 43
79.6%
ValueCountFrequency (%)
131.0 1
1.9%
198.0 1
1.9%
199.0 1
1.9%
264.0 1
1.9%
314.0 1
1.9%
330.0 1
1.9%
389.42 1
1.9%
474.04 1
1.9%
504.86 1
1.9%
514.0 1
1.9%
ValueCountFrequency (%)
6809.47 1
1.9%
5548.79 1
1.9%
4843.12 1
1.9%
3730.55 1
1.9%
3571.9 1
1.9%
3471.0 1
1.9%
3407.03 1
1.9%
3380.41 1
1.9%
2788.15 1
1.9%
2768.93 1
1.9%

Interactions

2023-12-12T18:47:17.064811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:47:16.875790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:47:17.169497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:47:16.954957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:47:19.666654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업소명업소소재지(도로명)영업장면적
번호1.0001.0001.0000.301
업소명1.0001.0001.0001.000
업소소재지(도로명)1.0001.0001.0001.000
영업장면적0.3011.0001.0001.000
2023-12-12T18:47:20.020162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호영업장면적
번호1.0000.295
영업장면적0.2951.000

Missing values

2023-12-12T18:47:17.314612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:47:17.405894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업소명업소소재지(도로명)영업장면적
01용인탕경기도 용인시 처인구 금령로99번길 6-3 (김량장동)314.0
12용인한증막경기도 용인시 처인구 동부로6번길 3 (마평동)198.0
23백암온천탕경기도 용인시 처인구 백암면 백암로 219-3330.0
34삼성탕경기도 용인시 처인구 금령로 9 (김량장동)514.0
45황토보석사우나경기도 용인시 기흥구 신갈로 79 (신갈동)264.0
56인정대중탕경기도 용인시 처인구 금어로 2 (유방동)770.88
67신온천탕경기도 용인시 기흥구 신구로22번길 28 (구갈동)924.0
78파인스포렉스탕경기도 용인시 처인구 양지면 남곡리 18번지 1호2024.88
89용평목욕탕경기도 용인시 처인구 금령로140번길 5 (마평동)474.04
910삼보탕경기도 용인시 처인구 금령로27번길 16-2 (김량장동)990.0
번호업소명업소소재지(도로명)영업장면적
4445용인시민체육센터경기도 용인시 처인구 포곡읍 금어로 317 (시민체육시설 지하 1층)564.18
4546신봉카스사우나경기도 용인시 수지구 신봉1로 84 (신봉동,신봉종합상가 지하 1층)2511.95
4647언남전통불한증막사우나경기도 용인시 기흥구 구성로 103 (언남동,멀티프라자 B101,B102,B103호)1996.44
474824시그린대중탕경기도 용인시 처인구 금령로 116 (김량장동,용인타워 지하1,2,3호)715.0
4849럭키불가마사우나경기도 용인시 처인구 중부대로1388번길 20-5 (김량장동)1944.04
4950에코사우나경기도 용인시 기흥구 흥덕2로87번길 18 (영덕동)1371.91
5051백암다래참숯가마경기도 용인시 처인구 백암면 원설로270번길 58-39988.44
5152현대드림불한증막경기도 용인시 수지구 성복2로76번길 26-3 (성복동, 드림타워 (B209, B301))1496.15
5253어정참숯불가마사우나경기도 용인시 기흥구 어정로 134-28 (중동)2788.15
5354황토한옥힐링타운경기도 용인시 처인구 운학로115번길 22-9 (운학동)131.0