Overview

Dataset statistics

Number of variables5
Number of observations45
Missing cells20
Missing cells (%)8.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory43.9 B

Variable types

Text3
Numeric1
Categorical1

Dataset

Description반주에 맞추어 노래를 부를 수 있는 반주장치 등의 시설을 갖추어 제공하는 업소인 노래연습장 현황, 업소명, 주소, 허가일에 대한 정보를 제공합니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15045454/fileData.do

Alerts

상호 has 1 (2.2%) missing valuesMissing
우편번호 has 15 (33.3%) missing valuesMissing
영업소소재지(지번) has 1 (2.2%) missing valuesMissing
영업소소재지(도로명) has 3 (6.7%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:26:52.001011
Analysis finished2023-12-11 23:26:52.711387
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

MISSING 

Distinct43
Distinct (%)97.7%
Missing1
Missing (%)2.2%
Memory size492.0 B
2023-12-12T08:26:52.852904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length6.7272727
Min length3

Characters and Unicode

Total characters296
Distinct characters101
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)95.5%

Sample

1st row병점게임장
2nd row삼성게임장
3rd row아라비안성인게임장
4th row황금포커성골드게임장
5th row성인센타게임장
ValueCountFrequency (%)
게임랜드 5
 
9.8%
롯데게임랜드 2
 
3.9%
에이스 1
 
2.0%
jo게임랜드 1
 
2.0%
조이폴리스게임센터 1
 
2.0%
시즌아이pc방화성본점 1
 
2.0%
캡틴게임장 1
 
2.0%
발안게임장 1
 
2.0%
황금돼지게임랜드 1
 
2.0%
플러스게임장 1
 
2.0%
Other values (36) 36
70.6%
2023-12-12T08:26:53.184775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
12.8%
38
 
12.8%
26
 
8.8%
23
 
7.8%
13
 
4.4%
9
 
3.0%
7
 
2.4%
7
 
2.4%
5
 
1.7%
5
 
1.7%
Other values (91) 125
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 277
93.6%
Space Separator 7
 
2.4%
Lowercase Letter 6
 
2.0%
Uppercase Letter 5
 
1.7%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
13.7%
38
 
13.7%
26
 
9.4%
23
 
8.3%
13
 
4.7%
9
 
3.2%
7
 
2.5%
5
 
1.8%
5
 
1.8%
3
 
1.1%
Other values (80) 110
39.7%
Uppercase Letter
ValueCountFrequency (%)
M 1
20.0%
C 1
20.0%
P 1
20.0%
J 1
20.0%
O 1
20.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
t 2
33.3%
a 1
16.7%
n 1
16.7%
Space Separator
ValueCountFrequency (%)
7
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 277
93.6%
Latin 11
 
3.7%
Common 8
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
13.7%
38
 
13.7%
26
 
9.4%
23
 
8.3%
13
 
4.7%
9
 
3.2%
7
 
2.5%
5
 
1.8%
5
 
1.8%
3
 
1.1%
Other values (80) 110
39.7%
Latin
ValueCountFrequency (%)
e 2
18.2%
t 2
18.2%
M 1
9.1%
C 1
9.1%
P 1
9.1%
a 1
9.1%
n 1
9.1%
J 1
9.1%
O 1
9.1%
Common
ValueCountFrequency (%)
7
87.5%
2 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 277
93.6%
ASCII 19
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
13.7%
38
 
13.7%
26
 
9.4%
23
 
8.3%
13
 
4.7%
9
 
3.2%
7
 
2.5%
5
 
1.8%
5
 
1.8%
3
 
1.1%
Other values (80) 110
39.7%
ASCII
ValueCountFrequency (%)
7
36.8%
e 2
 
10.5%
t 2
 
10.5%
2 1
 
5.3%
M 1
 
5.3%
C 1
 
5.3%
P 1
 
5.3%
a 1
 
5.3%
n 1
 
5.3%
J 1
 
5.3%

우편번호
Real number (ℝ)

MISSING 

Distinct16
Distinct (%)53.3%
Missing15
Missing (%)33.3%
Infinite0
Infinite (%)0.0%
Mean18448.933
Minimum18260
Maximum18594
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-12-12T08:26:53.305048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18260
5-th percentile18271
Q118405
median18417.5
Q318588
95-th percentile18593
Maximum18594
Range334
Interquartile range (IQR)183

Descriptive statistics

Standard deviation115.15834
Coefficient of variation (CV)0.0062420054
Kurtosis-1.115646
Mean18448.933
Median Absolute Deviation (MAD)110
Skewness-0.077084827
Sum553468
Variance13261.444
MonotonicityNot monotonic
2023-12-12T08:26:53.428473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
18593 7
15.6%
18412 4
 
8.9%
18405 3
 
6.7%
18271 3
 
6.7%
18455 2
 
4.4%
18423 1
 
2.2%
18567 1
 
2.2%
18398 1
 
2.2%
18453 1
 
2.2%
18442 1
 
2.2%
Other values (6) 6
 
13.3%
(Missing) 15
33.3%
ValueCountFrequency (%)
18260 1
 
2.2%
18271 3
6.7%
18278 1
 
2.2%
18337 1
 
2.2%
18398 1
 
2.2%
18405 3
6.7%
18406 1
 
2.2%
18412 4
8.9%
18423 1
 
2.2%
18442 1
 
2.2%
ValueCountFrequency (%)
18594 1
 
2.2%
18593 7
15.6%
18573 1
 
2.2%
18567 1
 
2.2%
18455 2
 
4.4%
18453 1
 
2.2%
18442 1
 
2.2%
18423 1
 
2.2%
18412 4
8.9%
18406 1
 
2.2%
Distinct42
Distinct (%)95.5%
Missing1
Missing (%)2.2%
Memory size492.0 B
2023-12-12T08:26:53.692626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length32
Mean length27.909091
Min length12

Characters and Unicode

Total characters1228
Distinct characters91
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)90.9%

Sample

1st row경기도 화성시 병점동 386-12번지 101,102호
2nd row경기도 화성시 병점동 381-9번지 외 3필지 201호
3rd row경기도 화성시 남양읍 남양리 2076-13번지 1층
4th row경기도 화성시 향남읍 평리 86-1번지 2층
5th row경기도 화성시 반송동 107-1번지 센타프라자 207,208호
ValueCountFrequency (%)
경기도 44
 
17.1%
화성시 44
 
17.1%
병점동 11
 
4.3%
향남읍 8
 
3.1%
반송동 7
 
2.7%
평리 7
 
2.7%
남양읍 5
 
1.9%
남양리 4
 
1.6%
4
 
1.6%
844-1번지 4
 
1.6%
Other values (94) 119
46.3%
2023-12-12T08:26:54.099961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
235
19.1%
48
 
3.9%
1 47
 
3.8%
46
 
3.7%
46
 
3.7%
2 46
 
3.7%
46
 
3.7%
44
 
3.6%
44
 
3.6%
44
 
3.6%
Other values (81) 582
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 682
55.5%
Decimal Number 260
 
21.2%
Space Separator 235
 
19.1%
Dash Punctuation 37
 
3.0%
Uppercase Letter 8
 
0.7%
Other Punctuation 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
7.0%
46
 
6.7%
46
 
6.7%
46
 
6.7%
44
 
6.5%
44
 
6.5%
44
 
6.5%
44
 
6.5%
31
 
4.5%
23
 
3.4%
Other values (63) 266
39.0%
Decimal Number
ValueCountFrequency (%)
1 47
18.1%
2 46
17.7%
0 34
13.1%
8 25
9.6%
3 25
9.6%
4 23
8.8%
6 19
7.3%
7 18
 
6.9%
5 17
 
6.5%
9 6
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
B 3
37.5%
A 2
25.0%
J 1
 
12.5%
O 1
 
12.5%
S 1
 
12.5%
Space Separator
ValueCountFrequency (%)
235
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 682
55.5%
Common 538
43.8%
Latin 8
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
7.0%
46
 
6.7%
46
 
6.7%
46
 
6.7%
44
 
6.5%
44
 
6.5%
44
 
6.5%
44
 
6.5%
31
 
4.5%
23
 
3.4%
Other values (63) 266
39.0%
Common
ValueCountFrequency (%)
235
43.7%
1 47
 
8.7%
2 46
 
8.6%
- 37
 
6.9%
0 34
 
6.3%
8 25
 
4.6%
3 25
 
4.6%
4 23
 
4.3%
6 19
 
3.5%
7 18
 
3.3%
Other values (3) 29
 
5.4%
Latin
ValueCountFrequency (%)
B 3
37.5%
A 2
25.0%
J 1
 
12.5%
O 1
 
12.5%
S 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 682
55.5%
ASCII 546
44.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235
43.0%
1 47
 
8.6%
2 46
 
8.4%
- 37
 
6.8%
0 34
 
6.2%
8 25
 
4.6%
3 25
 
4.6%
4 23
 
4.2%
6 19
 
3.5%
7 18
 
3.3%
Other values (8) 37
 
6.8%
Hangul
ValueCountFrequency (%)
48
 
7.0%
46
 
6.7%
46
 
6.7%
46
 
6.7%
44
 
6.5%
44
 
6.5%
44
 
6.5%
44
 
6.5%
31
 
4.5%
23
 
3.4%
Other values (63) 266
39.0%
Distinct41
Distinct (%)97.6%
Missing3
Missing (%)6.7%
Memory size492.0 B
2023-12-12T08:26:54.375446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36.5
Mean length29.857143
Min length19

Characters and Unicode

Total characters1254
Distinct characters117
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)95.2%

Sample

1st row경기도 화성시 떡전골로 82 (병점동)
2nd row경기도 화성시 떡전골로 86, 201호 (병점동, 병점하나로빌딩)
3rd row경기도 화성시 남양읍 역골로 9-14 (1층)
4th row경기도 화성시 향남읍 3.1만세로 1114, 2층
5th row경기도 화성시 동탄반석로 124 (반송동, 센타프라자 207,208호)
ValueCountFrequency (%)
경기도 42
 
16.5%
화성시 42
 
16.5%
향남읍 8
 
3.1%
효행로 8
 
3.1%
병점동 8
 
3.1%
떡전골로 5
 
2.0%
3.1만세로 5
 
2.0%
남양읍 5
 
2.0%
2층 4
 
1.6%
반송동 4
 
1.6%
Other values (104) 124
48.6%
2023-12-12T08:26:54.923558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
220
 
17.5%
1 55
 
4.4%
45
 
3.6%
45
 
3.6%
, 43
 
3.4%
43
 
3.4%
42
 
3.3%
42
 
3.3%
42
 
3.3%
2 41
 
3.3%
Other values (107) 636
50.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 672
53.6%
Decimal Number 237
 
18.9%
Space Separator 220
 
17.5%
Other Punctuation 48
 
3.8%
Close Punctuation 26
 
2.1%
Open Punctuation 26
 
2.1%
Dash Punctuation 18
 
1.4%
Uppercase Letter 7
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
6.7%
45
 
6.7%
43
 
6.4%
42
 
6.2%
42
 
6.2%
42
 
6.2%
37
 
5.5%
35
 
5.2%
21
 
3.1%
18
 
2.7%
Other values (86) 302
44.9%
Decimal Number
ValueCountFrequency (%)
1 55
23.2%
2 41
17.3%
0 32
13.5%
3 24
10.1%
5 20
 
8.4%
4 19
 
8.0%
6 15
 
6.3%
9 13
 
5.5%
7 10
 
4.2%
8 8
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
A 2
28.6%
B 2
28.6%
J 1
14.3%
O 1
14.3%
S 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 43
89.6%
. 5
 
10.4%
Space Separator
ValueCountFrequency (%)
220
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 672
53.6%
Common 575
45.9%
Latin 7
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
6.7%
45
 
6.7%
43
 
6.4%
42
 
6.2%
42
 
6.2%
42
 
6.2%
37
 
5.5%
35
 
5.2%
21
 
3.1%
18
 
2.7%
Other values (86) 302
44.9%
Common
ValueCountFrequency (%)
220
38.3%
1 55
 
9.6%
, 43
 
7.5%
2 41
 
7.1%
0 32
 
5.6%
) 26
 
4.5%
( 26
 
4.5%
3 24
 
4.2%
5 20
 
3.5%
4 19
 
3.3%
Other values (6) 69
 
12.0%
Latin
ValueCountFrequency (%)
A 2
28.6%
B 2
28.6%
J 1
14.3%
O 1
14.3%
S 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 672
53.6%
ASCII 582
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
220
37.8%
1 55
 
9.5%
, 43
 
7.4%
2 41
 
7.0%
0 32
 
5.5%
) 26
 
4.5%
( 26
 
4.5%
3 24
 
4.1%
5 20
 
3.4%
4 19
 
3.3%
Other values (11) 76
 
13.1%
Hangul
ValueCountFrequency (%)
45
 
6.7%
45
 
6.7%
43
 
6.4%
42
 
6.2%
42
 
6.2%
42
 
6.2%
37
 
5.5%
35
 
5.2%
21
 
3.1%
18
 
2.7%
Other values (86) 302
44.9%

업종명
Categorical

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
청소년게임제공업
32 
일반게임제공업
12 
<NA>
 
1

Length

Max length8
Median length8
Mean length7.6444444
Min length4

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row일반게임제공업
2nd row일반게임제공업
3rd row일반게임제공업
4th row일반게임제공업
5th row일반게임제공업

Common Values

ValueCountFrequency (%)
청소년게임제공업 32
71.1%
일반게임제공업 12
 
26.7%
<NA> 1
 
2.2%

Length

2023-12-12T08:26:55.122093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:26:55.276282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청소년게임제공업 32
71.1%
일반게임제공업 12
 
26.7%
na 1
 
2.2%

Interactions

2023-12-12T08:26:52.351387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:26:55.353135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호우편번호영업소소재지(지번)영업소소재지(도로명)업종명
상호1.0001.0000.9860.9950.000
우편번호1.0001.0001.0001.0000.000
영업소소재지(지번)0.9861.0001.0001.0000.000
영업소소재지(도로명)0.9951.0001.0001.0000.000
업종명0.0000.0000.0000.0001.000
2023-12-12T08:26:55.536159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호업종명
우편번호1.0000.000
업종명0.0001.000

Missing values

2023-12-12T08:26:52.463327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:26:52.557564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:26:52.646333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호우편번호영업소소재지(지번)영업소소재지(도로명)업종명
0병점게임장18412경기도 화성시 병점동 386-12번지 101,102호경기도 화성시 떡전골로 82 (병점동)일반게임제공업
1삼성게임장18412경기도 화성시 병점동 381-9번지 외 3필지 201호경기도 화성시 떡전골로 86, 201호 (병점동, 병점하나로빌딩)일반게임제공업
2아라비안성인게임장18271경기도 화성시 남양읍 남양리 2076-13번지 1층경기도 화성시 남양읍 역골로 9-14 (1층)일반게임제공업
3황금포커성골드게임장18593경기도 화성시 향남읍 평리 86-1번지 2층경기도 화성시 향남읍 3.1만세로 1114, 2층일반게임제공업
4성인센타게임장18455경기도 화성시 반송동 107-1번지 센타프라자 207,208호경기도 화성시 동탄반석로 124 (반송동, 센타프라자 207,208호)일반게임제공업
5왕대박게임랜드18593경기도 화성시 향남읍 평리 85-5번지경기도 화성시 향남읍 3.1만세로 1112-5일반게임제공업
6용게임랜드18271경기도 화성시 남양읍 남양리 2077-4번지 외 1번지 103호경기도 화성시 남양읍 역골로 9-13일반게임제공업
7역전월드게임장18412경기도 화성시 병점동 382-2번지 미라클프라자 205호경기도 화성시 떡전골로 96-4, 205호 (병점동, 미라클프라자)일반게임제공업
8롯데게임랜드18405경기도 화성시 병점동 844-1번지 씨네샤르망 B동 207호경기도 화성시 효행로 1052, B동 207호 (병점동, 씨네샤르망)일반게임제공업
9서울랜드 성인게임장18423경기도 화성시 능동 1064-5번지 209호경기도 화성시 동탄원천로 354-28, 209호 (능동, 이너매스)일반게임제공업
상호우편번호영업소소재지(지번)영업소소재지(도로명)업종명
35행운게임랜드18573경기도 화성시 우정읍 이화리 436-6번지 외 2필지경기도 화성시 우정읍 남양만로 650청소년게임제공업
36원게임장18593경기도 화성시 향남읍 평리 115-5번지 2층경기도 화성시 향남읍 평6길 62, 2층청소년게임제공업
37신세계게임장18406경기도 화성시 병점동 847번지경기도 화성시 효행로 1076-7, 305호 (병점동, 한마음프라자)청소년게임제공업
38게임바18278경기도 화성시 남양읍 활초리 4-21번지경기도 화성시 남양읍 시청로 290, 지하1층청소년게임제공업
39콜롬버스 게임랜드18337경기도 화성시 기안동 338-3번지 122호경기도 화성시 효행로 241, 1층 122호 (기안동)청소년게임제공업
40코인싱어18405경기도 화성시 병점동 844-1번지 시네마샤르망A동 306호경기도 화성시 효행로 1052, A동 306호 (병점동, 시네마샤르망)청소년게임제공업
41케이팝코인노래방18455경기도 화성시 반송동 104-2번지 한솔프라자 602호경기도 화성시 동탄중심상가1길 35, 602호 (반송동, 한솔프라자)청소년게임제공업
42골든드래곤게임랜드18593경기도 화성시 향남읍 평리 90-7번지 외 1필지, 2층경기도 화성시 향남읍 3.1만세로 1109-9, 2층청소년게임제공업
43중국오락실18594경기도 화성시 향남읍 발안리 118-3번지경기도 화성시 향남읍 발안서로42번길 11-2청소년게임제공업
44<NA><NA><NA><NA><NA>