Overview

Dataset statistics

Number of variables5
Number of observations288
Missing cells251
Missing cells (%)17.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.7 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description경기도 부천시 관내의 게임제공업현황으로 연번, 업종명, 업소명, 소재지(도로명), 소재지 전화번호 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3074195/fileData.do

Alerts

번호 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 번호High correlation
업소전화번호 has 251 (87.2%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:12:21.654608
Analysis finished2023-12-12 05:12:22.419531
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct288
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean144.5
Minimum1
Maximum288
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2023-12-12T14:12:22.519518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile15.35
Q172.75
median144.5
Q3216.25
95-th percentile273.65
Maximum288
Range287
Interquartile range (IQR)143.5

Descriptive statistics

Standard deviation83.282651
Coefficient of variation (CV)0.57635053
Kurtosis-1.2
Mean144.5
Median Absolute Deviation (MAD)72
Skewness0
Sum41616
Variance6936
MonotonicityStrictly increasing
2023-12-12T14:12:22.676575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
146 1
 
0.3%
198 1
 
0.3%
197 1
 
0.3%
196 1
 
0.3%
195 1
 
0.3%
194 1
 
0.3%
193 1
 
0.3%
192 1
 
0.3%
191 1
 
0.3%
Other values (278) 278
96.5%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
288 1
0.3%
287 1
0.3%
286 1
0.3%
285 1
0.3%
284 1
0.3%
283 1
0.3%
282 1
0.3%
281 1
0.3%
280 1
0.3%
279 1
0.3%

업종명
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
인터넷컴퓨터게임시설제공업
151 
청소년게임제공업
55 
노래연습장업
50 
일반게임제공업
18 
복합유통게임제공업
 
14

Length

Max length13
Median length13
Mean length10.260417
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인터넷컴퓨터게임시설제공업
2nd row인터넷컴퓨터게임시설제공업
3rd row인터넷컴퓨터게임시설제공업
4th row인터넷컴퓨터게임시설제공업
5th row인터넷컴퓨터게임시설제공업

Common Values

ValueCountFrequency (%)
인터넷컴퓨터게임시설제공업 151
52.4%
청소년게임제공업 55
 
19.1%
노래연습장업 50
 
17.4%
일반게임제공업 18
 
6.2%
복합유통게임제공업 14
 
4.9%

Length

2023-12-12T14:12:22.894764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:12:23.076644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인터넷컴퓨터게임시설제공업 151
52.4%
청소년게임제공업 55
 
19.1%
노래연습장업 50
 
17.4%
일반게임제공업 18
 
6.2%
복합유통게임제공업 14
 
4.9%
Distinct270
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T14:12:23.443066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length6.4548611
Min length2

Characters and Unicode

Total characters1859
Distinct characters314
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)88.2%

Sample

1st row갤러리PC방
2nd row돼지PC방
3rd row바빌론PC방
4th row겜노리PC방
5th row엔플러스PC방
ValueCountFrequency (%)
pc 21
 
5.6%
pc방 13
 
3.4%
노래연습장 5
 
1.3%
대박pc 4
 
1.1%
db 4
 
1.1%
인형뽑기 4
 
1.1%
게임랜드 3
 
0.8%
cafe 3
 
0.8%
제노pc방 3
 
0.8%
2
 
0.5%
Other values (292) 315
83.6%
2023-12-12T14:12:24.060398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
P 158
 
8.5%
C 155
 
8.3%
89
 
4.8%
77
 
4.1%
59
 
3.2%
57
 
3.1%
54
 
2.9%
51
 
2.7%
50
 
2.7%
39
 
2.1%
Other values (304) 1070
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1303
70.1%
Uppercase Letter 408
 
21.9%
Space Separator 89
 
4.8%
Lowercase Letter 25
 
1.3%
Decimal Number 21
 
1.1%
Close Punctuation 5
 
0.3%
Open Punctuation 5
 
0.3%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
5.9%
59
 
4.5%
57
 
4.4%
54
 
4.1%
51
 
3.9%
50
 
3.8%
39
 
3.0%
30
 
2.3%
28
 
2.1%
28
 
2.1%
Other values (255) 830
63.7%
Uppercase Letter
ValueCountFrequency (%)
P 158
38.7%
C 155
38.0%
E 14
 
3.4%
B 10
 
2.5%
O 10
 
2.5%
N 7
 
1.7%
S 5
 
1.2%
A 5
 
1.2%
F 5
 
1.2%
I 4
 
1.0%
Other values (14) 35
 
8.6%
Lowercase Letter
ValueCountFrequency (%)
c 4
16.0%
p 4
16.0%
t 3
12.0%
a 2
8.0%
s 2
8.0%
o 2
8.0%
r 2
8.0%
e 1
 
4.0%
f 1
 
4.0%
u 1
 
4.0%
Other values (3) 3
12.0%
Decimal Number
ValueCountFrequency (%)
2 8
38.1%
3 5
23.8%
1 3
 
14.3%
4 3
 
14.3%
6 1
 
4.8%
5 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
' 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1303
70.1%
Latin 433
 
23.3%
Common 123
 
6.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
5.9%
59
 
4.5%
57
 
4.4%
54
 
4.1%
51
 
3.9%
50
 
3.8%
39
 
3.0%
30
 
2.3%
28
 
2.1%
28
 
2.1%
Other values (255) 830
63.7%
Latin
ValueCountFrequency (%)
P 158
36.5%
C 155
35.8%
E 14
 
3.2%
B 10
 
2.3%
O 10
 
2.3%
N 7
 
1.6%
S 5
 
1.2%
A 5
 
1.2%
F 5
 
1.2%
I 4
 
0.9%
Other values (27) 60
 
13.9%
Common
ValueCountFrequency (%)
89
72.4%
2 8
 
6.5%
) 5
 
4.1%
( 5
 
4.1%
3 5
 
4.1%
1 3
 
2.4%
4 3
 
2.4%
6 1
 
0.8%
' 1
 
0.8%
& 1
 
0.8%
Other values (2) 2
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1303
70.1%
ASCII 556
29.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
P 158
28.4%
C 155
27.9%
89
16.0%
E 14
 
2.5%
B 10
 
1.8%
O 10
 
1.8%
2 8
 
1.4%
N 7
 
1.3%
) 5
 
0.9%
( 5
 
0.9%
Other values (39) 95
17.1%
Hangul
ValueCountFrequency (%)
77
 
5.9%
59
 
4.5%
57
 
4.4%
54
 
4.1%
51
 
3.9%
50
 
3.8%
39
 
3.0%
30
 
2.3%
28
 
2.1%
28
 
2.1%
Other values (255) 830
63.7%
Distinct285
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-12T14:12:24.399235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length40
Mean length29.611111
Min length20

Characters and Unicode

Total characters8528
Distinct characters188
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique282 ?
Unique (%)97.9%

Sample

1st row경기도 부천시 길주로77번길 61 404~406호 (상동 부건프라자 )
2nd row경기도 부천시 송내대로265번길 53 201호 (상동)
3rd row경기도 부천시 상동로 113 401호 (상동 승재프라자)
4th row경기도 부천시 길주로 125 다승프라자 125호 (상동)
5th row경기도 부천시 상동로 105 703호 (상동 현해프라자)
ValueCountFrequency (%)
경기도 288
 
15.7%
부천시 288
 
15.7%
심곡동 70
 
3.8%
1층 63
 
3.4%
2층 57
 
3.1%
중동 39
 
2.1%
고강동 36
 
2.0%
부일로 31
 
1.7%
상동 30
 
1.6%
심곡본동 24
 
1.3%
Other values (456) 905
49.4%
2023-12-12T14:12:24.988204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1822
21.4%
382
 
4.5%
1 340
 
4.0%
336
 
3.9%
314
 
3.7%
301
 
3.5%
297
 
3.5%
296
 
3.5%
293
 
3.4%
) 289
 
3.4%
Other values (178) 3858
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4540
53.2%
Space Separator 1822
21.4%
Decimal Number 1544
 
18.1%
Close Punctuation 289
 
3.4%
Open Punctuation 289
 
3.4%
Dash Punctuation 24
 
0.3%
Uppercase Letter 11
 
0.1%
Math Symbol 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
 
8.4%
336
 
7.4%
314
 
6.9%
301
 
6.6%
297
 
6.5%
296
 
6.5%
293
 
6.5%
288
 
6.3%
167
 
3.7%
156
 
3.4%
Other values (161) 1710
37.7%
Decimal Number
ValueCountFrequency (%)
1 340
22.0%
2 244
15.8%
3 190
12.3%
0 179
11.6%
4 140
9.1%
5 116
 
7.5%
7 95
 
6.2%
6 89
 
5.8%
9 78
 
5.1%
8 73
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
B 9
81.8%
F 2
 
18.2%
Space Separator
ValueCountFrequency (%)
1822
100.0%
Close Punctuation
ValueCountFrequency (%)
) 289
100.0%
Open Punctuation
ValueCountFrequency (%)
( 289
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4540
53.2%
Common 3977
46.6%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
 
8.4%
336
 
7.4%
314
 
6.9%
301
 
6.6%
297
 
6.5%
296
 
6.5%
293
 
6.5%
288
 
6.3%
167
 
3.7%
156
 
3.4%
Other values (161) 1710
37.7%
Common
ValueCountFrequency (%)
1822
45.8%
1 340
 
8.5%
) 289
 
7.3%
( 289
 
7.3%
2 244
 
6.1%
3 190
 
4.8%
0 179
 
4.5%
4 140
 
3.5%
5 116
 
2.9%
7 95
 
2.4%
Other values (5) 273
 
6.9%
Latin
ValueCountFrequency (%)
B 9
81.8%
F 2
 
18.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4540
53.2%
ASCII 3988
46.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1822
45.7%
1 340
 
8.5%
) 289
 
7.2%
( 289
 
7.2%
2 244
 
6.1%
3 190
 
4.8%
0 179
 
4.5%
4 140
 
3.5%
5 116
 
2.9%
7 95
 
2.4%
Other values (7) 284
 
7.1%
Hangul
ValueCountFrequency (%)
382
 
8.4%
336
 
7.4%
314
 
6.9%
301
 
6.6%
297
 
6.5%
296
 
6.5%
293
 
6.5%
288
 
6.3%
167
 
3.7%
156
 
3.4%
Other values (161) 1710
37.7%

업소전화번호
Text

MISSING 

Distinct37
Distinct (%)100.0%
Missing251
Missing (%)87.2%
Memory size2.4 KiB
2023-12-12T14:12:25.287040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters444
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row032-322-3245
2nd row032-623-1780
3rd row032-666-3010
4th row032-656-9322
5th row032-665-3545
ValueCountFrequency (%)
032-678-9239 1
 
2.7%
032-681-1724 1
 
2.7%
032-673-2633 1
 
2.7%
032-681-1087 1
 
2.7%
032-677-2114 1
 
2.7%
032-674-5039 1
 
2.7%
032-697-3464 1
 
2.7%
032-671-1671 1
 
2.7%
032-675-4130 1
 
2.7%
032-675-0355 1
 
2.7%
Other values (27) 27
73.0%
2023-12-12T14:12:25.754112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 74
16.7%
3 59
13.3%
2 58
13.1%
0 56
12.6%
6 55
12.4%
7 29
 
6.5%
8 26
 
5.9%
1 23
 
5.2%
9 22
 
5.0%
4 22
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 370
83.3%
Dash Punctuation 74
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 59
15.9%
2 58
15.7%
0 56
15.1%
6 55
14.9%
7 29
7.8%
8 26
7.0%
1 23
 
6.2%
9 22
 
5.9%
4 22
 
5.9%
5 20
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 444
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 74
16.7%
3 59
13.3%
2 58
13.1%
0 56
12.6%
6 55
12.4%
7 29
 
6.5%
8 26
 
5.9%
1 23
 
5.2%
9 22
 
5.0%
4 22
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 444
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 74
16.7%
3 59
13.3%
2 58
13.1%
0 56
12.6%
6 55
12.4%
7 29
 
6.5%
8 26
 
5.9%
1 23
 
5.2%
9 22
 
5.0%
4 22
 
5.0%

Interactions

2023-12-12T14:12:22.052833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:12:25.888667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종명업소전화번호
번호1.0000.8541.000
업종명0.8541.0001.000
업소전화번호1.0001.0001.000
2023-12-12T14:12:26.005437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업종명
번호1.0000.516
업종명0.5161.000

Missing values

2023-12-12T14:12:22.227773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:12:22.369348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업종명업소명업소도로명주소업소전화번호
01인터넷컴퓨터게임시설제공업갤러리PC방경기도 부천시 길주로77번길 61 404~406호 (상동 부건프라자 )<NA>
12인터넷컴퓨터게임시설제공업돼지PC방경기도 부천시 송내대로265번길 53 201호 (상동)<NA>
23인터넷컴퓨터게임시설제공업바빌론PC방경기도 부천시 상동로 113 401호 (상동 승재프라자)<NA>
34인터넷컴퓨터게임시설제공업겜노리PC방경기도 부천시 길주로 125 다승프라자 125호 (상동)<NA>
45인터넷컴퓨터게임시설제공업엔플러스PC방경기도 부천시 상동로 105 703호 (상동 현해프라자)<NA>
56인터넷컴퓨터게임시설제공업예림이경기도 부천시 길주로77번길 55-25 대야복합타워 105호 (상동)<NA>
67인터넷컴퓨터게임시설제공업DB PC카페경기도 부천시 송내대로73번길 21 (상동 정원빌딩)<NA>
78인터넷컴퓨터게임시설제공업아이센스리그PC방 송내로데오점경기도 부천시 상일로94번길 16 2층 전체호 (상동)<NA>
89인터넷컴퓨터게임시설제공업씨아이PC방경기도 부천시 길주로 86 해피플러스 306~308호 (상동)<NA>
910인터넷컴퓨터게임시설제공업아이비스PC경기도 부천시 송내대로265번길 43 크라운빌딩 301 302호 (상동)<NA>
번호업종명업소명업소도로명주소업소전화번호
278279인터넷컴퓨터게임시설제공업만수르PC경기도 부천시 고리울로51번길 56 1층 (고강동)<NA>
279280인터넷컴퓨터게임시설제공업연PC방경기도 부천시 삼작로 370 1층 (원종동)<NA>
280281인터넷컴퓨터게임시설제공업대박PC경기도 부천시 성곡로 96 1층 (여월동)<NA>
281282인터넷컴퓨터게임시설제공업골드PC경기도 부천시 삼작로380번길 8 1층 (원종동)<NA>
282283인터넷컴퓨터게임시설제공업놀자PC방경기도 부천시 역곡로482번길 124 1층 (고강동)<NA>
283284인터넷컴퓨터게임시설제공업고강PC경기도 부천시 역곡로504번길 97 101호 (고강동 영재리치빌)<NA>
284285일반게임제공업황금게임랜드경기도 부천시 역곡로 473 지하층 (고강동)<NA>
285286청소년게임제공업뽑기2번지경기도 부천시 삼작로380번길 6 1층 (원종동)<NA>
286287청소년게임제공업꿈꾸는 인형경기도 부천시 원종로 105 1층 (고강동)<NA>
287288청소년게임제공업여기서 뽑어경기도 부천시 고강로72번길 67 1층 (고강동)<NA>