Overview

Dataset statistics

Number of variables7
Number of observations176
Missing cells24
Missing cells (%)1.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory57.8 B

Variable types

Categorical3
Text3
Numeric1

Dataset

Description경기도 김포시의 실내공기질 관리법 적용대상 다중이용시설 현황 입니다.(시설구분, 시설명, 주소, 전화번호, 연면적, 위도, 경도)
Author경기도 김포시
URLhttps://www.data.go.kr/data/15037681/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
전화번호 has 24 (13.6%) missing valuesMissing

Reproduction

Analysis started2024-03-30 07:54:09.825309
Analysis finished2024-03-30 07:54:12.106617
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
김포시
176 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김포시
2nd row김포시
3rd row김포시
4th row김포시
5th row김포시

Common Values

ValueCountFrequency (%)
김포시 176
100.0%

Length

2024-03-30T07:54:12.454638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T07:54:12.948771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
김포시 176
100.0%

시설구분
Categorical

Distinct14
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
어린이집
38 
노인요양시설
28 
실내주차장
27 
의료기관
24 
PC영업시설
13 
Other values (9)
46 

Length

Max length9
Median length6
Mean length4.6534091
Min length2

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st rowPC영업시설
2nd rowPC영업시설
3rd rowPC영업시설
4th rowPC영업시설
5th rowPC영업시설

Common Values

ValueCountFrequency (%)
어린이집 38
21.6%
노인요양시설 28
15.9%
실내주차장 27
15.3%
의료기관 24
13.6%
PC영업시설 13
 
7.4%
목욕장 11
 
6.2%
대규모점포 8
 
4.5%
지하역사 8
 
4.5%
산후조리원 5
 
2.8%
영화상영관 5
 
2.8%
Other values (4) 9
 
5.1%

Length

2024-03-30T07:54:13.365901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이집 38
21.6%
노인요양시설 28
15.9%
실내주차장 27
15.3%
의료기관 24
13.6%
pc영업시설 13
 
7.4%
목욕장 11
 
6.2%
대규모점포 8
 
4.5%
지하역사 8
 
4.5%
산후조리원 5
 
2.8%
영화상영관 5
 
2.8%
Other values (4) 9
 
5.1%
Distinct172
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-03-30T07:54:14.039693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length8.2954545
Min length3

Characters and Unicode

Total characters1460
Distinct characters287
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique168 ?
Unique (%)95.5%

Sample

1st rowFLEX PC
2nd rowNIC PC방 장기점
3rd rowplay pc zone
4th rowVRIZ
5th row긱스타
ValueCountFrequency (%)
김포점 9
 
3.8%
김포도시철도 8
 
3.4%
현대프리미엄아울렛 4
 
1.7%
김포한강점 3
 
1.3%
별관 2
 
0.9%
요양원 2
 
0.9%
산후조리원 2
 
0.9%
운양역 2
 
0.9%
김포한강 2
 
0.9%
김포한강어린이집 2
 
0.9%
Other values (191) 198
84.6%
2024-03-30T07:54:15.231014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
67
 
4.6%
63
 
4.3%
62
 
4.2%
59
 
4.0%
58
 
4.0%
42
 
2.9%
40
 
2.7%
39
 
2.7%
37
 
2.5%
32
 
2.2%
Other values (277) 961
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1314
90.0%
Space Separator 58
 
4.0%
Uppercase Letter 50
 
3.4%
Decimal Number 11
 
0.8%
Lowercase Letter 10
 
0.7%
Open Punctuation 7
 
0.5%
Close Punctuation 7
 
0.5%
Other Symbol 2
 
0.1%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
 
5.1%
63
 
4.8%
62
 
4.7%
59
 
4.5%
42
 
3.2%
40
 
3.0%
39
 
3.0%
37
 
2.8%
32
 
2.4%
31
 
2.4%
Other values (240) 842
64.1%
Uppercase Letter
ValueCountFrequency (%)
C 14
28.0%
P 9
18.0%
V 4
 
8.0%
G 3
 
6.0%
I 3
 
6.0%
M 3
 
6.0%
E 2
 
4.0%
Y 2
 
4.0%
K 1
 
2.0%
U 1
 
2.0%
Other values (8) 8
16.0%
Lowercase Letter
ValueCountFrequency (%)
p 2
20.0%
e 1
10.0%
n 1
10.0%
o 1
10.0%
z 1
10.0%
c 1
10.0%
y 1
10.0%
a 1
10.0%
l 1
10.0%
Decimal Number
ValueCountFrequency (%)
2 5
45.5%
4 3
27.3%
1 1
 
9.1%
3 1
 
9.1%
5 1
 
9.1%
Space Separator
ValueCountFrequency (%)
58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1316
90.1%
Common 84
 
5.8%
Latin 60
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
 
5.1%
63
 
4.8%
62
 
4.7%
59
 
4.5%
42
 
3.2%
40
 
3.0%
39
 
3.0%
37
 
2.8%
32
 
2.4%
31
 
2.4%
Other values (241) 844
64.1%
Latin
ValueCountFrequency (%)
C 14
23.3%
P 9
15.0%
V 4
 
6.7%
G 3
 
5.0%
I 3
 
5.0%
M 3
 
5.0%
p 2
 
3.3%
E 2
 
3.3%
Y 2
 
3.3%
K 1
 
1.7%
Other values (17) 17
28.3%
Common
ValueCountFrequency (%)
58
69.0%
( 7
 
8.3%
) 7
 
8.3%
2 5
 
6.0%
4 3
 
3.6%
- 1
 
1.2%
1 1
 
1.2%
3 1
 
1.2%
5 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1314
90.0%
ASCII 144
 
9.9%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
67
 
5.1%
63
 
4.8%
62
 
4.7%
59
 
4.5%
42
 
3.2%
40
 
3.0%
39
 
3.0%
37
 
2.8%
32
 
2.4%
31
 
2.4%
Other values (240) 842
64.1%
ASCII
ValueCountFrequency (%)
58
40.3%
C 14
 
9.7%
P 9
 
6.2%
( 7
 
4.9%
) 7
 
4.9%
2 5
 
3.5%
V 4
 
2.8%
G 3
 
2.1%
4 3
 
2.1%
I 3
 
2.1%
Other values (26) 31
21.5%
None
ValueCountFrequency (%)
2
100.0%
Distinct171
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-03-30T07:54:15.876065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length35
Mean length22.375
Min length9

Characters and Unicode

Total characters3938
Distinct characters186
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)94.9%

Sample

1st row경기도 김포시 김포한강11로 43, 비호뉴팰리스Ⅱ 2층 전체호 (운양동)
2nd row경기도 김포시 김포한강1로78번길 6-6, 연경프라자 3층 (장기동)
3rd row경기도 김포시 김포한강9로 95, 407,408,409호 (구래동)
4th row경기도 김포시 김포한강9로 95, 404,405,406호 (구래동)
5th row경기도 김포시 김포한강1로 57, MJ프라자 601,602호 (장기동)
ValueCountFrequency (%)
김포시 174
 
21.9%
경기도 63
 
7.9%
통진읍 23
 
2.9%
김포대로 19
 
2.4%
고촌읍 13
 
1.6%
구래동 10
 
1.3%
양촌읍 9
 
1.1%
김포한강8로 8
 
1.0%
장기동 7
 
0.9%
김포한강9로 7
 
0.9%
Other values (340) 460
58.0%
2024-03-30T07:54:17.164637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
649
 
16.5%
263
 
6.7%
255
 
6.5%
177
 
4.5%
1 177
 
4.5%
175
 
4.4%
2 133
 
3.4%
0 86
 
2.2%
3 85
 
2.2%
, 81
 
2.1%
Other values (176) 1857
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2189
55.6%
Decimal Number 870
 
22.1%
Space Separator 649
 
16.5%
Other Punctuation 81
 
2.1%
Open Punctuation 47
 
1.2%
Close Punctuation 47
 
1.2%
Dash Punctuation 36
 
0.9%
Math Symbol 12
 
0.3%
Uppercase Letter 6
 
0.2%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
 
12.0%
255
 
11.6%
177
 
8.1%
175
 
8.0%
79
 
3.6%
70
 
3.2%
69
 
3.2%
60
 
2.7%
57
 
2.6%
54
 
2.5%
Other values (155) 930
42.5%
Decimal Number
ValueCountFrequency (%)
1 177
20.3%
2 133
15.3%
0 86
9.9%
3 85
9.8%
7 81
9.3%
4 77
8.9%
5 76
8.7%
6 54
 
6.2%
9 53
 
6.1%
8 48
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 3
50.0%
G 1
 
16.7%
M 1
 
16.7%
J 1
 
16.7%
Space Separator
ValueCountFrequency (%)
649
100.0%
Other Punctuation
ValueCountFrequency (%)
, 81
100.0%
Open Punctuation
ValueCountFrequency (%)
( 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 47
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Math Symbol
ValueCountFrequency (%)
~ 12
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2189
55.6%
Common 1742
44.2%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
 
12.0%
255
 
11.6%
177
 
8.1%
175
 
8.0%
79
 
3.6%
70
 
3.2%
69
 
3.2%
60
 
2.7%
57
 
2.6%
54
 
2.5%
Other values (155) 930
42.5%
Common
ValueCountFrequency (%)
649
37.3%
1 177
 
10.2%
2 133
 
7.6%
0 86
 
4.9%
3 85
 
4.9%
, 81
 
4.6%
7 81
 
4.6%
4 77
 
4.4%
5 76
 
4.4%
6 54
 
3.1%
Other values (6) 243
 
13.9%
Latin
ValueCountFrequency (%)
B 3
42.9%
1
 
14.3%
G 1
 
14.3%
M 1
 
14.3%
J 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2189
55.6%
ASCII 1748
44.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
649
37.1%
1 177
 
10.1%
2 133
 
7.6%
0 86
 
4.9%
3 85
 
4.9%
, 81
 
4.6%
7 81
 
4.6%
4 77
 
4.4%
5 76
 
4.3%
6 54
 
3.1%
Other values (10) 249
 
14.2%
Hangul
ValueCountFrequency (%)
263
 
12.0%
255
 
11.6%
177
 
8.1%
175
 
8.0%
79
 
3.6%
70
 
3.2%
69
 
3.2%
60
 
2.7%
57
 
2.6%
54
 
2.5%
Other values (155) 930
42.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct147
Distinct (%)96.7%
Missing24
Missing (%)13.6%
Memory size1.5 KiB
2024-03-30T07:54:17.922159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.157895
Min length9

Characters and Unicode

Total characters1848
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)94.7%

Sample

1st row070-8715-8500
2nd row031-987-1360
3rd row031-988-0303
4th row031-360-1530
5th row031-985-8004
ValueCountFrequency (%)
031-8048-2623 4
 
2.6%
031-8049-2580 2
 
1.3%
031-982-5700 2
 
1.3%
031-8048-1740 1
 
0.7%
031-981-5550 1
 
0.7%
031-989-2388 1
 
0.7%
031-986-9164 1
 
0.7%
070-8715-8500 1
 
0.7%
031-997-6509 1
 
0.7%
031-981-5116 1
 
0.7%
Other values (137) 137
90.1%
2024-03-30T07:54:19.374433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 302
16.3%
0 284
15.4%
9 233
12.6%
1 217
11.7%
8 206
11.1%
3 205
11.1%
7 93
 
5.0%
6 90
 
4.9%
5 78
 
4.2%
2 73
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1546
83.7%
Dash Punctuation 302
 
16.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 284
18.4%
9 233
15.1%
1 217
14.0%
8 206
13.3%
3 205
13.3%
7 93
 
6.0%
6 90
 
5.8%
5 78
 
5.0%
2 73
 
4.7%
4 67
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 302
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1848
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 302
16.3%
0 284
15.4%
9 233
12.6%
1 217
11.7%
8 206
11.1%
3 205
11.1%
7 93
 
5.0%
6 90
 
4.9%
5 78
 
4.2%
2 73
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1848
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 302
16.3%
0 284
15.4%
9 233
12.6%
1 217
11.7%
8 206
11.1%
3 205
11.1%
7 93
 
5.0%
6 90
 
4.9%
5 78
 
4.2%
2 73
 
4.0%

연면적
Real number (ℝ)

Distinct169
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7096.9773
Minimum297
Maximum153900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2024-03-30T07:54:19.893350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum297
5-th percentile409
Q1771.5
median2227
Q34182.75
95-th percentile26854.5
Maximum153900
Range153603
Interquartile range (IQR)3411.25

Descriptive statistics

Standard deviation18744.202
Coefficient of variation (CV)2.6411529
Kurtosis42.603305
Mean7096.9773
Median Absolute Deviation (MAD)1590
Skewness6.0322594
Sum1249068
Variance3.513451 × 108
MonotonicityNot monotonic
2024-03-30T07:54:20.508813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1709 2
 
1.1%
644 2
 
1.1%
1136 2
 
1.1%
464 2
 
1.1%
2338 2
 
1.1%
14798 2
 
1.1%
465 2
 
1.1%
4159 1
 
0.6%
441 1
 
0.6%
1342 1
 
0.6%
Other values (159) 159
90.3%
ValueCountFrequency (%)
297 1
0.6%
312 1
0.6%
324 1
0.6%
332 1
0.6%
339 1
0.6%
340 1
0.6%
343 1
0.6%
395 1
0.6%
403 1
0.6%
411 1
0.6%
ValueCountFrequency (%)
153900 1
0.6%
153810 1
0.6%
55648 1
0.6%
55600 1
0.6%
52960 1
0.6%
51141 1
0.6%
48769 1
0.6%
45834 1
0.6%
28302 1
0.6%
26372 1
0.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.5 KiB
2024-03-15
176 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-15
2nd row2024-03-15
3rd row2024-03-15
4th row2024-03-15
5th row2024-03-15

Common Values

ValueCountFrequency (%)
2024-03-15 176
100.0%

Length

2024-03-30T07:54:21.024560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T07:54:21.761652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-15 176
100.0%

Interactions

2024-03-30T07:54:10.754366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T07:54:21.996341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분연면적
시설구분1.0000.464
연면적0.4641.000
2024-03-30T07:54:22.274947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연면적시설구분
연면적1.0000.254
시설구분0.2541.000

Missing values

2024-03-30T07:54:11.234454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T07:54:11.779520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명시설구분시설명소재지도로명주소전화번호연면적데이터기준일자
0김포시PC영업시설FLEX PC경기도 김포시 김포한강11로 43, 비호뉴팰리스Ⅱ 2층 전체호 (운양동)070-8715-85003392024-03-15
1김포시PC영업시설NIC PC방 장기점경기도 김포시 김포한강1로78번길 6-6, 연경프라자 3층 (장기동)031-987-13604652024-03-15
2김포시PC영업시설play pc zone경기도 김포시 김포한강9로 95, 407,408,409호 (구래동)<NA>4112024-03-15
3김포시PC영업시설VRIZ경기도 김포시 김포한강9로 95, 404,405,406호 (구래동)<NA>3952024-03-15
4김포시PC영업시설긱스타경기도 김포시 김포한강1로 57, MJ프라자 601,602호 (장기동)<NA>3432024-03-15
5김포시PC영업시설몬스터PC경기도 김포시 김포한강9로 77, 702~705호 (구래동)<NA>4032024-03-15
6김포시PC영업시설볼트PC까페김포시 통진읍 김포대로2244번길 20, 마송프라자 701호<NA>5572024-03-15
7김포시PC영업시설브리즈김포시 김포한강9로 95, 센터프라자 401,402,403호<NA>3402024-03-15
8김포시PC영업시설스페이스PC방경기도 김포시 돌문로 49, 지하1층031-988-03033322024-03-15
9김포시PC영업시설엠투피씨(M2PC)경기도 김포시 김포한강9로76번길 63, 5층 504,505,506,507호 (구래동)<NA>4922024-03-15
시군명시설구분시설명소재지도로명주소전화번호연면적데이터기준일자
166김포시장례식장아너스힐병원장례식장경기도 김포시 통진읍 흥신로320-10 나동 1층031-989-440414522024-03-15
167김포시지하역사김포도시철도 걸포북변역김포시 김포대로 1040031-8048-175042542024-03-15
168김포시지하역사김포도시철도 고촌역김포시 고촌읍 김포대로 350031-8048-178043722024-03-15
169김포시지하역사김포도시철도 구래역김포시 김포한강7로 87031-8048-171032712024-03-15
170김포시지하역사김포도시철도 마산역김포시 김포한강3로 442031-8048-172030622024-03-15
171김포시지하역사김포도시철도 사우(김포시청)역김포시 김포대로 852031-8048-176041422024-03-15
172김포시지하역사김포도시철도 운양역김포시 김포한강1로 235031-8048-174036632024-03-15
173김포시지하역사김포도시철도 장기역김포시 김포한강1로 59031-8048-173039962024-03-15
174김포시지하역사김포도시철도 풍무역김포시 김포대로 710031-8048-177037292024-03-15
175김포시학원리틀아메리카어학원경기도 김포시 김포한강11로 139-44, 2층~5층 전부(운양동, 리틀아메리카)031-986-058414192024-03-15