Overview

Dataset statistics

Number of variables4
Number of observations9148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory294.9 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description부천시 관내의 결식아동지정급식소 현황으로 복자관, 지역아동센터, 전자카드 가맹점(결식아동 지정 급식소)구분에 따른 업체명, 위치 등의 자료를 제공합니다.
URLhttps://www.data.go.kr/data/3079417/fileData.do

Alerts

구분 is highly imbalanced (96.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:48:17.924106
Analysis finished2023-12-12 01:48:19.399900
Duration1.48 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct9148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4583.442
Minimum1
Maximum9157
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size80.5 KiB
2023-12-12T10:48:19.503411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile467.35
Q12296.75
median4583.5
Q36870.25
95-th percentile8699.65
Maximum9157
Range9156
Interquartile range (IQR)4573.5

Descriptive statistics

Standard deviation2641.0445
Coefficient of variation (CV)0.57621423
Kurtosis-1.1998231
Mean4583.442
Median Absolute Deviation (MAD)2287
Skewness-0.00012972936
Sum41929327
Variance6975115.8
MonotonicityStrictly increasing
2023-12-12T10:48:19.699619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
6112 1
 
< 0.1%
6106 1
 
< 0.1%
6107 1
 
< 0.1%
6108 1
 
< 0.1%
6109 1
 
< 0.1%
6110 1
 
< 0.1%
6111 1
 
< 0.1%
6113 1
 
< 0.1%
6104 1
 
< 0.1%
Other values (9138) 9138
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
9157 1
< 0.1%
9156 1
< 0.1%
9155 1
< 0.1%
9154 1
< 0.1%
9153 1
< 0.1%
9152 1
< 0.1%
9151 1
< 0.1%
9150 1
< 0.1%
9149 1
< 0.1%
9148 1
< 0.1%

구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size71.6 KiB
일반음식점
9089 
지역아동센터
 
57
도시락업체
 
2

Length

Max length6
Median length5
Mean length5.0062309
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지역아동센터
2nd row지역아동센터
3rd row지역아동센터
4th row지역아동센터
5th row지역아동센터

Common Values

ValueCountFrequency (%)
일반음식점 9089
99.4%
지역아동센터 57
 
0.6%
도시락업체 2
 
< 0.1%

Length

2023-12-12T10:48:19.890548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:48:20.059132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 9089
99.4%
지역아동센터 57
 
0.6%
도시락업체 2
 
< 0.1%
Distinct8521
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size71.6 KiB
2023-12-12T10:48:20.390590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length7.3130739
Min length1

Characters and Unicode

Total characters66900
Distinct characters1063
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8139 ?
Unique (%)89.0%

Sample

1st row서부
2nd row도깨비
3rd row심곡
4th row한울
5th row1318happyzone우리
ValueCountFrequency (%)
이마트24 138
 
1.2%
부천점 94
 
0.8%
지에스25 70
 
0.6%
세븐일레븐 67
 
0.6%
부천중동점 57
 
0.5%
중동점 55
 
0.5%
씨유(cu 50
 
0.4%
부천역점 48
 
0.4%
부천옥길점 47
 
0.4%
부천상동점 39
 
0.3%
Other values (8758) 10935
94.3%
2023-12-12T10:48:21.266529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2791
 
4.2%
2453
 
3.7%
1766
 
2.6%
1596
 
2.4%
1238
 
1.9%
1066
 
1.6%
991
 
1.5%
( 895
 
1.3%
) 893
 
1.3%
865
 
1.3%
Other values (1053) 52346
78.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57816
86.4%
Space Separator 2453
 
3.7%
Uppercase Letter 1822
 
2.7%
Decimal Number 1427
 
2.1%
Lowercase Letter 1315
 
2.0%
Open Punctuation 895
 
1.3%
Close Punctuation 893
 
1.3%
Other Punctuation 265
 
0.4%
Dash Punctuation 10
 
< 0.1%
Connector Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2791
 
4.8%
1766
 
3.1%
1596
 
2.8%
1238
 
2.1%
1066
 
1.8%
991
 
1.7%
865
 
1.5%
754
 
1.3%
658
 
1.1%
577
 
1.0%
Other values (974) 45514
78.7%
Uppercase Letter
ValueCountFrequency (%)
S 254
13.9%
G 196
 
10.8%
C 181
 
9.9%
E 124
 
6.8%
U 108
 
5.9%
A 105
 
5.8%
O 98
 
5.4%
B 81
 
4.4%
F 71
 
3.9%
T 69
 
3.8%
Other values (16) 535
29.4%
Lowercase Letter
ValueCountFrequency (%)
e 225
17.1%
a 152
11.6%
o 109
 
8.3%
r 78
 
5.9%
n 76
 
5.8%
f 73
 
5.6%
i 71
 
5.4%
c 69
 
5.2%
t 65
 
4.9%
s 56
 
4.3%
Other values (16) 341
25.9%
Other Punctuation
ValueCountFrequency (%)
& 122
46.0%
/ 58
21.9%
. 42
 
15.8%
, 15
 
5.7%
' 12
 
4.5%
? 7
 
2.6%
! 5
 
1.9%
: 1
 
0.4%
" 1
 
0.4%
# 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 515
36.1%
5 303
21.2%
4 186
 
13.0%
1 97
 
6.8%
3 90
 
6.3%
0 67
 
4.7%
9 60
 
4.2%
8 44
 
3.1%
6 39
 
2.7%
7 26
 
1.8%
Space Separator
ValueCountFrequency (%)
2453
100.0%
Open Punctuation
ValueCountFrequency (%)
( 895
100.0%
Close Punctuation
ValueCountFrequency (%)
) 893
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57790
86.4%
Common 5947
 
8.9%
Latin 3137
 
4.7%
Han 26
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2791
 
4.8%
1766
 
3.1%
1596
 
2.8%
1238
 
2.1%
1066
 
1.8%
991
 
1.7%
865
 
1.5%
754
 
1.3%
658
 
1.1%
577
 
1.0%
Other values (958) 45488
78.7%
Latin
ValueCountFrequency (%)
S 254
 
8.1%
e 225
 
7.2%
G 196
 
6.2%
C 181
 
5.8%
a 152
 
4.8%
E 124
 
4.0%
o 109
 
3.5%
U 108
 
3.4%
A 105
 
3.3%
O 98
 
3.1%
Other values (42) 1585
50.5%
Common
ValueCountFrequency (%)
2453
41.2%
( 895
 
15.0%
) 893
 
15.0%
2 515
 
8.7%
5 303
 
5.1%
4 186
 
3.1%
& 122
 
2.1%
1 97
 
1.6%
3 90
 
1.5%
0 67
 
1.1%
Other values (17) 326
 
5.5%
Han
ValueCountFrequency (%)
7
26.9%
3
11.5%
3
11.5%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (6) 6
23.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57789
86.4%
ASCII 9084
 
13.6%
CJK 26
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2791
 
4.8%
1766
 
3.1%
1596
 
2.8%
1238
 
2.1%
1066
 
1.8%
991
 
1.7%
865
 
1.5%
754
 
1.3%
658
 
1.1%
577
 
1.0%
Other values (957) 45487
78.7%
ASCII
ValueCountFrequency (%)
2453
27.0%
( 895
 
9.9%
) 893
 
9.8%
2 515
 
5.7%
5 303
 
3.3%
S 254
 
2.8%
e 225
 
2.5%
G 196
 
2.2%
4 186
 
2.0%
C 181
 
2.0%
Other values (69) 2983
32.8%
CJK
ValueCountFrequency (%)
7
26.9%
3
11.5%
3
11.5%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (6) 6
23.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct8977
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size71.6 KiB
2023-12-12T10:48:21.620691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length54
Mean length31.20387
Min length11

Characters and Unicode

Total characters285453
Distinct characters500
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8816 ?
Unique (%)96.4%

Sample

1st row경기도 부천시 조마루로372번길 38
2nd row경기도 부천시 부흥로 424 하나리아벨 206호
3rd row경기도 부천시 신흥로 61
4th row경기도 부천시 장말로278번길 22 3층
5th row경기도 부천시 역곡로45번길 43 3층
ValueCountFrequency (%)
부천시 9149
 
15.9%
경기 8956
 
15.6%
1층 2936
 
5.1%
원미구 1324
 
2.3%
중동 919
 
1.6%
상동 709
 
1.2%
일부 623
 
1.1%
심곡동 613
 
1.1%
오정구 543
 
0.9%
소사구 519
 
0.9%
Other values (7257) 31264
54.3%
2023-12-12T10:48:22.206501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48416
 
17.0%
1 17096
 
6.0%
12533
 
4.4%
10920
 
3.8%
, 10869
 
3.8%
10555
 
3.7%
9691
 
3.4%
9288
 
3.3%
9267
 
3.2%
9157
 
3.2%
Other values (490) 137661
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 151131
52.9%
Decimal Number 54735
 
19.2%
Space Separator 48416
 
17.0%
Other Punctuation 10905
 
3.8%
Open Punctuation 9153
 
3.2%
Close Punctuation 9151
 
3.2%
Dash Punctuation 1245
 
0.4%
Uppercase Letter 623
 
0.2%
Lowercase Letter 51
 
< 0.1%
Math Symbol 42
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12533
 
8.3%
10920
 
7.2%
10555
 
7.0%
9691
 
6.4%
9288
 
6.1%
9267
 
6.1%
9157
 
6.1%
5557
 
3.7%
5335
 
3.5%
4932
 
3.3%
Other values (428) 63896
42.3%
Uppercase Letter
ValueCountFrequency (%)
B 187
30.0%
A 108
17.3%
I 51
 
8.2%
C 45
 
7.2%
S 29
 
4.7%
E 24
 
3.9%
F 17
 
2.7%
T 17
 
2.7%
D 15
 
2.4%
U 14
 
2.2%
Other values (15) 116
18.6%
Lowercase Letter
ValueCountFrequency (%)
e 15
29.4%
n 6
 
11.8%
t 5
 
9.8%
c 5
 
9.8%
r 4
 
7.8%
l 3
 
5.9%
o 3
 
5.9%
a 3
 
5.9%
g 1
 
2.0%
m 1
 
2.0%
Other values (5) 5
 
9.8%
Decimal Number
ValueCountFrequency (%)
1 17096
31.2%
2 7134
13.0%
0 6095
 
11.1%
3 4881
 
8.9%
4 4413
 
8.1%
5 3419
 
6.2%
7 3382
 
6.2%
6 2900
 
5.3%
9 2730
 
5.0%
8 2685
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 10869
99.7%
. 26
 
0.2%
& 9
 
0.1%
/ 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 9152
> 99.9%
[ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 9150
> 99.9%
] 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
48416
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1245
100.0%
Math Symbol
ValueCountFrequency (%)
~ 42
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 151127
52.9%
Common 133647
46.8%
Latin 675
 
0.2%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12533
 
8.3%
10920
 
7.2%
10555
 
7.0%
9691
 
6.4%
9288
 
6.1%
9267
 
6.1%
9157
 
6.1%
5557
 
3.7%
5335
 
3.5%
4932
 
3.3%
Other values (427) 63892
42.3%
Latin
ValueCountFrequency (%)
B 187
27.7%
A 108
16.0%
I 51
 
7.6%
C 45
 
6.7%
S 29
 
4.3%
E 24
 
3.6%
F 17
 
2.5%
T 17
 
2.5%
D 15
 
2.2%
e 15
 
2.2%
Other values (31) 167
24.7%
Common
ValueCountFrequency (%)
48416
36.2%
1 17096
 
12.8%
, 10869
 
8.1%
( 9152
 
6.8%
) 9150
 
6.8%
2 7134
 
5.3%
0 6095
 
4.6%
3 4881
 
3.7%
4 4413
 
3.3%
5 3419
 
2.6%
Other values (11) 13022
 
9.7%
Han
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 151127
52.9%
ASCII 134321
47.1%
CJK 4
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48416
36.0%
1 17096
 
12.7%
, 10869
 
8.1%
( 9152
 
6.8%
) 9150
 
6.8%
2 7134
 
5.3%
0 6095
 
4.5%
3 4881
 
3.6%
4 4413
 
3.3%
5 3419
 
2.5%
Other values (51) 13696
 
10.2%
Hangul
ValueCountFrequency (%)
12533
 
8.3%
10920
 
7.2%
10555
 
7.0%
9691
 
6.4%
9288
 
6.1%
9267
 
6.1%
9157
 
6.1%
5557
 
3.7%
5335
 
3.5%
4932
 
3.3%
Other values (427) 63892
42.3%
CJK
ValueCountFrequency (%)
4
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T10:48:19.101021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:48:22.368652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.272
구분0.2721.000
2023-12-12T10:48:22.517955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.169
구분0.1691.000

Missing values

2023-12-12T10:48:19.246931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:48:19.347927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분상호명위 치
01지역아동센터서부경기도 부천시 조마루로372번길 38
12지역아동센터도깨비경기도 부천시 부흥로 424 하나리아벨 206호
23지역아동센터심곡경기도 부천시 신흥로 61
34지역아동센터한울경기도 부천시 장말로278번길 22 3층
45지역아동센터1318happyzone우리경기도 부천시 역곡로45번길 43 3층
56지역아동센터새날경기도 부천시 옥산로168번길 31-5
67지역아동센터원미경기도 부천시 원미로124번길 33-19 (원미동)
78지역아동센터역곡경기도 부천시 역곡로20번길 49 (역곡동,은하빌딩 3층)
89지역아동센터다정한경기도 부천시 부일로 689-2 (역곡동)
910지역아동센터원미산경기도 부천시 부일로 640, 2층 (역곡동)
연번구분상호명위 치
91389148일반음식점제주연탄구이경기도 부천시 중동로254번길 15,102호,106호(중동,동아프라자)
91399149일반음식점춤추는웍경기도 부천시 중동로254번길 19,103,106호(중동)
91409150일반음식점유리즉석떡볶이경기도 부천시 중동로254번길 50,105호,106호 (중동,센트럴프라움)
91419151일반음식점엄마손두루치기경기도 부천시 중동로254번길 69,106,107호(중동)
91429152일반음식점미담참숯불구이경기도 부천시 중동로254번길 70, 2층,3층 (중동)
91439153일반음식점국밥이가경기도 부천시 중동로254번길 78, 103호, 104호
91449154일반음식점택이네조개전골 (신중동점)경기도 부천시 중동로254번길 78,101,102호 (중동,필타운)
91459155일반음식점함경면옥경기도 부천시 중동로254번길 78,109,110호(중동)
91469156일반음식점남경중화요리경기도 부천시 지봉로 52,1층 101,102호(역곡동)
91479157일반음식점소새마을기획단 마을관리 사회적 협동조합경기도 부천시 호현로 457, 1층(소사본동)