Overview

Dataset statistics

Number of variables5
Number of observations1907
Missing cells461
Missing cells (%)4.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory76.5 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description인천광역시 관내 행정구역별 평생교육기관명, 전화번호, 주소에 대한 항목값에 대한 정보로 구성되어 있는 데이터 입니다.
Author인천광역시
URLhttps://www.data.go.kr/data/15055880/fileData.do

Alerts

전화번호 has 222 (11.6%) missing valuesMissing
주소 has 239 (12.5%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:53:01.135837
Analysis finished2023-12-12 07:53:02.173039
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1907
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean954
Minimum1
Maximum1907
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.9 KiB
2023-12-12T16:53:02.280972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile96.3
Q1477.5
median954
Q31430.5
95-th percentile1811.7
Maximum1907
Range1906
Interquartile range (IQR)953

Descriptive statistics

Standard deviation550.6478
Coefficient of variation (CV)0.57719895
Kurtosis-1.2
Mean954
Median Absolute Deviation (MAD)477
Skewness0
Sum1819278
Variance303213
MonotonicityStrictly increasing
2023-12-12T16:53:02.460244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1269 1
 
0.1%
1281 1
 
0.1%
1280 1
 
0.1%
1279 1
 
0.1%
1278 1
 
0.1%
1277 1
 
0.1%
1276 1
 
0.1%
1275 1
 
0.1%
1274 1
 
0.1%
Other values (1897) 1897
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1907 1
0.1%
1906 1
0.1%
1905 1
0.1%
1904 1
0.1%
1903 1
0.1%
1902 1
0.1%
1901 1
0.1%
1900 1
0.1%
1899 1
0.1%
1898 1
0.1%

행정구역
Categorical

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size15.0 KiB
남동구
408 
부평구
317 
서구
258 
연수구
215 
미추홀구
209 
Other values (5)
500 

Length

Max length4
Median length3
Mean length2.8673309
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부평구
2nd row부평구
3rd row남동구
4th row연수구
5th row부평구

Common Values

ValueCountFrequency (%)
남동구 408
21.4%
부평구 317
16.6%
서구 258
13.5%
연수구 215
11.3%
미추홀구 209
11.0%
계양구 166
8.7%
중구 114
 
6.0%
강화군 102
 
5.3%
동구 90
 
4.7%
옹진군 28
 
1.5%

Length

2023-12-12T16:53:02.591972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:53:02.722951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동구 408
21.4%
부평구 317
16.6%
서구 258
13.5%
연수구 215
11.3%
미추홀구 209
11.0%
계양구 166
8.7%
중구 114
 
6.0%
강화군 102
 
5.3%
동구 90
 
4.7%
옹진군 28
 
1.5%
Distinct1896
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size15.0 KiB
2023-12-12T16:53:02.995204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length8.6801259
Min length2

Characters and Unicode

Total characters16553
Distinct characters549
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1888 ?
Unique (%)99.0%

Sample

1st row부평의료복지요양원
2nd row함께걷기참사랑지역아동센터
3rd row남동구청
4th row연수구가족센터
5th row부평구학교밖청소년지원센터
ValueCountFrequency (%)
남촌도림어울림센터 5
 
0.3%
예일종합예술원부설예일음악예술원 2
 
0.1%
전통예술법인다락 2
 
0.1%
북센터 2
 
0.1%
구월3동 2
 
0.1%
대한노인회인천시연합회노인지도자사회교육원 2
 
0.1%
꿈나무지역아동센터 2
 
0.1%
에듀인천평생교육원 2
 
0.1%
주안2동주민자치센터 1
 
0.1%
주안3동주민자치센터 1
 
0.1%
Other values (1893) 1893
98.9%
2023-12-12T16:53:03.365289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
785
 
4.7%
654
 
4.0%
651
 
3.9%
522
 
3.2%
520
 
3.1%
498
 
3.0%
488
 
2.9%
413
 
2.5%
378
 
2.3%
361
 
2.2%
Other values (539) 11283
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16102
97.3%
Decimal Number 161
 
1.0%
Uppercase Letter 84
 
0.5%
Close Punctuation 76
 
0.5%
Open Punctuation 73
 
0.4%
Lowercase Letter 23
 
0.1%
Other Punctuation 15
 
0.1%
Space Separator 7
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Other Symbol 2
 
< 0.1%
Other values (4) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
785
 
4.9%
654
 
4.1%
651
 
4.0%
522
 
3.2%
520
 
3.2%
498
 
3.1%
488
 
3.0%
413
 
2.6%
378
 
2.3%
361
 
2.2%
Other values (482) 10832
67.3%
Uppercase Letter
ValueCountFrequency (%)
C 13
15.5%
M 10
11.9%
A 8
9.5%
E 6
 
7.1%
Y 6
 
7.1%
B 5
 
6.0%
S 5
 
6.0%
J 5
 
6.0%
I 4
 
4.8%
W 3
 
3.6%
Other values (11) 19
22.6%
Lowercase Letter
ValueCountFrequency (%)
e 5
21.7%
p 3
13.0%
n 2
 
8.7%
o 2
 
8.7%
a 2
 
8.7%
b 1
 
4.3%
t 1
 
4.3%
c 1
 
4.3%
y 1
 
4.3%
s 1
 
4.3%
Other values (4) 4
17.4%
Decimal Number
ValueCountFrequency (%)
1 47
29.2%
2 45
28.0%
3 33
20.5%
4 18
 
11.2%
5 10
 
6.2%
6 5
 
3.1%
8 2
 
1.2%
7 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
· 6
40.0%
, 3
20.0%
& 3
20.0%
. 2
 
13.3%
; 1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 76
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16104
97.3%
Common 340
 
2.1%
Latin 109
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
785
 
4.9%
654
 
4.1%
651
 
4.0%
522
 
3.2%
520
 
3.2%
498
 
3.1%
488
 
3.0%
413
 
2.6%
378
 
2.3%
361
 
2.2%
Other values (483) 10834
67.3%
Latin
ValueCountFrequency (%)
C 13
 
11.9%
M 10
 
9.2%
A 8
 
7.3%
E 6
 
5.5%
Y 6
 
5.5%
e 5
 
4.6%
B 5
 
4.6%
S 5
 
4.6%
J 5
 
4.6%
I 4
 
3.7%
Other values (26) 42
38.5%
Common
ValueCountFrequency (%)
) 76
22.4%
( 73
21.5%
1 47
13.8%
2 45
13.2%
3 33
9.7%
4 18
 
5.3%
5 10
 
2.9%
7
 
2.1%
· 6
 
1.8%
- 5
 
1.5%
Other values (10) 20
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16102
97.3%
ASCII 439
 
2.7%
None 8
 
< 0.1%
Number Forms 2
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
785
 
4.9%
654
 
4.1%
651
 
4.0%
522
 
3.2%
520
 
3.2%
498
 
3.1%
488
 
3.0%
413
 
2.6%
378
 
2.3%
361
 
2.2%
Other values (482) 10832
67.3%
ASCII
ValueCountFrequency (%)
) 76
17.3%
( 73
16.6%
1 47
10.7%
2 45
10.3%
3 33
 
7.5%
4 18
 
4.1%
C 13
 
3.0%
M 10
 
2.3%
5 10
 
2.3%
A 8
 
1.8%
Other values (42) 106
24.1%
None
ValueCountFrequency (%)
· 6
75.0%
2
 
25.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

전화번호
Text

MISSING 

Distinct1575
Distinct (%)93.5%
Missing222
Missing (%)11.6%
Memory size15.0 KiB
2023-12-12T16:53:03.623895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.023739
Min length7

Characters and Unicode

Total characters20260
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1485 ?
Unique (%)88.1%

Sample

1st row032-522-7520
2nd row032-851-2730
3rd row032-509-8918
4th row032-425-0123
5th row032-887-5270~2
ValueCountFrequency (%)
032-432-2226 9
 
0.5%
032-509-6169 5
 
0.3%
032-509-6444 5
 
0.3%
032-433-0733 3
 
0.2%
1600-0691 3
 
0.2%
070-8680-6171 3
 
0.2%
032-851-2730 3
 
0.2%
032-765-3677 3
 
0.2%
032-509-6440 3
 
0.2%
032-580-8399 3
 
0.2%
Other values (1565) 1645
97.6%
2023-12-12T16:53:04.000459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3364
16.6%
0 3171
15.7%
2 2860
14.1%
3 2705
13.4%
5 1427
7.0%
4 1334
 
6.6%
7 1198
 
5.9%
6 1144
 
5.6%
8 1118
 
5.5%
1 1108
 
5.5%
Other values (2) 831
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16895
83.4%
Dash Punctuation 3364
 
16.6%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3171
18.8%
2 2860
16.9%
3 2705
16.0%
5 1427
8.4%
4 1334
7.9%
7 1198
 
7.1%
6 1144
 
6.8%
8 1118
 
6.6%
1 1108
 
6.6%
9 830
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 3364
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20260
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3364
16.6%
0 3171
15.7%
2 2860
14.1%
3 2705
13.4%
5 1427
7.0%
4 1334
 
6.6%
7 1198
 
5.9%
6 1144
 
5.6%
8 1118
 
5.5%
1 1108
 
5.5%
Other values (2) 831
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20260
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3364
16.6%
0 3171
15.7%
2 2860
14.1%
3 2705
13.4%
5 1427
7.0%
4 1334
 
6.6%
7 1198
 
5.9%
6 1144
 
5.6%
8 1118
 
5.5%
1 1108
 
5.5%
Other values (2) 831
 
4.1%

주소
Text

MISSING 

Distinct1644
Distinct (%)98.6%
Missing239
Missing (%)12.5%
Memory size15.0 KiB
2023-12-12T16:53:04.381119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length48
Mean length30.601918
Min length14

Characters and Unicode

Total characters51044
Distinct characters462
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1620 ?
Unique (%)97.1%

Sample

1st row(21388) 인천 부평구 부평대로51번길 7 (부평동) 9 10층
2nd row(21360) 인천 부평구 부흥로315번길 18-5 (부평동 현대빌리지)
3rd row(21589) 인천 남동구 소래로 645 (만수동)
4th row(21927) 인천 연수구 청능대로 109 (연수동)
5th row(21387) 인천 부평구 부평문화로37번길 1 (부평동)
ValueCountFrequency (%)
인천광역시 338
 
6.9%
남동구 93
 
1.9%
인천 78
 
1.6%
부평구 73
 
1.5%
서구 65
 
1.3%
미추홀구 48
 
1.0%
연수구 48
 
1.0%
계양구 36
 
0.7%
2층 34
 
0.7%
3층 25
 
0.5%
Other values (2930) 4033
82.8%
2023-12-12T16:53:04.941628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6377
 
12.5%
2 2852
 
5.6%
1 2387
 
4.7%
1971
 
3.9%
1862
 
3.6%
0 1673
 
3.3%
1662
 
3.3%
1633
 
3.2%
1619
 
3.2%
1606
 
3.1%
Other values (452) 27402
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 26527
52.0%
Decimal Number 14375
28.2%
Space Separator 6377
 
12.5%
Open Punctuation 1595
 
3.1%
Close Punctuation 1595
 
3.1%
Dash Punctuation 429
 
0.8%
Other Punctuation 77
 
0.2%
Uppercase Letter 53
 
0.1%
Lowercase Letter 8
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1971
 
7.4%
1862
 
7.0%
1662
 
6.3%
1633
 
6.2%
1619
 
6.1%
1606
 
6.1%
1319
 
5.0%
1154
 
4.4%
563
 
2.1%
554
 
2.1%
Other values (406) 12584
47.4%
Uppercase Letter
ValueCountFrequency (%)
A 11
20.8%
B 6
11.3%
C 6
11.3%
Y 5
9.4%
M 5
9.4%
I 3
 
5.7%
T 3
 
5.7%
W 2
 
3.8%
S 2
 
3.8%
F 2
 
3.8%
Other values (7) 8
15.1%
Decimal Number
ValueCountFrequency (%)
2 2852
19.8%
1 2387
16.6%
0 1673
11.6%
4 1563
10.9%
3 1516
10.5%
5 1151
8.0%
6 867
 
6.0%
8 864
 
6.0%
7 812
 
5.6%
9 690
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
l 2
25.0%
p 1
12.5%
m 1
12.5%
a 1
12.5%
t 1
12.5%
b 1
12.5%
e 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 63
81.8%
. 8
 
10.4%
/ 2
 
2.6%
@ 2
 
2.6%
; 1
 
1.3%
& 1
 
1.3%
Space Separator
ValueCountFrequency (%)
6377
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1595
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1595
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 429
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 26527
52.0%
Common 24456
47.9%
Latin 61
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1971
 
7.4%
1862
 
7.0%
1662
 
6.3%
1633
 
6.2%
1619
 
6.1%
1606
 
6.1%
1319
 
5.0%
1154
 
4.4%
563
 
2.1%
554
 
2.1%
Other values (406) 12584
47.4%
Latin
ValueCountFrequency (%)
A 11
18.0%
B 6
 
9.8%
C 6
 
9.8%
Y 5
 
8.2%
M 5
 
8.2%
I 3
 
4.9%
T 3
 
4.9%
W 2
 
3.3%
l 2
 
3.3%
S 2
 
3.3%
Other values (14) 16
26.2%
Common
ValueCountFrequency (%)
6377
26.1%
2 2852
11.7%
1 2387
 
9.8%
0 1673
 
6.8%
( 1595
 
6.5%
) 1595
 
6.5%
4 1563
 
6.4%
3 1516
 
6.2%
5 1151
 
4.7%
6 867
 
3.5%
Other values (12) 2880
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 26527
52.0%
ASCII 24517
48.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6377
26.0%
2 2852
11.6%
1 2387
 
9.7%
0 1673
 
6.8%
( 1595
 
6.5%
) 1595
 
6.5%
4 1563
 
6.4%
3 1516
 
6.2%
5 1151
 
4.7%
6 867
 
3.5%
Other values (36) 2941
12.0%
Hangul
ValueCountFrequency (%)
1971
 
7.4%
1862
 
7.0%
1662
 
6.3%
1633
 
6.2%
1619
 
6.1%
1606
 
6.1%
1319
 
5.0%
1154
 
4.4%
563
 
2.1%
554
 
2.1%
Other values (406) 12584
47.4%

Interactions

2023-12-12T16:53:01.724080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:53:05.038287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호행정구역
번호1.0000.703
행정구역0.7031.000
2023-12-12T16:53:05.129502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호행정구역
번호1.0000.285
행정구역0.2851.000

Missing values

2023-12-12T16:53:01.889042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:53:02.008374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T16:53:02.112186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호행정구역기관이름전화번호주소
01부평구부평의료복지요양원<NA>(21388) 인천 부평구 부평대로51번길 7 (부평동) 9 10층
12부평구함께걷기참사랑지역아동센터032-522-7520(21360) 인천 부평구 부흥로315번길 18-5 (부평동 현대빌리지)
23남동구남동구청<NA>(21589) 인천 남동구 소래로 645 (만수동)
34연수구연수구가족센터032-851-2730(21927) 인천 연수구 청능대로 109 (연수동)
45부평구부평구학교밖청소년지원센터032-509-8918(21387) 인천 부평구 부평문화로37번길 1 (부평동)
56계양구예원플라워아카데미<NA>(21056) 인천 계양구 경명대로 1138 (계산동) 502호
67부평구동암마을공동체네트워크 동고동락032-425-0123(21437) 인천 부평구 아트센터로44번길 15-12 (십정동)
78미추홀구미추홀구청소년수련관032-887-5270~2(22169) 인천 미추홀구 경인로42번길 23 (숭의동)
89부평구쥬다르<NA><NA>
910계양구한마음다문화통합센터<NA>(21116) 인천 계양구 봉오대로698번길 15-14 (작전동 삼우빌라)
번호행정구역기관이름전화번호주소
18971898중구인천중구문화원<NA>(22340) 인천광역시중구축항대로296번길81중구문화회관내1층
18981899중구인천중구다문화가족지원센터032-891-1094(22321) 인천광역시 중구 답동10-4 답동신협 신협빌딩 4층
18991900중구꿈벗도서관032-764-6111(22315) 인천광역시중구홍예문로32
19001901중구인천중구국민체육센터032-763-8145(22340) 인천광역시중구축항대로296번길813층중구시설관리공단
19011902중구남부교육지원청032-770-0116(22315) 인천광역시중구차이나타운로51번길45
19021903중구한국문화센터동인천교육원032-766-3546(400180) 인천광역시중구큰우물로28-6세븐프라자5층
19031904미추홀구인천승학초등학교032-432-9775(22241) 인천광역시미추홀구관교동13-4승학초등학교
19041905중구인천중구여성회관032-772-7345(22340) 인천광역시중구축항대로296번길81중구문화회관1층중구여성회관
19051906중구대한노인회중구지회부설노인대학032-772-2579인천광역시 중구 제물량로80번길 3-24 (신흥동2가)
19061907부평구부평역사박물관032-515-6471(21327) 인천 부평구 굴포로 151