Overview

Dataset statistics

Number of variables5
Number of observations504
Missing cells571
Missing cells (%)22.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.8 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Categorical1
Text2
Unsupported1

Dataset

Description서울특별시강남서초교육지원청에서 관리하는 평생교육시설(시설구분,시설명칭,시설전화번호)
Author서울특별시교육청 서울특별시강남서초교육지원청
URLhttps://www.data.go.kr/data/15053608/fileData.do

Alerts

시설전화번호 has 67 (13.3%) missing valuesMissing
비고 has 504 (100.0%) missing valuesMissing
연번 has unique valuesUnique
시설명칭 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 08:14:48.180987
Analysis finished2023-12-12 08:14:48.768424
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct504
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean252.5
Minimum1
Maximum504
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.6 KiB
2023-12-12T17:14:48.860181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile26.15
Q1126.75
median252.5
Q3378.25
95-th percentile478.85
Maximum504
Range503
Interquartile range (IQR)251.5

Descriptive statistics

Standard deviation145.63653
Coefficient of variation (CV)0.57677835
Kurtosis-1.2
Mean252.5
Median Absolute Deviation (MAD)126
Skewness0
Sum127260
Variance21210
MonotonicityStrictly increasing
2023-12-12T17:14:49.051574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
333 1
 
0.2%
346 1
 
0.2%
345 1
 
0.2%
344 1
 
0.2%
343 1
 
0.2%
342 1
 
0.2%
341 1
 
0.2%
340 1
 
0.2%
339 1
 
0.2%
Other values (494) 494
98.0%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
504 1
0.2%
503 1
0.2%
502 1
0.2%
501 1
0.2%
500 1
0.2%
499 1
0.2%
498 1
0.2%
497 1
0.2%
496 1
0.2%
495 1
0.2%

시설구분
Categorical

Distinct6
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
원격평생교육시설
250 
지식.인력개발사업관련 평생교육시설
128 
언론기관부설 평생교육시설
87 
시민사회단체부설 평생교육시설
30 
사업장부설 평생교육시설
 
8

Length

Max length18
Median length15
Mean length11.888889
Min length8

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row사업장부설 평생교육시설
2nd row사업장부설 평생교육시설
3rd row사업장부설 평생교육시설
4th row언론기관부설 평생교육시설
5th row원격평생교육시설

Common Values

ValueCountFrequency (%)
원격평생교육시설 250
49.6%
지식.인력개발사업관련 평생교육시설 128
25.4%
언론기관부설 평생교육시설 87
 
17.3%
시민사회단체부설 평생교육시설 30
 
6.0%
사업장부설 평생교육시설 8
 
1.6%
학교부설 평생교육시설 1
 
0.2%

Length

2023-12-12T17:14:49.505425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:14:49.614886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평생교육시설 254
33.5%
원격평생교육시설 250
33.0%
지식.인력개발사업관련 128
16.9%
언론기관부설 87
 
11.5%
시민사회단체부설 30
 
4.0%
사업장부설 8
 
1.1%
학교부설 1
 
0.1%

시설명칭
Text

UNIQUE 

Distinct504
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-12T17:14:49.875264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length24
Mean length11.666667
Min length3

Characters and Unicode

Total characters5880
Distinct characters438
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique504 ?
Unique (%)100.0%

Sample

1st row현대백화점문화센터
2nd row무역센터점현대일반문화센터
3rd row삼성일반문화센터
4th row케이에이지이교육학술원
5th row배움사이버원격평생교육원
ValueCountFrequency (%)
평생교육원 55
 
8.7%
원격평생교육원 21
 
3.3%
평생교육시설 13
 
2.1%
원격평생교육시설 7
 
1.1%
한국이러닝산업 2
 
0.3%
위포트 2
 
0.3%
토커비 2
 
0.3%
아카데미 2
 
0.3%
티엠디평생교육원 1
 
0.2%
티엠디원격평생교육원 1
 
0.2%
Other values (523) 523
83.1%
2023-12-12T17:14:50.252557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
514
 
8.7%
469
 
8.0%
465
 
7.9%
423
 
7.2%
421
 
7.2%
159
 
2.7%
144
 
2.4%
131
 
2.2%
125
 
2.1%
100
 
1.7%
Other values (428) 2929
49.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5624
95.6%
Space Separator 125
 
2.1%
Uppercase Letter 57
 
1.0%
Lowercase Letter 36
 
0.6%
Open Punctuation 15
 
0.3%
Close Punctuation 15
 
0.3%
Other Punctuation 5
 
0.1%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
514
 
9.1%
469
 
8.3%
465
 
8.3%
423
 
7.5%
421
 
7.5%
159
 
2.8%
144
 
2.6%
131
 
2.3%
100
 
1.8%
91
 
1.6%
Other values (382) 2707
48.1%
Uppercase Letter
ValueCountFrequency (%)
R 6
 
10.5%
O 5
 
8.8%
A 5
 
8.8%
B 5
 
8.8%
S 4
 
7.0%
C 4
 
7.0%
M 4
 
7.0%
T 3
 
5.3%
K 3
 
5.3%
E 3
 
5.3%
Other values (10) 15
26.3%
Lowercase Letter
ValueCountFrequency (%)
w 5
13.9%
d 4
11.1%
a 3
 
8.3%
k 3
 
8.3%
o 3
 
8.3%
r 2
 
5.6%
t 2
 
5.6%
i 2
 
5.6%
n 2
 
5.6%
e 2
 
5.6%
Other values (7) 8
22.2%
Other Punctuation
ValueCountFrequency (%)
. 3
60.0%
& 1
 
20.0%
· 1
 
20.0%
Decimal Number
ValueCountFrequency (%)
2 1
33.3%
9 1
33.3%
7 1
33.3%
Space Separator
ValueCountFrequency (%)
125
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5624
95.6%
Common 163
 
2.8%
Latin 93
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
514
 
9.1%
469
 
8.3%
465
 
8.3%
423
 
7.5%
421
 
7.5%
159
 
2.8%
144
 
2.6%
131
 
2.3%
100
 
1.8%
91
 
1.6%
Other values (382) 2707
48.1%
Latin
ValueCountFrequency (%)
R 6
 
6.5%
O 5
 
5.4%
A 5
 
5.4%
B 5
 
5.4%
w 5
 
5.4%
d 4
 
4.3%
S 4
 
4.3%
C 4
 
4.3%
M 4
 
4.3%
T 3
 
3.2%
Other values (27) 48
51.6%
Common
ValueCountFrequency (%)
125
76.7%
( 15
 
9.2%
) 15
 
9.2%
. 3
 
1.8%
& 1
 
0.6%
2 1
 
0.6%
· 1
 
0.6%
9 1
 
0.6%
7 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5624
95.6%
ASCII 255
 
4.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
514
 
9.1%
469
 
8.3%
465
 
8.3%
423
 
7.5%
421
 
7.5%
159
 
2.8%
144
 
2.6%
131
 
2.3%
100
 
1.8%
91
 
1.6%
Other values (382) 2707
48.1%
ASCII
ValueCountFrequency (%)
125
49.0%
( 15
 
5.9%
) 15
 
5.9%
R 6
 
2.4%
O 5
 
2.0%
A 5
 
2.0%
B 5
 
2.0%
w 5
 
2.0%
d 4
 
1.6%
S 4
 
1.6%
Other values (35) 66
25.9%
None
ValueCountFrequency (%)
· 1
100.0%

시설전화번호
Text

MISSING 

Distinct425
Distinct (%)97.3%
Missing67
Missing (%)13.3%
Memory size4.1 KiB
2023-12-12T17:14:50.545158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.519451
Min length11

Characters and Unicode

Total characters5034
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique415 ?
Unique (%)95.0%

Sample

1st row02-3449-5503
2nd row02-539-4560
3rd row02-3470-0511
4th row02-573-7236
5th row02-2149-2512
ValueCountFrequency (%)
02-557-0802 4
 
0.9%
02-6000-5323 2
 
0.5%
02-3675-6772 2
 
0.5%
02-518-5468 2
 
0.5%
02-553-0177 2
 
0.5%
02-508-0702 2
 
0.5%
02-3453-0353 2
 
0.5%
02-540-8857 2
 
0.5%
070-8733-0505 2
 
0.5%
02-362-1326 2
 
0.5%
Other values (415) 415
95.0%
2023-12-12T17:14:50.996507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 875
17.4%
0 872
17.3%
2 737
14.6%
5 536
10.6%
3 340
 
6.8%
7 338
 
6.7%
4 299
 
5.9%
1 284
 
5.6%
8 279
 
5.5%
6 262
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4159
82.6%
Dash Punctuation 875
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 872
21.0%
2 737
17.7%
5 536
12.9%
3 340
 
8.2%
7 338
 
8.1%
4 299
 
7.2%
1 284
 
6.8%
8 279
 
6.7%
6 262
 
6.3%
9 212
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 875
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5034
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 875
17.4%
0 872
17.3%
2 737
14.6%
5 536
10.6%
3 340
 
6.8%
7 338
 
6.7%
4 299
 
5.9%
1 284
 
5.6%
8 279
 
5.5%
6 262
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5034
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 875
17.4%
0 872
17.3%
2 737
14.6%
5 536
10.6%
3 340
 
6.8%
7 338
 
6.7%
4 299
 
5.9%
1 284
 
5.6%
8 279
 
5.5%
6 262
 
5.2%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing504
Missing (%)100.0%
Memory size4.6 KiB

Interactions

2023-12-12T17:14:48.467198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:14:51.130254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설구분
연번1.0000.279
시설구분0.2791.000
2023-12-12T17:14:51.227694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설구분
연번1.0000.150
시설구분0.1501.000

Missing values

2023-12-12T17:14:48.616058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:14:48.726874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설구분시설명칭시설전화번호비고
01사업장부설 평생교육시설현대백화점문화센터02-3449-5503<NA>
12사업장부설 평생교육시설무역센터점현대일반문화센터02-539-4560<NA>
23사업장부설 평생교육시설삼성일반문화센터02-3470-0511<NA>
34언론기관부설 평생교육시설케이에이지이교육학술원02-573-7236<NA>
45원격평생교육시설배움사이버원격평생교육원02-2149-2512<NA>
56원격평생교육시설케이티이노에듀(kt innoedu) 평생교육원02-2179-5260<NA>
67원격평생교육시설이스터디뱅크<NA><NA>
78원격평생교육시설윈글리쉬02-546-1590<NA>
89원격평생교육시설매니저소사이어티02-582-1487<NA>
910원격평생교육시설박문각에듀스파02-3489-9500<NA>
연번시설구분시설명칭시설전화번호비고
494495지식.인력개발사업관련 평생교육시설명지평생교육원02-592-9020<NA>
495496지식.인력개발사업관련 평생교육시설위포트토커비2별관평생교육시설<NA><NA>
496497원격평생교육시설이디엠평생교육원<NA><NA>
497498원격평생교육시설천지인중국어원격평생교육원<NA><NA>
498499원격평생교육시설랜드삼원격평생교육원<NA><NA>
499500원격평생교육시설모카골드원격평생교육원<NA><NA>
500501원격평생교육시설미라클리온평생교육시설02-3444-4669<NA>
501502시민사회단체부설 평생교육시설우면평생교육원02-922-3388<NA>
502503지식.인력개발사업관련 평생교육시설옥스비어학원평생교육원02-508-0702<NA>
503504원격평생교육시설아이패스코리아평생교육원<NA><NA>