Overview

Dataset statistics

Number of variables7
Number of observations3662
Missing cells607
Missing cells (%)2.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory204.0 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text4
Categorical2

Dataset

Description서울특별시강남서초교육지원청에서 관리하는 학원 현황(학원명,학원종류,교습과정,학원주소,전화번호)
Author서울특별시교육청 서울특별시강남서초교육지원청
URLhttps://www.data.go.kr/data/15053609/fileData.do

Alerts

학원종류 is highly overall correlated with 분야구분High correlation
분야구분 is highly overall correlated with 학원종류High correlation
전화번호 has 607 (16.6%) missing valuesMissing
연번 has unique valuesUnique
학원명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:31:23.146615
Analysis finished2023-12-12 22:31:24.759736
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct3662
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1835.3951
Minimum1
Maximum3666
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size32.3 KiB
2023-12-13T07:31:24.883452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile188.05
Q1920.25
median1835.5
Q32750.75
95-th percentile3482.95
Maximum3666
Range3665
Interquartile range (IQR)1830.5

Descriptive statistics

Standard deviation1057.4495
Coefficient of variation (CV)0.57614272
Kurtosis-1.199299
Mean1835.3951
Median Absolute Deviation (MAD)915.5
Skewness-0.00054910252
Sum6721217
Variance1118199.5
MonotonicityStrictly increasing
2023-12-13T07:31:25.146291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2438 1
 
< 0.1%
2440 1
 
< 0.1%
2441 1
 
< 0.1%
2442 1
 
< 0.1%
2443 1
 
< 0.1%
2444 1
 
< 0.1%
2445 1
 
< 0.1%
2446 1
 
< 0.1%
2447 1
 
< 0.1%
Other values (3652) 3652
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3666 1
< 0.1%
3665 1
< 0.1%
3664 1
< 0.1%
3663 1
< 0.1%
3662 1
< 0.1%
3661 1
< 0.1%
3660 1
< 0.1%
3659 1
< 0.1%
3658 1
< 0.1%
3657 1
< 0.1%

학원명
Text

UNIQUE 

Distinct3662
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size28.7 KiB
2023-12-13T07:31:25.541956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length36
Mean length9.2048061
Min length3

Characters and Unicode

Total characters33708
Distinct characters781
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3662 ?
Unique (%)100.0%

Sample

1st row한국사의달인원격교습학원
2nd row평정수학학원
3rd row프라임스템주니어학원
4th row서초무용학원
5th row백석대학교평생교육신학원
ValueCountFrequency (%)
academy)학원 6
 
0.2%
prep)학원 3
 
0.1%
english)학원 3
 
0.1%
academy 2
 
0.1%
art)학원 2
 
0.1%
math)학원 2
 
0.1%
이맥스영어(emax 2
 
0.1%
에이셰프요리학원 1
 
< 0.1%
에이셰프컬리너리학원 1
 
< 0.1%
듀크어학원 1
 
< 0.1%
Other values (3723) 3723
99.4%
2023-12-13T07:31:26.164853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3973
 
11.8%
3684
 
10.9%
1000
 
3.0%
875
 
2.6%
671
 
2.0%
654
 
1.9%
613
 
1.8%
453
 
1.3%
453
 
1.3%
418
 
1.2%
Other values (771) 20914
62.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30760
91.3%
Uppercase Letter 982
 
2.9%
Lowercase Letter 839
 
2.5%
Open Punctuation 366
 
1.1%
Close Punctuation 366
 
1.1%
Decimal Number 244
 
0.7%
Space Separator 86
 
0.3%
Other Punctuation 49
 
0.1%
Dash Punctuation 12
 
< 0.1%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3973
 
12.9%
3684
 
12.0%
1000
 
3.3%
875
 
2.8%
671
 
2.2%
654
 
2.1%
613
 
2.0%
453
 
1.5%
453
 
1.5%
418
 
1.4%
Other values (695) 17966
58.4%
Uppercase Letter
ValueCountFrequency (%)
S 99
 
10.1%
E 86
 
8.8%
M 79
 
8.0%
A 78
 
7.9%
T 75
 
7.6%
C 63
 
6.4%
I 53
 
5.4%
P 48
 
4.9%
G 44
 
4.5%
L 39
 
4.0%
Other values (16) 318
32.4%
Lowercase Letter
ValueCountFrequency (%)
e 97
11.6%
a 73
 
8.7%
n 66
 
7.9%
i 65
 
7.7%
s 63
 
7.5%
t 60
 
7.2%
r 57
 
6.8%
o 43
 
5.1%
u 39
 
4.6%
d 37
 
4.4%
Other values (15) 239
28.5%
Decimal Number
ValueCountFrequency (%)
2 84
34.4%
1 70
28.7%
3 25
 
10.2%
0 17
 
7.0%
5 10
 
4.1%
4 10
 
4.1%
6 9
 
3.7%
7 9
 
3.7%
8 5
 
2.0%
9 5
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 21
42.9%
& 13
26.5%
' 7
 
14.3%
% 2
 
4.1%
, 2
 
4.1%
/ 1
 
2.0%
: 1
 
2.0%
! 1
 
2.0%
· 1
 
2.0%
Math Symbol
ValueCountFrequency (%)
+ 3
75.0%
1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 366
100.0%
Close Punctuation
ValueCountFrequency (%)
) 366
100.0%
Space Separator
ValueCountFrequency (%)
86
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30745
91.2%
Latin 1821
 
5.4%
Common 1127
 
3.3%
Han 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3973
 
12.9%
3684
 
12.0%
1000
 
3.3%
875
 
2.8%
671
 
2.2%
654
 
2.1%
613
 
2.0%
453
 
1.5%
453
 
1.5%
418
 
1.4%
Other values (681) 17951
58.4%
Latin
ValueCountFrequency (%)
S 99
 
5.4%
e 97
 
5.3%
E 86
 
4.7%
M 79
 
4.3%
A 78
 
4.3%
T 75
 
4.1%
a 73
 
4.0%
n 66
 
3.6%
i 65
 
3.6%
s 63
 
3.5%
Other values (41) 1040
57.1%
Common
ValueCountFrequency (%)
( 366
32.5%
) 366
32.5%
86
 
7.6%
2 84
 
7.5%
1 70
 
6.2%
3 25
 
2.2%
. 21
 
1.9%
0 17
 
1.5%
& 13
 
1.2%
- 12
 
1.1%
Other values (15) 67
 
5.9%
Han
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30745
91.2%
ASCII 2946
 
8.7%
CJK 15
 
< 0.1%
None 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3973
 
12.9%
3684
 
12.0%
1000
 
3.3%
875
 
2.8%
671
 
2.2%
654
 
2.1%
613
 
2.0%
453
 
1.5%
453
 
1.5%
418
 
1.4%
Other values (681) 17951
58.4%
ASCII
ValueCountFrequency (%)
( 366
 
12.4%
) 366
 
12.4%
S 99
 
3.4%
e 97
 
3.3%
E 86
 
2.9%
86
 
2.9%
2 84
 
2.9%
M 79
 
2.7%
A 78
 
2.6%
T 75
 
2.5%
Other values (64) 1530
51.9%
CJK
ValueCountFrequency (%)
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (4) 4
26.7%
None
ValueCountFrequency (%)
· 1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

학원종류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.7 KiB
학교교과교습학원
2858 
평생직업교육학원
804 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교교과교습학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row평생직업교육학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 2858
78.0%
평생직업교육학원 804
 
22.0%

Length

2023-12-13T07:31:26.567519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:26.775847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 2858
78.0%
평생직업교육학원 804
 
22.0%
Distinct83
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size28.7 KiB
2023-12-13T07:31:26.972813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length16
Mean length5.2864555
Min length2

Characters and Unicode

Total characters19359
Distinct characters161
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)0.5%

Sample

1st row보습
2nd row보습
3rd row종합
4th row무용
5th row종합
ValueCountFrequency (%)
보습 959
26.2%
보습·논술 696
19.0%
실용외국어(유아/초·중·고 288
 
7.9%
미술 249
 
6.8%
독서실(유아/초·중·고 180
 
4.9%
종합 145
 
4.0%
음악 131
 
3.6%
어학(성인 93
 
2.5%
연극 88
 
2.4%
이·미용 83
 
2.3%
Other values (73) 750
20.5%
2023-12-13T07:31:27.386865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
· 1716
 
8.9%
1697
 
8.8%
1655
 
8.5%
977
 
5.0%
) 897
 
4.6%
( 897
 
4.6%
697
 
3.6%
541
 
2.8%
519
 
2.7%
490
 
2.5%
Other values (151) 9273
47.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15065
77.8%
Other Punctuation 2500
 
12.9%
Close Punctuation 897
 
4.6%
Open Punctuation 897
 
4.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1697
 
11.3%
1655
 
11.0%
977
 
6.5%
697
 
4.6%
541
 
3.6%
519
 
3.4%
490
 
3.3%
469
 
3.1%
469
 
3.1%
468
 
3.1%
Other values (146) 7083
47.0%
Other Punctuation
ValueCountFrequency (%)
· 1716
68.6%
/ 468
 
18.7%
, 316
 
12.6%
Close Punctuation
ValueCountFrequency (%)
) 897
100.0%
Open Punctuation
ValueCountFrequency (%)
( 897
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15065
77.8%
Common 4294
 
22.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1697
 
11.3%
1655
 
11.0%
977
 
6.5%
697
 
4.6%
541
 
3.6%
519
 
3.4%
490
 
3.3%
469
 
3.1%
469
 
3.1%
468
 
3.1%
Other values (146) 7083
47.0%
Common
ValueCountFrequency (%)
· 1716
40.0%
) 897
20.9%
( 897
20.9%
/ 468
 
10.9%
, 316
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15065
77.8%
ASCII 2578
 
13.3%
None 1716
 
8.9%

Most frequent character per block

None
ValueCountFrequency (%)
· 1716
100.0%
Hangul
ValueCountFrequency (%)
1697
 
11.3%
1655
 
11.0%
977
 
6.5%
697
 
4.6%
541
 
3.6%
519
 
3.4%
490
 
3.3%
469
 
3.1%
469
 
3.1%
468
 
3.1%
Other values (146) 7083
47.0%
ASCII
ValueCountFrequency (%)
) 897
34.8%
( 897
34.8%
/ 468
18.2%
, 316
 
12.3%

분야구분
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size28.7 KiB
입시.검정 및 보습
1691 
국제화
406 
예능(대)
389 
직업기술
335 
기예(대)
215 
Other values (4)
626 

Length

Max length10
Median length7
Mean length6.9273621
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row입시.검정 및 보습
2nd row입시.검정 및 보습
3rd row종합(대)
4th row예능(대)
5th row종합(대)

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 1691
46.2%
국제화 406
 
11.1%
예능(대) 389
 
10.6%
직업기술 335
 
9.1%
기예(대) 215
 
5.9%
기타(대) 206
 
5.6%
독서실 199
 
5.4%
종합(대) 147
 
4.0%
인문사회(대) 74
 
2.0%

Length

2023-12-13T07:31:27.536072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:31:27.697139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 1691
24.0%
1691
24.0%
보습 1691
24.0%
국제화 406
 
5.8%
예능(대 389
 
5.5%
직업기술 335
 
4.8%
기예(대 215
 
3.1%
기타(대 206
 
2.9%
독서실 199
 
2.8%
종합(대 147
 
2.1%
Distinct3631
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size28.7 KiB
2023-12-13T07:31:27.993610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length67
Mean length34.516931
Min length21

Characters and Unicode

Total characters126401
Distinct characters420
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3600 ?
Unique (%)98.3%

Sample

1st row서울특별시 강남구 강남대로 340 , 8층일부 (역삼동,경원빌딩)
2nd row서울특별시 서초구 명달로 52-9 , 3층 (서초동)
3rd row서울특별시 강남구 역삼로 448 , 황우거빌딩 1층 (대치동)
4th row서울특별시 서초구 사임당로 151 대한무지개종합상가 3층 (서초동)
5th row서울특별시 서초구 방배로 69 , 1층,3층~5층 (방배동, 진리동백석대학교)
ValueCountFrequency (%)
서울특별시 3661
 
13.9%
2851
 
10.8%
강남구 2433
 
9.2%
서초구 1229
 
4.7%
대치동 799
 
3.0%
2층 626
 
2.4%
3층 564
 
2.1%
4층 395
 
1.5%
역삼동 327
 
1.2%
신사동 301
 
1.1%
Other values (3135) 13123
49.9%
2023-12-13T07:31:28.524398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23938
 
18.9%
5672
 
4.5%
, 4682
 
3.7%
4094
 
3.2%
3878
 
3.1%
3701
 
2.9%
( 3691
 
2.9%
) 3681
 
2.9%
3675
 
2.9%
3670
 
2.9%
Other values (410) 65719
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68278
54.0%
Space Separator 23938
 
18.9%
Decimal Number 21210
 
16.8%
Other Punctuation 4751
 
3.8%
Open Punctuation 3691
 
2.9%
Close Punctuation 3681
 
2.9%
Dash Punctuation 370
 
0.3%
Uppercase Letter 315
 
0.2%
Math Symbol 140
 
0.1%
Lowercase Letter 26
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5672
 
8.3%
4094
 
6.0%
3878
 
5.7%
3701
 
5.4%
3675
 
5.4%
3670
 
5.4%
3662
 
5.4%
3662
 
5.4%
3254
 
4.8%
2916
 
4.3%
Other values (348) 30094
44.1%
Uppercase Letter
ValueCountFrequency (%)
B 30
 
9.5%
H 25
 
7.9%
E 25
 
7.9%
O 23
 
7.3%
S 22
 
7.0%
A 20
 
6.3%
K 19
 
6.0%
M 19
 
6.0%
L 17
 
5.4%
R 15
 
4.8%
Other values (16) 100
31.7%
Lowercase Letter
ValueCountFrequency (%)
t 3
11.5%
r 3
11.5%
e 3
11.5%
a 3
11.5%
x 2
 
7.7%
l 2
 
7.7%
w 2
 
7.7%
h 1
 
3.8%
c 1
 
3.8%
m 1
 
3.8%
Other values (5) 5
19.2%
Decimal Number
ValueCountFrequency (%)
2 3666
17.3%
1 3505
16.5%
3 3133
14.8%
4 2401
11.3%
0 2189
10.3%
5 1928
9.1%
6 1395
 
6.6%
7 1113
 
5.2%
8 1011
 
4.8%
9 869
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 4682
98.5%
? 51
 
1.1%
· 11
 
0.2%
. 6
 
0.1%
* 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
23938
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3691
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3681
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 370
100.0%
Math Symbol
ValueCountFrequency (%)
~ 140
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68278
54.0%
Common 57781
45.7%
Latin 342
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5672
 
8.3%
4094
 
6.0%
3878
 
5.7%
3701
 
5.4%
3675
 
5.4%
3670
 
5.4%
3662
 
5.4%
3662
 
5.4%
3254
 
4.8%
2916
 
4.3%
Other values (348) 30094
44.1%
Latin
ValueCountFrequency (%)
B 30
 
8.8%
H 25
 
7.3%
E 25
 
7.3%
O 23
 
6.7%
S 22
 
6.4%
A 20
 
5.8%
K 19
 
5.6%
M 19
 
5.6%
L 17
 
5.0%
R 15
 
4.4%
Other values (32) 127
37.1%
Common
ValueCountFrequency (%)
23938
41.4%
, 4682
 
8.1%
( 3691
 
6.4%
) 3681
 
6.4%
2 3666
 
6.3%
1 3505
 
6.1%
3 3133
 
5.4%
4 2401
 
4.2%
0 2189
 
3.8%
5 1928
 
3.3%
Other values (10) 4967
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68278
54.0%
ASCII 58111
46.0%
None 11
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23938
41.2%
, 4682
 
8.1%
( 3691
 
6.4%
) 3681
 
6.3%
2 3666
 
6.3%
1 3505
 
6.0%
3 3133
 
5.4%
4 2401
 
4.1%
0 2189
 
3.8%
5 1928
 
3.3%
Other values (50) 5297
 
9.1%
Hangul
ValueCountFrequency (%)
5672
 
8.3%
4094
 
6.0%
3878
 
5.7%
3701
 
5.4%
3675
 
5.4%
3670
 
5.4%
3662
 
5.4%
3662
 
5.4%
3254
 
4.8%
2916
 
4.3%
Other values (348) 30094
44.1%
None
ValueCountFrequency (%)
· 11
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct2872
Distinct (%)94.0%
Missing607
Missing (%)16.6%
Memory size28.7 KiB
2023-12-13T07:31:28.795037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.260884
Min length11

Characters and Unicode

Total characters34402
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2731 ?
Unique (%)89.4%

Sample

1st row070-8733-0505
2nd row02-597-6998
3rd row02-6207-8020
4th row02-3474-9102
5th row02-520-0764
ValueCountFrequency (%)
02-552-2373 8
 
0.3%
02-569-7467 7
 
0.2%
02-538-3372 6
 
0.2%
02-534-0064 4
 
0.1%
02-557-9864 4
 
0.1%
02-541-8280 4
 
0.1%
02-548-1774 4
 
0.1%
02-3454-1162 4
 
0.1%
02-3486-2000 4
 
0.1%
02-522-8277 3
 
0.1%
Other values (2862) 3007
98.4%
2023-12-13T07:31:29.230415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 6110
17.8%
0 5346
15.5%
2 4899
14.2%
5 4650
13.5%
3 2231
 
6.5%
4 2091
 
6.1%
7 2040
 
5.9%
1 1920
 
5.6%
6 1841
 
5.4%
8 1685
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 28292
82.2%
Dash Punctuation 6110
 
17.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5346
18.9%
2 4899
17.3%
5 4650
16.4%
3 2231
7.9%
4 2091
 
7.4%
7 2040
 
7.2%
1 1920
 
6.8%
6 1841
 
6.5%
8 1685
 
6.0%
9 1589
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 6110
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 34402
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 6110
17.8%
0 5346
15.5%
2 4899
14.2%
5 4650
13.5%
3 2231
 
6.5%
4 2091
 
6.1%
7 2040
 
5.9%
1 1920
 
5.6%
6 1841
 
5.4%
8 1685
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34402
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 6110
17.8%
0 5346
15.5%
2 4899
14.2%
5 4650
13.5%
3 2231
 
6.5%
4 2091
 
6.1%
7 2040
 
5.9%
1 1920
 
5.6%
6 1841
 
5.4%
8 1685
 
4.9%

Interactions

2023-12-13T07:31:24.288232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:31:29.347161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번학원종류교습과정분야구분
연번1.0000.1100.4170.192
학원종류0.1101.0000.9840.857
교습과정0.4170.9841.0000.998
분야구분0.1920.8570.9981.000
2023-12-13T07:31:29.445895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야구분학원종류
분야구분1.0000.889
학원종류0.8891.000
2023-12-13T07:31:29.528086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번학원종류분야구분
연번1.0000.0840.088
학원종류0.0841.0000.889
분야구분0.0880.8891.000

Missing values

2023-12-13T07:31:24.463100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:31:24.671441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번학원명학원종류교습과정분야구분학원주소전화번호
01한국사의달인원격교습학원학교교과교습학원보습입시.검정 및 보습서울특별시 강남구 강남대로 340 , 8층일부 (역삼동,경원빌딩)070-8733-0505
12평정수학학원학교교과교습학원보습입시.검정 및 보습서울특별시 서초구 명달로 52-9 , 3층 (서초동)02-597-6998
23프라임스템주니어학원학교교과교습학원종합종합(대)서울특별시 강남구 역삼로 448 , 황우거빌딩 1층 (대치동)02-6207-8020
34서초무용학원학교교과교습학원무용예능(대)서울특별시 서초구 사임당로 151 대한무지개종합상가 3층 (서초동)02-3474-9102
45백석대학교평생교육신학원평생직업교육학원종합종합(대)서울특별시 서초구 방배로 69 , 1층,3층~5층 (방배동, 진리동백석대학교)02-520-0764
56강남청솔학원학교교과교습학원보습·논술입시.검정 및 보습서울특별시 강남구 강남대로94길 36 , 지하2층~7층 (역삼동)02-556-9001
67개혁신학원학교교과교습학원행정기타(대)서울특별시 강남구 테헤란로43길 17 (역삼동)<NA>
78강남삼육외국어학원평생직업교육학원어학(성인)국제화서울특별시 강남구 학동로47길 15 , 2층,3층 (논현동)02-512-3605
89현대음악학원학교교과교습학원음악예능(대)서울특별시 강남구 압구정로29길 68 , 주구센타 304호,305호,306호 (압구정동)02-543-0017
910샘밭미술학원학교교과교습학원미술예능(대)서울특별시 서초구 방배로16길 11-3 , 2층 (방배동)02-584-8561
연번학원명학원종류교습과정분야구분학원주소전화번호
36523657수플러스수학학원학교교과교습학원보습·논술입시.검정 및 보습서울특별시 강남구 선릉로64길 18 , 3층 (대치동)<NA>
36533658아트브루학원평생직업교육학원미술기예(대)서울특별시 강남구 논현로 409 , 301호 (역삼동)<NA>
36543659골든캠퍼스학원평생직업교육학원부동산직업기술서울특별시 서초구 사임당로17길 60 , 1층 (서초동)<NA>
36553660차이나탄역삼캠프중국어학원평생직업교육학원어학(성인)국제화서울특별시 강남구 테헤란로20길 10 , 2층 (역삼동)<NA>
36563661릴리프아카데미학원학교교과교습학원연극기타(대)서울특별시 강남구 봉은사로37길 28 , 4층 (논현동)02-511-2340
36573662하우스터디강남개포센터독서실학교교과교습학원독서실(유아/초·중·고)독서실서울특별시 강남구 개포로 508 , 501호,503호,504호,505호일부 (개포동)070-4015-2605
36583663엠앤에스랩학원학교교과교습학원보습·논술입시.검정 및 보습서울특별시 강남구 역삼로 452 , 2층 일부 (대치동)<NA>
36593664강남창조의아침미술학원학교교과교습학원미술예능(대)서울특별시 강남구 선릉로 527, 201호(역삼동) (역삼동)02-569-6608
36603665제이엠케이에듀케이션(JMK EDUCATION)학원학교교과교습학원보습·논술입시.검정 및 보습서울특별시 강남구 역삼로 443 , 2층 (대치동)<NA>
36613666쩡수학학원학교교과교습학원보습·논술입시.검정 및 보습서울특별시 서초구 신반포로 31 , 제아이동 304호 (반포동)<NA>