Overview

Dataset statistics

Number of variables8
Number of observations2784
Missing cells62
Missing cells (%)0.3%
Duplicate rows63
Duplicate rows (%)2.3%
Total size in memory174.1 KiB
Average record size in memory64.0 B

Variable types

Text5
Categorical3

Dataset

Description2022년 6월 30일자 기준 인천광역시교육청 인천동부교육지원청 관내 교습소현황_20220630 교습소명칭, 교습자성명, 교습소주소, 교습과목 등 교습소 관련 자료
Author인천광역시교육청
URLhttps://www.data.go.kr/data/15053960/fileData.do

Alerts

Dataset has 63 (2.3%) duplicate rowsDuplicates
분야구분 is highly overall correlated with 교습계열 and 1 other fieldsHigh correlation
교습과정 is highly overall correlated with 분야구분 and 1 other fieldsHigh correlation
교습계열 is highly overall correlated with 분야구분 and 1 other fieldsHigh correlation
교습과목(반) has 62 (2.2%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:39:45.076018
Analysis finished2023-12-12 13:39:45.995675
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct605
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-12T22:39:46.180913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length28
Mean length9.9019397
Min length4

Characters and Unicode

Total characters27567
Distinct characters462
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)4.6%

Sample

1st row일어-1
2nd row색샘미술과외교습소
3rd row난초피아노교습소
4th row피아노-7
5th row피아노-41
ValueCountFrequency (%)
지담수학교습소 30
 
1.0%
뮤엠영어송도롯데캐슬점영어교습소 29
 
1.0%
아트테라스미술교습소 26
 
0.9%
뮤엠영어송도파크자이영어교습소 23
 
0.8%
베리타스음악교습소 23
 
0.8%
테누토뮤직클래스음악교습소 22
 
0.8%
연수미술교습소 21
 
0.7%
아트튜브(art 20
 
0.7%
tube)미술교습소 20
 
0.7%
뮤엠영어송도그린에비뉴영어교습소 20
 
0.7%
Other values (606) 2624
91.8%
2023-12-12T22:39:46.697913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2826
 
10.3%
2813
 
10.2%
2765
 
10.0%
1022
 
3.7%
924
 
3.4%
758
 
2.7%
706
 
2.6%
661
 
2.4%
646
 
2.3%
579
 
2.1%
Other values (452) 13867
50.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25765
93.5%
Uppercase Letter 947
 
3.4%
Lowercase Letter 321
 
1.2%
Close Punctuation 131
 
0.5%
Decimal Number 126
 
0.5%
Open Punctuation 125
 
0.5%
Space Separator 78
 
0.3%
Other Punctuation 55
 
0.2%
Dash Punctuation 19
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2826
 
11.0%
2813
 
10.9%
2765
 
10.7%
1022
 
4.0%
924
 
3.6%
758
 
2.9%
706
 
2.7%
661
 
2.6%
646
 
2.5%
579
 
2.2%
Other values (396) 12065
46.8%
Uppercase Letter
ValueCountFrequency (%)
S 88
 
9.3%
T 86
 
9.1%
A 79
 
8.3%
E 78
 
8.2%
U 69
 
7.3%
L 67
 
7.1%
P 48
 
5.1%
R 42
 
4.4%
C 42
 
4.4%
M 41
 
4.3%
Other values (13) 307
32.4%
Lowercase Letter
ValueCountFrequency (%)
e 47
14.6%
i 34
10.6%
a 31
9.7%
r 27
8.4%
n 24
7.5%
m 23
 
7.2%
s 23
 
7.2%
u 22
 
6.9%
l 16
 
5.0%
h 15
 
4.7%
Other values (7) 59
18.4%
Decimal Number
ValueCountFrequency (%)
3 33
26.2%
1 32
25.4%
2 26
20.6%
0 22
17.5%
6 5
 
4.0%
7 5
 
4.0%
5 2
 
1.6%
4 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
& 41
74.5%
: 8
 
14.5%
. 5
 
9.1%
! 1
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 125
100.0%
Space Separator
ValueCountFrequency (%)
78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25753
93.4%
Latin 1268
 
4.6%
Common 534
 
1.9%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2826
 
11.0%
2813
 
10.9%
2765
 
10.7%
1022
 
4.0%
924
 
3.6%
758
 
2.9%
706
 
2.7%
661
 
2.6%
646
 
2.5%
579
 
2.2%
Other values (395) 12053
46.8%
Latin
ValueCountFrequency (%)
S 88
 
6.9%
T 86
 
6.8%
A 79
 
6.2%
E 78
 
6.2%
U 69
 
5.4%
L 67
 
5.3%
P 48
 
3.8%
e 47
 
3.7%
R 42
 
3.3%
C 42
 
3.3%
Other values (30) 622
49.1%
Common
ValueCountFrequency (%)
) 131
24.5%
( 125
23.4%
78
14.6%
& 41
 
7.7%
3 33
 
6.2%
1 32
 
6.0%
2 26
 
4.9%
0 22
 
4.1%
- 19
 
3.6%
: 8
 
1.5%
Other values (6) 19
 
3.6%
Han
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25753
93.4%
ASCII 1802
 
6.5%
CJK 12
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2826
 
11.0%
2813
 
10.9%
2765
 
10.7%
1022
 
4.0%
924
 
3.6%
758
 
2.9%
706
 
2.7%
661
 
2.6%
646
 
2.5%
579
 
2.2%
Other values (395) 12053
46.8%
ASCII
ValueCountFrequency (%)
) 131
 
7.3%
( 125
 
6.9%
S 88
 
4.9%
T 86
 
4.8%
A 79
 
4.4%
E 78
 
4.3%
78
 
4.3%
U 69
 
3.8%
L 67
 
3.7%
P 48
 
2.7%
Other values (46) 953
52.9%
CJK
ValueCountFrequency (%)
12
100.0%
Distinct572
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-12T22:39:47.163746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length3.1282328
Min length2

Characters and Unicode

Total characters8709
Distinct characters196
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)4.2%

Sample

1st row간수웅
2nd row노인자
3rd row박순란
4th row안정숙
5th row장정선
ValueCountFrequency (%)
이성준 30
 
1.1%
김효정 29
 
1.0%
정민지 26
 
0.9%
박은자 23
 
0.8%
이문옥 23
 
0.8%
김한나 23
 
0.8%
서정미 21
 
0.7%
박선경 20
 
0.7%
윤희범 20
 
0.7%
나현경 20
 
0.7%
Other values (573) 2603
91.7%
2023-12-12T22:39:47.719953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
501
 
5.8%
469
 
5.4%
449
 
5.2%
351
 
4.0%
311
 
3.6%
290
 
3.3%
256
 
2.9%
227
 
2.6%
211
 
2.4%
209
 
2.4%
Other values (186) 5435
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8245
94.7%
Uppercase Letter 410
 
4.7%
Space Separator 54
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
501
 
6.1%
469
 
5.7%
449
 
5.4%
351
 
4.3%
311
 
3.8%
290
 
3.5%
256
 
3.1%
227
 
2.8%
211
 
2.6%
209
 
2.5%
Other values (165) 4971
60.3%
Uppercase Letter
ValueCountFrequency (%)
A 45
11.0%
L 44
10.7%
E 44
10.7%
I 42
10.2%
N 40
9.8%
U 25
 
6.1%
Y 22
 
5.4%
S 22
 
5.4%
K 19
 
4.6%
T 16
 
3.9%
Other values (10) 91
22.2%
Space Separator
ValueCountFrequency (%)
54
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8245
94.7%
Latin 410
 
4.7%
Common 54
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
501
 
6.1%
469
 
5.7%
449
 
5.4%
351
 
4.3%
311
 
3.8%
290
 
3.5%
256
 
3.1%
227
 
2.8%
211
 
2.6%
209
 
2.5%
Other values (165) 4971
60.3%
Latin
ValueCountFrequency (%)
A 45
11.0%
L 44
10.7%
E 44
10.7%
I 42
10.2%
N 40
9.8%
U 25
 
6.1%
Y 22
 
5.4%
S 22
 
5.4%
K 19
 
4.6%
T 16
 
3.9%
Other values (10) 91
22.2%
Common
ValueCountFrequency (%)
54
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8245
94.7%
ASCII 464
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
501
 
6.1%
469
 
5.7%
449
 
5.4%
351
 
4.3%
311
 
3.8%
290
 
3.5%
256
 
3.1%
227
 
2.8%
211
 
2.6%
209
 
2.5%
Other values (165) 4971
60.3%
ASCII
ValueCountFrequency (%)
54
11.6%
A 45
9.7%
L 44
9.5%
E 44
9.5%
I 42
 
9.1%
N 40
 
8.6%
U 25
 
5.4%
Y 22
 
4.7%
S 22
 
4.7%
K 19
 
4.1%
Other values (11) 107
23.1%
Distinct200
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-12T22:39:48.003917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length4
Mean length6.308908
Min length4

Characters and Unicode

Total characters17564
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)1.7%

Sample

1st row032-500-0006
2nd row032-461-8963
3rd row032-422-0576
4th row032-200-0017
5th row032-300-0008
ValueCountFrequency (%)
제공불가 1983
71.2%
032-323-9254 21
 
0.8%
032-818-7942 17
 
0.6%
032-822-1840 16
 
0.6%
032-813-7953 14
 
0.5%
070-8861-6616 13
 
0.5%
032-834-9981 12
 
0.4%
032-822-0913 11
 
0.4%
032-427-1101 11
 
0.4%
032-813-8022 11
 
0.4%
Other values (190) 675
 
24.2%
2023-12-12T22:39:48.465854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1984
11.3%
1984
11.3%
1983
11.3%
1983
11.3%
- 1601
9.1%
2 1475
8.4%
3 1352
7.7%
0 1340
7.6%
4 753
 
4.3%
1 736
 
4.2%
Other values (7) 2373
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8027
45.7%
Other Letter 7936
45.2%
Dash Punctuation 1601
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1475
18.4%
3 1352
16.8%
0 1340
16.7%
4 753
9.4%
1 736
9.2%
8 689
8.6%
5 445
 
5.5%
7 429
 
5.3%
6 420
 
5.2%
9 388
 
4.8%
Other Letter
ValueCountFrequency (%)
1984
25.0%
1984
25.0%
1983
25.0%
1983
25.0%
1
 
< 0.1%
1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 1601
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9628
54.8%
Hangul 7936
45.2%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1601
16.6%
2 1475
15.3%
3 1352
14.0%
0 1340
13.9%
4 753
7.8%
1 736
7.6%
8 689
7.2%
5 445
 
4.6%
7 429
 
4.5%
6 420
 
4.4%
Hangul
ValueCountFrequency (%)
1984
25.0%
1984
25.0%
1983
25.0%
1983
25.0%
1
 
< 0.1%
1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9628
54.8%
Hangul 7936
45.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1984
25.0%
1984
25.0%
1983
25.0%
1983
25.0%
1
 
< 0.1%
1
 
< 0.1%
ASCII
ValueCountFrequency (%)
- 1601
16.6%
2 1475
15.3%
3 1352
14.0%
0 1340
13.9%
4 753
7.8%
1 736
7.6%
8 689
7.2%
5 445
 
4.6%
7 429
 
4.5%
6 420
 
4.4%
Distinct589
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
2023-12-12T22:39:48.816346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length56
Mean length43.153376
Min length1

Characters and Unicode

Total characters120139
Distinct characters292
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)3.9%

Sample

1st row인천광역시 남동구 용천로97번길 28 (구월동 , 해창아파트)
2nd row인천광역시 남동구 만수서로105번길 25 ()
3rd row확인불가
4th row확인불가
5th row확인불가
ValueCountFrequency (%)
인천광역시 2770
 
11.4%
2631
 
10.9%
연수구 1452
 
6.0%
남동구 1318
 
5.4%
송도동 686
 
2.8%
상가동 537
 
2.2%
동춘동 375
 
1.5%
일부 365
 
1.5%
구월동 306
 
1.3%
논현동 275
 
1.1%
Other values (972) 13485
55.7%
2023-12-12T22:39:49.367024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21497
 
17.9%
5934
 
4.9%
, 4671
 
3.9%
2 4167
 
3.5%
1 3862
 
3.2%
3336
 
2.8%
3153
 
2.6%
3118
 
2.6%
2943
 
2.4%
( 2909
 
2.4%
Other values (282) 64549
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68670
57.2%
Space Separator 21497
 
17.9%
Decimal Number 18860
 
15.7%
Other Punctuation 4684
 
3.9%
Open Punctuation 2909
 
2.4%
Close Punctuation 2909
 
2.4%
Dash Punctuation 342
 
0.3%
Uppercase Letter 246
 
0.2%
Lowercase Letter 15
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5934
 
8.6%
3336
 
4.9%
3153
 
4.6%
3118
 
4.5%
2943
 
4.3%
2896
 
4.2%
2847
 
4.1%
2791
 
4.1%
2490
 
3.6%
2239
 
3.3%
Other values (248) 36923
53.8%
Uppercase Letter
ValueCountFrequency (%)
A 77
31.3%
H 42
17.1%
B 23
 
9.3%
S 22
 
8.9%
D 16
 
6.5%
C 14
 
5.7%
L 13
 
5.3%
E 12
 
4.9%
J 7
 
2.8%
K 4
 
1.6%
Other values (5) 16
 
6.5%
Decimal Number
ValueCountFrequency (%)
2 4167
22.1%
1 3862
20.5%
0 2877
15.3%
3 1709
9.1%
4 1453
 
7.7%
5 1241
 
6.6%
6 1011
 
5.4%
8 974
 
5.2%
7 855
 
4.5%
9 711
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 4671
99.7%
/ 7
 
0.1%
. 6
 
0.1%
Space Separator
ValueCountFrequency (%)
21497
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2909
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2909
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 342
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 15
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68670
57.2%
Common 51208
42.6%
Latin 261
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5934
 
8.6%
3336
 
4.9%
3153
 
4.6%
3118
 
4.5%
2943
 
4.3%
2896
 
4.2%
2847
 
4.1%
2791
 
4.1%
2490
 
3.6%
2239
 
3.3%
Other values (248) 36923
53.8%
Common
ValueCountFrequency (%)
21497
42.0%
, 4671
 
9.1%
2 4167
 
8.1%
1 3862
 
7.5%
( 2909
 
5.7%
) 2909
 
5.7%
0 2877
 
5.6%
3 1709
 
3.3%
4 1453
 
2.8%
5 1241
 
2.4%
Other values (8) 3913
 
7.6%
Latin
ValueCountFrequency (%)
A 77
29.5%
H 42
16.1%
B 23
 
8.8%
S 22
 
8.4%
D 16
 
6.1%
e 15
 
5.7%
C 14
 
5.4%
L 13
 
5.0%
E 12
 
4.6%
J 7
 
2.7%
Other values (6) 20
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68670
57.2%
ASCII 51469
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21497
41.8%
, 4671
 
9.1%
2 4167
 
8.1%
1 3862
 
7.5%
( 2909
 
5.7%
) 2909
 
5.7%
0 2877
 
5.6%
3 1709
 
3.3%
4 1453
 
2.8%
5 1241
 
2.4%
Other values (24) 4174
 
8.1%
Hangul
ValueCountFrequency (%)
5934
 
8.6%
3336
 
4.9%
3153
 
4.6%
3118
 
4.5%
2943
 
4.3%
2896
 
4.2%
2847
 
4.1%
2791
 
4.1%
2490
 
3.6%
2239
 
3.3%
Other values (248) 36923
53.8%

분야구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
입시.검정 및 보습
1442 
예능(대)
1151 
국제화
 
142
기타(대)
 
49

Length

Max length10
Median length10
Mean length7.4877874
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row입시.검정 및 보습
2nd row예능(대)
3rd row예능(대)
4th row예능(대)
5th row예능(대)

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 1442
51.8%
예능(대) 1151
41.3%
국제화 142
 
5.1%
기타(대) 49
 
1.8%

Length

2023-12-12T22:39:49.564557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:49.699541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 1442
25.4%
1442
25.4%
보습 1442
25.4%
예능(대 1151
20.3%
국제화 142
 
2.5%
기타(대 49
 
0.9%

교습계열
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
보통교과
1440 
예능(중)
1151 
외국어
 
143
기타(중)
 
49
진학지도
 
1

Length

Max length5
Median length4
Mean length4.3796695
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row외국어
2nd row예능(중)
3rd row예능(중)
4th row예능(중)
5th row예능(중)

Common Values

ValueCountFrequency (%)
보통교과 1440
51.7%
예능(중) 1151
41.3%
외국어 143
 
5.1%
기타(중) 49
 
1.8%
진학지도 1
 
< 0.1%

Length

2023-12-12T22:39:49.854312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:39:50.014119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보통교과 1440
51.7%
예능(중 1151
41.3%
외국어 143
 
5.1%
기타(중 49
 
1.8%
진학지도 1
 
< 0.1%

교습과정
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size21.9 KiB
보습
1303 
음악
717 
미술
433 
실용외국어(유아/초·중·고)
143 
보습·논술
 
100
Other values (11)
 
88

Length

Max length15
Median length2
Mean length2.8534483
Min length2

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row실용외국어(유아/초·중·고)
2nd row미술
3rd row음악
4th row음악
5th row음악

Common Values

ValueCountFrequency (%)
보습 1303
46.8%
음악 717
25.8%
미술 433
 
15.6%
실용외국어(유아/초·중·고) 143
 
5.1%
보습·논술 100
 
3.6%
입시·논술 37
 
1.3%
기타(소) 13
 
0.5%
컴퓨터(소) 13
 
0.5%
서예 9
 
0.3%
광업자원 5
 
0.2%
Other values (6) 11
 
0.4%

Length

2023-12-12T22:39:50.171295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보습 1303
46.8%
음악 717
25.8%
미술 433
 
15.6%
실용외국어(유아/초·중·고 143
 
5.1%
보습·논술 100
 
3.6%
입시·논술 37
 
1.3%
기타(소 13
 
0.5%
컴퓨터(소 13
 
0.5%
서예 9
 
0.3%
광업자원 5
 
0.2%
Other values (6) 11
 
0.4%

교습과목(반)
Text

MISSING 

Distinct1394
Distinct (%)51.2%
Missing62
Missing (%)2.2%
Memory size21.9 KiB
2023-12-12T22:39:50.548733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length25
Mean length4.8758266
Min length2

Characters and Unicode

Total characters13272
Distinct characters289
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1125 ?
Unique (%)41.3%

Sample

1st row미술
2nd row피아노
3rd row피아노
4th row피아노
5th row피아노
ValueCountFrequency (%)
중등 53
 
1.9%
피아노 49
 
1.7%
중급 49
 
1.7%
초등 47
 
1.7%
초급 46
 
1.6%
고급 40
 
1.4%
초등수학 36
 
1.3%
중등수학 35
 
1.2%
고등 30
 
1.1%
중등영어 24
 
0.9%
Other values (1350) 2410
85.5%
2023-12-12T22:39:51.131057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1636
 
12.3%
1095
 
8.3%
699
 
5.3%
501
 
3.8%
448
 
3.4%
359
 
2.7%
319
 
2.4%
( 311
 
2.3%
) 311
 
2.3%
1 310
 
2.3%
Other values (279) 7283
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9710
73.2%
Decimal Number 1037
 
7.8%
Uppercase Letter 921
 
6.9%
Lowercase Letter 697
 
5.3%
Open Punctuation 311
 
2.3%
Close Punctuation 311
 
2.3%
Space Separator 99
 
0.7%
Other Punctuation 77
 
0.6%
Dash Punctuation 55
 
0.4%
Math Symbol 45
 
0.3%
Other values (2) 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1636
16.8%
1095
 
11.3%
699
 
7.2%
501
 
5.2%
448
 
4.6%
359
 
3.7%
319
 
3.3%
280
 
2.9%
260
 
2.7%
255
 
2.6%
Other values (209) 3858
39.7%
Uppercase Letter
ValueCountFrequency (%)
B 241
26.2%
A 234
25.4%
C 136
14.8%
E 55
 
6.0%
D 53
 
5.8%
S 20
 
2.2%
P 19
 
2.1%
G 18
 
2.0%
H 17
 
1.8%
K 16
 
1.7%
Other values (15) 112
12.2%
Lowercase Letter
ValueCountFrequency (%)
e 109
15.6%
i 60
 
8.6%
l 56
 
8.0%
n 56
 
8.0%
s 46
 
6.6%
o 41
 
5.9%
a 39
 
5.6%
t 38
 
5.5%
c 35
 
5.0%
r 34
 
4.9%
Other values (12) 183
26.3%
Decimal Number
ValueCountFrequency (%)
1 310
29.9%
2 253
24.4%
3 167
16.1%
0 123
 
11.9%
4 90
 
8.7%
5 47
 
4.5%
6 37
 
3.6%
7 5
 
0.5%
8 3
 
0.3%
9 2
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 57
74.0%
/ 17
 
22.1%
. 3
 
3.9%
Letter Number
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%
Math Symbol
ValueCountFrequency (%)
~ 35
77.8%
+ 10
 
22.2%
Open Punctuation
ValueCountFrequency (%)
( 311
100.0%
Close Punctuation
ValueCountFrequency (%)
) 311
100.0%
Space Separator
ValueCountFrequency (%)
99
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9710
73.2%
Common 1936
 
14.6%
Latin 1626
 
12.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1636
16.8%
1095
 
11.3%
699
 
7.2%
501
 
5.2%
448
 
4.6%
359
 
3.7%
319
 
3.3%
280
 
2.9%
260
 
2.7%
255
 
2.6%
Other values (209) 3858
39.7%
Latin
ValueCountFrequency (%)
B 241
14.8%
A 234
14.4%
C 136
 
8.4%
e 109
 
6.7%
i 60
 
3.7%
l 56
 
3.4%
n 56
 
3.4%
E 55
 
3.4%
D 53
 
3.3%
s 46
 
2.8%
Other values (40) 580
35.7%
Common
ValueCountFrequency (%)
( 311
16.1%
) 311
16.1%
1 310
16.0%
2 253
13.1%
3 167
8.6%
0 123
 
6.4%
99
 
5.1%
4 90
 
4.6%
, 57
 
2.9%
- 55
 
2.8%
Other values (10) 160
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9710
73.2%
ASCII 3554
 
26.8%
Number Forms 8
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1636
16.8%
1095
 
11.3%
699
 
7.2%
501
 
5.2%
448
 
4.6%
359
 
3.7%
319
 
3.3%
280
 
2.9%
260
 
2.7%
255
 
2.6%
Other values (209) 3858
39.7%
ASCII
ValueCountFrequency (%)
( 311
 
8.8%
) 311
 
8.8%
1 310
 
8.7%
2 253
 
7.1%
B 241
 
6.8%
A 234
 
6.6%
3 167
 
4.7%
C 136
 
3.8%
0 123
 
3.5%
e 109
 
3.1%
Other values (57) 1359
38.2%
Number Forms
ValueCountFrequency (%)
3
37.5%
3
37.5%
2
25.0%

Correlations

2023-12-12T22:39:51.258945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야구분교습계열교습과정
분야구분1.0000.9811.000
교습계열0.9811.0001.000
교습과정1.0001.0001.000
2023-12-12T22:39:51.348354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야구분교습과정교습계열
분야구분1.0000.9970.999
교습과정0.9971.0000.998
교습계열0.9990.9981.000
2023-12-12T22:39:51.434795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야구분교습계열교습과정
분야구분1.0000.9990.997
교습계열0.9991.0000.998
교습과정0.9970.9981.000

Missing values

2023-12-12T22:39:45.819409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:39:45.945614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교습소명교습자-성명전화번호교습소주소분야구분교습계열교습과정교습과목(반)
0일어-1간수웅032-500-0006인천광역시 남동구 용천로97번길 28 (구월동 , 해창아파트)입시.검정 및 보습외국어실용외국어(유아/초·중·고)<NA>
1색샘미술과외교습소노인자032-461-8963인천광역시 남동구 만수서로105번길 25 ()예능(대)예능(중)미술미술
2난초피아노교습소박순란032-422-0576확인불가예능(대)예능(중)음악피아노
3피아노-7안정숙032-200-0017확인불가예능(대)예능(중)음악피아노
4피아노-41장정선032-300-0008확인불가예능(대)예능(중)음악피아노
5피아노-20장은자032-300-0017인천광역시 남동구 남동대로916번길 36-1 31/3 (간석동)예능(대)예능(중)음악피아노
6피노키오피아노과외교습소소영임032-833-3585인천광역시 연수구 옥련로 33 단지내상가 302호 (옥련동 , 현대1차아파트)예능(대)예능(중)음악초급반
7피노키오피아노과외교습소소영임032-833-3585인천광역시 연수구 옥련로 33 단지내상가 302호 (옥련동 , 현대1차아파트)예능(대)예능(중)음악중급반
8피노키오피아노과외교습소소영임032-833-3585인천광역시 연수구 옥련로 33 단지내상가 302호 (옥련동 , 현대1차아파트)예능(대)예능(중)음악고급반
9피노키오피아노과외교습소소영임032-833-3585인천광역시 연수구 옥련로 33 단지내상가 302호 (옥련동 , 현대1차아파트)예능(대)예능(중)음악초급
교습소명교습자-성명전화번호교습소주소분야구분교습계열교습과정교습과목(반)
2774프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악초급
2775프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악중급A
2776프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악중급B
2777프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악고급A
2778프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악고급B
2779프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악입시
2780프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악우쿨렐레
2781프레이즈음악교습소허춘희제공불가인천광역시 남동구 장도로 4-2 , 1층일부 (논현동)예능(대)예능(중)음악플룻
2782송도맥수학교습소이정휘032-858-5010인천광역시 연수구 컨벤시아대로 81 , 드림시티 501호 (송도동,드림시티)입시.검정 및 보습보통교과보습중등단과
2783송도맥수학교습소이정휘032-858-5010인천광역시 연수구 컨벤시아대로 81 , 드림시티 501호 (송도동,드림시티)입시.검정 및 보습보통교과보습고등단과

Duplicate rows

Most frequently occurring

교습소명교습자-성명전화번호교습소주소분야구분교습계열교습과정교습과목(반)# duplicates
31아트랩미술교습소황인홍제공불가인천광역시 연수구 센트럴로 194 , 204호 (송도동, 더샵센트럴파크2)예능(대)예능(중)미술유아초등10
37아트튜브(ART TUBE)미술교습소한혜성제공불가인천광역시 연수구 센트럴로 232 , 212호 (송도동, 더샵센트럴파크1)예능(대)예능(중)미술초등영재8
13더클립수학교습소김민서제공불가인천광역시 남동구 담방로 105 , 201호,202호 (만수동, 만수주공7,8단지아파트)입시.검정 및 보습보통교과보습초등수학5
32아트랩미술교습소황인홍제공불가인천광역시 연수구 센트럴로 194 , 204호 (송도동, 더샵센트럴파크2)예능(대)예능(중)미술입시반5
14뮤엠구월서초영어교습소남미경제공불가인천광역시 남동구 문화로169번길 44 , 1층 102호 일부 (간석동)입시.검정 및 보습보통교과보습보습단과고등4
15뮤엠구월서초영어교습소남미경제공불가인천광역시 남동구 문화로169번길 44 , 1층 102호 일부 (간석동)입시.검정 및 보습보통교과보습보습단과중등4
16뮤엠구월서초영어교습소남미경제공불가인천광역시 남동구 문화로169번길 44 , 1층 102호 일부 (간석동)입시.검정 및 보습보통교과보습보습단과초등4
35아트튜브(ART TUBE)미술교습소한혜성제공불가인천광역시 연수구 센트럴로 232 , 212호 (송도동, 더샵센트럴파크1)예능(대)예능(중)미술중등입시4
45우노미술교습소우은오제공불가인천광역시 연수구 컨벤시아대로130번길 58 , 205호,상가 비동 (송도동, 송도자이하버뷰1단지아파트)예능(대)예능(중)미술입시초급4
0가람중국어교습소남해령제공불가인천광역시 연수구 원인재로 180 , 상가동 지층 008호 (연수동, 우성아파트)입시.검정 및 보습보통교과보습고등(단과)3