Overview

Dataset statistics

Number of variables7
Number of observations1193
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory67.7 KiB
Average record size in memory58.1 B

Variable types

Numeric1
Categorical2
Text4

Dataset

Description한국산업인력공단 외국인근로자가 자주 쓰는 외국어 정보(키르기스스탄어)로 외국인근로자가 자주 사용하는 키르기스스탄어 문장을 제공합니다.
URLhttps://www.data.go.kr/data/15050963/fileData.do

Alerts

대분류코드 has constant value ""Constant
연번 is highly overall correlated with 대분류High correlation
대분류 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:05:54.567959
Analysis finished2023-12-12 15:05:55.754499
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1193
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean597.46186
Minimum1
Maximum1194
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.6 KiB
2023-12-13T00:05:55.847679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile60.6
Q1299
median597
Q3896
95-th percentile1134.4
Maximum1194
Range1193
Interquartile range (IQR)597

Descriptive statistics

Standard deviation344.96451
Coefficient of variation (CV)0.57738331
Kurtosis-1.2014613
Mean597.46186
Median Absolute Deviation (MAD)299
Skewness0.00033030759
Sum712772
Variance119000.51
MonotonicityStrictly increasing
2023-12-13T00:05:56.028571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
803 1
 
0.1%
801 1
 
0.1%
800 1
 
0.1%
799 1
 
0.1%
798 1
 
0.1%
797 1
 
0.1%
796 1
 
0.1%
795 1
 
0.1%
794 1
 
0.1%
Other values (1183) 1183
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1194 1
0.1%
1193 1
0.1%
1192 1
0.1%
1191 1
0.1%
1190 1
0.1%
1189 1
0.1%
1188 1
0.1%
1187 1
0.1%
1186 1
0.1%
1185 1
0.1%

대분류코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
4
1193 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 1193
100.0%

Length

2023-12-13T00:05:56.171047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:05:56.286412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 1193
100.0%

대분류
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
일상생활
608 
작업지시
208 
근로관련
185 
기숙사및식당
120 
고용관련신고
68 
Other values (4)
 
4

Length

Max length33
Median length4
Mean length4.3671417
Min length4

Unique

Unique4 ?
Unique (%)0.3%

Sample

1st row일상생활
2nd row일상생활
3rd row일상생활
4th row일상생활
5th row일상생활

Common Values

ValueCountFrequency (%)
일상생활 608
51.0%
작업지시 208
 
17.4%
근로관련 185
 
15.5%
기숙사및식당 120
 
10.1%
고용관련신고 68
 
5.7%
Иш буйругу 1
 
0.1%
Жатакана жана ашкана 1
 
0.1%
Жумуш жонундо 1
 
0.1%
Жумушка орношууда билдируу жасоо 1
 
0.1%

Length

2023-12-13T00:05:56.424456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:05:56.565692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일상생활 608
50.7%
작업지시 208
 
17.3%
근로관련 185
 
15.4%
기숙사및식당 120
 
10.0%
고용관련신고 68
 
5.7%
иш 1
 
0.1%
буйругу 1
 
0.1%
жатакана 1
 
0.1%
жана 1
 
0.1%
ашкана 1
 
0.1%
Other values (6) 6
 
0.5%
Distinct55
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-13T00:05:56.782052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length32
Mean length4.5163453
Min length2

Characters and Unicode

Total characters5388
Distinct characters110
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)2.3%

Sample

1st row인사,소개
2nd row인사,소개
3rd row인사,소개
4th row인사,소개
5th row인사,소개
ValueCountFrequency (%)
기타 332
26.8%
근무태도 83
 
6.7%
건강,병원 82
 
6.6%
급여,수당 65
 
5.2%
기숙사규칙 57
 
4.6%
시장,교통 46
 
3.7%
음식,식생활 45
 
3.6%
안전규칙 45
 
3.6%
작업규칙등기타 44
 
3.5%
인사,소개 43
 
3.5%
Other values (77) 398
32.1%
2023-12-13T00:05:57.156443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
522
 
9.7%
, 429
 
8.0%
419
 
7.8%
220
 
4.1%
179
 
3.3%
179
 
3.3%
136
 
2.5%
134
 
2.5%
103
 
1.9%
103
 
1.9%
Other values (100) 2964
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4376
81.2%
Lowercase Letter 481
 
8.9%
Other Punctuation 431
 
8.0%
Space Separator 67
 
1.2%
Uppercase Letter 29
 
0.5%
Dash Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
522
 
11.9%
419
 
9.6%
220
 
5.0%
179
 
4.1%
179
 
4.1%
136
 
3.1%
134
 
3.1%
103
 
2.4%
103
 
2.4%
98
 
2.2%
Other values (60) 2283
52.2%
Lowercase Letter
ValueCountFrequency (%)
а 82
17.0%
у 64
13.3%
р 34
 
7.1%
о 33
 
6.9%
н 31
 
6.4%
т 28
 
5.8%
к 27
 
5.6%
м 22
 
4.6%
е 21
 
4.4%
ш 18
 
3.7%
Other values (17) 121
25.2%
Uppercase Letter
ValueCountFrequency (%)
Ж 11
37.9%
А 5
17.2%
К 4
 
13.8%
М 2
 
6.9%
И 2
 
6.9%
Б 2
 
6.9%
Т 1
 
3.4%
Д 1
 
3.4%
Э 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 429
99.5%
. 2
 
0.5%
Space Separator
ValueCountFrequency (%)
67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4376
81.2%
Cyrillic 510
 
9.5%
Common 502
 
9.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
522
 
11.9%
419
 
9.6%
220
 
5.0%
179
 
4.1%
179
 
4.1%
136
 
3.1%
134
 
3.1%
103
 
2.4%
103
 
2.4%
98
 
2.2%
Other values (60) 2283
52.2%
Cyrillic
ValueCountFrequency (%)
а 82
16.1%
у 64
12.5%
р 34
 
6.7%
о 33
 
6.5%
н 31
 
6.1%
т 28
 
5.5%
к 27
 
5.3%
м 22
 
4.3%
е 21
 
4.1%
ш 18
 
3.5%
Other values (26) 150
29.4%
Common
ValueCountFrequency (%)
, 429
85.5%
67
 
13.3%
- 4
 
0.8%
. 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4376
81.2%
Cyrillic 510
 
9.5%
ASCII 502
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
522
 
11.9%
419
 
9.6%
220
 
5.0%
179
 
4.1%
179
 
4.1%
136
 
3.1%
134
 
3.1%
103
 
2.4%
103
 
2.4%
98
 
2.2%
Other values (60) 2283
52.2%
ASCII
ValueCountFrequency (%)
, 429
85.5%
67
 
13.3%
- 4
 
0.8%
. 2
 
0.4%
Cyrillic
ValueCountFrequency (%)
а 82
16.1%
у 64
12.5%
р 34
 
6.7%
о 33
 
6.5%
н 31
 
6.1%
т 28
 
5.5%
к 27
 
5.3%
м 22
 
4.3%
е 21
 
4.1%
ш 18
 
3.5%
Other values (26) 150
29.4%
Distinct1190
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-13T00:05:57.573431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length34
Mean length14.005868
Min length1

Characters and Unicode

Total characters16709
Distinct characters614
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1187 ?
Unique (%)99.5%

Sample

1st row고맙습니다.
2nd row그동안 수고하셨습니다
3rd row다음에 또 뵈요
4th row다음에 오겠습니다.
5th row당신을 잊지 못할 것 입니다
ValueCountFrequency (%)
합니다 36
 
0.9%
있습니다 34
 
0.9%
32
 
0.8%
하세요 28
 
0.7%
마세요 28
 
0.7%
입니다 26
 
0.7%
21
 
0.5%
안됩니다 19
 
0.5%
18
 
0.5%
반드시 17
 
0.4%
Other values (2406) 3589
93.3%
2023-12-13T00:05:58.170873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2975
 
17.8%
572
 
3.4%
523
 
3.1%
. 473
 
2.8%
463
 
2.8%
359
 
2.1%
314
 
1.9%
293
 
1.8%
262
 
1.6%
251
 
1.5%
Other values (604) 10224
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13030
78.0%
Space Separator 2975
 
17.8%
Other Punctuation 507
 
3.0%
Decimal Number 138
 
0.8%
Uppercase Letter 27
 
0.2%
Math Symbol 11
 
0.1%
Close Punctuation 7
 
< 0.1%
Open Punctuation 7
 
< 0.1%
Dash Punctuation 4
 
< 0.1%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
572
 
4.4%
523
 
4.0%
463
 
3.6%
359
 
2.8%
314
 
2.4%
293
 
2.2%
262
 
2.0%
251
 
1.9%
225
 
1.7%
215
 
1.7%
Other values (573) 9553
73.3%
Decimal Number
ValueCountFrequency (%)
0 45
32.6%
1 26
18.8%
2 17
 
12.3%
3 10
 
7.2%
9 10
 
7.2%
4 8
 
5.8%
5 7
 
5.1%
6 6
 
4.3%
8 6
 
4.3%
7 3
 
2.2%
Other Punctuation
ValueCountFrequency (%)
. 473
93.3%
, 23
 
4.5%
/ 6
 
1.2%
1
 
0.2%
* 1
 
0.2%
% 1
 
0.2%
' 1
 
0.2%
! 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
O 22
81.5%
C 2
 
7.4%
E 1
 
3.7%
S 1
 
3.7%
A 1
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
d 1
33.3%
t 1
33.3%
v 1
33.3%
Space Separator
ValueCountFrequency (%)
2975
100.0%
Math Symbol
ValueCountFrequency (%)
~ 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13030
78.0%
Common 3649
 
21.8%
Latin 30
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
572
 
4.4%
523
 
4.0%
463
 
3.6%
359
 
2.8%
314
 
2.4%
293
 
2.2%
262
 
2.0%
251
 
1.9%
225
 
1.7%
215
 
1.7%
Other values (573) 9553
73.3%
Common
ValueCountFrequency (%)
2975
81.5%
. 473
 
13.0%
0 45
 
1.2%
1 26
 
0.7%
, 23
 
0.6%
2 17
 
0.5%
~ 11
 
0.3%
3 10
 
0.3%
9 10
 
0.3%
4 8
 
0.2%
Other values (13) 51
 
1.4%
Latin
ValueCountFrequency (%)
O 22
73.3%
C 2
 
6.7%
E 1
 
3.3%
d 1
 
3.3%
t 1
 
3.3%
v 1
 
3.3%
S 1
 
3.3%
A 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12891
77.2%
ASCII 3678
 
22.0%
Compat Jamo 139
 
0.8%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2975
80.9%
. 473
 
12.9%
0 45
 
1.2%
1 26
 
0.7%
, 23
 
0.6%
O 22
 
0.6%
2 17
 
0.5%
~ 11
 
0.3%
3 10
 
0.3%
9 10
 
0.3%
Other values (20) 66
 
1.8%
Hangul
ValueCountFrequency (%)
572
 
4.4%
523
 
4.1%
463
 
3.6%
359
 
2.8%
314
 
2.4%
293
 
2.3%
262
 
2.0%
251
 
1.9%
225
 
1.7%
215
 
1.7%
Other values (572) 9414
73.0%
Compat Jamo
ValueCountFrequency (%)
139
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct1176
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-13T00:05:58.504863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length205
Median length84
Mean length32.15088
Min length1

Characters and Unicode

Total characters38356
Distinct characters86
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1159 ?
Unique (%)97.2%

Sample

1st rowРахмат
2nd rowУшул убакыт бою жакшы иштедииз
3rd rowЭмкиде дагы жолугалы
4th rowЭмкиде келем
5th rowСизди унута албайм
ValueCountFrequency (%)
бул 79
 
1.5%
керек 61
 
1.1%
менен 58
 
1.1%
жакшы 57
 
1.1%
коопсуздук 36
 
0.7%
бир 33
 
0.6%
турган 30
 
0.6%
29
 
0.5%
айлык 28
 
0.5%
болот 28
 
0.5%
Other values (2299) 4984
91.9%
2023-12-13T00:05:59.052809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4591
 
12.0%
а 3870
 
10.1%
н 2373
 
6.2%
у 2265
 
5.9%
к 2124
 
5.5%
ы 1993
 
5.2%
е 1889
 
4.9%
т 1829
 
4.8%
и 1601
 
4.2%
р 1589
 
4.1%
Other values (76) 14232
37.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 31678
82.6%
Space Separator 4591
 
12.0%
Uppercase Letter 1194
 
3.1%
Other Punctuation 355
 
0.9%
Connector Punctuation 297
 
0.8%
Decimal Number 136
 
0.4%
Other Letter 37
 
0.1%
Dash Punctuation 33
 
0.1%
Open Punctuation 13
 
< 0.1%
Close Punctuation 13
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
а 3870
 
12.2%
н 2373
 
7.5%
у 2265
 
7.2%
к 2124
 
6.7%
ы 1993
 
6.3%
е 1889
 
6.0%
т 1829
 
5.8%
и 1601
 
5.1%
р 1589
 
5.0%
о 1507
 
4.8%
Other values (22) 10638
33.6%
Uppercase Letter
ValueCountFrequency (%)
К 186
15.6%
Б 178
14.9%
Ж 147
12.3%
А 121
10.1%
С 90
7.5%
Т 76
 
6.4%
Э 59
 
4.9%
М 59
 
4.9%
И 46
 
3.9%
О 36
 
3.0%
Other values (22) 196
16.4%
Decimal Number
ValueCountFrequency (%)
0 74
54.4%
1 17
 
12.5%
2 13
 
9.6%
3 8
 
5.9%
9 8
 
5.9%
4 5
 
3.7%
5 4
 
2.9%
8 3
 
2.2%
6 3
 
2.2%
7 1
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 304
85.6%
, 44
 
12.4%
/ 5
 
1.4%
% 1
 
0.3%
* 1
 
0.3%
Space Separator
ValueCountFrequency (%)
4591
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 297
100.0%
Other Letter
ValueCountFrequency (%)
37
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Cyrillic 32819
85.6%
Common 5447
 
14.2%
Latin 53
 
0.1%
Hangul 37
 
0.1%

Most frequent character per script

Cyrillic
ValueCountFrequency (%)
а 3870
 
11.8%
н 2373
 
7.2%
у 2265
 
6.9%
к 2124
 
6.5%
ы 1993
 
6.1%
е 1889
 
5.8%
т 1829
 
5.6%
и 1601
 
4.9%
р 1589
 
4.8%
о 1507
 
4.6%
Other values (48) 11779
35.9%
Common
ValueCountFrequency (%)
4591
84.3%
. 304
 
5.6%
_ 297
 
5.5%
0 74
 
1.4%
, 44
 
0.8%
- 33
 
0.6%
1 17
 
0.3%
( 13
 
0.2%
2 13
 
0.2%
) 13
 
0.2%
Other values (11) 48
 
0.9%
Latin
ValueCountFrequency (%)
D 15
28.3%
I 15
28.3%
C 11
20.8%
c 6
 
11.3%
O 5
 
9.4%
E 1
 
1.9%
Hangul
ValueCountFrequency (%)
37
100.0%

Most occurring blocks

ValueCountFrequency (%)
Cyrillic 32819
85.6%
ASCII 5500
 
14.3%
Compat Jamo 37
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4591
83.5%
. 304
 
5.5%
_ 297
 
5.4%
0 74
 
1.3%
, 44
 
0.8%
- 33
 
0.6%
1 17
 
0.3%
D 15
 
0.3%
I 15
 
0.3%
( 13
 
0.2%
Other values (17) 97
 
1.8%
Cyrillic
ValueCountFrequency (%)
а 3870
 
11.8%
н 2373
 
7.2%
у 2265
 
6.9%
к 2124
 
6.5%
ы 1993
 
6.1%
е 1889
 
5.8%
т 1829
 
5.6%
и 1601
 
4.9%
р 1589
 
4.8%
о 1507
 
4.6%
Other values (48) 11779
35.9%
Compat Jamo
ValueCountFrequency (%)
37
100.0%

발음
Text

Distinct1181
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-13T00:05:59.641276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length56
Mean length18.744342
Min length1

Characters and Unicode

Total characters22362
Distinct characters553
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1169 ?
Unique (%)98.0%

Sample

1st row라흐맛
2nd row우쉴 우바큿 보유 자크쉬 이쉬테디니즈
3rd row엠키데 다그 졸루갈르
4th row엠키데 켈렘
5th row시즈디 우누타 알바임
ValueCountFrequency (%)
79
 
1.5%
케렉 59
 
1.1%
메넨 55
 
1.0%
작쉬 44
 
0.8%
우춘 43
 
0.8%
코옵수수둑 32
 
0.6%
볼롯 30
 
0.6%
비르 27
 
0.5%
바르 26
 
0.5%
아일륵 26
 
0.5%
Other values (2539) 4954
92.2%
2023-12-13T00:06:00.092026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4257
 
19.0%
746
 
3.3%
598
 
2.7%
458
 
2.0%
430
 
1.9%
408
 
1.8%
402
 
1.8%
_ 381
 
1.7%
374
 
1.7%
340
 
1.5%
Other values (543) 13968
62.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17461
78.1%
Space Separator 4257
 
19.0%
Connector Punctuation 381
 
1.7%
Decimal Number 113
 
0.5%
Other Punctuation 80
 
0.4%
Dash Punctuation 30
 
0.1%
Open Punctuation 15
 
0.1%
Close Punctuation 13
 
0.1%
Math Symbol 9
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
746
 
4.3%
598
 
3.4%
458
 
2.6%
430
 
2.5%
408
 
2.3%
402
 
2.3%
374
 
2.1%
340
 
1.9%
336
 
1.9%
328
 
1.9%
Other values (519) 13041
74.7%
Decimal Number
ValueCountFrequency (%)
0 53
46.9%
1 16
 
14.2%
2 13
 
11.5%
3 8
 
7.1%
9 8
 
7.1%
4 5
 
4.4%
8 3
 
2.7%
6 3
 
2.7%
5 3
 
2.7%
7 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
, 43
53.8%
. 30
37.5%
/ 5
 
6.2%
% 1
 
1.2%
* 1
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 13
86.7%
[ 2
 
13.3%
Uppercase Letter
ValueCountFrequency (%)
C 2
66.7%
E 1
33.3%
Space Separator
ValueCountFrequency (%)
4257
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17461
78.1%
Common 4898
 
21.9%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
746
 
4.3%
598
 
3.4%
458
 
2.6%
430
 
2.5%
408
 
2.3%
402
 
2.3%
374
 
2.1%
340
 
1.9%
336
 
1.9%
328
 
1.9%
Other values (519) 13041
74.7%
Common
ValueCountFrequency (%)
4257
86.9%
_ 381
 
7.8%
0 53
 
1.1%
, 43
 
0.9%
. 30
 
0.6%
- 30
 
0.6%
1 16
 
0.3%
2 13
 
0.3%
) 13
 
0.3%
( 13
 
0.3%
Other values (12) 49
 
1.0%
Latin
ValueCountFrequency (%)
C 2
66.7%
E 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17455
78.1%
ASCII 4901
 
21.9%
Compat Jamo 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4257
86.9%
_ 381
 
7.8%
0 53
 
1.1%
, 43
 
0.9%
. 30
 
0.6%
- 30
 
0.6%
1 16
 
0.3%
2 13
 
0.3%
) 13
 
0.3%
( 13
 
0.3%
Other values (14) 52
 
1.1%
Hangul
ValueCountFrequency (%)
746
 
4.3%
598
 
3.4%
458
 
2.6%
430
 
2.5%
408
 
2.3%
402
 
2.3%
374
 
2.1%
340
 
1.9%
336
 
1.9%
328
 
1.9%
Other values (515) 13035
74.7%
Compat Jamo
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%

Interactions

2023-12-13T00:05:55.415757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:06:00.180224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대분류소분류
연번1.0000.8520.981
대분류0.8521.0000.986
소분류0.9810.9861.000
2023-12-13T00:06:00.253733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대분류
연번1.0000.607
대분류0.6071.000

Missing values

2023-12-13T00:05:55.579325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:05:55.698308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번대분류코드대분류소분류한국어키르키즈스탄어발음
014일상생활인사,소개고맙습니다.Рахмат라흐맛
124일상생활인사,소개그동안 수고하셨습니다Ушул убакыт бою жакшы иштедииз우쉴 우바큿 보유 자크쉬 이쉬테디니즈
234일상생활인사,소개다음에 또 뵈요Эмкиде дагы жолугалы엠키데 다그 졸루갈르
344일상생활인사,소개다음에 오겠습니다.Эмкиде келем엠키데 켈렘
454일상생활인사,소개당신을 잊지 못할 것 입니다Сизди унута албайм시즈디 우누타 알바임
564일상생활인사,소개당신의 상사는 ㅇㅇㅇ 입니다.Бул киши _______ сиздин шефииз불 키쉬 ____ 시즈딘 쉬에비니즈
674일상생활인사,소개당신의 성공을 기원합니다Сизге ийгилик каалайм시즈게 이이길릭 카알라임
784일상생활인사,소개동료Коллега콜레가
894일상생활인사,소개만나서 반갑습니다.Жолукканыбызга кубанычтамын졸루카느브즈가 쿠바느치타믄
9104일상생활인사,소개맛있게 드세요Тамагыыз таттуу болсун타마그느즈 타투 볼순
연번대분류코드대분류소분류한국어키르키즈스탄어발음
118311854고용관련신고기타사항이곳에 도장을 찍어주세요Бул жерге мрду басыныз.불 제르게 모오르두 바스느즈
118411864고용관련신고기타사항이곳에 서류를 접수하세요Бул жерге документтерди тапшырыныз.불 제르게 다쿠멘테르디 답쉬르느즈
118511874고용관련신고기타사항재고용근로자 안내문을 잘읽어보세요.Кайрадан жумушка алынган жумушчу учун арналган маалымат китепчени окуп чыгыныз.카이라단 주무쉬카 알린간 주무쉬추 우춘 아르날간 말리맛 키텝체니 오쿱 치기니즈.
118611884고용관련신고기타사항재발급을 원합니다.Кайрадан жасаткым келет.카이라단 자삿큼 켈렛
118711894고용관련신고기타사항체류기간 연장을 해야 합니다.Визаны узартуу керек.비자느 우자르투 케렉
118811904고용관련신고기타사항체류기간이 만료되었습니다.Визаныздын мнту тугнду.비자느즈든 모노투 투곤두
118911914고용관련신고기타사항체류지 변경시 반드시 전입신고를 해야 합니다Жашаган жеринизди згрткнд сзсуз згргндугу тууралуу маалымат беришиниз керек자샤간 제리니즈디 오주고르트곤도 소추주 오주고르곤두구 투랄루 말리맛 베리쉬니즈 켈렉
119011924고용관련신고기타사항출국시 외국인등록증은 공항 출입국관리사무소에 반납해야 합니다.Кореядан чыгып кетип жатканда аэропорттогу миграция кызматына ID карточканы таштап кетуу керек.카레야단 츠급 케팁 잣간다 아에라포르토구 미그라씨야 크즈마트나 아이디 카르토츠카느 타쉬탑 케투 케렉
119111934고용관련신고기타사항출국예정일 변경 또는 재입국을 포기하고자할때는 한국산업인력공단 해외주재사무소로반드시 연락하여야합니다.Кореяга кайра кирип келуучу кунду згртснр же келбну кааласанар Корея жумушчулар жана ндуруш агенттигине айтып коюнуздар카레야가 카이라 키립 켈류추 쿤두 오즈고르초노르 제 켈보누 칼라사나르 카레야 주무쉬츨라르 자나 온두루쉬 아겐티키네 아이팁 코유누즈다르
119211944고용관련신고기타사항한국체류기간이 얼마나 남았습니까Корея визаныздын мнту канча калды카레야 비자느즈든 모노투 칸차 칼드