Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells44843
Missing cells (%)20.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 MiB
Average record size in memory195.0 B

Variable types

Text7
DateTime4
Categorical8
Unsupported3

Dataset

Description통계표에 활용되는 표준코드 정보 테이블로, 표준분류 및 표준분류값의 정보들이 있음. 단위코드는 표준코드와 자료등록 구조가 달라 포함하지않음
Author통계청
URLhttps://www.data.go.kr/data/15072662/fileData.do

Alerts

조직번호 has constant value ""Constant
코드타입구분 has constant value ""Constant
코드색인구분 has constant value ""Constant
상위코드타입번호 is highly imbalanced (95.6%)Imbalance
상위코드색인번호 is highly imbalanced (95.6%)Imbalance
유효시작일 is highly imbalanced (99.7%)Imbalance
유효종료일 is highly imbalanced (99.7%)Imbalance
영문코드명 has 928 (9.3%) missing valuesMissing
한글코드약어명 has 10000 (100.0%) missing valuesMissing
영문코드약어명 has 10000 (100.0%) missing valuesMissing
상위상세코드 has 865 (8.6%) missing valuesMissing
함수값여부 has 10000 (100.0%) missing valuesMissing
코드값최초등록일 has 4061 (40.6%) missing valuesMissing
코드값최종변경일 has 4063 (40.6%) missing valuesMissing
코드번호 has 4925 (49.2%) missing valuesMissing
한글코드약어명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
영문코드약어명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
함수값여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 00:31:07.660241
Analysis finished2023-12-12 00:31:09.107372
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct222
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:31:09.386595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length5.842
Min length3

Characters and Unicode

Total characters58420
Distinct characters32
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st rowSGG_201804
2nd rowSWS
3rd rowSYJ
4th rowSGG_201804
5th row00B
ValueCountFrequency (%)
sgg_201807 1360
13.6%
sgg_201804 1350
13.5%
sgg_202007 1350
13.5%
hjg 1261
12.6%
00b 664
 
6.6%
ind 663
 
6.6%
skt 580
 
5.8%
gsh 302
 
3.0%
gbj 262
 
2.6%
sge 103
 
1.0%
Other values (212) 2105
21.1%
2023-12-12T09:31:09.914769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
G 10882
18.6%
0 10798
18.5%
S 5855
10.0%
2 5410
9.3%
_ 4060
 
6.9%
1 2710
 
4.6%
8 2710
 
4.6%
7 2710
 
4.6%
J 2159
 
3.7%
H 1844
 
3.2%
Other values (22) 9282
15.9%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 28672
49.1%
Decimal Number 25688
44.0%
Connector Punctuation 4060
 
6.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G 10882
38.0%
S 5855
20.4%
J 2159
 
7.5%
H 1844
 
6.4%
B 1714
 
6.0%
N 877
 
3.1%
K 832
 
2.9%
D 781
 
2.7%
I 764
 
2.7%
T 650
 
2.3%
Other values (15) 2314
 
8.1%
Decimal Number
ValueCountFrequency (%)
0 10798
42.0%
2 5410
21.1%
1 2710
 
10.5%
8 2710
 
10.5%
7 2710
 
10.5%
4 1350
 
5.3%
Connector Punctuation
ValueCountFrequency (%)
_ 4060
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29748
50.9%
Latin 28672
49.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 10882
38.0%
S 5855
20.4%
J 2159
 
7.5%
H 1844
 
6.4%
B 1714
 
6.0%
N 877
 
3.1%
K 832
 
2.9%
D 781
 
2.7%
I 764
 
2.7%
T 650
 
2.3%
Other values (15) 2314
 
8.1%
Common
ValueCountFrequency (%)
0 10798
36.3%
2 5410
18.2%
_ 4060
 
13.6%
1 2710
 
9.1%
8 2710
 
9.1%
7 2710
 
9.1%
4 1350
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
G 10882
18.6%
0 10798
18.5%
S 5855
10.0%
2 5410
9.3%
_ 4060
 
6.9%
1 2710
 
4.6%
8 2710
 
4.6%
7 2710
 
4.6%
J 2159
 
3.7%
H 1844
 
3.2%
Other values (22) 9282
15.9%
Distinct222
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:31:10.108462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length12.0684
Min length2

Characters and Unicode

Total characters120684
Distinct characters211
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row행정구역별 (ver 2018. 4.)
2nd row사망원인별(103항목)
3rd row수요자별
4th row행정구역별 (ver 2018. 4.)
5th row산업별
ValueCountFrequency (%)
행정구역별 4157
18.7%
ver 4060
18.3%
2018 2710
12.2%
7 2710
12.2%
4 1350
 
6.1%
2020 1350
 
6.1%
행정구역별(구 1261
 
5.7%
산업별 664
 
3.0%
산업분류별(9차 663
 
3.0%
sktc별 580
 
2.6%
Other values (214) 2675
12.1%
2023-12-12T09:31:10.506399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12180
 
10.1%
9982
 
8.3%
. 8120
 
6.7%
6854
 
5.7%
( 6209
 
5.1%
) 6209
 
5.1%
5625
 
4.7%
5607
 
4.6%
5537
 
4.6%
2 5495
 
4.6%
Other values (201) 48866
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51796
42.9%
Decimal Number 21336
17.7%
Lowercase Letter 12283
 
10.2%
Space Separator 12180
 
10.1%
Other Punctuation 8128
 
6.7%
Open Punctuation 6209
 
5.1%
Close Punctuation 6209
 
5.1%
Uppercase Letter 2440
 
2.0%
Dash Punctuation 103
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9982
19.3%
6854
13.2%
5625
10.9%
5607
10.8%
5537
10.7%
1606
 
3.1%
1530
 
3.0%
1238
 
2.4%
1107
 
2.1%
1052
 
2.0%
Other values (176) 11658
22.5%
Decimal Number
ValueCountFrequency (%)
2 5495
25.8%
0 5450
25.5%
1 2750
12.9%
8 2710
12.7%
7 2710
12.7%
4 1350
 
6.3%
9 663
 
3.1%
3 122
 
0.6%
6 85
 
0.4%
5 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
T 620
25.4%
C 620
25.4%
K 580
23.8%
S 580
23.8%
I 40
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 8120
99.9%
· 5
 
0.1%
, 3
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
e 4163
33.9%
r 4060
33.1%
v 4060
33.1%
Space Separator
ValueCountFrequency (%)
12180
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6209
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6209
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 54165
44.9%
Hangul 51796
42.9%
Latin 14723
 
12.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9982
19.3%
6854
13.2%
5625
10.9%
5607
10.8%
5537
10.7%
1606
 
3.1%
1530
 
3.0%
1238
 
2.4%
1107
 
2.1%
1052
 
2.0%
Other values (176) 11658
22.5%
Common
ValueCountFrequency (%)
12180
22.5%
. 8120
15.0%
( 6209
11.5%
) 6209
11.5%
2 5495
10.1%
0 5450
10.1%
1 2750
 
5.1%
8 2710
 
5.0%
7 2710
 
5.0%
4 1350
 
2.5%
Other values (7) 982
 
1.8%
Latin
ValueCountFrequency (%)
e 4163
28.3%
r 4060
27.6%
v 4060
27.6%
T 620
 
4.2%
C 620
 
4.2%
K 580
 
3.9%
S 580
 
3.9%
I 40
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68883
57.1%
Hangul 51796
42.9%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12180
17.7%
. 8120
11.8%
( 6209
9.0%
) 6209
9.0%
2 5495
8.0%
0 5450
7.9%
e 4163
 
6.0%
r 4060
 
5.9%
v 4060
 
5.9%
1 2750
 
4.0%
Other values (14) 10187
14.8%
Hangul
ValueCountFrequency (%)
9982
19.3%
6854
13.2%
5625
10.9%
5607
10.8%
5537
10.7%
1606
 
3.1%
1530
 
3.0%
1238
 
2.4%
1107
 
2.1%
1052
 
2.0%
Other values (176) 11658
22.5%
None
ValueCountFrequency (%)
· 5
100.0%
Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2010-05-13 00:00:00
Maximum2020-07-01 00:00:00
2023-12-12T09:31:10.630426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:31:10.757308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2010-05-13 00:00:00
Maximum2020-07-01 00:00:00
2023-12-12T09:31:10.905361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:31:11.011530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)

조직번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
101
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row101
2nd row101
3rd row101
4th row101
5th row101

Common Values

ValueCountFrequency (%)
101 10000
100.0%

Length

2023-12-12T09:31:11.116380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:11.199270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
101 10000
100.0%
Distinct6816
Distinct (%)68.2%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2023-12-12T09:31:11.492029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.7320732
Min length1

Characters and Unicode

Total characters57315
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4882 ?
Unique (%)48.8%

Sample

1st row3709058
2nd rowC
3rd row1020204
4th row3833037
5th rowR93991
ValueCountFrequency (%)
00 45
 
0.5%
2 41
 
0.4%
10 32
 
0.3%
1 31
 
0.3%
0 28
 
0.3%
30 26
 
0.3%
40 24
 
0.2%
20 24
 
0.2%
3 23
 
0.2%
4 18
 
0.2%
Other values (6806) 9707
97.1%
2023-12-12T09:31:11.948853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11775
20.5%
3 9800
17.1%
1 9223
16.1%
2 6451
11.3%
5 4122
 
7.2%
4 3367
 
5.9%
6 3308
 
5.8%
7 2496
 
4.4%
8 1887
 
3.3%
9 1799
 
3.1%
Other values (25) 3087
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 54228
94.6%
Uppercase Letter 3087
 
5.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 622
20.1%
D 427
13.8%
C 326
10.6%
N 307
9.9%
G 256
8.3%
H 139
 
4.5%
P 125
 
4.0%
J 110
 
3.6%
B 100
 
3.2%
I 87
 
2.8%
Other values (15) 588
19.0%
Decimal Number
ValueCountFrequency (%)
0 11775
21.7%
3 9800
18.1%
1 9223
17.0%
2 6451
11.9%
5 4122
 
7.6%
4 3367
 
6.2%
6 3308
 
6.1%
7 2496
 
4.6%
8 1887
 
3.5%
9 1799
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
Common 54228
94.6%
Latin 3087
 
5.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 622
20.1%
D 427
13.8%
C 326
10.6%
N 307
9.9%
G 256
8.3%
H 139
 
4.5%
P 125
 
4.0%
J 110
 
3.6%
B 100
 
3.2%
I 87
 
2.8%
Other values (15) 588
19.0%
Common
ValueCountFrequency (%)
0 11775
21.7%
3 9800
18.1%
1 9223
17.0%
2 6451
11.9%
5 4122
 
7.6%
4 3367
 
6.2%
6 3308
 
6.1%
7 2496
 
4.6%
8 1887
 
3.5%
9 1799
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 57315
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11775
20.5%
3 9800
17.1%
1 9223
16.1%
2 6451
11.3%
5 4122
 
7.2%
4 3367
 
5.9%
6 3308
 
5.8%
7 2496
 
4.4%
8 1887
 
3.3%
9 1799
 
3.1%
Other values (25) 3087
 
5.4%

코드타입구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
11
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11
2nd row11
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
11 10000
100.0%

Length

2023-12-12T09:31:12.072470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:12.166576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11 10000
100.0%

코드색인구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
101
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row101
2nd row101
3rd row101
4th row101
5th row101

Common Values

ValueCountFrequency (%)
101 10000
100.0%

Length

2023-12-12T09:31:12.262691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:12.408919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
101 10000
100.0%
Distinct6885
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:31:12.750634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length42
Mean length5.6824
Min length1

Characters and Unicode

Total characters56824
Distinct characters787
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4873 ?
Unique (%)48.7%

Sample

1st row점촌2동
2nd row피부 및 피부밑조직의 질환 (L00-L98)
3rd row도소매업
4th row계성면
5th row예식장업
ValueCountFrequency (%)
876
 
5.7%
제조업 475
 
3.1%
기타 353
 
2.3%
도매업 92
 
0.6%
서비스업 90
 
0.6%
소매업 78
 
0.5%
또는 67
 
0.4%
합계 59
 
0.4%
별개의 58
 
0.4%
그외 54
 
0.4%
Other values (7458) 13171
85.7%
2023-12-12T09:31:13.328437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5375
 
9.5%
3438
 
6.1%
1748
 
3.1%
1469
 
2.6%
1385
 
2.4%
1002
 
1.8%
930
 
1.6%
719
 
1.3%
718
 
1.3%
1 677
 
1.2%
Other values (777) 39363
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47260
83.2%
Space Separator 5376
 
9.5%
Decimal Number 2337
 
4.1%
Other Punctuation 433
 
0.8%
Uppercase Letter 366
 
0.6%
Open Punctuation 338
 
0.6%
Close Punctuation 335
 
0.6%
Lowercase Letter 208
 
0.4%
Dash Punctuation 136
 
0.2%
Currency Symbol 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3438
 
7.3%
1748
 
3.7%
1469
 
3.1%
1385
 
2.9%
1002
 
2.1%
930
 
2.0%
719
 
1.5%
718
 
1.5%
625
 
1.3%
519
 
1.1%
Other values (689) 34707
73.4%
Uppercase Letter
ValueCountFrequency (%)
C 58
15.8%
A 30
 
8.2%
S 24
 
6.6%
U 22
 
6.0%
R 21
 
5.7%
I 17
 
4.6%
V 15
 
4.1%
K 14
 
3.8%
F 13
 
3.6%
J 13
 
3.6%
Other values (24) 139
38.0%
Lowercase Letter
ValueCountFrequency (%)
e 36
17.3%
t 32
15.4%
r 24
11.5%
m 20
9.6%
c 20
9.6%
n 17
8.2%
i 16
7.7%
o 16
7.7%
a 6
 
2.9%
l 5
 
2.4%
Other values (7) 16
7.7%
Decimal Number
ValueCountFrequency (%)
1 677
29.0%
2 650
27.8%
3 284
12.2%
4 173
 
7.4%
0 136
 
5.8%
5 125
 
5.3%
9 86
 
3.7%
6 76
 
3.3%
8 63
 
2.7%
7 62
 
2.7%
Other values (5) 5
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 172
39.7%
. 126
29.1%
· 87
20.1%
/ 23
 
5.3%
; 7
 
1.6%
7
 
1.6%
: 5
 
1.2%
% 2
 
0.5%
# 2
 
0.5%
2
 
0.5%
Math Symbol
ValueCountFrequency (%)
+ 11
68.8%
~ 3
 
18.8%
> 1
 
6.2%
< 1
 
6.2%
Space Separator
ValueCountFrequency (%)
5375
> 99.9%
  1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 337
99.7%
1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 334
99.7%
1
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 136
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47260
83.2%
Common 8990
 
15.8%
Latin 574
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3438
 
7.3%
1748
 
3.7%
1469
 
3.1%
1385
 
2.9%
1002
 
2.1%
930
 
2.0%
719
 
1.5%
718
 
1.5%
625
 
1.3%
519
 
1.1%
Other values (689) 34707
73.4%
Latin
ValueCountFrequency (%)
C 58
 
10.1%
e 36
 
6.3%
t 32
 
5.6%
A 30
 
5.2%
r 24
 
4.2%
S 24
 
4.2%
U 22
 
3.8%
R 21
 
3.7%
m 20
 
3.5%
c 20
 
3.5%
Other values (41) 287
50.0%
Common
ValueCountFrequency (%)
5375
59.8%
1 677
 
7.5%
2 650
 
7.2%
( 337
 
3.7%
) 334
 
3.7%
3 284
 
3.2%
4 173
 
1.9%
, 172
 
1.9%
- 136
 
1.5%
0 136
 
1.5%
Other values (27) 716
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47260
83.2%
ASCII 9437
 
16.6%
None 127
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5375
57.0%
1 677
 
7.2%
2 650
 
6.9%
( 337
 
3.6%
) 334
 
3.5%
3 284
 
3.0%
4 173
 
1.8%
, 172
 
1.8%
- 136
 
1.4%
0 136
 
1.4%
Other values (58) 1163
 
12.3%
Hangul
ValueCountFrequency (%)
3438
 
7.3%
1748
 
3.7%
1469
 
3.1%
1385
 
2.9%
1002
 
2.1%
930
 
2.0%
719
 
1.5%
718
 
1.5%
625
 
1.3%
519
 
1.1%
Other values (689) 34707
73.4%
None
ValueCountFrequency (%)
· 87
68.5%
7
 
5.5%
5
 
3.9%
4
 
3.1%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
  1
 
0.8%
Other values (10) 10
 
7.9%

영문코드명
Text

MISSING 

Distinct6084
Distinct (%)67.1%
Missing928
Missing (%)9.3%
Memory size156.2 KiB
2023-12-12T09:31:13.679833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length146
Median length89
Mean length19.243607
Min length2

Characters and Unicode

Total characters174578
Distinct characters81
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4144 ?
Unique (%)45.7%

Sample

1st rowJeomchon 2(i)-dong
2nd rowDiseases of the skin and subcutaneous tissue (L00-L98)
3rd rowWhole sale retail sale
4th rowGyeseong-myeon
5th rowWedding Chapel Services
ValueCountFrequency (%)
of 1015
 
4.9%
and 827
 
4.0%
2(i)-dong 533
 
2.6%
1(il)-dong 508
 
2.5%
manufacture 452
 
2.2%
other 346
 
1.7%
3(sam)-dong 215
 
1.0%
products 166
 
0.8%
services 123
 
0.6%
equipment 105
 
0.5%
Other values (5792) 16366
79.2%
2023-12-12T09:31:14.353772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 16640
 
9.5%
o 14990
 
8.6%
e 14441
 
8.3%
12083
 
6.9%
a 11403
 
6.5%
g 8543
 
4.9%
i 8516
 
4.9%
s 6882
 
3.9%
r 6719
 
3.8%
d 6235
 
3.6%
Other values (71) 68126
39.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 135786
77.8%
Uppercase Letter 14131
 
8.1%
Space Separator 12083
 
6.9%
Dash Punctuation 5932
 
3.4%
Decimal Number 2241
 
1.3%
Open Punctuation 1821
 
1.0%
Close Punctuation 1813
 
1.0%
Other Punctuation 746
 
0.4%
Currency Symbol 19
 
< 0.1%
Connector Punctuation 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 16640
12.3%
o 14990
11.0%
e 14441
10.6%
a 11403
 
8.4%
g 8543
 
6.3%
i 8516
 
6.3%
s 6882
 
5.1%
r 6719
 
4.9%
d 6235
 
4.6%
t 6160
 
4.5%
Other values (16) 35257
26.0%
Uppercase Letter
ValueCountFrequency (%)
S 1915
13.6%
M 1270
 
9.0%
G 985
 
7.0%
C 933
 
6.6%
P 835
 
5.9%
B 755
 
5.3%
D 744
 
5.3%
O 689
 
4.9%
A 683
 
4.8%
H 609
 
4.3%
Other values (16) 4713
33.4%
Decimal Number
ValueCountFrequency (%)
1 651
29.0%
2 637
28.4%
3 272
12.1%
4 165
 
7.4%
0 143
 
6.4%
5 111
 
5.0%
9 83
 
3.7%
6 72
 
3.2%
7 54
 
2.4%
8 53
 
2.4%
Other Punctuation
ValueCountFrequency (%)
. 485
65.0%
, 135
 
18.1%
; 61
 
8.2%
· 35
 
4.7%
/ 22
 
2.9%
& 5
 
0.7%
: 2
 
0.3%
' 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1820
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1812
99.9%
] 1
 
0.1%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
12083
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5932
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 19
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149919
85.9%
Common 24659
 
14.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 16640
 
11.1%
o 14990
 
10.0%
e 14441
 
9.6%
a 11403
 
7.6%
g 8543
 
5.7%
i 8516
 
5.7%
s 6882
 
4.6%
r 6719
 
4.5%
d 6235
 
4.2%
t 6160
 
4.1%
Other values (44) 49390
32.9%
Common
ValueCountFrequency (%)
12083
49.0%
- 5932
24.1%
( 1820
 
7.4%
) 1812
 
7.3%
1 651
 
2.6%
2 637
 
2.6%
. 485
 
2.0%
3 272
 
1.1%
4 165
 
0.7%
0 143
 
0.6%
Other values (17) 659
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 174541
> 99.9%
None 35
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 16640
 
9.5%
o 14990
 
8.6%
e 14441
 
8.3%
12083
 
6.9%
a 11403
 
6.5%
g 8543
 
4.9%
i 8516
 
4.9%
s 6882
 
3.9%
r 6719
 
3.8%
d 6235
 
3.6%
Other values (68) 68089
39.0%
None
ValueCountFrequency (%)
· 35
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

한글코드약어명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

영문코드약어명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

상위코드타입번호
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
11
9918 
21
 
80
<NA>
 
2

Length

Max length4
Median length2
Mean length2.0004
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11
2nd row11
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
11 9918
99.2%
21 80
 
0.8%
<NA> 2
 
< 0.1%

Length

2023-12-12T09:31:14.550502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:14.683569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11 9918
99.2%
21 80
 
0.8%
na 2
 
< 0.1%

상위코드색인번호
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
101
9918 
100
 
80
<NA>
 
2

Length

Max length4
Median length3
Mean length3.0002
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row101
2nd row101
3rd row101
4th row101
5th row101

Common Values

ValueCountFrequency (%)
101 9918
99.2%
100 80
 
0.8%
<NA> 2
 
< 0.1%

Length

2023-12-12T09:31:14.815490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:14.939653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
101 9918
99.2%
100 80
 
0.8%
na 2
 
< 0.1%

상위상세코드
Text

MISSING 

Distinct1848
Distinct (%)20.2%
Missing865
Missing (%)8.6%
Memory size156.2 KiB
2023-12-12T09:31:15.338860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.2302135
Min length1

Characters and Unicode

Total characters38643
Distinct characters31
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique833 ?
Unique (%)9.1%

Sample

1st row37090
2nd row10202
3rd row38330
4th rowR9399
5th row1105
ValueCountFrequency (%)
31 98
 
1.1%
1 96
 
1.1%
2 86
 
0.9%
11 68
 
0.7%
3 63
 
0.7%
pa 53
 
0.6%
21 49
 
0.5%
35020 46
 
0.5%
35030 46
 
0.5%
39010 43
 
0.5%
Other values (1838) 8487
92.9%
2023-12-12T09:31:15.900679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8947
23.2%
3 6848
17.7%
1 6513
16.9%
2 4524
11.7%
4 2081
 
5.4%
5 1810
 
4.7%
6 1527
 
4.0%
7 1468
 
3.8%
8 1256
 
3.3%
9 1016
 
2.6%
Other values (21) 2653
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 35990
93.1%
Uppercase Letter 2653
 
6.9%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 574
21.6%
D 398
15.0%
N 295
11.1%
C 290
10.9%
G 232
8.7%
P 116
 
4.4%
H 111
 
4.2%
J 85
 
3.2%
F 74
 
2.8%
I 73
 
2.8%
Other values (11) 405
15.3%
Decimal Number
ValueCountFrequency (%)
0 8947
24.9%
3 6848
19.0%
1 6513
18.1%
2 4524
12.6%
4 2081
 
5.8%
5 1810
 
5.0%
6 1527
 
4.2%
7 1468
 
4.1%
8 1256
 
3.5%
9 1016
 
2.8%

Most occurring scripts

ValueCountFrequency (%)
Common 35990
93.1%
Latin 2653
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 574
21.6%
D 398
15.0%
N 295
11.1%
C 290
10.9%
G 232
8.7%
P 116
 
4.4%
H 111
 
4.2%
J 85
 
3.2%
F 74
 
2.8%
I 73
 
2.8%
Other values (11) 405
15.3%
Common
ValueCountFrequency (%)
0 8947
24.9%
3 6848
19.0%
1 6513
18.1%
2 4524
12.6%
4 2081
 
5.8%
5 1810
 
5.0%
6 1527
 
4.2%
7 1468
 
4.1%
8 1256
 
3.5%
9 1016
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38643
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8947
23.2%
3 6848
17.7%
1 6513
16.9%
2 4524
11.7%
4 2081
 
5.4%
5 1810
 
4.7%
6 1527
 
4.0%
7 1468
 
3.8%
8 1256
 
3.3%
9 1016
 
2.6%
Other values (21) 2653
 
6.9%

유효시작일
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9998 
19700101
 
2

Length

Max length8
Median length4
Mean length4.0008
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9998
> 99.9%
19700101 2
 
< 0.1%

Length

2023-12-12T09:31:16.056556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:16.179638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9998
> 99.9%
19700101 2
 
< 0.1%

유효종료일
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
9998 
99991231
 
2

Length

Max length8
Median length4
Mean length4.0008
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 9998
> 99.9%
99991231 2
 
< 0.1%

Length

2023-12-12T09:31:16.306656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:16.413311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 9998
> 99.9%
99991231 2
 
< 0.1%

함수값여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB
Distinct8
Distinct (%)0.1%
Missing4061
Missing (%)40.6%
Memory size156.2 KiB
Minimum2010-05-13 00:00:00
Maximum2017-07-04 00:00:00
2023-12-12T09:31:16.509955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:31:16.629967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
Distinct7
Distinct (%)0.1%
Missing4063
Missing (%)40.6%
Memory size156.2 KiB
Minimum2010-05-13 00:00:00
Maximum2017-07-04 00:00:00
2023-12-12T09:31:16.749750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:31:16.841436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
101
5937 
<NA>
4063 

Length

Max length4
Median length3
Mean length3.4063
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row101
3rd row101
4th row<NA>
5th row101

Common Values

ValueCountFrequency (%)
101 5937
59.4%
<NA> 4063
40.6%

Length

2023-12-12T09:31:16.939679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:31:17.028681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
101 5937
59.4%
na 4063
40.6%

코드번호
Text

MISSING 

Distinct5075
Distinct (%)100.0%
Missing4925
Missing (%)49.2%
Memory size156.2 KiB
2023-12-12T09:31:17.219787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length12.89064
Min length8

Characters and Unicode

Total characters65420
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5075 ?
Unique (%)100.0%

Sample

1st row11101SWSC
2nd row11101SYJ1020204
3rd row1110100BR93991
4th row11101GHD110501
5th row11101HJG3603039
ValueCountFrequency (%)
11101yre440 1
 
< 0.1%
11101hak03 1
 
< 0.1%
11101hjg34012 1
 
< 0.1%
11101hjg1102057 1
 
< 0.1%
11101hjg3536037 1
 
< 0.1%
11101gbja09202 1
 
< 0.1%
11101hjg2112053 1
 
< 0.1%
11101skt66214 1
 
< 0.1%
11101hjg3105154 1
 
< 0.1%
11101skt1a19 1
 
< 0.1%
Other values (5065) 5065
99.8%
2023-12-12T09:31:17.590996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 24605
37.6%
0 11441
17.5%
3 3355
 
5.1%
2 3044
 
4.7%
G 2617
 
4.0%
J 2221
 
3.4%
H 1932
 
3.0%
B 1768
 
2.7%
5 1617
 
2.5%
S 1596
 
2.4%
Other values (25) 11224
17.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 49335
75.4%
Uppercase Letter 16085
 
24.6%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G 2617
16.3%
J 2221
13.8%
H 1932
12.0%
B 1768
11.0%
S 1596
9.9%
K 852
 
5.3%
T 655
 
4.1%
A 639
 
4.0%
D 535
 
3.3%
N 496
 
3.1%
Other values (15) 2774
17.2%
Decimal Number
ValueCountFrequency (%)
1 24605
49.9%
0 11441
23.2%
3 3355
 
6.8%
2 3044
 
6.2%
5 1617
 
3.3%
4 1339
 
2.7%
6 1182
 
2.4%
7 1036
 
2.1%
9 912
 
1.8%
8 804
 
1.6%

Most occurring scripts

ValueCountFrequency (%)
Common 49335
75.4%
Latin 16085
 
24.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 2617
16.3%
J 2221
13.8%
H 1932
12.0%
B 1768
11.0%
S 1596
9.9%
K 852
 
5.3%
T 655
 
4.1%
A 639
 
4.0%
D 535
 
3.3%
N 496
 
3.1%
Other values (15) 2774
17.2%
Common
ValueCountFrequency (%)
1 24605
49.9%
0 11441
23.2%
3 3355
 
6.8%
2 3044
 
6.2%
5 1617
 
3.3%
4 1339
 
2.7%
6 1182
 
2.4%
7 1036
 
2.1%
9 912
 
1.8%
8 804
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 65420
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 24605
37.6%
0 11441
17.5%
3 3355
 
5.1%
2 3044
 
4.7%
G 2617
 
4.0%
J 2221
 
3.4%
H 1932
 
3.0%
B 1768
 
2.7%
5 1617
 
2.5%
S 1596
 
2.4%
Other values (25) 11224
17.2%

Sample

그룹코드한글그룹코드명최초등록일최종변경일조직번호상세코드코드타입구분코드색인구분한글코드명영문코드명한글코드약어명영문코드약어명상위코드타입번호상위코드색인번호상위상세코드유효시작일유효종료일함수값여부코드값최초등록일코드값최종변경일코드값 조직번호코드번호
16928SGG_201804행정구역별 (ver 2018. 4.)2018-04-112018-04-11101370905811101점촌2동Jeomchon 2(i)-dong<NA><NA>1110137090<NA><NA><NA><NA><NA><NA><NA>
27762SWS사망원인별(103항목)2010-05-132010-05-13101C11101피부 및 피부밑조직의 질환 (L00-L98)Diseases of the skin and subcutaneous tissue (L00-L98)<NA><NA>11101<NA><NA><NA><NA>2010-05-132010-05-1310111101SWSC
27832SYJ수요자별2010-05-132010-05-13101102020411101도소매업Whole sale retail sale<NA><NA>1110110202<NA><NA><NA>2010-05-132010-05-1310111101SYJ1020204
17311SGG_201804행정구역별 (ver 2018. 4.)2018-04-112018-04-11101383303711101계성면Gyeseong-myeon<NA><NA>1110138330<NA><NA><NA><NA><NA><NA><NA>
183100B산업별2010-05-132010-05-13101R9399111101예식장업Wedding Chapel Services<NA><NA>11101R9399<NA><NA><NA>2010-05-132010-05-131011110100BR93991
3864GHD경제활동별2010-05-132010-05-1310111050111101정보처리 및 기타 컴퓨터 운영 관련업(산업)Industry<NA><NA>111011105<NA><NA><NA>2010-05-132010-05-1310111101GHD110501
17600SGG_201807행정구역별 (ver 2018. 7.)2018-08-212018-08-21101110906111101번2동Beon 2(i)-dong<NA><NA>1110111090<NA><NA><NA><NA><NA><NA><NA>
15945SGG_201804행정구역별 (ver 2018. 4.)2018-04-112018-04-11101340125211101성정2동Seongjeong 2(i)-dong<NA><NA>1110134012<NA><NA><NA><NA><NA><NA><NA>
21258SGG_202007행정구역별 (ver 2020. 7.)2020-07-012020-07-01101110106911101창신3동Changsin 3(sam)-dong<NA><NA>1110111010<NA><NA><NA><NA><NA><NA><NA>
21050SGG_201807행정구역별 (ver 2018. 7.)2018-08-212018-08-21101381155411101여좌동Yeojwa-dong<NA><NA>1110138115<NA><NA><NA><NA><NA><NA><NA>
그룹코드한글그룹코드명최초등록일최종변경일조직번호상세코드코드타입구분코드색인구분한글코드명영문코드명한글코드약어명영문코드약어명상위코드타입번호상위코드색인번호상위상세코드유효시작일유효종료일함수값여부코드값최초등록일코드값최종변경일코드값 조직번호코드번호
23000B산업별2010-05-132010-05-13101D1740911101기타 섬유 염색 및 정리업Other Dyeing and Finishing Textiles<NA><NA>11101D1740<NA><NA><NA>2010-05-132010-05-131011110100BD17409
12815PGM폐기물종류별2010-05-132010-05-13101011101합계<NA><NA><NA>11101<NA><NA><NA><NA>2010-05-132010-05-1310111101PGM0
1887BIH법인형태별2010-05-132010-05-13101BIH11101법인분류별<NA><NA><NA>21100<NA><NA><NA><NA>2010-05-132010-05-1310111101BIH
10057IND산업분류별(9차)2012-11-072012-11-07101C2711101의료, 정밀, 광학기기 및 시계 제조업Manufacture of Medical, Precision and Optical Instruments, Watches and Clocks<NA><NA>11101C<NA><NA><NA>2012-11-072012-11-07101<NA>
7551HJG행정구역별(구)2010-05-132012-10-23101324003311101토성면Toseong-myeon<NA><NA>1110132400<NA><NA><NA>2014-04-302014-04-3010111101HJG3240033
20842SGG_201807행정구역별 (ver 2018. 7.)2018-08-212018-08-21101374103211101봉성면Bongseong-myeon<NA><NA>1110137410<NA><NA><NA><NA><NA><NA><NA>
13891SGG_201804행정구역별 (ver 2018. 4.)2018-04-112018-04-11101111307211101북가좌2동Bukgajwa 2(i)-dong<NA><NA>1110111130<NA><NA><NA><NA><NA><NA><NA>
21131SGG_201807행정구역별 (ver 2018. 7.)2018-08-212018-08-21101383603211101악양면Agyang-myeon<NA><NA>1110138360<NA><NA><NA><NA><NA><NA><NA>
10936IND산업분류별(9차)2012-11-072012-11-07101L6811211101비주거용 건물 임대업Renting of Non-Residential Buildings<NA><NA>11101L6811<NA><NA><NA>2012-11-072012-11-07101<NA>
4525GSH가계수지항목별2010-05-132010-05-13101I2911101기타부채증가Increasing other debts<NA><NA>11101I2<NA><NA><NA>2010-05-132010-05-1310111101GSHI29