Overview

Dataset statistics

Number of variables9
Number of observations1191
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory85.0 KiB
Average record size in memory73.1 B

Variable types

Numeric1
Text7
Categorical1

Dataset

Description국토교통R&D를 수행하고 있는 수행기관에 대한 통합정보(기관명, 기관유형, 설립일, 주소, 대표자, 연락처) 제공
Author국토교통과학기술진흥원
URLhttps://www.data.go.kr/data/15060797/fileData.do

Alerts

기관유형 is highly imbalanced (53.5%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:33:05.735533
Analysis finished2023-12-12 12:33:07.341265
Duration1.61 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1191
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean596
Minimum1
Maximum1191
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.6 KiB
2023-12-12T21:33:07.459540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile60.5
Q1298.5
median596
Q3893.5
95-th percentile1131.5
Maximum1191
Range1190
Interquartile range (IQR)595

Descriptive statistics

Standard deviation343.95639
Coefficient of variation (CV)0.57710804
Kurtosis-1.2
Mean596
Median Absolute Deviation (MAD)298
Skewness0
Sum709836
Variance118306
MonotonicityStrictly increasing
2023-12-12T21:33:07.691152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
801 1
 
0.1%
799 1
 
0.1%
798 1
 
0.1%
797 1
 
0.1%
796 1
 
0.1%
795 1
 
0.1%
794 1
 
0.1%
793 1
 
0.1%
792 1
 
0.1%
Other values (1181) 1181
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1191 1
0.1%
1190 1
0.1%
1189 1
0.1%
1188 1
0.1%
1187 1
0.1%
1186 1
0.1%
1185 1
0.1%
1184 1
0.1%
1183 1
0.1%
1182 1
0.1%
Distinct1188
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:08.030240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length32
Mean length9.6523929
Min length3

Characters and Unicode

Total characters11496
Distinct characters476
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1185 ?
Unique (%)99.5%

Sample

1st row(사) 한국금속포장산업협회
2nd row(사)대한건축학회
3rd row(사)한국공간정보연구조합
4th row(사)한국도로협회
5th row(사)한국콘크리트학회
ValueCountFrequency (%)
주식회사 178
 
11.3%
59
 
3.7%
산학협력단 37
 
2.3%
university 15
 
1.0%
사단법인 10
 
0.6%
of 9
 
0.6%
technology 4
 
0.3%
institute 3
 
0.2%
건축사사무소 3
 
0.2%
주)우진산전 3
 
0.2%
Other values (1241) 1257
79.7%
2023-12-12T21:33:08.564526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
916
 
8.0%
) 753
 
6.6%
( 753
 
6.6%
387
 
3.4%
368
 
3.2%
301
 
2.6%
301
 
2.6%
256
 
2.2%
223
 
1.9%
197
 
1.7%
Other values (466) 7041
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8995
78.2%
Close Punctuation 753
 
6.6%
Open Punctuation 753
 
6.6%
Lowercase Letter 432
 
3.8%
Space Separator 387
 
3.4%
Uppercase Letter 170
 
1.5%
Other Punctuation 5
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
916
 
10.2%
368
 
4.1%
301
 
3.3%
301
 
3.3%
256
 
2.8%
223
 
2.5%
197
 
2.2%
174
 
1.9%
173
 
1.9%
172
 
1.9%
Other values (412) 5914
65.7%
Uppercase Letter
ValueCountFrequency (%)
T 17
 
10.0%
I 16
 
9.4%
S 16
 
9.4%
U 15
 
8.8%
E 13
 
7.6%
N 10
 
5.9%
A 8
 
4.7%
B 7
 
4.1%
K 7
 
4.1%
C 7
 
4.1%
Other values (15) 54
31.8%
Lowercase Letter
ValueCountFrequency (%)
e 51
11.8%
i 51
11.8%
n 42
9.7%
t 39
 
9.0%
o 33
 
7.6%
s 29
 
6.7%
r 29
 
6.7%
y 22
 
5.1%
a 19
 
4.4%
h 17
 
3.9%
Other values (12) 100
23.1%
Other Punctuation
ValueCountFrequency (%)
, 2
40.0%
. 2
40.0%
1
20.0%
Close Punctuation
ValueCountFrequency (%)
) 753
100.0%
Open Punctuation
ValueCountFrequency (%)
( 753
100.0%
Space Separator
ValueCountFrequency (%)
387
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8995
78.2%
Common 1899
 
16.5%
Latin 602
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
916
 
10.2%
368
 
4.1%
301
 
3.3%
301
 
3.3%
256
 
2.8%
223
 
2.5%
197
 
2.2%
174
 
1.9%
173
 
1.9%
172
 
1.9%
Other values (412) 5914
65.7%
Latin
ValueCountFrequency (%)
e 51
 
8.5%
i 51
 
8.5%
n 42
 
7.0%
t 39
 
6.5%
o 33
 
5.5%
s 29
 
4.8%
r 29
 
4.8%
y 22
 
3.7%
a 19
 
3.2%
T 17
 
2.8%
Other values (37) 270
44.9%
Common
ValueCountFrequency (%)
) 753
39.7%
( 753
39.7%
387
20.4%
, 2
 
0.1%
. 2
 
0.1%
1
 
0.1%
- 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8995
78.2%
ASCII 2500
 
21.7%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
916
 
10.2%
368
 
4.1%
301
 
3.3%
301
 
3.3%
256
 
2.8%
223
 
2.5%
197
 
2.2%
174
 
1.9%
173
 
1.9%
172
 
1.9%
Other values (412) 5914
65.7%
ASCII
ValueCountFrequency (%)
) 753
30.1%
( 753
30.1%
387
15.5%
e 51
 
2.0%
i 51
 
2.0%
n 42
 
1.7%
t 39
 
1.6%
o 33
 
1.3%
s 29
 
1.2%
r 29
 
1.2%
Other values (43) 333
13.3%
None
ValueCountFrequency (%)
1
100.0%

기관유형
Categorical

IMBALANCE 

Distinct14
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
중소기업(연)
821 
대학교
131 
중견기업(연)
 
60
대기업(연)
 
53
정부출연(연)
 
34
Other values (9)
92 

Length

Max length15
Median length7
Mean length6.4130982
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row협회
2nd row학회
3rd row연구조합
4th row협회
5th row학회

Common Values

ValueCountFrequency (%)
중소기업(연) 821
68.9%
대학교 131
 
11.0%
중견기업(연) 60
 
5.0%
대기업(연) 53
 
4.5%
정부출연(연) 34
 
2.9%
기타 25
 
2.1%
준정부기관(비영리기관)(연) 23
 
1.9%
협회 20
 
1.7%
지자체 7
 
0.6%
학회 5
 
0.4%
Other values (4) 12
 
1.0%

Length

2023-12-12T21:33:08.758054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
중소기업(연 821
68.9%
대학교 131
 
11.0%
중견기업(연 60
 
5.0%
대기업(연 53
 
4.5%
정부출연(연 34
 
2.9%
기타 25
 
2.1%
준정부기관(비영리기관)(연 23
 
1.9%
협회 20
 
1.7%
지자체 7
 
0.6%
학회 5
 
0.4%
Other values (4) 12
 
1.0%
Distinct1166
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:09.079314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters14292
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1160 ?
Unique (%)97.4%

Sample

1st row138-82-01708
2nd row108-82-32137
3rd row113-82-06266
4th row116-82-04821
5th row220-82-06453
ValueCountFrequency (%)
999-99-99999 18
 
1.5%
000-00-00000 5
 
0.4%
206-82-07306 2
 
0.2%
135-82-10789 2
 
0.2%
101-82-12009 2
 
0.2%
305-82-13385 2
 
0.2%
123-82-12111 1
 
0.1%
120-81-86581 1
 
0.1%
258-86-00132 1
 
0.1%
107-88-21212 1
 
0.1%
Other values (1156) 1156
97.1%
2023-12-12T21:33:09.664152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2382
16.7%
1 2090
14.6%
8 1870
13.1%
0 1586
11.1%
2 1343
9.4%
6 980
6.9%
3 885
 
6.2%
4 840
 
5.9%
9 794
 
5.6%
7 768
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11910
83.3%
Dash Punctuation 2382
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2090
17.5%
8 1870
15.7%
0 1586
13.3%
2 1343
11.3%
6 980
8.2%
3 885
7.4%
4 840
7.1%
9 794
 
6.7%
7 768
 
6.4%
5 754
 
6.3%
Dash Punctuation
ValueCountFrequency (%)
- 2382
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14292
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2382
16.7%
1 2090
14.6%
8 1870
13.1%
0 1586
11.1%
2 1343
9.4%
6 980
6.9%
3 885
 
6.2%
4 840
 
5.9%
9 794
 
5.6%
7 768
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14292
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2382
16.7%
1 2090
14.6%
8 1870
13.1%
0 1586
11.1%
2 1343
9.4%
6 980
6.9%
3 885
 
6.2%
4 840
 
5.9%
9 794
 
5.6%
7 768
 
5.4%
Distinct452
Distinct (%)38.0%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:10.050962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length8.2493703
Min length7

Characters and Unicode

Total characters9825
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique425 ?
Unique (%)35.7%

Sample

1st row데이터 미집계
2nd row1971-06-01
3rd row2013-12-24
4th row1966-04-27
5th row1992-09-02
ValueCountFrequency (%)
데이터 695
36.9%
미집계 695
36.9%
2004-03-01 6
 
0.3%
2004-04-01 6
 
0.3%
1999-02-01 4
 
0.2%
2004-03-26 4
 
0.2%
2004-05-03 3
 
0.2%
2010-07-08 3
 
0.2%
2003-09-23 3
 
0.2%
2016-01-01 3
 
0.2%
Other values (443) 464
24.6%
2023-12-12T21:33:10.620949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1331
13.5%
- 992
10.1%
1 753
 
7.7%
695
 
7.1%
695
 
7.1%
695
 
7.1%
695
 
7.1%
695
 
7.1%
695
 
7.1%
695
 
7.1%
Other values (8) 1884
19.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4170
42.4%
Decimal Number 3968
40.4%
Dash Punctuation 992
 
10.1%
Space Separator 695
 
7.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1331
33.5%
1 753
19.0%
2 601
15.1%
9 392
 
9.9%
4 174
 
4.4%
3 153
 
3.9%
7 148
 
3.7%
5 145
 
3.7%
6 136
 
3.4%
8 135
 
3.4%
Other Letter
ValueCountFrequency (%)
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 992
100.0%
Space Separator
ValueCountFrequency (%)
695
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5655
57.6%
Hangul 4170
42.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1331
23.5%
- 992
17.5%
1 753
13.3%
695
12.3%
2 601
10.6%
9 392
 
6.9%
4 174
 
3.1%
3 153
 
2.7%
7 148
 
2.6%
5 145
 
2.6%
Other values (2) 271
 
4.8%
Hangul
ValueCountFrequency (%)
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5655
57.6%
Hangul 4170
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1331
23.5%
- 992
17.5%
1 753
13.3%
695
12.3%
2 601
10.6%
9 392
 
6.9%
4 174
 
3.1%
3 153
 
2.7%
7 148
 
2.6%
5 145
 
2.6%
Other values (2) 271
 
4.8%
Hangul
ValueCountFrequency (%)
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%
695
16.7%
Distinct866
Distinct (%)72.7%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:11.075966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length4.768262
Min length1

Characters and Unicode

Total characters5679
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique715 ?
Unique (%)60.0%

Sample

1st row16679
2nd row6687
3rd row8389
4th row13647
5th row6130
ValueCountFrequency (%)
데이터 27
 
2.2%
미집계 27
 
2.2%
5836 16
 
1.3%
14056 16
 
1.3%
13449 11
 
0.9%
16006 8
 
0.7%
8507 8
 
0.7%
14057 8
 
0.7%
8511 7
 
0.6%
8390 6
 
0.5%
Other values (857) 1084
89.0%
2023-12-12T21:33:11.867482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 789
13.9%
4 642
11.3%
5 620
10.9%
3 605
10.7%
2 565
9.9%
6 531
9.4%
0 493
8.7%
8 484
8.5%
7 404
7.1%
9 357
6.3%
Other values (7) 189
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5490
96.7%
Other Letter 162
 
2.9%
Space Separator 27
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 789
14.4%
4 642
11.7%
5 620
11.3%
3 605
11.0%
2 565
10.3%
6 531
9.7%
0 493
9.0%
8 484
8.8%
7 404
7.4%
9 357
6.5%
Other Letter
ValueCountFrequency (%)
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%
Space Separator
ValueCountFrequency (%)
27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5517
97.1%
Hangul 162
 
2.9%

Most frequent character per script

Common
ValueCountFrequency (%)
1 789
14.3%
4 642
11.6%
5 620
11.2%
3 605
11.0%
2 565
10.2%
6 531
9.6%
0 493
8.9%
8 484
8.8%
7 404
7.3%
9 357
6.5%
Hangul
ValueCountFrequency (%)
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5517
97.1%
Hangul 162
 
2.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 789
14.3%
4 642
11.6%
5 620
11.2%
3 605
11.0%
2 565
10.2%
6 531
9.6%
0 493
8.9%
8 484
8.8%
7 404
7.3%
9 357
6.5%
Hangul
ValueCountFrequency (%)
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%
27
16.7%

주소
Text

Distinct1166
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:12.369105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length153
Median length60
Mean length33.4089
Min length7

Characters and Unicode

Total characters39790
Distinct characters542
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1161 ?
Unique (%)97.5%

Sample

1st row경기도 수원시 영통구 영통로241번길 12-29 (신동) 아람빌딩 6층 한국금속포장산업협회
2nd row서울특별시 서초구 효령로 87 (방배동) 대한건축학회 건축센터
3rd row서울 구로구 디지털로26길 5 308호 (구로동,에이스하이엔타워)
4th row경기 성남시 수정구 위례서일로 26, 8층 (창곡동)
5th row서울 강남구 테헤란로7길 22 (역삼동)
ValueCountFrequency (%)
서울 222
 
2.9%
서울특별시 180
 
2.3%
경기 167
 
2.2%
경기도 158
 
2.1%
성남시 87
 
1.1%
강남구 64
 
0.8%
분당구 53
 
0.7%
안양시 52
 
0.7%
금천구 50
 
0.6%
송파구 47
 
0.6%
Other values (3236) 6624
86.0%
2023-12-12T21:33:13.143480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6593
 
16.6%
1 1442
 
3.6%
1414
 
3.6%
( 1160
 
2.9%
) 1160
 
2.9%
1153
 
2.9%
1044
 
2.6%
2 914
 
2.3%
851
 
2.1%
, 847
 
2.1%
Other values (532) 23212
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22551
56.7%
Space Separator 6593
 
16.6%
Decimal Number 6520
 
16.4%
Close Punctuation 1162
 
2.9%
Open Punctuation 1161
 
2.9%
Other Punctuation 856
 
2.2%
Lowercase Letter 423
 
1.1%
Uppercase Letter 330
 
0.8%
Dash Punctuation 184
 
0.5%
Math Symbol 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1414
 
6.3%
1153
 
5.1%
1044
 
4.6%
851
 
3.8%
641
 
2.8%
562
 
2.5%
556
 
2.5%
461
 
2.0%
442
 
2.0%
430
 
1.9%
Other values (464) 14997
66.5%
Lowercase Letter
ValueCountFrequency (%)
a 52
12.3%
n 48
11.3%
e 39
9.2%
r 39
9.2%
o 38
9.0%
i 33
 
7.8%
d 25
 
5.9%
t 24
 
5.7%
g 21
 
5.0%
l 13
 
3.1%
Other values (14) 91
21.5%
Uppercase Letter
ValueCountFrequency (%)
B 39
11.8%
A 32
 
9.7%
C 31
 
9.4%
T 30
 
9.1%
I 25
 
7.6%
D 19
 
5.8%
H 17
 
5.2%
K 16
 
4.8%
S 16
 
4.8%
L 15
 
4.5%
Other values (14) 90
27.3%
Decimal Number
ValueCountFrequency (%)
1 1442
22.1%
2 914
14.0%
0 822
12.6%
3 612
9.4%
4 564
 
8.7%
5 549
 
8.4%
6 493
 
7.6%
7 418
 
6.4%
8 389
 
6.0%
9 317
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 847
98.9%
. 8
 
0.9%
/ 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1160
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1160
99.8%
] 2
 
0.2%
Space Separator
ValueCountFrequency (%)
6593
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 184
100.0%
Math Symbol
ValueCountFrequency (%)
~ 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22551
56.7%
Common 16486
41.4%
Latin 753
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1414
 
6.3%
1153
 
5.1%
1044
 
4.6%
851
 
3.8%
641
 
2.8%
562
 
2.5%
556
 
2.5%
461
 
2.0%
442
 
2.0%
430
 
1.9%
Other values (464) 14997
66.5%
Latin
ValueCountFrequency (%)
a 52
 
6.9%
n 48
 
6.4%
B 39
 
5.2%
e 39
 
5.2%
r 39
 
5.2%
o 38
 
5.0%
i 33
 
4.4%
A 32
 
4.2%
C 31
 
4.1%
T 30
 
4.0%
Other values (38) 372
49.4%
Common
ValueCountFrequency (%)
6593
40.0%
1 1442
 
8.7%
( 1160
 
7.0%
) 1160
 
7.0%
2 914
 
5.5%
, 847
 
5.1%
0 822
 
5.0%
3 612
 
3.7%
4 564
 
3.4%
5 549
 
3.3%
Other values (10) 1823
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22551
56.7%
ASCII 17239
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6593
38.2%
1 1442
 
8.4%
( 1160
 
6.7%
) 1160
 
6.7%
2 914
 
5.3%
, 847
 
4.9%
0 822
 
4.8%
3 612
 
3.6%
4 564
 
3.3%
5 549
 
3.2%
Other values (58) 2576
 
14.9%
Hangul
ValueCountFrequency (%)
1414
 
6.3%
1153
 
5.1%
1044
 
4.6%
851
 
3.8%
641
 
2.8%
562
 
2.5%
556
 
2.5%
461
 
2.0%
442
 
2.0%
430
 
1.9%
Other values (464) 14997
66.5%
Distinct1121
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:13.575045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length3
Mean length3.5541562
Min length2

Characters and Unicode

Total characters4233
Distinct characters269
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1088 ?
Unique (%)91.4%

Sample

1st row윤창우
2nd row이현수
3rd row김기영
4th row김진숙
5th row박홍근
ValueCountFrequency (%)
데이터 30
 
2.3%
미집계 30
 
2.3%
1명 21
 
1.6%
김진숙 4
 
0.3%
김기영 4
 
0.3%
이동현 3
 
0.2%
김성진 3
 
0.2%
조영태 3
 
0.2%
김영신 3
 
0.2%
김영진 3
 
0.2%
Other values (1178) 1209
92.1%
2023-12-12T21:33:14.182847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
6.2%
208
 
4.9%
129
 
3.0%
115
 
2.7%
106
 
2.5%
98
 
2.3%
77
 
1.8%
75
 
1.8%
68
 
1.6%
68
 
1.6%
Other values (259) 3026
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3786
89.4%
Lowercase Letter 147
 
3.5%
Space Separator 129
 
3.0%
Uppercase Letter 98
 
2.3%
Other Punctuation 50
 
1.2%
Decimal Number 22
 
0.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
 
6.9%
208
 
5.5%
115
 
3.0%
106
 
2.8%
98
 
2.6%
77
 
2.0%
75
 
2.0%
68
 
1.8%
68
 
1.8%
63
 
1.7%
Other values (207) 2645
69.9%
Uppercase Letter
ValueCountFrequency (%)
N 12
 
12.2%
H 8
 
8.2%
G 7
 
7.1%
K 7
 
7.1%
Y 6
 
6.1%
U 6
 
6.1%
A 6
 
6.1%
J 5
 
5.1%
I 5
 
5.1%
M 4
 
4.1%
Other values (14) 32
32.7%
Lowercase Letter
ValueCountFrequency (%)
n 20
13.6%
a 15
10.2%
e 15
10.2%
o 14
9.5%
i 13
8.8%
r 12
 
8.2%
g 8
 
5.4%
u 7
 
4.8%
t 6
 
4.1%
h 6
 
4.1%
Other values (12) 31
21.1%
Other Punctuation
ValueCountFrequency (%)
, 49
98.0%
. 1
 
2.0%
Decimal Number
ValueCountFrequency (%)
1 21
95.5%
2 1
 
4.5%
Space Separator
ValueCountFrequency (%)
129
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3786
89.4%
Latin 245
 
5.8%
Common 202
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
 
6.9%
208
 
5.5%
115
 
3.0%
106
 
2.8%
98
 
2.6%
77
 
2.0%
75
 
2.0%
68
 
1.8%
68
 
1.8%
63
 
1.7%
Other values (207) 2645
69.9%
Latin
ValueCountFrequency (%)
n 20
 
8.2%
a 15
 
6.1%
e 15
 
6.1%
o 14
 
5.7%
i 13
 
5.3%
r 12
 
4.9%
N 12
 
4.9%
H 8
 
3.3%
g 8
 
3.3%
G 7
 
2.9%
Other values (36) 121
49.4%
Common
ValueCountFrequency (%)
129
63.9%
, 49
 
24.3%
1 21
 
10.4%
. 1
 
0.5%
2 1
 
0.5%
- 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3786
89.4%
ASCII 447
 
10.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
263
 
6.9%
208
 
5.5%
115
 
3.0%
106
 
2.8%
98
 
2.6%
77
 
2.0%
75
 
2.0%
68
 
1.8%
68
 
1.8%
63
 
1.7%
Other values (207) 2645
69.9%
ASCII
ValueCountFrequency (%)
129
28.9%
, 49
 
11.0%
1 21
 
4.7%
n 20
 
4.5%
a 15
 
3.4%
e 15
 
3.4%
o 14
 
3.1%
i 13
 
2.9%
r 12
 
2.7%
N 12
 
2.7%
Other values (42) 147
32.9%
Distinct1038
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-12T21:33:14.576752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length11.349286
Min length7

Characters and Unicode

Total characters13517
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1032 ?
Unique (%)86.6%

Sample

1st row031-202-8267
2nd row02-525-1843
3rd row데이터 미집계
4th row데이터 미집계
5th row02-539-5983
ValueCountFrequency (%)
데이터 149
 
11.1%
미집계 149
 
11.1%
031-969-6952 2
 
0.1%
043-820-4258 2
 
0.1%
042-615-4656 2
 
0.1%
02-552-1012 2
 
0.1%
042-824-1142 2
 
0.1%
070-4495-7482 1
 
0.1%
02-527-6400 1
 
0.1%
02-2125-4026 1
 
0.1%
Other values (1029) 1029
76.8%
2023-12-12T21:33:15.605485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2246
16.6%
- 2089
15.5%
2 1347
10.0%
3 1119
8.3%
1 1095
8.1%
5 906
6.7%
4 842
 
6.2%
7 833
 
6.2%
6 779
 
5.8%
8 699
 
5.2%
Other values (9) 1562
11.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10383
76.8%
Dash Punctuation 2089
 
15.5%
Other Letter 894
 
6.6%
Space Separator 149
 
1.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2246
21.6%
2 1347
13.0%
3 1119
10.8%
1 1095
10.5%
5 906
8.7%
4 842
 
8.1%
7 833
 
8.0%
6 779
 
7.5%
8 699
 
6.7%
9 517
 
5.0%
Other Letter
ValueCountFrequency (%)
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 2089
100.0%
Space Separator
ValueCountFrequency (%)
149
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12623
93.4%
Hangul 894
 
6.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2246
17.8%
- 2089
16.5%
2 1347
10.7%
3 1119
8.9%
1 1095
8.7%
5 906
7.2%
4 842
 
6.7%
7 833
 
6.6%
6 779
 
6.2%
8 699
 
5.5%
Other values (3) 668
 
5.3%
Hangul
ValueCountFrequency (%)
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12623
93.4%
Hangul 894
 
6.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2246
17.8%
- 2089
16.5%
2 1347
10.7%
3 1119
8.9%
1 1095
8.7%
5 906
7.2%
4 842
 
6.7%
7 833
 
6.6%
6 779
 
6.2%
8 699
 
5.5%
Other values (3) 668
 
5.3%
Hangul
ValueCountFrequency (%)
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%
149
16.7%

Interactions

2023-12-12T21:33:06.876013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:33:15.750936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번기관유형
순번1.0000.535
기관유형0.5351.000
2023-12-12T21:33:15.854028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번기관유형
순번1.0000.248
기관유형0.2481.000

Missing values

2023-12-12T21:33:07.049183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:33:07.259695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번기관명기관유형사업자번호설립일우편번호주소대표자연락처
01(사) 한국금속포장산업협회협회138-82-01708데이터 미집계16679경기도 수원시 영통구 영통로241번길 12-29 (신동) 아람빌딩 6층 한국금속포장산업협회윤창우031-202-8267
12(사)대한건축학회학회108-82-321371971-06-016687서울특별시 서초구 효령로 87 (방배동) 대한건축학회 건축센터이현수02-525-1843
23(사)한국공간정보연구조합연구조합113-82-062662013-12-248389서울 구로구 디지털로26길 5 308호 (구로동,에이스하이엔타워)김기영데이터 미집계
34(사)한국도로협회협회116-82-048211966-04-2713647경기 성남시 수정구 위례서일로 26, 8층 (창곡동)김진숙데이터 미집계
45(사)한국콘크리트학회학회220-82-064531992-09-026130서울 강남구 테헤란로7길 22 (역삼동)박홍근02-539-5983
56(사)한국패시브건축협회기타220-82-101132009-03-075520서울특별시 송파구 올림픽로 577, 3층 (풍납동,에이동)최정만070-7603-6621
67(사)한국화재보험협회협회116-82-01300데이터 미집계7328서울 영등포구 국제금융로6길 38 (여의도동,한국화재보험협회빌딩)이윤배데이터 미집계
78(사단)대한산업안전협회협회130-82-070091964-07-068289서울 구로구 공원로 70 (구로동)박종선02-860-7100-
89(사단)빌딩스마트협회협회101-82-178862008-04-256600서울 서초구 서초중앙로 188, 423호 (서초동,아크로비스타 사무동엘)허인070-7012-0409
910(사단)생활환경디자인연구소기타101-82-168542008-05-276017서울 강남구 언주로168길 37, 3층 (신사동)변혜령02-548-2058
순번기관명기관유형사업자번호설립일우편번호주소대표자연락처
11811182현대오토에버(주)대기업(연)104-81-53190데이터 미집계6179서울 강남구 테헤란로 510 (대치동)서정식02-6296-4762
11821183현대자동차(주)대기업(연)101-81-09147데이터 미집계6797서울 서초구 헌릉로 12 (양재동)하언태02-3464-1114
11831184현우시스템중소기업(연)123-36-204962011-08-08437757경기도 의왕시 철도박물관로176 (월암동 한국철도기술연구원) 기술실용화센터기술사업화연구실 214남학기데이터 미집계
11841185호서대학교산학협력단대학교312-82-102561978-09-2831499충남 아산시 배방읍 호서로79번길 20, 내 (세출리,호서대학교)김병삼041-540-5721
11851186홍익대학교 세종캠퍼스산학협력단대학교307-82-08705데이터 미집계30016세종특별자치시 조치원읍 세종로 2639 홍익대학교 세종산학협력단김기수044-860-2801
11861187홍익대학교과학기술연구소대학교105-82-06945데이터 미집계4066서울 마포구 와우산로 94 (상수동)박구현02-320-1421
11871188홍익대학교산학협력단대학교105-82-136172012-01-084066서울특별시 마포구 와우산로 94 (상수동)추상호044-860-2875
11881189화인정밀(주)중소기업(연)606-86-353871989-01-0149487부산 사하구 다산로175번길 62 (다대동)홍기영051-301-1213
11891190효창엔지니어링 (주)중소기업(연)223-81-07264데이터 미집계200959강원도 춘천시 춘천로380 (후평동 ) 742-2박건033-251-4797
11901191휴센텍(주)중소기업(연)107-88-41260데이터 미집계14056경기 안양시 동안구 학의로 268 410호 (관양동,메가밸리)강시철070-7844-9315