Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory625.0 KiB
Average record size in memory64.0 B

Variable types

Text5
Categorical1
Boolean1

Dataset

Description경기데이터드림 OpenApi 응답 명세
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=V3FXHJNFCLAVBMQTBRRI29844183&infSeq=1

Alerts

요청인자여부 is highly imbalanced (57.6%)Imbalance

Reproduction

Analysis started2023-12-10 21:03:09.585321
Analysis finished2023-12-10 21:03:10.870083
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1642
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:03:11.133268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length28
Mean length27.7118
Min length25

Characters and Unicode

Total characters277118
Distinct characters36
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)1.6%

Sample

1st row98MPA0H1EMHUXPEAEG1533349416
2nd rowAB7AFI48HJH7CY7P4USS25178695
3rd rowJBTU5JH49NBV11K1CFVL24468331
4th rowF32ZDZ5AAZWMDY363OMC33625778
5th row05GJA39MCJY5F4O0U542201248
ValueCountFrequency (%)
mpqr5kwez6yv0labw5t529117513 40
 
0.4%
yh0j17es44c9una8xqcq29697412 37
 
0.4%
n8nru1s8r380pfhrup1i29101003 35
 
0.4%
kvvbn6skkyvrt5q5ba8y33296771 33
 
0.3%
i1mvrr82lb9h1mxf2m7r29688846 32
 
0.3%
gaf1pg59dty81aww6lby33315118 31
 
0.3%
dylq1w7o4vli1a843nrg29153306 31
 
0.3%
cts65ln22mrppsrk8nem32765057 29
 
0.3%
x5tk0e41y1y6o3h3ebck23323506 29
 
0.3%
qss34vws7uu7x0eoq8ns28029024 28
 
0.3%
Other values (1632) 9675
96.8%
2023-12-11T06:03:11.559756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 18071
 
6.5%
1 16560
 
6.0%
3 16535
 
6.0%
4 14183
 
5.1%
8 13998
 
5.1%
9 13852
 
5.0%
5 13478
 
4.9%
7 13475
 
4.9%
6 13029
 
4.7%
0 12584
 
4.5%
Other values (26) 131353
47.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 145765
52.6%
Uppercase Letter 131353
47.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 5493
 
4.2%
Q 5448
 
4.1%
T 5346
 
4.1%
S 5339
 
4.1%
N 5286
 
4.0%
A 5248
 
4.0%
X 5190
 
4.0%
P 5172
 
3.9%
Z 5130
 
3.9%
E 5121
 
3.9%
Other values (16) 78580
59.8%
Decimal Number
ValueCountFrequency (%)
2 18071
12.4%
1 16560
11.4%
3 16535
11.3%
4 14183
9.7%
8 13998
9.6%
9 13852
9.5%
5 13478
9.2%
7 13475
9.2%
6 13029
8.9%
0 12584
8.6%

Most occurring scripts

ValueCountFrequency (%)
Common 145765
52.6%
Latin 131353
47.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 5493
 
4.2%
Q 5448
 
4.1%
T 5346
 
4.1%
S 5339
 
4.1%
N 5286
 
4.0%
A 5248
 
4.0%
X 5190
 
4.0%
P 5172
 
3.9%
Z 5130
 
3.9%
E 5121
 
3.9%
Other values (16) 78580
59.8%
Common
ValueCountFrequency (%)
2 18071
12.4%
1 16560
11.4%
3 16535
11.3%
4 14183
9.7%
8 13998
9.6%
9 13852
9.5%
5 13478
9.2%
7 13475
9.2%
6 13029
8.9%
0 12584
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 277118
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 18071
 
6.5%
1 16560
 
6.0%
3 16535
 
6.0%
4 14183
 
5.1%
8 13998
 
5.1%
9 13852
 
5.0%
5 13478
 
4.9%
7 13475
 
4.9%
6 13029
 
4.7%
0 12584
 
4.5%
Other values (26) 131353
47.4%
Distinct1640
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:03:11.804450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length32
Mean length14.439
Min length4

Characters and Unicode

Total characters144390
Distinct characters541
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)1.6%

Sample

1st row경기도_BMS 예비차 정보
2nd row특수학급 순회교육 학생수 집계현황
3rd row도시계획 개발행위 허가 정보 현황
4th row경기도 대기환경정보 월평균자료
5th row의료기기 판매(임대)업체 현황
ValueCountFrequency (%)
현황 5573
 
19.3%
경기도 701
 
2.4%
현황(제공표준 690
 
2.4%
정보 632
 
2.2%
현황_인허가 379
 
1.3%
집계현황 343
 
1.2%
공공체육시설 219
 
0.8%
217
 
0.7%
집계 205
 
0.7%
경기도_bms 194
 
0.7%
Other values (2251) 19790
68.4%
2023-12-11T06:03:12.249263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18949
 
13.1%
8050
 
5.6%
8033
 
5.6%
3563
 
2.5%
3327
 
2.3%
( 2974
 
2.1%
) 2974
 
2.1%
2568
 
1.8%
2317
 
1.6%
2303
 
1.6%
Other values (531) 89332
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 114969
79.6%
Space Separator 18949
 
13.1%
Open Punctuation 2989
 
2.1%
Close Punctuation 2989
 
2.1%
Connector Punctuation 1953
 
1.4%
Uppercase Letter 1379
 
1.0%
Other Punctuation 722
 
0.5%
Lowercase Letter 233
 
0.2%
Decimal Number 102
 
0.1%
Dash Punctuation 97
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8050
 
7.0%
8033
 
7.0%
3563
 
3.1%
3327
 
2.9%
2568
 
2.2%
2317
 
2.0%
2303
 
2.0%
2079
 
1.8%
1948
 
1.7%
1809
 
1.6%
Other values (475) 78972
68.7%
Uppercase Letter
ValueCountFrequency (%)
S 295
21.4%
M 261
18.9%
B 233
16.9%
C 89
 
6.5%
I 69
 
5.0%
A 61
 
4.4%
G 49
 
3.6%
T 49
 
3.6%
E 46
 
3.3%
D 39
 
2.8%
Other values (11) 188
13.6%
Lowercase Letter
ValueCountFrequency (%)
i 36
15.5%
n 32
13.7%
e 31
13.3%
l 26
11.2%
a 20
8.6%
o 14
 
6.0%
m 14
 
6.0%
c 13
 
5.6%
y 10
 
4.3%
g 8
 
3.4%
Other values (6) 29
12.4%
Decimal Number
ValueCountFrequency (%)
1 58
56.9%
9 20
 
19.6%
8 10
 
9.8%
2 8
 
7.8%
5 3
 
2.9%
0 3
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 427
59.1%
· 149
 
20.6%
/ 102
 
14.1%
. 37
 
5.1%
? 7
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 2974
99.5%
[ 15
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 2974
99.5%
] 15
 
0.5%
Space Separator
ValueCountFrequency (%)
18949
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1953
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 114953
79.6%
Common 27809
 
19.3%
Latin 1612
 
1.1%
Han 16
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8050
 
7.0%
8033
 
7.0%
3563
 
3.1%
3327
 
2.9%
2568
 
2.2%
2317
 
2.0%
2303
 
2.0%
2079
 
1.8%
1948
 
1.7%
1809
 
1.6%
Other values (473) 78956
68.7%
Latin
ValueCountFrequency (%)
S 295
18.3%
M 261
16.2%
B 233
14.5%
C 89
 
5.5%
I 69
 
4.3%
A 61
 
3.8%
G 49
 
3.0%
T 49
 
3.0%
E 46
 
2.9%
D 39
 
2.4%
Other values (27) 421
26.1%
Common
ValueCountFrequency (%)
18949
68.1%
( 2974
 
10.7%
) 2974
 
10.7%
_ 1953
 
7.0%
, 427
 
1.5%
· 149
 
0.5%
/ 102
 
0.4%
- 97
 
0.3%
1 58
 
0.2%
. 37
 
0.1%
Other values (9) 89
 
0.3%
Han
ValueCountFrequency (%)
8
50.0%
8
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 114930
79.6%
ASCII 29272
 
20.3%
None 149
 
0.1%
Compat Jamo 23
 
< 0.1%
CJK 16
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
18949
64.7%
( 2974
 
10.2%
) 2974
 
10.2%
_ 1953
 
6.7%
, 427
 
1.5%
S 295
 
1.0%
M 261
 
0.9%
B 233
 
0.8%
/ 102
 
0.3%
- 97
 
0.3%
Other values (45) 1007
 
3.4%
Hangul
ValueCountFrequency (%)
8050
 
7.0%
8033
 
7.0%
3563
 
3.1%
3327
 
2.9%
2568
 
2.2%
2317
 
2.0%
2303
 
2.0%
2079
 
1.8%
1948
 
1.7%
1809
 
1.6%
Other values (472) 78933
68.7%
None
ValueCountFrequency (%)
· 149
100.0%
Compat Jamo
ValueCountFrequency (%)
23
100.0%
CJK
ValueCountFrequency (%)
8
50.0%
8
50.0%
Distinct1642
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:03:12.656506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length49
Mean length41.9739
Min length28

Characters and Unicode

Total characters419739
Distinct characters60
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)1.6%

Sample

1st rowhttps://openapi.gg.go.kr/TBBMSVEHSPAREM
2nd rowhttps://openapi.gg.go.kr/Ggspeclclasrdstdnt
3rd rowhttps://openapi.gg.go.kr/Cityplndevloppermisn
4th rowhttps://openapi.gg.go.kr/TBGGAIRMONTHAVGM
5th rowhttps://openapi.gg.go.kr/MedicalCareInstrumentSale
ValueCountFrequency (%)
https://openapi.gg.go.kr/bidpblancserviceinq 40
 
0.4%
https://openapi.gg.go.kr/distrbprodwtrqult 37
 
0.4%
https://openapi.gg.go.kr/bidpblancconstwkinq 35
 
0.4%
https://openapi.gg.go.kr/tbbmsvehinfom 33
 
0.3%
https://openapi.gg.go.kr/manufentrorigwtrinspect 32
 
0.3%
https://openapi.gg.go.kr/ggeducotdstus 31
 
0.3%
https://openapi.gg.go.kr/bidpblancthnginq 31
 
0.3%
https://openapi.gg.go.kr/purpspasngdstn 29
 
0.3%
https://openapi.gg.go.kr/ofcpstechr 29
 
0.3%
https://openapi.gg.go.kr/buldngsanittnmanage 28
 
0.3%
Other values (1630) 9675
96.8%
2023-12-11T06:03:13.200972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
p 34751
 
8.3%
g 33707
 
8.0%
t 32365
 
7.7%
/ 30000
 
7.1%
. 30000
 
7.1%
o 27177
 
6.5%
e 21677
 
5.2%
n 20936
 
5.0%
r 19990
 
4.8%
s 19898
 
4.7%
Other values (50) 149238
35.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 306999
73.1%
Other Punctuation 70000
 
16.7%
Uppercase Letter 42690
 
10.2%
Decimal Number 50
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
p 34751
11.3%
g 33707
11.0%
t 32365
10.5%
o 27177
8.9%
e 21677
 
7.1%
n 20936
 
6.8%
r 19990
 
6.5%
s 19898
 
6.5%
a 18314
 
6.0%
i 18159
 
5.9%
Other values (16) 60025
19.6%
Uppercase Letter
ValueCountFrequency (%)
S 4333
 
10.1%
T 4229
 
9.9%
G 3332
 
7.8%
M 2796
 
6.5%
E 2697
 
6.3%
P 2691
 
6.3%
R 2595
 
6.1%
C 2268
 
5.3%
I 2041
 
4.8%
A 2010
 
4.7%
Other values (16) 13698
32.1%
Decimal Number
ValueCountFrequency (%)
1 27
54.0%
2 13
26.0%
0 6
 
12.0%
8 2
 
4.0%
9 2
 
4.0%
Other Punctuation
ValueCountFrequency (%)
/ 30000
42.9%
. 30000
42.9%
: 10000
 
14.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 349689
83.3%
Common 70050
 
16.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
p 34751
 
9.9%
g 33707
 
9.6%
t 32365
 
9.3%
o 27177
 
7.8%
e 21677
 
6.2%
n 20936
 
6.0%
r 19990
 
5.7%
s 19898
 
5.7%
a 18314
 
5.2%
i 18159
 
5.2%
Other values (42) 102715
29.4%
Common
ValueCountFrequency (%)
/ 30000
42.8%
. 30000
42.8%
: 10000
 
14.3%
1 27
 
< 0.1%
2 13
 
< 0.1%
0 6
 
< 0.1%
8 2
 
< 0.1%
9 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 419739
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
p 34751
 
8.3%
g 33707
 
8.0%
t 32365
 
7.7%
/ 30000
 
7.1%
. 30000
 
7.1%
o 27177
 
6.5%
e 21677
 
5.2%
n 20936
 
5.0%
r 19990
 
4.8%
s 19898
 
4.7%
Other values (50) 149238
35.6%
Distinct3898
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:03:13.535491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length24
Mean length13.6261
Min length2

Characters and Unicode

Total characters136261
Distinct characters43
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3106 ?
Unique (%)31.1%

Sample

1st rowWORK_CD_NM
2nd rowADDPOST_REDU_FACLT_STDNT_CNT
3rd rowSIGUN_NM
4th rowCITY_CODE
5th rowX_CRDNT_VL
ValueCountFrequency (%)
sigun_nm 380
 
3.8%
sigun_cd 374
 
3.7%
refine_wgs84_lat 291
 
2.9%
refine_roadnm_addr 275
 
2.8%
refine_wgs84_logt 263
 
2.6%
refine_zip_cd 230
 
2.3%
refine_lotno_addr 224
 
2.2%
bsn_state_nm 126
 
1.3%
licensg_de 117
 
1.2%
clsbiz_de 110
 
1.1%
Other values (3888) 7610
76.1%
2023-12-11T06:03:14.086218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 18199
13.4%
N 13601
 
10.0%
T 10794
 
7.9%
E 9171
 
6.7%
R 7415
 
5.4%
C 7329
 
5.4%
D 7193
 
5.3%
S 7180
 
5.3%
I 6747
 
5.0%
L 6122
 
4.5%
Other values (33) 42510
31.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 115970
85.1%
Connector Punctuation 18199
 
13.4%
Decimal Number 2086
 
1.5%
Other Letter 6
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 13601
 
11.7%
T 10794
 
9.3%
E 9171
 
7.9%
R 7415
 
6.4%
C 7329
 
6.3%
D 7193
 
6.2%
S 7180
 
6.2%
I 6747
 
5.8%
L 6122
 
5.3%
A 6047
 
5.2%
Other values (16) 34371
29.6%
Decimal Number
ValueCountFrequency (%)
4 660
31.6%
8 587
28.1%
1 188
 
9.0%
0 180
 
8.6%
2 142
 
6.8%
5 112
 
5.4%
3 104
 
5.0%
6 53
 
2.5%
9 36
 
1.7%
7 24
 
1.2%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Connector Punctuation
ValueCountFrequency (%)
_ 18199
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 115970
85.1%
Common 20285
 
14.9%
Hangul 6
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 13601
 
11.7%
T 10794
 
9.3%
E 9171
 
7.9%
R 7415
 
6.4%
C 7329
 
6.3%
D 7193
 
6.2%
S 7180
 
6.2%
I 6747
 
5.8%
L 6122
 
5.3%
A 6047
 
5.2%
Other values (16) 34371
29.6%
Common
ValueCountFrequency (%)
_ 18199
89.7%
4 660
 
3.3%
8 587
 
2.9%
1 188
 
0.9%
0 180
 
0.9%
2 142
 
0.7%
5 112
 
0.6%
3 104
 
0.5%
6 53
 
0.3%
9 36
 
0.2%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 136255
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 18199
13.4%
N 13601
 
10.0%
T 10794
 
7.9%
E 9171
 
6.7%
R 7415
 
5.4%
C 7329
 
5.4%
D 7193
 
5.3%
S 7180
 
5.3%
I 6747
 
5.0%
L 6122
 
4.5%
Other values (27) 42504
31.2%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct4001
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T06:03:14.513944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length5.8456
Min length1

Characters and Unicode

Total characters58456
Distinct characters585
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3230 ?
Unique (%)32.3%

Sample

1st row업무코드명
2nd row겸임순회교육시설학생수
3rd row시군명
4th row도시코드
5th rowX좌표값
ValueCountFrequency (%)
시군명 376
 
3.7%
시군코드 374
 
3.6%
소재지도로명주소 230
 
2.2%
wgs84위도 229
 
2.2%
소재지우편번호 222
 
2.2%
wgs84경도 189
 
1.8%
소재지지번주소 188
 
1.8%
영업상태명 118
 
1.1%
인허가일자 117
 
1.1%
폐업일자 111
 
1.1%
Other values (4017) 8144
79.1%
2023-12-11T06:03:15.091801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2439
 
4.2%
1867
 
3.2%
1823
 
3.1%
1692
 
2.9%
1648
 
2.8%
1498
 
2.6%
1216
 
2.1%
1103
 
1.9%
919
 
1.6%
888
 
1.5%
Other values (575) 43363
74.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52654
90.1%
Decimal Number 2199
 
3.8%
Uppercase Letter 2096
 
3.6%
Space Separator 298
 
0.5%
Lowercase Letter 283
 
0.5%
Open Punctuation 268
 
0.5%
Close Punctuation 267
 
0.5%
Dash Punctuation 126
 
0.2%
Math Symbol 125
 
0.2%
Other Punctuation 78
 
0.1%
Other values (3) 62
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2439
 
4.6%
1867
 
3.5%
1823
 
3.5%
1692
 
3.2%
1648
 
3.1%
1498
 
2.8%
1216
 
2.3%
1103
 
2.1%
919
 
1.7%
888
 
1.7%
Other values (495) 37561
71.3%
Uppercase Letter
ValueCountFrequency (%)
S 495
23.6%
W 487
23.2%
G 481
22.9%
X 64
 
3.1%
L 63
 
3.0%
R 63
 
3.0%
Y 61
 
2.9%
D 60
 
2.9%
U 57
 
2.7%
I 53
 
2.5%
Other values (15) 212
10.1%
Lowercase Letter
ValueCountFrequency (%)
m 71
25.1%
k 55
19.4%
o 21
 
7.4%
19
 
6.7%
n 16
 
5.7%
e 16
 
5.7%
y 15
 
5.3%
a 13
 
4.6%
h 10
 
3.5%
s 8
 
2.8%
Other values (13) 39
13.8%
Decimal Number
ValueCountFrequency (%)
4 581
26.4%
8 500
22.7%
1 280
12.7%
0 205
 
9.3%
2 172
 
7.8%
5 150
 
6.8%
3 138
 
6.3%
9 69
 
3.1%
6 66
 
3.0%
7 38
 
1.7%
Other Punctuation
ValueCountFrequency (%)
/ 31
39.7%
? 19
24.4%
, 9
 
11.5%
: 9
 
11.5%
. 4
 
5.1%
· 3
 
3.8%
% 3
 
3.8%
Math Symbol
ValueCountFrequency (%)
~ 103
82.4%
+ 19
 
15.2%
× 2
 
1.6%
= 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 240
89.6%
[ 27
 
10.1%
{ 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 240
89.9%
] 27
 
10.1%
Other Symbol
ValueCountFrequency (%)
15
53.6%
13
46.4%
Space Separator
ValueCountFrequency (%)
298
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 28
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52654
90.1%
Common 3442
 
5.9%
Latin 2360
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2439
 
4.6%
1867
 
3.5%
1823
 
3.5%
1692
 
3.2%
1648
 
3.1%
1498
 
2.8%
1216
 
2.3%
1103
 
2.1%
919
 
1.7%
888
 
1.7%
Other values (495) 37561
71.3%
Latin
ValueCountFrequency (%)
S 495
21.0%
W 487
20.6%
G 481
20.4%
m 71
 
3.0%
X 64
 
2.7%
L 63
 
2.7%
R 63
 
2.7%
Y 61
 
2.6%
D 60
 
2.5%
U 57
 
2.4%
Other values (37) 458
19.4%
Common
ValueCountFrequency (%)
4 581
16.9%
8 500
14.5%
298
8.7%
1 280
8.1%
( 240
 
7.0%
) 240
 
7.0%
0 205
 
6.0%
2 172
 
5.0%
5 150
 
4.4%
3 138
 
4.0%
Other values (23) 638
18.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52651
90.1%
ASCII 5750
 
9.8%
CJK Compat 28
 
< 0.1%
Letterlike Symbols 19
 
< 0.1%
None 5
 
< 0.1%
Compat Jamo 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2439
 
4.6%
1867
 
3.5%
1823
 
3.5%
1692
 
3.2%
1648
 
3.1%
1498
 
2.8%
1216
 
2.3%
1103
 
2.1%
919
 
1.7%
888
 
1.7%
Other values (494) 37558
71.3%
ASCII
ValueCountFrequency (%)
4 581
 
10.1%
8 500
 
8.7%
S 495
 
8.6%
W 487
 
8.5%
G 481
 
8.4%
298
 
5.2%
1 280
 
4.9%
( 240
 
4.2%
) 240
 
4.2%
0 205
 
3.6%
Other values (65) 1943
33.8%
Letterlike Symbols
ValueCountFrequency (%)
19
100.0%
CJK Compat
ValueCountFrequency (%)
15
53.6%
13
46.4%
None
ValueCountFrequency (%)
· 3
60.0%
× 2
40.0%
Compat Jamo
ValueCountFrequency (%)
3
100.0%

항목타입
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
TEXT
7477 
NUMBER
2523 

Length

Max length6
Median length4
Mean length4.5046
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTEXT
2nd rowNUMBER
3rd rowTEXT
4th rowTEXT
5th rowTEXT

Common Values

ValueCountFrequency (%)
TEXT 7477
74.8%
NUMBER 2523
 
25.2%

Length

2023-12-11T06:03:15.307460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:03:15.458526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
text 7477
74.8%
number 2523
 
25.2%

요청인자여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
9136 
True
 
864
ValueCountFrequency (%)
False 9136
91.4%
True 864
 
8.6%
2023-12-11T06:03:15.573118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:03:15.642937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항목타입요청인자여부
항목타입1.0000.275
요청인자여부0.2751.000
2023-12-11T06:03:16.066790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
요청인자여부항목타입
요청인자여부1.0000.177
항목타입0.1771.000
2023-12-11T06:03:16.174561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항목타입요청인자여부
항목타입1.0000.177
요청인자여부0.1771.000

Missing values

2023-12-11T06:03:10.616484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:03:10.783772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공공데이터ID공공데이터명호출URL항목ID항목명항목타입요청인자여부
352998MPA0H1EMHUXPEAEG1533349416경기도_BMS 예비차 정보https://openapi.gg.go.kr/TBBMSVEHSPAREMWORK_CD_NM업무코드명TEXTN
26544AB7AFI48HJH7CY7P4USS25178695특수학급 순회교육 학생수 집계현황https://openapi.gg.go.kr/GgspeclclasrdstdntADDPOST_REDU_FACLT_STDNT_CNT겸임순회교육시설학생수NUMBERN
10407JBTU5JH49NBV11K1CFVL24468331도시계획 개발행위 허가 정보 현황https://openapi.gg.go.kr/CityplndevloppermisnSIGUN_NM시군명TEXTY
1872F32ZDZ5AAZWMDY363OMC33625778경기도 대기환경정보 월평균자료https://openapi.gg.go.kr/TBGGAIRMONTHAVGMCITY_CODE도시코드TEXTN
1909805GJA39MCJY5F4O0U542201248의료기기 판매(임대)업체 현황https://openapi.gg.go.kr/MedicalCareInstrumentSaleX_CRDNT_VLX좌표값TEXTN
11141YSRHA8I0AFKA3PG6LHNF14653338목재수입유통업 현황https://openapi.gg.go.kr/TimberiBSN_STATE_DIV_CD영업상태구분코드TEXTN
79493ZC9N7M6839J9WKS0461168232관광극장 유흥업체 현황_인허가https://openapi.gg.go.kr/THEATPLESRSIGUN_NM시군명TEXTY
8795FRF1VBDTJUJCA204NT5922371545기능별 재원별 세출예산 현황https://openapi.gg.go.kr/SkllfnrscanexptrbudgetFIELD_TYPE_CD분야유형코드TEXTN
121736T98794V0223GQQ9O1P42484027병원 현황(병원급)https://openapi.gg.go.kr/HospitalREFINE_LOTNO_ADDR소재지지번주소TEXTN
19266FQBYDQWUXT7QI8CQ3XC420335463의원급 산부인과 현황https://openapi.gg.go.kr/HsptlObgynBSN_STATE_NM영업상태명TEXTN
공공데이터ID공공데이터명호출URL항목ID항목명항목타입요청인자여부
19186TRBQS6B01VJFXLP1POK228813182의료유사업 현황_인허가https://openapi.gg.go.kr/MedcareSimlrtyEntrpsLOCPLC_AR_INFO소재지면적정보TEXTN
777QSS34VWS7UU7X0EOQ8NS28029024건물위생관리업_인허가https://openapi.gg.go.kr/buldngSanittnManageLABOTRY_BAN_INFO실험실반정보TEXTN
18365H1E6WL7XXWGR9D9IQHEU13995091유통전문판매업 현황https://openapi.gg.go.kr/DistrbspecltysalebizREFINE_ZIP_CD소재지우편번호TEXTN
26347Y7G89G6QNH3MRKZKT2FO14096681특수장소의약품취급업소(휴게소등) 현황https://openapi.gg.go.kr/SpeclmedcintrtbiztblCLSBIZ_DE폐업일자TEXTN
24097WYF7FTKH3KNUGF2NR9G320532392중학교 현황https://openapi.gg.go.kr/MskulMSCHOOL_DIV_NM학교구분명TEXTN
9278EBAUVR4I7C0FHIYCANY128983763노인장애인보호구역(제공표준)https://openapi.gg.go.kr/OldpsnDspsnProtectZoneSIGUNGU_CD시군구코드TEXTN
7912030L4KK9N4DI72P1X023548558관광 펜션업체 현황https://openapi.gg.go.kr/TourismPensionY_CRDNT_VLY좌표값TEXTN
1711689TMHP3IR8EAY3W38CE51355109영화 배급업체 현황https://openapi.gg.go.kr/MovieDistributionCIRCUMFR_ENVRN_NM주변환경명TEXTN
726083H2783QDS94TM9KJ3941899884공공체육시설 현황(야구)https://openapi.gg.go.kr/PublicTrainingFacilityBasebalAR면적NUMBERN
780QSS34VWS7UU7X0EOQ8NS28029024건물위생관리업_인허가https://openapi.gg.go.kr/buldngSanittnManageLABOTRY_SPECL_ISSNO_ADDR실험실특수호주소TEXTN