Overview

Dataset statistics

Number of variables15
Number of observations7929
Missing cells3230
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory929.3 KiB
Average record size in memory120.0 B

Variable types

Text9
Categorical5
DateTime1

Dataset

Description서비스 ID,서비스명,대분류,중분류,제공기관,제공부서명,소분류,시스템명,담당자명,담당자연락처,갱신주기,최종갱신일자,제공사이트,제공형식,서비스URL
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1263/S/1/datasetView.do

Alerts

갱신주기 is highly overall correlated with 대분류 and 1 other fieldsHigh correlation
대분류 is highly overall correlated with 갱신주기 and 1 other fieldsHigh correlation
제공형식 is highly overall correlated with 대분류 and 1 other fieldsHigh correlation
중분류 is highly imbalanced (53.6%)Imbalance
제공형식 is highly imbalanced (63.9%)Imbalance
시스템명 has 1795 (22.6%) missing valuesMissing
제공사이트 has 1359 (17.1%) missing valuesMissing
서비스 ID has unique valuesUnique
서비스URL has unique valuesUnique

Reproduction

Analysis started2024-05-11 04:20:16.966011
Analysis finished2024-05-11 04:20:25.289425
Duration8.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서비스 ID
Text

UNIQUE 

Distinct7929
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
2024-05-11T04:20:26.040020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.8427292
Min length6

Characters and Unicode

Total characters62185
Distinct characters13
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7929 ?
Unique (%)100.0%

Sample

1st rowOA-22208
2nd rowOA-12611
3rd rowOA-12621
4th rowOA-12615
5th rowOA-22224
ValueCountFrequency (%)
oa-22208 1
 
< 0.1%
oa-20757 1
 
< 0.1%
oa-20796 1
 
< 0.1%
oa-20860 1
 
< 0.1%
oa-20872 1
 
< 0.1%
oa-20785 1
 
< 0.1%
oa-20591 1
 
< 0.1%
oa-20642 1
 
< 0.1%
oa-20618 1
 
< 0.1%
oa-20878 1
 
< 0.1%
Other values (7919) 7919
99.9%
2024-05-11T04:20:27.223577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 8841
14.2%
O 7929
12.8%
A 7929
12.8%
- 7929
12.8%
2 5680
9.1%
0 3323
 
5.3%
6 3224
 
5.2%
7 3200
 
5.1%
8 2921
 
4.7%
5 2918
 
4.7%
Other values (3) 8291
13.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 38398
61.7%
Uppercase Letter 15858
25.5%
Dash Punctuation 7929
 
12.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 8841
23.0%
2 5680
14.8%
0 3323
 
8.7%
6 3224
 
8.4%
7 3200
 
8.3%
8 2921
 
7.6%
5 2918
 
7.6%
3 2916
 
7.6%
9 2856
 
7.4%
4 2519
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
O 7929
50.0%
A 7929
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 7929
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 46327
74.5%
Latin 15858
 
25.5%

Most frequent character per script

Common
ValueCountFrequency (%)
1 8841
19.1%
- 7929
17.1%
2 5680
12.3%
0 3323
 
7.2%
6 3224
 
7.0%
7 3200
 
6.9%
8 2921
 
6.3%
5 2918
 
6.3%
3 2916
 
6.3%
9 2856
 
6.2%
Latin
ValueCountFrequency (%)
O 7929
50.0%
A 7929
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62185
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 8841
14.2%
O 7929
12.8%
A 7929
12.8%
- 7929
12.8%
2 5680
9.1%
0 3323
 
5.3%
6 3224
 
5.2%
7 3200
 
5.1%
8 2921
 
4.7%
5 2918
 
4.7%
Other values (3) 8291
13.3%
Distinct7928
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
2024-05-11T04:20:27.975931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length44
Mean length20.871232
Min length5

Characters and Unicode

Total characters165488
Distinct characters616
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7927 ?
Unique (%)> 99.9%

Sample

1st row서울시 고시원 정보(2020년)
2nd row서울시 강동구 길동 주민센터 새소식 정보
3rd row서울시 강동구 암사2동 주민센터 새소식 정보
4th row서울시 강동구 명일2동 주민센터 새소식 정보
5th row서울시 하천수위 관측 데이터(2018~2020년)
ValueCountFrequency (%)
서울시 6956
 
18.4%
정보 3951
 
10.5%
인허가 3016
 
8.0%
현황 1294
 
3.4%
통계 1228
 
3.3%
유치원 339
 
0.9%
양천구 247
 
0.7%
강서구 227
 
0.6%
강북구 206
 
0.5%
206
 
0.5%
Other values (4558) 20078
53.2%
2024-05-11T04:20:29.036633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29830
 
18.0%
8354
 
5.0%
8237
 
5.0%
7402
 
4.5%
5372
 
3.2%
5224
 
3.2%
4823
 
2.9%
3801
 
2.3%
3702
 
2.2%
3361
 
2.0%
Other values (606) 85382
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 126808
76.6%
Space Separator 29830
 
18.0%
Decimal Number 3301
 
2.0%
Open Punctuation 1691
 
1.0%
Close Punctuation 1691
 
1.0%
Uppercase Letter 1066
 
0.6%
Other Punctuation 538
 
0.3%
Dash Punctuation 219
 
0.1%
Math Symbol 158
 
0.1%
Lowercase Letter 98
 
0.1%
Other values (2) 88
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8354
 
6.6%
8237
 
6.5%
7402
 
5.8%
5372
 
4.2%
5224
 
4.1%
4823
 
3.8%
3801
 
3.0%
3702
 
2.9%
3361
 
2.7%
3070
 
2.4%
Other values (541) 73462
57.9%
Uppercase Letter
ValueCountFrequency (%)
T 183
17.2%
C 165
15.5%
I 104
9.8%
S 99
9.3%
V 97
9.1%
F 76
7.1%
R 75
7.0%
G 67
 
6.3%
W 64
 
6.0%
O 22
 
2.1%
Other values (11) 114
10.7%
Lowercase Letter
ValueCountFrequency (%)
o 25
25.5%
e 12
12.2%
t 10
 
10.2%
n 9
 
9.2%
a 8
 
8.2%
p 6
 
6.1%
i 5
 
5.1%
s 4
 
4.1%
c 3
 
3.1%
r 3
 
3.1%
Other values (8) 13
13.3%
Decimal Number
ValueCountFrequency (%)
0 1099
33.3%
2 724
21.9%
1 567
17.2%
9 263
 
8.0%
8 161
 
4.9%
5 121
 
3.7%
4 120
 
3.6%
6 96
 
2.9%
7 92
 
2.8%
3 58
 
1.8%
Other Punctuation
ValueCountFrequency (%)
/ 153
28.4%
: 139
25.8%
? 111
20.6%
. 82
15.2%
, 52
 
9.7%
& 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 153
96.8%
2
 
1.3%
+ 2
 
1.3%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
29830
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1691
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1691
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 219
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 85
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 126804
76.6%
Common 37516
 
22.7%
Latin 1164
 
0.7%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8354
 
6.6%
8237
 
6.5%
7402
 
5.8%
5372
 
4.2%
5224
 
4.1%
4823
 
3.8%
3801
 
3.0%
3702
 
2.9%
3361
 
2.7%
3070
 
2.4%
Other values (539) 73458
57.9%
Latin
ValueCountFrequency (%)
T 183
15.7%
C 165
14.2%
I 104
8.9%
S 99
8.5%
V 97
8.3%
F 76
 
6.5%
R 75
 
6.4%
G 67
 
5.8%
W 64
 
5.5%
o 25
 
2.1%
Other values (29) 209
18.0%
Common
ValueCountFrequency (%)
29830
79.5%
( 1691
 
4.5%
) 1691
 
4.5%
0 1099
 
2.9%
2 724
 
1.9%
1 567
 
1.5%
9 263
 
0.7%
- 219
 
0.6%
8 161
 
0.4%
~ 153
 
0.4%
Other values (16) 1118
 
3.0%
Han
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 126799
76.6%
ASCII 38674
 
23.4%
Compat Jamo 5
 
< 0.1%
CJK 4
 
< 0.1%
CJK Compat 3
 
< 0.1%
Arrows 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29830
77.1%
( 1691
 
4.4%
) 1691
 
4.4%
0 1099
 
2.8%
2 724
 
1.9%
1 567
 
1.5%
9 263
 
0.7%
- 219
 
0.6%
T 183
 
0.5%
C 165
 
0.4%
Other values (52) 2242
 
5.8%
Hangul
ValueCountFrequency (%)
8354
 
6.6%
8237
 
6.5%
7402
 
5.8%
5372
 
4.2%
5224
 
4.1%
4823
 
3.8%
3801
 
3.0%
3702
 
2.9%
3361
 
2.7%
3070
 
2.4%
Other values (538) 73453
57.9%
Compat Jamo
ValueCountFrequency (%)
5
100.0%
CJK Compat
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Arrows
ValueCountFrequency (%)
2
66.7%
1
33.3%

대분류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
공공데이터
6168 
통계
1761 

Length

Max length5
Median length5
Mean length4.3337117
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공데이터
2nd row공공데이터
3rd row공공데이터
4th row공공데이터
5th row공공데이터

Common Values

ValueCountFrequency (%)
공공데이터 6168
77.8%
통계 1761
 
22.2%

Length

2024-05-11T04:20:29.482635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:20:29.822951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공데이터 6168
77.8%
통계 1761
 
22.2%

중분류
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
서울시(본청)
5617 
자치구 및 자치구산하
1698 
공공기관(외부)
 
395
서울시(산하기관)
 
199
서울시(사업소)
 
18

Length

Max length11
Median length7
Mean length7.9586329
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시(본청)
2nd row자치구 및 자치구산하
3rd row자치구 및 자치구산하
4th row자치구 및 자치구산하
5th row서울시(본청)

Common Values

ValueCountFrequency (%)
서울시(본청) 5617
70.8%
자치구 및 자치구산하 1698
 
21.4%
공공기관(외부) 395
 
5.0%
서울시(산하기관) 199
 
2.5%
서울시(사업소) 18
 
0.2%
민간(기업) 2
 
< 0.1%

Length

2024-05-11T04:20:30.164394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:20:30.505485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울시(본청 5617
49.6%
자치구 1698
 
15.0%
1698
 
15.0%
자치구산하 1698
 
15.0%
공공기관(외부 395
 
3.5%
서울시(산하기관 199
 
1.8%
서울시(사업소 18
 
0.2%
민간(기업 2
 
< 0.1%
Distinct60
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
2024-05-11T04:20:30.940111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length4.0373313
Min length2

Characters and Unicode

Total characters32012
Distinct characters140
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row서울특별시
2nd row강동구
3rd row강동구
4th row강동구
5th row서울특별시
ValueCountFrequency (%)
서울특별시 3218
40.5%
양천구 244
 
3.1%
강서구 227
 
2.9%
강북구 206
 
2.6%
강동구 192
 
2.4%
도봉구 191
 
2.4%
구로구 184
 
2.3%
강남구 183
 
2.3%
성동구 180
 
2.3%
중구 178
 
2.2%
Other values (53) 2940
37.0%
2024-05-11T04:20:31.882882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4605
14.4%
4030
12.6%
3472
10.8%
3337
10.4%
3263
10.2%
3263
10.2%
819
 
2.6%
709
 
2.2%
409
 
1.3%
367
 
1.1%
Other values (130) 7738
24.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31955
99.8%
Uppercase Letter 23
 
0.1%
Space Separator 15
 
< 0.1%
Decimal Number 15
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4605
14.4%
4030
12.6%
3472
10.9%
3337
10.4%
3263
10.2%
3263
10.2%
819
 
2.6%
709
 
2.2%
409
 
1.3%
367
 
1.1%
Other values (119) 7681
24.0%
Uppercase Letter
ValueCountFrequency (%)
S 9
39.1%
H 6
26.1%
B 4
17.4%
T 3
 
13.0%
C 1
 
4.3%
Decimal Number
ValueCountFrequency (%)
0 5
33.3%
2 5
33.3%
1 5
33.3%
Space Separator
ValueCountFrequency (%)
15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31955
99.8%
Common 34
 
0.1%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4605
14.4%
4030
12.6%
3472
10.9%
3337
10.4%
3263
10.2%
3263
10.2%
819
 
2.6%
709
 
2.2%
409
 
1.3%
367
 
1.1%
Other values (119) 7681
24.0%
Common
ValueCountFrequency (%)
15
44.1%
0 5
 
14.7%
2 5
 
14.7%
1 5
 
14.7%
) 2
 
5.9%
( 2
 
5.9%
Latin
ValueCountFrequency (%)
S 9
39.1%
H 6
26.1%
B 4
17.4%
T 3
 
13.0%
C 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31955
99.8%
ASCII 57
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4605
14.4%
4030
12.6%
3472
10.9%
3337
10.4%
3263
10.2%
3263
10.2%
819
 
2.6%
709
 
2.2%
409
 
1.3%
367
 
1.1%
Other values (119) 7681
24.0%
ASCII
ValueCountFrequency (%)
15
26.3%
S 9
15.8%
H 6
 
10.5%
0 5
 
8.8%
2 5
 
8.8%
1 5
 
8.8%
B 4
 
7.0%
T 3
 
5.3%
) 2
 
3.5%
( 2
 
3.5%
Distinct534
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
2024-05-11T04:20:32.504889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length24
Mean length13.833523
Min length3

Characters and Unicode

Total characters109686
Distinct characters254
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)2.9%

Sample

1st row소방재난본부 예방과
2nd row강동구 길동
3rd row강동구 암사2동
4th row강동구 명일2동
5th row도시교통실 교통기획관 교통운영과
ValueCountFrequency (%)
빅데이터담당관 3631
21.1%
디지털정책관 1930
 
11.2%
스마트도시정책관 1911
 
11.1%
보건소 616
 
3.6%
한국교육학술정보원 379
 
2.2%
보건위생과 248
 
1.4%
도시교통실 242
 
1.4%
교통기획관 213
 
1.2%
강서구 207
 
1.2%
행정국 167
 
1.0%
Other values (494) 7705
44.7%
2024-05-11T04:20:33.548722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9322
 
8.5%
9200
 
8.4%
6428
 
5.9%
4784
 
4.4%
4040
 
3.7%
4040
 
3.7%
3804
 
3.5%
3719
 
3.4%
3703
 
3.4%
3694
 
3.4%
Other values (244) 56952
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100283
91.4%
Space Separator 9322
 
8.5%
Uppercase Letter 31
 
< 0.1%
Decimal Number 27
 
< 0.1%
Math Symbol 15
 
< 0.1%
Other Punctuation 6
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9200
 
9.2%
6428
 
6.4%
4784
 
4.8%
4040
 
4.0%
4040
 
4.0%
3804
 
3.8%
3719
 
3.7%
3703
 
3.7%
3694
 
3.7%
3001
 
3.0%
Other values (228) 53870
53.7%
Uppercase Letter
ValueCountFrequency (%)
I 6
19.4%
M 5
16.1%
E 5
16.1%
C 5
16.1%
D 4
12.9%
X 4
12.9%
O 1
 
3.2%
T 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
1 15
55.6%
2 8
29.6%
3 4
 
14.8%
Space Separator
ValueCountFrequency (%)
9322
100.0%
Math Symbol
ValueCountFrequency (%)
> 15
100.0%
Other Punctuation
ValueCountFrequency (%)
? 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100283
91.4%
Common 9372
 
8.5%
Latin 31
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9200
 
9.2%
6428
 
6.4%
4784
 
4.8%
4040
 
4.0%
4040
 
4.0%
3804
 
3.8%
3719
 
3.7%
3703
 
3.7%
3694
 
3.7%
3001
 
3.0%
Other values (228) 53870
53.7%
Common
ValueCountFrequency (%)
9322
99.5%
> 15
 
0.2%
1 15
 
0.2%
2 8
 
0.1%
? 6
 
0.1%
3 4
 
< 0.1%
) 1
 
< 0.1%
( 1
 
< 0.1%
Latin
ValueCountFrequency (%)
I 6
19.4%
M 5
16.1%
E 5
16.1%
C 5
16.1%
D 4
12.9%
X 4
12.9%
O 1
 
3.2%
T 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100283
91.4%
ASCII 9403
 
8.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9322
99.1%
> 15
 
0.2%
1 15
 
0.2%
2 8
 
0.1%
? 6
 
0.1%
I 6
 
0.1%
M 5
 
0.1%
E 5
 
0.1%
C 5
 
0.1%
3 4
 
< 0.1%
Other values (6) 12
 
0.1%
Hangul
ValueCountFrequency (%)
9200
 
9.2%
6428
 
6.4%
4784
 
4.8%
4040
 
4.0%
4040
 
4.0%
3804
 
3.8%
3719
 
3.7%
3703
 
3.7%
3694
 
3.7%
3001
 
3.0%
Other values (228) 53870
53.7%

소분류
Categorical

Distinct13
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
보건
1712 
문화/관광
1617 
산업/경제
857 
환경
590 
<NA>
554 
Other values (8)
2599 

Length

Max length5
Median length4
Mean length3.4107706
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주택/건설
2nd row일반행정
3rd row일반행정
4th row일반행정
5th row도시관리

Common Values

ValueCountFrequency (%)
보건 1712
21.6%
문화/관광 1617
20.4%
산업/경제 857
10.8%
환경 590
 
7.4%
<NA> 554
 
7.0%
교육 540
 
6.8%
일반행정 509
 
6.4%
교통 413
 
5.2%
복지 302
 
3.8%
인구/가구 238
 
3.0%
Other values (3) 597
 
7.5%

Length

2024-05-11T04:20:33.991735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보건 1712
21.6%
문화/관광 1617
20.4%
산업/경제 857
10.8%
환경 590
 
7.4%
na 554
 
7.0%
교육 540
 
6.8%
일반행정 509
 
6.4%
교통 413
 
5.2%
복지 302
 
3.8%
인구/가구 238
 
3.0%
Other values (3) 597
 
7.5%

시스템명
Text

MISSING 

Distinct461
Distinct (%)7.5%
Missing1795
Missing (%)22.6%
Memory size62.1 KiB
2024-05-11T04:20:34.705381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length14
Mean length11.922563
Min length3

Characters and Unicode

Total characters73133
Distinct characters369
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)3.7%

Sample

1st row공공데이터포탈
2nd row강동구 홈페이지
3rd row강동구 홈페이지
4th row강동구 홈페이지
5th row공공데이터포탈
ValueCountFrequency (%)
지방행정 3020
21.1%
데이터개방 3020
21.1%
인허가 3020
21.1%
시도행정정보시스템 477
 
3.3%
홈페이지 367
 
2.6%
유치원 338
 
2.4%
알리미 338
 
2.4%
서울시 289
 
2.0%
kosis 209
 
1.5%
국가통계포털 209
 
1.5%
Other values (551) 3017
21.1%
2024-05-11T04:20:35.963855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8304
 
11.4%
6097
 
8.3%
4436
 
6.1%
3702
 
5.1%
3694
 
5.1%
3519
 
4.8%
3265
 
4.5%
3256
 
4.5%
3148
 
4.3%
3104
 
4.2%
Other values (359) 30608
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62446
85.4%
Space Separator 8304
 
11.4%
Uppercase Letter 1511
 
2.1%
Decimal Number 355
 
0.5%
Open Punctuation 129
 
0.2%
Close Punctuation 129
 
0.2%
Other Punctuation 83
 
0.1%
Connector Punctuation 81
 
0.1%
Lowercase Letter 80
 
0.1%
Dash Punctuation 11
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3694
 
5.9%
3519
 
5.6%
3265
 
5.2%
3256
 
5.2%
3148
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25125
40.2%
Uppercase Letter
ValueCountFrequency (%)
S 521
34.5%
I 249
16.5%
O 245
16.2%
K 210
13.9%
A 73
 
4.8%
T 54
 
3.6%
P 50
 
3.3%
N 21
 
1.4%
D 17
 
1.1%
W 14
 
0.9%
Other values (9) 57
 
3.8%
Decimal Number
ValueCountFrequency (%)
0 167
47.0%
1 93
26.2%
2 87
24.5%
9 2
 
0.6%
6 2
 
0.6%
3 1
 
0.3%
5 1
 
0.3%
4 1
 
0.3%
8 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
p 56
70.0%
e 7
 
8.8%
n 7
 
8.8%
s 4
 
5.0%
t 3
 
3.8%
b 3
 
3.8%
Open Punctuation
ValueCountFrequency (%)
( 125
96.9%
[ 3
 
2.3%
1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 125
96.9%
] 3
 
2.3%
1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 81
97.6%
/ 2
 
2.4%
Math Symbol
ValueCountFrequency (%)
1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
8304
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 81
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62446
85.4%
Common 9096
 
12.4%
Latin 1591
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3694
 
5.9%
3519
 
5.6%
3265
 
5.2%
3256
 
5.2%
3148
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25125
40.2%
Latin
ValueCountFrequency (%)
S 521
32.7%
I 249
15.7%
O 245
15.4%
K 210
13.2%
A 73
 
4.6%
p 56
 
3.5%
T 54
 
3.4%
P 50
 
3.1%
N 21
 
1.3%
D 17
 
1.1%
Other values (15) 95
 
6.0%
Common
ValueCountFrequency (%)
8304
91.3%
0 167
 
1.8%
( 125
 
1.4%
) 125
 
1.4%
1 93
 
1.0%
2 87
 
1.0%
. 81
 
0.9%
_ 81
 
0.9%
- 11
 
0.1%
[ 3
 
< 0.1%
Other values (13) 19
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62446
85.4%
ASCII 10684
 
14.6%
None 2
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8304
77.7%
S 521
 
4.9%
I 249
 
2.3%
O 245
 
2.3%
K 210
 
2.0%
0 167
 
1.6%
( 125
 
1.2%
) 125
 
1.2%
1 93
 
0.9%
2 87
 
0.8%
Other values (35) 558
 
5.2%
Hangul
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3694
 
5.9%
3519
 
5.6%
3265
 
5.2%
3256
 
5.2%
3148
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25125
40.2%
Arrows
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct965
Distinct (%)12.2%
Missing7
Missing (%)0.1%
Memory size62.1 KiB
2024-05-11T04:20:36.720050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length3
Mean length4.7931078
Min length1

Characters and Unicode

Total characters37971
Distinct characters220
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique451 ?
Unique (%)5.7%

Sample

1st row고강섭
2nd row강동구 대표전화
3rd row강동구 대표전화
4th row강동구 대표전화
5th row박성수
ValueCountFrequency (%)
대표전화 2475
23.8%
원은묵 555
 
5.3%
김지연 447
 
4.3%
최성용 420
 
4.0%
전연욱 236
 
2.3%
강북구 191
 
1.8%
강동구 177
 
1.7%
도봉구 175
 
1.7%
강남구 169
 
1.6%
성동구 165
 
1.6%
Other values (961) 5389
51.8%
2024-05-11T04:20:38.032460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2731
 
7.2%
2630
 
6.9%
2554
 
6.7%
2485
 
6.5%
2484
 
6.5%
2477
 
6.5%
1302
 
3.4%
1122
 
3.0%
943
 
2.5%
931
 
2.5%
Other values (210) 18312
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35477
93.4%
Space Separator 2485
 
6.5%
Other Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2731
 
7.7%
2630
 
7.4%
2554
 
7.2%
2484
 
7.0%
2477
 
7.0%
1302
 
3.7%
1122
 
3.2%
943
 
2.7%
931
 
2.6%
849
 
2.4%
Other values (206) 17454
49.2%
Space Separator
ValueCountFrequency (%)
2485
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35477
93.4%
Common 2494
 
6.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2731
 
7.7%
2630
 
7.4%
2554
 
7.2%
2484
 
7.0%
2477
 
7.0%
1302
 
3.7%
1122
 
3.2%
943
 
2.7%
931
 
2.6%
849
 
2.4%
Other values (206) 17454
49.2%
Common
ValueCountFrequency (%)
2485
99.6%
, 3
 
0.1%
( 3
 
0.1%
) 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35477
93.4%
ASCII 2494
 
6.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2731
 
7.7%
2630
 
7.4%
2554
 
7.2%
2484
 
7.0%
2477
 
7.0%
1302
 
3.7%
1122
 
3.2%
943
 
2.7%
931
 
2.6%
849
 
2.4%
Other values (206) 17454
49.2%
ASCII
ValueCountFrequency (%)
2485
99.6%
, 3
 
0.1%
( 3
 
0.1%
) 3
 
0.1%
Distinct1060
Distinct (%)13.5%
Missing63
Missing (%)0.8%
Memory size62.1 KiB
2024-05-11T04:20:38.747175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length12
Mean length11.812611
Min length6

Characters and Unicode

Total characters92918
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique557 ?
Unique (%)7.1%

Sample

1st row02-1577-1188
2nd row02-1577-1188
3rd row02-1577-1188
4th row02-2133-4438
5th row02-2286-5114
ValueCountFrequency (%)
02-2133-4366 556
 
7.1%
02-2133-4364 415
 
5.3%
02-2133-4365 334
 
4.2%
02-2133-4368 236
 
3.0%
02-901-6114 191
 
2.4%
02-1577-1188 177
 
2.2%
02-2091-2120 175
 
2.2%
02-2286-5114 165
 
2.1%
02-3396-4114 163
 
2.1%
02-879-5000 152
 
1.9%
Other values (1049) 5305
67.4%
2024-05-11T04:20:39.961112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 17064
18.4%
- 15693
16.9%
0 12718
13.7%
3 11993
12.9%
1 10868
11.7%
4 7106
7.6%
6 5593
 
6.0%
5 3468
 
3.7%
8 2993
 
3.2%
7 2783
 
3.0%
Other values (3) 2639
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 77212
83.1%
Dash Punctuation 15693
 
16.9%
Space Separator 10
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 17064
22.1%
0 12718
16.5%
3 11993
15.5%
1 10868
14.1%
4 7106
9.2%
6 5593
 
7.2%
5 3468
 
4.5%
8 2993
 
3.9%
7 2783
 
3.6%
9 2626
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 15693
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 92918
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 17064
18.4%
- 15693
16.9%
0 12718
13.7%
3 11993
12.9%
1 10868
11.7%
4 7106
7.6%
6 5593
 
6.0%
5 3468
 
3.7%
8 2993
 
3.2%
7 2783
 
3.0%
Other values (3) 2639
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 92918
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 17064
18.4%
- 15693
16.9%
0 12718
13.7%
3 11993
12.9%
1 10868
11.7%
4 7106
7.6%
6 5593
 
6.0%
5 3468
 
3.7%
8 2993
 
3.2%
7 2783
 
3.0%
Other values (3) 2639
 
2.8%

갱신주기
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
수시
4305 
년간
1694 
월간
655 
주기없음
565 
일간
498 
Other values (3)
 
212

Length

Max length4
Median length2
Mean length2.1425148
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주기없음
2nd row일간
3rd row일간
4th row일간
5th row주기없음

Common Values

ValueCountFrequency (%)
수시 4305
54.3%
년간 1694
 
21.4%
월간 655
 
8.3%
주기없음 565
 
7.1%
일간 498
 
6.3%
분기 132
 
1.7%
주간 59
 
0.7%
반기 21
 
0.3%

Length

2024-05-11T04:20:40.484095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T04:20:40.904868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수시 4305
54.3%
년간 1694
 
21.4%
월간 655
 
8.3%
주기없음 565
 
7.1%
일간 498
 
6.3%
분기 132
 
1.7%
주간 59
 
0.7%
반기 21
 
0.3%
Distinct444
Distinct (%)5.6%
Missing6
Missing (%)0.1%
Memory size62.1 KiB
Minimum2011-01-06 00:00:00
Maximum2024-05-11 00:00:00
2024-05-11T04:20:41.336852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T04:20:41.795089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

제공사이트
Text

MISSING 

Distinct633
Distinct (%)9.6%
Missing1359
Missing (%)17.1%
Memory size62.1 KiB
2024-05-11T04:20:42.643417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length309
Median length229
Mean length34.973668
Min length4

Characters and Unicode

Total characters229777
Distinct characters400
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique380 ?
Unique (%)5.8%

Sample

1st row공공데이터포탈(https://www.data.go.kr/data15030030/fileData.do)
2nd row강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/gil/main)
3rd row강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/amsa2/main)
4th row강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/myeongil2/main)
5th row공공데이터포탈(https://www.data.go.kr/data15084568/fileData.do)
ValueCountFrequency (%)
지방행정 3020
20.1%
데이터개방(http://localdata.kr 3020
20.1%
인허가 3020
20.1%
시도행정정보시스템 477
 
3.2%
알리미(https://e-childschoolinfo.moe.go.kr 338
 
2.3%
유치원 338
 
2.3%
https://news.seoul.go.kr/welfare/?p=530461 306
 
2.0%
서울시 289
 
1.9%
kosis 209
 
1.4%
공공서비스예약(http://yeyak.seoul.go.kr 158
 
1.1%
Other values (828) 3818
25.5%
2024-05-11T04:20:43.928600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 15636
 
6.8%
/ 13992
 
6.1%
a 11611
 
5.1%
. 11169
 
4.9%
o 10151
 
4.4%
l 9310
 
4.1%
8445
 
3.7%
r 6819
 
3.0%
p 6707
 
2.9%
h 6694
 
2.9%
Other values (390) 129243
56.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 109040
47.5%
Other Letter 62473
27.2%
Other Punctuation 31407
 
13.7%
Space Separator 8445
 
3.7%
Close Punctuation 5829
 
2.5%
Open Punctuation 5829
 
2.5%
Decimal Number 3389
 
1.5%
Uppercase Letter 2220
 
1.0%
Math Symbol 549
 
0.2%
Dash Punctuation 442
 
0.2%
Other values (2) 154
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3697
 
5.9%
3519
 
5.6%
3265
 
5.2%
3259
 
5.2%
3151
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25143
40.2%
Lowercase Letter
ValueCountFrequency (%)
t 15636
14.3%
a 11611
10.6%
o 10151
 
9.3%
l 9310
 
8.5%
r 6819
 
6.3%
p 6707
 
6.2%
h 6694
 
6.1%
k 6377
 
5.8%
s 5236
 
4.8%
e 5059
 
4.6%
Other values (16) 25440
23.3%
Uppercase Letter
ValueCountFrequency (%)
S 568
25.6%
I 313
14.1%
O 259
11.7%
K 222
 
10.0%
A 132
 
5.9%
P 120
 
5.4%
D 116
 
5.2%
M 86
 
3.9%
C 77
 
3.5%
N 63
 
2.8%
Other values (13) 264
11.9%
Decimal Number
ValueCountFrequency (%)
0 766
22.6%
1 669
19.7%
5 400
11.8%
3 388
11.4%
6 376
11.1%
4 372
11.0%
2 187
 
5.5%
9 91
 
2.7%
7 80
 
2.4%
8 60
 
1.8%
Other Punctuation
ValueCountFrequency (%)
/ 13992
44.6%
. 11169
35.6%
: 5696
18.1%
? 378
 
1.2%
& 169
 
0.5%
# 2
 
< 0.1%
% 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 5825
99.9%
] 3
 
0.1%
1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 5825
99.9%
[ 3
 
0.1%
1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 547
99.6%
+ 1
 
0.2%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
8445
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 442
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 151
100.0%
Control
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 111260
48.4%
Hangul 62473
27.2%
Common 56044
24.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3697
 
5.9%
3519
 
5.6%
3265
 
5.2%
3259
 
5.2%
3151
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25143
40.2%
Latin
ValueCountFrequency (%)
t 15636
14.1%
a 11611
10.4%
o 10151
 
9.1%
l 9310
 
8.4%
r 6819
 
6.1%
p 6707
 
6.0%
h 6694
 
6.0%
k 6377
 
5.7%
s 5236
 
4.7%
e 5059
 
4.5%
Other values (39) 27660
24.9%
Common
ValueCountFrequency (%)
/ 13992
25.0%
. 11169
19.9%
8445
15.1%
) 5825
10.4%
( 5825
10.4%
: 5696
10.2%
0 766
 
1.4%
1 669
 
1.2%
= 547
 
1.0%
- 442
 
0.8%
Other values (20) 2668
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 167301
72.8%
Hangul 62473
 
27.2%
None 2
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 15636
 
9.3%
/ 13992
 
8.4%
a 11611
 
6.9%
. 11169
 
6.7%
o 10151
 
6.1%
l 9310
 
5.6%
8445
 
5.0%
r 6819
 
4.1%
p 6707
 
4.0%
h 6694
 
4.0%
Other values (66) 66767
39.9%
Hangul
ValueCountFrequency (%)
6097
 
9.8%
4436
 
7.1%
3702
 
5.9%
3697
 
5.9%
3519
 
5.6%
3265
 
5.2%
3259
 
5.2%
3151
 
5.0%
3104
 
5.0%
3100
 
5.0%
Other values (311) 25143
40.2%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Arrows
ValueCountFrequency (%)
1
100.0%

제공형식
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct42
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
Sheet,Api
4853 
Sheet,Chart
1761 
File
 
409
Api,Sheet
 
227
Sheet,Api,File
 
131
Other values (37)
548 

Length

Max length24
Median length9
Mean length9.509522
Min length3

Unique

Unique8 ?
Unique (%)0.1%

Sample

1st rowFile
2nd rowSheet,Api
3rd rowSheet,Api
4th rowSheet,Api
5th rowFile

Common Values

ValueCountFrequency (%)
Sheet,Api 4853
61.2%
Sheet,Chart 1761
 
22.2%
File 409
 
5.2%
Api,Sheet 227
 
2.9%
Sheet,Api,File 131
 
1.7%
Link 63
 
0.8%
Api 60
 
0.8%
Sheet,File,Api 60
 
0.8%
File,Sheet,Api 57
 
0.7%
Chart,Sheet,Api 49
 
0.6%
Other values (32) 259
 
3.3%

Length

2024-05-11T04:20:44.529131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
sheet,api 4853
61.2%
sheet,chart 1761
 
22.2%
file 409
 
5.2%
api,sheet 227
 
2.9%
sheet,api,file 131
 
1.7%
link 63
 
0.8%
api 60
 
0.8%
sheet,file,api 60
 
0.8%
file,sheet,api 57
 
0.7%
chart,sheet,api 49
 
0.6%
Other values (32) 259
 
3.3%

서비스URL
Text

UNIQUE 

Distinct7929
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size62.1 KiB
2024-05-11T04:20:45.335530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length60
Mean length59.974019
Min length53

Characters and Unicode

Total characters475534
Distinct characters56
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7929 ?
Unique (%)100.0%

Sample

1st rowhttp://data.seoul.go.kr/dataList/OA-22208/F/1/datasetView.do
2nd rowhttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12611
3rd rowhttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12621
4th rowhttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12615
5th rowhttp://data.seoul.go.kr/dataList/OA-22224/F/1/datasetView.do
ValueCountFrequency (%)
http://data.seoul.go.kr/datalist/oa-22208/f/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20757/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20796/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20860/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20872/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20785/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20591/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20642/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20618/s/1/datasetview.do 1
 
< 0.1%
http://data.seoul.go.kr/datalist/oa-20878/s/1/datasetview.do 1
 
< 0.1%
Other values (7919) 7919
99.9%
2024-05-11T04:20:46.751705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 50409
 
10.6%
t 50318
 
10.6%
a 41796
 
8.8%
. 31716
 
6.7%
d 29001
 
6.1%
e 25890
 
5.4%
o 23243
 
4.9%
s 22640
 
4.8%
i 17760
 
3.7%
1 13957
 
2.9%
Other values (46) 168804
35.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 293659
61.8%
Other Punctuation 91752
 
19.3%
Decimal Number 47885
 
10.1%
Uppercase Letter 34372
 
7.2%
Dash Punctuation 6168
 
1.3%
Math Symbol 1698
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 50318
17.1%
a 41796
14.2%
d 29001
9.9%
e 25890
8.8%
o 23243
7.9%
s 22640
7.7%
i 17760
 
6.0%
p 11641
 
4.0%
h 9643
 
3.3%
g 9310
 
3.2%
Other values (13) 52417
17.8%
Uppercase Letter
ValueCountFrequency (%)
L 6317
18.4%
O 6299
18.3%
A 6250
18.2%
V 6231
18.1%
S 5742
16.7%
I 1701
 
4.9%
D 612
 
1.8%
T 554
 
1.6%
F 445
 
1.3%
E 118
 
0.3%
Other values (7) 103
 
0.3%
Decimal Number
ValueCountFrequency (%)
1 13957
29.1%
2 7455
15.6%
0 6519
13.6%
6 3167
 
6.6%
7 3118
 
6.5%
3 2988
 
6.2%
8 2815
 
5.9%
9 2758
 
5.8%
5 2667
 
5.6%
4 2441
 
5.1%
Other Punctuation
ValueCountFrequency (%)
/ 50409
54.9%
. 31716
34.6%
: 7929
 
8.6%
? 1698
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 6168
100.0%
Math Symbol
ValueCountFrequency (%)
= 1698
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 328031
69.0%
Common 147503
31.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 50318
15.3%
a 41796
12.7%
d 29001
 
8.8%
e 25890
 
7.9%
o 23243
 
7.1%
s 22640
 
6.9%
i 17760
 
5.4%
p 11641
 
3.5%
h 9643
 
2.9%
g 9310
 
2.8%
Other values (30) 86789
26.5%
Common
ValueCountFrequency (%)
/ 50409
34.2%
. 31716
21.5%
1 13957
 
9.5%
: 7929
 
5.4%
2 7455
 
5.1%
0 6519
 
4.4%
- 6168
 
4.2%
6 3167
 
2.1%
7 3118
 
2.1%
3 2988
 
2.0%
Other values (6) 14077
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 475534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 50409
 
10.6%
t 50318
 
10.6%
a 41796
 
8.8%
. 31716
 
6.7%
d 29001
 
6.1%
e 25890
 
5.4%
o 23243
 
4.9%
s 22640
 
4.8%
i 17760
 
3.7%
1 13957
 
2.9%
Other values (46) 168804
35.5%

Correlations

2024-05-11T04:20:47.062420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류중분류제공기관소분류갱신주기제공형식
대분류1.0000.4740.7790.6220.9921.000
중분류0.4741.0000.9840.7660.5830.780
제공기관0.7790.9841.0000.6570.7490.812
소분류0.6220.7660.6571.0000.6730.674
갱신주기0.9920.5830.7490.6731.0000.843
제공형식1.0000.7800.8120.6740.8431.000
2024-05-11T04:20:47.423631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
갱신주기소분류대분류중분류제공형식
갱신주기1.0000.3610.9220.3720.518
소분류0.3611.0000.4880.4090.276
대분류0.9220.4881.0000.3420.997
중분류0.3720.4090.3421.0000.442
제공형식0.5180.2760.9970.4421.000
2024-05-11T04:20:47.941042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류중분류소분류갱신주기제공형식
대분류1.0000.3420.4880.9220.997
중분류0.3421.0000.4090.3720.442
소분류0.4880.4091.0000.3610.276
갱신주기0.9220.3720.3611.0000.518
제공형식0.9970.4420.2760.5181.000

Missing values

2024-05-11T04:20:23.470049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T04:20:24.507616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-05-11T04:20:25.020075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서비스 ID서비스명대분류중분류제공기관제공부서명소분류시스템명담당자명담당자연락처갱신주기최종갱신일자제공사이트제공형식서비스URL
0OA-22208서울시 고시원 정보(2020년)공공데이터서울시(본청)서울특별시소방재난본부 예방과주택/건설공공데이터포탈고강섭<NA>주기없음<NA>공공데이터포탈(https://www.data.go.kr/data15030030/fileData.do)Filehttp://data.seoul.go.kr/dataList/OA-22208/F/1/datasetView.do
1OA-12611서울시 강동구 길동 주민센터 새소식 정보공공데이터자치구 및 자치구산하강동구강동구 길동일반행정강동구 홈페이지강동구 대표전화02-1577-1188일간<NA>강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/gil/main)Sheet,Apihttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12611
2OA-12621서울시 강동구 암사2동 주민센터 새소식 정보공공데이터자치구 및 자치구산하강동구강동구 암사2동일반행정강동구 홈페이지강동구 대표전화02-1577-1188일간<NA>강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/amsa2/main)Sheet,Apihttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12621
3OA-12615서울시 강동구 명일2동 주민센터 새소식 정보공공데이터자치구 및 자치구산하강동구강동구 명일2동일반행정강동구 홈페이지강동구 대표전화02-1577-1188일간<NA>강동구 홈페이지(https://www.gangdong.go.kr/web/dongrenew/myeongil2/main)Sheet,Apihttp://data.gd.go.kr/openinf/sheetview.jsp?infId=OA-12615
4OA-22224서울시 하천수위 관측 데이터(2018~2020년)공공데이터서울시(본청)서울특별시도시교통실 교통기획관 교통운영과도시관리공공데이터포탈박성수<NA>주기없음<NA>공공데이터포탈(https://www.data.go.kr/data15084568/fileData.do)Filehttp://data.seoul.go.kr/dataList/OA-22224/F/1/datasetView.do
5OA-22237서울시 대기오염물질 일별 배출량(2019~2021년)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경공공데이터포털공재원02-2133-4438주기없음<NA>공공데이터포털(https://www.data.go.kr/data15101604/fileData.do)Filehttp://data.seoul.go.kr/dataList/OA-22237/F/1/datasetView.do
6OA-12662서울시 성동구 금호4가동 소식 정보공공데이터자치구 및 자치구산하성동구성동구 스마트포용도시국 정보통신과일반행정성동구 홈페이지성동구 대표전화02-2286-5114일간2024-05-11성동구 홈페이지(http://www.sd.go.kr)Sheet,Apihttp://data.sd.go.kr/openinf/sheetview.jsp?infId=OA-12662
7OA-17021서울시 광진구 시내순환관광업 인허가 정보공공데이터서울시(본청)광진구광진구 행정국 문화예술과문화/관광지방행정 인허가 데이터개방문연희02-450-7582수시2024-05-11지방행정 인허가 데이터개방(http://localdata.kr)Sheet,Apihttp://data.seoul.go.kr/dataList/OA-17021/S/1/datasetView.do
8OA-10916서울시 강북구 식품위생업소 식품수거검사 현황공공데이터자치구 및 자치구산하강북구강북구 행정관리국 디지털정보과보건시도행정정보시스템강북구 대표전화02-901-6114수시2024-05-11시도행정정보시스템Api,Sheethttp://data.gangbuk.go.kr/openinf/sheetview.jsp?infId=OA-10916
9OA-16453서울시 송파구 의료기기판매(임대)업 인허가 정보공공데이터서울시(본청)송파구스마트도시정책관 빅데이터담당관보건지방행정 인허가 데이터개방송파구 대표전화02-2147-2000수시2024-05-11지방행정 인허가 데이터개방(http://localdata.kr)Sheet,Apihttp://data.seoul.go.kr/dataList/OA-16453/S/1/datasetView.do
서비스 ID서비스명대분류중분류제공기관제공부서명소분류시스템명담당자명담당자연락처갱신주기최종갱신일자제공사이트제공형식서비스URL
7919OA-351서울시 연평균최고기온 2006년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Api,Sheet,Charthttp://data.seoul.go.kr/dataList/OA-351/S/1/datasetView.do
7920OA-353서울시 연평균최고기온 2008년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Api,Sheet,Charthttp://data.seoul.go.kr/dataList/OA-353/S/1/datasetView.do
7921OA-357서울시 연평균최저기온 2008년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Sheet,Api,Charthttp://data.seoul.go.kr/dataList/OA-357/S/1/datasetView.do
7922OA-329서울시 행정동별 전력 사용량 2007년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과산업/경제서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Sheet,Apihttp://data.seoul.go.kr/dataList/OA-329/S/1/datasetView.do
7923OA-334서울시 행정동별 가스 사용량 2008년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과산업/경제서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Sheet,Apihttp://data.seoul.go.kr/dataList/OA-334/S/1/datasetView.do
7924OA-359서울시 여름철 평균기온 위치정보 (1998~2009년) (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Api,Sheet,Charthttp://data.seoul.go.kr/dataList/OA-359/S/1/datasetView.do
7925OA-324서울시 행정동별 상수도 사용량 2006년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과산업/경제서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Sheet,Apihttp://data.seoul.go.kr/dataList/OA-324/S/1/datasetView.do
7926OA-330서울시 행정동별 전력 사용량 2008년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과산업/경제서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Sheet,Apihttp://data.seoul.go.kr/dataList/OA-330/S/1/datasetView.do
7927OA-349서울시 연평균기온 2008년 위치정보 (좌표계: ITRF2000)공공데이터서울시(본청)서울특별시기후환경본부 대기정책과환경서울시 기후.에너지 지도(2010)이경옥02-2133-3597주기없음2012-01-13서울시 기후.에너지 지도(2010)MAP,File,Api,Sheet,Charthttp://data.seoul.go.kr/dataList/OA-349/S/1/datasetView.do
7928OA-2532서울시 사회적기업 지원 기관 목록 정보공공데이터서울시(본청)서울특별시노동공정상생정책관 소상공인담당관산업/경제서울 사회적경제 포털이경훈02-2133-5491일간2011-01-06서울 사회적경제 포털(http://sehub.net/)Sheet,Apihttp://data.seoul.go.kr/dataList/OA-2532/S/1/datasetView.do