Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 2933 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 91.8 KiB |
Average record size in memory | 32.0 B |
Variable types
Text | 4 |
---|
Dataset
Description | 비점오염원관리 정보시스템의 자료를 관리하기 위한 기초 데이터로서 사업종류, 기상상태, 업종 및 하천 수계에 대한 세부 코드 정보를 제공합니다. |
---|---|
Author | 한국환경공단 |
URL | https://www.data.go.kr/data/15070122/fileData.do |
Reproduction
Analysis started | 2024-03-16 04:25:22.673549 |
---|---|
Analysis finished | 2024-03-16 04:25:23.893195 |
Duration | 1.22 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
코드종류
Text
Distinct | 96 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.0 KiB |
Value | Count | Frequency (%) |
fc10005 | 2021 | |
ms10003 | 179 | 6.1% |
fc10029 | 59 | 2.0% |
ms10004 | 42 | 1.4% |
fc10008 | 40 | 1.4% |
fc10048 | 36 | 1.2% |
fc10003 | 32 | 1.1% |
cd10007 | 26 | 0.9% |
ms10006 | 25 | 0.9% |
fc10039 | 23 | 0.8% |
Other values (86) | 450 | 15.3% |
Most occurring characters
Value | Count | Frequency (%) |
0 | 8381 | |
1 | 3151 | 15.3% |
C | 2535 | 12.3% |
F | 2414 | 11.8% |
5 | 2061 | 10.0% |
3 | 358 | 1.7% |
M | 348 | 1.7% |
S | 348 | 1.7% |
2 | 188 | 0.9% |
4 | 167 | 0.8% |
Other values (10) | 577 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 14660 | |
Uppercase Letter | 5868 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 8381 | |
1 | 3151 | 21.5% |
5 | 2061 | 14.1% |
3 | 358 | 2.4% |
2 | 188 | 1.3% |
4 | 167 | 1.1% |
9 | 123 | 0.8% |
8 | 108 | 0.7% |
6 | 68 | 0.5% |
7 | 55 | 0.4% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 2535 | |
F | 2414 | |
M | 348 | 5.9% |
S | 348 | 5.9% |
D | 104 | 1.8% |
R | 50 | 0.9% |
V | 49 | 0.8% |
N | 17 | 0.3% |
O | 2 | < 0.1% |
T | 1 | < 0.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 14660 | |
Latin | 5868 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 8381 | |
1 | 3151 | 21.5% |
5 | 2061 | 14.1% |
3 | 358 | 2.4% |
2 | 188 | 1.3% |
4 | 167 | 1.1% |
9 | 123 | 0.8% |
8 | 108 | 0.7% |
6 | 68 | 0.5% |
7 | 55 | 0.4% |
Latin
Value | Count | Frequency (%) |
C | 2535 | |
F | 2414 | |
M | 348 | 5.9% |
S | 348 | 5.9% |
D | 104 | 1.8% |
R | 50 | 0.9% |
V | 49 | 0.8% |
N | 17 | 0.3% |
O | 2 | < 0.1% |
T | 1 | < 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20528 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 8381 | |
1 | 3151 | 15.3% |
C | 2535 | 12.3% |
F | 2414 | 11.8% |
5 | 2061 | 10.0% |
3 | 358 | 1.7% |
M | 348 | 1.7% |
S | 348 | 1.7% |
2 | 188 | 0.9% |
4 | 167 | 0.8% |
Other values (10) | 577 | 2.8% |
코드종류명
Text
Distinct | 94 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.0 KiB |
Length
Max length | 27 |
---|---|
Median length | 20 |
Mean length | 17.890215 |
Min length | 5 |
Characters and Unicode
Total characters | 52472 |
---|---|
Distinct characters | 158 |
Distinct categories | 6 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 측정소_운영점검일지_점검_항목_코드 |
---|---|
2nd row | 비용_구분_코드 |
3rd row | 사업전후_구분_코드 |
4th row | 사업전후_구분_코드 |
5th row | 사업전후_구분_코드 |
Value | Count | Frequency (%) |
사업_종류_코드(표준산업_분류_코드 | 2021 | |
측정소_운영점검일지_점검_항목_코드 | 179 | 6.0% |
장비_종류_코드 | 67 | 2.3% |
주요관리_대상_물질_코드 | 59 | 2.0% |
저감_시설_종류_코드 | 40 | 1.4% |
저감_시설_종류_코드(국고보조 | 36 | 1.2% |
개발사업_종류_코드 | 32 | 1.1% |
첨부파일_업무_구분_코드 | 26 | 0.9% |
국고보조_유지관리_재해_민원_폐기물처리_항목_코드 | 23 | 0.8% |
변경신고_이력_변경대상코드 | 22 | 0.7% |
Other values (88) | 456 | 15.4% |
Most occurring characters
Value | Count | Frequency (%) |
_ | 10555 | |
코 | 4940 | 9.4% |
드 | 4940 | 9.4% |
류 | 4265 | 8.1% |
업 | 4174 | 8.0% |
종 | 2241 | 4.3% |
분 | 2171 | 4.1% |
사 | 2129 | 4.1% |
) | 2083 | 4.0% |
( | 2083 | 4.0% |
Other values (148) | 12891 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 37617 | |
Connector Punctuation | 10555 | 20.1% |
Close Punctuation | 2083 | 4.0% |
Open Punctuation | 2083 | 4.0% |
Uppercase Letter | 84 | 0.2% |
Space Separator | 50 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
코 | 4940 | |
드 | 4940 | |
류 | 4265 | |
업 | 4174 | |
종 | 2241 | 6.0% |
분 | 2171 | 5.8% |
사 | 2129 | 5.7% |
산 | 2028 | 5.4% |
표 | 2024 | 5.4% |
준 | 2021 | 5.4% |
Other values (139) | 6684 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 52 | |
M | 26 | |
U | 2 | 2.4% |
R | 2 | 2.4% |
L | 2 | 2.4% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 10555 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2083 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2083 |
Space Separator
Value | Count | Frequency (%) |
50 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 37617 | |
Common | 14771 | 28.2% |
Latin | 84 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
코 | 4940 | |
드 | 4940 | |
류 | 4265 | |
업 | 4174 | |
종 | 2241 | 6.0% |
분 | 2171 | 5.8% |
사 | 2129 | 5.7% |
산 | 2028 | 5.4% |
표 | 2024 | 5.4% |
준 | 2021 | 5.4% |
Other values (139) | 6684 |
Latin
Value | Count | Frequency (%) |
S | 52 | |
M | 26 | |
U | 2 | 2.4% |
R | 2 | 2.4% |
L | 2 | 2.4% |
Common
Value | Count | Frequency (%) |
_ | 10555 | |
) | 2083 | 14.1% |
( | 2083 | 14.1% |
50 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 37617 | |
ASCII | 14855 | 28.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 10555 | |
) | 2083 | 14.0% |
( | 2083 | 14.0% |
S | 52 | 0.4% |
50 | 0.3% | |
M | 26 | 0.2% |
U | 2 | < 0.1% |
R | 2 | < 0.1% |
L | 2 | < 0.1% |
Hangul
Value | Count | Frequency (%) |
코 | 4940 | |
드 | 4940 | |
류 | 4265 | |
업 | 4174 | |
종 | 2241 | 6.0% |
분 | 2171 | 5.8% |
사 | 2129 | 5.7% |
산 | 2028 | 5.4% |
표 | 2024 | 5.4% |
준 | 2021 | 5.4% |
Other values (139) | 6684 |
세부코드
Text
Distinct | 2502 |
---|---|
Distinct (%) | 85.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.0 KiB |
Value | Count | Frequency (%) |
1 | 29 | 1.0% |
2 | 25 | 0.9% |
3 | 15 | 0.5% |
11 | 13 | 0.4% |
13 | 11 | 0.4% |
4 | 11 | 0.4% |
5 | 11 | 0.4% |
12 | 11 | 0.4% |
0 | 11 | 0.4% |
6 | 10 | 0.3% |
Other values (2491) | 2786 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2206 | |
2 | 1985 | |
0 | 1332 | |
9 | 971 | |
3 | 965 | |
4 | 929 | |
6 | 618 | 5.3% |
5 | 598 | 5.1% |
7 | 502 | 4.3% |
8 | 349 | 3.0% |
Other values (35) | 1293 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 10455 | |
Uppercase Letter | 1272 | 10.8% |
Lowercase Letter | 21 | 0.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
D | 175 | |
R | 139 | 10.9% |
E | 98 | 7.7% |
C | 89 | 7.0% |
T | 78 | 6.1% |
I | 76 | 6.0% |
A | 75 | 5.9% |
B | 73 | 5.7% |
M | 56 | 4.4% |
F | 52 | 4.1% |
Other values (16) | 361 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2206 | |
2 | 1985 | |
0 | 1332 | |
9 | 971 | |
3 | 965 | |
4 | 929 | |
6 | 618 | 5.9% |
5 | 598 | 5.7% |
7 | 502 | 4.8% |
8 | 349 | 3.3% |
Lowercase Letter
Value | Count | Frequency (%) |
t | 5 | |
m | 5 | |
h | 4 | |
p | 2 | 9.5% |
s | 1 | 4.8% |
e | 1 | 4.8% |
c | 1 | 4.8% |
r | 1 | 4.8% |
a | 1 | 4.8% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 10455 | |
Latin | 1293 | 11.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
D | 175 | |
R | 139 | 10.8% |
E | 98 | 7.6% |
C | 89 | 6.9% |
T | 78 | 6.0% |
I | 76 | 5.9% |
A | 75 | 5.8% |
B | 73 | 5.6% |
M | 56 | 4.3% |
F | 52 | 4.0% |
Other values (25) | 382 |
Common
Value | Count | Frequency (%) |
1 | 2206 | |
2 | 1985 | |
0 | 1332 | |
9 | 971 | |
3 | 965 | |
4 | 929 | |
6 | 618 | 5.9% |
5 | 598 | 5.7% |
7 | 502 | 4.8% |
8 | 349 | 3.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 11748 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2206 | |
2 | 1985 | |
0 | 1332 | |
9 | 971 | |
3 | 965 | |
4 | 929 | |
6 | 618 | 5.3% |
5 | 598 | 5.1% |
7 | 502 | 4.3% |
8 | 349 | 3.0% |
Other values (35) | 1293 |
세부코드명
Text
Distinct | 2460 |
---|---|
Distinct (%) | 83.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 23.0 KiB |
Length
Max length | 60 |
---|---|
Median length | 40 |
Mean length | 11.591544 |
Min length | 1 |
Characters and Unicode
Total characters | 33998 |
---|---|
Distinct characters | 594 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 2100 ? |
---|---|
Unique (%) | 71.6% |
Sample
1st row | 채수장비(펌프, 맨홀, 켄틸러버) 정상작동 확인 |
---|---|
2nd row | 기타(부대비용 등) |
3rd row | 사업전 |
4th row | 공사중 |
5th row | 사업후 |
Value | Count | Frequency (%) |
및 | 978 | 9.9% |
제조업 | 679 | 6.9% |
기타 | 389 | 4.0% |
서비스업 | 155 | 1.6% |
확인 | 127 | 1.3% |
도매업 | 121 | 1.2% |
소매업 | 107 | 1.1% |
그 | 99 | 1.0% |
외 | 89 | 0.9% |
운영업 | 76 | 0.8% |
Other values (2532) | 7028 |
Most occurring characters
Value | Count | Frequency (%) |
6915 | 20.3% | |
업 | 2016 | 5.9% |
제 | 1037 | 3.1% |
및 | 978 | 2.9% |
기 | 932 | 2.7% |
조 | 878 | 2.6% |
품 | 424 | 1.2% |
타 | 413 | 1.2% |
비 | 407 | 1.2% |
, | 396 | 1.2% |
Other values (584) | 19602 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 25802 | |
Space Separator | 6915 | 20.3% |
Other Punctuation | 444 | 1.3% |
Lowercase Letter | 299 | 0.9% |
Uppercase Letter | 234 | 0.7% |
Decimal Number | 97 | 0.3% |
Close Punctuation | 70 | 0.2% |
Open Punctuation | 70 | 0.2% |
Connector Punctuation | 49 | 0.1% |
Dash Punctuation | 17 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 2016 | 7.8% |
제 | 1037 | 4.0% |
및 | 978 | 3.8% |
기 | 932 | 3.6% |
조 | 878 | 3.4% |
품 | 424 | 1.6% |
타 | 413 | 1.6% |
비 | 407 | 1.6% |
용 | 375 | 1.5% |
물 | 361 | 1.4% |
Other values (518) | 17981 |
Uppercase Letter
Value | Count | Frequency (%) |
P | 27 | |
O | 25 | |
T | 22 | 9.4% |
S | 20 | 8.5% |
N | 19 | 8.1% |
D | 17 | 7.3% |
C | 12 | 5.1% |
L | 11 | 4.7% |
B | 10 | 4.3% |
H | 10 | 4.3% |
Other values (12) | 61 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 43 | |
r | 41 | |
e | 29 | 9.7% |
s | 22 | 7.4% |
i | 18 | 6.0% |
w | 15 | 5.0% |
m | 14 | 4.7% |
t | 14 | 4.7% |
l | 13 | 4.3% |
o | 13 | 4.3% |
Other values (11) | 77 |
Decimal Number
Value | Count | Frequency (%) |
1 | 32 | |
0 | 19 | |
2 | 12 | 12.4% |
3 | 10 | 10.3% |
5 | 6 | 6.2% |
6 | 5 | 5.2% |
4 | 5 | 5.2% |
7 | 4 | 4.1% |
9 | 2 | 2.1% |
8 | 2 | 2.1% |
Other Punctuation
Value | Count | Frequency (%) |
, | 396 | |
/ | 11 | 2.5% |
· | 10 | 2.3% |
; | 10 | 2.3% |
: | 8 | 1.8% |
. | 7 | 1.6% |
& | 2 | 0.5% |
Space Separator
Value | Count | Frequency (%) |
6915 |
Close Punctuation
Value | Count | Frequency (%) |
) | 70 |
Open Punctuation
Value | Count | Frequency (%) |
( | 70 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 49 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 17 |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 25802 | |
Common | 7663 | 22.5% |
Latin | 533 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 2016 | 7.8% |
제 | 1037 | 4.0% |
및 | 978 | 3.8% |
기 | 932 | 3.6% |
조 | 878 | 3.4% |
품 | 424 | 1.6% |
타 | 413 | 1.6% |
비 | 407 | 1.6% |
용 | 375 | 1.5% |
물 | 361 | 1.4% |
Other values (518) | 17981 |
Latin
Value | Count | Frequency (%) |
a | 43 | 8.1% |
r | 41 | 7.7% |
e | 29 | 5.4% |
P | 27 | 5.1% |
O | 25 | 4.7% |
T | 22 | 4.1% |
s | 22 | 4.1% |
S | 20 | 3.8% |
N | 19 | 3.6% |
i | 18 | 3.4% |
Other values (33) | 267 |
Common
Value | Count | Frequency (%) |
6915 | ||
, | 396 | 5.2% |
) | 70 | 0.9% |
( | 70 | 0.9% |
_ | 49 | 0.6% |
1 | 32 | 0.4% |
0 | 19 | 0.2% |
- | 17 | 0.2% |
2 | 12 | 0.2% |
/ | 11 | 0.1% |
Other values (13) | 72 | 0.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 25742 | |
ASCII | 8185 | 24.1% |
Compat Jamo | 60 | 0.2% |
None | 10 | < 0.1% |
CJK Compat | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
6915 | ||
, | 396 | 4.8% |
) | 70 | 0.9% |
( | 70 | 0.9% |
_ | 49 | 0.6% |
a | 43 | 0.5% |
r | 41 | 0.5% |
1 | 32 | 0.4% |
e | 29 | 0.4% |
P | 27 | 0.3% |
Other values (54) | 513 | 6.3% |
Hangul
Value | Count | Frequency (%) |
업 | 2016 | 7.8% |
제 | 1037 | 4.0% |
및 | 978 | 3.8% |
기 | 932 | 3.6% |
조 | 878 | 3.4% |
품 | 424 | 1.6% |
타 | 413 | 1.6% |
비 | 407 | 1.6% |
용 | 375 | 1.5% |
물 | 361 | 1.4% |
Other values (517) | 17921 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 60 |
None
Value | Count | Frequency (%) |
· | 10 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 1 |
코드종류 | 코드종류명 | |
---|---|---|
코드종류 | 1.000 | 1.000 |
코드종류명 | 1.000 | 1.000 |
코드종류 | 코드종류명 | 세부코드 | 세부코드명 | |
---|---|---|---|---|
0 | MS10003 | 측정소_운영점검일지_점검_항목_코드 | B6 | 채수장비(펌프, 맨홀, 켄틸러버) 정상작동 확인 |
1 | RV10013 | 비용_구분_코드 | 99 | 기타(부대비용 등) |
2 | RV10016 | 사업전후_구분_코드 | 1 | 사업전 |
3 | RV10016 | 사업전후_구분_코드 | 2 | 공사중 |
4 | RV10016 | 사업전후_구분_코드 | 3 | 사업후 |
5 | CD10010 | 주소별_좌표_주소구분코드 | 1 | 주소별좌표_시도 |
6 | CD10010 | 주소별_좌표_주소구분코드 | 2 | 주소별좌표_시군구 |
7 | CD10010 | 주소별_좌표_주소구분코드 | 3 | 주소별좌표_읍면동 |
8 | MS10001 | 측정망 SMS발송코드 | 1 | 강우개시 |
9 | MS10001 | 측정망 SMS발송코드 | 2 | 자동채수시작 |
코드종류 | 코드종류명 | 세부코드 | 세부코드명 | |
---|---|---|---|---|
2923 | NC10010 | 유관기관_협력기관_회원구분 | GB04 | 유관기관 |
2924 | MS10004 | 장비_종류_코드 | EQ06 | 시료여과필터 |
2925 | MS10004 | 장비_종류_코드 | EQ08 | 강우설량계 |
2926 | MS10004 | 장비_종류_코드 | EQ09 | 데이터로거 |
2927 | MS10004 | 장비_종류_코드 | EQ22 | 저류수조 |
2928 | MS10004 | 장비_종류_코드 | EQ23 | 의자 |
2929 | MS10004 | 장비_종류_코드 | EQ14 | 채수펌프B |
2930 | MS10004 | 장비_종류_코드 | EQ12 | VPN |
2931 | MS10004 | 장비_종류_코드 | EQ13 | 채수펌프A |
2932 | MS10004 | 장비_종류_코드 | EQ15 | 조명 및 전열설비 |