Overview

Dataset statistics

Number of variables6
Number of observations1241
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory60.7 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text1
DateTime1
Categorical1
Boolean1

Dataset

Description환경경영정보포털에서 제공하는 환경경영 등 환경분야 관련 컨설팅 정보(전문 컨설팅 주제, 조회수, 등록일자, 분류명, 삭제여부 등)
Author환경부
URLhttps://www.data.go.kr/data/15039231/fileData.do

Alerts

삭제여부 has constant value ""Constant
연번 is highly overall correlated with 분류High correlation
분류 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
조회수 has 51 (4.1%) zerosZeros

Reproduction

Analysis started2023-12-12 20:30:22.893393
Analysis finished2023-12-12 20:30:23.827591
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1241
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2895.2973
Minimum2271
Maximum3578
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.0 KiB
2023-12-13T05:30:23.899695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2271
5-th percentile2333
Q12581
median2891
Q33201
95-th percentile3487
Maximum3578
Range1307
Interquartile range (IQR)620

Descriptive statistics

Standard deviation365.44878
Coefficient of variation (CV)0.1262215
Kurtosis-1.1303784
Mean2895.2973
Median Absolute Deviation (MAD)310
Skewness0.059313282
Sum3593064
Variance133552.81
MonotonicityNot monotonic
2023-12-13T05:30:24.037720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2312 1
 
0.1%
3138 1
 
0.1%
3145 1
 
0.1%
3144 1
 
0.1%
3143 1
 
0.1%
3142 1
 
0.1%
3141 1
 
0.1%
3140 1
 
0.1%
3139 1
 
0.1%
3137 1
 
0.1%
Other values (1231) 1231
99.2%
ValueCountFrequency (%)
2271 1
0.1%
2272 1
0.1%
2273 1
0.1%
2274 1
0.1%
2275 1
0.1%
2276 1
0.1%
2277 1
0.1%
2278 1
0.1%
2279 1
0.1%
2280 1
0.1%
ValueCountFrequency (%)
3578 1
0.1%
3577 1
0.1%
3576 1
0.1%
3575 1
0.1%
3574 1
0.1%
3573 1
0.1%
3572 1
0.1%
3571 1
0.1%
3570 1
0.1%
3569 1
0.1%

제목
Text

Distinct1227
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2023-12-13T05:30:24.304885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length55
Mean length24.377115
Min length7

Characters and Unicode

Total characters30252
Distinct characters573
Distinct categories15 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1213 ?
Unique (%)97.7%

Sample

1st row폐기물의 에너지화에 대한 환경컨설팅(1)
2nd row폐기물의 에너지화에 대한 환경컨설팅(2)
3rd row환경피해 분쟁조정 예방에 대한 컨설팅 기법(1)
4th row환경피해 분쟁조정 예방에 대한 컨설팅 기법(2)
5th row친환경상품과 환경마크인증 획득에 필요한 환경컨설팅 기법
ValueCountFrequency (%)
311
 
5.0%
사례 128
 
2.1%
컨설팅 80
 
1.3%
위한 79
 
1.3%
환경컨설팅 65
 
1.1%
매뉴얼 47
 
0.8%
관리 42
 
0.7%
관한 39
 
0.6%
개발 39
 
0.6%
온실가스 35
 
0.6%
Other values (2887) 5317
86.0%
2023-12-13T05:30:24.761429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4942
 
16.3%
536
 
1.8%
501
 
1.7%
494
 
1.6%
489
 
1.6%
473
 
1.6%
445
 
1.5%
) 405
 
1.3%
( 405
 
1.3%
396
 
1.3%
Other values (563) 21166
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21644
71.5%
Space Separator 4942
 
16.3%
Decimal Number 1186
 
3.9%
Lowercase Letter 627
 
2.1%
Uppercase Letter 539
 
1.8%
Close Punctuation 421
 
1.4%
Open Punctuation 421
 
1.4%
Other Punctuation 376
 
1.2%
Dash Punctuation 64
 
0.2%
Math Symbol 14
 
< 0.1%
Other values (5) 18
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
536
 
2.5%
501
 
2.3%
494
 
2.3%
489
 
2.3%
473
 
2.2%
445
 
2.1%
396
 
1.8%
390
 
1.8%
359
 
1.7%
358
 
1.7%
Other values (474) 17203
79.5%
Lowercase Letter
ValueCountFrequency (%)
e 72
11.5%
a 69
11.0%
t 57
9.1%
o 51
8.1%
r 50
8.0%
i 49
 
7.8%
n 45
 
7.2%
m 40
 
6.4%
p 36
 
5.7%
s 34
 
5.4%
Other values (14) 124
19.8%
Uppercase Letter
ValueCountFrequency (%)
E 75
13.9%
C 57
10.6%
S 51
9.5%
M 44
 
8.2%
A 44
 
8.2%
I 35
 
6.5%
P 32
 
5.9%
B 29
 
5.4%
D 24
 
4.5%
R 22
 
4.1%
Other values (12) 126
23.4%
Other Punctuation
ValueCountFrequency (%)
. 151
40.2%
, 51
 
13.6%
· 43
 
11.4%
/ 30
 
8.0%
& 25
 
6.6%
; 25
 
6.6%
# 20
 
5.3%
! 15
 
4.0%
' 12
 
3.2%
: 2
 
0.5%
Decimal Number
ValueCountFrequency (%)
2 343
28.9%
1 281
23.7%
0 266
22.4%
8 63
 
5.3%
3 51
 
4.3%
6 47
 
4.0%
9 44
 
3.7%
4 38
 
3.2%
5 28
 
2.4%
7 25
 
2.1%
Close Punctuation
ValueCountFrequency (%)
) 405
96.2%
7
 
1.7%
] 5
 
1.2%
2
 
0.5%
2
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 405
96.2%
7
 
1.7%
[ 5
 
1.2%
2
 
0.5%
2
 
0.5%
Letter Number
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%
Math Symbol
ValueCountFrequency (%)
~ 13
92.9%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
4942
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Control
ValueCountFrequency (%)
6
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21641
71.5%
Common 7436
 
24.6%
Latin 1172
 
3.9%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
536
 
2.5%
501
 
2.3%
494
 
2.3%
489
 
2.3%
473
 
2.2%
445
 
2.1%
396
 
1.8%
390
 
1.8%
359
 
1.7%
358
 
1.7%
Other values (471) 17200
79.5%
Latin
ValueCountFrequency (%)
E 75
 
6.4%
e 72
 
6.1%
a 69
 
5.9%
C 57
 
4.9%
t 57
 
4.9%
S 51
 
4.4%
o 51
 
4.4%
r 50
 
4.3%
i 49
 
4.2%
n 45
 
3.8%
Other values (40) 596
50.9%
Common
ValueCountFrequency (%)
4942
66.5%
) 405
 
5.4%
( 405
 
5.4%
2 343
 
4.6%
1 281
 
3.8%
0 266
 
3.6%
. 151
 
2.0%
- 64
 
0.9%
8 63
 
0.8%
3 51
 
0.7%
Other values (29) 465
 
6.3%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21638
71.5%
ASCII 8534
 
28.2%
None 65
 
0.2%
Number Forms 6
 
< 0.1%
Compat Jamo 3
 
< 0.1%
CJK 3
 
< 0.1%
Punctuation 2
 
< 0.1%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4942
57.9%
) 405
 
4.7%
( 405
 
4.7%
2 343
 
4.0%
1 281
 
3.3%
0 266
 
3.1%
. 151
 
1.8%
E 75
 
0.9%
e 72
 
0.8%
a 69
 
0.8%
Other values (65) 1525
 
17.9%
Hangul
ValueCountFrequency (%)
536
 
2.5%
501
 
2.3%
494
 
2.3%
489
 
2.3%
473
 
2.2%
445
 
2.1%
396
 
1.8%
390
 
1.8%
359
 
1.7%
358
 
1.7%
Other values (470) 17197
79.5%
None
ValueCountFrequency (%)
· 43
66.2%
7
 
10.8%
7
 
10.8%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Number Forms
ValueCountFrequency (%)
3
50.0%
1
 
16.7%
1
 
16.7%
1
 
16.7%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

조회수
Real number (ℝ)

ZEROS 

Distinct433
Distinct (%)34.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean197.40371
Minimum0
Maximum6931
Zeros51
Zeros (%)4.1%
Negative0
Negative (%)0.0%
Memory size11.0 KiB
2023-12-13T05:30:24.921172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q111
median97
Q3355
95-th percentile633
Maximum6931
Range6931
Interquartile range (IQR)344

Descriptive statistics

Standard deviation297.84015
Coefficient of variation (CV)1.508787
Kurtosis214.59746
Mean197.40371
Median Absolute Deviation (MAD)93
Skewness10.275189
Sum244978
Variance88708.757
MonotonicityNot monotonic
2023-12-13T05:30:25.081541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 51
 
4.1%
6 38
 
3.1%
5 32
 
2.6%
2 32
 
2.6%
7 29
 
2.3%
4 29
 
2.3%
3 23
 
1.9%
1 21
 
1.7%
12 18
 
1.5%
8 17
 
1.4%
Other values (423) 951
76.6%
ValueCountFrequency (%)
0 51
4.1%
1 21
1.7%
2 32
2.6%
3 23
1.9%
4 29
2.3%
5 32
2.6%
6 38
3.1%
7 29
2.3%
8 17
 
1.4%
9 13
 
1.0%
ValueCountFrequency (%)
6931 1
0.1%
2829 1
0.1%
1143 1
0.1%
1085 1
0.1%
1083 1
0.1%
1001 1
0.1%
996 1
0.1%
853 1
0.1%
829 1
0.1%
823 1
0.1%
Distinct450
Distinct (%)36.3%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
Minimum2009-01-30 00:00:00
Maximum2020-10-15 00:00:00
2023-12-13T05:30:25.256622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:30:25.427643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

분류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
메뉴얼
563 
이론및리포트
447 
우수사례
231 

Length

Max length6
Median length4
Mean length4.2667204
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이론및리포트
2nd row이론및리포트
3rd row이론및리포트
4th row이론및리포트
5th row이론및리포트

Common Values

ValueCountFrequency (%)
메뉴얼 563
45.4%
이론및리포트 447
36.0%
우수사례 231
18.6%

Length

2023-12-13T05:30:25.602836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:30:25.705153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
메뉴얼 563
45.4%
이론및리포트 447
36.0%
우수사례 231
18.6%

삭제여부
Boolean

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
False
1241 
ValueCountFrequency (%)
False 1241
100.0%
2023-12-13T05:30:25.798618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-13T05:30:23.449905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:30:23.252968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:30:23.561756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:30:23.361341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:30:26.220617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조회수분류
연번1.0000.2270.910
조회수0.2271.0000.171
분류0.9100.1711.000
2023-12-13T05:30:26.369897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번조회수분류
연번1.0000.1730.868
조회수0.1731.0000.162
분류0.8680.1621.000

Missing values

2023-12-13T05:30:23.692366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:30:23.789248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번제목조회수등록일분류삭제여부
02312폐기물의 에너지화에 대한 환경컨설팅(1)202010-03-24이론및리포트N
12313폐기물의 에너지화에 대한 환경컨설팅(2)142010-03-24이론및리포트N
22314환경피해 분쟁조정 예방에 대한 컨설팅 기법(1)92010-03-29이론및리포트N
32315환경피해 분쟁조정 예방에 대한 컨설팅 기법(2)72010-03-29이론및리포트N
42316친환경상품과 환경마크인증 획득에 필요한 환경컨설팅 기법92010-03-24이론및리포트N
52317환경정화용 유전자 변형 생물체(LMO)의 안전 관리 방안 연구62010-03-24이론및리포트N
62318수질오염방지 엔지니어링 컨설팅 방법202010-03-24이론및리포트N
72319기후변화 및 대응전략에 대한 환경컨설팅152010-03-24이론및리포트N
82320환경오염 생체지표 개발 및 독성연구 표준화22010-03-24이론및리포트N
92321실내환경질오염 조사 및 평가 방법 컨설팅102010-03-24이론및리포트N
연번제목조회수등록일분류삭제여부
12313561폐기물관리법 개정 시행 홍보 팜플렛(배출자, 처리자)_'20.5.27 시행02020-10-05메뉴얼N
12323570에너지 자립마을 만들기 교육자료(2012)02020-10-12메뉴얼N
12333571음식물류폐기물 감량기 설치·운영 가이드라인(20.03.)02020-10-12메뉴얼N
12343572음식물쓰레기자원화 홍보책자(2017.04)12020-10-12메뉴얼N
12353573공공부문 온실가스에너지 목표관리제 9월 온라인 교육자료(2020.09.25.)02020-10-12메뉴얼N
12363574녹색·기후기술 백서 201912020-10-12메뉴얼N
12373575상수도 공급망 연계체계 구축 기본계획 수립을 위한 조사보고서(2015)02020-10-12메뉴얼N
12383576폐수배출시설 인허가 가이드북(1권~4권)(19.10.24. 작성 기준)82020-10-12메뉴얼N
12393577유기물 측정지표 전환(COD→TOC) 설명자료 (19.12.17. 작성 기준)12020-10-12메뉴얼N
12403578음식물류폐기물 바이오가스화시설 기술지침(2015.12 개정)02020-10-15메뉴얼N