Overview

Dataset statistics

Number of variables7
Number of observations228
Missing cells62
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.5 KiB
Average record size in memory60.6 B

Variable types

Numeric2
Text2
Categorical3

Dataset

Description지적재조사 바른땅 시스템 인터넷망(대국민, 토지소유자, 대행자, 책임수행기관), 행정망(공무원)의 메뉴 대분류, 중분류, 소분류 코드 및 이름 파일입니다.
URLhttps://www.data.go.kr/data/15118385/fileData.do

Alerts

비고 is highly overall correlated with 메뉴ID and 3 other fieldsHigh correlation
업무 구분 코드 is highly overall correlated with 링크코드 and 1 other fieldsHigh correlation
링크코드 is highly overall correlated with 메뉴ID and 3 other fieldsHigh correlation
메뉴ID is highly overall correlated with 상위메뉴ID and 2 other fieldsHigh correlation
상위메뉴ID is highly overall correlated with 메뉴ID and 2 other fieldsHigh correlation
비고 is highly imbalanced (71.1%)Imbalance
상위메뉴ID has 30 (13.2%) missing valuesMissing
메뉴URL has 31 (13.6%) missing valuesMissing
상위메뉴ID has 13 (5.7%) zerosZeros

Reproduction

Analysis started2023-12-12 00:12:59.076183
Analysis finished2023-12-12 00:13:00.088377
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

메뉴ID
Real number (ℝ)

HIGH CORRELATION 

Distinct206
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2937.1711
Minimum0
Maximum9999
Zeros1
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T09:13:00.169863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile91.35
Q11217.5
median2155
Q33302.25
95-th percentile9658
Maximum9999
Range9999
Interquartile range (IQR)2084.75

Descriptive statistics

Standard deviation2844.1806
Coefficient of variation (CV)0.96834012
Kurtosis0.88470747
Mean2937.1711
Median Absolute Deviation (MAD)1055
Skewness1.431793
Sum669675
Variance8089363.2
MonotonicityIncreasing
2023-12-12T09:13:00.312330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1210 2
 
0.9%
9010 2
 
0.9%
8040 2
 
0.9%
8030 2
 
0.9%
8020 2
 
0.9%
8000 2
 
0.9%
3100 2
 
0.9%
3110 2
 
0.9%
3120 2
 
0.9%
3200 2
 
0.9%
Other values (196) 208
91.2%
ValueCountFrequency (%)
0 1
0.4%
10 1
0.4%
20 1
0.4%
30 1
0.4%
40 1
0.4%
50 1
0.4%
60 1
0.4%
70 1
0.4%
71 1
0.4%
80 1
0.4%
ValueCountFrequency (%)
9999 1
0.4%
9998 1
0.4%
9995 1
0.4%
9994 1
0.4%
9992 1
0.4%
9991 1
0.4%
9990 1
0.4%
9987 1
0.4%
9984 1
0.4%
9983 1
0.4%
Distinct157
Distinct (%)69.2%
Missing1
Missing (%)0.4%
Memory size1.9 KiB
2023-12-12T09:13:00.581501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length14
Mean length7.8722467
Min length2

Characters and Unicode

Total characters1787
Distinct characters207
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique112 ?
Unique (%)49.3%

Sample

1st row커뮤니티
2nd row공지사항
3rd row자료실
4th row사용자 매뉴얼
5th rowFAQ
ValueCountFrequency (%)
사업지구 14
 
3.8%
관리 11
 
3.0%
확인 9
 
2.5%
대행자 7
 
1.9%
권한관리 6
 
1.6%
변환사업 6
 
1.6%
선정 6
 
1.6%
모니터링 6
 
1.6%
등록 5
 
1.4%
현황 5
 
1.4%
Other values (180) 290
79.5%
2023-12-12T09:13:01.184214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
139
 
7.8%
74
 
4.1%
73
 
4.1%
61
 
3.4%
58
 
3.2%
54
 
3.0%
52
 
2.9%
46
 
2.6%
41
 
2.3%
39
 
2.2%
Other values (197) 1150
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1549
86.7%
Space Separator 139
 
7.8%
Lowercase Letter 40
 
2.2%
Uppercase Letter 34
 
1.9%
Other Punctuation 25
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
4.8%
73
 
4.7%
61
 
3.9%
58
 
3.7%
54
 
3.5%
52
 
3.4%
46
 
3.0%
41
 
2.6%
39
 
2.5%
36
 
2.3%
Other values (176) 1015
65.5%
Uppercase Letter
ValueCountFrequency (%)
E 8
23.5%
R 4
11.8%
W 4
11.8%
V 4
11.8%
I 4
11.8%
D 4
11.8%
S 2
 
5.9%
F 1
 
2.9%
A 1
 
2.9%
Q 1
 
2.9%
Lowercase Letter
ValueCountFrequency (%)
s 10
25.0%
p 10
25.0%
n 6
15.0%
b 6
15.0%
e 4
 
10.0%
m 4
 
10.0%
Other Punctuation
ValueCountFrequency (%)
; 10
40.0%
& 10
40.0%
/ 5
20.0%
Space Separator
ValueCountFrequency (%)
139
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1549
86.7%
Common 164
 
9.2%
Latin 74
 
4.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
4.8%
73
 
4.7%
61
 
3.9%
58
 
3.7%
54
 
3.5%
52
 
3.4%
46
 
3.0%
41
 
2.6%
39
 
2.5%
36
 
2.3%
Other values (176) 1015
65.5%
Latin
ValueCountFrequency (%)
s 10
13.5%
p 10
13.5%
E 8
10.8%
n 6
8.1%
b 6
8.1%
R 4
 
5.4%
W 4
 
5.4%
V 4
 
5.4%
I 4
 
5.4%
D 4
 
5.4%
Other values (7) 14
18.9%
Common
ValueCountFrequency (%)
139
84.8%
; 10
 
6.1%
& 10
 
6.1%
/ 5
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1549
86.7%
ASCII 238
 
13.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
139
58.4%
; 10
 
4.2%
& 10
 
4.2%
s 10
 
4.2%
p 10
 
4.2%
E 8
 
3.4%
n 6
 
2.5%
b 6
 
2.5%
/ 5
 
2.1%
R 4
 
1.7%
Other values (11) 30
 
12.6%
Hangul
ValueCountFrequency (%)
74
 
4.8%
73
 
4.7%
61
 
3.9%
58
 
3.7%
54
 
3.5%
52
 
3.4%
46
 
3.0%
41
 
2.6%
39
 
2.5%
36
 
2.3%
Other values (176) 1015
65.5%

상위메뉴ID
Real number (ℝ)

HIGH CORRELATION  MISSING  ZEROS 

Distinct24
Distinct (%)12.1%
Missing30
Missing (%)13.2%
Infinite0
Infinite (%)0.0%
Mean2875.2525
Minimum0
Maximum9000
Zeros13
Zeros (%)5.7%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T09:13:01.363057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11200
median2100
Q33300
95-th percentile9000
Maximum9000
Range9000
Interquartile range (IQR)2100

Descriptive statistics

Standard deviation2761.5827
Coefficient of variation (CV)0.96046613
Kurtosis0.56148525
Mean2875.2525
Median Absolute Deviation (MAD)1100
Skewness1.3344087
Sum569300
Variance7626338.8
MonotonicityIncreasing
2023-12-12T09:13:01.486509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
9000 22
 
9.6%
1200 20
 
8.8%
3200 13
 
5.7%
0 13
 
5.7%
3300 12
 
5.3%
2100 11
 
4.8%
8000 10
 
4.4%
1300 10
 
4.4%
3100 9
 
3.9%
2300 9
 
3.9%
Other values (14) 69
30.3%
(Missing) 30
13.2%
ValueCountFrequency (%)
0 13
5.7%
100 8
 
3.5%
300 6
 
2.6%
500 4
 
1.8%
700 4
 
1.8%
900 2
 
0.9%
1100 8
 
3.5%
1200 20
8.8%
1300 10
4.4%
1400 7
 
3.1%
ValueCountFrequency (%)
9000 22
9.6%
8000 10
4.4%
3400 8
 
3.5%
3300 12
5.3%
3200 13
5.7%
3100 9
3.9%
3000 2
 
0.9%
2900 3
 
1.3%
2300 9
3.9%
2200 8
 
3.5%

메뉴URL
Text

MISSING 

Distinct125
Distinct (%)63.5%
Missing31
Missing (%)13.6%
Memory size1.9 KiB
2023-12-12T09:13:01.753168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length33
Mean length26.015228
Min length7

Characters and Unicode

Total characters5125
Distinct characters59
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)39.1%

Sample

1st rownotice/list.do?classCd=101
2nd rowfileReference/list.do?boardClsCd=101
3rd rowboard/manual.do
4th rowfaq/list.do?boardClsCd=103
5th rowjob/listJobPage.do
ValueCountFrequency (%)
demand/listdemandmain.do 6
 
3.0%
bzzonecmpt/initrspsbbzzoneprog.do 5
 
2.5%
bzzonereg/initrspsbauthreg.do 4
 
2.0%
bzzonemonitor/initbzzonemonitortab.do 3
 
1.5%
cttfenfrcplan/listtfenfrcplanpage.do 3
 
1.5%
bounddcsonagree/initbounddcsonagree.do 3
 
1.5%
bpcommon/listopiansw.do 3
 
1.5%
bpevaluation/listbzzoneevaluation.do 3
 
1.5%
stats/mainstatus.do 3
 
1.5%
stats/inityearstate.do 3
 
1.5%
Other values (113) 161
81.7%
2023-12-12T09:13:02.244065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 490
 
9.6%
o 428
 
8.4%
i 404
 
7.9%
n 397
 
7.7%
d 326
 
6.4%
e 288
 
5.6%
s 259
 
5.1%
a 235
 
4.6%
. 197
 
3.8%
/ 197
 
3.8%
Other values (49) 1904
37.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4084
79.7%
Uppercase Letter 611
 
11.9%
Other Punctuation 401
 
7.8%
Decimal Number 21
 
0.4%
Math Symbol 7
 
0.1%
Control 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 490
12.0%
o 428
10.5%
i 404
9.9%
n 397
9.7%
d 326
 
8.0%
e 288
 
7.1%
s 259
 
6.3%
a 235
 
5.8%
l 165
 
4.0%
r 149
 
3.6%
Other values (16) 943
23.1%
Uppercase Letter
ValueCountFrequency (%)
P 83
13.6%
S 74
12.1%
M 70
11.5%
C 64
10.5%
B 49
8.0%
T 48
7.9%
A 48
7.9%
R 47
7.7%
D 31
 
5.1%
E 29
 
4.7%
Other values (13) 68
11.1%
Decimal Number
ValueCountFrequency (%)
0 10
47.6%
1 7
33.3%
3 2
 
9.5%
2 1
 
4.8%
5 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
. 197
49.1%
/ 197
49.1%
? 7
 
1.7%
Math Symbol
ValueCountFrequency (%)
= 7
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4695
91.6%
Common 430
 
8.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 490
 
10.4%
o 428
 
9.1%
i 404
 
8.6%
n 397
 
8.5%
d 326
 
6.9%
e 288
 
6.1%
s 259
 
5.5%
a 235
 
5.0%
l 165
 
3.5%
r 149
 
3.2%
Other values (39) 1554
33.1%
Common
ValueCountFrequency (%)
. 197
45.8%
/ 197
45.8%
0 10
 
2.3%
1 7
 
1.6%
? 7
 
1.6%
= 7
 
1.6%
3 2
 
0.5%
2 1
 
0.2%
1
 
0.2%
5 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5125
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 490
 
9.6%
o 428
 
8.4%
i 404
 
7.9%
n 397
 
7.7%
d 326
 
6.4%
e 288
 
5.6%
s 259
 
5.1%
a 235
 
4.6%
. 197
 
3.8%
/ 197
 
3.8%
Other values (49) 1904
37.2%

링크코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
1
198 
<NA>
30 

Length

Max length4
Median length1
Mean length1.3947368
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 198
86.8%
<NA> 30
 
13.2%

Length

2023-12-12T09:13:02.423504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:13:02.548365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 198
86.8%
na 30
 
13.2%

업무 구분 코드
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
1
147 
2
81 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 147
64.5%
2 81
35.5%

Length

2023-12-12T09:13:02.695706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:13:02.799035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 147
64.5%
2 81
35.5%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
<NA>
199 
책임수행기관
 
13
기존
 
6
대행자
 
6
본사
 
2
Other values (2)
 
2

Length

Max length6
Median length4
Mean length4.004386
Min length2

Unique

Unique2 ?
Unique (%)0.9%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 199
87.3%
책임수행기관 13
 
5.7%
기존 6
 
2.6%
대행자 6
 
2.6%
본사 2
 
0.9%
전담팀 1
 
0.4%
본부 1
 
0.4%

Length

2023-12-12T09:13:02.933759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:13:03.046940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 199
87.3%
책임수행기관 13
 
5.7%
기존 6
 
2.6%
대행자 6
 
2.6%
본사 2
 
0.9%
전담팀 1
 
0.4%
본부 1
 
0.4%

Interactions

2023-12-12T09:12:59.573758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:12:59.395568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:12:59.649644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:12:59.484010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:13:03.133444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
메뉴ID상위메뉴ID업무 구분 코드비고
메뉴ID1.0001.0000.6320.765
상위메뉴ID1.0001.0000.6760.765
업무 구분 코드0.6320.6761.000NaN
비고0.7650.765NaN1.000
2023-12-12T09:13:03.229106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고업무 구분 코드링크코드
비고1.0001.0001.000
업무 구분 코드1.0001.0001.000
링크코드1.0001.0001.000
2023-12-12T09:13:03.310570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
메뉴ID상위메뉴ID링크코드업무 구분 코드비고
메뉴ID1.0000.9981.0000.4570.523
상위메뉴ID0.9981.0001.0000.4900.523
링크코드1.0001.0001.0001.0001.000
업무 구분 코드0.4570.4901.0001.0001.000
비고0.5230.5231.0001.0001.000

Missing values

2023-12-12T09:12:59.776727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:12:59.885703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:12:59.989995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

메뉴ID메뉴명상위메뉴ID메뉴URL링크코드업무 구분 코드비고
00커뮤니티<NA><NA><NA>1<NA>
110공지사항0notice/list.do?classCd=10111<NA>
220자료실0fileReference/list.do?boardClsCd=10111<NA>
330사용자 매뉴얼0board/manual.do11<NA>
440FAQ0faq/list.do?boardClsCd=10311<NA>
550작업요청 게시판0job/listJobPage.do11<NA>
660기능/개선문의0board/list.do?boardClsCd=10211<NA>
770갑질부조리신고0report/initReport.do11<NA>
871갑질부조리신고 내역0report/listReportMain.do11<NA>
980DVIEWER 인증키 요청0dviewerKey/view.do11<NA>
메뉴ID메뉴명상위메뉴ID메뉴URL링크코드업무 구분 코드비고
2189983메시지 발송 대상 관리9000mmsSendCondition/msgSendList.do11<NA>
2199984월별통계 생성이력 관리9000monthStats/init.do11<NA>
2209987알림톡 현황9000alrimTalkApi/AlrimTalkApiStats.do11<NA>
2219990화면디자인9000etc/css.do11<NA>
2229991접속 이력 관리9000/logHistory/listLogHistoryMg.do11<NA>
2239992인증키 관리9000dviewerKey/list.do11<NA>
2249994DVIEWER 클라이언트 목록9000dviewerClientVsn/list.do11<NA>
2259995타지역발령 이력 관리9000auth/listUsrAreaHist.do11<NA>
2269998주민전산자료 연계 현황 통계9000landOwnConn/initLandOwnConn.do11<NA>
2279999연계기관 및 시스템관리9000connOfcr/init.do11<NA>