Overview

Dataset statistics

Number of variables6
Number of observations608
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory29.8 KiB
Average record size in memory50.2 B

Variable types

Numeric1
DateTime2
Categorical3

Dataset

Description지적재조사 사업시 새로 생성한 기준점에 관한 측량 성과 검사 일자 및 등록일에 관한 데이터 파일입니다.사업지구 정보와 연관지어 사용하시면 됩니다.
Author국토교통부
URLhttps://www.data.go.kr/data/15123032/fileData.do

Alerts

검사결과코드 is highly overall correlated with 검사결과코드명High correlation
검사결과코드명 is highly overall correlated with 검사결과코드High correlation
검사결과코드 is highly imbalanced (96.8%)Imbalance
검사결과코드명 is highly imbalanced (96.8%)Imbalance
검사일련번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:55:55.004055
Analysis finished2023-12-12 07:55:55.598332
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검사일련번호
Real number (ℝ)

UNIQUE 

Distinct608
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14981.082
Minimum13410
Maximum16370
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.5 KiB
2023-12-12T16:55:55.699652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13410
5-th percentile13609.35
Q114650.5
median15031
Q315475.25
95-th percentile16274.9
Maximum16370
Range2960
Interquartile range (IQR)824.75

Descriptive statistics

Standard deviation756.72703
Coefficient of variation (CV)0.050512174
Kurtosis-0.54423501
Mean14981.082
Median Absolute Deviation (MAD)431.5
Skewness-0.29317271
Sum9108498
Variance572635.81
MonotonicityNot monotonic
2023-12-12T16:55:55.861162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13414 1
 
0.2%
15225 1
 
0.2%
16078 1
 
0.2%
16108 1
 
0.2%
15079 1
 
0.2%
15094 1
 
0.2%
15104 1
 
0.2%
15105 1
 
0.2%
15222 1
 
0.2%
15228 1
 
0.2%
Other values (598) 598
98.4%
ValueCountFrequency (%)
13410 1
0.2%
13412 1
0.2%
13413 1
0.2%
13414 1
0.2%
13416 1
0.2%
13420 1
0.2%
13421 1
0.2%
13441 1
0.2%
13446 1
0.2%
13447 1
0.2%
ValueCountFrequency (%)
16370 1
0.2%
16369 1
0.2%
16368 1
0.2%
16358 1
0.2%
16357 1
0.2%
16356 1
0.2%
16355 1
0.2%
16354 1
0.2%
16324 1
0.2%
16322 1
0.2%
Distinct134
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
Minimum2018-11-14 00:00:00
Maximum2023-09-06 00:00:00
2023-12-12T16:55:56.384195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:56.552384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

검사결과코드
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
1
606 
4
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 606
99.7%
4 2
 
0.3%

Length

2023-12-12T16:55:56.703666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:55:56.793737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 606
99.7%
4 2
 
0.3%

검사결과코드명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
검사완료
606 
보완
 
2

Length

Max length4
Median length4
Mean length3.9934211
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row검사완료
2nd row검사완료
3rd row검사완료
4th row검사완료
5th row검사완료

Common Values

ValueCountFrequency (%)
검사완료 606
99.7%
보완 2
 
0.3%

Length

2023-12-12T16:55:56.915891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:55:57.046894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
검사완료 606
99.7%
보완 2
 
0.3%

검사담당자
Categorical

Distinct30
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
김**
161 
조**
110 
최**
40 
박**
36 
이**
34 
Other values (25)
227 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique5 ?
Unique (%)0.8%

Sample

1st row장**
2nd row장**
3rd row조**
4th row조**
5th row장**

Common Values

ValueCountFrequency (%)
김** 161
26.5%
조** 110
18.1%
최** 40
 
6.6%
박** 36
 
5.9%
이** 34
 
5.6%
송** 31
 
5.1%
손** 26
 
4.3%
강** 26
 
4.3%
신** 25
 
4.1%
정** 18
 
3.0%
Other values (20) 101
16.6%

Length

2023-12-12T16:55:57.142944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
161
26.5%
110
18.1%
40
 
6.6%
36
 
5.9%
34
 
5.6%
31
 
5.1%
26
 
4.3%
26
 
4.3%
25
 
4.1%
18
 
3.0%
Other values (20) 101
16.6%
Distinct113
Distinct (%)18.6%
Missing0
Missing (%)0.0%
Memory size4.9 KiB
Minimum2022-10-17 00:00:00
Maximum2023-09-15 00:00:00
2023-12-12T16:55:57.265317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:57.401374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T16:55:55.209837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:55:57.485446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사일련번호검사결과코드검사결과코드명검사담당자
검사일련번호1.0000.0000.0000.836
검사결과코드0.0001.0000.9230.089
검사결과코드명0.0000.9231.0000.089
검사담당자0.8360.0890.0891.000
2023-12-12T16:55:57.565885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사담당자검사결과코드검사결과코드명
검사담당자1.0000.0680.068
검사결과코드0.0681.0000.749
검사결과코드명0.0680.7491.000
2023-12-12T16:55:57.642955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사일련번호검사결과코드검사결과코드명검사담당자
검사일련번호1.0000.0000.0000.430
검사결과코드0.0001.0000.7490.068
검사결과코드명0.0000.7491.0000.068
검사담당자0.4300.0680.0681.000

Missing values

2023-12-12T16:55:55.347186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:55:55.545410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검사일련번호검사일자검사결과코드검사결과코드명검사담당자등록일자
0134142022-05-111검사완료장**2022-10-17
1134162022-06-291검사완료장**2022-10-17
2134462022-06-071검사완료조**2022-10-21
3134482022-06-071검사완료조**2022-10-21
4134102022-07-251검사완료장**2022-10-17
5134122022-07-251검사완료장**2022-10-17
6134202022-06-291검사완료장**2022-10-17
7134212022-06-291검사완료장**2022-10-17
8134412022-02-281검사완료옥**2022-10-20
9134132022-06-291검사완료장**2022-10-17
검사일련번호검사일자검사결과코드검사결과코드명검사담당자등록일자
598163222023-04-051검사완료김**2023-09-06
599163242022-03-181검사완료강**2023-09-08
600163542023-03-061검사완료김**2023-09-14
601163552023-03-061검사완료김**2023-09-14
602163702023-02-201검사완료김**2023-09-15
603163562023-03-061검사완료김**2023-09-14
604163572023-03-061검사완료김**2023-09-14
605163582023-03-061검사완료김**2023-09-14
606163682023-02-201검사완료김**2023-09-15
607163692023-02-201검사완료김**2023-09-15