Overview

Dataset statistics

Number of variables14
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory115.3 B

Variable types

Categorical2
Numeric2
Boolean8
DateTime2

Alerts

생성일 has constant value ""Constant
수정일 has constant value ""Constant
사용여부 has constant value ""Constant
조사시작일 is highly overall correlated with 조사종료일 and 1 other fieldsHigh correlation
조사종료일 is highly overall correlated with 조사시작일 and 1 other fieldsHigh correlation
사업코드 is highly overall correlated with 조사시작일 and 2 other fieldsHigh correlation
포유류조사유무 is highly overall correlated with 조류조사유무 and 2 other fieldsHigh correlation
조류조사유무 is highly overall correlated with 사업코드 and 3 other fieldsHigh correlation
양서파충류조사유무 is highly overall correlated with 포유류조사유무 and 2 other fieldsHigh correlation
곤충류조사유무 is highly overall correlated with 포유류조사유무 and 2 other fieldsHigh correlation
어류조사유무 is highly overall correlated with 저서생물(동물)유무High correlation
저서생물(동물)유무 is highly overall correlated with 어류조사유무High correlation

Reproduction

Analysis started2023-12-10 10:39:29.102828
Analysis finished2023-12-10 10:39:31.821051
Duration2.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업코드
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
DG2006Q002
10 
DG2007B002
10 
DG2005Q001
DG2007E005
DG2007C001
Other values (16)
56 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique4 ?
Unique (%)4.0%

Sample

1st rowDG2005E013
2nd rowDG2005E013
3rd rowDG2005E013
4th rowDG2005Q001
5th rowDG2005Q001

Common Values

ValueCountFrequency (%)
DG2006Q002 10
 
10.0%
DG2007B002 10
 
10.0%
DG2005Q001 9
 
9.0%
DG2007E005 8
 
8.0%
DG2007C001 7
 
7.0%
DG2006N005 6
 
6.0%
DG2006C001 5
 
5.0%
DG2007B001 5
 
5.0%
DG2006A001 5
 
5.0%
DG2007B004 4
 
4.0%
Other values (11) 31
31.0%

Length

2023-12-10T19:39:31.948819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
dg2006q002 10
 
10.0%
dg2007b002 10
 
10.0%
dg2005q001 9
 
9.0%
dg2007e005 8
 
8.0%
dg2007c001 7
 
7.0%
dg2006n005 6
 
6.0%
dg2006c001 5
 
5.0%
dg2007b001 5
 
5.0%
dg2006a001 5
 
5.0%
dg2006e009 4
 
4.0%
Other values (11) 31
31.0%

조사차수
Categorical

Distinct32
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1차
17 
3차
16 
2차
16 
4차
14 
5차
Other values (27)
32 

Length

Max length17
Median length2
Mean length4.32
Min length2

Unique

Unique25 ?
Unique (%)25.0%

Sample

1st row2차
2nd row1차
3rd row3차
4th row5차
5th row4차

Common Values

ValueCountFrequency (%)
1차 17
17.0%
3차 16
16.0%
2차 16
16.0%
4차 14
14.0%
5차 5
 
5.0%
0차 5
 
5.0%
2차_어류저서생물 2
 
2.0%
2차_2 1
 
1.0%
2차_식물상및식생 1
 
1.0%
1차_식물상및식생 1
 
1.0%
Other values (22) 22
22.0%

Length

2023-12-10T19:39:32.255685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1차 17
17.0%
2차 16
16.0%
3차 16
16.0%
4차 14
14.0%
5차 5
 
5.0%
0차 5
 
5.0%
2차_어류저서생물 2
 
2.0%
6차 1
 
1.0%
2017_1분기(사후공사시 1
 
1.0%
2016_3분기(사후공사시 1
 
1.0%
Other values (22) 22
22.0%

조사시작일
Real number (ℝ)

HIGH CORRELATION 

Distinct88
Distinct (%)88.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20070051
Minimum20030601
Maximum20170321
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:39:32.518249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20030601
5-th percentile20040416
Q120050821
median20060662
Q320070406
95-th percentile20161214
Maximum20170321
Range139720
Interquartile range (IQR)19584.5

Descriptive statistics

Standard deviation35721.681
Coefficient of variation (CV)0.0017798501
Kurtosis2.6503526
Mean20070051
Median Absolute Deviation (MAD)9750.5
Skewness1.9162234
Sum2.0070051 × 109
Variance1.2760385 × 109
MonotonicityNot monotonic
2023-12-10T19:39:32.767900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20060615 3
 
3.0%
20070407 2
 
2.0%
20060223 2
 
2.0%
20070516 2
 
2.0%
20061009 2
 
2.0%
20060417 2
 
2.0%
20060823 2
 
2.0%
20170320 2
 
2.0%
20060720 2
 
2.0%
20060217 2
 
2.0%
Other values (78) 79
79.0%
ValueCountFrequency (%)
20030601 1
1.0%
20030920 1
1.0%
20031220 1
1.0%
20040226 1
1.0%
20040320 1
1.0%
20040421 1
1.0%
20040524 1
1.0%
20040628 1
1.0%
20040701 1
1.0%
20040723 1
1.0%
ValueCountFrequency (%)
20170321 1
1.0%
20170320 2
2.0%
20161220 1
1.0%
20161219 1
1.0%
20161214 1
1.0%
20160818 1
1.0%
20160625 1
1.0%
20160512 1
1.0%
20160511 1
1.0%
20160321 1
1.0%

조사종료일
Real number (ℝ)

HIGH CORRELATION 

Distinct91
Distinct (%)91.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20070053
Minimum20030615
Maximum20170321
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:39:33.005494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20030615
5-th percentile20040418
Q120050822
median20060664
Q320070408
95-th percentile20161215
Maximum20170321
Range139706
Interquartile range (IQR)19585.75

Descriptive statistics

Standard deviation35720.683
Coefficient of variation (CV)0.0017798001
Kurtosis2.6504384
Mean20070053
Median Absolute Deviation (MAD)9751.5
Skewness1.9162618
Sum2.0070053 × 109
Variance1.2759672 × 109
MonotonicityNot monotonic
2023-12-10T19:39:33.255936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20060617 3
 
3.0%
20060218 2
 
2.0%
20060721 2
 
2.0%
20070413 2
 
2.0%
20060826 2
 
2.0%
20050531 2
 
2.0%
20170321 2
 
2.0%
20060224 2
 
2.0%
20070529 1
 
1.0%
20160625 1
 
1.0%
Other values (81) 81
81.0%
ValueCountFrequency (%)
20030615 1
1.0%
20030930 1
1.0%
20031225 1
1.0%
20040229 1
1.0%
20040330 1
1.0%
20040423 1
1.0%
20040527 1
1.0%
20040629 1
1.0%
20040702 1
1.0%
20040724 1
1.0%
ValueCountFrequency (%)
20170321 2
2.0%
20170320 1
1.0%
20161220 1
1.0%
20161219 1
1.0%
20161215 1
1.0%
20160819 1
1.0%
20160625 1
1.0%
20160512 1
1.0%
20160511 1
1.0%
20160322 1
1.0%
Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
76 
False
24 
ValueCountFrequency (%)
True 76
76.0%
False 24
 
24.0%
2023-12-10T19:39:33.464432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

포유류조사유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
70 
False
30 
ValueCountFrequency (%)
True 70
70.0%
False 30
30.0%
2023-12-10T19:39:33.618579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

조류조사유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
76 
False
24 
ValueCountFrequency (%)
True 76
76.0%
False 24
 
24.0%
2023-12-10T19:39:33.761394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

양서파충류조사유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
60 
False
40 
ValueCountFrequency (%)
True 60
60.0%
False 40
40.0%
2023-12-10T19:39:33.992950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

곤충류조사유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
53 
False
47 
ValueCountFrequency (%)
True 53
53.0%
False 47
47.0%
2023-12-10T19:39:34.144699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

어류조사유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
53 
True
47 
ValueCountFrequency (%)
False 53
53.0%
True 47
47.0%
2023-12-10T19:39:34.301064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

저서생물(동물)유무
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
55 
True
45 
ValueCountFrequency (%)
False 55
55.0%
True 45
45.0%
2023-12-10T19:39:34.460094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

생성일
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2019-01-07 00:00:00
Maximum2019-01-07 00:00:00
2023-12-10T19:39:34.702045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:39:34.927871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

수정일
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2019-12-03 00:00:00
Maximum2019-12-03 00:00:00
2023-12-10T19:39:35.127428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:39:35.313729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
True
100 
ValueCountFrequency (%)
True 100
100.0%
2023-12-10T19:39:35.477217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-10T19:39:30.796312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:39:30.434445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:39:30.957445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:39:30.621636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:39:35.610431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업코드조사차수조사시작일조사종료일식물상조사유무포유류조사유무조류조사유무양서파충류조사유무곤충류조사유무어류조사유무저서생물(동물)유무
사업코드1.0000.0000.9080.9080.4240.6240.6740.5750.4940.4170.535
조사차수0.0001.0000.0000.0000.0000.4580.5410.1950.2250.0530.258
조사시작일0.9080.0001.0001.0000.1790.1810.1830.2660.3090.0000.000
조사종료일0.9080.0001.0001.0000.1790.1810.1830.2660.3090.0000.000
식물상조사유무0.4240.0000.1790.1791.0000.4640.0000.4760.5550.2290.187
포유류조사유무0.6240.4580.1810.1810.4641.0000.9100.9130.8680.3460.303
조류조사유무0.6740.5410.1830.1830.0000.9101.0000.8190.7780.1320.065
양서파충류조사유무0.5750.1950.2660.2660.4760.9130.8191.0000.9530.3660.380
곤충류조사유무0.4940.2250.3090.3090.5550.8680.7780.9531.0000.4390.444
어류조사유무0.4170.0530.0000.0000.2290.3460.1320.3660.4391.0000.959
저서생물(동물)유무0.5350.2580.0000.0000.1870.3030.0650.3800.4440.9591.000
2023-12-10T19:39:35.875062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사차수식물상조사유무사업코드포유류조사유무조류조사유무저서생물(동물)유무양서파충류조사유무어류조사유무곤충류조사유무
조사차수1.0000.0000.0000.2990.3570.1620.1170.0000.138
식물상조사유무0.0001.0000.3320.3070.0000.1190.3160.1470.374
사업코드0.0000.3321.0000.4960.5390.4230.4550.3260.389
포유류조사유무0.2990.3070.4961.0000.7270.1960.7320.2240.669
조류조사유무0.3570.0000.5390.7271.0000.0400.6110.0840.567
저서생물(동물)유무0.1620.1190.4230.1960.0401.0000.2480.8180.293
양서파충류조사유무0.1170.3160.4550.7320.6110.2481.0000.2380.803
어류조사유무0.0000.1470.3260.2240.0840.8180.2381.0000.289
곤충류조사유무0.1380.3740.3890.6690.5670.2930.8030.2891.000
2023-12-10T19:39:36.113475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사시작일조사종료일사업코드조사차수식물상조사유무포유류조사유무조류조사유무양서파충류조사유무곤충류조사유무어류조사유무저서생물(동물)유무
조사시작일1.0001.0000.5570.0000.2060.1950.1880.2770.3240.0000.000
조사종료일1.0001.0000.5570.0000.2060.1950.1880.2770.3240.0000.000
사업코드0.5570.5571.0000.0000.3320.4960.5390.4550.3890.3260.423
조사차수0.0000.0000.0001.0000.0000.2990.3570.1170.1380.0000.162
식물상조사유무0.2060.2060.3320.0001.0000.3070.0000.3160.3740.1470.119
포유류조사유무0.1950.1950.4960.2990.3071.0000.7270.7320.6690.2240.196
조류조사유무0.1880.1880.5390.3570.0000.7271.0000.6110.5670.0840.040
양서파충류조사유무0.2770.2770.4550.1170.3160.7320.6111.0000.8030.2380.248
곤충류조사유무0.3240.3240.3890.1380.3740.6690.5670.8031.0000.2890.293
어류조사유무0.0000.0000.3260.0000.1470.2240.0840.2380.2891.0000.818
저서생물(동물)유무0.0000.0000.4230.1620.1190.1960.0400.2480.2930.8181.000

Missing values

2023-12-10T19:39:31.201945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:39:31.661104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업코드조사차수조사시작일조사종료일식물상조사유무포유류조사유무조류조사유무양서파충류조사유무곤충류조사유무어류조사유무저서생물(동물)유무생성일수정일사용여부
0DG2005E0132차2005072620050728YNNNNNN2019-01-072019-12-03Y
1DG2005E0131차2005041320050415YNNNNNN2019-01-072019-12-03Y
2DG2005E0133차2008050720080507YNNNNNN2019-01-072019-12-03Y
3DG2005Q0015차2004110620041110NNNNNYY2019-01-072019-12-03Y
4DG2005Q0014차2004032020040330NNNNNYY2019-01-072019-12-03Y
5DG2005Q0012차_어류저서생물2003092020030930NNNNNYY2019-01-072019-12-03Y
6DG2005Q0011차2004072320040724YYYYYNN2019-01-072019-12-03Y
7DG2005Q0013차2004120420041205YYYYYNN2019-01-072019-12-03Y
8DG2005Q0012차2004101620041017YYYYYNN2019-01-072019-12-03Y
9DG2005Q0011차_조류2004110620041106NNYNNNN2019-01-072019-12-03Y
사업코드조사차수조사시작일조사종료일식물상조사유무포유류조사유무조류조사유무양서파충류조사유무곤충류조사유무어류조사유무저서생물(동물)유무생성일수정일사용여부
90DG2007E0043차2010080920100810YYYYYNY2019-01-072019-12-03Y
91DG2007E0044차2011042220110422YYYYYNY2019-01-072019-12-03Y
92DG2007E0054차2007052620070527NYYYNNN2019-01-072019-12-03Y
93DG2007E0050차2006083020060831YNNNNNN2019-01-072019-12-03Y
94DG2007E0052016_3분기(사후공사시)2016081820160819YYYYYYY2019-01-072019-12-03Y
95DG2007E0052016_4분기(사후공사시)2016121420161215YYYYYYY2019-01-072019-12-03Y
96DG2007E0052017_1분기(사후공사시)2017032020170321YYYYYYY2019-01-072019-12-03Y
97DG2007E0051차2006082520060826NYYYNNN2019-01-072019-12-03Y
98DG2007E0052016_1분기(사후공사시)2016032120160322YYYYYYY2019-01-072019-12-03Y
99DG2007E0052차2006112920061130NYYYNNN2019-01-072019-12-03Y