Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory566.4 KiB
Average record size in memory58.0 B

Variable types

Categorical3
DateTime1
Numeric2

Dataset

Description한국남동발전의 네트워크 트래픽 현황 정보입니다. 네트워크 트래픽 현황은 일자별 사업소, 입출력구분, 평균사용량, 최대사용량 항목을 나타냅니다.
URLhttps://www.data.go.kr/data/15113991/fileData.do

Alerts

평균값 is highly overall correlated with 최대값High correlation
최대값 is highly overall correlated with 평균값High correlation
단위 is highly imbalanced (98.9%)Imbalance

Reproduction

Analysis started2023-12-12 16:05:21.714078
Analysis finished2023-12-12 16:05:23.129000
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업소
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
분당
1444 
삼천포
1441 
본사_ERP센터
1435 
영흥
1427 
본사
1418 
Other values (2)
2835 

Length

Max length8
Median length2
Mean length3.0051
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본사
2nd row영흥
3rd row여수
4th row여수
5th row삼천포

Common Values

ValueCountFrequency (%)
분당 1444
14.4%
삼천포 1441
14.4%
본사_ERP센터 1435
14.3%
영흥 1427
14.3%
본사 1418
14.2%
영동 1418
14.2%
여수 1417
14.2%

Length

2023-12-13T01:05:23.240366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:23.393491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분당 1444
14.4%
삼천포 1441
14.4%
본사_erp센터 1435
14.3%
영흥 1427
14.3%
본사 1418
14.2%
영동 1418
14.2%
여수 1417
14.2%

입출력구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
출력 사용량
5008 
입력 사용량
4992 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row출력 사용량
2nd row입력 사용량
3rd row입력 사용량
4th row출력 사용량
5th row입력 사용량

Common Values

ValueCountFrequency (%)
출력 사용량 5008
50.1%
입력 사용량 4992
49.9%

Length

2023-12-13T01:05:23.555307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:23.676508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용량 10000
50.0%
출력 5008
25.0%
입력 4992
25.0%

일자
Date

Distinct761
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-04-20 00:00:00
Maximum2023-05-20 00:00:00
2023-12-13T01:05:23.820127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:24.034938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

평균값
Real number (ℝ)

HIGH CORRELATION 

Distinct5660
Distinct (%)56.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.112947
Minimum0.6
Maximum989.29
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:05:24.240420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.6
5-th percentile1.4795
Q16.32
median16.97
Q382.5625
95-th percentile191.1115
Maximum989.29
Range988.69
Interquartile range (IQR)76.2425

Descriptive statistics

Standard deviation91.012384
Coefficient of variation (CV)1.6219498
Kurtosis35.184403
Mean56.112947
Median Absolute Deviation (MAD)14.095
Skewness4.7494988
Sum561129.47
Variance8283.2541
MonotonicityNot monotonic
2023-12-13T01:05:24.505178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.9 35
 
0.4%
1.0 29
 
0.3%
1.11 18
 
0.2%
1.13 18
 
0.2%
1.07 17
 
0.2%
1.14 15
 
0.1%
1.16 15
 
0.1%
0.8 15
 
0.1%
1.17 14
 
0.1%
1.02 13
 
0.1%
Other values (5650) 9811
98.1%
ValueCountFrequency (%)
0.6 1
 
< 0.1%
0.7 3
 
< 0.1%
0.8 15
0.1%
0.82 1
 
< 0.1%
0.89 1
 
< 0.1%
0.9 35
0.4%
0.93 1
 
< 0.1%
0.94 1
 
< 0.1%
0.95 1
 
< 0.1%
0.98 1
 
< 0.1%
ValueCountFrequency (%)
989.29 1
< 0.1%
988.91 1
< 0.1%
988.32 1
< 0.1%
985.2 1
< 0.1%
976.7 1
< 0.1%
974.13 1
< 0.1%
971.15 1
< 0.1%
967.06 1
< 0.1%
965.28 1
< 0.1%
964.53 1
< 0.1%

최대값
Real number (ℝ)

HIGH CORRELATION 

Distinct8823
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean384.43596
Minimum1
Maximum18700
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:05:24.741370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.208
Q190.3475
median216.25
Q3432.485
95-th percentile1720
Maximum18700
Range18699
Interquartile range (IQR)342.1375

Descriptive statistics

Standard deviation505.67829
Coefficient of variation (CV)1.3153772
Kurtosis174.98747
Mean384.43596
Median Absolute Deviation (MAD)159.98
Skewness6.5416533
Sum3844359.6
Variance255710.54
MonotonicityNot monotonic
2023-12-13T01:05:24.951923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1720.0 29
 
0.3%
1760.0 23
 
0.2%
1690.0 23
 
0.2%
1710.0 22
 
0.2%
1770.0 18
 
0.2%
1670.0 17
 
0.2%
1730.0 16
 
0.2%
1740.0 16
 
0.2%
1780.0 16
 
0.2%
1700.0 16
 
0.2%
Other values (8813) 9804
98.0%
ValueCountFrequency (%)
1.0 1
< 0.1%
1.01 1
< 0.1%
1.03 2
< 0.1%
1.05 1
< 0.1%
1.09 1
< 0.1%
1.1 1
< 0.1%
1.11 1
< 0.1%
1.15 1
< 0.1%
1.33 1
< 0.1%
1.88 1
< 0.1%
ValueCountFrequency (%)
18700.0 1
< 0.1%
2400.06 1
< 0.1%
2342.85 1
< 0.1%
2325.84 1
< 0.1%
2325.56 1
< 0.1%
2325.49 1
< 0.1%
2300.45 1
< 0.1%
2296.45 1
< 0.1%
2291.85 1
< 0.1%
2286.37 1
< 0.1%

단위
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Mbps
9990 
Gbps
 
10

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMbps
2nd rowMbps
3rd rowMbps
4th rowMbps
5th rowMbps

Common Values

ValueCountFrequency (%)
Mbps 9990
99.9%
Gbps 10
 
0.1%

Length

2023-12-13T01:05:25.122221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:25.259821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mbps 9990
99.9%
gbps 10
 
0.1%

Interactions

2023-12-13T01:05:22.587819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:22.286232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:22.754324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:22.429682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:05:25.346054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소입출력구분평균값최대값단위
사업소1.0000.0000.4990.2600.042
입출력구분0.0001.0000.1270.0000.000
평균값0.4990.1271.0000.4430.079
최대값0.2600.0000.4431.0000.000
단위0.0420.0000.0790.0001.000
2023-12-13T01:05:25.467516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
입출력구분사업소단위
입출력구분1.0000.0000.000
사업소0.0001.0000.045
단위0.0000.0451.000
2023-12-13T01:05:25.574788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평균값최대값사업소입출력구분단위
평균값1.0000.8480.2800.0980.061
최대값0.8481.0000.1810.0000.000
사업소0.2800.1811.0000.0000.045
입출력구분0.0980.0000.0001.0000.000
단위0.0610.0000.0450.0001.000

Missing values

2023-12-13T01:05:22.938604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:05:23.071023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업소입출력구분일자평균값최대값단위
3655본사출력 사용량2022-02-0545.9103.17Mbps
4916영흥입력 사용량2022-10-2918.92116.29Mbps
7366여수입력 사용량2022-03-0813.85210.16Mbps
7579여수출력 사용량2022-06-2281.04409.53Mbps
5276삼천포입력 사용량2022-04-2714.14314.39Mbps
6525영동출력 사용량2022-01-115.4201.81Mbps
6770영동입력 사용량2022-05-141.2428.78Mbps
4587영흥출력 사용량2022-05-1721.45152.23Mbps
6082분당입력 사용량2022-06-041.066.18Mbps
10549본사_ERP센터출력 사용량2023-03-29136.871794.28Mbps
사업소입출력구분일자평균값최대값단위
3160본사_ERP센터입력 사용량2021-06-03117.05480.63Mbps
6791영동출력 사용량2022-05-244.2550.63Mbps
5064삼천포입력 사용량2022-01-1111.9178.63Mbps
4735영흥출력 사용량2022-07-3012.18104.33Mbps
10587본사_ERP센터출력 사용량2023-04-17140.532342.85Mbps
10017영동출력 사용량2023-04-128.475.24Mbps
3065여수출력 사용량2021-12-2867.98359.25Mbps
5219삼천포출력 사용량2022-03-2914.12234.34Mbps
7097영동출력 사용량2022-10-2414.17261.17Mbps
6900영동입력 사용량2022-07-183.254.02Mbps