Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric3
DateTime1
Categorical2

Dataset

Description한국산업안전보건공단 K2B 시스템에 접속 정보들에 대한 내용으로 접속일자, 지사구분, 접속자수,횟수 등에 대한 정보를 제공합니다.
Author한국산업안전보건공단
URLhttps://www.data.go.kr/data/15093368/fileData.do

Alerts

접속자수 is highly overall correlated with 접속횟수High correlation
접속횟수 is highly overall correlated with 접속자수High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:57:42.766007
Analysis finished2023-12-12 23:57:44.184038
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5160.7551
Minimum1
Maximum10345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:57:44.245845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile511.95
Q12579.75
median5152.5
Q37753.25
95-th percentile9829.05
Maximum10345
Range10344
Interquartile range (IQR)5173.5

Descriptive statistics

Standard deviation2989.1692
Coefficient of variation (CV)0.57921159
Kurtosis-1.199959
Mean5160.7551
Median Absolute Deviation (MAD)2586.5
Skewness0.0058905172
Sum51607551
Variance8935132.3
MonotonicityNot monotonic
2023-12-13T08:57:44.363038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1067 1
 
< 0.1%
2996 1
 
< 0.1%
3273 1
 
< 0.1%
2531 1
 
< 0.1%
8914 1
 
< 0.1%
6815 1
 
< 0.1%
1144 1
 
< 0.1%
5723 1
 
< 0.1%
3428 1
 
< 0.1%
8424 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
10345 1
< 0.1%
10344 1
< 0.1%
10343 1
< 0.1%
10342 1
< 0.1%
10341 1
< 0.1%
10340 1
< 0.1%
10339 1
< 0.1%
10338 1
< 0.1%
10337 1
< 0.1%
10336 1
< 0.1%
Distinct364
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-01 00:00:00
Maximum2020-12-31 00:00:00
2023-12-13T08:57:44.478348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:44.620040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

접속요일
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1474 
1472 
1459 
1452 
1444 
Other values (2)
2699 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
1474
14.7%
1472
14.7%
1459
14.6%
1452
14.5%
1444
14.4%
1373
13.7%
1326
13.3%

Length

2023-12-13T08:57:44.732900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:57:44.827085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1474
14.7%
1472
14.7%
1459
14.6%
1452
14.5%
1444
14.4%
1373
13.7%
1326
13.3%

지사구분
Categorical

Distinct29
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부산광역본부
 
356
본부
 
356
대전세종광역본부
 
356
경기지역본부
 
355
울산지역본부
 
355
Other values (24)
8222 

Length

Max length8
Median length6
Mean length5.9288
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원동부지사
2nd row충북지역본부
3rd row대구광역본부
4th row경기중부지사
5th row경북동부지사

Common Values

ValueCountFrequency (%)
부산광역본부 356
 
3.6%
본부 356
 
3.6%
대전세종광역본부 356
 
3.6%
경기지역본부 355
 
3.5%
울산지역본부 355
 
3.5%
서울광역본부 355
 
3.5%
인천광역본부 355
 
3.5%
경남지역본부 353
 
3.5%
서울남부지사 353
 
3.5%
대구광역본부 352
 
3.5%
Other values (19) 6454
64.5%

Length

2023-12-13T08:57:44.940077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부산광역본부 356
 
3.6%
본부 356
 
3.6%
대전세종광역본부 356
 
3.6%
경기지역본부 355
 
3.5%
울산지역본부 355
 
3.5%
서울광역본부 355
 
3.5%
인천광역본부 355
 
3.5%
경남지역본부 353
 
3.5%
서울남부지사 353
 
3.5%
대구광역본부 352
 
3.5%
Other values (19) 6454
64.5%

접속자수
Real number (ℝ)

HIGH CORRELATION 

Distinct567
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.7678
Minimum1
Maximum2835
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:57:45.050521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q126
median62
Q3115
95-th percentile224
Maximum2835
Range2834
Interquartile range (IQR)89

Descriptive statistics

Standard deviation311.78494
Coefficient of variation (CV)2.5396312
Kurtosis35.979134
Mean122.7678
Median Absolute Deviation (MAD)39
Skewness5.9648485
Sum1227678
Variance97209.848
MonotonicityNot monotonic
2023-12-13T08:57:45.171291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 224
 
2.2%
3 156
 
1.6%
1 136
 
1.4%
5 133
 
1.3%
2 127
 
1.3%
23 108
 
1.1%
27 104
 
1.0%
28 101
 
1.0%
25 95
 
0.9%
31 95
 
0.9%
Other values (557) 8721
87.2%
ValueCountFrequency (%)
1 136
1.4%
2 127
1.3%
3 156
1.6%
4 224
2.2%
5 133
1.3%
6 79
 
0.8%
7 71
 
0.7%
8 69
 
0.7%
9 71
 
0.7%
10 78
 
0.8%
ValueCountFrequency (%)
2835 1
< 0.1%
2709 2
< 0.1%
2662 1
< 0.1%
2595 1
< 0.1%
2578 1
< 0.1%
2560 2
< 0.1%
2559 1
< 0.1%
2556 1
< 0.1%
2544 1
< 0.1%
2519 1
< 0.1%

접속횟수
Real number (ℝ)

HIGH CORRELATION 

Distinct2158
Distinct (%)21.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean967.9518
Minimum1
Maximum26780
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T08:57:45.288144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20
Q1147
median444
Q3892
95-th percentile1909.05
Maximum26780
Range26779
Interquartile range (IQR)745

Descriptive statistics

Standard deviation2591.5251
Coefficient of variation (CV)2.6773287
Kurtosis37.630209
Mean967.9518
Median Absolute Deviation (MAD)334
Skewness6.0546749
Sum9679518
Variance6716002.5
MonotonicityNot monotonic
2023-12-13T08:57:45.402578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 45
 
0.4%
6 43
 
0.4%
14 34
 
0.3%
4 33
 
0.3%
8 32
 
0.3%
10 31
 
0.3%
20 31
 
0.3%
21 31
 
0.3%
12 29
 
0.3%
24 29
 
0.3%
Other values (2148) 9662
96.6%
ValueCountFrequency (%)
1 26
0.3%
2 45
0.4%
3 16
 
0.2%
4 33
0.3%
5 17
 
0.2%
6 43
0.4%
7 18
 
0.2%
8 32
0.3%
9 14
 
0.1%
10 31
0.3%
ValueCountFrequency (%)
26780 1
< 0.1%
25137 1
< 0.1%
24516 1
< 0.1%
23571 1
< 0.1%
23326 1
< 0.1%
22896 1
< 0.1%
22820 1
< 0.1%
22763 1
< 0.1%
22286 1
< 0.1%
22087 1
< 0.1%

Interactions

2023-12-13T08:57:43.719945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.186732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.459453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.810193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.277580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.540681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.935684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.380162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:57:43.624331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:57:45.479293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번접속요일지사구분접속자수접속횟수
연번1.0000.0000.0000.1310.107
접속요일0.0001.0000.0000.1310.114
지사구분0.0000.0001.0000.6730.646
접속자수0.1310.1310.6731.0000.959
접속횟수0.1070.1140.6460.9591.000
2023-12-13T08:57:45.561434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접속요일지사구분
접속요일1.0000.000
지사구분0.0001.000
2023-12-13T08:57:45.637835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번접속자수접속횟수접속요일지사구분
연번1.0000.2640.2290.0000.000
접속자수0.2641.0000.9550.0660.310
접속횟수0.2290.9551.0000.0580.290
접속요일0.0000.0660.0581.0000.000
지사구분0.0000.3100.2900.0001.000

Missing values

2023-12-13T08:57:44.037466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:57:44.143034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번접속일자접속요일지사구분접속자수접속횟수
106610672020-02-11강원동부지사9175
728972902020-09-15충북지역본부1501015
267226732020-04-07대구광역본부102728
243524362020-03-30경기중부지사44535
810281032020-10-14경북동부지사68531
178417852020-03-07경남동부지사626
3273282020-01-13서울동부지사321
708270832020-09-08전남동부지사91643
968596862020-12-08대구서부지사124894
717071712020-09-11전남지역본부80372
연번접속일자접속요일지사구분접속자수접속횟수
822882292020-10-18인천광역본부89549
649564962020-08-19경북동부지사64430
807580762020-10-13광주광역본부1511343
338433852020-05-02경기지역본부48372
395539562020-05-22경기동부지사101544
121312142020-02-16경기중부지사14
10267102682020-12-29경북동부지사6225
654365442020-08-20충북지역본부135892
945194522020-11-30경북동부지사44629
404440452020-05-25경기서부지사1771578