Overview

Dataset statistics

Number of variables5
Number of observations1045
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory43.0 KiB
Average record size in memory42.1 B

Variable types

DateTime1
Numeric2
Categorical1
Text1

Dataset

Description해당 데이터는 산업통상자원부 주간 보도자료에 계획에 관한 데이터로 "산업통상자원부_주간 보도 세부현황" 데이터의 글번호 항목과 연계하여 상세 내용을 확인할 수 있습니다.
Author산업통상자원부
URLhttps://www.data.go.kr/data/15067405/fileData.do

Alerts

부서코드 is highly overall correlated with 부서명High correlation
부서명 is highly overall correlated with 부서코드High correlation
부서명 is highly imbalanced (92.1%)Imbalance

Reproduction

Analysis started2024-03-15 01:22:00.424617
Analysis finished2024-03-15 01:22:01.771368
Duration1.35 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1028
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
Minimum2003-09-08 00:00:00
Maximum2023-12-29 00:00:00
2024-03-15T10:22:01.893824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:22:02.154373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

부서코드
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1440153.4
Minimum1440000
Maximum1450016
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2024-03-15T10:22:02.410261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1440000
5-th percentile1440000
Q11440000
median1440000
Q31440000
95-th percentile1440000
Maximum1450016
Range10016
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1230.1795
Coefficient of variation (CV)0.00085420031
Kurtosis60.62343
Mean1440153.4
Median Absolute Deviation (MAD)0
Skewness7.9061597
Sum1.5049603 × 109
Variance1513341.6
MonotonicityNot monotonic
2024-03-15T10:22:02.790286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
1440000 1016
97.2%
1450016 13
 
1.2%
1450006 3
 
0.3%
1440009 1
 
0.1%
1440014 1
 
0.1%
1440013 1
 
0.1%
1440012 1
 
0.1%
1440010 1
 
0.1%
1440008 1
 
0.1%
1440001 1
 
0.1%
Other values (6) 6
 
0.6%
ValueCountFrequency (%)
1440000 1016
97.2%
1440001 1
 
0.1%
1440002 1
 
0.1%
1440003 1
 
0.1%
1440004 1
 
0.1%
1440005 1
 
0.1%
1440006 1
 
0.1%
1440007 1
 
0.1%
1440008 1
 
0.1%
1440009 1
 
0.1%
ValueCountFrequency (%)
1450016 13
1.2%
1450006 3
 
0.3%
1440014 1
 
0.1%
1440013 1
 
0.1%
1440012 1
 
0.1%
1440010 1
 
0.1%
1440009 1
 
0.1%
1440008 1
 
0.1%
1440007 1
 
0.1%
1440006 1
 
0.1%

부서명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
산업통상자원부
1029 
정보관리담당관
 
13
홍보담당관
 
3

Length

Max length7
Median length7
Mean length6.9942584
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row산업통상자원부
2nd row산업통상자원부
3rd row산업통상자원부
4th row산업통상자원부
5th row산업통상자원부

Common Values

ValueCountFrequency (%)
산업통상자원부 1029
98.5%
정보관리담당관 13
 
1.2%
홍보담당관 3
 
0.3%

Length

2024-03-15T10:22:03.155826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:22:03.628740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
산업통상자원부 1029
98.5%
정보관리담당관 13
 
1.2%
홍보담당관 3
 
0.3%
Distinct558
Distinct (%)53.4%
Missing0
Missing (%)0.0%
Memory size8.3 KiB
2024-03-15T10:22:04.641541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length10.991388
Min length6

Characters and Unicode

Total characters11486
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique226 ?
Unique (%)21.6%

Sample

1st row01-01~01-06
2nd row12-26~12-30
3rd row12-18~12-23
4th row12-11~12-16
5th row12-04~12-09
ValueCountFrequency (%)
06-20~06-25 4
 
0.4%
05-30~06-04 4
 
0.4%
06-08~06-13 4
 
0.4%
03-09~03-14 4
 
0.4%
10-22~10-27 4
 
0.4%
10-24~10-29 4
 
0.4%
11-07~11-12 4
 
0.4%
11-28~12-03 4
 
0.4%
11-14~11-19 4
 
0.4%
01-05~01-10 4
 
0.4%
Other values (548) 1005
96.2%
2024-03-15T10:22:06.409438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2550
22.2%
- 2090
18.2%
1 1834
16.0%
2 1215
10.6%
~ 1045
9.1%
3 477
 
4.2%
5 390
 
3.4%
7 383
 
3.3%
6 378
 
3.3%
8 377
 
3.3%
Other values (2) 747
 
6.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8351
72.7%
Dash Punctuation 2090
 
18.2%
Math Symbol 1045
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2550
30.5%
1 1834
22.0%
2 1215
14.5%
3 477
 
5.7%
5 390
 
4.7%
7 383
 
4.6%
6 378
 
4.5%
8 377
 
4.5%
4 374
 
4.5%
9 373
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 2090
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1045
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11486
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2550
22.2%
- 2090
18.2%
1 1834
16.0%
2 1215
10.6%
~ 1045
9.1%
3 477
 
4.2%
5 390
 
3.4%
7 383
 
3.3%
6 378
 
3.3%
8 377
 
3.3%
Other values (2) 747
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11486
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2550
22.2%
- 2090
18.2%
1 1834
16.0%
2 1215
10.6%
~ 1045
9.1%
3 477
 
4.2%
5 390
 
3.4%
7 383
 
3.3%
6 378
 
3.3%
8 377
 
3.3%
Other values (2) 747
 
6.5%

글번호
Real number (ℝ)

Distinct1038
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2266.4469
Minimum38
Maximum3037
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.3 KiB
2024-03-15T10:22:06.855418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum38
5-th percentile783.2
Q12246
median2508
Q32776
95-th percentile2984.8
Maximum3037
Range2999
Interquartile range (IQR)530

Descriptive statistics

Standard deviation745.36983
Coefficient of variation (CV)0.32887152
Kurtosis0.43915877
Mean2266.4469
Median Absolute Deviation (MAD)265
Skewness-1.321161
Sum2368437
Variance555576.18
MonotonicityNot monotonic
2024-03-15T10:22:07.413143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
786 2
 
0.2%
886 2
 
0.2%
846 2
 
0.2%
766 2
 
0.2%
826 2
 
0.2%
866 2
 
0.2%
806 2
 
0.2%
887 1
 
0.1%
2459 1
 
0.1%
2456 1
 
0.1%
Other values (1028) 1028
98.4%
ValueCountFrequency (%)
38 1
0.1%
39 1
0.1%
40 1
0.1%
66 1
0.1%
85 1
0.1%
146 1
0.1%
165 1
0.1%
185 1
0.1%
207 1
0.1%
245 1
0.1%
ValueCountFrequency (%)
3037 1
0.1%
3036 1
0.1%
3035 1
0.1%
3034 1
0.1%
3033 1
0.1%
3032 1
0.1%
3031 1
0.1%
3030 1
0.1%
3029 1
0.1%
3028 1
0.1%

Interactions

2024-03-15T10:22:01.022003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:22:00.667734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:22:01.283035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:22:00.834667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:22:07.773549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서코드부서명글번호
부서코드1.0001.0001.000
부서명1.0001.0000.159
글번호1.0000.1591.000
2024-03-15T10:22:08.033730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서코드글번호부서명
부서코드1.000-0.0371.000
글번호-0.0371.0000.095
부서명1.0000.0951.000

Missing values

2024-03-15T10:22:01.529248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:22:01.704288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

등록일부서코드부서명발행월일글번호
02023-12-291440000산업통상자원부01-01~01-06887
12023-12-221440001산업통상자원부12-26~12-30886
22023-12-151440002산업통상자원부12-18~12-23885
32023-12-081440003산업통상자원부12-11~12-16884
42023-12-011440004산업통상자원부12-04~12-09883
52023-11-241440005산업통상자원부11-27~12-02882
62023-11-171440006산업통상자원부11-20~11-25881
72023-11-101440007산업통상자원부11-13~11-18880
82023-11-031440008산업통상자원부11-06~11-11879
92023-10-271440009산업통상자원부10-30~11-04878
등록일부서코드부서명발행월일글번호
10352003-10-181440000산업통상자원부10-20~10-24245
10362003-10-111440000산업통상자원부10-13~10-20207
10372003-10-041440000산업통상자원부10-06~10-10185
10382003-09-291440000산업통상자원부09-29~10-04165
10392003-09-221440000산업통상자원부09-22~09-29146
10402003-09-091440000산업통상자원부09-15~09-2285
10412003-09-081440000산업통상자원부08-18~08-2538
10422003-09-081440000산업통상자원부08-25~09-0139
10432003-09-081440000산업통상자원부09-01~09-0840
10442003-09-081440000산업통상자원부09-08~09-1566