Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory419.9 KiB
Average record size in memory43.0 B

Variable types

Numeric2
Categorical1
Text1

Dataset

Description국가법령정보서비스 대국민 평가 및 수요 조사를 위해서 가공한 트랜잭션별 시퀀스ID, 이벤트ID, 크기, 법령 정보 등의 데이터 전처리 기초자료입니다.
Author법제처
URLhttps://www.data.go.kr/data/15049410/fileData.do

Alerts

transactionID_SIZE has constant value ""Constant

Reproduction

Analysis started2023-12-12 20:53:51.352639
Analysis finished2023-12-12 20:53:52.260199
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

transactionID_sequenceID
Real number (ℝ)

Distinct6879
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78585.157
Minimum6
Maximum110965
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:53:52.330717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile8298.85
Q166290.75
median93628
Q3100407.5
95-th percentile108665.1
Maximum110965
Range110959
Interquartile range (IQR)34116.75

Descriptive statistics

Standard deviation33117.116
Coefficient of variation (CV)0.42141693
Kurtosis-0.13650919
Mean78585.157
Median Absolute Deviation (MAD)9174
Skewness-1.1882563
Sum7.8585157 × 108
Variance1.0967433 × 109
MonotonicityNot monotonic
2023-12-13T05:53:52.452849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
97726 49
 
0.5%
97725 33
 
0.3%
99184 29
 
0.3%
99198 26
 
0.3%
99187 26
 
0.3%
99189 23
 
0.2%
99197 22
 
0.2%
99194 22
 
0.2%
99193 22
 
0.2%
99182 20
 
0.2%
Other values (6869) 9728
97.3%
ValueCountFrequency (%)
6 1
 
< 0.1%
10 3
< 0.1%
34 2
< 0.1%
388 1
 
< 0.1%
412 1
 
< 0.1%
423 1
 
< 0.1%
425 4
< 0.1%
433 3
< 0.1%
442 1
 
< 0.1%
452 3
< 0.1%
ValueCountFrequency (%)
110965 1
< 0.1%
110964 1
< 0.1%
110959 1
< 0.1%
110958 1
< 0.1%
110957 1
< 0.1%
110953 1
< 0.1%
110950 1
< 0.1%
110943 1
< 0.1%
110941 2
< 0.1%
110933 2
< 0.1%

transactionID_eventID
Real number (ℝ)

Distinct198
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.3237
Minimum1
Maximum380
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T05:53:52.562516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q37
95-th percentile46
Maximum380
Range379
Interquartile range (IQR)6

Descriptive statistics

Standard deviation27.261431
Coefficient of variation (CV)2.6406648
Kurtosis58.749886
Mean10.3237
Median Absolute Deviation (MAD)2
Skewness6.6353035
Sum103237
Variance743.18564
MonotonicityNot monotonic
2023-12-13T05:53:52.685108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 3398
34.0%
2 1566
15.7%
3 875
 
8.8%
4 505
 
5.1%
5 431
 
4.3%
6 386
 
3.9%
7 364
 
3.6%
8 251
 
2.5%
9 226
 
2.3%
10 178
 
1.8%
Other values (188) 1820
18.2%
ValueCountFrequency (%)
1 3398
34.0%
2 1566
15.7%
3 875
 
8.8%
4 505
 
5.1%
5 431
 
4.3%
6 386
 
3.9%
7 364
 
3.6%
8 251
 
2.5%
9 226
 
2.3%
10 178
 
1.8%
ValueCountFrequency (%)
380 1
< 0.1%
368 1
< 0.1%
366 1
< 0.1%
362 1
< 0.1%
351 2
< 0.1%
350 1
< 0.1%
336 1
< 0.1%
333 1
< 0.1%
327 2
< 0.1%
326 1
< 0.1%

transactionID_SIZE
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 10000
100.0%

Length

2023-12-13T05:53:52.807821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:53:52.881862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 10000
100.0%

law_nm
Text

Distinct2182
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T05:53:53.100314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length47
Mean length13.1398
Min length2

Characters and Unicode

Total characters131398
Distinct characters411
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique832 ?
Unique (%)8.3%

Sample

1st row혈액관리법
2nd row건축법
3rd row철도안전법
4th row건축법 시행규칙
5th row공무원의 노동조합 설립 및 운영 등에 관한 법률 시행령
ValueCountFrequency (%)
관한 3170
 
10.4%
법률 2507
 
8.2%
시행령 2292
 
7.5%
시행규칙 1748
 
5.7%
1668
 
5.5%
등에 626
 
2.0%
규칙 293
 
1.0%
진흥법 261
 
0.9%
특별법 246
 
0.8%
당사자로 216
 
0.7%
Other values (2010) 17570
57.4%
2023-12-13T05:53:53.496987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20597
 
15.7%
9347
 
7.1%
4764
 
3.6%
4736
 
3.6%
4306
 
3.3%
3604
 
2.7%
3418
 
2.6%
2601
 
2.0%
2530
 
1.9%
2429
 
1.8%
Other values (401) 73066
55.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 110676
84.2%
Space Separator 20597
 
15.7%
Decimal Number 120
 
0.1%
Other Punctuation 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9347
 
8.4%
4764
 
4.3%
4736
 
4.3%
4306
 
3.9%
3604
 
3.3%
3418
 
3.1%
2601
 
2.4%
2530
 
2.3%
2429
 
2.2%
2332
 
2.1%
Other values (387) 70609
63.8%
Decimal Number
ValueCountFrequency (%)
1 41
34.2%
2 21
17.5%
0 14
 
11.7%
5 13
 
10.8%
9 10
 
8.3%
8 9
 
7.5%
7 4
 
3.3%
4 4
 
3.3%
3 2
 
1.7%
6 2
 
1.7%
Space Separator
ValueCountFrequency (%)
20597
100.0%
Other Punctuation
ValueCountFrequency (%)
· 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 110676
84.2%
Common 20722
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9347
 
8.4%
4764
 
4.3%
4736
 
4.3%
4306
 
3.9%
3604
 
3.3%
3418
 
3.1%
2601
 
2.4%
2530
 
2.3%
2429
 
2.2%
2332
 
2.1%
Other values (387) 70609
63.8%
Common
ValueCountFrequency (%)
20597
99.4%
1 41
 
0.2%
2 21
 
0.1%
0 14
 
0.1%
5 13
 
0.1%
9 10
 
< 0.1%
8 9
 
< 0.1%
7 4
 
< 0.1%
4 4
 
< 0.1%
· 3
 
< 0.1%
Other values (4) 6
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 110083
83.8%
ASCII 20719
 
15.8%
Compat Jamo 593
 
0.5%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20597
99.4%
1 41
 
0.2%
2 21
 
0.1%
0 14
 
0.1%
5 13
 
0.1%
9 10
 
< 0.1%
8 9
 
< 0.1%
7 4
 
< 0.1%
4 4
 
< 0.1%
3 2
 
< 0.1%
Other values (3) 4
 
< 0.1%
Hangul
ValueCountFrequency (%)
9347
 
8.5%
4764
 
4.3%
4736
 
4.3%
4306
 
3.9%
3604
 
3.3%
3418
 
3.1%
2601
 
2.4%
2530
 
2.3%
2429
 
2.2%
2332
 
2.1%
Other values (386) 70016
63.6%
Compat Jamo
ValueCountFrequency (%)
593
100.0%
None
ValueCountFrequency (%)
· 3
100.0%

Interactions

2023-12-13T05:53:51.903489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:53:51.699022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:53:52.012685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:53:51.787977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:53:53.577292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
transactionID_sequenceIDtransactionID_eventID
transactionID_sequenceID1.0000.317
transactionID_eventID0.3171.000
2023-12-13T05:53:53.660900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
transactionID_sequenceIDtransactionID_eventID
transactionID_sequenceID1.000-0.285
transactionID_eventID-0.2851.000

Missing values

2023-12-13T05:53:52.140216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:53:52.225191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

transactionID_sequenceIDtransactionID_eventIDtransactionID_SIZElaw_nm
5762799182321혈액관리법
493419727341건축법
302208735511철도안전법
324148851611건축법 시행규칙
5320698126281공무원의 노동조합 설립 및 운영 등에 관한 법률 시행령
12693558151공유토지분할에 관한 특례법 시행령
30707193201노인복지법 시행령
475199634231주택건설기준 등에 관한 규정
232777858131환경기술 및 환경산업 지원법
20245041101국가를 당사자로 하는 계약에 관한 법률 시행규칙
transactionID_sequenceIDtransactionID_eventIDtransactionID_SIZElaw_nm
412939351911도시 및 주거환경정비법
5473398144331공무원의 노동조합 설립 및 운영 등에 관한 법률
105782003351지방자치법
8005810946811건설기술 진흥법 시행령
67301102948151외국환거래법 시행령
419429386011근로기준법
6985110387171고용상 연령차별금지 및 고령자고용촉진에 관한 법률 시행령
7326510543811도로교통법 시행규칙
492489722861독점규제 및 공정거래에 관한 법률
90671630451건축법