Amino acid dipepetide frequency for Hubei lepidoptera virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.458AlaAla: 1.458 ± 0.814
1.458AlaCys: 1.458 ± 0.568
2.187AlaAsp: 2.187 ± 0.532
2.187AlaGlu: 2.187 ± 1.159
1.944AlaPhe: 1.944 ± 1.204
1.458AlaGly: 1.458 ± 0.424
1.215AlaHis: 1.215 ± 0.745
4.617AlaIle: 4.617 ± 1.621
3.888AlaLys: 3.888 ± 0.862
3.645AlaLeu: 3.645 ± 1.027
0.972AlaMet: 0.972 ± 0.296
1.215AlaAsn: 1.215 ± 0.433
0.486AlaPro: 0.486 ± 0.298
0.243AlaGln: 0.243 ± 0.149
0.729AlaArg: 0.729 ± 1.487
3.402AlaSer: 3.402 ± 0.997
0.729AlaThr: 0.729 ± 0.352
3.159AlaVal: 3.159 ± 0.558
0.486AlaTrp: 0.486 ± 0.722
0.729AlaTyr: 0.729 ± 0.177
0.0AlaXaa: 0.0 ± 0.0
Cys
0.729CysAla: 0.729 ± 0.67
0.486CysCys: 0.486 ± 0.298
0.729CysAsp: 0.729 ± 0.177
0.972CysGlu: 0.972 ± 0.296
1.458CysPhe: 1.458 ± 0.653
0.0CysGly: 0.0 ± 0.0
0.243CysHis: 0.243 ± 0.23
2.43CysIle: 2.43 ± 0.289
1.944CysLys: 1.944 ± 0.566
2.673CysLeu: 2.673 ± 0.685
1.458CysMet: 1.458 ± 0.932
0.972CysAsn: 0.972 ± 0.283
1.701CysPro: 1.701 ± 0.928
1.215CysGln: 1.215 ± 0.433
0.729CysArg: 0.729 ± 0.177
3.402CysSer: 3.402 ± 1.135
2.673CysThr: 2.673 ± 0.851
0.729CysVal: 0.729 ± 0.352
0.0CysTrp: 0.0 ± 0.0
0.972CysTyr: 0.972 ± 0.296
0.0CysXaa: 0.0 ± 0.0
Asp
1.701AspAla: 1.701 ± 0.785
2.187AspCys: 2.187 ± 0.547
5.589AspAsp: 5.589 ± 1.329
4.374AspGlu: 4.374 ± 1.238
3.645AspPhe: 3.645 ± 0.814
2.43AspGly: 2.43 ± 0.568
1.215AspHis: 1.215 ± 0.745
2.916AspIle: 2.916 ± 0.71
4.131AspLys: 4.131 ± 1.101
6.075AspLeu: 6.075 ± 0.96
0.972AspMet: 0.972 ± 0.371
3.159AspAsn: 3.159 ± 0.916
1.701AspPro: 1.701 ± 0.722
1.458AspGln: 1.458 ± 0.577
2.673AspArg: 2.673 ± 0.652
6.804AspSer: 6.804 ± 1.563
2.187AspThr: 2.187 ± 0.547
1.458AspVal: 1.458 ± 0.568
0.972AspTrp: 0.972 ± 1.439
2.187AspTyr: 2.187 ± 0.469
0.0AspXaa: 0.0 ± 0.0
Glu
2.187GluAla: 2.187 ± 0.532
1.458GluCys: 1.458 ± 0.424
6.075GluAsp: 6.075 ± 1.686
6.318GluGlu: 6.318 ± 0.66
2.916GluPhe: 2.916 ± 1.461
4.131GluGly: 4.131 ± 1.586
1.215GluHis: 1.215 ± 0.484
5.103GluIle: 5.103 ± 0.583
4.86GluLys: 4.86 ± 1.231
5.589GluLeu: 5.589 ± 1.451
2.673GluMet: 2.673 ± 0.758
3.159GluAsn: 3.159 ± 0.211
1.458GluPro: 1.458 ± 0.704
1.215GluGln: 1.215 ± 0.433
2.673GluArg: 2.673 ± 0.33
6.561GluSer: 6.561 ± 1.996
2.916GluThr: 2.916 ± 1.1
4.131GluVal: 4.131 ± 0.736
0.729GluTrp: 0.729 ± 0.7
1.944GluTyr: 1.944 ± 1.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.944PheAla: 1.944 ± 0.869
0.972PheCys: 0.972 ± 0.283
1.944PheAsp: 1.944 ± 1.204
2.187PheGlu: 2.187 ± 0.532
0.0PhePhe: 0.0 ± 0.0
2.43PheGly: 2.43 ± 1.165
0.729PheHis: 0.729 ± 0.691
3.402PheIle: 3.402 ± 1.242
2.187PheLys: 2.187 ± 0.727
5.346PheLeu: 5.346 ± 1.425
0.486PheMet: 0.486 ± 0.298
4.617PheAsn: 4.617 ± 0.259
2.916PhePro: 2.916 ± 0.36
1.215PheGln: 1.215 ± 0.284
1.215PheArg: 1.215 ± 0.433
4.131PheSer: 4.131 ± 1.438
2.916PheThr: 2.916 ± 0.694
1.458PheVal: 1.458 ± 0.577
0.486PheTrp: 0.486 ± 0.141
1.701PheTyr: 1.701 ± 0.412
0.0PheXaa: 0.0 ± 0.0
Gly
2.43GlyAla: 2.43 ± 0.638
1.215GlyCys: 1.215 ± 0.806
3.888GlyAsp: 3.888 ± 0.156
2.673GlyGlu: 2.673 ± 0.758
2.187GlyPhe: 2.187 ± 0.496
2.916GlyGly: 2.916 ± 0.423
0.486GlyHis: 0.486 ± 0.298
4.617GlyIle: 4.617 ± 1.592
3.159GlyLys: 3.159 ± 0.922
5.103GlyLeu: 5.103 ± 0.639
0.972GlyMet: 0.972 ± 0.283
1.701GlyAsn: 1.701 ± 1.346
0.729GlyPro: 0.729 ± 0.352
1.701GlyGln: 1.701 ± 0.608
1.944GlyArg: 1.944 ± 0.591
4.86GlySer: 4.86 ± 1.733
2.43GlyThr: 2.43 ± 0.568
2.673GlyVal: 2.673 ± 1.009
0.243GlyTrp: 0.243 ± 0.149
1.458GlyTyr: 1.458 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.486HisAla: 0.486 ± 0.722
0.729HisCys: 0.729 ± 0.177
0.972HisAsp: 0.972 ± 0.602
1.944HisGlu: 1.944 ± 0.591
0.486HisPhe: 0.486 ± 0.298
0.729HisGly: 0.729 ± 0.177
0.243HisHis: 0.243 ± 0.149
0.243HisIle: 0.243 ± 0.149
1.215HisLys: 1.215 ± 0.433
2.43HisLeu: 2.43 ± 0.707
0.0HisMet: 0.0 ± 0.0
2.187HisAsn: 2.187 ± 0.807
0.0HisPro: 0.0 ± 0.0
0.729HisGln: 0.729 ± 0.67
0.729HisArg: 0.729 ± 0.447
1.458HisSer: 1.458 ± 0.424
1.458HisThr: 1.458 ± 0.494
0.972HisVal: 0.972 ± 0.283
0.486HisTrp: 0.486 ± 0.298
1.215HisTyr: 1.215 ± 0.433
0.0HisXaa: 0.0 ± 0.0
Ile
3.888IleAla: 3.888 ± 0.815
1.458IleCys: 1.458 ± 0.814
5.346IleAsp: 5.346 ± 1.261
4.86IleGlu: 4.86 ± 1.137
1.701IlePhe: 1.701 ± 0.464
4.86IleGly: 4.86 ± 1.529
2.187IleHis: 2.187 ± 0.496
7.047IleIle: 7.047 ± 3.296
5.832IleLys: 5.832 ± 1.102
8.019IleLeu: 8.019 ± 2.496
3.402IleMet: 3.402 ± 1.723
9.721IleAsn: 9.721 ± 4.625
3.645IlePro: 3.645 ± 1.146
2.673IleGln: 2.673 ± 0.685
4.617IleArg: 4.617 ± 2.021
6.318IleSer: 6.318 ± 0.392
3.645IleThr: 3.645 ± 0.636
3.645IleVal: 3.645 ± 0.997
1.215IleTrp: 1.215 ± 0.484
3.159IleTyr: 3.159 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
3.645LysAla: 3.645 ± 0.853
2.187LysCys: 2.187 ± 1.383
3.645LysAsp: 3.645 ± 0.853
4.86LysGlu: 4.86 ± 1.137
4.374LysPhe: 4.374 ± 0.64
2.673LysGly: 2.673 ± 0.685
0.486LysHis: 0.486 ± 0.141
7.047LysIle: 7.047 ± 0.984
7.533LysLys: 7.533 ± 1.742
8.748LysLeu: 8.748 ± 1.288
2.187LysMet: 2.187 ± 1.2
5.832LysAsn: 5.832 ± 1.222
1.215LysPro: 1.215 ± 0.284
1.701LysGln: 1.701 ± 0.722
2.43LysArg: 2.43 ± 0.867
8.262LysSer: 8.262 ± 1.821
4.374LysThr: 4.374 ± 1.483
4.617LysVal: 4.617 ± 1.439
0.729LysTrp: 0.729 ± 0.177
0.972LysTyr: 0.972 ± 0.283
0.0LysXaa: 0.0 ± 0.0
Leu
3.402LeuAla: 3.402 ± 0.997
2.43LeuCys: 2.43 ± 0.707
5.589LeuAsp: 5.589 ± 0.535
4.86LeuGlu: 4.86 ± 1.478
5.589LeuPhe: 5.589 ± 1.886
3.159LeuGly: 3.159 ± 0.823
2.916LeuHis: 2.916 ± 0.771
8.748LeuIle: 8.748 ± 2.092
8.262LeuLys: 8.262 ± 0.772
6.318LeuLeu: 6.318 ± 2.597
2.673LeuMet: 2.673 ± 0.758
7.776LeuAsn: 7.776 ± 0.682
4.131LeuPro: 4.131 ± 0.386
1.944LeuGln: 1.944 ± 0.869
4.86LeuArg: 4.86 ± 0.514
10.45LeuSer: 10.45 ± 2.696
5.832LeuThr: 5.832 ± 2.274
4.617LeuVal: 4.617 ± 3.067
1.215LeuTrp: 1.215 ± 0.284
2.673LeuTyr: 2.673 ± 0.625
0.0LeuXaa: 0.0 ± 0.0
Met
0.972MetAla: 0.972 ± 0.602
0.486MetCys: 0.486 ± 0.141
1.458MetAsp: 1.458 ± 0.577
2.187MetGlu: 2.187 ± 1.112
0.972MetPhe: 0.972 ± 0.283
1.215MetGly: 1.215 ± 0.484
0.243MetHis: 0.243 ± 0.149
2.673MetIle: 2.673 ± 2.034
1.944MetLys: 1.944 ± 0.407
1.458MetLeu: 1.458 ± 0.355
1.458MetMet: 1.458 ± 0.894
3.402MetAsn: 3.402 ± 0.875
1.215MetPro: 1.215 ± 0.284
1.458MetGln: 1.458 ± 0.355
1.215MetArg: 1.215 ± 0.566
1.944MetSer: 1.944 ± 1.193
2.187MetThr: 2.187 ± 1.159
0.486MetVal: 0.486 ± 0.298
0.486MetTrp: 0.486 ± 0.141
0.972MetTyr: 0.972 ± 0.602
0.0MetXaa: 0.0 ± 0.0
Asn
2.916AsnAla: 2.916 ± 1.06
1.701AsnCys: 1.701 ± 0.438
3.645AsnAsp: 3.645 ± 0.275
6.075AsnGlu: 6.075 ± 0.32
4.374AsnPhe: 4.374 ± 1.273
3.645AsnGly: 3.645 ± 0.926
0.729AsnHis: 0.729 ± 0.691
5.103AsnIle: 5.103 ± 1.167
5.832AsnLys: 5.832 ± 0.99
6.561AsnLeu: 6.561 ± 0.615
2.43AsnMet: 2.43 ± 0.291
5.346AsnAsn: 5.346 ± 1.532
2.43AsnPro: 2.43 ± 0.638
1.944AsnGln: 1.944 ± 0.452
3.159AsnArg: 3.159 ± 0.813
4.86AsnSer: 4.86 ± 1.653
3.159AsnThr: 3.159 ± 2.299
2.673AsnVal: 2.673 ± 1.31
0.972AsnTrp: 0.972 ± 0.296
2.673AsnTyr: 2.673 ± 0.851
0.0AsnXaa: 0.0 ± 0.0
Pro
0.486ProAla: 0.486 ± 0.141
0.729ProCys: 0.729 ± 0.447
1.944ProAsp: 1.944 ± 1.193
1.944ProGlu: 1.944 ± 1.418
1.458ProPhe: 1.458 ± 0.932
1.944ProGly: 1.944 ± 0.566
0.729ProHis: 0.729 ± 0.818
4.617ProIle: 4.617 ± 1.46
2.673ProLys: 2.673 ± 0.509
2.673ProLeu: 2.673 ± 1.009
0.972ProMet: 0.972 ± 1.439
1.701ProAsn: 1.701 ± 0.464
0.729ProPro: 0.729 ± 0.177
0.486ProGln: 0.486 ± 0.46
1.701ProArg: 1.701 ± 0.438
2.673ProSer: 2.673 ± 0.898
1.944ProThr: 1.944 ± 0.407
0.729ProVal: 0.729 ± 0.177
0.243ProTrp: 0.243 ± 0.149
2.187ProTyr: 2.187 ± 1.017
0.0ProXaa: 0.0 ± 0.0
Gln
0.486GlnAla: 0.486 ± 0.141
0.0GlnCys: 0.0 ± 0.0
1.215GlnAsp: 1.215 ± 0.433
2.187GlnGlu: 2.187 ± 0.469
1.215GlnPhe: 1.215 ± 0.484
2.187GlnGly: 2.187 ± 1.017
0.972GlnHis: 0.972 ± 0.296
1.944GlnIle: 1.944 ± 0.452
1.944GlnLys: 1.944 ± 0.566
1.458GlnLeu: 1.458 ± 0.355
0.243GlnMet: 0.243 ± 0.23
1.458GlnAsn: 1.458 ± 0.424
0.486GlnPro: 0.486 ± 0.141
0.486GlnGln: 0.486 ± 0.298
1.944GlnArg: 1.944 ± 0.591
2.187GlnSer: 2.187 ± 0.727
2.43GlnThr: 2.43 ± 0.867
1.701GlnVal: 1.701 ± 0.438
0.0GlnTrp: 0.0 ± 0.0
1.458GlnTyr: 1.458 ± 0.814
0.0GlnXaa: 0.0 ± 0.0
Arg
1.215ArgAla: 1.215 ± 0.566
0.972ArgCys: 0.972 ± 0.283
2.187ArgAsp: 2.187 ± 0.547
4.617ArgGlu: 4.617 ± 1.009
2.916ArgPhe: 2.916 ± 0.694
2.43ArgGly: 2.43 ± 0.658
0.729ArgHis: 0.729 ± 0.7
4.131ArgIle: 4.131 ± 0.878
3.888ArgLys: 3.888 ± 1.01
5.589ArgLeu: 5.589 ± 2.162
0.486ArgMet: 0.486 ± 0.722
1.215ArgAsn: 1.215 ± 0.433
0.972ArgPro: 0.972 ± 0.596
0.972ArgGln: 0.972 ± 0.296
2.187ArgArg: 2.187 ± 1.017
2.916ArgSer: 2.916 ± 0.887
1.701ArgThr: 1.701 ± 1.044
2.187ArgVal: 2.187 ± 0.547
0.243ArgTrp: 0.243 ± 0.149
2.43ArgTyr: 2.43 ± 1.165
0.0ArgXaa: 0.0 ± 0.0
Ser
3.645SerAla: 3.645 ± 1.658
2.673SerCys: 2.673 ± 1.84
4.86SerAsp: 4.86 ± 1.387
7.29SerGlu: 7.29 ± 0.869
2.43SerPhe: 2.43 ± 0.707
5.103SerGly: 5.103 ± 0.602
2.187SerHis: 2.187 ± 0.496
8.262SerIle: 8.262 ± 2.267
7.776SerLys: 7.776 ± 4.822
10.693SerLeu: 10.693 ± 2.569
1.215SerMet: 1.215 ± 0.566
5.346SerAsn: 5.346 ± 1.261
2.916SerPro: 2.916 ± 1.357
2.673SerGln: 2.673 ± 0.33
3.645SerArg: 3.645 ± 0.449
11.179SerSer: 11.179 ± 0.872
4.86SerThr: 4.86 ± 1.153
4.131SerVal: 4.131 ± 0.383
0.972SerTrp: 0.972 ± 0.296
3.645SerTyr: 3.645 ± 0.997
0.0SerXaa: 0.0 ± 0.0
Thr
1.215ThrAla: 1.215 ± 0.566
1.944ThrCys: 1.944 ± 0.956
2.43ThrAsp: 2.43 ± 0.289
3.159ThrGlu: 3.159 ± 1.299
1.215ThrPhe: 1.215 ± 0.433
2.187ThrGly: 2.187 ± 0.547
1.215ThrHis: 1.215 ± 0.745
4.86ThrIle: 4.86 ± 1.072
2.187ThrLys: 2.187 ± 1.383
7.776ThrLeu: 7.776 ± 2.467
2.43ThrMet: 2.43 ± 0.289
4.131ThrAsn: 4.131 ± 0.976
2.187ThrPro: 2.187 ± 0.32
1.701ThrGln: 1.701 ± 0.412
3.645ThrArg: 3.645 ± 1.171
3.888ThrSer: 3.888 ± 0.856
3.159ThrThr: 3.159 ± 1.174
2.916ThrVal: 2.916 ± 1.418
0.486ThrTrp: 0.486 ± 0.141
0.729ThrTyr: 0.729 ± 0.352
0.0ThrXaa: 0.0 ± 0.0
Val
1.701ValAla: 1.701 ± 0.438
1.458ValCys: 1.458 ± 1.322
2.187ValAsp: 2.187 ± 0.469
2.187ValGlu: 2.187 ± 0.32
1.701ValPhe: 1.701 ± 0.464
2.187ValGly: 2.187 ± 1.159
0.0ValHis: 0.0 ± 0.0
5.832ValIle: 5.832 ± 1.491
4.131ValLys: 4.131 ± 0.717
2.673ValLeu: 2.673 ± 0.509
1.215ValMet: 1.215 ± 0.284
4.131ValAsn: 4.131 ± 1.09
1.701ValPro: 1.701 ± 0.523
1.701ValGln: 1.701 ± 0.464
1.944ValArg: 1.944 ± 0.428
4.374ValSer: 4.374 ± 0.993
2.187ValThr: 2.187 ± 0.727
2.43ValVal: 2.43 ± 0.707
0.243ValTrp: 0.243 ± 0.149
2.43ValTyr: 2.43 ± 0.707
0.0ValXaa: 0.0 ± 0.0
Trp
0.972TrpAla: 0.972 ± 0.596
0.486TrpCys: 0.486 ± 0.298
0.972TrpAsp: 0.972 ± 0.283
0.729TrpGlu: 0.729 ± 0.447
0.243TrpPhe: 0.243 ± 0.149
0.0TrpGly: 0.0 ± 0.0
0.243TrpHis: 0.243 ± 0.149
0.729TrpIle: 0.729 ± 0.177
0.243TrpLys: 0.243 ± 0.23
1.215TrpLeu: 1.215 ± 0.566
0.486TrpMet: 0.486 ± 0.298
0.486TrpAsn: 0.486 ± 0.298
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.486TrpArg: 0.486 ± 0.141
1.215TrpSer: 1.215 ± 1.427
0.243TrpThr: 0.243 ± 0.23
0.972TrpVal: 0.972 ± 0.602
0.0TrpTrp: 0.0 ± 0.0
0.486TrpTyr: 0.486 ± 0.46
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.729TyrAla: 0.729 ± 0.447
0.486TyrCys: 0.486 ± 0.46
0.972TyrAsp: 0.972 ± 0.296
1.701TyrGlu: 1.701 ± 1.264
1.458TyrPhe: 1.458 ± 0.355
1.701TyrGly: 1.701 ± 0.464
0.729TyrHis: 0.729 ± 0.177
2.916TyrIle: 2.916 ± 0.887
3.888TyrLys: 3.888 ± 0.856
3.402TyrLeu: 3.402 ± 0.672
1.458TyrMet: 1.458 ± 0.894
2.916TyrAsn: 2.916 ± 0.36
1.944TyrPro: 1.944 ± 0.428
0.486TyrGln: 0.486 ± 0.141
1.458TyrArg: 1.458 ± 0.355
4.617TyrSer: 4.617 ± 0.508
2.43TyrThr: 2.43 ± 0.568
0.729TyrVal: 0.729 ± 0.177
0.0TyrTrp: 0.0 ± 0.0
1.701TyrTyr: 1.701 ± 0.621
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4116 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski