Amino acid dipepetide frequency for Magnaporthe oryzae polymycovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.688AlaAla: 16.688 ± 3.052
1.712AlaCys: 1.712 ± 1.093
7.702AlaAsp: 7.702 ± 2.672
5.563AlaGlu: 5.563 ± 0.878
3.423AlaPhe: 3.423 ± 1.171
7.702AlaGly: 7.702 ± 0.465
2.995AlaHis: 2.995 ± 1.565
4.279AlaIle: 4.279 ± 0.68
3.851AlaLys: 3.851 ± 0.537
12.409AlaLeu: 12.409 ± 3.412
3.851AlaMet: 3.851 ± 1.129
3.851AlaAsn: 3.851 ± 0.711
2.139AlaPro: 2.139 ± 0.477
3.851AlaGln: 3.851 ± 0.787
15.832AlaArg: 15.832 ± 2.6
6.846AlaSer: 6.846 ± 1.652
7.274AlaThr: 7.274 ± 1.485
8.13AlaVal: 8.13 ± 1.965
1.284AlaTrp: 1.284 ± 0.521
2.139AlaTyr: 2.139 ± 0.577
0.0AlaXaa: 0.0 ± 0.0
Cys
1.284CysAla: 1.284 ± 0.253
0.0CysCys: 0.0 ± 0.0
0.856CysAsp: 0.856 ± 0.336
0.428CysGlu: 0.428 ± 0.318
0.428CysPhe: 0.428 ± 0.622
0.428CysGly: 0.428 ± 0.405
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.856CysLeu: 0.856 ± 0.336
0.0CysMet: 0.0 ± 0.0
0.428CysAsn: 0.428 ± 0.318
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.856CysArg: 0.856 ± 1.244
0.428CysSer: 0.428 ± 0.335
0.856CysThr: 0.856 ± 0.418
2.567CysVal: 2.567 ± 1.216
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.558AspAla: 8.558 ± 1.685
0.428AspCys: 0.428 ± 0.318
2.995AspAsp: 2.995 ± 1.933
3.851AspGlu: 3.851 ± 1.63
1.712AspPhe: 1.712 ± 0.586
3.851AspGly: 3.851 ± 0.808
0.856AspHis: 0.856 ± 0.81
2.139AspIle: 2.139 ± 0.754
3.423AspLys: 3.423 ± 0.226
5.563AspLeu: 5.563 ± 0.789
1.284AspMet: 1.284 ± 0.572
0.856AspAsn: 0.856 ± 0.336
5.563AspPro: 5.563 ± 0.853
1.284AspGln: 1.284 ± 0.562
4.279AspArg: 4.279 ± 1.46
4.707AspSer: 4.707 ± 1.535
3.851AspThr: 3.851 ± 0.516
7.702AspVal: 7.702 ± 2.403
0.0AspTrp: 0.0 ± 0.0
0.856AspTyr: 0.856 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
2.995GluAla: 2.995 ± 1.468
0.0GluCys: 0.0 ± 0.0
0.856GluAsp: 0.856 ± 0.418
1.284GluGlu: 1.284 ± 0.603
0.428GluPhe: 0.428 ± 0.318
3.423GluGly: 3.423 ± 2.197
1.712GluHis: 1.712 ± 0.421
1.712GluIle: 1.712 ± 0.386
2.139GluLys: 2.139 ± 0.926
4.279GluLeu: 4.279 ± 0.484
0.856GluMet: 0.856 ± 0.336
0.428GluAsn: 0.428 ± 0.335
2.995GluPro: 2.995 ± 0.925
0.856GluGln: 0.856 ± 0.418
3.423GluArg: 3.423 ± 1.256
0.856GluSer: 0.856 ± 0.336
3.423GluThr: 3.423 ± 0.842
3.851GluVal: 3.851 ± 1.513
1.284GluTrp: 1.284 ± 0.622
3.851GluTyr: 3.851 ± 1.128
0.0GluXaa: 0.0 ± 0.0
Phe
3.423PheAla: 3.423 ± 1.81
1.284PheCys: 1.284 ± 0.603
1.712PheAsp: 1.712 ± 1.169
2.139PheGlu: 2.139 ± 0.604
0.428PhePhe: 0.428 ± 0.405
3.423PheGly: 3.423 ± 2.273
0.428PheHis: 0.428 ± 0.405
1.284PheIle: 1.284 ± 0.521
0.856PheLys: 0.856 ± 0.392
1.712PheLeu: 1.712 ± 0.897
0.428PheMet: 0.428 ± 0.405
0.856PheAsn: 0.856 ± 0.392
2.139PhePro: 2.139 ± 0.96
0.0PheGln: 0.0 ± 0.0
0.856PheArg: 0.856 ± 0.392
2.567PheSer: 2.567 ± 0.373
2.139PheThr: 2.139 ± 0.512
2.567PheVal: 2.567 ± 1.914
0.0PheTrp: 0.0 ± 0.0
0.428PheTyr: 0.428 ± 0.335
0.0PheXaa: 0.0 ± 0.0
Gly
7.274GlyAla: 7.274 ± 0.799
0.0GlyCys: 0.0 ± 0.0
5.135GlyAsp: 5.135 ± 1.369
1.284GlyGlu: 1.284 ± 0.603
1.284GlyPhe: 1.284 ± 0.521
6.846GlyGly: 6.846 ± 1.173
1.284GlyHis: 1.284 ± 0.562
2.995GlyIle: 2.995 ± 1.313
2.995GlyLys: 2.995 ± 1.692
5.563GlyLeu: 5.563 ± 2.037
2.139GlyMet: 2.139 ± 1.192
1.284GlyAsn: 1.284 ± 0.794
6.846GlyPro: 6.846 ± 1.22
1.284GlyGln: 1.284 ± 0.521
6.846GlyArg: 6.846 ± 1.943
7.274GlySer: 7.274 ± 1.209
2.995GlyThr: 2.995 ± 0.806
9.414GlyVal: 9.414 ± 1.99
0.0GlyTrp: 0.0 ± 0.0
3.423GlyTyr: 3.423 ± 0.951
0.0GlyXaa: 0.0 ± 0.0
His
3.851HisAla: 3.851 ± 1.556
0.0HisCys: 0.0 ± 0.0
2.567HisAsp: 2.567 ± 0.654
2.139HisGlu: 2.139 ± 0.512
0.856HisPhe: 0.856 ± 0.418
3.423HisGly: 3.423 ± 0.359
0.856HisHis: 0.856 ± 0.418
0.856HisIle: 0.856 ± 0.67
0.428HisLys: 0.428 ± 0.405
0.428HisLeu: 0.428 ± 0.335
0.428HisMet: 0.428 ± 0.318
0.856HisAsn: 0.856 ± 0.336
1.712HisPro: 1.712 ± 0.897
0.0HisGln: 0.0 ± 0.0
0.856HisArg: 0.856 ± 0.336
1.712HisSer: 1.712 ± 0.706
1.284HisThr: 1.284 ± 0.253
2.139HisVal: 2.139 ± 0.828
0.0HisTrp: 0.0 ± 0.0
0.856HisTyr: 0.856 ± 0.418
0.0HisXaa: 0.0 ± 0.0
Ile
6.418IleAla: 6.418 ± 1.135
0.428IleCys: 0.428 ± 0.622
5.563IleAsp: 5.563 ± 1.009
0.856IleGlu: 0.856 ± 0.635
0.856IlePhe: 0.856 ± 0.336
2.995IleGly: 2.995 ± 0.684
0.0IleHis: 0.0 ± 0.0
1.712IleIle: 1.712 ± 0.586
0.0IleLys: 0.0 ± 0.0
3.851IleLeu: 3.851 ± 0.204
1.284IleMet: 1.284 ± 0.253
1.712IleAsn: 1.712 ± 0.773
1.712IlePro: 1.712 ± 1.169
0.856IleGln: 0.856 ± 0.392
1.284IleArg: 1.284 ± 0.851
2.139IleSer: 2.139 ± 0.926
2.567IleThr: 2.567 ± 0.629
2.139IleVal: 2.139 ± 1.09
0.0IleTrp: 0.0 ± 0.0
0.428IleTyr: 0.428 ± 0.318
0.0IleXaa: 0.0 ± 0.0
Lys
1.712LysAla: 1.712 ± 0.731
0.428LysCys: 0.428 ± 0.405
1.284LysAsp: 1.284 ± 0.253
0.856LysGlu: 0.856 ± 0.67
0.856LysPhe: 0.856 ± 0.63
2.567LysGly: 2.567 ± 1.026
0.856LysHis: 0.856 ± 0.392
1.712LysIle: 1.712 ± 0.783
0.856LysLys: 0.856 ± 0.62
5.563LysLeu: 5.563 ± 2.062
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
2.567LysPro: 2.567 ± 0.934
0.856LysGln: 0.856 ± 0.62
1.712LysArg: 1.712 ± 0.673
2.995LysSer: 2.995 ± 0.928
1.712LysThr: 1.712 ± 0.619
1.284LysVal: 1.284 ± 0.602
0.428LysTrp: 0.428 ± 0.335
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.414LeuAla: 9.414 ± 1.819
0.428LeuCys: 0.428 ± 0.405
5.563LeuAsp: 5.563 ± 2.503
3.423LeuGlu: 3.423 ± 0.842
4.279LeuPhe: 4.279 ± 1.053
4.707LeuGly: 4.707 ± 1.107
3.423LeuHis: 3.423 ± 1.285
2.567LeuIle: 2.567 ± 0.904
2.139LeuLys: 2.139 ± 1.82
7.274LeuLeu: 7.274 ± 1.961
2.139LeuMet: 2.139 ± 0.718
1.712LeuAsn: 1.712 ± 0.706
5.563LeuPro: 5.563 ± 1.33
2.567LeuGln: 2.567 ± 0.74
8.13LeuArg: 8.13 ± 1.982
8.986LeuSer: 8.986 ± 1.771
8.13LeuThr: 8.13 ± 1.455
8.13LeuVal: 8.13 ± 1.509
0.428LeuTrp: 0.428 ± 0.318
2.995LeuTyr: 2.995 ± 1.146
0.0LeuXaa: 0.0 ± 0.0
Met
2.567MetAla: 2.567 ± 0.767
0.0MetCys: 0.0 ± 0.0
0.428MetAsp: 0.428 ± 0.335
0.856MetGlu: 0.856 ± 0.336
0.856MetPhe: 0.856 ± 0.336
0.428MetGly: 0.428 ± 0.335
0.0MetHis: 0.0 ± 0.0
0.428MetIle: 0.428 ± 0.405
0.428MetLys: 0.428 ± 0.335
1.712MetLeu: 1.712 ± 0.586
0.0MetMet: 0.0 ± 0.0
1.284MetAsn: 1.284 ± 0.622
1.284MetPro: 1.284 ± 0.603
0.428MetGln: 0.428 ± 0.405
1.712MetArg: 1.712 ± 0.897
2.995MetSer: 2.995 ± 1.499
0.856MetThr: 0.856 ± 0.392
2.995MetVal: 2.995 ± 1.007
0.0MetTrp: 0.0 ± 0.0
2.139MetTyr: 2.139 ± 0.4
0.0MetXaa: 0.0 ± 0.0
Asn
1.712AsnAla: 1.712 ± 1.178
0.0AsnCys: 0.0 ± 0.0
0.856AsnAsp: 0.856 ± 0.674
1.284AsnGlu: 1.284 ± 0.606
0.428AsnPhe: 0.428 ± 0.405
1.284AsnGly: 1.284 ± 0.603
1.284AsnHis: 1.284 ± 0.602
0.856AsnIle: 0.856 ± 0.336
0.856AsnLys: 0.856 ± 0.336
2.995AsnLeu: 2.995 ± 0.524
0.856AsnMet: 0.856 ± 0.579
0.856AsnAsn: 0.856 ± 0.336
1.284AsnPro: 1.284 ± 0.521
0.428AsnGln: 0.428 ± 0.335
2.567AsnArg: 2.567 ± 0.505
2.139AsnSer: 2.139 ± 1.078
1.712AsnThr: 1.712 ± 0.673
1.284AsnVal: 1.284 ± 0.253
0.428AsnTrp: 0.428 ± 0.405
0.856AsnTyr: 0.856 ± 0.67
0.0AsnXaa: 0.0 ± 0.0
Pro
5.991ProAla: 5.991 ± 1.664
0.0ProCys: 0.0 ± 0.0
2.567ProAsp: 2.567 ± 0.387
3.851ProGlu: 3.851 ± 0.81
0.856ProPhe: 0.856 ± 0.336
7.274ProGly: 7.274 ± 0.852
1.712ProHis: 1.712 ± 0.9
2.567ProIle: 2.567 ± 0.939
1.284ProLys: 1.284 ± 0.606
8.558ProLeu: 8.558 ± 1.33
0.428ProMet: 0.428 ± 0.405
1.712ProAsn: 1.712 ± 0.586
5.563ProPro: 5.563 ± 1.081
1.284ProGln: 1.284 ± 0.521
4.707ProArg: 4.707 ± 1.331
6.418ProSer: 6.418 ± 1.28
2.139ProThr: 2.139 ± 0.926
5.563ProVal: 5.563 ± 0.996
0.856ProTrp: 0.856 ± 0.336
0.428ProTyr: 0.428 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
1.284GlnAla: 1.284 ± 0.953
1.284GlnCys: 1.284 ± 0.562
0.428GlnAsp: 0.428 ± 0.405
1.284GlnGlu: 1.284 ± 0.723
0.0GlnPhe: 0.0 ± 0.0
1.284GlnGly: 1.284 ± 0.521
0.428GlnHis: 0.428 ± 0.622
0.428GlnIle: 0.428 ± 0.318
0.428GlnLys: 0.428 ± 0.405
3.423GlnLeu: 3.423 ± 0.842
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.856GlnPro: 0.856 ± 0.336
0.856GlnGln: 0.856 ± 0.635
2.995GlnArg: 2.995 ± 0.954
0.856GlnSer: 0.856 ± 0.674
2.995GlnThr: 2.995 ± 0.616
2.567GlnVal: 2.567 ± 0.636
0.0GlnTrp: 0.0 ± 0.0
1.712GlnTyr: 1.712 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
12.837ArgAla: 12.837 ± 1.75
0.428ArgCys: 0.428 ± 0.622
2.567ArgAsp: 2.567 ± 0.636
3.423ArgGlu: 3.423 ± 0.359
2.567ArgPhe: 2.567 ± 0.654
5.991ArgGly: 5.991 ± 1.309
1.712ArgHis: 1.712 ± 0.421
3.423ArgIle: 3.423 ± 1.45
0.0ArgLys: 0.0 ± 0.0
7.274ArgLeu: 7.274 ± 1.022
2.139ArgMet: 2.139 ± 0.436
1.712ArgAsn: 1.712 ± 0.731
5.991ArgPro: 5.991 ± 1.38
2.995ArgGln: 2.995 ± 0.906
4.707ArgArg: 4.707 ± 1.016
7.274ArgSer: 7.274 ± 1.527
2.995ArgThr: 2.995 ± 0.943
6.846ArgVal: 6.846 ± 1.567
0.0ArgTrp: 0.0 ± 0.0
2.567ArgTyr: 2.567 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
11.553SerAla: 11.553 ± 1.961
0.428SerCys: 0.428 ± 0.335
6.846SerAsp: 6.846 ± 0.817
2.567SerGlu: 2.567 ± 1.207
1.712SerPhe: 1.712 ± 0.619
7.702SerGly: 7.702 ± 2.084
2.567SerHis: 2.567 ± 0.636
3.423SerIle: 3.423 ± 0.951
1.712SerLys: 1.712 ± 0.773
7.274SerLeu: 7.274 ± 2.819
1.284SerMet: 1.284 ± 0.562
0.0SerAsn: 0.0 ± 0.0
4.707SerPro: 4.707 ± 1.437
0.856SerGln: 0.856 ± 0.336
4.707SerArg: 4.707 ± 1.526
7.274SerSer: 7.274 ± 2.296
3.851SerThr: 3.851 ± 0.958
6.846SerVal: 6.846 ± 0.485
0.0SerTrp: 0.0 ± 0.0
1.712SerTyr: 1.712 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
5.991ThrAla: 5.991 ± 1.019
1.284ThrCys: 1.284 ± 0.562
2.995ThrAsp: 2.995 ± 0.524
2.139ThrGlu: 2.139 ± 1.331
2.567ThrPhe: 2.567 ± 0.977
5.135ThrGly: 5.135 ± 1.677
1.284ThrHis: 1.284 ± 1.005
2.995ThrIle: 2.995 ± 0.512
1.712ThrLys: 1.712 ± 1.178
3.423ThrLeu: 3.423 ± 1.056
0.856ThrMet: 0.856 ± 0.67
1.284ThrAsn: 1.284 ± 0.521
6.418ThrPro: 6.418 ± 2.196
1.712ThrGln: 1.712 ± 0.731
3.851ThrArg: 3.851 ± 1.146
4.707ThrSer: 4.707 ± 1.023
5.135ThrThr: 5.135 ± 1.445
6.418ThrVal: 6.418 ± 1.679
1.284ThrTrp: 1.284 ± 0.723
2.139ThrTyr: 2.139 ± 0.512
0.0ThrXaa: 0.0 ± 0.0
Val
13.693ValAla: 13.693 ± 2.947
0.856ValCys: 0.856 ± 0.392
8.13ValAsp: 8.13 ± 2.354
3.423ValGlu: 3.423 ± 1.199
3.851ValPhe: 3.851 ± 1.513
3.423ValGly: 3.423 ± 0.628
3.423ValHis: 3.423 ± 1.386
2.567ValIle: 2.567 ± 0.713
4.707ValLys: 4.707 ± 0.397
7.274ValLeu: 7.274 ± 1.154
2.567ValMet: 2.567 ± 1.175
3.851ValAsn: 3.851 ± 1.073
5.991ValPro: 5.991 ± 2.033
0.856ValGln: 0.856 ± 0.81
6.418ValArg: 6.418 ± 1.187
5.135ValSer: 5.135 ± 1.075
6.418ValThr: 6.418 ± 1.777
9.414ValVal: 9.414 ± 1.591
0.428ValTrp: 0.428 ± 0.335
0.856ValTyr: 0.856 ± 0.674
0.0ValXaa: 0.0 ± 0.0
Trp
0.428TrpAla: 0.428 ± 0.335
0.428TrpCys: 0.428 ± 0.318
1.284TrpAsp: 1.284 ± 0.253
0.0TrpGlu: 0.0 ± 0.0
0.428TrpPhe: 0.428 ± 0.405
0.428TrpGly: 0.428 ± 0.318
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.428TrpLeu: 0.428 ± 0.318
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.428TrpGln: 0.428 ± 0.405
0.428TrpArg: 0.428 ± 0.335
0.856TrpSer: 0.856 ± 0.62
0.0TrpThr: 0.0 ± 0.0
1.284TrpVal: 1.284 ± 0.606
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.279TyrAla: 4.279 ± 1.195
0.0TyrCys: 0.0 ± 0.0
3.423TyrAsp: 3.423 ± 1.285
0.0TyrGlu: 0.0 ± 0.0
0.856TyrPhe: 0.856 ± 0.674
3.851TyrGly: 3.851 ± 1.104
0.428TyrHis: 0.428 ± 0.318
0.856TyrIle: 0.856 ± 0.418
0.428TyrLys: 0.428 ± 0.335
2.139TyrLeu: 2.139 ± 1.223
0.428TyrMet: 0.428 ± 0.335
1.284TyrAsn: 1.284 ± 0.759
0.428TyrPro: 0.428 ± 0.335
1.712TyrGln: 1.712 ± 0.849
1.284TyrArg: 1.284 ± 0.794
0.856TyrSer: 0.856 ± 0.418
2.995TyrThr: 2.995 ± 0.524
2.139TyrVal: 2.139 ± 0.436
0.0TyrTrp: 0.0 ± 0.0
0.428TyrTyr: 0.428 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski