Amino acid dipepetide frequency for European mountain ash ringspot-associated virus (isolate Sorbus aucuparia) (EMARAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.435AlaAla: 1.435 ± 1.047
1.722AlaCys: 1.722 ± 1.028
0.861AlaAsp: 0.861 ± 0.264
2.296AlaGlu: 2.296 ± 0.979
1.435AlaPhe: 1.435 ± 0.797
2.296AlaGly: 2.296 ± 0.877
0.574AlaHis: 0.574 ± 0.419
3.731AlaIle: 3.731 ± 0.942
3.731AlaLys: 3.731 ± 1.145
3.157AlaLeu: 3.157 ± 0.431
1.148AlaMet: 1.148 ± 0.899
2.296AlaAsn: 2.296 ± 0.815
0.861AlaPro: 0.861 ± 0.478
1.722AlaGln: 1.722 ± 0.524
0.574AlaArg: 0.574 ± 0.45
2.87AlaSer: 2.87 ± 0.97
2.296AlaThr: 2.296 ± 1.579
1.435AlaVal: 1.435 ± 1.397
0.0AlaTrp: 0.0 ± 0.0
2.009AlaTyr: 2.009 ± 0.908
0.0AlaXaa: 0.0 ± 0.0
Cys
1.148CysAla: 1.148 ± 0.899
0.0CysCys: 0.0 ± 0.0
1.435CysAsp: 1.435 ± 0.566
1.148CysGlu: 1.148 ± 0.643
0.861CysPhe: 0.861 ± 1.3
1.435CysGly: 1.435 ± 0.566
0.574CysHis: 0.574 ± 0.321
2.009CysIle: 2.009 ± 0.386
1.435CysLys: 1.435 ± 0.758
1.148CysLeu: 1.148 ± 0.638
0.0CysMet: 0.0 ± 0.0
2.009CysAsn: 2.009 ± 0.882
0.574CysPro: 0.574 ± 0.321
0.861CysGln: 0.861 ± 0.746
0.574CysArg: 0.574 ± 0.319
1.148CysSer: 1.148 ± 0.483
0.861CysThr: 0.861 ± 0.547
1.722CysVal: 1.722 ± 0.528
0.0CysTrp: 0.0 ± 0.0
0.574CysTyr: 0.574 ± 0.866
0.0CysXaa: 0.0 ± 0.0
Asp
2.009AspAla: 2.009 ± 1.319
1.435AspCys: 1.435 ± 1.064
5.454AspAsp: 5.454 ± 1.193
5.741AspGlu: 5.741 ± 1.345
4.305AspPhe: 4.305 ± 2.247
1.435AspGly: 1.435 ± 1.064
1.722AspHis: 1.722 ± 0.528
5.454AspIle: 5.454 ± 1.447
3.157AspLys: 3.157 ± 1.083
6.315AspLeu: 6.315 ± 2.134
1.722AspMet: 1.722 ± 1.09
3.157AspAsn: 3.157 ± 0.816
3.157AspPro: 3.157 ± 1.754
2.009AspGln: 2.009 ± 0.386
0.861AspArg: 0.861 ± 0.547
5.454AspSer: 5.454 ± 1.051
2.87AspThr: 2.87 ± 0.332
2.296AspVal: 2.296 ± 0.59
0.287AspTrp: 0.287 ± 0.513
4.018AspTyr: 4.018 ± 1.217
0.0AspXaa: 0.0 ± 0.0
Glu
2.009GluAla: 2.009 ± 1.116
1.148GluCys: 1.148 ± 0.295
2.583GluAsp: 2.583 ± 1.044
2.583GluGlu: 2.583 ± 1.022
3.157GluPhe: 3.157 ± 0.861
1.148GluGly: 1.148 ± 0.467
2.009GluHis: 2.009 ± 0.882
4.305GluIle: 4.305 ± 1.48
4.018GluLys: 4.018 ± 1.625
3.731GluLeu: 3.731 ± 1.596
2.009GluMet: 2.009 ± 0.676
2.583GluAsn: 2.583 ± 0.967
2.296GluPro: 2.296 ± 1.285
1.722GluGln: 1.722 ± 0.263
1.722GluArg: 1.722 ± 0.524
5.166GluSer: 5.166 ± 2.74
2.583GluThr: 2.583 ± 1.723
4.018GluVal: 4.018 ± 1.069
0.0GluTrp: 0.0 ± 0.0
4.305GluTyr: 4.305 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
0.861PheAla: 0.861 ± 0.438
1.435PheCys: 1.435 ± 0.758
2.296PheAsp: 2.296 ± 0.815
2.87PheGlu: 2.87 ± 0.755
1.435PhePhe: 1.435 ± 1.064
1.435PheGly: 1.435 ± 0.566
1.148PheHis: 1.148 ± 0.893
5.741PheIle: 5.741 ± 1.434
4.305PheLys: 4.305 ± 1.104
6.315PheLeu: 6.315 ± 2.03
0.287PheMet: 0.287 ± 0.352
3.444PheAsn: 3.444 ± 1.198
0.574PhePro: 0.574 ± 0.419
2.296PheGln: 2.296 ± 0.935
0.861PheArg: 0.861 ± 0.264
3.731PheSer: 3.731 ± 1.188
2.296PheThr: 2.296 ± 0.823
2.87PheVal: 2.87 ± 1.132
0.0PheTrp: 0.0 ± 0.0
2.87PheTyr: 2.87 ± 0.793
0.0PheXaa: 0.0 ± 0.0
Gly
0.574GlyAla: 0.574 ± 0.319
0.861GlyCys: 0.861 ± 0.746
3.444GlyAsp: 3.444 ± 0.664
1.435GlyGlu: 1.435 ± 1.047
3.157GlyPhe: 3.157 ± 0.825
1.148GlyGly: 1.148 ± 0.295
0.287GlyHis: 0.287 ± 0.159
2.583GlyIle: 2.583 ± 0.802
3.444GlyLys: 3.444 ± 0.866
2.87GlyLeu: 2.87 ± 0.708
0.861GlyMet: 0.861 ± 0.478
2.87GlyAsn: 2.87 ± 1.117
0.574GlyPro: 0.574 ± 0.866
0.287GlyGln: 0.287 ± 0.513
1.148GlyArg: 1.148 ± 0.638
3.157GlySer: 3.157 ± 0.632
1.435GlyThr: 1.435 ± 1.064
2.583GlyVal: 2.583 ± 0.43
0.287GlyTrp: 0.287 ± 0.159
2.296GlyTyr: 2.296 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
1.435HisAla: 1.435 ± 0.394
0.861HisCys: 0.861 ± 0.478
2.009HisAsp: 2.009 ± 1.383
1.722HisGlu: 1.722 ± 0.634
0.287HisPhe: 0.287 ± 0.159
1.722HisGly: 1.722 ± 0.786
1.722HisHis: 1.722 ± 1.341
2.583HisIle: 2.583 ± 1.247
0.861HisLys: 0.861 ± 0.414
2.583HisLeu: 2.583 ± 0.566
1.435HisMet: 1.435 ± 0.35
1.148HisAsn: 1.148 ± 0.79
0.0HisPro: 0.0 ± 0.0
0.287HisGln: 0.287 ± 0.159
0.574HisArg: 0.574 ± 0.321
1.722HisSer: 1.722 ± 0.964
2.296HisThr: 2.296 ± 0.585
1.148HisVal: 1.148 ± 0.643
0.287HisTrp: 0.287 ± 0.159
2.009HisTyr: 2.009 ± 0.529
0.0HisXaa: 0.0 ± 0.0
Ile
5.454IleAla: 5.454 ± 1.405
3.157IleCys: 3.157 ± 1.006
5.741IleAsp: 5.741 ± 1.012
3.157IleGlu: 3.157 ± 0.788
2.87IlePhe: 2.87 ± 0.793
2.87IleGly: 2.87 ± 1.341
2.009IleHis: 2.009 ± 1.383
7.176IleIle: 7.176 ± 1.338
6.315IleLys: 6.315 ± 0.763
6.602IleLeu: 6.602 ± 0.916
2.583IleMet: 2.583 ± 1.022
6.028IleAsn: 6.028 ± 0.61
2.583IlePro: 2.583 ± 0.293
2.583IleGln: 2.583 ± 0.967
2.583IleArg: 2.583 ± 0.703
10.907IleSer: 10.907 ± 0.974
7.463IleThr: 7.463 ± 1.306
3.444IleVal: 3.444 ± 1.056
0.0IleTrp: 0.0 ± 0.0
4.018IleTyr: 4.018 ± 0.772
0.0IleXaa: 0.0 ± 0.0
Lys
2.009LysAla: 2.009 ± 0.976
0.861LysCys: 0.861 ± 0.478
4.305LysAsp: 4.305 ± 0.555
3.731LysGlu: 3.731 ± 0.946
4.018LysPhe: 4.018 ± 0.236
2.296LysGly: 2.296 ± 0.431
2.583LysHis: 2.583 ± 0.293
5.741LysIle: 5.741 ± 1.267
8.324LysLys: 8.324 ± 1.23
8.898LysLeu: 8.898 ± 3.086
2.296LysMet: 2.296 ± 1.846
4.305LysAsn: 4.305 ± 0.555
3.157LysPro: 3.157 ± 1.006
2.87LysGln: 2.87 ± 0.711
1.722LysArg: 1.722 ± 0.899
5.454LysSer: 5.454 ± 0.707
6.315LysThr: 6.315 ± 1.206
6.315LysVal: 6.315 ± 0.552
0.287LysTrp: 0.287 ± 0.159
5.166LysTyr: 5.166 ± 1.954
0.0LysXaa: 0.0 ± 0.0
Leu
4.305LeuAla: 4.305 ± 1.541
1.435LeuCys: 1.435 ± 0.797
6.028LeuAsp: 6.028 ± 1.081
5.166LeuGlu: 5.166 ± 1.935
4.018LeuPhe: 4.018 ± 1.058
3.444LeuGly: 3.444 ± 1.105
1.435LeuHis: 1.435 ± 0.35
7.463LeuIle: 7.463 ± 0.993
4.879LeuLys: 4.879 ± 0.637
9.759LeuLeu: 9.759 ± 2.204
2.87LeuMet: 2.87 ± 1.125
6.028LeuAsn: 6.028 ± 1.167
2.583LeuPro: 2.583 ± 0.566
3.444LeuGln: 3.444 ± 0.669
4.305LeuArg: 4.305 ± 0.924
8.037LeuSer: 8.037 ± 1.191
4.305LeuThr: 4.305 ± 0.666
7.176LeuVal: 7.176 ± 1.203
0.287LeuTrp: 0.287 ± 0.159
4.592LeuTyr: 4.592 ± 0.548
0.0LeuXaa: 0.0 ± 0.0
Met
1.148MetAla: 1.148 ± 0.483
0.0MetCys: 0.0 ± 0.0
1.722MetAsp: 1.722 ± 0.332
1.148MetGlu: 1.148 ± 1.125
1.435MetPhe: 1.435 ± 0.872
0.861MetGly: 0.861 ± 0.414
0.0MetHis: 0.0 ± 0.0
0.287MetIle: 0.287 ± 0.159
2.87MetLys: 2.87 ± 0.97
2.583MetLeu: 2.583 ± 0.768
0.861MetMet: 0.861 ± 0.264
2.009MetAsn: 2.009 ± 0.953
1.148MetPro: 1.148 ± 0.483
1.435MetGln: 1.435 ± 0.797
1.435MetArg: 1.435 ± 0.651
1.435MetSer: 1.435 ± 0.877
4.592MetThr: 4.592 ± 1.07
2.296MetVal: 2.296 ± 0.585
0.287MetTrp: 0.287 ± 0.159
1.148MetTyr: 1.148 ± 0.899
0.0MetXaa: 0.0 ± 0.0
Asn
1.435AsnAla: 1.435 ± 0.612
0.287AsnCys: 0.287 ± 0.433
2.583AsnAsp: 2.583 ± 0.616
4.305AsnGlu: 4.305 ± 0.765
2.296AsnPhe: 2.296 ± 0.823
1.722AsnGly: 1.722 ± 0.528
1.435AsnHis: 1.435 ± 0.797
6.602AsnIle: 6.602 ± 2.581
7.176AsnLys: 7.176 ± 0.371
7.176AsnLeu: 7.176 ± 1.096
1.148AsnMet: 1.148 ± 0.467
4.879AsnAsn: 4.879 ± 1.306
3.444AsnPro: 3.444 ± 1.367
1.148AsnGln: 1.148 ± 0.295
2.009AsnArg: 2.009 ± 0.435
6.602AsnSer: 6.602 ± 0.996
3.157AsnThr: 3.157 ± 0.54
3.157AsnVal: 3.157 ± 1.068
0.574AsnTrp: 0.574 ± 0.319
2.87AsnTyr: 2.87 ± 1.035
0.0AsnXaa: 0.0 ± 0.0
Pro
0.574ProAla: 0.574 ± 0.803
0.0ProCys: 0.0 ± 0.0
2.87ProAsp: 2.87 ± 0.332
2.009ProGlu: 2.009 ± 0.815
1.148ProPhe: 1.148 ± 0.483
2.009ProGly: 2.009 ± 0.908
0.287ProHis: 0.287 ± 0.159
3.731ProIle: 3.731 ± 0.748
2.583ProLys: 2.583 ± 1.057
1.148ProLeu: 1.148 ± 0.638
0.861ProMet: 0.861 ± 0.478
2.87ProAsn: 2.87 ± 0.332
0.574ProPro: 0.574 ± 0.321
1.435ProGln: 1.435 ± 0.57
1.435ProArg: 1.435 ± 0.562
3.157ProSer: 3.157 ± 0.659
0.861ProThr: 0.861 ± 0.264
1.148ProVal: 1.148 ± 1.434
0.0ProTrp: 0.0 ± 0.0
2.87ProTyr: 2.87 ± 1.122
0.0ProXaa: 0.0 ± 0.0
Gln
1.722GlnAla: 1.722 ± 0.524
0.0GlnCys: 0.0 ± 0.0
0.861GlnAsp: 0.861 ± 0.264
1.435GlnGlu: 1.435 ± 0.57
1.722GlnPhe: 1.722 ± 0.682
2.009GlnGly: 2.009 ± 0.537
1.435GlnHis: 1.435 ± 0.566
2.87GlnIle: 2.87 ± 0.591
2.009GlnLys: 2.009 ± 1.116
2.296GlnLeu: 2.296 ± 0.861
1.148GlnMet: 1.148 ± 0.977
1.148GlnAsn: 1.148 ± 0.988
0.287GlnPro: 0.287 ± 0.159
0.287GlnGln: 0.287 ± 0.433
2.583GlnArg: 2.583 ± 1.435
3.444GlnSer: 3.444 ± 1.048
2.009GlnThr: 2.009 ± 0.67
2.009GlnVal: 2.009 ± 0.813
0.0GlnTrp: 0.0 ± 0.0
2.009GlnTyr: 2.009 ± 1.116
0.0GlnXaa: 0.0 ± 0.0
Arg
0.574ArgAla: 0.574 ± 0.419
0.861ArgCys: 0.861 ± 0.264
1.435ArgAsp: 1.435 ± 0.394
2.296ArgGlu: 2.296 ± 0.675
2.009ArgPhe: 2.009 ± 0.386
0.861ArgGly: 0.861 ± 0.438
1.435ArgHis: 1.435 ± 0.874
2.009ArgIle: 2.009 ± 0.815
2.296ArgLys: 2.296 ± 0.827
3.731ArgLeu: 3.731 ± 1.699
0.287ArgMet: 0.287 ± 0.48
2.009ArgAsn: 2.009 ± 0.991
1.435ArgPro: 1.435 ± 0.758
2.009ArgGln: 2.009 ± 0.725
1.435ArgArg: 1.435 ± 0.394
3.731ArgSer: 3.731 ± 1.317
1.435ArgThr: 1.435 ± 0.562
1.435ArgVal: 1.435 ± 0.296
0.0ArgTrp: 0.0 ± 0.0
2.296ArgTyr: 2.296 ± 0.815
0.0ArgXaa: 0.0 ± 0.0
Ser
3.157SerAla: 3.157 ± 0.526
2.296SerCys: 2.296 ± 1.998
8.324SerAsp: 8.324 ± 1.449
3.444SerGlu: 3.444 ± 1.013
5.454SerPhe: 5.454 ± 1.334
3.157SerGly: 3.157 ± 0.431
3.157SerHis: 3.157 ± 0.729
7.75SerIle: 7.75 ± 1.367
6.315SerLys: 6.315 ± 0.795
8.611SerLeu: 8.611 ± 2.132
2.296SerMet: 2.296 ± 0.567
6.028SerAsn: 6.028 ± 0.814
3.444SerPro: 3.444 ± 1.198
2.583SerGln: 2.583 ± 0.673
3.444SerArg: 3.444 ± 1.174
5.741SerSer: 5.741 ± 0.712
6.315SerThr: 6.315 ± 1.097
3.157SerVal: 3.157 ± 1.193
0.574SerTrp: 0.574 ± 0.321
4.018SerTyr: 4.018 ± 1.519
0.0SerXaa: 0.0 ± 0.0
Thr
1.722ThrAla: 1.722 ± 1.404
1.148ThrCys: 1.148 ± 1.177
6.315ThrAsp: 6.315 ± 1.641
3.157ThrGlu: 3.157 ± 0.861
2.296ThrPhe: 2.296 ± 0.335
1.435ThrGly: 1.435 ± 0.394
1.435ThrHis: 1.435 ± 1.293
7.176ThrIle: 7.176 ± 1.256
5.741ThrLys: 5.741 ± 0.559
7.463ThrLeu: 7.463 ± 0.899
1.435ThrMet: 1.435 ± 0.768
2.87ThrAsn: 2.87 ± 1.122
2.296ThrPro: 2.296 ± 0.862
2.009ThrGln: 2.009 ± 0.42
1.435ThrArg: 1.435 ± 0.35
7.176ThrSer: 7.176 ± 0.913
3.731ThrThr: 3.731 ± 1.966
1.722ThrVal: 1.722 ± 0.524
0.574ThrTrp: 0.574 ± 0.681
3.157ThrTyr: 3.157 ± 0.901
0.0ThrXaa: 0.0 ± 0.0
Val
2.296ValAla: 2.296 ± 0.823
0.287ValCys: 0.287 ± 0.433
1.435ValAsp: 1.435 ± 0.296
2.87ValGlu: 2.87 ± 0.708
2.87ValPhe: 2.87 ± 0.28
1.435ValGly: 1.435 ± 0.296
2.87ValHis: 2.87 ± 1.035
3.731ValIle: 3.731 ± 0.702
4.879ValLys: 4.879 ± 1.828
2.583ValLeu: 2.583 ± 0.293
2.296ValMet: 2.296 ± 0.335
3.444ValAsn: 3.444 ± 1.198
1.148ValPro: 1.148 ± 0.79
1.435ValGln: 1.435 ± 0.296
3.444ValArg: 3.444 ± 0.637
4.879ValSer: 4.879 ± 2.522
4.305ValThr: 4.305 ± 1.249
2.296ValVal: 2.296 ± 1.201
0.287ValTrp: 0.287 ± 0.159
4.305ValTyr: 4.305 ± 1.725
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.287TrpCys: 0.287 ± 0.159
0.0TrpAsp: 0.0 ± 0.0
0.574TrpGlu: 0.574 ± 0.681
0.287TrpPhe: 0.287 ± 0.159
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.287TrpLys: 0.287 ± 0.159
0.861TrpLeu: 0.861 ± 0.478
0.287TrpMet: 0.287 ± 0.159
0.861TrpAsn: 0.861 ± 0.478
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.861TrpSer: 0.861 ± 0.264
0.287TrpThr: 0.287 ± 0.513
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.583TyrAla: 2.583 ± 0.961
1.435TyrCys: 1.435 ± 0.394
3.157TyrAsp: 3.157 ± 0.788
2.009TyrGlu: 2.009 ± 0.435
2.296TyrPhe: 2.296 ± 0.823
2.583TyrGly: 2.583 ± 0.616
0.574TyrHis: 0.574 ± 0.319
6.602TyrIle: 6.602 ± 2.134
6.028TyrLys: 6.028 ± 2.399
3.731TyrLeu: 3.731 ± 0.808
2.583TyrMet: 2.583 ± 0.364
4.018TyrAsn: 4.018 ± 0.761
1.435TyrPro: 1.435 ± 0.296
0.861TyrGln: 0.861 ± 0.547
1.435TyrArg: 1.435 ± 0.57
4.592TyrSer: 4.592 ± 1.305
4.879TyrThr: 4.879 ± 2.335
2.87TyrVal: 2.87 ± 2.714
0.861TyrTrp: 0.861 ± 0.478
4.305TyrTyr: 4.305 ± 0.481
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3485 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski