Amino acid dipepetide frequency for Moussa virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.638AlaAla: 1.638 ± 1.564
2.184AlaCys: 2.184 ± 0.544
3.003AlaAsp: 3.003 ± 1.768
2.457AlaGlu: 2.457 ± 0.836
1.638AlaPhe: 1.638 ± 1.278
2.73AlaGly: 2.73 ± 0.778
1.365AlaHis: 1.365 ± 0.789
4.368AlaIle: 4.368 ± 1.822
1.638AlaLys: 1.638 ± 1.155
3.549AlaLeu: 3.549 ± 0.861
0.546AlaMet: 0.546 ± 0.316
1.638AlaAsn: 1.638 ± 0.81
2.184AlaPro: 2.184 ± 1.213
1.365AlaGln: 1.365 ± 0.427
1.911AlaArg: 1.911 ± 0.409
2.73AlaSer: 2.73 ± 1.535
3.003AlaThr: 3.003 ± 0.759
2.184AlaVal: 2.184 ± 0.511
0.546AlaTrp: 0.546 ± 0.398
1.638AlaTyr: 1.638 ± 0.691
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.398
0.273CysCys: 0.273 ± 0.158
1.638CysAsp: 1.638 ± 0.622
1.092CysGlu: 1.092 ± 0.427
0.273CysPhe: 0.273 ± 0.158
1.365CysGly: 1.365 ± 0.521
0.273CysHis: 0.273 ± 0.444
0.819CysIle: 0.819 ± 0.411
0.273CysLys: 0.273 ± 0.444
3.549CysLeu: 3.549 ± 0.773
0.0CysMet: 0.0 ± 0.0
1.092CysAsn: 1.092 ± 0.28
0.819CysPro: 0.819 ± 1.046
0.546CysGln: 0.546 ± 0.398
0.273CysArg: 0.273 ± 0.158
1.365CysSer: 1.365 ± 0.476
1.092CysThr: 1.092 ± 0.399
0.819CysVal: 0.819 ± 0.474
0.819CysTrp: 0.819 ± 0.311
0.819CysTyr: 0.819 ± 0.311
0.0CysXaa: 0.0 ± 0.0
Asp
1.638AspAla: 1.638 ± 0.853
0.819AspCys: 0.819 ± 0.622
4.095AspAsp: 4.095 ± 1.401
3.276AspGlu: 3.276 ± 0.344
2.184AspPhe: 2.184 ± 1.199
3.549AspGly: 3.549 ± 0.629
0.819AspHis: 0.819 ± 0.305
3.822AspIle: 3.822 ± 0.859
4.368AspLys: 4.368 ± 0.998
4.095AspLeu: 4.095 ± 1.266
1.092AspMet: 1.092 ± 0.606
4.095AspAsn: 4.095 ± 0.794
3.822AspPro: 3.822 ± 1.375
2.457AspGln: 2.457 ± 0.819
1.638AspArg: 1.638 ± 0.612
2.457AspSer: 2.457 ± 1.109
1.911AspThr: 1.911 ± 0.363
1.911AspVal: 1.911 ± 0.563
0.546AspTrp: 0.546 ± 0.316
2.457AspTyr: 2.457 ± 0.781
0.0AspXaa: 0.0 ± 0.0
Glu
1.365GluAla: 1.365 ± 0.602
1.365GluCys: 1.365 ± 0.906
4.641GluAsp: 4.641 ± 0.652
7.371GluGlu: 7.371 ± 1.384
2.184GluPhe: 2.184 ± 1.263
6.006GluGly: 6.006 ± 1.535
3.003GluHis: 3.003 ± 1.532
3.549GluIle: 3.549 ± 0.974
4.914GluLys: 4.914 ± 1.265
5.733GluLeu: 5.733 ± 0.6
3.276GluMet: 3.276 ± 0.786
2.73GluAsn: 2.73 ± 0.742
1.092GluPro: 1.092 ± 0.583
2.457GluGln: 2.457 ± 1.045
3.276GluArg: 3.276 ± 0.892
4.914GluSer: 4.914 ± 1.489
4.914GluThr: 4.914 ± 0.67
3.549GluVal: 3.549 ± 1.571
0.819GluTrp: 0.819 ± 0.311
1.638GluTyr: 1.638 ± 0.612
0.0GluXaa: 0.0 ± 0.0
Phe
1.911PheAla: 1.911 ± 0.602
0.0PheCys: 0.0 ± 0.0
1.365PheAsp: 1.365 ± 0.58
1.911PheGlu: 1.911 ± 0.604
0.819PhePhe: 0.819 ± 0.405
1.638PheGly: 1.638 ± 0.672
0.546PheHis: 0.546 ± 0.29
1.638PheIle: 1.638 ± 0.495
1.911PheLys: 1.911 ± 1.088
4.368PheLeu: 4.368 ± 0.923
1.092PheMet: 1.092 ± 0.367
1.092PheAsn: 1.092 ± 0.479
1.638PhePro: 1.638 ± 0.61
1.092PheGln: 1.092 ± 0.631
1.365PheArg: 1.365 ± 0.583
3.003PheSer: 3.003 ± 1.093
2.184PheThr: 2.184 ± 0.783
3.549PheVal: 3.549 ± 1.095
1.092PheTrp: 1.092 ± 0.756
1.638PheTyr: 1.638 ± 0.738
0.0PheXaa: 0.0 ± 0.0
Gly
3.276GlyAla: 3.276 ± 1.382
0.819GlyCys: 0.819 ± 0.526
2.73GlyAsp: 2.73 ± 0.544
4.641GlyGlu: 4.641 ± 0.576
2.73GlyPhe: 2.73 ± 1.041
4.368GlyGly: 4.368 ± 1.794
1.911GlyHis: 1.911 ± 0.363
3.276GlyIle: 3.276 ± 0.874
5.187GlyLys: 5.187 ± 0.611
8.736GlyLeu: 8.736 ± 1.084
1.638GlyMet: 1.638 ± 0.296
2.73GlyAsn: 2.73 ± 0.909
1.638GlyPro: 1.638 ± 0.478
3.003GlyGln: 3.003 ± 0.31
3.276GlyArg: 3.276 ± 1.069
5.46GlySer: 5.46 ± 0.678
5.733GlyThr: 5.733 ± 2.305
3.822GlyVal: 3.822 ± 0.714
1.365GlyTrp: 1.365 ± 0.241
1.911GlyTyr: 1.911 ± 0.899
0.0GlyXaa: 0.0 ± 0.0
His
0.819HisAla: 0.819 ± 0.386
0.273HisCys: 0.273 ± 0.398
0.546HisAsp: 0.546 ± 0.316
0.819HisGlu: 0.819 ± 0.305
0.273HisPhe: 0.273 ± 0.158
0.546HisGly: 0.546 ± 0.316
1.638HisHis: 1.638 ± 0.772
0.273HisIle: 0.273 ± 0.158
2.457HisLys: 2.457 ± 1.029
2.73HisLeu: 2.73 ± 1.003
0.546HisMet: 0.546 ± 0.519
1.638HisAsn: 1.638 ± 0.738
1.911HisPro: 1.911 ± 0.552
1.092HisGln: 1.092 ± 0.432
1.365HisArg: 1.365 ± 0.583
2.184HisSer: 2.184 ± 0.82
1.911HisThr: 1.911 ± 0.863
1.911HisVal: 1.911 ± 1.028
0.546HisTrp: 0.546 ± 0.29
0.546HisTyr: 0.546 ± 0.316
0.0HisXaa: 0.0 ± 0.0
Ile
1.911IleAla: 1.911 ± 0.843
1.638IleCys: 1.638 ± 0.672
2.73IleAsp: 2.73 ± 1.003
4.641IleGlu: 4.641 ± 0.811
2.184IlePhe: 2.184 ± 0.544
4.095IleGly: 4.095 ± 0.831
0.546IleHis: 0.546 ± 0.316
5.46IleIle: 5.46 ± 1.064
5.187IleLys: 5.187 ± 1.24
4.368IleLeu: 4.368 ± 1.128
1.638IleMet: 1.638 ± 0.59
4.368IleAsn: 4.368 ± 1.463
3.549IlePro: 3.549 ± 0.55
2.457IleGln: 2.457 ± 0.691
3.276IleArg: 3.276 ± 1.193
4.368IleSer: 4.368 ± 1.737
3.549IleThr: 3.549 ± 0.704
3.003IleVal: 3.003 ± 1.795
1.911IleTrp: 1.911 ± 0.863
4.368IleTyr: 4.368 ± 1.288
0.0IleXaa: 0.0 ± 0.0
Lys
4.095LysAla: 4.095 ± 0.512
1.092LysCys: 1.092 ± 0.479
4.095LysAsp: 4.095 ± 1.168
4.641LysGlu: 4.641 ± 1.422
2.184LysPhe: 2.184 ± 0.775
4.095LysGly: 4.095 ± 0.835
0.546LysHis: 0.546 ± 0.698
5.46LysIle: 5.46 ± 1.609
3.822LysLys: 3.822 ± 1.523
9.009LysLeu: 9.009 ± 0.895
1.365LysMet: 1.365 ± 0.512
3.003LysAsn: 3.003 ± 0.562
0.819LysPro: 0.819 ± 0.971
1.638LysGln: 1.638 ± 0.612
4.368LysArg: 4.368 ± 0.88
5.733LysSer: 5.733 ± 0.845
3.549LysThr: 3.549 ± 0.605
3.549LysVal: 3.549 ± 0.507
1.365LysTrp: 1.365 ± 0.476
1.092LysTyr: 1.092 ± 0.452
0.0LysXaa: 0.0 ± 0.0
Leu
4.368LeuAla: 4.368 ± 1.252
1.365LeuCys: 1.365 ± 0.58
3.549LeuAsp: 3.549 ± 1.627
6.006LeuGlu: 6.006 ± 0.682
3.003LeuPhe: 3.003 ± 0.634
6.279LeuGly: 6.279 ± 1.842
2.73LeuHis: 2.73 ± 1.252
7.917LeuIle: 7.917 ± 1.614
6.552LeuLys: 6.552 ± 1.843
7.644LeuLeu: 7.644 ± 1.921
3.003LeuMet: 3.003 ± 1.05
5.46LeuAsn: 5.46 ± 1.461
4.095LeuPro: 4.095 ± 0.847
2.184LeuGln: 2.184 ± 0.688
7.371LeuArg: 7.371 ± 1.689
7.917LeuSer: 7.917 ± 1.033
6.552LeuThr: 6.552 ± 2.051
6.006LeuVal: 6.006 ± 1.354
0.819LeuTrp: 0.819 ± 0.838
4.095LeuTyr: 4.095 ± 1.32
0.0LeuXaa: 0.0 ± 0.0
Met
1.365MetAla: 1.365 ± 0.521
0.546MetCys: 0.546 ± 0.316
1.092MetAsp: 1.092 ± 0.432
2.184MetGlu: 2.184 ± 1.646
2.184MetPhe: 2.184 ± 0.544
1.911MetGly: 1.911 ± 0.438
0.546MetHis: 0.546 ± 0.316
3.003MetIle: 3.003 ± 0.643
1.911MetLys: 1.911 ± 0.409
1.092MetLeu: 1.092 ± 0.635
0.819MetMet: 0.819 ± 0.305
1.365MetAsn: 1.365 ± 0.473
0.819MetPro: 0.819 ± 0.305
0.819MetGln: 0.819 ± 0.622
2.457MetArg: 2.457 ± 0.952
0.819MetSer: 0.819 ± 0.526
1.911MetThr: 1.911 ± 0.363
1.911MetVal: 1.911 ± 0.657
0.0MetTrp: 0.0 ± 0.0
1.092MetTyr: 1.092 ± 0.631
0.0MetXaa: 0.0 ± 0.0
Asn
2.184AsnAla: 2.184 ± 1.807
0.546AsnCys: 0.546 ± 0.318
1.638AsnAsp: 1.638 ± 0.458
2.73AsnGlu: 2.73 ± 0.483
0.546AsnPhe: 0.546 ± 0.29
3.549AsnGly: 3.549 ± 0.995
0.819AsnHis: 0.819 ± 0.305
2.73AsnIle: 2.73 ± 1.041
3.003AsnLys: 3.003 ± 0.718
4.914AsnLeu: 4.914 ± 0.484
1.638AsnMet: 1.638 ± 0.607
2.73AsnAsn: 2.73 ± 1.257
2.457AsnPro: 2.457 ± 1.168
2.184AsnGln: 2.184 ± 0.797
2.73AsnArg: 2.73 ± 1.257
4.368AsnSer: 4.368 ± 1.054
4.641AsnThr: 4.641 ± 0.482
3.003AsnVal: 3.003 ± 0.517
1.092AsnTrp: 1.092 ± 0.696
1.911AsnTyr: 1.911 ± 0.748
0.0AsnXaa: 0.0 ± 0.0
Pro
1.911ProAla: 1.911 ± 0.843
0.273ProCys: 0.273 ± 0.158
3.003ProAsp: 3.003 ± 0.743
3.003ProGlu: 3.003 ± 0.683
0.546ProPhe: 0.546 ± 0.29
2.457ProGly: 2.457 ± 0.87
0.819ProHis: 0.819 ± 0.311
3.003ProIle: 3.003 ± 1.086
2.457ProLys: 2.457 ± 1.058
2.457ProLeu: 2.457 ± 0.815
0.819ProMet: 0.819 ± 0.574
1.911ProAsn: 1.911 ± 1.61
1.365ProPro: 1.365 ± 0.906
1.638ProGln: 1.638 ± 1.051
1.365ProArg: 1.365 ± 0.757
4.095ProSer: 4.095 ± 2.319
3.549ProThr: 3.549 ± 0.825
2.457ProVal: 2.457 ± 0.842
0.546ProTrp: 0.546 ± 0.316
2.457ProTyr: 2.457 ± 0.951
0.0ProXaa: 0.0 ± 0.0
Gln
0.546GlnAla: 0.546 ± 0.398
0.546GlnCys: 0.546 ± 0.316
1.365GlnAsp: 1.365 ± 0.58
3.276GlnGlu: 3.276 ± 1.074
0.819GlnPhe: 0.819 ± 0.526
3.822GlnGly: 3.822 ± 0.967
1.092GlnHis: 1.092 ± 0.631
2.184GlnIle: 2.184 ± 0.629
1.638GlnLys: 1.638 ± 0.672
3.549GlnLeu: 3.549 ± 0.687
1.365GlnMet: 1.365 ± 0.423
1.638GlnAsn: 1.638 ± 0.296
1.638GlnPro: 1.638 ± 0.424
1.365GlnGln: 1.365 ± 0.427
2.457GlnArg: 2.457 ± 0.84
1.911GlnSer: 1.911 ± 0.536
1.365GlnThr: 1.365 ± 0.848
1.638GlnVal: 1.638 ± 0.658
0.273GlnTrp: 0.273 ± 0.158
0.546GlnTyr: 0.546 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
2.73ArgAla: 2.73 ± 1.067
0.819ArgCys: 0.819 ± 0.305
3.822ArgAsp: 3.822 ± 0.728
3.276ArgGlu: 3.276 ± 1.226
3.822ArgPhe: 3.822 ± 0.817
3.276ArgGly: 3.276 ± 0.8
1.638ArgHis: 1.638 ± 0.947
2.73ArgIle: 2.73 ± 0.916
3.549ArgLys: 3.549 ± 1.719
3.549ArgLeu: 3.549 ± 1.056
2.184ArgMet: 2.184 ± 0.734
1.092ArgAsn: 1.092 ± 0.461
2.184ArgPro: 2.184 ± 0.511
1.092ArgGln: 1.092 ± 0.367
3.003ArgArg: 3.003 ± 0.743
2.457ArgSer: 2.457 ± 0.713
4.641ArgThr: 4.641 ± 2.017
3.549ArgVal: 3.549 ± 1.021
1.365ArgTrp: 1.365 ± 0.789
2.184ArgTyr: 2.184 ± 0.651
0.0ArgXaa: 0.0 ± 0.0
Ser
5.187SerAla: 5.187 ± 2.475
1.092SerCys: 1.092 ± 0.479
4.095SerAsp: 4.095 ± 1.157
5.733SerGlu: 5.733 ± 1.279
2.184SerPhe: 2.184 ± 1.661
3.276SerGly: 3.276 ± 1.729
1.365SerHis: 1.365 ± 0.555
4.095SerIle: 4.095 ± 1.187
5.733SerLys: 5.733 ± 1.569
10.92SerLeu: 10.92 ± 1.097
1.092SerMet: 1.092 ± 0.631
2.457SerAsn: 2.457 ± 0.815
2.184SerPro: 2.184 ± 1.219
1.911SerGln: 1.911 ± 0.933
4.914SerArg: 4.914 ± 1.408
7.371SerSer: 7.371 ± 1.733
4.914SerThr: 4.914 ± 0.778
7.371SerVal: 7.371 ± 0.412
1.911SerTrp: 1.911 ± 0.86
1.638SerTyr: 1.638 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
2.73ThrAla: 2.73 ± 0.984
0.819ThrCys: 0.819 ± 0.483
3.276ThrAsp: 3.276 ± 0.815
5.46ThrGlu: 5.46 ± 1.324
2.457ThrPhe: 2.457 ± 1.131
4.914ThrGly: 4.914 ± 0.762
1.365ThrHis: 1.365 ± 0.241
4.095ThrIle: 4.095 ± 0.777
4.368ThrLys: 4.368 ± 0.823
9.828ThrLeu: 9.828 ± 2.762
2.457ThrMet: 2.457 ± 0.845
2.73ThrAsn: 2.73 ± 1.011
3.549ThrPro: 3.549 ± 0.685
1.638ThrGln: 1.638 ± 0.478
1.638ThrArg: 1.638 ± 1.406
7.098ThrSer: 7.098 ± 1.465
3.276ThrThr: 3.276 ± 0.963
4.641ThrVal: 4.641 ± 1.489
0.546ThrTrp: 0.546 ± 0.29
1.638ThrTyr: 1.638 ± 0.622
0.0ThrXaa: 0.0 ± 0.0
Val
2.73ValAla: 2.73 ± 1.62
2.184ValCys: 2.184 ± 0.797
2.457ValAsp: 2.457 ± 0.836
3.276ValGlu: 3.276 ± 0.792
1.365ValPhe: 1.365 ± 0.789
4.914ValGly: 4.914 ± 1.072
1.638ValHis: 1.638 ± 0.981
3.549ValIle: 3.549 ± 0.901
2.184ValLys: 2.184 ± 0.847
3.549ValLeu: 3.549 ± 0.546
1.365ValMet: 1.365 ± 0.433
4.368ValAsn: 4.368 ± 0.355
2.457ValPro: 2.457 ± 1.252
1.911ValGln: 1.911 ± 1.007
3.276ValArg: 3.276 ± 0.841
7.371ValSer: 7.371 ± 1.147
6.825ValThr: 6.825 ± 1.222
4.095ValVal: 4.095 ± 0.814
1.638ValTrp: 1.638 ± 0.296
1.638ValTyr: 1.638 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.305
0.0TrpCys: 0.0 ± 0.0
0.546TrpAsp: 0.546 ± 0.316
1.365TrpGlu: 1.365 ± 0.789
1.638TrpPhe: 1.638 ± 0.883
2.184TrpGly: 2.184 ± 0.674
0.819TrpHis: 0.819 ± 0.474
1.365TrpIle: 1.365 ± 0.423
1.365TrpLys: 1.365 ± 0.521
0.819TrpLeu: 0.819 ± 0.521
1.092TrpMet: 1.092 ± 0.695
1.092TrpAsn: 1.092 ± 0.367
0.546TrpPro: 0.546 ± 0.399
0.0TrpGln: 0.0 ± 0.0
0.546TrpArg: 0.546 ± 0.398
1.911TrpSer: 1.911 ± 0.559
0.273TrpThr: 0.273 ± 0.349
1.092TrpVal: 1.092 ± 0.663
0.273TrpTrp: 0.273 ± 0.158
0.546TrpTyr: 0.546 ± 0.316
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.819TyrAla: 0.819 ± 0.494
0.819TyrCys: 0.819 ± 0.311
1.638TyrAsp: 1.638 ± 0.607
1.638TyrGlu: 1.638 ± 0.474
0.819TyrPhe: 0.819 ± 0.305
3.276TyrGly: 3.276 ± 0.925
0.546TyrHis: 0.546 ± 0.399
1.911TyrIle: 1.911 ± 0.559
3.003TyrLys: 3.003 ± 0.895
2.457TyrLeu: 2.457 ± 0.689
0.546TyrMet: 0.546 ± 0.318
2.184TyrAsn: 2.184 ± 0.783
1.365TyrPro: 1.365 ± 0.473
2.184TyrGln: 2.184 ± 0.735
2.457TyrArg: 2.457 ± 0.61
1.911TyrSer: 1.911 ± 0.559
2.73TyrThr: 2.73 ± 0.778
2.73TyrVal: 2.73 ± 0.373
0.819TyrTrp: 0.819 ± 0.311
0.273TyrTyr: 0.273 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3664 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski