Amino acid dipepetide frequency for Spiroplasma virus SpV1-C74 (SpV1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.455AlaAla: 0.455 ± 0.549
0.0AlaCys: 0.0 ± 0.0
0.455AlaAsp: 0.455 ± 0.316
0.911AlaGlu: 0.911 ± 0.536
2.277AlaPhe: 2.277 ± 0.821
0.455AlaGly: 0.455 ± 0.399
0.455AlaHis: 0.455 ± 0.316
4.098AlaIle: 4.098 ± 1.318
3.643AlaLys: 3.643 ± 1.496
4.098AlaLeu: 4.098 ± 0.653
0.0AlaMet: 0.0 ± 0.0
3.188AlaAsn: 3.188 ± 0.698
1.366AlaPro: 1.366 ± 0.776
0.911AlaGln: 0.911 ± 0.417
0.455AlaArg: 0.455 ± 0.433
1.821AlaSer: 1.821 ± 0.617
0.0AlaThr: 0.0 ± 0.0
2.277AlaVal: 2.277 ± 0.771
0.911AlaTrp: 0.911 ± 0.697
0.911AlaTyr: 0.911 ± 0.659
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.455CysAsp: 0.455 ± 0.399
0.0CysGlu: 0.0 ± 0.0
0.911CysPhe: 0.911 ± 1.048
0.455CysGly: 0.455 ± 0.524
0.0CysHis: 0.0 ± 0.0
0.455CysIle: 0.455 ± 0.434
0.455CysLys: 0.455 ± 0.5
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.911CysAsn: 0.911 ± 0.798
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.911CysVal: 0.911 ± 0.866
0.0CysTrp: 0.0 ± 0.0
0.911CysTyr: 0.911 ± 0.674
0.0CysXaa: 0.0 ± 0.0
Asp
0.911AspAla: 0.911 ± 0.798
0.0AspCys: 0.0 ± 0.0
2.277AspAsp: 2.277 ± 1.185
4.554AspGlu: 4.554 ± 1.29
5.009AspPhe: 5.009 ± 1.873
0.455AspGly: 0.455 ± 0.433
0.0AspHis: 0.0 ± 0.0
3.188AspIle: 3.188 ± 1.253
8.652AspLys: 8.652 ± 1.611
6.375AspLeu: 6.375 ± 1.313
0.455AspMet: 0.455 ± 0.415
3.188AspAsn: 3.188 ± 0.6
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
0.911AspArg: 0.911 ± 0.417
1.366AspSer: 1.366 ± 0.789
0.911AspThr: 0.911 ± 0.548
2.277AspVal: 2.277 ± 0.613
1.366AspTrp: 1.366 ± 0.701
2.277AspTyr: 2.277 ± 0.78
0.0AspXaa: 0.0 ± 0.0
Glu
0.911GluAla: 0.911 ± 0.461
0.455GluCys: 0.455 ± 0.524
1.366GluAsp: 1.366 ± 0.599
2.277GluGlu: 2.277 ± 0.763
3.643GluPhe: 3.643 ± 1.45
1.821GluGly: 1.821 ± 1.176
0.0GluHis: 0.0 ± 0.0
5.92GluIle: 5.92 ± 1.47
3.643GluLys: 3.643 ± 1.309
4.098GluLeu: 4.098 ± 0.833
0.0GluMet: 0.0 ± 0.0
5.92GluAsn: 5.92 ± 2.947
0.455GluPro: 0.455 ± 0.532
1.821GluGln: 1.821 ± 1.075
3.188GluArg: 3.188 ± 1.439
2.277GluSer: 2.277 ± 0.979
2.277GluThr: 2.277 ± 1.578
1.821GluVal: 1.821 ± 0.875
0.911GluTrp: 0.911 ± 0.417
0.455GluTyr: 0.455 ± 0.512
0.0GluXaa: 0.0 ± 0.0
Phe
4.098PheAla: 4.098 ± 0.822
0.911PheCys: 0.911 ± 0.766
4.098PheAsp: 4.098 ± 1.216
3.188PheGlu: 3.188 ± 1.471
7.286PhePhe: 7.286 ± 2.017
3.643PheGly: 3.643 ± 1.372
0.455PheHis: 0.455 ± 0.399
11.384PheIle: 11.384 ± 2.767
7.286PheLys: 7.286 ± 1.985
10.929PheLeu: 10.929 ± 2.501
2.277PheMet: 2.277 ± 0.756
6.831PheAsn: 6.831 ± 2.036
0.911PhePro: 0.911 ± 0.461
1.366PheGln: 1.366 ± 0.422
2.277PheArg: 2.277 ± 0.787
6.375PheSer: 6.375 ± 1.477
2.732PheThr: 2.732 ± 1.373
5.464PheVal: 5.464 ± 1.816
0.911PheTrp: 0.911 ± 0.63
5.009PheTyr: 5.009 ± 1.135
0.0PheXaa: 0.0 ± 0.0
Gly
0.911GlyAla: 0.911 ± 0.478
0.0GlyCys: 0.0 ± 0.0
0.911GlyAsp: 0.911 ± 0.531
1.821GlyGlu: 1.821 ± 0.634
3.643GlyPhe: 3.643 ± 1.798
1.821GlyGly: 1.821 ± 0.875
0.0GlyHis: 0.0 ± 0.0
5.464GlyIle: 5.464 ± 1.133
5.464GlyLys: 5.464 ± 1.924
4.554GlyLeu: 4.554 ± 0.877
2.277GlyMet: 2.277 ± 1.003
0.911GlyAsn: 0.911 ± 0.536
0.0GlyPro: 0.0 ± 0.0
0.455GlyGln: 0.455 ± 0.316
0.455GlyArg: 0.455 ± 0.549
3.643GlySer: 3.643 ± 0.931
2.732GlyThr: 2.732 ± 0.996
3.643GlyVal: 3.643 ± 1.05
0.455GlyTrp: 0.455 ± 0.316
2.732GlyTyr: 2.732 ± 1.131
0.0GlyXaa: 0.0 ± 0.0
His
0.455HisAla: 0.455 ± 0.399
0.0HisCys: 0.0 ± 0.0
0.455HisAsp: 0.455 ± 0.433
0.0HisGlu: 0.0 ± 0.0
0.455HisPhe: 0.455 ± 0.433
0.911HisGly: 0.911 ± 0.659
0.0HisHis: 0.0 ± 0.0
0.911HisIle: 0.911 ± 0.537
0.911HisLys: 0.911 ± 0.536
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.455HisAsn: 0.455 ± 0.532
0.911HisPro: 0.911 ± 0.548
0.911HisGln: 0.911 ± 0.798
0.455HisArg: 0.455 ± 0.433
1.821HisSer: 1.821 ± 0.46
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.455HisTyr: 0.455 ± 0.316
0.0HisXaa: 0.0 ± 0.0
Ile
1.821IleAla: 1.821 ± 1.038
0.455IleCys: 0.455 ± 0.399
5.009IleAsp: 5.009 ± 1.25
3.643IleGlu: 3.643 ± 1.099
9.563IlePhe: 9.563 ± 1.401
4.098IleGly: 4.098 ± 1.213
0.911IleHis: 0.911 ± 0.659
9.563IleIle: 9.563 ± 2.359
7.741IleLys: 7.741 ± 2.346
8.652IleLeu: 8.652 ± 2.532
2.277IleMet: 2.277 ± 0.884
8.197IleAsn: 8.197 ± 1.962
3.643IlePro: 3.643 ± 1.002
0.911IleGln: 0.911 ± 0.827
2.732IleArg: 2.732 ± 1.381
5.464IleSer: 5.464 ± 1.006
4.554IleThr: 4.554 ± 1.034
4.098IleVal: 4.098 ± 1.539
4.098IleTrp: 4.098 ± 0.994
6.831IleTyr: 6.831 ± 2.005
0.0IleXaa: 0.0 ± 0.0
Lys
3.643LysAla: 3.643 ± 1.639
0.455LysCys: 0.455 ± 0.524
3.188LysAsp: 3.188 ± 1.234
7.741LysGlu: 7.741 ± 1.231
6.375LysPhe: 6.375 ± 0.946
3.643LysGly: 3.643 ± 1.009
1.366LysHis: 1.366 ± 0.645
9.563LysIle: 9.563 ± 2.168
12.75LysLys: 12.75 ± 2.848
9.107LysLeu: 9.107 ± 1.074
4.098LysMet: 4.098 ± 1.428
12.295LysAsn: 12.295 ± 2.473
2.277LysPro: 2.277 ± 1.185
4.554LysGln: 4.554 ± 1.019
2.732LysArg: 2.732 ± 0.68
2.732LysSer: 2.732 ± 1.189
3.643LysThr: 3.643 ± 1.168
3.188LysVal: 3.188 ± 1.139
2.732LysTrp: 2.732 ± 1.612
7.286LysTyr: 7.286 ± 1.829
0.0LysXaa: 0.0 ± 0.0
Leu
1.821LeuAla: 1.821 ± 0.753
0.911LeuCys: 0.911 ± 0.633
1.821LeuAsp: 1.821 ± 0.862
3.643LeuGlu: 3.643 ± 1.13
10.474LeuPhe: 10.474 ± 3.015
3.643LeuGly: 3.643 ± 1.136
0.911LeuHis: 0.911 ± 0.403
8.652LeuIle: 8.652 ± 2.05
10.018LeuLys: 10.018 ± 2.564
9.563LeuLeu: 9.563 ± 2.027
1.366LeuMet: 1.366 ± 1.017
7.286LeuAsn: 7.286 ± 1.728
1.366LeuPro: 1.366 ± 0.649
5.464LeuGln: 5.464 ± 2.144
1.366LeuArg: 1.366 ± 0.704
8.652LeuSer: 8.652 ± 1.085
7.741LeuThr: 7.741 ± 2.958
6.831LeuVal: 6.831 ± 2.134
1.821LeuTrp: 1.821 ± 1.116
6.375LeuTyr: 6.375 ± 2.374
0.0LeuXaa: 0.0 ± 0.0
Met
0.455MetAla: 0.455 ± 0.549
0.0MetCys: 0.0 ± 0.0
0.911MetAsp: 0.911 ± 0.618
0.0MetGlu: 0.0 ± 0.0
1.821MetPhe: 1.821 ± 0.88
0.911MetGly: 0.911 ± 0.417
0.0MetHis: 0.0 ± 0.0
2.732MetIle: 2.732 ± 1.026
2.732MetLys: 2.732 ± 0.777
1.821MetLeu: 1.821 ± 1.075
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.911MetPro: 0.911 ± 0.77
0.911MetGln: 0.911 ± 0.724
1.366MetArg: 1.366 ± 0.599
0.455MetSer: 0.455 ± 0.505
1.366MetThr: 1.366 ± 0.92
3.188MetVal: 3.188 ± 1.493
0.911MetTrp: 0.911 ± 0.723
1.366MetTyr: 1.366 ± 0.651
0.0MetXaa: 0.0 ± 0.0
Asn
2.732AsnAla: 2.732 ± 0.92
0.0AsnCys: 0.0 ± 0.0
5.464AsnAsp: 5.464 ± 1.358
2.732AsnGlu: 2.732 ± 1.111
8.197AsnPhe: 8.197 ± 2.184
4.098AsnGly: 4.098 ± 1.205
0.911AsnHis: 0.911 ± 0.537
7.286AsnIle: 7.286 ± 0.996
7.286AsnLys: 7.286 ± 2.121
8.197AsnLeu: 8.197 ± 2.185
0.911AsnMet: 0.911 ± 0.734
10.018AsnAsn: 10.018 ± 2.705
1.821AsnPro: 1.821 ± 0.862
2.277AsnGln: 2.277 ± 1.183
2.277AsnArg: 2.277 ± 0.819
4.098AsnSer: 4.098 ± 1.282
3.643AsnThr: 3.643 ± 1.645
4.554AsnVal: 4.554 ± 1.486
2.732AsnTrp: 2.732 ± 1.196
6.375AsnTyr: 6.375 ± 1.339
0.0AsnXaa: 0.0 ± 0.0
Pro
0.455ProAla: 0.455 ± 0.316
0.0ProCys: 0.0 ± 0.0
0.911ProAsp: 0.911 ± 0.417
0.911ProGlu: 0.911 ± 0.699
2.277ProPhe: 2.277 ± 0.697
0.455ProGly: 0.455 ± 0.316
0.455ProHis: 0.455 ± 0.433
0.911ProIle: 0.911 ± 0.63
1.821ProLys: 1.821 ± 0.673
2.732ProLeu: 2.732 ± 1.301
0.455ProMet: 0.455 ± 0.5
1.821ProAsn: 1.821 ± 0.615
0.911ProPro: 0.911 ± 0.417
0.911ProGln: 0.911 ± 0.403
1.366ProArg: 1.366 ± 0.599
0.455ProSer: 0.455 ± 0.399
1.821ProThr: 1.821 ± 0.757
1.821ProVal: 1.821 ± 0.753
0.455ProTrp: 0.455 ± 0.433
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.366GlnAla: 1.366 ± 0.608
0.455GlnCys: 0.455 ± 0.433
1.366GlnAsp: 1.366 ± 0.599
1.366GlnGlu: 1.366 ± 0.701
2.732GlnPhe: 2.732 ± 0.857
0.911GlnGly: 0.911 ± 0.631
0.455GlnHis: 0.455 ± 0.399
0.911GlnIle: 0.911 ± 0.51
3.188GlnLys: 3.188 ± 0.8
2.277GlnLeu: 2.277 ± 0.801
0.911GlnMet: 0.911 ± 0.633
5.009GlnAsn: 5.009 ± 1.561
0.0GlnPro: 0.0 ± 0.0
2.277GlnGln: 2.277 ± 1.255
0.911GlnArg: 0.911 ± 0.674
1.366GlnSer: 1.366 ± 0.608
2.277GlnThr: 2.277 ± 0.752
2.277GlnVal: 2.277 ± 1.061
0.911GlnTrp: 0.911 ± 0.631
3.643GlnTyr: 3.643 ± 1.031
0.0GlnXaa: 0.0 ± 0.0
Arg
0.911ArgAla: 0.911 ± 0.866
0.0ArgCys: 0.0 ± 0.0
0.455ArgAsp: 0.455 ± 0.433
0.455ArgGlu: 0.455 ± 0.433
1.821ArgPhe: 1.821 ± 0.753
0.455ArgGly: 0.455 ± 0.491
1.366ArgHis: 1.366 ± 0.941
1.821ArgIle: 1.821 ± 0.797
2.277ArgLys: 2.277 ± 0.811
2.277ArgLeu: 2.277 ± 0.822
1.821ArgMet: 1.821 ± 0.738
0.911ArgAsn: 0.911 ± 0.601
0.455ArgPro: 0.455 ± 0.316
1.366ArgGln: 1.366 ± 0.729
0.0ArgArg: 0.0 ± 0.0
2.277ArgSer: 2.277 ± 0.747
2.277ArgThr: 2.277 ± 0.737
2.732ArgVal: 2.732 ± 0.86
1.366ArgTrp: 1.366 ± 0.738
3.188ArgTyr: 3.188 ± 1.141
0.0ArgXaa: 0.0 ± 0.0
Ser
2.277SerAla: 2.277 ± 0.963
0.455SerCys: 0.455 ± 0.5
1.821SerAsp: 1.821 ± 0.482
3.643SerGlu: 3.643 ± 1.357
5.009SerPhe: 5.009 ± 1.647
3.188SerGly: 3.188 ± 1.386
0.455SerHis: 0.455 ± 0.549
3.643SerIle: 3.643 ± 1.469
3.643SerLys: 3.643 ± 1.054
9.107SerLeu: 9.107 ± 1.163
1.366SerMet: 1.366 ± 0.555
4.554SerAsn: 4.554 ± 1.522
0.911SerPro: 0.911 ± 0.536
1.821SerGln: 1.821 ± 0.666
0.455SerArg: 0.455 ± 0.399
3.643SerSer: 3.643 ± 1.469
4.554SerThr: 4.554 ± 2.225
5.009SerVal: 5.009 ± 1.342
0.0SerTrp: 0.0 ± 0.0
3.188SerTyr: 3.188 ± 0.573
0.0SerXaa: 0.0 ± 0.0
Thr
2.277ThrAla: 2.277 ± 1.051
0.0ThrCys: 0.0 ± 0.0
4.098ThrAsp: 4.098 ± 1.166
0.911ThrGlu: 0.911 ± 0.737
1.366ThrPhe: 1.366 ± 0.841
4.554ThrGly: 4.554 ± 1.401
0.0ThrHis: 0.0 ± 0.0
3.643ThrIle: 3.643 ± 1.32
4.554ThrLys: 4.554 ± 1.266
2.732ThrLeu: 2.732 ± 0.83
1.366ThrMet: 1.366 ± 0.506
3.188ThrAsn: 3.188 ± 0.814
1.821ThrPro: 1.821 ± 0.81
1.366ThrGln: 1.366 ± 0.755
2.277ThrArg: 2.277 ± 0.79
2.732ThrSer: 2.732 ± 1.075
1.821ThrThr: 1.821 ± 0.972
3.643ThrVal: 3.643 ± 1.469
1.366ThrTrp: 1.366 ± 0.977
1.366ThrTyr: 1.366 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
1.821ValAla: 1.821 ± 0.767
0.0ValCys: 0.0 ± 0.0
2.277ValAsp: 2.277 ± 1.169
2.732ValGlu: 2.732 ± 1.175
7.286ValPhe: 7.286 ± 1.64
4.554ValGly: 4.554 ± 1.105
0.911ValHis: 0.911 ± 0.536
5.464ValIle: 5.464 ± 1.82
9.107ValLys: 9.107 ± 1.319
4.098ValLeu: 4.098 ± 1.576
0.455ValMet: 0.455 ± 0.434
3.643ValAsn: 3.643 ± 1.152
1.366ValPro: 1.366 ± 0.422
2.277ValGln: 2.277 ± 0.922
2.277ValArg: 2.277 ± 0.837
2.732ValSer: 2.732 ± 1.116
0.0ValThr: 0.0 ± 0.0
2.277ValVal: 2.277 ± 0.885
1.821ValTrp: 1.821 ± 0.875
4.554ValTyr: 4.554 ± 1.113
0.0ValXaa: 0.0 ± 0.0
Trp
0.455TrpAla: 0.455 ± 0.399
0.0TrpCys: 0.0 ± 0.0
1.821TrpAsp: 1.821 ± 0.624
1.366TrpGlu: 1.366 ± 0.977
0.455TrpPhe: 0.455 ± 0.512
0.0TrpGly: 0.0 ± 0.0
0.455TrpHis: 0.455 ± 0.316
3.643TrpIle: 3.643 ± 1.274
3.643TrpLys: 3.643 ± 1.045
4.098TrpLeu: 4.098 ± 1.762
0.455TrpMet: 0.455 ± 0.512
2.277TrpAsn: 2.277 ± 0.635
0.0TrpPro: 0.0 ± 0.0
1.366TrpGln: 1.366 ± 0.701
0.0TrpArg: 0.0 ± 0.0
1.821TrpSer: 1.821 ± 0.717
0.911TrpThr: 0.911 ± 0.461
0.911TrpVal: 0.911 ± 0.723
1.821TrpTrp: 1.821 ± 0.619
0.455TrpTyr: 0.455 ± 0.316
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.911TyrAla: 0.911 ± 0.531
1.366TyrCys: 1.366 ± 0.889
5.464TyrAsp: 5.464 ± 1.027
1.366TyrGlu: 1.366 ± 0.422
6.831TyrPhe: 6.831 ± 1.614
1.821TyrGly: 1.821 ± 0.955
0.0TyrHis: 0.0 ± 0.0
5.009TyrIle: 5.009 ± 0.883
4.554TyrLys: 4.554 ± 1.66
5.009TyrLeu: 5.009 ± 1.814
0.911TyrMet: 0.911 ± 0.507
5.009TyrAsn: 5.009 ± 1.237
1.821TyrPro: 1.821 ± 0.617
3.643TyrGln: 3.643 ± 1.02
2.732TyrArg: 2.732 ± 1.045
5.009TyrSer: 5.009 ± 1.309
1.821TyrThr: 1.821 ± 0.875
2.732TyrVal: 2.732 ± 0.871
1.366TyrTrp: 1.366 ± 0.754
4.554TyrTyr: 4.554 ± 2.096
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2197 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski