Amino acid dipepetide frequency for Human spumaretrovirus (SFVcpz(hu)) (Human foamy virus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.434AlaAla: 3.434 ± 0.456
0.981AlaCys: 0.981 ± 0.281
1.226AlaAsp: 1.226 ± 0.321
5.396AlaGlu: 5.396 ± 1.233
1.226AlaPhe: 1.226 ± 0.391
3.434AlaGly: 3.434 ± 0.488
1.962AlaHis: 1.962 ± 0.578
0.981AlaIle: 0.981 ± 0.507
1.472AlaLys: 1.472 ± 0.729
5.396AlaLeu: 5.396 ± 0.781
1.962AlaMet: 1.962 ± 1.074
3.189AlaAsn: 3.189 ± 0.368
2.698AlaPro: 2.698 ± 1.237
1.226AlaGln: 1.226 ± 0.261
1.717AlaArg: 1.717 ± 0.741
4.906AlaSer: 4.906 ± 0.771
4.906AlaThr: 4.906 ± 0.847
3.924AlaVal: 3.924 ± 1.218
0.981AlaTrp: 0.981 ± 0.696
2.208AlaTyr: 2.208 ± 0.91
0.0AlaXaa: 0.0 ± 0.0
Cys
0.491CysAla: 0.491 ± 0.33
0.736CysCys: 0.736 ± 0.354
1.962CysAsp: 1.962 ± 0.899
0.0CysGlu: 0.0 ± 0.0
1.226CysPhe: 1.226 ± 0.505
0.736CysGly: 0.736 ± 0.48
0.0CysHis: 0.0 ± 0.0
0.981CysIle: 0.981 ± 0.442
0.981CysLys: 0.981 ± 0.443
1.962CysLeu: 1.962 ± 0.503
0.0CysMet: 0.0 ± 0.0
0.981CysAsn: 0.981 ± 0.471
0.736CysPro: 0.736 ± 0.516
1.472CysGln: 1.472 ± 0.633
1.226CysArg: 1.226 ± 0.417
1.717CysSer: 1.717 ± 0.792
0.491CysThr: 0.491 ± 0.33
0.736CysVal: 0.736 ± 0.37
0.245CysTrp: 0.245 ± 0.187
0.491CysTyr: 0.491 ± 0.439
0.0CysXaa: 0.0 ± 0.0
Asp
0.981AspAla: 0.981 ± 0.443
1.472AspCys: 1.472 ± 0.807
1.962AspAsp: 1.962 ± 0.629
2.698AspGlu: 2.698 ± 0.977
1.226AspPhe: 1.226 ± 0.603
2.208AspGly: 2.208 ± 0.447
1.226AspHis: 1.226 ± 0.321
3.434AspIle: 3.434 ± 0.973
1.472AspLys: 1.472 ± 0.442
3.924AspLeu: 3.924 ± 1.188
0.736AspMet: 0.736 ± 0.337
1.962AspAsn: 1.962 ± 0.784
4.415AspPro: 4.415 ± 1.551
2.698AspGln: 2.698 ± 0.839
1.717AspArg: 1.717 ± 0.724
3.924AspSer: 3.924 ± 1.273
2.208AspThr: 2.208 ± 0.764
3.189AspVal: 3.189 ± 0.329
1.226AspTrp: 1.226 ± 0.633
2.453AspTyr: 2.453 ± 0.592
0.0AspXaa: 0.0 ± 0.0
Glu
2.208GluAla: 2.208 ± 0.375
0.736GluCys: 0.736 ± 0.423
1.962GluAsp: 1.962 ± 0.51
7.113GluGlu: 7.113 ± 2.17
0.981GluPhe: 0.981 ± 0.539
4.906GluGly: 4.906 ± 1.002
0.981GluHis: 0.981 ± 0.427
5.396GluIle: 5.396 ± 0.787
2.943GluLys: 2.943 ± 1.077
4.17GluLeu: 4.17 ± 0.815
2.943GluMet: 2.943 ± 0.96
2.943GluAsn: 2.943 ± 0.802
3.189GluPro: 3.189 ± 1.028
2.698GluGln: 2.698 ± 1.054
4.66GluArg: 4.66 ± 0.536
4.415GluSer: 4.415 ± 0.827
2.698GluThr: 2.698 ± 0.692
4.17GluVal: 4.17 ± 1.241
0.491GluTrp: 0.491 ± 0.374
1.472GluTyr: 1.472 ± 0.521
0.0GluXaa: 0.0 ± 0.0
Phe
2.698PheAla: 2.698 ± 0.751
0.491PheCys: 0.491 ± 0.33
0.981PheAsp: 0.981 ± 0.547
0.736PheGlu: 0.736 ± 0.485
0.245PhePhe: 0.245 ± 0.187
1.717PheGly: 1.717 ± 0.42
1.472PheHis: 1.472 ± 0.749
2.453PheIle: 2.453 ± 0.772
1.472PheLys: 1.472 ± 0.354
2.208PheLeu: 2.208 ± 0.92
0.981PheMet: 0.981 ± 0.747
0.245PheAsn: 0.245 ± 0.187
1.226PhePro: 1.226 ± 0.677
0.736PheGln: 0.736 ± 0.457
0.491PheArg: 0.491 ± 0.439
1.717PheSer: 1.717 ± 0.614
2.453PheThr: 2.453 ± 0.704
1.472PheVal: 1.472 ± 0.551
1.226PheTrp: 1.226 ± 0.359
1.472PheTyr: 1.472 ± 0.354
0.0PheXaa: 0.0 ± 0.0
Gly
2.698GlyAla: 2.698 ± 0.922
1.226GlyCys: 1.226 ± 0.887
3.679GlyAsp: 3.679 ± 1.177
3.924GlyGlu: 3.924 ± 1.582
1.472GlyPhe: 1.472 ± 0.895
3.189GlyGly: 3.189 ± 1.923
2.698GlyHis: 2.698 ± 0.996
5.151GlyIle: 5.151 ± 0.629
3.189GlyLys: 3.189 ± 1.154
3.679GlyLeu: 3.679 ± 1.057
1.472GlyMet: 1.472 ± 0.633
3.189GlyAsn: 3.189 ± 1.131
5.151GlyPro: 5.151 ± 1.895
3.924GlyGln: 3.924 ± 1.424
3.924GlyArg: 3.924 ± 2.098
3.924GlySer: 3.924 ± 0.943
2.698GlyThr: 2.698 ± 0.499
2.453GlyVal: 2.453 ± 0.61
0.736GlyTrp: 0.736 ± 0.461
3.924GlyTyr: 3.924 ± 0.654
0.0GlyXaa: 0.0 ± 0.0
His
0.981HisAla: 0.981 ± 0.294
0.245HisCys: 0.245 ± 0.345
0.981HisAsp: 0.981 ± 0.532
0.736HisGlu: 0.736 ± 0.423
0.0HisPhe: 0.0 ± 0.0
2.208HisGly: 2.208 ± 1.191
0.491HisHis: 0.491 ± 0.472
1.472HisIle: 1.472 ± 0.26
1.472HisLys: 1.472 ± 0.644
3.189HisLeu: 3.189 ± 0.893
0.736HisMet: 0.736 ± 0.47
0.736HisAsn: 0.736 ± 0.375
2.943HisPro: 2.943 ± 0.529
0.981HisGln: 0.981 ± 0.389
2.208HisArg: 2.208 ± 1.012
1.226HisSer: 1.226 ± 0.452
0.981HisThr: 0.981 ± 0.314
1.472HisVal: 1.472 ± 0.518
0.491HisTrp: 0.491 ± 0.374
1.962HisTyr: 1.962 ± 0.579
0.0HisXaa: 0.0 ± 0.0
Ile
4.17IleAla: 4.17 ± 1.36
0.736IleCys: 0.736 ± 0.658
4.17IleAsp: 4.17 ± 0.384
2.943IleGlu: 2.943 ± 0.714
1.226IlePhe: 1.226 ± 0.505
2.943IleGly: 2.943 ± 0.561
1.717IleHis: 1.717 ± 0.446
2.208IleIle: 2.208 ± 0.92
3.924IleLys: 3.924 ± 1.208
8.339IleLeu: 8.339 ± 1.011
1.226IleMet: 1.226 ± 0.611
1.962IleAsn: 1.962 ± 0.933
5.887IlePro: 5.887 ± 1.354
4.66IleGln: 4.66 ± 0.753
3.679IleArg: 3.679 ± 0.783
1.962IleSer: 1.962 ± 0.705
2.943IleThr: 2.943 ± 1.335
3.189IleVal: 3.189 ± 1.009
0.981IleTrp: 0.981 ± 0.441
1.472IleTyr: 1.472 ± 0.518
0.0IleXaa: 0.0 ± 0.0
Lys
4.66LysAla: 4.66 ± 1.093
0.981LysCys: 0.981 ± 0.47
1.717LysAsp: 1.717 ± 0.826
5.151LysGlu: 5.151 ± 1.271
1.472LysPhe: 1.472 ± 0.354
2.208LysGly: 2.208 ± 0.955
2.208LysHis: 2.208 ± 0.853
3.679LysIle: 3.679 ± 1.32
3.189LysLys: 3.189 ± 1.56
3.434LysLeu: 3.434 ± 0.756
0.245LysMet: 0.245 ± 0.219
2.698LysAsn: 2.698 ± 1.064
4.17LysPro: 4.17 ± 1.781
3.924LysGln: 3.924 ± 1.441
3.924LysArg: 3.924 ± 1.401
3.189LysSer: 3.189 ± 0.777
3.189LysThr: 3.189 ± 1.676
4.906LysVal: 4.906 ± 1.611
1.226LysTrp: 1.226 ± 0.496
1.962LysTyr: 1.962 ± 0.788
0.0LysXaa: 0.0 ± 0.0
Leu
5.151LeuAla: 5.151 ± 0.745
2.208LeuCys: 2.208 ± 1.47
3.679LeuAsp: 3.679 ± 0.694
3.924LeuGlu: 3.924 ± 1.176
2.208LeuPhe: 2.208 ± 0.375
5.887LeuGly: 5.887 ± 0.903
0.981LeuHis: 0.981 ± 0.603
3.434LeuIle: 3.434 ± 1.144
6.377LeuLys: 6.377 ± 1.95
8.585LeuLeu: 8.585 ± 2.887
1.472LeuMet: 1.472 ± 1.095
5.641LeuAsn: 5.641 ± 1.191
6.132LeuPro: 6.132 ± 0.95
7.358LeuGln: 7.358 ± 0.686
4.415LeuArg: 4.415 ± 1.327
4.415LeuSer: 4.415 ± 0.463
6.623LeuThr: 6.623 ± 0.756
5.887LeuVal: 5.887 ± 1.896
1.472LeuTrp: 1.472 ± 0.754
3.679LeuTyr: 3.679 ± 0.667
0.0LeuXaa: 0.0 ± 0.0
Met
1.962MetAla: 1.962 ± 0.566
0.0MetCys: 0.0 ± 0.0
1.226MetAsp: 1.226 ± 0.508
1.717MetGlu: 1.717 ± 0.8
1.472MetPhe: 1.472 ± 0.467
1.717MetGly: 1.717 ± 0.606
1.226MetHis: 1.226 ± 0.355
1.962MetIle: 1.962 ± 0.789
0.491MetLys: 0.491 ± 0.243
0.981MetLeu: 0.981 ± 0.333
0.245MetMet: 0.245 ± 0.24
0.736MetAsn: 0.736 ± 0.374
0.736MetPro: 0.736 ± 0.571
0.736MetGln: 0.736 ± 0.516
0.981MetArg: 0.981 ± 0.498
1.962MetSer: 1.962 ± 1.028
1.962MetThr: 1.962 ± 0.297
1.226MetVal: 1.226 ± 0.362
0.0MetTrp: 0.0 ± 0.0
0.245MetTyr: 0.245 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.189AsnAla: 3.189 ± 0.628
0.981AsnCys: 0.981 ± 0.427
1.472AsnAsp: 1.472 ± 0.464
3.189AsnGlu: 3.189 ± 0.963
1.717AsnPhe: 1.717 ± 0.892
1.226AsnGly: 1.226 ± 0.47
0.736AsnHis: 0.736 ± 0.215
3.189AsnIle: 3.189 ± 0.511
2.208AsnLys: 2.208 ± 0.92
4.17AsnLeu: 4.17 ± 0.89
0.981AsnMet: 0.981 ± 0.415
2.698AsnAsn: 2.698 ± 1.185
3.924AsnPro: 3.924 ± 1.214
3.924AsnGln: 3.924 ± 1.731
1.226AsnArg: 1.226 ± 0.43
1.717AsnSer: 1.717 ± 0.614
3.924AsnThr: 3.924 ± 0.902
2.943AsnVal: 2.943 ± 0.646
0.491AsnTrp: 0.491 ± 0.439
1.962AsnTyr: 1.962 ± 0.572
0.0AsnXaa: 0.0 ± 0.0
Pro
3.924ProAla: 3.924 ± 0.833
0.245ProCys: 0.245 ± 0.219
2.208ProAsp: 2.208 ± 0.778
4.906ProGlu: 4.906 ± 1.151
2.943ProPhe: 2.943 ± 1.069
3.189ProGly: 3.189 ± 1.629
2.453ProHis: 2.453 ± 0.563
3.679ProIle: 3.679 ± 0.913
3.434ProLys: 3.434 ± 1.324
6.868ProLeu: 6.868 ± 0.95
1.472ProMet: 1.472 ± 0.573
2.698ProAsn: 2.698 ± 0.384
4.906ProPro: 4.906 ± 1.631
3.189ProGln: 3.189 ± 0.772
7.113ProArg: 7.113 ± 3.016
7.113ProSer: 7.113 ± 1.563
2.943ProThr: 2.943 ± 0.641
5.396ProVal: 5.396 ± 0.776
1.226ProTrp: 1.226 ± 0.266
2.698ProTyr: 2.698 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
2.208GlnAla: 2.208 ± 0.646
0.491GlnCys: 0.491 ± 0.243
3.679GlnAsp: 3.679 ± 1.414
3.434GlnGlu: 3.434 ± 0.58
1.717GlnPhe: 1.717 ± 1.104
6.377GlnGly: 6.377 ± 0.903
2.208GlnHis: 2.208 ± 0.323
3.189GlnIle: 3.189 ± 0.893
4.17GlnLys: 4.17 ± 0.905
5.641GlnLeu: 5.641 ± 1.595
1.226GlnMet: 1.226 ± 0.619
2.943GlnAsn: 2.943 ± 0.73
2.453GlnPro: 2.453 ± 0.71
3.189GlnGln: 3.189 ± 1.418
3.434GlnArg: 3.434 ± 1.659
3.189GlnSer: 3.189 ± 1.138
2.698GlnThr: 2.698 ± 0.877
2.698GlnVal: 2.698 ± 0.874
0.736GlnTrp: 0.736 ± 0.423
1.962GlnTyr: 1.962 ± 0.517
0.0GlnXaa: 0.0 ± 0.0
Arg
2.698ArgAla: 2.698 ± 1.553
0.981ArgCys: 0.981 ± 0.256
2.208ArgAsp: 2.208 ± 1.047
3.434ArgGlu: 3.434 ± 0.749
1.717ArgPhe: 1.717 ± 0.611
5.151ArgGly: 5.151 ± 2.78
0.736ArgHis: 0.736 ± 0.495
1.962ArgIle: 1.962 ± 0.611
3.434ArgLys: 3.434 ± 0.886
3.924ArgLeu: 3.924 ± 0.434
1.226ArgMet: 1.226 ± 0.359
2.453ArgAsn: 2.453 ± 1.305
6.132ArgPro: 6.132 ± 2.232
2.453ArgGln: 2.453 ± 0.276
4.66ArgArg: 4.66 ± 1.68
3.189ArgSer: 3.189 ± 1.12
2.698ArgThr: 2.698 ± 1.115
1.472ArgVal: 1.472 ± 0.275
2.208ArgTrp: 2.208 ± 0.756
1.962ArgTyr: 1.962 ± 0.606
0.0ArgXaa: 0.0 ± 0.0
Ser
3.434SerAla: 3.434 ± 0.934
1.226SerCys: 1.226 ± 0.8
4.415SerAsp: 4.415 ± 0.934
2.208SerGlu: 2.208 ± 0.81
2.943SerPhe: 2.943 ± 0.565
7.849SerGly: 7.849 ± 2.813
1.226SerHis: 1.226 ± 0.321
4.415SerIle: 4.415 ± 0.607
2.698SerLys: 2.698 ± 1.256
6.132SerLeu: 6.132 ± 0.914
0.981SerMet: 0.981 ± 0.539
2.698SerAsn: 2.698 ± 0.802
5.641SerPro: 5.641 ± 1.069
3.434SerGln: 3.434 ± 0.822
2.943SerArg: 2.943 ± 1.174
6.623SerSer: 6.623 ± 2.047
3.924SerThr: 3.924 ± 0.759
2.208SerVal: 2.208 ± 0.847
0.736SerTrp: 0.736 ± 0.304
2.698SerTyr: 2.698 ± 0.762
0.0SerXaa: 0.0 ± 0.0
Thr
3.679ThrAla: 3.679 ± 1.124
1.962ThrCys: 1.962 ± 0.862
2.208ThrAsp: 2.208 ± 0.471
1.717ThrGlu: 1.717 ± 0.695
1.226ThrPhe: 1.226 ± 0.612
2.208ThrGly: 2.208 ± 0.584
0.981ThrHis: 0.981 ± 0.603
3.679ThrIle: 3.679 ± 0.792
5.396ThrLys: 5.396 ± 1.401
4.415ThrLeu: 4.415 ± 0.462
1.717ThrMet: 1.717 ± 0.467
1.472ThrAsn: 1.472 ± 0.456
6.132ThrPro: 6.132 ± 1.431
2.943ThrGln: 2.943 ± 0.716
3.434ThrArg: 3.434 ± 1.011
6.623ThrSer: 6.623 ± 0.9
2.698ThrThr: 2.698 ± 0.506
2.698ThrVal: 2.698 ± 0.709
2.698ThrTrp: 2.698 ± 0.925
2.453ThrTyr: 2.453 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
1.717ValAla: 1.717 ± 0.379
0.245ValCys: 0.245 ± 0.219
2.698ValAsp: 2.698 ± 0.925
2.698ValGlu: 2.698 ± 0.817
1.472ValPhe: 1.472 ± 0.678
2.453ValGly: 2.453 ± 0.83
0.491ValHis: 0.491 ± 0.294
5.641ValIle: 5.641 ± 1.043
5.396ValLys: 5.396 ± 1.26
6.868ValLeu: 6.868 ± 1.349
0.736ValMet: 0.736 ± 0.516
3.434ValAsn: 3.434 ± 0.838
2.698ValPro: 2.698 ± 0.309
2.943ValGln: 2.943 ± 0.937
0.981ValArg: 0.981 ± 0.507
3.434ValSer: 3.434 ± 0.683
5.887ValThr: 5.887 ± 1.763
4.415ValVal: 4.415 ± 1.112
1.472ValTrp: 1.472 ± 0.708
2.698ValTyr: 2.698 ± 0.876
0.0ValXaa: 0.0 ± 0.0
Trp
0.736TrpAla: 0.736 ± 0.374
0.491TrpCys: 0.491 ± 0.388
1.717TrpAsp: 1.717 ± 0.305
1.717TrpGlu: 1.717 ± 0.589
0.0TrpPhe: 0.0 ± 0.0
0.245TrpGly: 0.245 ± 0.24
0.491TrpHis: 0.491 ± 0.349
1.472TrpIle: 1.472 ± 0.55
1.962TrpLys: 1.962 ± 0.682
1.717TrpLeu: 1.717 ± 0.363
0.736TrpMet: 0.736 ± 0.423
1.717TrpAsn: 1.717 ± 0.57
1.226TrpPro: 1.226 ± 0.355
0.736TrpGln: 0.736 ± 0.441
0.491TrpArg: 0.491 ± 0.243
1.472TrpSer: 1.472 ± 0.471
1.226TrpThr: 1.226 ± 0.355
0.736TrpVal: 0.736 ± 0.495
0.981TrpTrp: 0.981 ± 0.403
0.491TrpTyr: 0.491 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.208TyrAla: 2.208 ± 1.062
0.736TyrCys: 0.736 ± 0.359
0.981TyrAsp: 0.981 ± 0.539
3.434TyrGlu: 3.434 ± 0.672
0.245TyrPhe: 0.245 ± 0.187
2.453TyrGly: 2.453 ± 1.082
1.226TyrHis: 1.226 ± 0.326
1.962TyrIle: 1.962 ± 0.722
2.453TyrLys: 2.453 ± 0.922
3.924TyrLeu: 3.924 ± 1.288
0.245TyrMet: 0.245 ± 0.219
1.472TyrAsn: 1.472 ± 0.551
2.208TyrPro: 2.208 ± 0.738
4.415TyrGln: 4.415 ± 1.506
1.472TyrArg: 1.472 ± 0.749
1.717TyrSer: 1.717 ± 0.51
3.189TyrThr: 3.189 ± 0.441
3.189TyrVal: 3.189 ± 1.378
0.736TyrTrp: 0.736 ± 0.561
2.943TyrTyr: 2.943 ± 1.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski