Amino acid dipepetide frequency for Squirrel monkey simian foamy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.649AlaAla: 2.649 ± 1.047
1.177AlaCys: 1.177 ± 0.411
1.766AlaAsp: 1.766 ± 0.716
1.471AlaGlu: 1.471 ± 0.641
1.471AlaPhe: 1.471 ± 0.483
2.649AlaGly: 2.649 ± 1.835
1.471AlaHis: 1.471 ± 0.507
2.06AlaIle: 2.06 ± 0.328
1.766AlaLys: 1.766 ± 0.796
5.592AlaLeu: 5.592 ± 1.322
0.883AlaMet: 0.883 ± 0.614
0.589AlaAsn: 0.589 ± 0.458
1.766AlaPro: 1.766 ± 1.159
2.354AlaGln: 2.354 ± 0.792
4.12AlaArg: 4.12 ± 1.224
4.709AlaSer: 4.709 ± 1.541
2.943AlaThr: 2.943 ± 0.743
4.12AlaVal: 4.12 ± 0.748
1.177AlaTrp: 1.177 ± 0.543
2.354AlaTyr: 2.354 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
0.589CysAla: 0.589 ± 0.378
0.294CysCys: 0.294 ± 0.359
0.294CysAsp: 0.294 ± 0.324
1.177CysGlu: 1.177 ± 0.763
0.589CysPhe: 0.589 ± 0.504
1.766CysGly: 1.766 ± 1.03
0.294CysHis: 0.294 ± 0.252
0.589CysIle: 0.589 ± 0.353
1.766CysLys: 1.766 ± 0.75
2.06CysLeu: 2.06 ± 0.827
0.589CysMet: 0.589 ± 0.353
1.177CysAsn: 1.177 ± 0.753
0.589CysPro: 0.589 ± 0.504
0.0CysGln: 0.0 ± 0.0
1.471CysArg: 1.471 ± 0.382
0.883CysSer: 0.883 ± 0.532
0.589CysThr: 0.589 ± 0.545
0.294CysVal: 0.294 ± 0.359
0.589CysTrp: 0.589 ± 0.409
0.589CysTyr: 0.589 ± 0.504
0.0CysXaa: 0.0 ± 0.0
Asp
1.177AspAla: 1.177 ± 0.625
1.177AspCys: 1.177 ± 0.763
2.354AspAsp: 2.354 ± 0.705
2.06AspGlu: 2.06 ± 1.253
1.471AspPhe: 1.471 ± 0.367
1.766AspGly: 1.766 ± 0.736
0.589AspHis: 0.589 ± 0.247
3.237AspIle: 3.237 ± 0.858
2.649AspLys: 2.649 ± 0.867
5.297AspLeu: 5.297 ± 1.046
0.589AspMet: 0.589 ± 0.504
1.766AspAsn: 1.766 ± 0.716
3.237AspPro: 3.237 ± 1.122
2.354AspGln: 2.354 ± 0.599
1.177AspArg: 1.177 ± 0.68
2.649AspSer: 2.649 ± 0.632
1.177AspThr: 1.177 ± 0.655
2.649AspVal: 2.649 ± 0.845
0.883AspTrp: 0.883 ± 0.532
4.12AspTyr: 4.12 ± 0.923
0.0AspXaa: 0.0 ± 0.0
Glu
1.766GluAla: 1.766 ± 1.453
1.177GluCys: 1.177 ± 0.553
2.354GluAsp: 2.354 ± 0.686
5.592GluGlu: 5.592 ± 1.474
1.766GluPhe: 1.766 ± 0.818
5.297GluGly: 5.297 ± 0.756
0.883GluHis: 0.883 ± 0.511
5.297GluIle: 5.297 ± 1.754
4.12GluLys: 4.12 ± 1.196
5.003GluLeu: 5.003 ± 1.251
1.177GluMet: 1.177 ± 0.27
2.943GluAsn: 2.943 ± 1.128
1.766GluPro: 1.766 ± 0.716
2.649GluGln: 2.649 ± 0.973
3.826GluArg: 3.826 ± 0.634
2.649GluSer: 2.649 ± 0.756
3.826GluThr: 3.826 ± 0.897
3.237GluVal: 3.237 ± 0.84
0.883GluTrp: 0.883 ± 0.693
2.354GluTyr: 2.354 ± 1.163
0.0GluXaa: 0.0 ± 0.0
Phe
1.766PheAla: 1.766 ± 0.796
0.589PheCys: 0.589 ± 0.247
1.471PheAsp: 1.471 ± 0.568
0.883PheGlu: 0.883 ± 0.515
0.589PhePhe: 0.589 ± 0.769
1.766PheGly: 1.766 ± 1.169
0.589PheHis: 0.589 ± 0.378
1.766PheIle: 1.766 ± 0.564
1.766PheLys: 1.766 ± 0.75
2.354PheLeu: 2.354 ± 0.662
0.294PheMet: 0.294 ± 0.252
1.177PheAsn: 1.177 ± 0.519
2.943PhePro: 2.943 ± 1.389
3.531PheGln: 3.531 ± 0.54
0.589PheArg: 0.589 ± 0.345
2.06PheSer: 2.06 ± 0.609
2.943PheThr: 2.943 ± 0.666
0.589PheVal: 0.589 ± 0.247
1.177PheTrp: 1.177 ± 0.445
1.177PheTyr: 1.177 ± 0.817
0.0PheXaa: 0.0 ± 0.0
Gly
0.883GlyAla: 0.883 ± 0.417
0.294GlyCys: 0.294 ± 0.359
3.237GlyAsp: 3.237 ± 1.076
2.943GlyGlu: 2.943 ± 1.062
2.943GlyPhe: 2.943 ± 0.839
3.531GlyGly: 3.531 ± 1.553
1.177GlyHis: 1.177 ± 0.411
4.414GlyIle: 4.414 ± 1.15
4.414GlyLys: 4.414 ± 1.246
4.414GlyLeu: 4.414 ± 1.05
1.766GlyMet: 1.766 ± 0.878
4.709GlyAsn: 4.709 ± 1.424
2.06GlyPro: 2.06 ± 0.729
4.12GlyGln: 4.12 ± 1.013
3.531GlyArg: 3.531 ± 2.194
3.826GlySer: 3.826 ± 0.546
2.649GlyThr: 2.649 ± 1.358
2.06GlyVal: 2.06 ± 0.478
1.177GlyTrp: 1.177 ± 0.525
3.237GlyTyr: 3.237 ± 0.782
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.313
0.589HisCys: 0.589 ± 0.247
0.883HisAsp: 0.883 ± 0.36
2.943HisGlu: 2.943 ± 0.606
1.177HisPhe: 1.177 ± 0.678
0.589HisGly: 0.589 ± 0.345
0.589HisHis: 0.589 ± 0.545
1.766HisIle: 1.766 ± 0.741
0.883HisLys: 0.883 ± 0.404
2.06HisLeu: 2.06 ± 1.231
0.294HisMet: 0.294 ± 0.385
0.883HisAsn: 0.883 ± 0.557
2.943HisPro: 2.943 ± 0.921
1.766HisGln: 1.766 ± 1.373
0.0HisArg: 0.0 ± 0.0
1.471HisSer: 1.471 ± 0.859
1.766HisThr: 1.766 ± 0.806
1.177HisVal: 1.177 ± 0.358
0.294HisTrp: 0.294 ± 0.252
0.294HisTyr: 0.294 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
3.531IleAla: 3.531 ± 0.978
1.177IleCys: 1.177 ± 0.756
2.354IleAsp: 2.354 ± 1.028
3.237IleGlu: 3.237 ± 0.641
1.766IlePhe: 1.766 ± 1.169
4.12IleGly: 4.12 ± 1.285
0.589IleHis: 0.589 ± 0.247
5.886IleIle: 5.886 ± 1.168
4.414IleLys: 4.414 ± 1.888
7.946IleLeu: 7.946 ± 2.643
1.177IleMet: 1.177 ± 0.383
3.826IleAsn: 3.826 ± 1.142
5.886IlePro: 5.886 ± 1.73
3.531IleGln: 3.531 ± 1.031
3.826IleArg: 3.826 ± 0.474
2.943IleSer: 2.943 ± 0.923
6.474IleThr: 6.474 ± 1.164
3.237IleVal: 3.237 ± 1.251
0.883IleTrp: 0.883 ± 0.756
2.06IleTyr: 2.06 ± 1.007
0.0IleXaa: 0.0 ± 0.0
Lys
2.649LysAla: 2.649 ± 0.857
1.766LysCys: 1.766 ± 1.227
3.237LysAsp: 3.237 ± 1.15
4.12LysGlu: 4.12 ± 1.083
0.589LysPhe: 0.589 ± 0.353
2.354LysGly: 2.354 ± 0.651
1.766LysHis: 1.766 ± 0.808
4.414LysIle: 4.414 ± 1.291
4.12LysLys: 4.12 ± 1.103
7.652LysLeu: 7.652 ± 2.413
0.294LysMet: 0.294 ± 0.229
2.06LysAsn: 2.06 ± 0.867
5.003LysPro: 5.003 ± 1.441
2.943LysGln: 2.943 ± 1.043
3.237LysArg: 3.237 ± 1.062
3.531LysSer: 3.531 ± 0.798
3.826LysThr: 3.826 ± 1.367
1.471LysVal: 1.471 ± 0.367
2.354LysTrp: 2.354 ± 0.709
2.649LysTyr: 2.649 ± 1.162
0.0LysXaa: 0.0 ± 0.0
Leu
4.414LeuAla: 4.414 ± 1.59
1.177LeuCys: 1.177 ± 0.531
4.414LeuAsp: 4.414 ± 0.303
3.826LeuGlu: 3.826 ± 0.645
2.06LeuPhe: 2.06 ± 0.911
4.709LeuGly: 4.709 ± 0.916
3.531LeuHis: 3.531 ± 0.568
5.003LeuIle: 5.003 ± 1.776
7.063LeuLys: 7.063 ± 2.023
9.417LeuLeu: 9.417 ± 2.421
1.177LeuMet: 1.177 ± 0.728
5.297LeuAsn: 5.297 ± 1.092
7.357LeuPro: 7.357 ± 0.765
5.886LeuGln: 5.886 ± 1.122
6.474LeuArg: 6.474 ± 1.926
7.357LeuSer: 7.357 ± 1.207
6.474LeuThr: 6.474 ± 1.096
6.474LeuVal: 6.474 ± 1.787
1.766LeuTrp: 1.766 ± 0.68
3.826LeuTyr: 3.826 ± 0.855
0.0LeuXaa: 0.0 ± 0.0
Met
1.177MetAla: 1.177 ± 0.525
0.883MetCys: 0.883 ± 0.532
0.883MetAsp: 0.883 ± 0.424
0.589MetGlu: 0.589 ± 0.504
0.589MetPhe: 0.589 ± 0.46
1.766MetGly: 1.766 ± 0.497
0.883MetHis: 0.883 ± 0.509
1.177MetIle: 1.177 ± 0.753
0.294MetLys: 0.294 ± 0.252
1.471MetLeu: 1.471 ± 0.509
0.294MetMet: 0.294 ± 0.324
1.177MetAsn: 1.177 ± 0.678
1.471MetPro: 1.471 ± 0.372
0.883MetGln: 0.883 ± 0.599
0.294MetArg: 0.294 ± 0.229
0.883MetSer: 0.883 ± 0.463
0.883MetThr: 0.883 ± 0.62
0.294MetVal: 0.294 ± 0.229
0.0MetTrp: 0.0 ± 0.0
0.294MetTyr: 0.294 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.531AsnAla: 3.531 ± 0.966
0.294AsnCys: 0.294 ± 0.359
1.471AsnAsp: 1.471 ± 0.295
1.766AsnGlu: 1.766 ± 0.263
4.709AsnPhe: 4.709 ± 1.466
3.531AsnGly: 3.531 ± 1.069
0.589AsnHis: 0.589 ± 0.458
4.414AsnIle: 4.414 ± 0.864
2.354AsnLys: 2.354 ± 1.112
3.826AsnLeu: 3.826 ± 1.352
0.0AsnMet: 0.0 ± 0.0
2.943AsnAsn: 2.943 ± 1.014
4.12AsnPro: 4.12 ± 2.451
4.414AsnGln: 4.414 ± 1.995
2.06AsnArg: 2.06 ± 1.467
4.414AsnSer: 4.414 ± 0.754
3.531AsnThr: 3.531 ± 1.136
2.06AsnVal: 2.06 ± 0.446
1.471AsnTrp: 1.471 ± 0.618
1.471AsnTyr: 1.471 ± 0.848
0.0AsnXaa: 0.0 ± 0.0
Pro
3.237ProAla: 3.237 ± 1.031
1.177ProCys: 1.177 ± 0.415
2.06ProAsp: 2.06 ± 0.989
2.354ProGlu: 2.354 ± 0.937
2.943ProPhe: 2.943 ± 1.219
4.12ProGly: 4.12 ± 1.389
2.649ProHis: 2.649 ± 0.569
5.003ProIle: 5.003 ± 0.902
5.003ProLys: 5.003 ± 1.85
10.594ProLeu: 10.594 ± 2.542
2.06ProMet: 2.06 ± 0.52
5.003ProAsn: 5.003 ± 1.858
5.297ProPro: 5.297 ± 1.126
3.531ProGln: 3.531 ± 0.944
2.649ProArg: 2.649 ± 1.299
6.474ProSer: 6.474 ± 1.582
5.003ProThr: 5.003 ± 1.043
3.237ProVal: 3.237 ± 1.129
0.883ProTrp: 0.883 ± 0.444
1.766ProTyr: 1.766 ± 0.838
0.0ProXaa: 0.0 ± 0.0
Gln
2.943GlnAla: 2.943 ± 0.525
0.589GlnCys: 0.589 ± 0.378
3.237GlnAsp: 3.237 ± 0.886
4.709GlnGlu: 4.709 ± 0.633
0.0GlnPhe: 0.0 ± 0.0
4.414GlnGly: 4.414 ± 0.754
1.766GlnHis: 1.766 ± 0.484
3.237GlnIle: 3.237 ± 0.845
3.531GlnLys: 3.531 ± 1.459
4.12GlnLeu: 4.12 ± 1.114
0.294GlnMet: 0.294 ± 0.321
4.414GlnAsn: 4.414 ± 1.524
5.297GlnPro: 5.297 ± 2.303
1.471GlnGln: 1.471 ± 0.629
2.06GlnArg: 2.06 ± 0.956
2.649GlnSer: 2.649 ± 0.793
1.471GlnThr: 1.471 ± 0.317
3.531GlnVal: 3.531 ± 0.354
1.766GlnTrp: 1.766 ± 0.593
1.766GlnTyr: 1.766 ± 1.064
0.0GlnXaa: 0.0 ± 0.0
Arg
3.531ArgAla: 3.531 ± 0.846
0.883ArgCys: 0.883 ± 0.463
1.766ArgAsp: 1.766 ± 0.7
2.649ArgGlu: 2.649 ± 1.054
0.883ArgPhe: 0.883 ± 0.404
2.943ArgGly: 2.943 ± 2.326
1.177ArgHis: 1.177 ± 0.789
2.06ArgIle: 2.06 ± 0.874
2.943ArgLys: 2.943 ± 1.082
3.237ArgLeu: 3.237 ± 1.069
1.766ArgMet: 1.766 ± 0.687
3.237ArgAsn: 3.237 ± 1.601
4.709ArgPro: 4.709 ± 1.05
1.471ArgGln: 1.471 ± 0.399
2.943ArgArg: 2.943 ± 1.185
2.943ArgSer: 2.943 ± 1.37
2.06ArgThr: 2.06 ± 0.743
2.943ArgVal: 2.943 ± 1.762
1.766ArgTrp: 1.766 ± 1.128
1.766ArgTyr: 1.766 ± 1.051
0.0ArgXaa: 0.0 ± 0.0
Ser
5.003SerAla: 5.003 ± 2.027
0.589SerCys: 0.589 ± 0.769
3.531SerAsp: 3.531 ± 0.783
3.531SerGlu: 3.531 ± 1.366
0.883SerPhe: 0.883 ± 0.84
4.414SerGly: 4.414 ± 2.012
1.471SerHis: 1.471 ± 0.483
3.826SerIle: 3.826 ± 1.118
2.354SerLys: 2.354 ± 0.767
5.297SerLeu: 5.297 ± 1.151
1.177SerMet: 1.177 ± 0.474
2.06SerAsn: 2.06 ± 0.792
5.886SerPro: 5.886 ± 1.183
3.826SerGln: 3.826 ± 1.265
1.471SerArg: 1.471 ± 0.915
5.297SerSer: 5.297 ± 1.125
7.652SerThr: 7.652 ± 0.776
3.826SerVal: 3.826 ± 0.904
2.354SerTrp: 2.354 ± 0.563
2.943SerTyr: 2.943 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
2.354ThrAla: 2.354 ± 1.124
1.471ThrCys: 1.471 ± 0.399
1.766ThrAsp: 1.766 ± 0.594
5.297ThrGlu: 5.297 ± 0.625
1.766ThrPhe: 1.766 ± 0.7
3.531ThrGly: 3.531 ± 1.278
1.177ThrHis: 1.177 ± 0.383
4.709ThrIle: 4.709 ± 1.135
4.414ThrLys: 4.414 ± 0.882
4.709ThrLeu: 4.709 ± 0.389
0.294ThrMet: 0.294 ± 0.229
2.649ThrAsn: 2.649 ± 0.72
7.652ThrPro: 7.652 ± 2.158
3.826ThrGln: 3.826 ± 0.959
3.237ThrArg: 3.237 ± 0.948
4.709ThrSer: 4.709 ± 1.288
5.886ThrThr: 5.886 ± 1.074
5.592ThrVal: 5.592 ± 1.183
1.471ThrTrp: 1.471 ± 0.295
1.766ThrTyr: 1.766 ± 0.545
0.0ThrXaa: 0.0 ± 0.0
Val
2.649ValAla: 2.649 ± 1.125
0.0ValCys: 0.0 ± 0.0
2.354ValAsp: 2.354 ± 0.925
3.531ValGlu: 3.531 ± 1.006
2.354ValPhe: 2.354 ± 0.498
1.471ValGly: 1.471 ± 0.629
1.471ValHis: 1.471 ± 0.483
5.003ValIle: 5.003 ± 1.109
2.649ValLys: 2.649 ± 0.612
5.886ValLeu: 5.886 ± 0.391
1.177ValMet: 1.177 ± 0.728
3.826ValAsn: 3.826 ± 1.248
3.237ValPro: 3.237 ± 0.596
2.649ValGln: 2.649 ± 0.5
0.883ValArg: 0.883 ± 0.246
3.826ValSer: 3.826 ± 0.775
3.531ValThr: 3.531 ± 0.568
3.237ValVal: 3.237 ± 0.587
0.589ValTrp: 0.589 ± 0.409
2.943ValTyr: 2.943 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.883TrpAla: 0.883 ± 0.404
0.589TrpCys: 0.589 ± 0.545
1.177TrpAsp: 1.177 ± 0.703
3.237TrpGlu: 3.237 ± 0.717
0.589TrpPhe: 0.589 ± 0.353
1.471TrpGly: 1.471 ± 0.731
0.294TrpHis: 0.294 ± 0.252
1.471TrpIle: 1.471 ± 0.601
1.471TrpLys: 1.471 ± 0.399
2.943TrpLeu: 2.943 ± 0.713
0.0TrpMet: 0.0 ± 0.0
0.883TrpAsn: 0.883 ± 0.463
1.177TrpPro: 1.177 ± 0.678
0.589TrpGln: 0.589 ± 0.353
1.766TrpArg: 1.766 ± 0.577
0.294TrpSer: 0.294 ± 0.229
2.354TrpThr: 2.354 ± 0.317
0.294TrpVal: 0.294 ± 0.252
0.589TrpTrp: 0.589 ± 0.317
0.883TrpTyr: 0.883 ± 0.36
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.177TyrAla: 1.177 ± 0.655
0.294TyrCys: 0.294 ± 0.252
1.766TyrAsp: 1.766 ± 0.856
2.649TyrGlu: 2.649 ± 0.715
0.883TyrPhe: 0.883 ± 0.751
1.177TyrGly: 1.177 ± 0.519
0.589TyrHis: 0.589 ± 0.458
3.531TyrIle: 3.531 ± 0.54
2.06TyrLys: 2.06 ± 0.698
3.826TyrLeu: 3.826 ± 1.498
0.589TyrMet: 0.589 ± 0.345
2.354TyrAsn: 2.354 ± 0.662
2.649TyrPro: 2.649 ± 0.39
1.766TyrGln: 1.766 ± 0.796
2.06TyrArg: 2.06 ± 0.885
3.826TyrSer: 3.826 ± 1.353
3.237TyrThr: 3.237 ± 0.645
2.943TyrVal: 2.943 ± 0.923
0.883TyrTrp: 0.883 ± 0.404
2.943TyrTyr: 2.943 ± 1.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3399 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski