Amino acid dipepetide frequency for Simian foamy virus type 3 (strain LK3) (SFVagm) (SFV-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.344AlaAla: 4.344 ± 0.745
1.448AlaCys: 1.448 ± 0.459
2.606AlaAsp: 2.606 ± 0.882
3.475AlaGlu: 3.475 ± 1.073
2.027AlaPhe: 2.027 ± 0.555
2.606AlaGly: 2.606 ± 0.985
1.738AlaHis: 1.738 ± 0.809
1.448AlaIle: 1.448 ± 0.488
3.765AlaLys: 3.765 ± 1.102
6.082AlaLeu: 6.082 ± 0.82
1.158AlaMet: 1.158 ± 0.691
2.606AlaAsn: 2.606 ± 0.631
3.765AlaPro: 3.765 ± 1.539
4.054AlaGln: 4.054 ± 1.584
4.344AlaArg: 4.344 ± 1.275
3.475AlaSer: 3.475 ± 1.085
4.054AlaThr: 4.054 ± 0.881
3.475AlaVal: 3.475 ± 1.05
0.579AlaTrp: 0.579 ± 0.234
2.027AlaTyr: 2.027 ± 0.883
0.0AlaXaa: 0.0 ± 0.0
Cys
0.869CysAla: 0.869 ± 0.41
0.29CysCys: 0.29 ± 0.234
0.579CysAsp: 0.579 ± 0.359
0.0CysGlu: 0.0 ± 0.0
0.579CysPhe: 0.579 ± 0.468
0.869CysGly: 0.869 ± 0.706
0.0CysHis: 0.0 ± 0.0
0.579CysIle: 0.579 ± 0.234
0.869CysLys: 0.869 ± 0.533
1.738CysLeu: 1.738 ± 0.687
0.869CysMet: 0.869 ± 0.706
0.869CysAsn: 0.869 ± 0.702
0.869CysPro: 0.869 ± 0.496
0.869CysGln: 0.869 ± 1.046
1.448CysArg: 1.448 ± 0.566
0.579CysSer: 0.579 ± 0.468
0.29CysThr: 0.29 ± 0.234
1.158CysVal: 1.158 ± 0.266
0.869CysTrp: 0.869 ± 0.285
0.579CysTyr: 0.579 ± 0.468
0.0CysXaa: 0.0 ± 0.0
Asp
2.317AspAla: 2.317 ± 0.963
1.448AspCys: 1.448 ± 0.851
1.448AspAsp: 1.448 ± 0.614
1.738AspGlu: 1.738 ± 0.495
0.869AspPhe: 0.869 ± 0.702
2.317AspGly: 2.317 ± 0.862
1.448AspHis: 1.448 ± 0.249
4.634AspIle: 4.634 ± 0.317
2.027AspLys: 2.027 ± 1.091
4.344AspLeu: 4.344 ± 1.338
1.158AspMet: 1.158 ± 0.561
1.448AspAsn: 1.448 ± 0.793
4.344AspPro: 4.344 ± 1.694
3.765AspGln: 3.765 ± 0.817
1.448AspArg: 1.448 ± 0.35
2.896AspSer: 2.896 ± 1.176
1.158AspThr: 1.158 ± 0.266
1.738AspVal: 1.738 ± 0.219
0.869AspTrp: 0.869 ± 0.411
2.317AspTyr: 2.317 ± 0.255
0.0AspXaa: 0.0 ± 0.0
Glu
2.317GluAla: 2.317 ± 0.451
0.869GluCys: 0.869 ± 0.702
2.606GluAsp: 2.606 ± 1.144
3.475GluGlu: 3.475 ± 0.851
2.606GluPhe: 2.606 ± 1.048
4.634GluGly: 4.634 ± 1.334
0.579GluHis: 0.579 ± 0.234
3.186GluIle: 3.186 ± 0.537
4.923GluLys: 4.923 ± 2.047
4.344GluLeu: 4.344 ± 0.802
1.158GluMet: 1.158 ± 0.45
3.765GluAsn: 3.765 ± 1.223
2.896GluPro: 2.896 ± 1.493
3.186GluGln: 3.186 ± 1.176
3.765GluArg: 3.765 ± 0.835
2.606GluSer: 2.606 ± 0.855
1.158GluThr: 1.158 ± 0.626
4.634GluVal: 4.634 ± 1.465
0.29GluTrp: 0.29 ± 0.234
1.738GluTyr: 1.738 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.317PheAla: 2.317 ± 0.879
0.29PheCys: 0.29 ± 0.234
0.579PheAsp: 0.579 ± 0.234
1.448PheGlu: 1.448 ± 0.732
0.579PhePhe: 0.579 ± 0.451
2.027PheGly: 2.027 ± 0.733
1.158PheHis: 1.158 ± 0.368
0.869PheIle: 0.869 ± 0.396
1.448PheLys: 1.448 ± 0.54
4.054PheLeu: 4.054 ± 0.559
0.0PheMet: 0.0 ± 0.0
1.448PheAsn: 1.448 ± 0.305
2.027PhePro: 2.027 ± 0.649
0.869PheGln: 0.869 ± 0.895
0.579PheArg: 0.579 ± 0.234
1.158PheSer: 1.158 ± 0.903
2.027PheThr: 2.027 ± 0.992
1.158PheVal: 1.158 ± 0.626
0.869PheTrp: 0.869 ± 0.345
1.158PheTyr: 1.158 ± 0.601
0.0PheXaa: 0.0 ± 0.0
Gly
2.027GlyAla: 2.027 ± 0.598
0.579GlyCys: 0.579 ± 0.541
3.186GlyAsp: 3.186 ± 1.227
1.448GlyGlu: 1.448 ± 0.776
2.317GlyPhe: 2.317 ± 1.126
2.606GlyGly: 2.606 ± 2.084
1.738GlyHis: 1.738 ± 0.219
5.213GlyIle: 5.213 ± 1.417
2.027GlyLys: 2.027 ± 0.992
4.344GlyLeu: 4.344 ± 1.306
0.869GlyMet: 0.869 ± 0.396
4.634GlyAsn: 4.634 ± 1.744
4.054GlyPro: 4.054 ± 2.492
5.502GlyGln: 5.502 ± 2.796
4.344GlyArg: 4.344 ± 2.658
4.634GlySer: 4.634 ± 1.901
3.186GlyThr: 3.186 ± 0.344
2.606GlyVal: 2.606 ± 0.704
0.869GlyTrp: 0.869 ± 0.425
2.896GlyTyr: 2.896 ± 1.233
0.0GlyXaa: 0.0 ± 0.0
His
0.29HisAla: 0.29 ± 0.349
0.869HisCys: 0.869 ± 0.41
0.869HisAsp: 0.869 ± 0.411
1.448HisGlu: 1.448 ± 0.305
0.29HisPhe: 0.29 ± 0.349
1.158HisGly: 1.158 ± 0.673
0.869HisHis: 0.869 ± 0.285
1.158HisIle: 1.158 ± 0.369
1.158HisLys: 1.158 ± 0.469
4.054HisLeu: 4.054 ± 1.393
0.0HisMet: 0.0 ± 0.0
1.448HisAsn: 1.448 ± 0.565
2.317HisPro: 2.317 ± 0.717
1.158HisGln: 1.158 ± 0.442
0.869HisArg: 0.869 ± 0.379
1.448HisSer: 1.448 ± 0.574
1.448HisThr: 1.448 ± 0.611
1.448HisVal: 1.448 ± 0.626
0.869HisTrp: 0.869 ± 0.533
0.869HisTyr: 0.869 ± 0.533
0.0HisXaa: 0.0 ± 0.0
Ile
4.634IleAla: 4.634 ± 1.272
1.448IleCys: 1.448 ± 0.729
2.896IleAsp: 2.896 ± 0.494
2.606IleGlu: 2.606 ± 1.337
0.869IlePhe: 0.869 ± 0.228
3.765IleGly: 3.765 ± 1.191
3.186IleHis: 3.186 ± 0.549
4.344IleIle: 4.344 ± 1.486
3.765IleLys: 3.765 ± 1.404
5.502IleLeu: 5.502 ± 0.441
1.448IleMet: 1.448 ± 0.482
3.475IleAsn: 3.475 ± 1.477
4.634IlePro: 4.634 ± 1.348
5.792IleGln: 5.792 ± 0.794
3.765IleArg: 3.765 ± 0.679
3.765IleSer: 3.765 ± 1.207
3.186IleThr: 3.186 ± 0.917
3.475IleVal: 3.475 ± 1.309
1.158IleTrp: 1.158 ± 0.266
1.448IleTyr: 1.448 ± 0.35
0.0IleXaa: 0.0 ± 0.0
Lys
4.344LysAla: 4.344 ± 1.699
1.158LysCys: 1.158 ± 0.937
1.738LysAsp: 1.738 ± 0.682
5.502LysGlu: 5.502 ± 1.767
1.448LysPhe: 1.448 ± 0.463
1.448LysGly: 1.448 ± 1.129
2.317LysHis: 2.317 ± 0.698
4.634LysIle: 4.634 ± 2.077
4.054LysLys: 4.054 ± 1.083
3.765LysLeu: 3.765 ± 1.28
2.027LysMet: 2.027 ± 1.971
1.158LysAsn: 1.158 ± 0.601
3.765LysPro: 3.765 ± 1.739
2.027LysGln: 2.027 ± 1.259
2.317LysArg: 2.317 ± 0.784
2.606LysSer: 2.606 ± 0.862
3.186LysThr: 3.186 ± 1.035
3.765LysVal: 3.765 ± 1.178
1.738LysTrp: 1.738 ± 0.587
3.186LysTyr: 3.186 ± 1.009
0.0LysXaa: 0.0 ± 0.0
Leu
4.634LeuAla: 4.634 ± 0.317
0.869LeuCys: 0.869 ± 0.836
4.344LeuAsp: 4.344 ± 0.927
5.792LeuGlu: 5.792 ± 1.216
2.606LeuPhe: 2.606 ± 0.906
5.502LeuGly: 5.502 ± 1.473
1.448LeuHis: 1.448 ± 0.459
5.213LeuIle: 5.213 ± 1.076
8.978LeuLys: 8.978 ± 2.512
10.136LeuLeu: 10.136 ± 1.676
1.448LeuMet: 1.448 ± 0.611
4.923LeuAsn: 4.923 ± 1.196
6.95LeuPro: 6.95 ± 0.711
6.661LeuGln: 6.661 ± 1.263
5.213LeuArg: 5.213 ± 1.832
4.634LeuSer: 4.634 ± 0.572
6.082LeuThr: 6.082 ± 1.724
6.082LeuVal: 6.082 ± 0.879
1.738LeuTrp: 1.738 ± 0.687
3.186LeuTyr: 3.186 ± 0.636
0.0LeuXaa: 0.0 ± 0.0
Met
1.158MetAla: 1.158 ± 1.056
0.579MetCys: 0.579 ± 0.381
1.448MetAsp: 1.448 ± 1.019
1.448MetGlu: 1.448 ± 0.575
0.0MetPhe: 0.0 ± 0.0
1.738MetGly: 1.738 ± 0.292
0.0MetHis: 0.0 ± 0.0
0.869MetIle: 0.869 ± 0.702
0.869MetLys: 0.869 ± 0.396
1.738MetLeu: 1.738 ± 0.531
0.0MetMet: 0.0 ± 0.0
1.158MetAsn: 1.158 ± 0.626
0.579MetPro: 0.579 ± 0.725
0.869MetGln: 0.869 ± 0.895
1.158MetArg: 1.158 ± 0.785
1.738MetSer: 1.738 ± 1.013
1.158MetThr: 1.158 ± 0.459
0.869MetVal: 0.869 ± 0.677
0.29MetTrp: 0.29 ± 0.298
0.579MetTyr: 0.579 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
2.606AsnAla: 2.606 ± 0.202
0.0AsnCys: 0.0 ± 0.0
1.448AsnAsp: 1.448 ± 0.762
2.896AsnGlu: 2.896 ± 1.137
1.158AsnPhe: 1.158 ± 0.601
2.896AsnGly: 2.896 ± 1.081
0.869AsnHis: 0.869 ± 0.425
4.634AsnIle: 4.634 ± 1.602
2.317AsnLys: 2.317 ± 0.994
5.792AsnLeu: 5.792 ± 1.312
1.448AsnMet: 1.448 ± 0.641
2.896AsnAsn: 2.896 ± 0.868
4.054AsnPro: 4.054 ± 1.309
4.054AsnGln: 4.054 ± 2.384
1.448AsnArg: 1.448 ± 0.328
3.475AsnSer: 3.475 ± 1.059
3.765AsnThr: 3.765 ± 0.75
2.606AsnVal: 2.606 ± 0.865
1.448AsnTrp: 1.448 ± 0.658
0.869AsnTyr: 0.869 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
3.765ProAla: 3.765 ± 0.99
0.869ProCys: 0.869 ± 0.509
2.606ProAsp: 2.606 ± 1.344
4.344ProGlu: 4.344 ± 0.555
1.738ProPhe: 1.738 ± 0.784
4.634ProGly: 4.634 ± 2.582
1.738ProHis: 1.738 ± 0.821
4.923ProIle: 4.923 ± 1.471
3.475ProLys: 3.475 ± 0.986
7.819ProLeu: 7.819 ± 1.381
1.158ProMet: 1.158 ± 0.607
3.186ProAsn: 3.186 ± 0.852
4.634ProPro: 4.634 ± 0.992
3.765ProGln: 3.765 ± 0.528
5.213ProArg: 5.213 ± 1.435
6.082ProSer: 6.082 ± 1.394
3.475ProThr: 3.475 ± 0.64
5.502ProVal: 5.502 ± 0.822
1.448ProTrp: 1.448 ± 0.577
2.317ProTyr: 2.317 ± 0.64
0.0ProXaa: 0.0 ± 0.0
Gln
3.186GlnAla: 3.186 ± 1.594
1.158GlnCys: 1.158 ± 0.368
2.606GlnAsp: 2.606 ± 1.306
4.344GlnGlu: 4.344 ± 0.949
1.448GlnPhe: 1.448 ± 0.463
5.213GlnGly: 5.213 ± 1.367
1.738GlnHis: 1.738 ± 0.394
2.027GlnIle: 2.027 ± 0.861
2.317GlnLys: 2.317 ± 0.698
6.371GlnLeu: 6.371 ± 2.114
0.579GlnMet: 0.579 ± 0.338
3.765GlnAsn: 3.765 ± 0.758
6.371GlnPro: 6.371 ± 3.66
6.95GlnGln: 6.95 ± 3.612
3.186GlnArg: 3.186 ± 2.571
3.186GlnSer: 3.186 ± 1.25
1.738GlnThr: 1.738 ± 0.518
4.054GlnVal: 4.054 ± 0.708
0.579GlnTrp: 0.579 ± 0.468
2.606GlnTyr: 2.606 ± 0.429
0.0GlnXaa: 0.0 ± 0.0
Arg
3.186ArgAla: 3.186 ± 0.946
0.869ArgCys: 0.869 ± 0.41
4.344ArgAsp: 4.344 ± 1.317
2.606ArgGlu: 2.606 ± 0.834
1.158ArgPhe: 1.158 ± 0.266
4.054ArgGly: 4.054 ± 2.504
0.869ArgHis: 0.869 ± 0.581
2.606ArgIle: 2.606 ± 0.463
2.317ArgLys: 2.317 ± 0.736
4.634ArgLeu: 4.634 ± 0.572
1.158ArgMet: 1.158 ± 0.369
2.317ArgAsn: 2.317 ± 0.596
4.923ArgPro: 4.923 ± 1.279
2.027ArgGln: 2.027 ± 1.349
3.765ArgArg: 3.765 ± 1.266
2.606ArgSer: 2.606 ± 0.704
2.317ArgThr: 2.317 ± 1.337
3.186ArgVal: 3.186 ± 0.926
2.027ArgTrp: 2.027 ± 0.812
1.738ArgTyr: 1.738 ± 0.9
0.0ArgXaa: 0.0 ± 0.0
Ser
5.213SerAla: 5.213 ± 0.77
0.29SerCys: 0.29 ± 0.234
2.896SerAsp: 2.896 ± 0.747
3.186SerGlu: 3.186 ± 1.364
1.448SerPhe: 1.448 ± 0.575
4.634SerGly: 4.634 ± 2.383
1.158SerHis: 1.158 ± 0.368
6.371SerIle: 6.371 ± 0.69
2.317SerLys: 2.317 ± 0.786
6.371SerLeu: 6.371 ± 0.64
1.738SerMet: 1.738 ± 0.69
2.896SerAsn: 2.896 ± 0.369
3.765SerPro: 3.765 ± 0.924
3.186SerGln: 3.186 ± 0.991
2.317SerArg: 2.317 ± 0.929
7.819SerSer: 7.819 ± 2.889
4.634SerThr: 4.634 ± 0.669
1.448SerVal: 1.448 ± 0.611
0.869SerTrp: 0.869 ± 0.518
3.475SerTyr: 3.475 ± 0.873
0.0SerXaa: 0.0 ± 0.0
Thr
5.213ThrAla: 5.213 ± 0.803
0.579ThrCys: 0.579 ± 0.396
2.027ThrAsp: 2.027 ± 0.836
2.317ThrGlu: 2.317 ± 0.548
2.317ThrPhe: 2.317 ± 1.003
3.765ThrGly: 3.765 ± 0.679
1.158ThrHis: 1.158 ± 0.703
2.317ThrIle: 2.317 ± 0.67
3.186ThrLys: 3.186 ± 1.02
3.186ThrLeu: 3.186 ± 0.633
0.29ThrMet: 0.29 ± 0.226
1.448ThrAsn: 1.448 ± 0.451
5.502ThrPro: 5.502 ± 1.697
2.027ThrGln: 2.027 ± 0.672
3.475ThrArg: 3.475 ± 0.957
4.923ThrSer: 4.923 ± 1.06
4.634ThrThr: 4.634 ± 1.237
4.634ThrVal: 4.634 ± 1.039
1.158ThrTrp: 1.158 ± 0.903
2.317ThrTyr: 2.317 ± 1.317
0.0ThrXaa: 0.0 ± 0.0
Val
4.344ValAla: 4.344 ± 1.391
0.579ValCys: 0.579 ± 0.359
2.606ValAsp: 2.606 ± 0.664
2.027ValGlu: 2.027 ± 0.836
1.448ValPhe: 1.448 ± 0.353
1.738ValGly: 1.738 ± 0.506
1.158ValHis: 1.158 ± 0.524
4.923ValIle: 4.923 ± 1.161
4.344ValLys: 4.344 ± 1.254
4.923ValLeu: 4.923 ± 1.438
0.869ValMet: 0.869 ± 0.411
3.765ValAsn: 3.765 ± 0.915
4.054ValPro: 4.054 ± 1.137
2.606ValGln: 2.606 ± 0.614
1.448ValArg: 1.448 ± 0.608
4.923ValSer: 4.923 ± 1.269
4.923ValThr: 4.923 ± 1.028
4.923ValVal: 4.923 ± 1.426
1.738ValTrp: 1.738 ± 0.915
2.896ValTyr: 2.896 ± 0.896
0.0ValXaa: 0.0 ± 0.0
Trp
1.158TrpAla: 1.158 ± 0.459
0.0TrpCys: 0.0 ± 0.0
0.869TrpAsp: 0.869 ± 0.285
2.317TrpGlu: 2.317 ± 1.212
0.0TrpPhe: 0.0 ± 0.0
1.448TrpGly: 1.448 ± 0.98
0.29TrpHis: 0.29 ± 0.226
2.027TrpIle: 2.027 ± 0.432
0.579TrpLys: 0.579 ± 0.234
3.475TrpLeu: 3.475 ± 1.014
0.29TrpMet: 0.29 ± 0.226
1.448TrpAsn: 1.448 ± 0.683
1.158TrpPro: 1.158 ± 0.592
0.869TrpGln: 0.869 ± 0.396
1.448TrpArg: 1.448 ± 0.577
0.579TrpSer: 0.579 ± 0.451
2.027TrpThr: 2.027 ± 0.463
0.29TrpVal: 0.29 ± 0.363
0.579TrpTrp: 0.579 ± 0.278
0.579TrpTyr: 0.579 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.158TyrAla: 1.158 ± 0.687
0.579TyrCys: 0.579 ± 0.353
2.027TyrAsp: 2.027 ± 0.948
2.317TyrGlu: 2.317 ± 0.912
1.158TyrPhe: 1.158 ± 0.785
2.027TyrGly: 2.027 ± 1.067
0.579TyrHis: 0.579 ± 0.396
3.186TyrIle: 3.186 ± 0.691
1.738TyrLys: 1.738 ± 0.59
3.765TyrLeu: 3.765 ± 1.377
0.29TyrMet: 0.29 ± 0.234
2.027TyrAsn: 2.027 ± 0.362
2.027TyrPro: 2.027 ± 0.592
3.186TyrGln: 3.186 ± 0.699
1.158TyrArg: 1.158 ± 0.824
2.896TyrSer: 2.896 ± 1.392
2.027TyrThr: 2.027 ± 0.618
3.186TyrVal: 3.186 ± 0.793
1.448TyrTrp: 1.448 ± 0.811
2.027TyrTyr: 2.027 ± 1.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski