Amino acid dipepetide frequency for Brown greater galago prosimian foamy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.882AlaAla: 5.882 ± 0.621
1.261AlaCys: 1.261 ± 0.548
3.361AlaAsp: 3.361 ± 0.558
0.0AlaGlu: 0.0 ± 0.0
2.941AlaPhe: 2.941 ± 0.606
2.941AlaGly: 2.941 ± 1.53
0.84AlaHis: 0.84 ± 0.476
3.361AlaIle: 3.361 ± 1.114
1.681AlaLys: 1.681 ± 1.101
6.723AlaLeu: 6.723 ± 1.392
1.681AlaMet: 1.681 ± 0.549
1.261AlaAsn: 1.261 ± 0.548
5.042AlaPro: 5.042 ± 2.045
3.361AlaGln: 3.361 ± 1.603
2.101AlaArg: 2.101 ± 0.252
4.202AlaSer: 4.202 ± 1.979
4.202AlaThr: 4.202 ± 0.77
3.361AlaVal: 3.361 ± 0.61
0.84AlaTrp: 0.84 ± 0.55
1.261AlaTyr: 1.261 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
1.681CysAla: 1.681 ± 0.901
0.0CysCys: 0.0 ± 0.0
0.42CysAsp: 0.42 ± 0.275
0.0CysGlu: 0.0 ± 0.0
0.42CysPhe: 0.42 ± 0.453
2.521CysGly: 2.521 ± 1.051
0.0CysHis: 0.0 ± 0.0
0.84CysIle: 0.84 ± 0.55
2.101CysLys: 2.101 ± 0.828
1.681CysLeu: 1.681 ± 0.91
0.84CysMet: 0.84 ± 0.404
1.681CysAsn: 1.681 ± 0.267
0.84CysPro: 0.84 ± 0.55
0.84CysGln: 0.84 ± 0.404
1.261CysArg: 1.261 ± 0.531
1.681CysSer: 1.681 ± 0.588
1.261CysThr: 1.261 ± 0.531
0.42CysVal: 0.42 ± 0.275
0.42CysTrp: 0.42 ± 0.422
2.101CysTyr: 2.101 ± 1.03
0.0CysXaa: 0.0 ± 0.0
Asp
0.84AspAla: 0.84 ± 0.557
1.681AspCys: 1.681 ± 0.588
1.681AspAsp: 1.681 ± 0.373
0.84AspGlu: 0.84 ± 0.609
1.681AspPhe: 1.681 ± 0.267
2.101AspGly: 2.101 ± 0.724
0.84AspHis: 0.84 ± 0.609
5.042AspIle: 5.042 ± 1.439
0.42AspLys: 0.42 ± 0.275
6.303AspLeu: 6.303 ± 1.332
1.261AspMet: 1.261 ± 0.299
2.941AspAsn: 2.941 ± 0.673
5.042AspPro: 5.042 ± 2.788
2.521AspGln: 2.521 ± 1.651
2.101AspArg: 2.101 ± 0.948
3.361AspSer: 3.361 ± 1.714
2.941AspThr: 2.941 ± 0.988
2.941AspVal: 2.941 ± 0.684
2.521AspTrp: 2.521 ± 0.543
1.681AspTyr: 1.681 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
2.101GluAla: 2.101 ± 1.014
2.521GluCys: 2.521 ± 0.9
2.941GluAsp: 2.941 ± 1.56
5.042GluGlu: 5.042 ± 1.368
1.261GluPhe: 1.261 ± 0.531
3.361GluGly: 3.361 ± 0.259
0.42GluHis: 0.42 ± 0.422
1.681GluIle: 1.681 ± 0.767
1.681GluLys: 1.681 ± 0.808
3.782GluLeu: 3.782 ± 1.012
1.261GluMet: 1.261 ± 0.759
0.84GluAsn: 0.84 ± 0.455
5.462GluPro: 5.462 ± 2.154
2.941GluGln: 2.941 ± 1.087
1.681GluArg: 1.681 ± 0.588
4.622GluSer: 4.622 ± 2.385
1.261GluThr: 1.261 ± 0.429
3.782GluVal: 3.782 ± 0.885
1.681GluTrp: 1.681 ± 0.845
0.84GluTyr: 0.84 ± 0.309
0.0GluXaa: 0.0 ± 0.0
Phe
0.42PheAla: 0.42 ± 0.399
1.261PheCys: 1.261 ± 0.548
1.681PheAsp: 1.681 ± 0.619
0.84PheGlu: 0.84 ± 0.55
0.42PhePhe: 0.42 ± 0.453
1.681PheGly: 1.681 ± 0.267
0.42PheHis: 0.42 ± 0.275
0.42PheIle: 0.42 ± 0.453
0.84PheLys: 0.84 ± 0.455
2.521PheLeu: 2.521 ± 0.963
0.84PheMet: 0.84 ± 0.455
1.261PheAsn: 1.261 ± 0.826
2.101PhePro: 2.101 ± 0.401
3.782PheGln: 3.782 ± 1.15
2.101PheArg: 2.101 ± 0.561
3.361PheSer: 3.361 ± 1.122
2.101PheThr: 2.101 ± 0.252
1.681PheVal: 1.681 ± 0.651
0.42PheTrp: 0.42 ± 0.422
0.84PheTyr: 0.84 ± 0.455
0.0PheXaa: 0.0 ± 0.0
Gly
2.941GlyAla: 2.941 ± 1.271
0.0GlyCys: 0.0 ± 0.0
5.042GlyAsp: 5.042 ± 1.179
0.84GlyGlu: 0.84 ± 0.404
2.101GlyPhe: 2.101 ± 0.47
6.303GlyGly: 6.303 ± 5.009
1.681GlyHis: 1.681 ± 0.619
3.782GlyIle: 3.782 ± 0.864
0.42GlyLys: 0.42 ± 0.275
7.563GlyLeu: 7.563 ± 0.832
3.361GlyMet: 3.361 ± 0.558
2.941GlyAsn: 2.941 ± 0.191
7.143GlyPro: 7.143 ± 3.202
5.462GlyGln: 5.462 ± 2.467
4.202GlyArg: 4.202 ± 1.379
5.462GlySer: 5.462 ± 1.7
3.361GlyThr: 3.361 ± 0.984
1.261GlyVal: 1.261 ± 0.658
2.521GlyTrp: 2.521 ± 0.543
2.101GlyTyr: 2.101 ± 0.678
0.0GlyXaa: 0.0 ± 0.0
His
0.84HisAla: 0.84 ± 0.55
0.42HisCys: 0.42 ± 0.275
1.681HisAsp: 1.681 ± 0.588
3.361HisGlu: 3.361 ± 1.35
1.261HisPhe: 1.261 ± 0.531
0.84HisGly: 0.84 ± 0.797
0.84HisHis: 0.84 ± 0.404
1.681HisIle: 1.681 ± 0.808
1.261HisLys: 1.261 ± 0.548
4.622HisLeu: 4.622 ± 0.745
0.84HisMet: 0.84 ± 0.476
0.42HisAsn: 0.42 ± 0.453
1.681HisPro: 1.681 ± 0.267
0.42HisGln: 0.42 ± 0.399
2.101HisArg: 2.101 ± 1.217
0.84HisSer: 0.84 ± 0.557
1.261HisThr: 1.261 ± 0.429
1.261HisVal: 1.261 ± 0.548
0.42HisTrp: 0.42 ± 0.275
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.681IleAla: 1.681 ± 1.101
0.84IleCys: 0.84 ± 0.455
1.681IleAsp: 1.681 ± 0.952
3.782IleGlu: 3.782 ± 0.604
1.261IlePhe: 1.261 ± 0.429
3.782IleGly: 3.782 ± 0.798
0.84IleHis: 0.84 ± 0.309
5.882IleIle: 5.882 ± 1.109
2.101IleLys: 2.101 ± 1.014
6.303IleLeu: 6.303 ± 1.386
0.42IleMet: 0.42 ± 0.275
2.101IleAsn: 2.101 ± 0.596
5.042IlePro: 5.042 ± 0.951
2.101IleGln: 2.101 ± 0.561
2.101IleArg: 2.101 ± 0.401
2.941IleSer: 2.941 ± 1.068
3.361IleThr: 3.361 ± 1.114
3.361IleVal: 3.361 ± 0.646
1.261IleTrp: 1.261 ± 0.548
1.681IleTyr: 1.681 ± 0.816
0.0IleXaa: 0.0 ± 0.0
Lys
2.101LysAla: 2.101 ± 1.376
0.42LysCys: 0.42 ± 0.453
2.101LysAsp: 2.101 ± 1.014
2.941LysGlu: 2.941 ± 1.304
1.261LysPhe: 1.261 ± 0.601
3.782LysGly: 3.782 ± 1.373
0.42LysHis: 0.42 ± 0.275
1.261LysIle: 1.261 ± 0.826
2.101LysLys: 2.101 ± 0.922
1.681LysLeu: 1.681 ± 0.767
0.0LysMet: 0.0 ± 0.0
1.261LysAsn: 1.261 ± 0.826
2.941LysPro: 2.941 ± 0.735
0.42LysGln: 0.42 ± 0.453
2.101LysArg: 2.101 ± 0.903
2.941LysSer: 2.941 ± 1.112
1.681LysThr: 1.681 ± 0.619
2.101LysVal: 2.101 ± 0.749
0.42LysTrp: 0.42 ± 0.275
0.84LysTyr: 0.84 ± 0.609
0.0LysXaa: 0.0 ± 0.0
Leu
6.303LeuAla: 6.303 ± 0.839
1.261LeuCys: 1.261 ± 0.548
4.202LeuAsp: 4.202 ± 0.77
4.622LeuGlu: 4.622 ± 1.708
2.941LeuPhe: 2.941 ± 1.202
4.622LeuGly: 4.622 ± 0.83
6.303LeuHis: 6.303 ± 0.706
3.782LeuIle: 3.782 ± 1.208
4.202LeuLys: 4.202 ± 2.752
10.504LeuLeu: 10.504 ± 2.375
1.261LeuMet: 1.261 ± 0.548
4.202LeuAsn: 4.202 ± 1.492
12.185LeuPro: 12.185 ± 1.926
8.403LeuGln: 8.403 ± 1.74
5.042LeuArg: 5.042 ± 0.827
2.101LeuSer: 2.101 ± 0.252
4.202LeuThr: 4.202 ± 0.852
5.882LeuVal: 5.882 ± 2.215
1.681LeuTrp: 1.681 ± 0.549
1.681LeuTyr: 1.681 ± 0.651
0.0LeuXaa: 0.0 ± 0.0
Met
2.101MetAla: 2.101 ± 0.252
0.42MetCys: 0.42 ± 0.453
1.261MetAsp: 1.261 ± 0.865
2.521MetGlu: 2.521 ± 0.722
1.261MetPhe: 1.261 ± 0.658
2.941MetGly: 2.941 ± 0.191
0.84MetHis: 0.84 ± 0.455
1.261MetIle: 1.261 ± 0.429
0.42MetLys: 0.42 ± 0.275
1.681MetLeu: 1.681 ± 0.808
0.42MetMet: 0.42 ± 0.422
0.84MetAsn: 0.84 ± 0.309
0.42MetPro: 0.42 ± 0.275
1.261MetGln: 1.261 ± 0.77
2.101MetArg: 2.101 ± 0.56
2.101MetSer: 2.101 ± 0.252
1.261MetThr: 1.261 ± 0.779
1.681MetVal: 1.681 ± 0.267
0.0MetTrp: 0.0 ± 0.0
0.42MetTyr: 0.42 ± 0.422
0.0MetXaa: 0.0 ± 0.0
Asn
2.521AsnAla: 2.521 ± 0.753
0.42AsnCys: 0.42 ± 0.453
3.361AsnAsp: 3.361 ± 1.156
2.521AsnGlu: 2.521 ± 0.682
2.101AsnPhe: 2.101 ± 0.56
0.84AsnGly: 0.84 ± 0.55
0.84AsnHis: 0.84 ± 0.55
3.361AsnIle: 3.361 ± 0.868
0.84AsnLys: 0.84 ± 0.55
2.941AsnLeu: 2.941 ± 1.068
1.261AsnMet: 1.261 ± 0.271
0.42AsnAsn: 0.42 ± 0.422
2.941AsnPro: 2.941 ± 1.271
2.521AsnGln: 2.521 ± 0.928
4.202AsnArg: 4.202 ± 0.972
4.202AsnSer: 4.202 ± 1.684
4.202AsnThr: 4.202 ± 0.624
1.261AsnVal: 1.261 ± 0.429
1.261AsnTrp: 1.261 ± 0.271
1.261AsnTyr: 1.261 ± 0.417
0.0AsnXaa: 0.0 ± 0.0
Pro
10.924ProAla: 10.924 ± 3.383
0.84ProCys: 0.84 ± 0.455
2.941ProAsp: 2.941 ± 0.684
4.622ProGlu: 4.622 ± 2.719
2.101ProPhe: 2.101 ± 1.047
7.563ProGly: 7.563 ± 3.296
2.101ProHis: 2.101 ± 0.922
4.202ProIle: 4.202 ± 0.59
0.84ProLys: 0.84 ± 0.797
7.143ProLeu: 7.143 ± 1.946
1.261ProMet: 1.261 ± 0.271
2.521ProAsn: 2.521 ± 0.98
14.286ProPro: 14.286 ± 6.912
6.723ProGln: 6.723 ± 2.901
6.303ProArg: 6.303 ± 2.513
6.723ProSer: 6.723 ± 2.437
2.941ProThr: 2.941 ± 1.314
6.723ProVal: 6.723 ± 1.477
0.84ProTrp: 0.84 ± 0.843
2.521ProTyr: 2.521 ± 0.963
0.0ProXaa: 0.0 ± 0.0
Gln
2.101GlnAla: 2.101 ± 1.436
2.521GlnCys: 2.521 ± 1.063
2.101GlnAsp: 2.101 ± 1.436
2.101GlnGlu: 2.101 ± 0.252
0.42GlnPhe: 0.42 ± 0.453
5.462GlnGly: 5.462 ± 2.072
1.681GlnHis: 1.681 ± 0.267
2.101GlnIle: 2.101 ± 0.561
2.521GlnLys: 2.521 ± 1.318
5.462GlnLeu: 5.462 ± 1.708
0.84GlnMet: 0.84 ± 0.404
4.622GlnAsn: 4.622 ± 1.401
6.723GlnPro: 6.723 ± 3.81
7.563GlnGln: 7.563 ± 3.351
3.782GlnArg: 3.782 ± 0.941
2.521GlnSer: 2.521 ± 0.722
3.361GlnThr: 3.361 ± 1.114
3.361GlnVal: 3.361 ± 1.012
1.261GlnTrp: 1.261 ± 0.601
1.681GlnTyr: 1.681 ± 0.767
0.0GlnXaa: 0.0 ± 0.0
Arg
3.361ArgAla: 3.361 ± 0.354
1.261ArgCys: 1.261 ± 0.429
3.782ArgAsp: 3.782 ± 1.276
2.941ArgGlu: 2.941 ± 1.112
1.681ArgPhe: 1.681 ± 0.813
3.782ArgGly: 3.782 ± 1.956
2.941ArgHis: 2.941 ± 0.469
1.681ArgIle: 1.681 ± 1.13
1.261ArgLys: 1.261 ± 0.826
3.361ArgLeu: 3.361 ± 1.302
2.521ArgMet: 2.521 ± 0.833
2.941ArgAsn: 2.941 ± 0.577
5.462ArgPro: 5.462 ± 0.933
1.261ArgGln: 1.261 ± 0.934
3.361ArgArg: 3.361 ± 0.835
6.303ArgSer: 6.303 ± 0.731
3.361ArgThr: 3.361 ± 1.527
3.361ArgVal: 3.361 ± 0.681
0.84ArgTrp: 0.84 ± 0.404
1.681ArgTyr: 1.681 ± 1.594
0.0ArgXaa: 0.0 ± 0.0
Ser
4.622SerAla: 4.622 ± 1.659
2.101SerCys: 2.101 ± 0.895
4.202SerAsp: 4.202 ± 1.539
3.361SerGlu: 3.361 ± 0.772
1.681SerPhe: 1.681 ± 0.588
3.782SerGly: 3.782 ± 0.864
1.261SerHis: 1.261 ± 0.548
2.941SerIle: 2.941 ± 0.987
2.101SerLys: 2.101 ± 0.749
5.042SerLeu: 5.042 ± 0.724
3.361SerMet: 3.361 ± 1.186
2.521SerAsn: 2.521 ± 0.724
5.882SerPro: 5.882 ± 2.672
4.202SerGln: 4.202 ± 0.77
3.782SerArg: 3.782 ± 1.01
7.983SerSer: 7.983 ± 2.802
4.622SerThr: 4.622 ± 1.318
2.101SerVal: 2.101 ± 0.56
2.521SerTrp: 2.521 ± 0.971
3.782SerTyr: 3.782 ± 0.84
0.0SerXaa: 0.0 ± 0.0
Thr
3.782ThrAla: 3.782 ± 0.77
0.84ThrCys: 0.84 ± 0.404
2.101ThrAsp: 2.101 ± 0.596
2.941ThrGlu: 2.941 ± 0.952
0.84ThrPhe: 0.84 ± 0.404
3.782ThrGly: 3.782 ± 0.953
0.84ThrHis: 0.84 ± 0.55
1.261ThrIle: 1.261 ± 0.429
2.521ThrLys: 2.521 ± 1.273
7.563ThrLeu: 7.563 ± 0.891
1.681ThrMet: 1.681 ± 0.767
3.361ThrAsn: 3.361 ± 0.68
4.622ThrPro: 4.622 ± 0.854
4.622ThrGln: 4.622 ± 1.856
2.941ThrArg: 2.941 ± 1.202
4.622ThrSer: 4.622 ± 0.901
5.462ThrThr: 5.462 ± 1.969
2.521ThrVal: 2.521 ± 0.542
0.84ThrTrp: 0.84 ± 0.309
2.101ThrTyr: 2.101 ± 0.696
0.0ThrXaa: 0.0 ± 0.0
Val
0.84ValAla: 0.84 ± 0.797
2.101ValCys: 2.101 ± 0.724
2.521ValAsp: 2.521 ± 0.963
2.101ValGlu: 2.101 ± 0.903
1.681ValPhe: 1.681 ± 0.651
2.521ValGly: 2.521 ± 0.971
2.101ValHis: 2.101 ± 0.724
5.462ValIle: 5.462 ± 1.231
2.521ValLys: 2.521 ± 0.834
6.723ValLeu: 6.723 ± 2.064
1.261ValMet: 1.261 ± 0.779
2.101ValAsn: 2.101 ± 0.724
4.622ValPro: 4.622 ± 1.622
2.521ValGln: 2.521 ± 0.928
2.941ValArg: 2.941 ± 0.469
3.782ValSer: 3.782 ± 1.332
3.782ValThr: 3.782 ± 1.287
2.941ValVal: 2.941 ± 0.462
0.84ValTrp: 0.84 ± 0.476
1.681ValTyr: 1.681 ± 0.767
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.42TrpCys: 0.42 ± 0.275
0.42TrpAsp: 0.42 ± 0.275
2.101TrpGlu: 2.101 ± 1.167
0.0TrpPhe: 0.0 ± 0.0
2.941TrpGly: 2.941 ± 1.309
0.42TrpHis: 0.42 ± 0.453
2.101TrpIle: 2.101 ± 0.401
0.42TrpLys: 0.42 ± 0.422
0.84TrpLeu: 0.84 ± 0.55
0.42TrpMet: 0.42 ± 0.399
2.521TrpAsn: 2.521 ± 0.9
0.84TrpPro: 0.84 ± 0.55
0.42TrpGln: 0.42 ± 0.275
1.261TrpArg: 1.261 ± 0.601
0.42TrpSer: 0.42 ± 0.275
1.681TrpThr: 1.681 ± 0.901
2.101TrpVal: 2.101 ± 2.108
0.0TrpTrp: 0.0 ± 0.0
1.261TrpTyr: 1.261 ± 0.531
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.261TyrAla: 1.261 ± 0.417
0.42TyrCys: 0.42 ± 0.422
1.261TyrAsp: 1.261 ± 0.601
1.681TyrGlu: 1.681 ± 1.185
1.681TyrPhe: 1.681 ± 0.767
2.941TyrGly: 2.941 ± 0.988
0.42TyrHis: 0.42 ± 0.399
0.42TyrIle: 0.42 ± 0.275
2.521TyrLys: 2.521 ± 0.971
2.941TyrLeu: 2.941 ± 1.069
0.0TyrMet: 0.0 ± 0.0
2.521TyrAsn: 2.521 ± 1.318
0.84TyrPro: 0.84 ± 0.609
0.84TyrGln: 0.84 ± 0.309
1.681TyrArg: 1.681 ± 0.373
2.101TyrSer: 2.101 ± 0.903
2.941TyrThr: 2.941 ± 1.304
2.941TyrVal: 2.941 ± 0.191
0.0TyrTrp: 0.0 ± 0.0
2.521TyrTyr: 2.521 ± 1.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2381 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski