Amino acid dipepetide frequency for Chimpanzee faeces associated microphage 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.39AlaAla: 6.39 ± 3.884
1.597AlaCys: 1.597 ± 1.1
7.188AlaAsp: 7.188 ± 3.095
2.396AlaGlu: 2.396 ± 1.618
3.994AlaPhe: 3.994 ± 1.379
3.994AlaGly: 3.994 ± 1.626
3.195AlaHis: 3.195 ± 1.288
3.195AlaIle: 3.195 ± 1.732
1.597AlaLys: 1.597 ± 1.124
7.987AlaLeu: 7.987 ± 2.425
1.597AlaMet: 1.597 ± 1.055
7.188AlaAsn: 7.188 ± 3.366
2.396AlaPro: 2.396 ± 0.92
3.994AlaGln: 3.994 ± 1.165
6.39AlaArg: 6.39 ± 1.97
8.786AlaSer: 8.786 ± 3.115
2.396AlaThr: 2.396 ± 0.92
6.39AlaVal: 6.39 ± 2.31
0.799AlaTrp: 0.799 ± 0.787
3.195AlaTyr: 3.195 ± 1.35
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.561
0.0CysCys: 0.0 ± 0.0
2.396CysAsp: 2.396 ± 1.33
1.597CysGlu: 1.597 ± 1.1
0.799CysPhe: 0.799 ± 0.787
0.799CysGly: 0.799 ± 0.787
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.799CysLys: 0.799 ± 0.561
3.195CysLeu: 3.195 ± 2.062
0.0CysMet: 0.0 ± 0.0
0.799CysAsn: 0.799 ± 0.959
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.597CysArg: 1.597 ± 0.624
0.799CysSer: 0.799 ± 0.561
0.799CysThr: 0.799 ± 0.787
0.799CysVal: 0.799 ± 0.787
0.799CysTrp: 0.799 ± 0.787
0.799CysTyr: 0.799 ± 0.787
0.0CysXaa: 0.0 ± 0.0
Asp
5.591AspAla: 5.591 ± 1.215
1.597AspCys: 1.597 ± 1.1
3.994AspAsp: 3.994 ± 0.856
3.195AspGlu: 3.195 ± 1.35
5.591AspPhe: 5.591 ± 2.15
3.994AspGly: 3.994 ± 1.43
1.597AspHis: 1.597 ± 1.015
2.396AspIle: 2.396 ± 0.88
5.591AspLys: 5.591 ± 2.044
6.39AspLeu: 6.39 ± 0.426
1.597AspMet: 1.597 ± 0.624
3.195AspAsn: 3.195 ± 1.248
3.195AspPro: 3.195 ± 1.082
0.0AspGln: 0.0 ± 0.0
1.597AspArg: 1.597 ± 0.541
5.591AspSer: 5.591 ± 1.483
3.195AspThr: 3.195 ± 1.326
2.396AspVal: 2.396 ± 1.33
0.0AspTrp: 0.0 ± 0.0
3.195AspTyr: 3.195 ± 1.082
0.0AspXaa: 0.0 ± 0.0
Glu
3.195GluAla: 3.195 ± 1.699
0.799GluCys: 0.799 ± 0.787
2.396GluAsp: 2.396 ± 1.908
0.799GluGlu: 0.799 ± 0.787
2.396GluPhe: 2.396 ± 1.721
1.597GluGly: 1.597 ± 1.055
2.396GluHis: 2.396 ± 0.889
1.597GluIle: 1.597 ± 1.123
2.396GluLys: 2.396 ± 1.908
2.396GluLeu: 2.396 ± 0.597
0.799GluMet: 0.799 ± 1.011
2.396GluAsn: 2.396 ± 1.132
2.396GluPro: 2.396 ± 1.893
2.396GluGln: 2.396 ± 1.33
4.792GluArg: 4.792 ± 0.973
2.396GluSer: 2.396 ± 0.597
0.0GluThr: 0.0 ± 0.0
3.994GluVal: 3.994 ± 1.621
0.0GluTrp: 0.0 ± 0.0
3.994GluTyr: 3.994 ± 1.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.396PheAla: 2.396 ± 1.305
0.799PheCys: 0.799 ± 0.561
3.994PheAsp: 3.994 ± 0.499
0.799PheGlu: 0.799 ± 0.787
3.994PhePhe: 3.994 ± 1.34
5.591PheGly: 5.591 ± 1.26
1.597PheHis: 1.597 ± 0.624
1.597PheIle: 1.597 ± 0.624
2.396PheLys: 2.396 ± 0.889
1.597PheLeu: 1.597 ± 1.015
0.799PheMet: 0.799 ± 0.561
0.0PheAsn: 0.0 ± 0.0
1.597PhePro: 1.597 ± 1.574
2.396PheGln: 2.396 ± 0.843
2.396PheArg: 2.396 ± 1.305
3.994PheSer: 3.994 ± 1.301
4.792PheThr: 4.792 ± 1.3
3.195PheVal: 3.195 ± 1.326
0.799PheTrp: 0.799 ± 0.959
0.799PheTyr: 0.799 ± 0.561
0.0PheXaa: 0.0 ± 0.0
Gly
4.792GlyAla: 4.792 ± 3.211
1.597GlyCys: 1.597 ± 1.574
3.994GlyAsp: 3.994 ± 0.825
7.987GlyGlu: 7.987 ± 2.14
3.195GlyPhe: 3.195 ± 1.699
7.188GlyGly: 7.188 ± 1.541
0.0GlyHis: 0.0 ± 0.0
3.195GlyIle: 3.195 ± 1.326
7.188GlyLys: 7.188 ± 1.792
7.188GlyLeu: 7.188 ± 3.008
1.597GlyMet: 1.597 ± 0.951
3.994GlyAsn: 3.994 ± 0.825
0.0GlyPro: 0.0 ± 0.0
0.799GlyGln: 0.799 ± 0.711
1.597GlyArg: 1.597 ± 0.624
4.792GlySer: 4.792 ± 1.463
5.591GlyThr: 5.591 ± 2.341
6.39GlyVal: 6.39 ± 1.039
2.396GlyTrp: 2.396 ± 0.597
1.597GlyTyr: 1.597 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
0.799HisAla: 0.799 ± 0.787
0.0HisCys: 0.0 ± 0.0
2.396HisAsp: 2.396 ± 1.305
2.396HisGlu: 2.396 ± 2.361
1.597HisPhe: 1.597 ± 1.123
0.799HisGly: 0.799 ± 0.561
0.799HisHis: 0.799 ± 0.561
1.597HisIle: 1.597 ± 1.1
0.0HisLys: 0.0 ± 0.0
1.597HisLeu: 1.597 ± 0.624
0.799HisMet: 0.799 ± 0.49
0.0HisAsn: 0.0 ± 0.0
0.799HisPro: 0.799 ± 0.787
0.799HisGln: 0.799 ± 0.711
0.799HisArg: 0.799 ± 0.787
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.799HisVal: 0.799 ± 0.561
0.799HisTrp: 0.799 ± 0.561
0.799HisTyr: 0.799 ± 0.787
0.0HisXaa: 0.0 ± 0.0
Ile
2.396IleAla: 2.396 ± 0.92
0.0IleCys: 0.0 ± 0.0
3.195IleAsp: 3.195 ± 0.479
0.799IleGlu: 0.799 ± 0.561
0.0IlePhe: 0.0 ± 0.0
6.39IleGly: 6.39 ± 0.426
0.0IleHis: 0.0 ± 0.0
0.799IleIle: 0.799 ± 0.959
1.597IleLys: 1.597 ± 1.123
2.396IleLeu: 2.396 ± 0.92
0.0IleMet: 0.0 ± 0.0
2.396IleAsn: 2.396 ± 1.33
2.396IlePro: 2.396 ± 0.889
0.799IleGln: 0.799 ± 0.959
0.0IleArg: 0.0 ± 0.0
1.597IleSer: 1.597 ± 1.123
2.396IleThr: 2.396 ± 0.88
1.597IleVal: 1.597 ± 1.123
1.597IleTrp: 1.597 ± 1.123
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
7.987LysAla: 7.987 ± 4.392
1.597LysCys: 1.597 ± 0.624
3.994LysAsp: 3.994 ± 1.626
1.597LysGlu: 1.597 ± 1.123
3.994LysPhe: 3.994 ± 0.499
1.597LysGly: 1.597 ± 1.422
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
3.195LysLys: 3.195 ± 2.232
1.597LysLeu: 1.597 ± 0.541
0.799LysMet: 0.799 ± 0.608
0.799LysAsn: 0.799 ± 0.711
3.195LysPro: 3.195 ± 0.985
0.799LysGln: 0.799 ± 0.711
4.792LysArg: 4.792 ± 1.727
4.792LysSer: 4.792 ± 1.686
1.597LysThr: 1.597 ± 1.015
3.195LysVal: 3.195 ± 2.843
0.0LysTrp: 0.0 ± 0.0
1.597LysTyr: 1.597 ± 1.1
0.0LysXaa: 0.0 ± 0.0
Leu
7.987LeuAla: 7.987 ± 1.68
0.799LeuCys: 0.799 ± 0.561
3.195LeuAsp: 3.195 ± 1.288
4.792LeuGlu: 4.792 ± 2.271
2.396LeuPhe: 2.396 ± 1.655
7.987LeuGly: 7.987 ± 1.483
1.597LeuHis: 1.597 ± 1.574
2.396LeuIle: 2.396 ± 1.684
3.195LeuLys: 3.195 ± 1.811
1.597LeuLeu: 1.597 ± 0.541
3.195LeuMet: 3.195 ± 2.111
3.994LeuAsn: 3.994 ± 1.854
7.188LeuPro: 7.188 ± 2.068
2.396LeuGln: 2.396 ± 0.889
7.987LeuArg: 7.987 ± 2.266
6.39LeuSer: 6.39 ± 1.516
5.591LeuThr: 5.591 ± 1.215
5.591LeuVal: 5.591 ± 1.68
1.597LeuTrp: 1.597 ± 1.055
2.396LeuTyr: 2.396 ± 0.92
0.0LeuXaa: 0.0 ± 0.0
Met
2.396MetAla: 2.396 ± 1.618
0.799MetCys: 0.799 ± 0.787
2.396MetAsp: 2.396 ± 0.597
1.597MetGlu: 1.597 ± 1.055
0.0MetPhe: 0.0 ± 0.0
0.799MetGly: 0.799 ± 0.561
0.799MetHis: 0.799 ± 0.787
1.597MetIle: 1.597 ± 1.123
0.799MetLys: 0.799 ± 0.959
1.597MetLeu: 1.597 ± 1.055
0.799MetMet: 0.799 ± 0.711
1.597MetAsn: 1.597 ± 0.541
0.799MetPro: 0.799 ± 0.561
0.799MetGln: 0.799 ± 0.561
1.597MetArg: 1.597 ± 1.574
3.994MetSer: 3.994 ± 0.994
0.799MetThr: 0.799 ± 0.561
0.799MetVal: 0.799 ± 0.787
0.799MetTrp: 0.799 ± 0.561
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.994AsnAla: 3.994 ± 2.891
0.0AsnCys: 0.0 ± 0.0
1.597AsnAsp: 1.597 ± 1.123
0.0AsnGlu: 0.0 ± 0.0
1.597AsnPhe: 1.597 ± 1.123
3.994AsnGly: 3.994 ± 2.096
0.799AsnHis: 0.799 ± 0.561
1.597AsnIle: 1.597 ± 1.123
1.597AsnLys: 1.597 ± 0.541
6.39AsnLeu: 6.39 ± 2.048
0.799AsnMet: 0.799 ± 0.561
4.792AsnAsn: 4.792 ± 1.686
3.195AsnPro: 3.195 ± 1.288
1.597AsnGln: 1.597 ± 1.422
2.396AsnArg: 2.396 ± 0.92
4.792AsnSer: 4.792 ± 2.271
4.792AsnThr: 4.792 ± 1.633
2.396AsnVal: 2.396 ± 0.843
0.799AsnTrp: 0.799 ± 0.561
1.597AsnTyr: 1.597 ± 1.123
0.0AsnXaa: 0.0 ± 0.0
Pro
2.396ProAla: 2.396 ± 0.88
0.799ProCys: 0.799 ± 0.787
2.396ProAsp: 2.396 ± 0.843
0.799ProGlu: 0.799 ± 0.561
1.597ProPhe: 1.597 ± 0.624
2.396ProGly: 2.396 ± 0.843
0.799ProHis: 0.799 ± 0.787
2.396ProIle: 2.396 ± 1.132
1.597ProLys: 1.597 ± 0.624
6.39ProLeu: 6.39 ± 1.616
1.597ProMet: 1.597 ± 0.624
0.799ProAsn: 0.799 ± 0.711
3.195ProPro: 3.195 ± 2.824
5.591ProGln: 5.591 ± 2.311
0.799ProArg: 0.799 ± 0.787
9.585ProSer: 9.585 ± 3.983
3.994ProThr: 3.994 ± 2.267
2.396ProVal: 2.396 ± 1.33
0.799ProTrp: 0.799 ± 0.561
0.799ProTyr: 0.799 ± 0.561
0.0ProXaa: 0.0 ± 0.0
Gln
3.195GlnAla: 3.195 ± 1.799
0.799GlnCys: 0.799 ± 0.959
1.597GlnAsp: 1.597 ± 1.123
2.396GlnGlu: 2.396 ± 0.843
2.396GlnPhe: 2.396 ± 0.889
2.396GlnGly: 2.396 ± 0.843
0.0GlnHis: 0.0 ± 0.0
0.799GlnIle: 0.799 ± 0.561
1.597GlnLys: 1.597 ± 1.123
3.195GlnLeu: 3.195 ± 2.111
1.597GlnMet: 1.597 ± 0.624
1.597GlnAsn: 1.597 ± 0.541
0.799GlnPro: 0.799 ± 0.711
1.597GlnGln: 1.597 ± 0.541
3.195GlnArg: 3.195 ± 1.029
3.994GlnSer: 3.994 ± 1.874
0.799GlnThr: 0.799 ± 0.561
0.799GlnVal: 0.799 ± 0.561
0.799GlnTrp: 0.799 ± 0.711
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.994ArgAla: 3.994 ± 1.301
0.799ArgCys: 0.799 ± 0.787
3.994ArgAsp: 3.994 ± 1.187
1.597ArgGlu: 1.597 ± 1.055
0.0ArgPhe: 0.0 ± 0.0
3.195ArgGly: 3.195 ± 1.029
0.799ArgHis: 0.799 ± 0.561
2.396ArgIle: 2.396 ± 1.242
4.792ArgLys: 4.792 ± 3.166
7.987ArgLeu: 7.987 ± 2.823
3.195ArgMet: 3.195 ± 1.288
2.396ArgAsn: 2.396 ± 0.889
3.195ArgPro: 3.195 ± 1.35
0.0ArgGln: 0.0 ± 0.0
3.195ArgArg: 3.195 ± 1.288
5.591ArgSer: 5.591 ± 0.555
3.195ArgThr: 3.195 ± 1.248
3.195ArgVal: 3.195 ± 1.288
2.396ArgTrp: 2.396 ± 0.843
4.792ArgTyr: 4.792 ± 1.3
0.0ArgXaa: 0.0 ± 0.0
Ser
11.182SerAla: 11.182 ± 2.092
0.799SerCys: 0.799 ± 0.561
4.792SerAsp: 4.792 ± 0.973
3.994SerGlu: 3.994 ± 1.976
3.195SerPhe: 3.195 ± 1.248
7.987SerGly: 7.987 ± 1.649
0.0SerHis: 0.0 ± 0.0
2.396SerIle: 2.396 ± 0.843
0.799SerLys: 0.799 ± 0.711
3.994SerLeu: 3.994 ± 0.499
1.597SerMet: 1.597 ± 1.123
6.39SerAsn: 6.39 ± 1.695
4.792SerPro: 4.792 ± 1.536
1.597SerGln: 1.597 ± 1.055
6.39SerArg: 6.39 ± 1.516
11.182SerSer: 11.182 ± 3.536
4.792SerThr: 4.792 ± 3.368
11.981SerVal: 11.981 ± 3.142
2.396SerTrp: 2.396 ± 1.618
3.994SerTyr: 3.994 ± 1.621
0.0SerXaa: 0.0 ± 0.0
Thr
6.39ThrAla: 6.39 ± 2.048
1.597ThrCys: 1.597 ± 1.123
4.792ThrAsp: 4.792 ± 1.9
1.597ThrGlu: 1.597 ± 1.015
1.597ThrPhe: 1.597 ± 0.624
3.994ThrGly: 3.994 ± 1.379
0.0ThrHis: 0.0 ± 0.0
0.799ThrIle: 0.799 ± 0.561
1.597ThrLys: 1.597 ± 0.541
5.591ThrLeu: 5.591 ± 1.782
1.597ThrMet: 1.597 ± 0.624
0.799ThrAsn: 0.799 ± 0.561
3.994ThrPro: 3.994 ± 2.657
2.396ThrGln: 2.396 ± 0.843
4.792ThrArg: 4.792 ± 0.973
4.792ThrSer: 4.792 ± 3.368
6.39ThrThr: 6.39 ± 2.652
0.799ThrVal: 0.799 ± 0.959
2.396ThrTrp: 2.396 ± 1.721
0.799ThrTyr: 0.799 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
2.396ValAla: 2.396 ± 1.132
0.799ValCys: 0.799 ± 0.561
1.597ValAsp: 1.597 ± 1.1
2.396ValGlu: 2.396 ± 0.88
4.792ValPhe: 4.792 ± 1.366
5.591ValGly: 5.591 ± 0.978
1.597ValHis: 1.597 ± 1.574
1.597ValIle: 1.597 ± 1.015
3.994ValLys: 3.994 ± 2.943
5.591ValLeu: 5.591 ± 1.805
0.799ValMet: 0.799 ± 0.711
0.799ValAsn: 0.799 ± 0.787
7.188ValPro: 7.188 ± 3.008
2.396ValGln: 2.396 ± 0.88
3.195ValArg: 3.195 ± 0.762
5.591ValSer: 5.591 ± 1.239
1.597ValThr: 1.597 ± 1.015
3.994ValVal: 3.994 ± 2.267
2.396ValTrp: 2.396 ± 0.843
3.195ValTyr: 3.195 ± 1.082
0.0ValXaa: 0.0 ± 0.0
Trp
2.396TrpAla: 2.396 ± 0.88
0.0TrpCys: 0.0 ± 0.0
0.799TrpAsp: 0.799 ± 0.561
2.396TrpGlu: 2.396 ± 1.132
0.799TrpPhe: 0.799 ± 0.787
0.799TrpGly: 0.799 ± 0.711
0.799TrpHis: 0.799 ± 0.561
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.195TrpLeu: 3.195 ± 1.155
0.0TrpMet: 0.0 ± 0.0
1.597TrpAsn: 1.597 ± 1.123
0.799TrpPro: 0.799 ± 0.561
0.799TrpGln: 0.799 ± 0.711
0.799TrpArg: 0.799 ± 0.711
2.396TrpSer: 2.396 ± 0.889
2.396TrpThr: 2.396 ± 0.88
0.799TrpVal: 0.799 ± 0.787
0.0TrpTrp: 0.0 ± 0.0
2.396TrpTyr: 2.396 ± 0.597
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.994TyrAla: 3.994 ± 0.499
1.597TyrCys: 1.597 ± 1.574
3.195TyrAsp: 3.195 ± 1.155
0.799TyrGlu: 0.799 ± 0.787
1.597TyrPhe: 1.597 ± 1.123
3.994TyrGly: 3.994 ± 1.187
0.799TyrHis: 0.799 ± 0.787
0.0TyrIle: 0.0 ± 0.0
2.396TyrLys: 2.396 ± 1.33
2.396TyrLeu: 2.396 ± 0.597
0.799TyrMet: 0.799 ± 0.561
3.195TyrAsn: 3.195 ± 1.155
0.0TyrPro: 0.0 ± 0.0
2.396TyrGln: 2.396 ± 0.843
2.396TyrArg: 2.396 ± 1.684
3.195TyrSer: 3.195 ± 1.326
1.597TyrThr: 1.597 ± 0.541
0.0TyrVal: 0.0 ± 0.0
1.597TyrTrp: 1.597 ± 0.624
3.195TyrTyr: 3.195 ± 2.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1253 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski