Amino acid dipepetide frequency for Escherichia phage MS2 (Bacteriophage MS2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.005AlaAla: 7.005 ± 1.381
0.876AlaCys: 0.876 ± 0.574
2.627AlaAsp: 2.627 ± 0.708
2.627AlaGlu: 2.627 ± 0.825
1.751AlaPhe: 1.751 ± 0.394
3.503AlaGly: 3.503 ± 0.788
0.876AlaHis: 0.876 ± 0.574
4.378AlaIle: 4.378 ± 1.735
2.627AlaLys: 2.627 ± 0.959
9.632AlaLeu: 9.632 ± 3.202
1.751AlaMet: 1.751 ± 1.244
4.378AlaAsn: 4.378 ± 2.031
6.13AlaPro: 6.13 ± 0.879
2.627AlaGln: 2.627 ± 0.825
5.254AlaArg: 5.254 ± 1.918
6.13AlaSer: 6.13 ± 0.837
7.005AlaThr: 7.005 ± 2.124
3.503AlaVal: 3.503 ± 0.98
2.627AlaTrp: 2.627 ± 0.825
6.13AlaTyr: 6.13 ± 1.827
0.0AlaXaa: 0.0 ± 0.0
Cys
0.876CysAla: 0.876 ± 0.574
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.876CysGlu: 0.876 ± 1.088
0.0CysPhe: 0.0 ± 0.0
1.751CysGly: 1.751 ± 1.149
0.0CysHis: 0.0 ± 0.0
0.876CysIle: 0.876 ± 0.574
0.876CysLys: 0.876 ± 0.574
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.876CysAsn: 0.876 ± 0.685
0.876CysPro: 0.876 ± 0.574
0.0CysGln: 0.0 ± 0.0
0.876CysArg: 0.876 ± 1.205
1.751CysSer: 1.751 ± 1.244
0.876CysThr: 0.876 ± 0.574
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.876CysTyr: 0.876 ± 0.685
0.0CysXaa: 0.0 ± 0.0
Asp
4.378AspAla: 4.378 ± 2.295
0.876AspCys: 0.876 ± 1.088
1.751AspAsp: 1.751 ± 1.149
1.751AspGlu: 1.751 ± 1.149
0.876AspPhe: 0.876 ± 0.574
4.378AspGly: 4.378 ± 0.7
0.0AspHis: 0.0 ± 0.0
1.751AspIle: 1.751 ± 0.394
0.876AspLys: 0.876 ± 0.574
4.378AspLeu: 4.378 ± 2.872
0.876AspMet: 0.876 ± 0.574
2.627AspAsn: 2.627 ± 0.825
2.627AspPro: 2.627 ± 1.723
2.627AspGln: 2.627 ± 0.708
5.254AspArg: 5.254 ± 1.416
2.627AspSer: 2.627 ± 1.723
0.876AspThr: 0.876 ± 0.574
3.503AspVal: 3.503 ± 0.69
0.876AspTrp: 0.876 ± 0.685
1.751AspTyr: 1.751 ± 1.146
0.0AspXaa: 0.0 ± 0.0
Glu
4.378GluAla: 4.378 ± 1.223
0.876GluCys: 0.876 ± 0.574
2.627GluAsp: 2.627 ± 1.794
0.876GluGlu: 0.876 ± 0.685
1.751GluPhe: 1.751 ± 0.394
1.751GluGly: 1.751 ± 0.394
0.0GluHis: 0.0 ± 0.0
2.627GluIle: 2.627 ± 1.723
2.627GluLys: 2.627 ± 0.959
7.005GluLeu: 7.005 ± 2.625
0.876GluMet: 0.876 ± 0.685
0.876GluAsn: 0.876 ± 0.685
0.876GluPro: 0.876 ± 0.574
0.876GluGln: 0.876 ± 0.574
1.751GluArg: 1.751 ± 0.394
3.503GluSer: 3.503 ± 1.228
3.503GluThr: 3.503 ± 1.339
2.627GluVal: 2.627 ± 1.688
1.751GluTrp: 1.751 ± 0.981
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.503PheAla: 3.503 ± 1.962
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
0.876PheGlu: 0.876 ± 0.574
1.751PhePhe: 1.751 ± 1.149
4.378PheGly: 4.378 ± 0.992
0.876PheHis: 0.876 ± 0.574
0.876PheIle: 0.876 ± 0.574
1.751PheLys: 1.751 ± 1.146
4.378PheLeu: 4.378 ± 2.079
0.0PheMet: 0.0 ± 0.0
2.627PheAsn: 2.627 ± 0.959
2.627PhePro: 2.627 ± 1.041
0.876PheGln: 0.876 ± 0.685
3.503PheArg: 3.503 ± 1.228
7.005PheSer: 7.005 ± 2.247
4.378PheThr: 4.378 ± 1.735
3.503PheVal: 3.503 ± 1.158
0.876PheTrp: 0.876 ± 0.574
2.627PheTyr: 2.627 ± 0.708
0.0PheXaa: 0.0 ± 0.0
Gly
7.005GlyAla: 7.005 ± 2.247
0.876GlyCys: 0.876 ± 0.685
3.503GlyAsp: 3.503 ± 1.581
3.503GlyGlu: 3.503 ± 0.788
5.254GlyPhe: 5.254 ± 1.416
4.378GlyGly: 4.378 ± 2.097
0.876GlyHis: 0.876 ± 0.574
5.254GlyIle: 5.254 ± 0.549
0.0GlyLys: 0.0 ± 0.0
2.627GlyLeu: 2.627 ± 0.825
0.876GlyMet: 0.876 ± 0.574
7.881GlyAsn: 7.881 ± 0.961
0.876GlyPro: 0.876 ± 0.574
1.751GlyGln: 1.751 ± 1.149
5.254GlyArg: 5.254 ± 1.918
4.378GlySer: 4.378 ± 1.781
5.254GlyThr: 5.254 ± 1.202
7.881GlyVal: 7.881 ± 1.577
2.627GlyTrp: 2.627 ± 0.959
0.876GlyTyr: 0.876 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.574
0.0HisCys: 0.0 ± 0.0
1.751HisAsp: 1.751 ± 1.149
0.876HisGlu: 0.876 ± 1.205
1.751HisPhe: 1.751 ± 1.149
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.876HisLys: 0.876 ± 0.574
0.876HisLeu: 0.876 ± 0.685
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.876HisPro: 0.876 ± 0.574
0.0HisGln: 0.0 ± 0.0
0.876HisArg: 0.876 ± 0.685
0.876HisSer: 0.876 ± 0.574
1.751HisThr: 1.751 ± 0.394
0.876HisVal: 0.876 ± 0.685
0.0HisTrp: 0.0 ± 0.0
1.751HisTyr: 1.751 ± 0.394
0.0HisXaa: 0.0 ± 0.0
Ile
7.005IleAla: 7.005 ± 2.502
0.876IleCys: 0.876 ± 0.574
4.378IleAsp: 4.378 ± 1.781
0.876IleGlu: 0.876 ± 0.685
3.503IlePhe: 3.503 ± 2.601
1.751IleGly: 1.751 ± 1.149
0.876IleHis: 0.876 ± 0.574
1.751IleIle: 1.751 ± 0.394
2.627IleLys: 2.627 ± 0.825
1.751IleLeu: 1.751 ± 0.394
0.0IleMet: 0.0 ± 0.0
2.627IleAsn: 2.627 ± 0.708
1.751IlePro: 1.751 ± 2.175
0.876IleGln: 0.876 ± 0.685
7.005IleArg: 7.005 ± 2.658
4.378IleSer: 4.378 ± 1.714
1.751IleThr: 1.751 ± 0.394
4.378IleVal: 4.378 ± 0.7
0.876IleTrp: 0.876 ± 0.685
2.627IleTyr: 2.627 ± 1.184
0.0IleXaa: 0.0 ± 0.0
Lys
4.378LysAla: 4.378 ± 1.714
0.0LysCys: 0.0 ± 0.0
0.876LysAsp: 0.876 ± 1.088
0.876LysGlu: 0.876 ± 0.574
4.378LysPhe: 4.378 ± 1.223
0.876LysGly: 0.876 ± 0.574
3.503LysHis: 3.503 ± 0.98
0.876LysIle: 0.876 ± 0.574
2.627LysLys: 2.627 ± 1.723
2.627LysLeu: 2.627 ± 0.708
0.0LysMet: 0.0 ± 0.0
0.876LysAsn: 0.876 ± 0.574
2.627LysPro: 2.627 ± 1.723
0.876LysGln: 0.876 ± 0.685
0.876LysArg: 0.876 ± 0.685
3.503LysSer: 3.503 ± 1.228
3.503LysThr: 3.503 ± 1.228
6.13LysVal: 6.13 ± 3.265
0.0LysTrp: 0.0 ± 0.0
1.751LysTyr: 1.751 ± 1.244
0.0LysXaa: 0.0 ± 0.0
Leu
11.384LeuAla: 11.384 ± 1.804
0.876LeuCys: 0.876 ± 0.574
4.378LeuAsp: 4.378 ± 2.295
5.254LeuGlu: 5.254 ± 0.957
4.378LeuPhe: 4.378 ± 1.781
3.503LeuGly: 3.503 ± 1.62
0.876LeuHis: 0.876 ± 0.574
4.378LeuIle: 4.378 ± 1.183
2.627LeuLys: 2.627 ± 0.825
8.757LeuLeu: 8.757 ± 4.407
1.751LeuMet: 1.751 ± 0.394
4.378LeuAsn: 4.378 ± 0.7
5.254LeuPro: 5.254 ± 1.202
6.13LeuGln: 6.13 ± 1.13
5.254LeuArg: 5.254 ± 2.345
9.632LeuSer: 9.632 ± 1.885
6.13LeuThr: 6.13 ± 1.502
4.378LeuVal: 4.378 ± 0.965
0.0LeuTrp: 0.0 ± 0.0
2.627LeuTyr: 2.627 ± 1.354
0.0LeuXaa: 0.0 ± 0.0
Met
0.876MetAla: 0.876 ± 1.088
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.751MetGlu: 1.751 ± 1.761
0.876MetPhe: 0.876 ± 0.574
1.751MetGly: 1.751 ± 1.149
0.876MetHis: 0.876 ± 0.685
0.876MetIle: 0.876 ± 0.574
0.876MetLys: 0.876 ± 0.574
2.627MetLeu: 2.627 ± 0.959
0.0MetMet: 0.0 ± 0.0
0.876MetAsn: 0.876 ± 0.574
0.0MetPro: 0.0 ± 0.0
0.876MetGln: 0.876 ± 1.088
1.751MetArg: 1.751 ± 1.371
3.503MetSer: 3.503 ± 0.788
0.0MetThr: 0.0 ± 0.0
0.876MetVal: 0.876 ± 0.685
0.0MetTrp: 0.0 ± 0.0
0.876MetTyr: 0.876 ± 0.574
0.0MetXaa: 0.0 ± 0.0
Asn
0.876AsnAla: 0.876 ± 0.574
0.0AsnCys: 0.0 ± 0.0
2.627AsnAsp: 2.627 ± 0.708
2.627AsnGlu: 2.627 ± 0.959
3.503AsnPhe: 3.503 ± 2.487
5.254AsnGly: 5.254 ± 2.368
0.0AsnHis: 0.0 ± 0.0
1.751AsnIle: 1.751 ± 1.371
0.876AsnLys: 0.876 ± 0.574
4.378AsnLeu: 4.378 ± 0.992
2.627AsnMet: 2.627 ± 0.825
0.876AsnAsn: 0.876 ± 0.574
1.751AsnPro: 1.751 ± 1.244
1.751AsnGln: 1.751 ± 1.146
4.378AsnArg: 4.378 ± 1.183
5.254AsnSer: 5.254 ± 3.046
0.0AsnThr: 0.0 ± 0.0
0.876AsnVal: 0.876 ± 0.685
1.751AsnTrp: 1.751 ± 1.371
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.627ProAla: 2.627 ± 1.041
0.876ProCys: 0.876 ± 1.205
1.751ProAsp: 1.751 ± 1.149
0.876ProGlu: 0.876 ± 0.574
3.503ProPhe: 3.503 ± 1.339
0.876ProGly: 0.876 ± 0.685
0.876ProHis: 0.876 ± 0.574
1.751ProIle: 1.751 ± 2.175
1.751ProLys: 1.751 ± 0.981
1.751ProLeu: 1.751 ± 1.371
1.751ProMet: 1.751 ± 0.394
0.876ProAsn: 0.876 ± 0.574
1.751ProPro: 1.751 ± 1.149
2.627ProGln: 2.627 ± 1.041
4.378ProArg: 4.378 ± 2.872
5.254ProSer: 5.254 ± 1.651
3.503ProThr: 3.503 ± 1.228
5.254ProVal: 5.254 ± 1.202
0.876ProTrp: 0.876 ± 0.574
4.378ProTyr: 4.378 ± 1.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.627GlnAla: 2.627 ± 0.825
1.751GlnCys: 1.751 ± 1.149
0.876GlnAsp: 0.876 ± 0.574
1.751GlnGlu: 1.751 ± 0.394
1.751GlnPhe: 1.751 ± 1.244
5.254GlnGly: 5.254 ± 2.34
0.0GlnHis: 0.0 ± 0.0
2.627GlnIle: 2.627 ± 0.708
1.751GlnLys: 1.751 ± 0.394
2.627GlnLeu: 2.627 ± 2.484
0.0GlnMet: 0.0 ± 0.0
2.627GlnAsn: 2.627 ± 1.688
0.0GlnPro: 0.0 ± 0.0
4.378GlnGln: 4.378 ± 4.658
2.627GlnArg: 2.627 ± 1.041
4.378GlnSer: 4.378 ± 1.183
4.378GlnThr: 4.378 ± 1.636
1.751GlnVal: 1.751 ± 0.394
0.876GlnTrp: 0.876 ± 0.685
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.254ArgAla: 5.254 ± 1.183
0.0ArgCys: 0.0 ± 0.0
2.627ArgAsp: 2.627 ± 1.723
4.378ArgGlu: 4.378 ± 1.781
1.751ArgPhe: 1.751 ± 1.366
5.254ArgGly: 5.254 ± 1.183
0.876ArgHis: 0.876 ± 0.574
2.627ArgIle: 2.627 ± 0.708
3.503ArgLys: 3.503 ± 0.69
10.508ArgLeu: 10.508 ± 3.683
1.751ArgMet: 1.751 ± 0.474
0.0ArgAsn: 0.0 ± 0.0
1.751ArgPro: 1.751 ± 1.366
5.254ArgGln: 5.254 ± 3.157
7.005ArgArg: 7.005 ± 3.082
9.632ArgSer: 9.632 ± 2.472
3.503ArgThr: 3.503 ± 1.736
2.627ArgVal: 2.627 ± 0.708
1.751ArgTrp: 1.751 ± 0.394
2.627ArgTyr: 2.627 ± 0.708
0.0ArgXaa: 0.0 ± 0.0
Ser
4.378SerAla: 4.378 ± 1.679
1.751SerCys: 1.751 ± 0.394
5.254SerAsp: 5.254 ± 1.431
2.627SerGlu: 2.627 ± 1.723
2.627SerPhe: 2.627 ± 0.959
7.881SerGly: 7.881 ± 1.922
0.876SerHis: 0.876 ± 0.574
5.254SerIle: 5.254 ± 1.416
2.627SerLys: 2.627 ± 1.041
9.632SerLeu: 9.632 ± 1.532
3.503SerMet: 3.503 ± 1.228
4.378SerAsn: 4.378 ± 2.972
3.503SerPro: 3.503 ± 0.788
4.378SerGln: 4.378 ± 1.636
3.503SerArg: 3.503 ± 2.256
7.881SerSer: 7.881 ± 1.517
6.13SerThr: 6.13 ± 3.113
7.881SerVal: 7.881 ± 1.021
1.751SerTrp: 1.751 ± 0.394
4.378SerTyr: 4.378 ± 0.7
0.0SerXaa: 0.0 ± 0.0
Thr
5.254ThrAla: 5.254 ± 1.918
1.751ThrCys: 1.751 ± 1.244
2.627ThrAsp: 2.627 ± 0.708
1.751ThrGlu: 1.751 ± 1.371
4.378ThrPhe: 4.378 ± 1.781
5.254ThrGly: 5.254 ± 1.202
0.0ThrHis: 0.0 ± 0.0
5.254ThrIle: 5.254 ± 1.65
4.378ThrLys: 4.378 ± 0.992
5.254ThrLeu: 5.254 ± 2.081
1.751ThrMet: 1.751 ± 0.892
3.503ThrAsn: 3.503 ± 2.859
4.378ThrPro: 4.378 ± 1.223
4.378ThrGln: 4.378 ± 2.031
2.627ThrArg: 2.627 ± 1.354
3.503ThrSer: 3.503 ± 1.228
3.503ThrThr: 3.503 ± 1.339
6.13ThrVal: 6.13 ± 2.015
0.0ThrTrp: 0.0 ± 0.0
0.876ThrTyr: 0.876 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
7.005ValAla: 7.005 ± 4.251
0.0ValCys: 0.0 ± 0.0
6.13ValAsp: 6.13 ± 0.879
2.627ValGlu: 2.627 ± 2.234
0.876ValPhe: 0.876 ± 0.574
7.005ValGly: 7.005 ± 0.661
1.751ValHis: 1.751 ± 1.371
3.503ValIle: 3.503 ± 1.339
4.378ValLys: 4.378 ± 0.7
4.378ValLeu: 4.378 ± 1.183
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
6.13ValPro: 6.13 ± 0.782
0.876ValGln: 0.876 ± 0.685
6.13ValArg: 6.13 ± 0.879
3.503ValSer: 3.503 ± 1.228
6.13ValThr: 6.13 ± 2.714
3.503ValVal: 3.503 ± 1.228
3.503ValTrp: 3.503 ± 0.788
1.751ValTyr: 1.751 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.574
0.0TrpCys: 0.0 ± 0.0
0.876TrpAsp: 0.876 ± 0.574
1.751TrpGlu: 1.751 ± 0.394
1.751TrpPhe: 1.751 ± 0.394
1.751TrpGly: 1.751 ± 0.394
0.876TrpHis: 0.876 ± 0.685
1.751TrpIle: 1.751 ± 0.981
0.0TrpLys: 0.0 ± 0.0
4.378TrpLeu: 4.378 ± 2.295
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.876TrpPro: 0.876 ± 0.685
0.0TrpGln: 0.0 ± 0.0
1.751TrpArg: 1.751 ± 1.244
0.876TrpSer: 0.876 ± 0.574
1.751TrpThr: 1.751 ± 1.371
0.876TrpVal: 0.876 ± 0.574
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.876TyrAla: 0.876 ± 0.685
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
2.627TyrGlu: 2.627 ± 0.959
0.0TyrPhe: 0.0 ± 0.0
4.378TyrGly: 4.378 ± 1.781
0.0TyrHis: 0.0 ± 0.0
3.503TyrIle: 3.503 ± 0.788
3.503TyrLys: 3.503 ± 0.69
4.378TyrLeu: 4.378 ± 0.965
0.876TyrMet: 0.876 ± 1.159
0.876TyrAsn: 0.876 ± 0.574
2.627TyrPro: 2.627 ± 1.794
0.876TyrGln: 0.876 ± 0.574
2.627TyrArg: 2.627 ± 0.708
2.627TyrSer: 2.627 ± 0.959
2.627TyrThr: 2.627 ± 0.825
2.627TyrVal: 2.627 ± 1.041
0.0TyrTrp: 0.0 ± 0.0
1.751TyrTyr: 1.751 ± 1.149
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1143 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski