Amino acid dipepetide frequency for Gordonia phage Mahdia

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.154AlaAla: 16.154 ± 1.464
0.295AlaCys: 0.295 ± 0.143
8.188AlaAsp: 8.188 ± 0.915
9.81AlaGlu: 9.81 ± 0.986
2.434AlaPhe: 2.434 ± 0.663
9.737AlaGly: 9.737 ± 1.083
1.401AlaHis: 1.401 ± 0.365
3.541AlaIle: 3.541 ± 0.588
6.196AlaLys: 6.196 ± 0.548
8.852AlaLeu: 8.852 ± 0.84
2.655AlaMet: 2.655 ± 0.313
3.319AlaAsn: 3.319 ± 0.592
5.237AlaPro: 5.237 ± 0.653
3.393AlaGln: 3.393 ± 0.551
7.007AlaArg: 7.007 ± 0.713
7.155AlaSer: 7.155 ± 0.871
6.565AlaThr: 6.565 ± 0.77
8.63AlaVal: 8.63 ± 0.774
3.172AlaTrp: 3.172 ± 0.506
3.393AlaTyr: 3.393 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.295CysAla: 0.295 ± 0.159
0.0CysCys: 0.0 ± 0.0
0.221CysAsp: 0.221 ± 0.108
0.148CysGlu: 0.148 ± 0.107
0.295CysPhe: 0.295 ± 0.138
0.664CysGly: 0.664 ± 0.233
0.369CysHis: 0.369 ± 0.147
0.074CysIle: 0.074 ± 0.069
0.074CysLys: 0.074 ± 0.088
0.369CysLeu: 0.369 ± 0.176
0.074CysMet: 0.074 ± 0.071
0.295CysAsn: 0.295 ± 0.145
0.221CysPro: 0.221 ± 0.148
0.369CysGln: 0.369 ± 0.161
0.443CysArg: 0.443 ± 0.215
0.443CysSer: 0.443 ± 0.248
0.221CysThr: 0.221 ± 0.131
0.295CysVal: 0.295 ± 0.152
0.074CysTrp: 0.074 ± 0.074
0.148CysTyr: 0.148 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
7.45AspAla: 7.45 ± 0.642
0.295AspCys: 0.295 ± 0.144
3.614AspAsp: 3.614 ± 0.714
2.655AspGlu: 2.655 ± 0.53
0.516AspPhe: 0.516 ± 0.265
5.901AspGly: 5.901 ± 0.674
2.065AspHis: 2.065 ± 0.424
3.024AspIle: 3.024 ± 0.442
2.065AspLys: 2.065 ± 0.328
6.86AspLeu: 6.86 ± 0.731
1.475AspMet: 1.475 ± 0.384
1.918AspAsn: 1.918 ± 0.403
5.975AspPro: 5.975 ± 0.648
2.139AspGln: 2.139 ± 0.44
4.057AspArg: 4.057 ± 0.531
2.213AspSer: 2.213 ± 0.417
4.942AspThr: 4.942 ± 0.59
3.319AspVal: 3.319 ± 0.497
1.992AspTrp: 1.992 ± 0.33
1.918AspTyr: 1.918 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
8.483GluAla: 8.483 ± 0.754
0.369GluCys: 0.369 ± 0.186
4.721GluAsp: 4.721 ± 0.677
4.426GluGlu: 4.426 ± 0.553
1.623GluPhe: 1.623 ± 0.366
4.868GluGly: 4.868 ± 0.516
1.033GluHis: 1.033 ± 0.287
3.688GluIle: 3.688 ± 0.538
0.516GluLys: 0.516 ± 0.175
6.491GluLeu: 6.491 ± 0.618
1.401GluMet: 1.401 ± 0.353
1.401GluAsn: 1.401 ± 0.315
3.319GluPro: 3.319 ± 0.413
1.623GluGln: 1.623 ± 0.377
4.5GluArg: 4.5 ± 0.61
2.729GluSer: 2.729 ± 0.421
4.352GluThr: 4.352 ± 0.594
3.762GluVal: 3.762 ± 0.6
1.328GluTrp: 1.328 ± 0.284
1.401GluTyr: 1.401 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
1.918PheAla: 1.918 ± 0.304
0.221PheCys: 0.221 ± 0.108
1.992PheAsp: 1.992 ± 0.324
1.106PheGlu: 1.106 ± 0.287
0.443PhePhe: 0.443 ± 0.191
1.623PheGly: 1.623 ± 0.395
0.516PheHis: 0.516 ± 0.203
0.959PheIle: 0.959 ± 0.285
1.401PheLys: 1.401 ± 0.422
1.77PheLeu: 1.77 ± 0.364
0.516PheMet: 0.516 ± 0.202
0.664PheAsn: 0.664 ± 0.218
1.623PhePro: 1.623 ± 0.38
1.401PheGln: 1.401 ± 0.334
1.475PheArg: 1.475 ± 0.356
1.475PheSer: 1.475 ± 0.266
2.065PheThr: 2.065 ± 0.334
1.328PheVal: 1.328 ± 0.319
0.369PheTrp: 0.369 ± 0.179
0.811PheTyr: 0.811 ± 0.216
0.0PheXaa: 0.0 ± 0.0
Gly
6.86GlyAla: 6.86 ± 0.956
0.443GlyCys: 0.443 ± 0.203
5.237GlyAsp: 5.237 ± 0.556
5.385GlyGlu: 5.385 ± 0.651
2.36GlyPhe: 2.36 ± 0.401
8.925GlyGly: 8.925 ± 1.205
1.844GlyHis: 1.844 ± 0.432
5.016GlyIle: 5.016 ± 0.677
3.246GlyLys: 3.246 ± 0.5
7.081GlyLeu: 7.081 ± 0.808
1.992GlyMet: 1.992 ± 0.39
2.729GlyAsn: 2.729 ± 0.475
4.352GlyPro: 4.352 ± 0.597
4.131GlyGln: 4.131 ± 0.55
6.417GlyArg: 6.417 ± 0.884
6.417GlySer: 6.417 ± 0.875
6.491GlyThr: 6.491 ± 0.797
7.155GlyVal: 7.155 ± 1.088
1.918GlyTrp: 1.918 ± 0.311
2.36GlyTyr: 2.36 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
1.401HisAla: 1.401 ± 0.296
0.074HisCys: 0.074 ± 0.087
1.401HisAsp: 1.401 ± 0.256
0.369HisGlu: 0.369 ± 0.158
0.811HisPhe: 0.811 ± 0.256
1.697HisGly: 1.697 ± 0.47
0.811HisHis: 0.811 ± 0.319
0.443HisIle: 0.443 ± 0.19
0.516HisLys: 0.516 ± 0.176
2.287HisLeu: 2.287 ± 0.399
0.295HisMet: 0.295 ± 0.135
0.811HisAsn: 0.811 ± 0.271
1.623HisPro: 1.623 ± 0.389
1.18HisGln: 1.18 ± 0.32
1.549HisArg: 1.549 ± 0.434
0.443HisSer: 0.443 ± 0.204
0.885HisThr: 0.885 ± 0.264
1.106HisVal: 1.106 ± 0.31
0.074HisTrp: 0.074 ± 0.077
0.443HisTyr: 0.443 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
1.918IleAla: 1.918 ± 0.595
0.148IleCys: 0.148 ± 0.111
2.065IleAsp: 2.065 ± 0.367
0.443IleGlu: 0.443 ± 0.164
0.59IlePhe: 0.59 ± 0.205
4.057IleGly: 4.057 ± 0.578
1.328IleHis: 1.328 ± 0.332
0.959IleIle: 0.959 ± 0.3
1.623IleLys: 1.623 ± 0.361
4.057IleLeu: 4.057 ± 0.61
0.885IleMet: 0.885 ± 0.296
0.885IleAsn: 0.885 ± 0.28
3.098IlePro: 3.098 ± 0.475
2.065IleGln: 2.065 ± 0.435
2.655IleArg: 2.655 ± 0.482
2.877IleSer: 2.877 ± 0.475
4.352IleThr: 4.352 ± 0.631
2.729IleVal: 2.729 ± 0.484
1.18IleTrp: 1.18 ± 0.23
1.18IleTyr: 1.18 ± 0.265
0.0IleXaa: 0.0 ± 0.0
Lys
5.532LysAla: 5.532 ± 0.671
0.148LysCys: 0.148 ± 0.088
2.729LysAsp: 2.729 ± 0.425
2.065LysGlu: 2.065 ± 0.389
0.811LysPhe: 0.811 ± 0.268
2.065LysGly: 2.065 ± 0.415
0.295LysHis: 0.295 ± 0.142
1.697LysIle: 1.697 ± 0.323
0.59LysLys: 0.59 ± 0.194
4.057LysLeu: 4.057 ± 0.541
0.443LysMet: 0.443 ± 0.172
1.623LysAsn: 1.623 ± 0.412
1.401LysPro: 1.401 ± 0.307
0.369LysGln: 0.369 ± 0.165
1.918LysArg: 1.918 ± 0.342
2.065LysSer: 2.065 ± 0.472
2.877LysThr: 2.877 ± 0.453
3.467LysVal: 3.467 ± 0.747
0.738LysTrp: 0.738 ± 0.285
0.959LysTyr: 0.959 ± 0.243
0.0LysXaa: 0.0 ± 0.0
Leu
11.064LeuAla: 11.064 ± 0.962
0.664LeuCys: 0.664 ± 0.203
5.458LeuAsp: 5.458 ± 0.763
6.122LeuGlu: 6.122 ± 0.88
1.844LeuPhe: 1.844 ± 0.357
9.294LeuGly: 9.294 ± 0.705
1.623LeuHis: 1.623 ± 0.289
2.434LeuIle: 2.434 ± 0.469
4.647LeuLys: 4.647 ± 0.616
5.827LeuLeu: 5.827 ± 0.76
1.77LeuMet: 1.77 ± 0.266
1.401LeuAsn: 1.401 ± 0.442
5.532LeuPro: 5.532 ± 0.678
3.688LeuGln: 3.688 ± 0.629
5.827LeuArg: 5.827 ± 0.615
3.024LeuSer: 3.024 ± 0.488
6.712LeuThr: 6.712 ± 0.716
4.795LeuVal: 4.795 ± 0.583
1.106LeuTrp: 1.106 ± 0.244
2.139LeuTyr: 2.139 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
3.467MetAla: 3.467 ± 0.446
0.074MetCys: 0.074 ± 0.076
1.77MetAsp: 1.77 ± 0.362
1.475MetGlu: 1.475 ± 0.337
0.516MetPhe: 0.516 ± 0.168
1.992MetGly: 1.992 ± 0.319
0.074MetHis: 0.074 ± 0.071
0.738MetIle: 0.738 ± 0.21
0.295MetLys: 0.295 ± 0.146
0.738MetLeu: 0.738 ± 0.214
0.295MetMet: 0.295 ± 0.158
0.59MetAsn: 0.59 ± 0.242
1.106MetPro: 1.106 ± 0.301
0.369MetGln: 0.369 ± 0.133
1.328MetArg: 1.328 ± 0.294
2.36MetSer: 2.36 ± 0.506
2.951MetThr: 2.951 ± 0.582
1.623MetVal: 1.623 ± 0.318
0.295MetTrp: 0.295 ± 0.261
0.295MetTyr: 0.295 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.246AsnAla: 3.246 ± 0.516
0.148AsnCys: 0.148 ± 0.102
0.885AsnAsp: 0.885 ± 0.21
0.885AsnGlu: 0.885 ± 0.24
0.811AsnPhe: 0.811 ± 0.27
3.319AsnGly: 3.319 ± 0.88
0.295AsnHis: 0.295 ± 0.142
1.623AsnIle: 1.623 ± 0.387
1.549AsnLys: 1.549 ± 0.339
2.065AsnLeu: 2.065 ± 0.389
0.885AsnMet: 0.885 ± 0.23
0.811AsnAsn: 0.811 ± 0.23
2.582AsnPro: 2.582 ± 0.448
0.221AsnGln: 0.221 ± 0.131
1.623AsnArg: 1.623 ± 0.335
2.065AsnSer: 2.065 ± 0.385
2.065AsnThr: 2.065 ± 0.444
2.139AsnVal: 2.139 ± 0.363
0.369AsnTrp: 0.369 ± 0.167
0.516AsnTyr: 0.516 ± 0.187
0.0AsnXaa: 0.0 ± 0.0
Pro
7.671ProAla: 7.671 ± 0.715
0.369ProCys: 0.369 ± 0.217
5.458ProAsp: 5.458 ± 0.803
6.27ProGlu: 6.27 ± 0.774
1.77ProPhe: 1.77 ± 0.468
5.532ProGly: 5.532 ± 0.782
0.885ProHis: 0.885 ± 0.231
2.803ProIle: 2.803 ± 0.485
1.18ProLys: 1.18 ± 0.302
3.688ProLeu: 3.688 ± 0.545
1.254ProMet: 1.254 ± 0.272
1.844ProAsn: 1.844 ± 0.339
2.951ProPro: 2.951 ± 0.555
0.738ProGln: 0.738 ± 0.197
2.508ProArg: 2.508 ± 0.484
3.836ProSer: 3.836 ± 0.681
4.204ProThr: 4.204 ± 0.578
3.909ProVal: 3.909 ± 0.666
1.475ProTrp: 1.475 ± 0.385
1.106ProTyr: 1.106 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
5.606GlnAla: 5.606 ± 0.806
0.074GlnCys: 0.074 ± 0.081
1.992GlnAsp: 1.992 ± 0.356
1.401GlnGlu: 1.401 ± 0.339
1.18GlnPhe: 1.18 ± 0.268
2.36GlnGly: 2.36 ± 0.405
0.295GlnHis: 0.295 ± 0.136
1.844GlnIle: 1.844 ± 0.393
0.443GlnLys: 0.443 ± 0.164
2.877GlnLeu: 2.877 ± 0.526
0.738GlnMet: 0.738 ± 0.245
1.033GlnAsn: 1.033 ± 0.249
1.401GlnPro: 1.401 ± 0.424
1.401GlnGln: 1.401 ± 0.588
2.803GlnArg: 2.803 ± 0.451
0.959GlnSer: 0.959 ± 0.248
2.139GlnThr: 2.139 ± 0.354
3.246GlnVal: 3.246 ± 0.446
0.811GlnTrp: 0.811 ± 0.229
1.106GlnTyr: 1.106 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
8.114ArgAla: 8.114 ± 0.797
0.516ArgCys: 0.516 ± 0.206
3.246ArgAsp: 3.246 ± 0.525
4.573ArgGlu: 4.573 ± 0.711
1.697ArgPhe: 1.697 ± 0.473
6.86ArgGly: 6.86 ± 0.954
1.328ArgHis: 1.328 ± 0.334
1.328ArgIle: 1.328 ± 0.304
1.77ArgLys: 1.77 ± 0.336
5.532ArgLeu: 5.532 ± 0.793
2.508ArgMet: 2.508 ± 0.48
2.213ArgAsn: 2.213 ± 0.477
4.5ArgPro: 4.5 ± 0.723
1.992ArgGln: 1.992 ± 0.357
5.458ArgArg: 5.458 ± 0.945
3.172ArgSer: 3.172 ± 0.482
3.688ArgThr: 3.688 ± 0.565
5.606ArgVal: 5.606 ± 0.737
1.401ArgTrp: 1.401 ± 0.34
1.697ArgTyr: 1.697 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
6.122SerAla: 6.122 ± 0.723
0.074SerCys: 0.074 ± 0.07
4.204SerAsp: 4.204 ± 0.612
3.688SerGlu: 3.688 ± 0.547
1.328SerPhe: 1.328 ± 0.291
5.237SerGly: 5.237 ± 0.722
0.811SerHis: 0.811 ± 0.248
1.475SerIle: 1.475 ± 0.256
2.139SerLys: 2.139 ± 0.393
4.795SerLeu: 4.795 ± 0.622
1.033SerMet: 1.033 ± 0.305
1.033SerAsn: 1.033 ± 0.311
3.688SerPro: 3.688 ± 0.639
2.065SerGln: 2.065 ± 0.425
3.688SerArg: 3.688 ± 0.51
3.246SerSer: 3.246 ± 0.75
3.319SerThr: 3.319 ± 0.592
4.573SerVal: 4.573 ± 0.563
1.18SerTrp: 1.18 ± 0.25
1.254SerTyr: 1.254 ± 0.331
0.0SerXaa: 0.0 ± 0.0
Thr
9.81ThrAla: 9.81 ± 0.885
0.59ThrCys: 0.59 ± 0.19
4.131ThrAsp: 4.131 ± 0.489
4.868ThrGlu: 4.868 ± 0.641
1.844ThrPhe: 1.844 ± 0.439
7.229ThrGly: 7.229 ± 0.871
0.885ThrHis: 0.885 ± 0.275
2.139ThrIle: 2.139 ± 0.364
3.172ThrLys: 3.172 ± 0.38
5.901ThrLeu: 5.901 ± 0.897
1.623ThrMet: 1.623 ± 0.338
1.77ThrAsn: 1.77 ± 0.361
3.983ThrPro: 3.983 ± 0.471
1.918ThrGln: 1.918 ± 0.298
4.5ThrArg: 4.5 ± 0.637
4.278ThrSer: 4.278 ± 0.587
5.016ThrThr: 5.016 ± 0.692
6.491ThrVal: 6.491 ± 0.635
1.033ThrTrp: 1.033 ± 0.284
1.328ThrTyr: 1.328 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
7.524ValAla: 7.524 ± 0.704
0.295ValCys: 0.295 ± 0.186
4.942ValAsp: 4.942 ± 0.507
3.909ValGlu: 3.909 ± 0.5
1.401ValPhe: 1.401 ± 0.301
5.532ValGly: 5.532 ± 0.76
1.844ValHis: 1.844 ± 0.391
3.541ValIle: 3.541 ± 0.424
3.172ValLys: 3.172 ± 0.45
6.712ValLeu: 6.712 ± 0.836
1.918ValMet: 1.918 ± 0.365
1.844ValAsn: 1.844 ± 0.346
4.426ValPro: 4.426 ± 0.505
3.024ValGln: 3.024 ± 0.454
5.458ValArg: 5.458 ± 0.679
3.541ValSer: 3.541 ± 0.497
5.827ValThr: 5.827 ± 0.761
7.598ValVal: 7.598 ± 0.923
1.254ValTrp: 1.254 ± 0.381
2.065ValTyr: 2.065 ± 0.495
0.0ValXaa: 0.0 ± 0.0
Trp
1.401TrpAla: 1.401 ± 0.318
0.295TrpCys: 0.295 ± 0.226
1.106TrpAsp: 1.106 ± 0.281
1.475TrpGlu: 1.475 ± 0.265
0.664TrpPhe: 0.664 ± 0.356
0.959TrpGly: 0.959 ± 0.264
0.221TrpHis: 0.221 ± 0.122
0.885TrpIle: 0.885 ± 0.292
0.443TrpLys: 0.443 ± 0.208
2.065TrpLeu: 2.065 ± 0.391
0.148TrpMet: 0.148 ± 0.114
0.885TrpAsn: 0.885 ± 0.385
1.106TrpPro: 1.106 ± 0.245
0.664TrpGln: 0.664 ± 0.173
1.918TrpArg: 1.918 ± 0.409
1.549TrpSer: 1.549 ± 0.425
1.918TrpThr: 1.918 ± 0.388
1.992TrpVal: 1.992 ± 0.462
0.443TrpTrp: 0.443 ± 0.192
0.221TrpTyr: 0.221 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.098TyrAla: 3.098 ± 0.466
0.0TyrCys: 0.0 ± 0.0
1.033TyrAsp: 1.033 ± 0.319
0.959TyrGlu: 0.959 ± 0.218
0.738TyrPhe: 0.738 ± 0.229
2.508TyrGly: 2.508 ± 0.407
0.295TyrHis: 0.295 ± 0.14
0.885TyrIle: 0.885 ± 0.317
0.811TyrLys: 0.811 ± 0.283
2.951TyrLeu: 2.951 ± 0.455
0.369TyrMet: 0.369 ± 0.177
1.033TyrAsn: 1.033 ± 0.288
1.328TyrPro: 1.328 ± 0.291
0.959TyrGln: 0.959 ± 0.264
2.065TyrArg: 2.065 ± 0.267
1.106TyrSer: 1.106 ± 0.302
1.918TyrThr: 1.918 ± 0.425
2.065TyrVal: 2.065 ± 0.436
0.221TyrTrp: 0.221 ± 0.124
0.369TyrTyr: 0.369 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (13558 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski