Amino acid dipepetide frequency for Microbacterium phage Didgeridoo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.549AlaAla: 12.549 ± 1.039
0.369AlaCys: 0.369 ± 0.179
7.972AlaAsp: 7.972 ± 0.822
7.086AlaGlu: 7.086 ± 0.889
3.248AlaPhe: 3.248 ± 0.556
10.408AlaGly: 10.408 ± 0.889
1.698AlaHis: 1.698 ± 0.384
6.348AlaIle: 6.348 ± 0.645
5.905AlaLys: 5.905 ± 0.852
10.039AlaLeu: 10.039 ± 0.807
2.805AlaMet: 2.805 ± 0.359
3.912AlaAsn: 3.912 ± 0.667
5.684AlaPro: 5.684 ± 0.619
5.167AlaGln: 5.167 ± 0.686
5.979AlaArg: 5.979 ± 0.657
4.503AlaSer: 4.503 ± 0.502
7.16AlaThr: 7.16 ± 0.991
8.932AlaVal: 8.932 ± 0.781
2.584AlaTrp: 2.584 ± 0.395
2.288AlaTyr: 2.288 ± 0.545
0.0AlaXaa: 0.0 ± 0.0
Cys
0.221CysAla: 0.221 ± 0.147
0.0CysCys: 0.0 ± 0.0
0.369CysAsp: 0.369 ± 0.176
0.074CysGlu: 0.074 ± 0.074
0.148CysPhe: 0.148 ± 0.124
1.033CysGly: 1.033 ± 0.336
0.074CysHis: 0.074 ± 0.08
0.0CysIle: 0.0 ± 0.0
0.074CysLys: 0.074 ± 0.069
0.148CysLeu: 0.148 ± 0.11
0.074CysMet: 0.074 ± 0.077
0.295CysAsn: 0.295 ± 0.155
0.886CysPro: 0.886 ± 0.376
0.148CysGln: 0.148 ± 0.122
0.369CysArg: 0.369 ± 0.183
0.369CysSer: 0.369 ± 0.207
0.295CysThr: 0.295 ± 0.155
0.148CysVal: 0.148 ± 0.1
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.201AspAla: 6.201 ± 0.614
0.148AspCys: 0.148 ± 0.111
4.134AspAsp: 4.134 ± 0.83
3.912AspGlu: 3.912 ± 0.585
2.436AspPhe: 2.436 ± 0.417
5.389AspGly: 5.389 ± 0.612
1.919AspHis: 1.919 ± 0.385
3.027AspIle: 3.027 ± 0.473
1.993AspLys: 1.993 ± 0.343
6.348AspLeu: 6.348 ± 0.662
1.919AspMet: 1.919 ± 0.382
1.919AspAsn: 1.919 ± 0.36
4.355AspPro: 4.355 ± 0.649
1.772AspGln: 1.772 ± 0.327
3.248AspArg: 3.248 ± 0.658
3.765AspSer: 3.765 ± 0.45
3.396AspThr: 3.396 ± 0.577
4.577AspVal: 4.577 ± 0.677
1.476AspTrp: 1.476 ± 0.319
2.731AspTyr: 2.731 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
8.194GluAla: 8.194 ± 0.709
0.0GluCys: 0.0 ± 0.0
3.912GluAsp: 3.912 ± 0.601
3.912GluGlu: 3.912 ± 0.577
1.993GluPhe: 1.993 ± 0.328
3.765GluGly: 3.765 ± 0.517
1.624GluHis: 1.624 ± 0.433
1.033GluIle: 1.033 ± 0.254
2.436GluLys: 2.436 ± 0.504
6.348GluLeu: 6.348 ± 0.796
1.181GluMet: 1.181 ± 0.309
1.845GluAsn: 1.845 ± 0.456
3.469GluPro: 3.469 ± 0.55
2.436GluGln: 2.436 ± 0.39
3.027GluArg: 3.027 ± 0.519
1.993GluSer: 1.993 ± 0.503
3.691GluThr: 3.691 ± 0.522
6.717GluVal: 6.717 ± 0.726
1.476GluTrp: 1.476 ± 0.324
1.329GluTyr: 1.329 ± 0.282
0.0GluXaa: 0.0 ± 0.0
Phe
3.174PheAla: 3.174 ± 0.484
0.148PheCys: 0.148 ± 0.112
1.476PheAsp: 1.476 ± 0.287
2.657PheGlu: 2.657 ± 0.324
1.033PhePhe: 1.033 ± 0.345
2.362PheGly: 2.362 ± 0.55
0.517PheHis: 0.517 ± 0.259
1.698PheIle: 1.698 ± 0.407
1.55PheLys: 1.55 ± 0.333
2.362PheLeu: 2.362 ± 0.473
0.96PheMet: 0.96 ± 0.304
1.255PheAsn: 1.255 ± 0.227
1.624PhePro: 1.624 ± 0.354
1.033PheGln: 1.033 ± 0.296
3.691PheArg: 3.691 ± 0.634
1.845PheSer: 1.845 ± 0.378
2.805PheThr: 2.805 ± 0.441
1.403PheVal: 1.403 ± 0.331
0.517PheTrp: 0.517 ± 0.28
0.591PheTyr: 0.591 ± 0.176
0.0PheXaa: 0.0 ± 0.0
Gly
8.784GlyAla: 8.784 ± 0.829
0.664GlyCys: 0.664 ± 0.22
6.422GlyAsp: 6.422 ± 0.625
3.912GlyGlu: 3.912 ± 0.608
3.248GlyPhe: 3.248 ± 0.474
5.684GlyGly: 5.684 ± 1.046
1.403GlyHis: 1.403 ± 0.372
4.134GlyIle: 4.134 ± 0.453
3.765GlyLys: 3.765 ± 0.62
7.16GlyLeu: 7.16 ± 0.74
2.141GlyMet: 2.141 ± 0.414
2.584GlyAsn: 2.584 ± 0.463
2.879GlyPro: 2.879 ± 0.478
3.174GlyGln: 3.174 ± 0.643
4.724GlyArg: 4.724 ± 0.512
3.986GlySer: 3.986 ± 0.64
5.979GlyThr: 5.979 ± 0.849
6.127GlyVal: 6.127 ± 0.81
1.55GlyTrp: 1.55 ± 0.29
2.362GlyTyr: 2.362 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
2.141HisAla: 2.141 ± 0.404
0.221HisCys: 0.221 ± 0.13
1.181HisAsp: 1.181 ± 0.3
0.812HisGlu: 0.812 ± 0.227
0.591HisPhe: 0.591 ± 0.241
1.476HisGly: 1.476 ± 0.309
0.369HisHis: 0.369 ± 0.147
0.96HisIle: 0.96 ± 0.297
0.96HisLys: 0.96 ± 0.367
2.362HisLeu: 2.362 ± 0.458
0.517HisMet: 0.517 ± 0.178
0.664HisAsn: 0.664 ± 0.263
1.107HisPro: 1.107 ± 0.294
0.295HisGln: 0.295 ± 0.163
1.255HisArg: 1.255 ± 0.373
0.591HisSer: 0.591 ± 0.259
0.886HisThr: 0.886 ± 0.273
1.476HisVal: 1.476 ± 0.365
0.295HisTrp: 0.295 ± 0.132
0.96HisTyr: 0.96 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
4.65IleAla: 4.65 ± 0.545
0.0IleCys: 0.0 ± 0.0
3.691IleAsp: 3.691 ± 0.49
3.543IleGlu: 3.543 ± 0.531
0.664IlePhe: 0.664 ± 0.188
2.805IleGly: 2.805 ± 0.512
1.255IleHis: 1.255 ± 0.378
2.362IleIle: 2.362 ± 0.402
2.879IleLys: 2.879 ± 0.751
3.986IleLeu: 3.986 ± 0.491
0.517IleMet: 0.517 ± 0.2
1.624IleAsn: 1.624 ± 0.41
2.436IlePro: 2.436 ± 0.511
1.476IleGln: 1.476 ± 0.274
3.396IleArg: 3.396 ± 0.558
1.772IleSer: 1.772 ± 0.375
3.838IleThr: 3.838 ± 0.516
3.469IleVal: 3.469 ± 0.406
0.591IleTrp: 0.591 ± 0.229
1.255IleTyr: 1.255 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
5.905LysAla: 5.905 ± 1.059
0.295LysCys: 0.295 ± 0.132
2.288LysAsp: 2.288 ± 0.452
1.993LysGlu: 1.993 ± 0.36
1.107LysPhe: 1.107 ± 0.293
2.805LysGly: 2.805 ± 0.542
0.591LysHis: 0.591 ± 0.178
1.55LysIle: 1.55 ± 0.475
1.624LysLys: 1.624 ± 0.353
4.355LysLeu: 4.355 ± 0.692
1.181LysMet: 1.181 ± 0.25
1.181LysAsn: 1.181 ± 0.205
3.396LysPro: 3.396 ± 0.534
1.329LysGln: 1.329 ± 0.32
2.51LysArg: 2.51 ± 0.435
2.436LysSer: 2.436 ± 0.485
3.691LysThr: 3.691 ± 0.532
3.322LysVal: 3.322 ± 0.415
0.443LysTrp: 0.443 ± 0.19
0.96LysTyr: 0.96 ± 0.263
0.0LysXaa: 0.0 ± 0.0
Leu
10.556LeuAla: 10.556 ± 1.049
0.369LeuCys: 0.369 ± 0.169
5.315LeuAsp: 5.315 ± 0.602
5.905LeuGlu: 5.905 ± 0.663
2.288LeuPhe: 2.288 ± 0.428
7.972LeuGly: 7.972 ± 0.665
1.403LeuHis: 1.403 ± 0.355
3.986LeuIle: 3.986 ± 0.666
3.469LeuLys: 3.469 ± 0.522
6.644LeuLeu: 6.644 ± 0.656
2.067LeuMet: 2.067 ± 0.339
3.543LeuAsn: 3.543 ± 0.607
4.355LeuPro: 4.355 ± 0.492
2.879LeuGln: 2.879 ± 0.503
5.315LeuArg: 5.315 ± 0.823
4.872LeuSer: 4.872 ± 0.631
6.274LeuThr: 6.274 ± 0.739
5.758LeuVal: 5.758 ± 0.651
1.403LeuTrp: 1.403 ± 0.309
1.624LeuTyr: 1.624 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
3.469MetAla: 3.469 ± 0.532
0.0MetCys: 0.0 ± 0.0
1.329MetAsp: 1.329 ± 0.385
0.517MetGlu: 0.517 ± 0.221
0.443MetPhe: 0.443 ± 0.176
1.255MetGly: 1.255 ± 0.333
0.148MetHis: 0.148 ± 0.099
0.886MetIle: 0.886 ± 0.29
1.476MetLys: 1.476 ± 0.353
1.329MetLeu: 1.329 ± 0.303
0.295MetMet: 0.295 ± 0.165
1.107MetAsn: 1.107 ± 0.282
1.181MetPro: 1.181 ± 0.303
0.517MetGln: 0.517 ± 0.193
1.624MetArg: 1.624 ± 0.346
1.255MetSer: 1.255 ± 0.313
2.584MetThr: 2.584 ± 0.395
1.476MetVal: 1.476 ± 0.326
0.369MetTrp: 0.369 ± 0.171
0.295MetTyr: 0.295 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
3.1AsnAla: 3.1 ± 0.474
0.148AsnCys: 0.148 ± 0.097
1.698AsnAsp: 1.698 ± 0.287
1.993AsnGlu: 1.993 ± 0.388
0.96AsnPhe: 0.96 ± 0.272
2.879AsnGly: 2.879 ± 0.41
0.812AsnHis: 0.812 ± 0.252
1.698AsnIle: 1.698 ± 0.334
1.403AsnLys: 1.403 ± 0.32
2.657AsnLeu: 2.657 ± 0.412
0.517AsnMet: 0.517 ± 0.174
1.476AsnAsn: 1.476 ± 0.335
3.174AsnPro: 3.174 ± 0.533
0.886AsnGln: 0.886 ± 0.216
2.436AsnArg: 2.436 ± 0.462
2.51AsnSer: 2.51 ± 0.537
2.067AsnThr: 2.067 ± 0.447
2.141AsnVal: 2.141 ± 0.33
0.812AsnTrp: 0.812 ± 0.283
0.96AsnTyr: 0.96 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
8.046ProAla: 8.046 ± 0.863
0.591ProCys: 0.591 ± 0.259
3.838ProAsp: 3.838 ± 0.516
4.724ProGlu: 4.724 ± 0.626
1.476ProPhe: 1.476 ± 0.316
4.577ProGly: 4.577 ± 0.507
1.107ProHis: 1.107 ± 0.336
2.436ProIle: 2.436 ± 0.397
2.141ProLys: 2.141 ± 0.355
3.691ProLeu: 3.691 ± 0.56
0.517ProMet: 0.517 ± 0.192
2.067ProAsn: 2.067 ± 0.41
2.362ProPro: 2.362 ± 0.554
1.403ProGln: 1.403 ± 0.259
2.288ProArg: 2.288 ± 0.463
3.322ProSer: 3.322 ± 0.512
3.986ProThr: 3.986 ± 0.613
4.281ProVal: 4.281 ± 0.541
1.55ProTrp: 1.55 ± 0.4
0.517ProTyr: 0.517 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
5.02GlnAla: 5.02 ± 0.509
0.148GlnCys: 0.148 ± 0.125
1.624GlnAsp: 1.624 ± 0.348
1.993GlnGlu: 1.993 ± 0.383
1.255GlnPhe: 1.255 ± 0.347
2.584GlnGly: 2.584 ± 0.363
0.738GlnHis: 0.738 ± 0.249
1.476GlnIle: 1.476 ± 0.368
1.403GlnLys: 1.403 ± 0.336
3.986GlnLeu: 3.986 ± 0.503
0.295GlnMet: 0.295 ± 0.135
1.033GlnAsn: 1.033 ± 0.285
1.255GlnPro: 1.255 ± 0.354
0.812GlnGln: 0.812 ± 0.259
1.919GlnArg: 1.919 ± 0.387
1.329GlnSer: 1.329 ± 0.35
1.845GlnThr: 1.845 ± 0.346
2.657GlnVal: 2.657 ± 0.494
0.738GlnTrp: 0.738 ± 0.261
0.664GlnTyr: 0.664 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
6.201ArgAla: 6.201 ± 0.683
0.443ArgCys: 0.443 ± 0.228
3.838ArgAsp: 3.838 ± 0.601
3.765ArgGlu: 3.765 ± 0.665
2.288ArgPhe: 2.288 ± 0.422
5.093ArgGly: 5.093 ± 0.918
1.403ArgHis: 1.403 ± 0.277
2.805ArgIle: 2.805 ± 0.463
3.027ArgLys: 3.027 ± 0.487
4.798ArgLeu: 4.798 ± 0.47
1.255ArgMet: 1.255 ± 0.314
1.476ArgAsn: 1.476 ± 0.267
2.805ArgPro: 2.805 ± 0.476
2.51ArgGln: 2.51 ± 0.419
5.389ArgArg: 5.389 ± 0.756
3.322ArgSer: 3.322 ± 0.49
3.248ArgThr: 3.248 ± 0.576
5.093ArgVal: 5.093 ± 0.643
1.476ArgTrp: 1.476 ± 0.394
1.329ArgTyr: 1.329 ± 0.263
0.0ArgXaa: 0.0 ± 0.0
Ser
5.241SerAla: 5.241 ± 0.663
0.148SerCys: 0.148 ± 0.101
3.174SerAsp: 3.174 ± 0.553
2.584SerGlu: 2.584 ± 0.576
2.362SerPhe: 2.362 ± 0.434
5.389SerGly: 5.389 ± 0.656
0.886SerHis: 0.886 ± 0.224
2.584SerIle: 2.584 ± 0.523
2.362SerLys: 2.362 ± 0.357
5.093SerLeu: 5.093 ± 0.677
1.255SerMet: 1.255 ± 0.318
1.993SerAsn: 1.993 ± 0.458
2.805SerPro: 2.805 ± 0.424
1.476SerGln: 1.476 ± 0.314
2.953SerArg: 2.953 ± 0.436
2.953SerSer: 2.953 ± 0.563
3.838SerThr: 3.838 ± 0.46
2.731SerVal: 2.731 ± 0.567
1.403SerTrp: 1.403 ± 0.344
1.107SerTyr: 1.107 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
8.415ThrAla: 8.415 ± 1.051
0.443ThrCys: 0.443 ± 0.263
4.872ThrAsp: 4.872 ± 0.706
3.469ThrGlu: 3.469 ± 0.591
2.805ThrPhe: 2.805 ± 0.522
6.422ThrGly: 6.422 ± 0.775
1.107ThrHis: 1.107 ± 0.289
4.134ThrIle: 4.134 ± 0.574
3.322ThrLys: 3.322 ± 0.444
5.02ThrLeu: 5.02 ± 0.561
0.96ThrMet: 0.96 ± 0.237
1.55ThrAsn: 1.55 ± 0.341
4.798ThrPro: 4.798 ± 0.561
1.107ThrGln: 1.107 ± 0.265
3.617ThrArg: 3.617 ± 0.473
3.912ThrSer: 3.912 ± 0.52
5.832ThrThr: 5.832 ± 0.961
5.02ThrVal: 5.02 ± 0.661
1.845ThrTrp: 1.845 ± 0.434
1.919ThrTyr: 1.919 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
8.194ValAla: 8.194 ± 0.83
0.295ValCys: 0.295 ± 0.145
4.65ValAsp: 4.65 ± 0.583
4.503ValGlu: 4.503 ± 0.521
3.174ValPhe: 3.174 ± 0.501
5.389ValGly: 5.389 ± 0.682
1.255ValHis: 1.255 ± 0.367
3.174ValIle: 3.174 ± 0.449
2.362ValLys: 2.362 ± 0.502
6.127ValLeu: 6.127 ± 0.69
1.55ValMet: 1.55 ± 0.348
3.027ValAsn: 3.027 ± 0.573
3.986ValPro: 3.986 ± 0.601
2.362ValGln: 2.362 ± 0.387
4.946ValArg: 4.946 ± 0.567
4.134ValSer: 4.134 ± 0.547
5.462ValThr: 5.462 ± 0.784
6.348ValVal: 6.348 ± 0.672
1.698ValTrp: 1.698 ± 0.404
2.215ValTyr: 2.215 ± 0.383
0.0ValXaa: 0.0 ± 0.0
Trp
2.215TrpAla: 2.215 ± 0.404
0.074TrpCys: 0.074 ± 0.092
0.738TrpAsp: 0.738 ± 0.218
1.403TrpGlu: 1.403 ± 0.347
0.738TrpPhe: 0.738 ± 0.22
1.255TrpGly: 1.255 ± 0.266
0.221TrpHis: 0.221 ± 0.13
1.403TrpIle: 1.403 ± 0.34
0.517TrpLys: 0.517 ± 0.215
1.993TrpLeu: 1.993 ± 0.438
0.738TrpMet: 0.738 ± 0.231
1.033TrpAsn: 1.033 ± 0.307
0.812TrpPro: 0.812 ± 0.309
1.181TrpGln: 1.181 ± 0.43
0.96TrpArg: 0.96 ± 0.336
1.55TrpSer: 1.55 ± 0.367
1.698TrpThr: 1.698 ± 0.34
1.624TrpVal: 1.624 ± 0.326
0.074TrpTrp: 0.074 ± 0.071
0.664TrpTyr: 0.664 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.436TyrAla: 2.436 ± 0.463
0.221TyrCys: 0.221 ± 0.133
2.141TyrAsp: 2.141 ± 0.393
1.624TyrGlu: 1.624 ± 0.389
0.443TyrPhe: 0.443 ± 0.195
2.141TyrGly: 2.141 ± 0.402
0.591TyrHis: 0.591 ± 0.199
0.517TyrIle: 0.517 ± 0.182
0.295TyrLys: 0.295 ± 0.166
1.55TyrLeu: 1.55 ± 0.278
0.738TyrMet: 0.738 ± 0.225
0.738TyrAsn: 0.738 ± 0.26
1.55TyrPro: 1.55 ± 0.317
0.812TyrGln: 0.812 ± 0.284
1.919TyrArg: 1.919 ± 0.424
1.919TyrSer: 1.919 ± 0.376
1.993TyrThr: 1.993 ± 0.359
1.55TyrVal: 1.55 ± 0.386
0.591TyrTrp: 0.591 ± 0.215
0.369TyrTyr: 0.369 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (13548 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski