Amino acid dipepetide frequency for Propionibacterium phage Pirate

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.041AlaAla: 11.041 ± 1.661
0.563AlaCys: 0.563 ± 0.245
6.422AlaAsp: 6.422 ± 0.922
5.408AlaGlu: 5.408 ± 0.779
2.817AlaPhe: 2.817 ± 0.595
9.576AlaGly: 9.576 ± 0.972
2.028AlaHis: 2.028 ± 0.503
4.394AlaIle: 4.394 ± 0.68
3.831AlaLys: 3.831 ± 0.605
8.112AlaLeu: 8.112 ± 1.006
3.042AlaMet: 3.042 ± 1.225
2.141AlaAsn: 2.141 ± 0.483
2.479AlaPro: 2.479 ± 0.495
4.732AlaGln: 4.732 ± 0.636
6.196AlaArg: 6.196 ± 1.071
6.084AlaSer: 6.084 ± 0.818
5.746AlaThr: 5.746 ± 0.874
9.351AlaVal: 9.351 ± 1.378
1.465AlaTrp: 1.465 ± 0.387
2.141AlaTyr: 2.141 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 0.322
0.113CysCys: 0.113 ± 0.105
1.014CysAsp: 1.014 ± 0.379
1.127CysGlu: 1.127 ± 0.376
0.0CysPhe: 0.0 ± 0.0
1.465CysGly: 1.465 ± 0.422
0.338CysHis: 0.338 ± 0.231
0.225CysIle: 0.225 ± 0.147
0.451CysLys: 0.451 ± 0.254
0.563CysLeu: 0.563 ± 0.215
0.0CysMet: 0.0 ± 0.0
0.338CysAsn: 0.338 ± 0.19
0.901CysPro: 0.901 ± 0.288
0.225CysGln: 0.225 ± 0.139
1.465CysArg: 1.465 ± 0.458
0.451CysSer: 0.451 ± 0.264
1.014CysThr: 1.014 ± 0.327
0.676CysVal: 0.676 ± 0.274
0.338CysTrp: 0.338 ± 0.215
0.225CysTyr: 0.225 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
4.507AspAla: 4.507 ± 0.752
1.014AspCys: 1.014 ± 0.29
5.408AspAsp: 5.408 ± 0.808
3.831AspGlu: 3.831 ± 0.731
1.69AspPhe: 1.69 ± 0.505
5.971AspGly: 5.971 ± 1.438
2.141AspHis: 2.141 ± 0.513
2.929AspIle: 2.929 ± 0.753
2.366AspLys: 2.366 ± 0.363
4.394AspLeu: 4.394 ± 0.916
1.014AspMet: 1.014 ± 0.357
3.831AspAsn: 3.831 ± 0.877
3.718AspPro: 3.718 ± 1.064
1.69AspGln: 1.69 ± 0.527
3.38AspArg: 3.38 ± 0.644
3.38AspSer: 3.38 ± 0.631
5.07AspThr: 5.07 ± 0.764
4.845AspVal: 4.845 ± 0.756
1.69AspTrp: 1.69 ± 0.412
2.479AspTyr: 2.479 ± 0.581
0.0AspXaa: 0.0 ± 0.0
Glu
5.521GluAla: 5.521 ± 1.032
0.789GluCys: 0.789 ± 0.399
2.028GluAsp: 2.028 ± 0.522
2.591GluGlu: 2.591 ± 0.664
1.352GluPhe: 1.352 ± 0.366
3.267GluGly: 3.267 ± 0.583
0.676GluHis: 0.676 ± 0.253
2.366GluIle: 2.366 ± 0.484
1.803GluLys: 1.803 ± 0.454
3.831GluLeu: 3.831 ± 0.739
1.014GluMet: 1.014 ± 0.282
1.803GluAsn: 1.803 ± 0.485
1.465GluPro: 1.465 ± 0.507
2.028GluGln: 2.028 ± 0.471
3.493GluArg: 3.493 ± 0.704
4.394GluSer: 4.394 ± 0.85
3.38GluThr: 3.38 ± 0.753
3.493GluVal: 3.493 ± 0.61
1.69GluTrp: 1.69 ± 0.508
2.141GluTyr: 2.141 ± 0.614
0.0GluXaa: 0.0 ± 0.0
Phe
2.704PheAla: 2.704 ± 0.828
0.338PheCys: 0.338 ± 0.198
1.803PheAsp: 1.803 ± 0.468
1.577PheGlu: 1.577 ± 0.341
1.239PhePhe: 1.239 ± 0.311
2.366PheGly: 2.366 ± 0.494
0.901PheHis: 0.901 ± 0.314
1.014PheIle: 1.014 ± 0.263
1.465PheLys: 1.465 ± 0.402
1.803PheLeu: 1.803 ± 0.447
0.789PheMet: 0.789 ± 0.301
0.789PheAsn: 0.789 ± 0.429
1.352PhePro: 1.352 ± 0.408
0.676PheGln: 0.676 ± 0.204
2.028PheArg: 2.028 ± 0.439
2.253PheSer: 2.253 ± 0.517
2.028PheThr: 2.028 ± 0.422
1.915PheVal: 1.915 ± 0.42
0.563PheTrp: 0.563 ± 0.239
0.113PheTyr: 0.113 ± 0.106
0.0PheXaa: 0.0 ± 0.0
Gly
6.647GlyAla: 6.647 ± 1.275
0.563GlyCys: 0.563 ± 0.256
5.408GlyAsp: 5.408 ± 0.685
4.507GlyGlu: 4.507 ± 0.614
2.817GlyPhe: 2.817 ± 0.694
6.985GlyGly: 6.985 ± 0.878
1.803GlyHis: 1.803 ± 0.515
2.704GlyIle: 2.704 ± 0.669
4.394GlyLys: 4.394 ± 0.669
8.112GlyLeu: 8.112 ± 1.2
1.69GlyMet: 1.69 ± 0.403
3.042GlyAsn: 3.042 ± 0.45
3.831GlyPro: 3.831 ± 0.954
3.267GlyGln: 3.267 ± 0.704
4.281GlyArg: 4.281 ± 0.932
6.76GlySer: 6.76 ± 1.198
4.056GlyThr: 4.056 ± 0.622
9.013GlyVal: 9.013 ± 1.396
2.028GlyTrp: 2.028 ± 0.396
2.817GlyTyr: 2.817 ± 0.536
0.0GlyXaa: 0.0 ± 0.0
His
1.803HisAla: 1.803 ± 0.56
0.789HisCys: 0.789 ± 0.283
1.577HisAsp: 1.577 ± 0.509
0.563HisGlu: 0.563 ± 0.266
0.676HisPhe: 0.676 ± 0.367
1.915HisGly: 1.915 ± 0.411
1.69HisHis: 1.69 ± 0.54
1.69HisIle: 1.69 ± 0.496
1.127HisLys: 1.127 ± 0.441
2.591HisLeu: 2.591 ± 0.76
0.225HisMet: 0.225 ± 0.174
1.239HisAsn: 1.239 ± 0.412
1.127HisPro: 1.127 ± 0.409
1.352HisGln: 1.352 ± 0.396
1.352HisArg: 1.352 ± 0.395
1.352HisSer: 1.352 ± 0.308
2.028HisThr: 2.028 ± 0.594
1.352HisVal: 1.352 ± 0.32
0.451HisTrp: 0.451 ± 0.227
1.014HisTyr: 1.014 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
3.831IleAla: 3.831 ± 0.658
0.789IleCys: 0.789 ± 0.318
4.056IleAsp: 4.056 ± 0.761
3.267IleGlu: 3.267 ± 0.501
1.239IlePhe: 1.239 ± 0.295
2.366IleGly: 2.366 ± 0.749
1.465IleHis: 1.465 ± 0.449
3.155IleIle: 3.155 ± 0.738
1.803IleLys: 1.803 ± 0.434
3.831IleLeu: 3.831 ± 0.741
1.239IleMet: 1.239 ± 0.456
2.253IleAsn: 2.253 ± 0.623
2.817IlePro: 2.817 ± 0.494
1.803IleGln: 1.803 ± 0.452
2.929IleArg: 2.929 ± 0.6
2.479IleSer: 2.479 ± 0.57
4.281IleThr: 4.281 ± 0.817
3.493IleVal: 3.493 ± 0.524
0.338IleTrp: 0.338 ± 0.193
0.901IleTyr: 0.901 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
4.619LysAla: 4.619 ± 0.766
0.225LysCys: 0.225 ± 0.149
2.591LysAsp: 2.591 ± 0.657
1.127LysGlu: 1.127 ± 0.359
0.451LysPhe: 0.451 ± 0.201
3.267LysGly: 3.267 ± 0.609
0.676LysHis: 0.676 ± 0.268
1.577LysIle: 1.577 ± 0.43
1.127LysLys: 1.127 ± 0.378
3.493LysLeu: 3.493 ± 0.522
1.127LysMet: 1.127 ± 0.313
1.915LysAsn: 1.915 ± 0.516
3.155LysPro: 3.155 ± 0.566
2.479LysGln: 2.479 ± 0.629
2.929LysArg: 2.929 ± 0.614
2.253LysSer: 2.253 ± 0.866
3.493LysThr: 3.493 ± 0.841
1.577LysVal: 1.577 ± 0.493
0.676LysTrp: 0.676 ± 0.244
1.127LysTyr: 1.127 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
10.027LeuAla: 10.027 ± 0.964
1.465LeuCys: 1.465 ± 0.366
5.633LeuAsp: 5.633 ± 1.053
3.943LeuGlu: 3.943 ± 0.755
1.915LeuPhe: 1.915 ± 0.521
6.309LeuGly: 6.309 ± 1.226
2.704LeuHis: 2.704 ± 0.57
3.831LeuIle: 3.831 ± 0.793
4.394LeuLys: 4.394 ± 0.647
4.619LeuLeu: 4.619 ± 0.641
1.465LeuMet: 1.465 ± 0.335
2.929LeuAsn: 2.929 ± 0.511
4.394LeuPro: 4.394 ± 0.585
3.493LeuGln: 3.493 ± 0.552
3.042LeuArg: 3.042 ± 0.695
5.858LeuSer: 5.858 ± 0.796
4.281LeuThr: 4.281 ± 0.838
4.056LeuVal: 4.056 ± 0.725
1.014LeuTrp: 1.014 ± 0.357
1.915LeuTyr: 1.915 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
2.817MetAla: 2.817 ± 0.648
0.563MetCys: 0.563 ± 0.251
1.014MetAsp: 1.014 ± 0.327
0.563MetGlu: 0.563 ± 0.257
1.127MetPhe: 1.127 ± 0.304
1.465MetGly: 1.465 ± 0.616
0.451MetHis: 0.451 ± 0.198
1.803MetIle: 1.803 ± 0.419
0.789MetLys: 0.789 ± 0.298
2.366MetLeu: 2.366 ± 0.427
0.338MetMet: 0.338 ± 0.161
1.014MetAsn: 1.014 ± 0.303
1.352MetPro: 1.352 ± 0.587
1.352MetGln: 1.352 ± 0.398
1.465MetArg: 1.465 ± 0.375
2.028MetSer: 2.028 ± 0.552
1.014MetThr: 1.014 ± 0.327
1.803MetVal: 1.803 ± 0.373
0.563MetTrp: 0.563 ± 0.251
1.014MetTyr: 1.014 ± 0.36
0.0MetXaa: 0.0 ± 0.0
Asn
2.817AsnAla: 2.817 ± 0.905
0.113AsnCys: 0.113 ± 0.106
1.577AsnAsp: 1.577 ± 0.416
1.465AsnGlu: 1.465 ± 0.426
0.676AsnPhe: 0.676 ± 0.227
4.732AsnGly: 4.732 ± 0.632
1.014AsnHis: 1.014 ± 0.3
2.479AsnIle: 2.479 ± 0.598
1.352AsnLys: 1.352 ± 0.426
2.479AsnLeu: 2.479 ± 0.621
1.127AsnMet: 1.127 ± 0.277
2.591AsnAsn: 2.591 ± 0.888
3.042AsnPro: 3.042 ± 0.739
2.141AsnGln: 2.141 ± 0.597
2.479AsnArg: 2.479 ± 0.738
1.803AsnSer: 1.803 ± 0.556
2.929AsnThr: 2.929 ± 0.69
2.479AsnVal: 2.479 ± 0.563
0.563AsnTrp: 0.563 ± 0.274
1.127AsnTyr: 1.127 ± 0.379
0.0AsnXaa: 0.0 ± 0.0
Pro
5.521ProAla: 5.521 ± 0.843
0.338ProCys: 0.338 ± 0.205
4.507ProAsp: 4.507 ± 0.835
2.704ProGlu: 2.704 ± 0.573
1.127ProPhe: 1.127 ± 0.405
5.183ProGly: 5.183 ± 0.753
1.239ProHis: 1.239 ± 0.429
1.69ProIle: 1.69 ± 0.39
1.69ProLys: 1.69 ± 0.319
3.155ProLeu: 3.155 ± 0.495
0.901ProMet: 0.901 ± 0.301
2.141ProAsn: 2.141 ± 0.568
3.155ProPro: 3.155 ± 0.781
1.915ProGln: 1.915 ± 0.551
1.577ProArg: 1.577 ± 0.476
3.38ProSer: 3.38 ± 0.636
2.591ProThr: 2.591 ± 0.499
5.183ProVal: 5.183 ± 1.092
1.69ProTrp: 1.69 ± 0.495
0.676ProTyr: 0.676 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
4.732GlnAla: 4.732 ± 1.026
0.451GlnCys: 0.451 ± 0.245
1.69GlnAsp: 1.69 ± 0.488
1.014GlnGlu: 1.014 ± 0.333
0.901GlnPhe: 0.901 ± 0.338
2.366GlnGly: 2.366 ± 0.742
1.915GlnHis: 1.915 ± 0.662
2.591GlnIle: 2.591 ± 0.763
1.239GlnLys: 1.239 ± 0.348
3.831GlnLeu: 3.831 ± 0.729
1.127GlnMet: 1.127 ± 0.406
1.239GlnAsn: 1.239 ± 0.55
2.817GlnPro: 2.817 ± 0.622
3.38GlnGln: 3.38 ± 0.82
2.817GlnArg: 2.817 ± 0.549
1.915GlnSer: 1.915 ± 0.428
2.704GlnThr: 2.704 ± 0.631
2.591GlnVal: 2.591 ± 0.526
1.014GlnTrp: 1.014 ± 0.391
1.239GlnTyr: 1.239 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
5.521ArgAla: 5.521 ± 0.949
0.338ArgCys: 0.338 ± 0.193
3.605ArgAsp: 3.605 ± 0.508
2.366ArgGlu: 2.366 ± 0.588
2.028ArgPhe: 2.028 ± 0.548
4.507ArgGly: 4.507 ± 0.837
1.239ArgHis: 1.239 ± 0.316
3.38ArgIle: 3.38 ± 0.533
2.817ArgLys: 2.817 ± 0.647
6.196ArgLeu: 6.196 ± 0.965
1.915ArgMet: 1.915 ± 0.539
2.817ArgAsn: 2.817 ± 0.67
1.465ArgPro: 1.465 ± 0.451
2.479ArgGln: 2.479 ± 0.479
4.732ArgArg: 4.732 ± 0.924
4.732ArgSer: 4.732 ± 0.774
2.704ArgThr: 2.704 ± 0.677
5.408ArgVal: 5.408 ± 1.044
0.901ArgTrp: 0.901 ± 0.334
2.028ArgTyr: 2.028 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
6.76SerAla: 6.76 ± 1.145
0.563SerCys: 0.563 ± 0.262
4.619SerAsp: 4.619 ± 0.589
3.38SerGlu: 3.38 ± 0.581
2.817SerPhe: 2.817 ± 0.498
7.661SerGly: 7.661 ± 1.234
1.239SerHis: 1.239 ± 0.358
3.38SerIle: 3.38 ± 0.523
1.915SerLys: 1.915 ± 0.596
4.845SerLeu: 4.845 ± 0.566
2.704SerMet: 2.704 ± 0.41
2.141SerAsn: 2.141 ± 0.52
2.704SerPro: 2.704 ± 0.633
2.028SerGln: 2.028 ± 0.488
4.169SerArg: 4.169 ± 0.88
3.831SerSer: 3.831 ± 0.758
2.591SerThr: 2.591 ± 0.55
6.422SerVal: 6.422 ± 1.159
2.028SerTrp: 2.028 ± 0.474
0.676SerTyr: 0.676 ± 0.297
0.0SerXaa: 0.0 ± 0.0
Thr
6.309ThrAla: 6.309 ± 1.061
0.901ThrCys: 0.901 ± 0.373
3.831ThrAsp: 3.831 ± 0.612
2.366ThrGlu: 2.366 ± 0.566
2.253ThrPhe: 2.253 ± 0.669
5.633ThrGly: 5.633 ± 0.785
1.465ThrHis: 1.465 ± 0.435
4.281ThrIle: 4.281 ± 0.792
2.253ThrLys: 2.253 ± 0.517
4.732ThrLeu: 4.732 ± 1.02
0.789ThrMet: 0.789 ± 0.292
1.577ThrAsn: 1.577 ± 0.443
3.943ThrPro: 3.943 ± 0.582
2.479ThrGln: 2.479 ± 0.651
3.042ThrArg: 3.042 ± 0.49
3.943ThrSer: 3.943 ± 0.661
4.281ThrThr: 4.281 ± 1.094
5.521ThrVal: 5.521 ± 0.625
0.789ThrTrp: 0.789 ± 0.348
2.253ThrTyr: 2.253 ± 0.637
0.0ThrXaa: 0.0 ± 0.0
Val
7.886ValAla: 7.886 ± 1.536
1.127ValCys: 1.127 ± 0.419
5.408ValAsp: 5.408 ± 0.801
5.07ValGlu: 5.07 ± 0.859
2.028ValPhe: 2.028 ± 0.597
5.971ValGly: 5.971 ± 0.985
1.352ValHis: 1.352 ± 0.242
2.479ValIle: 2.479 ± 0.798
3.493ValLys: 3.493 ± 0.915
5.971ValLeu: 5.971 ± 0.959
2.929ValMet: 2.929 ± 0.684
2.817ValAsn: 2.817 ± 0.651
3.831ValPro: 3.831 ± 0.649
1.915ValGln: 1.915 ± 0.58
4.732ValArg: 4.732 ± 0.666
6.647ValSer: 6.647 ± 0.914
4.619ValThr: 4.619 ± 0.742
6.985ValVal: 6.985 ± 1.4
1.577ValTrp: 1.577 ± 0.467
1.803ValTyr: 1.803 ± 0.503
0.0ValXaa: 0.0 ± 0.0
Trp
1.577TrpAla: 1.577 ± 0.505
0.338TrpCys: 0.338 ± 0.2
0.901TrpAsp: 0.901 ± 0.295
1.014TrpGlu: 1.014 ± 0.324
0.225TrpPhe: 0.225 ± 0.145
1.239TrpGly: 1.239 ± 0.372
0.789TrpHis: 0.789 ± 0.291
1.014TrpIle: 1.014 ± 0.287
0.676TrpLys: 0.676 ± 0.287
1.352TrpLeu: 1.352 ± 0.427
0.901TrpMet: 0.901 ± 0.296
1.239TrpAsn: 1.239 ± 0.409
0.676TrpPro: 0.676 ± 0.234
1.127TrpGln: 1.127 ± 0.382
2.366TrpArg: 2.366 ± 0.615
1.352TrpSer: 1.352 ± 0.522
1.465TrpThr: 1.465 ± 0.392
0.789TrpVal: 0.789 ± 0.385
0.563TrpTrp: 0.563 ± 0.204
0.676TrpTyr: 0.676 ± 0.328
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.028TyrAla: 2.028 ± 0.391
0.451TyrCys: 0.451 ± 0.244
2.253TyrAsp: 2.253 ± 0.477
1.239TyrGlu: 1.239 ± 0.317
0.338TyrPhe: 0.338 ± 0.188
2.479TyrGly: 2.479 ± 0.477
0.789TyrHis: 0.789 ± 0.275
1.352TyrIle: 1.352 ± 0.368
0.901TyrLys: 0.901 ± 0.338
1.014TyrLeu: 1.014 ± 0.333
0.451TyrMet: 0.451 ± 0.237
1.127TyrAsn: 1.127 ± 0.369
2.141TyrPro: 2.141 ± 0.552
1.127TyrGln: 1.127 ± 0.396
2.704TyrArg: 2.704 ± 0.494
1.577TyrSer: 1.577 ± 0.416
2.366TyrThr: 2.366 ± 0.431
1.69TyrVal: 1.69 ± 0.349
0.338TyrTrp: 0.338 ± 0.21
0.901TyrTyr: 0.901 ± 0.303
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (8877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski