Amino acid dipepetide frequency for Pseudomonas phage PS-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.421AlaAla: 17.421 ± 1.737
1.633AlaCys: 1.633 ± 0.373
6.533AlaAsp: 6.533 ± 0.936
9.391AlaGlu: 9.391 ± 0.869
4.287AlaPhe: 4.287 ± 0.583
10.684AlaGly: 10.684 ± 0.848
2.11AlaHis: 2.11 ± 0.359
4.695AlaIle: 4.695 ± 0.598
6.533AlaLys: 6.533 ± 0.829
11.296AlaLeu: 11.296 ± 1.071
4.015AlaMet: 4.015 ± 0.484
3.471AlaAsn: 3.471 ± 0.63
4.355AlaPro: 4.355 ± 0.508
4.627AlaGln: 4.627 ± 0.657
8.506AlaArg: 8.506 ± 0.843
6.261AlaSer: 6.261 ± 0.84
6.261AlaThr: 6.261 ± 0.991
7.486AlaVal: 7.486 ± 0.667
1.429AlaTrp: 1.429 ± 0.326
2.586AlaTyr: 2.586 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
1.769CysAla: 1.769 ± 0.393
0.068CysCys: 0.068 ± 0.071
0.612CysAsp: 0.612 ± 0.197
0.953CysGlu: 0.953 ± 0.277
0.34CysPhe: 0.34 ± 0.184
1.225CysGly: 1.225 ± 0.307
0.34CysHis: 0.34 ± 0.14
0.681CysIle: 0.681 ± 0.181
0.476CysLys: 0.476 ± 0.166
1.225CysLeu: 1.225 ± 0.4
0.136CysMet: 0.136 ± 0.082
0.476CysAsn: 0.476 ± 0.245
1.157CysPro: 1.157 ± 0.362
0.544CysGln: 0.544 ± 0.188
0.749CysArg: 0.749 ± 0.202
0.612CysSer: 0.612 ± 0.218
0.34CysThr: 0.34 ± 0.129
0.885CysVal: 0.885 ± 0.244
0.408CysTrp: 0.408 ± 0.173
0.408CysTyr: 0.408 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
7.349AspAla: 7.349 ± 0.698
0.953AspCys: 0.953 ± 0.217
3.675AspAsp: 3.675 ± 0.602
3.811AspGlu: 3.811 ± 0.576
2.042AspPhe: 2.042 ± 0.422
5.92AspGly: 5.92 ± 0.549
1.565AspHis: 1.565 ± 0.318
2.79AspIle: 2.79 ± 0.346
2.926AspLys: 2.926 ± 0.386
4.764AspLeu: 4.764 ± 0.663
1.089AspMet: 1.089 ± 0.291
1.565AspAsn: 1.565 ± 0.274
1.769AspPro: 1.769 ± 0.303
1.905AspGln: 1.905 ± 0.297
3.403AspArg: 3.403 ± 0.422
2.994AspSer: 2.994 ± 0.5
1.769AspThr: 1.769 ± 0.349
3.198AspVal: 3.198 ± 0.509
1.293AspTrp: 1.293 ± 0.322
1.429AspTyr: 1.429 ± 0.336
0.0AspXaa: 0.0 ± 0.0
Glu
8.302GluAla: 8.302 ± 0.839
0.749GluCys: 0.749 ± 0.232
2.382GluAsp: 2.382 ± 0.481
3.471GluGlu: 3.471 ± 0.548
2.246GluPhe: 2.246 ± 0.388
4.423GluGly: 4.423 ± 0.664
1.633GluHis: 1.633 ± 0.335
4.423GluIle: 4.423 ± 0.564
2.858GluLys: 2.858 ± 0.513
6.601GluLeu: 6.601 ± 0.629
1.565GluMet: 1.565 ± 0.396
2.11GluAsn: 2.11 ± 0.456
2.518GluPro: 2.518 ± 0.433
3.471GluGln: 3.471 ± 0.403
6.941GluArg: 6.941 ± 0.756
2.858GluSer: 2.858 ± 0.323
2.926GluThr: 2.926 ± 0.45
3.811GluVal: 3.811 ± 0.509
1.837GluTrp: 1.837 ± 0.339
1.701GluTyr: 1.701 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
4.015PheAla: 4.015 ± 0.487
0.476PheCys: 0.476 ± 0.175
2.586PheAsp: 2.586 ± 0.398
2.382PheGlu: 2.382 ± 0.33
0.817PhePhe: 0.817 ± 0.312
2.79PheGly: 2.79 ± 0.365
0.476PheHis: 0.476 ± 0.149
0.749PheIle: 0.749 ± 0.222
1.497PheLys: 1.497 ± 0.316
1.429PheLeu: 1.429 ± 0.293
0.476PheMet: 0.476 ± 0.169
0.953PheAsn: 0.953 ± 0.258
0.885PhePro: 0.885 ± 0.228
0.885PheGln: 0.885 ± 0.229
2.382PheArg: 2.382 ± 0.426
1.905PheSer: 1.905 ± 0.339
2.11PheThr: 2.11 ± 0.462
1.361PheVal: 1.361 ± 0.232
0.544PheTrp: 0.544 ± 0.192
0.817PheTyr: 0.817 ± 0.271
0.0PheXaa: 0.0 ± 0.0
Gly
7.349GlyAla: 7.349 ± 0.757
1.293GlyCys: 1.293 ± 0.313
4.423GlyAsp: 4.423 ± 0.583
4.627GlyGlu: 4.627 ± 0.743
2.654GlyPhe: 2.654 ± 0.309
5.648GlyGly: 5.648 ± 0.714
1.157GlyHis: 1.157 ± 0.272
3.947GlyIle: 3.947 ± 0.483
5.036GlyLys: 5.036 ± 0.677
5.852GlyLeu: 5.852 ± 0.624
2.314GlyMet: 2.314 ± 0.385
2.722GlyAsn: 2.722 ± 0.445
2.586GlyPro: 2.586 ± 0.463
2.79GlyGln: 2.79 ± 0.443
5.58GlyArg: 5.58 ± 0.752
4.015GlySer: 4.015 ± 0.507
4.491GlyThr: 4.491 ± 0.691
5.512GlyVal: 5.512 ± 0.67
1.701GlyTrp: 1.701 ± 0.326
3.062GlyTyr: 3.062 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
1.973HisAla: 1.973 ± 0.361
0.136HisCys: 0.136 ± 0.093
1.633HisAsp: 1.633 ± 0.375
1.225HisGlu: 1.225 ± 0.316
0.408HisPhe: 0.408 ± 0.179
1.701HisGly: 1.701 ± 0.432
0.476HisHis: 0.476 ± 0.272
1.225HisIle: 1.225 ± 0.334
1.429HisLys: 1.429 ± 0.311
1.293HisLeu: 1.293 ± 0.317
0.476HisMet: 0.476 ± 0.181
0.408HisAsn: 0.408 ± 0.166
1.089HisPro: 1.089 ± 0.336
0.749HisGln: 0.749 ± 0.182
0.953HisArg: 0.953 ± 0.278
1.225HisSer: 1.225 ± 0.255
0.749HisThr: 0.749 ± 0.259
1.701HisVal: 1.701 ± 0.328
0.476HisTrp: 0.476 ± 0.162
1.021HisTyr: 1.021 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
6.397IleAla: 6.397 ± 0.835
0.817IleCys: 0.817 ± 0.195
3.334IleAsp: 3.334 ± 0.418
4.083IleGlu: 4.083 ± 0.616
0.817IlePhe: 0.817 ± 0.207
3.539IleGly: 3.539 ± 0.561
1.089IleHis: 1.089 ± 0.293
2.246IleIle: 2.246 ± 0.339
2.178IleLys: 2.178 ± 0.395
2.45IleLeu: 2.45 ± 0.449
0.612IleMet: 0.612 ± 0.232
1.837IleAsn: 1.837 ± 0.354
2.178IlePro: 2.178 ± 0.356
1.837IleGln: 1.837 ± 0.254
3.811IleArg: 3.811 ± 0.496
3.13IleSer: 3.13 ± 0.526
2.518IleThr: 2.518 ± 0.441
2.994IleVal: 2.994 ± 0.395
0.681IleTrp: 0.681 ± 0.23
1.429IleTyr: 1.429 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
6.533LysAla: 6.533 ± 0.726
0.681LysCys: 0.681 ± 0.227
2.314LysAsp: 2.314 ± 0.408
2.654LysGlu: 2.654 ± 0.389
1.089LysPhe: 1.089 ± 0.279
4.015LysGly: 4.015 ± 0.581
1.225LysHis: 1.225 ± 0.256
1.361LysIle: 1.361 ± 0.332
1.293LysLys: 1.293 ± 0.317
4.559LysLeu: 4.559 ± 0.585
0.817LysMet: 0.817 ± 0.237
1.633LysAsn: 1.633 ± 0.355
2.586LysPro: 2.586 ± 0.463
3.13LysGln: 3.13 ± 0.556
3.062LysArg: 3.062 ± 0.464
2.79LysSer: 2.79 ± 0.479
3.062LysThr: 3.062 ± 0.455
3.334LysVal: 3.334 ± 0.482
0.544LysTrp: 0.544 ± 0.175
1.089LysTyr: 1.089 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
9.391LeuAla: 9.391 ± 1.067
1.293LeuCys: 1.293 ± 0.282
4.764LeuAsp: 4.764 ± 0.518
6.056LeuGlu: 6.056 ± 0.758
2.246LeuPhe: 2.246 ± 0.418
5.376LeuGly: 5.376 ± 0.595
1.769LeuHis: 1.769 ± 0.339
3.607LeuIle: 3.607 ± 0.475
3.539LeuLys: 3.539 ± 0.398
6.465LeuLeu: 6.465 ± 0.652
1.973LeuMet: 1.973 ± 0.354
2.994LeuAsn: 2.994 ± 0.353
4.355LeuPro: 4.355 ± 0.48
2.994LeuGln: 2.994 ± 0.494
6.465LeuArg: 6.465 ± 0.547
5.92LeuSer: 5.92 ± 0.639
4.764LeuThr: 4.764 ± 0.748
4.695LeuVal: 4.695 ± 0.617
1.293LeuTrp: 1.293 ± 0.319
1.905LeuTyr: 1.905 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
2.994MetAla: 2.994 ± 0.61
0.204MetCys: 0.204 ± 0.136
1.089MetAsp: 1.089 ± 0.296
1.225MetGlu: 1.225 ± 0.272
0.476MetPhe: 0.476 ± 0.169
1.905MetGly: 1.905 ± 0.334
0.612MetHis: 0.612 ± 0.194
1.089MetIle: 1.089 ± 0.267
1.497MetLys: 1.497 ± 0.339
2.042MetLeu: 2.042 ± 0.363
0.612MetMet: 0.612 ± 0.207
1.565MetAsn: 1.565 ± 0.382
0.885MetPro: 0.885 ± 0.227
1.225MetGln: 1.225 ± 0.35
2.178MetArg: 2.178 ± 0.396
2.314MetSer: 2.314 ± 0.406
2.722MetThr: 2.722 ± 0.386
1.089MetVal: 1.089 ± 0.271
0.136MetTrp: 0.136 ± 0.099
0.136MetTyr: 0.136 ± 0.085
0.0MetXaa: 0.0 ± 0.0
Asn
4.083AsnAla: 4.083 ± 0.62
0.34AsnCys: 0.34 ± 0.13
1.225AsnAsp: 1.225 ± 0.242
1.633AsnGlu: 1.633 ± 0.358
0.612AsnPhe: 0.612 ± 0.21
3.266AsnGly: 3.266 ± 0.469
0.817AsnHis: 0.817 ± 0.266
1.361AsnIle: 1.361 ± 0.585
1.361AsnLys: 1.361 ± 0.379
3.062AsnLeu: 3.062 ± 0.438
1.021AsnMet: 1.021 ± 0.247
1.565AsnAsn: 1.565 ± 0.394
1.497AsnPro: 1.497 ± 0.298
1.089AsnGln: 1.089 ± 0.273
1.837AsnArg: 1.837 ± 0.409
2.926AsnSer: 2.926 ± 0.419
2.042AsnThr: 2.042 ± 0.402
2.178AsnVal: 2.178 ± 0.361
0.34AsnTrp: 0.34 ± 0.143
0.953AsnTyr: 0.953 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
5.58ProAla: 5.58 ± 0.706
0.749ProCys: 0.749 ± 0.255
2.79ProAsp: 2.79 ± 0.412
3.13ProGlu: 3.13 ± 0.517
1.225ProPhe: 1.225 ± 0.346
2.926ProGly: 2.926 ± 0.536
1.293ProHis: 1.293 ± 0.286
2.11ProIle: 2.11 ± 0.345
1.905ProLys: 1.905 ± 0.347
2.858ProLeu: 2.858 ± 0.559
1.225ProMet: 1.225 ± 0.29
1.225ProAsn: 1.225 ± 0.288
1.293ProPro: 1.293 ± 0.339
2.11ProGln: 2.11 ± 0.483
2.178ProArg: 2.178 ± 0.391
2.586ProSer: 2.586 ± 0.45
2.178ProThr: 2.178 ± 0.384
3.062ProVal: 3.062 ± 0.37
1.089ProTrp: 1.089 ± 0.29
0.885ProTyr: 0.885 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
5.036GlnAla: 5.036 ± 0.603
0.544GlnCys: 0.544 ± 0.183
1.497GlnAsp: 1.497 ± 0.358
2.042GlnGlu: 2.042 ± 0.371
1.633GlnPhe: 1.633 ± 0.335
2.926GlnGly: 2.926 ± 0.385
0.476GlnHis: 0.476 ± 0.156
2.314GlnIle: 2.314 ± 0.365
1.837GlnLys: 1.837 ± 0.317
3.879GlnLeu: 3.879 ± 0.518
1.701GlnMet: 1.701 ± 0.397
0.953GlnAsn: 0.953 ± 0.259
1.973GlnPro: 1.973 ± 0.336
3.403GlnGln: 3.403 ± 0.462
3.198GlnArg: 3.198 ± 0.5
2.314GlnSer: 2.314 ± 0.444
1.769GlnThr: 1.769 ± 0.384
2.654GlnVal: 2.654 ± 0.354
1.021GlnTrp: 1.021 ± 0.246
1.497GlnTyr: 1.497 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
7.213ArgAla: 7.213 ± 0.724
0.885ArgCys: 0.885 ± 0.268
4.151ArgAsp: 4.151 ± 0.546
5.444ArgGlu: 5.444 ± 0.703
2.314ArgPhe: 2.314 ± 0.406
4.968ArgGly: 4.968 ± 0.581
1.497ArgHis: 1.497 ± 0.324
3.743ArgIle: 3.743 ± 0.475
3.539ArgLys: 3.539 ± 0.547
6.941ArgLeu: 6.941 ± 0.685
2.586ArgMet: 2.586 ± 0.397
2.042ArgAsn: 2.042 ± 0.331
2.79ArgPro: 2.79 ± 0.493
3.539ArgGln: 3.539 ± 0.489
5.104ArgArg: 5.104 ± 0.722
3.743ArgSer: 3.743 ± 0.466
2.654ArgThr: 2.654 ± 0.362
4.423ArgVal: 4.423 ± 0.528
1.361ArgTrp: 1.361 ± 0.304
2.11ArgTyr: 2.11 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
7.622SerAla: 7.622 ± 1.108
0.749SerCys: 0.749 ± 0.245
3.266SerAsp: 3.266 ± 0.446
3.743SerGlu: 3.743 ± 0.435
1.565SerPhe: 1.565 ± 0.303
4.764SerGly: 4.764 ± 0.62
1.021SerHis: 1.021 ± 0.262
2.858SerIle: 2.858 ± 0.488
3.334SerLys: 3.334 ± 0.531
4.151SerLeu: 4.151 ± 0.536
1.565SerMet: 1.565 ± 0.333
2.722SerAsn: 2.722 ± 0.515
1.905SerPro: 1.905 ± 0.373
2.042SerGln: 2.042 ± 0.348
4.151SerArg: 4.151 ± 0.425
3.13SerSer: 3.13 ± 0.568
3.607SerThr: 3.607 ± 0.701
4.559SerVal: 4.559 ± 0.458
1.157SerTrp: 1.157 ± 0.235
1.837SerTyr: 1.837 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
7.077ThrAla: 7.077 ± 0.915
0.476ThrCys: 0.476 ± 0.183
3.13ThrAsp: 3.13 ± 0.496
3.13ThrGlu: 3.13 ± 0.521
1.905ThrPhe: 1.905 ± 0.307
4.832ThrGly: 4.832 ± 0.558
0.953ThrHis: 0.953 ± 0.284
3.471ThrIle: 3.471 ± 0.51
2.314ThrLys: 2.314 ± 0.411
4.423ThrLeu: 4.423 ± 0.621
0.749ThrMet: 0.749 ± 0.222
1.497ThrAsn: 1.497 ± 0.414
2.994ThrPro: 2.994 ± 0.544
2.042ThrGln: 2.042 ± 0.343
2.654ThrArg: 2.654 ± 0.424
3.266ThrSer: 3.266 ± 0.409
3.062ThrThr: 3.062 ± 0.689
2.654ThrVal: 2.654 ± 0.519
1.225ThrTrp: 1.225 ± 0.245
1.837ThrTyr: 1.837 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
7.077ValAla: 7.077 ± 0.643
0.885ValCys: 0.885 ± 0.26
4.151ValAsp: 4.151 ± 0.39
5.104ValGlu: 5.104 ± 0.685
1.905ValPhe: 1.905 ± 0.339
3.539ValGly: 3.539 ± 0.581
0.749ValHis: 0.749 ± 0.31
3.403ValIle: 3.403 ± 0.527
2.586ValLys: 2.586 ± 0.346
4.9ValLeu: 4.9 ± 0.614
2.042ValMet: 2.042 ± 0.38
2.042ValAsn: 2.042 ± 0.344
3.062ValPro: 3.062 ± 0.48
2.246ValGln: 2.246 ± 0.362
4.015ValArg: 4.015 ± 0.535
3.743ValSer: 3.743 ± 0.573
4.355ValThr: 4.355 ± 0.649
4.151ValVal: 4.151 ± 0.518
1.021ValTrp: 1.021 ± 0.221
1.021ValTyr: 1.021 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
2.722TrpAla: 2.722 ± 0.482
0.272TrpCys: 0.272 ± 0.124
1.429TrpAsp: 1.429 ± 0.266
1.089TrpGlu: 1.089 ± 0.265
0.272TrpPhe: 0.272 ± 0.126
0.408TrpGly: 0.408 ± 0.155
0.544TrpHis: 0.544 ± 0.186
1.225TrpIle: 1.225 ± 0.3
0.749TrpLys: 0.749 ± 0.22
1.769TrpLeu: 1.769 ± 0.35
0.408TrpMet: 0.408 ± 0.176
0.544TrpAsn: 0.544 ± 0.171
1.089TrpPro: 1.089 ± 0.338
0.885TrpGln: 0.885 ± 0.262
1.769TrpArg: 1.769 ± 0.331
1.225TrpSer: 1.225 ± 0.299
0.476TrpThr: 0.476 ± 0.187
0.612TrpVal: 0.612 ± 0.217
0.34TrpTrp: 0.34 ± 0.13
0.408TrpTyr: 0.408 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.675TyrAla: 3.675 ± 0.478
0.204TyrCys: 0.204 ± 0.133
1.225TyrAsp: 1.225 ± 0.319
1.633TyrGlu: 1.633 ± 0.312
0.681TyrPhe: 0.681 ± 0.357
1.905TyrGly: 1.905 ± 0.306
0.544TyrHis: 0.544 ± 0.225
0.681TyrIle: 0.681 ± 0.235
0.885TyrLys: 0.885 ± 0.237
1.905TyrLeu: 1.905 ± 0.241
0.544TyrMet: 0.544 ± 0.162
0.885TyrAsn: 0.885 ± 0.239
1.429TyrPro: 1.429 ± 0.335
1.157TyrGln: 1.157 ± 0.289
2.178TyrArg: 2.178 ± 0.354
2.79TyrSer: 2.79 ± 0.362
1.837TyrThr: 1.837 ± 0.329
1.633TyrVal: 1.633 ± 0.296
0.34TyrTrp: 0.34 ± 0.123
0.612TyrTyr: 0.612 ± 0.196
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14696 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski