Amino acid dipepetide frequency for Pseudomonas phage vB_PaeP_C1-14_Or

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.044AlaAla: 9.044 ± 1.527
1.052AlaCys: 1.052 ± 0.363
4.417AlaAsp: 4.417 ± 0.527
5.048AlaGlu: 5.048 ± 0.871
2.664AlaPhe: 2.664 ± 0.38
6.45AlaGly: 6.45 ± 1.069
1.192AlaHis: 1.192 ± 0.309
4.277AlaIle: 4.277 ± 0.583
4.627AlaLys: 4.627 ± 0.614
8.203AlaLeu: 8.203 ± 0.79
2.594AlaMet: 2.594 ± 0.466
3.365AlaAsn: 3.365 ± 0.486
2.594AlaPro: 2.594 ± 0.569
3.646AlaGln: 3.646 ± 0.868
3.576AlaArg: 3.576 ± 0.546
5.399AlaSer: 5.399 ± 0.785
3.786AlaThr: 3.786 ± 0.503
6.17AlaVal: 6.17 ± 0.681
0.911AlaTrp: 0.911 ± 0.269
2.594AlaTyr: 2.594 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.841CysAla: 0.841 ± 0.227
0.14CysCys: 0.14 ± 0.094
0.421CysAsp: 0.421 ± 0.19
0.701CysGlu: 0.701 ± 0.192
0.14CysPhe: 0.14 ± 0.104
1.332CysGly: 1.332 ± 0.354
0.491CysHis: 0.491 ± 0.173
0.491CysIle: 0.491 ± 0.191
0.351CysLys: 0.351 ± 0.195
0.771CysLeu: 0.771 ± 0.223
0.14CysMet: 0.14 ± 0.094
0.631CysAsn: 0.631 ± 0.201
0.631CysPro: 0.631 ± 0.22
0.421CysGln: 0.421 ± 0.175
0.911CysArg: 0.911 ± 0.264
0.561CysSer: 0.561 ± 0.192
0.491CysThr: 0.491 ± 0.17
0.701CysVal: 0.701 ± 0.271
0.14CysTrp: 0.14 ± 0.086
0.771CysTyr: 0.771 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
5.188AspAla: 5.188 ± 0.58
0.491AspCys: 0.491 ± 0.202
3.786AspAsp: 3.786 ± 0.526
4.908AspGlu: 4.908 ± 0.787
2.314AspPhe: 2.314 ± 0.411
4.417AspGly: 4.417 ± 0.618
1.683AspHis: 1.683 ± 0.399
3.856AspIle: 3.856 ± 0.503
2.664AspLys: 2.664 ± 0.574
5.679AspLeu: 5.679 ± 0.497
1.753AspMet: 1.753 ± 0.309
2.103AspAsn: 2.103 ± 0.46
3.506AspPro: 3.506 ± 0.537
1.472AspGln: 1.472 ± 0.273
3.506AspArg: 3.506 ± 0.524
2.875AspSer: 2.875 ± 0.503
3.155AspThr: 3.155 ± 0.467
2.875AspVal: 2.875 ± 0.473
1.823AspTrp: 1.823 ± 0.339
1.963AspTyr: 1.963 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
5.959GluAla: 5.959 ± 0.864
0.561GluCys: 0.561 ± 0.19
5.469GluAsp: 5.469 ± 0.638
6.1GluGlu: 6.1 ± 1.232
2.033GluPhe: 2.033 ± 0.347
6.17GluGly: 6.17 ± 0.713
1.262GluHis: 1.262 ± 0.328
2.524GluIle: 2.524 ± 0.374
3.295GluLys: 3.295 ± 0.575
5.959GluLeu: 5.959 ± 0.676
2.244GluMet: 2.244 ± 0.377
2.454GluAsn: 2.454 ± 0.348
1.893GluPro: 1.893 ± 0.409
2.454GluGln: 2.454 ± 0.567
5.188GluArg: 5.188 ± 0.626
2.945GluSer: 2.945 ± 0.476
3.295GluThr: 3.295 ± 0.523
7.221GluVal: 7.221 ± 0.688
0.771GluTrp: 0.771 ± 0.238
1.823GluTyr: 1.823 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
2.244PheAla: 2.244 ± 0.414
0.491PheCys: 0.491 ± 0.189
2.314PheAsp: 2.314 ± 0.446
1.823PheGlu: 1.823 ± 0.333
1.613PhePhe: 1.613 ± 0.363
4.066PheGly: 4.066 ± 0.5
0.631PheHis: 0.631 ± 0.226
2.173PheIle: 2.173 ± 0.435
2.033PheLys: 2.033 ± 0.363
3.506PheLeu: 3.506 ± 0.46
0.982PheMet: 0.982 ± 0.203
1.332PheAsn: 1.332 ± 0.272
1.332PhePro: 1.332 ± 0.369
1.472PheGln: 1.472 ± 0.339
1.613PheArg: 1.613 ± 0.445
2.524PheSer: 2.524 ± 0.327
2.314PheThr: 2.314 ± 0.325
3.085PheVal: 3.085 ± 0.494
0.771PheTrp: 0.771 ± 0.294
1.262PheTyr: 1.262 ± 0.364
0.0PheXaa: 0.0 ± 0.0
Gly
7.292GlyAla: 7.292 ± 1.167
1.122GlyCys: 1.122 ± 0.297
5.188GlyAsp: 5.188 ± 0.642
4.417GlyGlu: 4.417 ± 0.607
2.875GlyPhe: 2.875 ± 0.465
7.081GlyGly: 7.081 ± 1.183
2.314GlyHis: 2.314 ± 0.47
4.557GlyIle: 4.557 ± 0.59
4.557GlyLys: 4.557 ± 0.642
6.17GlyLeu: 6.17 ± 0.723
2.524GlyMet: 2.524 ± 0.401
4.207GlyAsn: 4.207 ± 0.562
2.875GlyPro: 2.875 ± 0.514
3.225GlyGln: 3.225 ± 0.468
4.627GlyArg: 4.627 ± 0.504
6.52GlySer: 6.52 ± 0.91
4.347GlyThr: 4.347 ± 0.614
5.889GlyVal: 5.889 ± 0.558
1.052GlyTrp: 1.052 ± 0.255
2.314GlyTyr: 2.314 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
1.192HisAla: 1.192 ± 0.269
0.28HisCys: 0.28 ± 0.142
0.631HisAsp: 0.631 ± 0.229
1.542HisGlu: 1.542 ± 0.381
0.911HisPhe: 0.911 ± 0.258
1.472HisGly: 1.472 ± 0.29
0.491HisHis: 0.491 ± 0.216
1.332HisIle: 1.332 ± 0.328
1.122HisLys: 1.122 ± 0.279
2.384HisLeu: 2.384 ± 0.504
0.491HisMet: 0.491 ± 0.189
0.631HisAsn: 0.631 ± 0.214
0.911HisPro: 0.911 ± 0.262
0.771HisGln: 0.771 ± 0.232
1.262HisArg: 1.262 ± 0.252
0.771HisSer: 0.771 ± 0.213
1.122HisThr: 1.122 ± 0.355
0.982HisVal: 0.982 ± 0.246
0.701HisTrp: 0.701 ± 0.272
0.561HisTyr: 0.561 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
3.506IleAla: 3.506 ± 0.507
0.421IleCys: 0.421 ± 0.167
2.594IleAsp: 2.594 ± 0.538
2.804IleGlu: 2.804 ± 0.485
2.314IlePhe: 2.314 ± 0.409
4.978IleGly: 4.978 ± 0.575
0.982IleHis: 0.982 ± 0.265
2.875IleIle: 2.875 ± 0.451
3.085IleLys: 3.085 ± 0.553
3.996IleLeu: 3.996 ± 0.476
0.982IleMet: 0.982 ± 0.27
1.683IleAsn: 1.683 ± 0.365
3.365IlePro: 3.365 ± 0.534
2.734IleGln: 2.734 ± 0.454
3.365IleArg: 3.365 ± 0.508
2.875IleSer: 2.875 ± 0.406
2.314IleThr: 2.314 ± 0.389
3.225IleVal: 3.225 ± 0.6
0.491IleTrp: 0.491 ± 0.171
2.314IleTyr: 2.314 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
4.627LysAla: 4.627 ± 0.697
0.421LysCys: 0.421 ± 0.193
4.066LysAsp: 4.066 ± 0.508
3.856LysGlu: 3.856 ± 0.503
1.963LysPhe: 1.963 ± 0.467
3.646LysGly: 3.646 ± 0.609
1.402LysHis: 1.402 ± 0.324
2.314LysIle: 2.314 ± 0.391
3.015LysLys: 3.015 ± 0.432
4.066LysLeu: 4.066 ± 0.643
1.613LysMet: 1.613 ± 0.344
2.033LysAsn: 2.033 ± 0.443
2.594LysPro: 2.594 ± 0.523
1.893LysGln: 1.893 ± 0.293
3.155LysArg: 3.155 ± 0.553
3.015LysSer: 3.015 ± 0.487
2.594LysThr: 2.594 ± 0.351
4.066LysVal: 4.066 ± 0.704
1.192LysTrp: 1.192 ± 0.329
2.524LysTyr: 2.524 ± 0.422
0.0LysXaa: 0.0 ± 0.0
Leu
6.59LeuAla: 6.59 ± 0.585
0.701LeuCys: 0.701 ± 0.216
5.118LeuAsp: 5.118 ± 0.547
6.31LeuGlu: 6.31 ± 0.81
3.015LeuPhe: 3.015 ± 0.49
6.24LeuGly: 6.24 ± 0.767
1.122LeuHis: 1.122 ± 0.334
3.576LeuIle: 3.576 ± 0.45
5.399LeuLys: 5.399 ± 0.524
7.221LeuLeu: 7.221 ± 0.652
3.365LeuMet: 3.365 ± 0.702
3.716LeuAsn: 3.716 ± 0.464
3.856LeuPro: 3.856 ± 0.586
3.856LeuGln: 3.856 ± 0.667
6.52LeuArg: 6.52 ± 0.655
5.188LeuSer: 5.188 ± 0.629
4.066LeuThr: 4.066 ± 0.544
5.609LeuVal: 5.609 ± 0.733
0.911LeuTrp: 0.911 ± 0.269
2.524LeuTyr: 2.524 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
3.295MetAla: 3.295 ± 0.554
0.21MetCys: 0.21 ± 0.121
1.472MetAsp: 1.472 ± 0.279
1.963MetGlu: 1.963 ± 0.357
0.982MetPhe: 0.982 ± 0.245
2.244MetGly: 2.244 ± 0.501
0.351MetHis: 0.351 ± 0.145
1.542MetIle: 1.542 ± 0.313
2.103MetLys: 2.103 ± 0.327
1.963MetLeu: 1.963 ± 0.316
0.841MetMet: 0.841 ± 0.229
1.262MetAsn: 1.262 ± 0.312
1.542MetPro: 1.542 ± 0.276
0.841MetGln: 0.841 ± 0.264
1.402MetArg: 1.402 ± 0.316
2.244MetSer: 2.244 ± 0.411
1.963MetThr: 1.963 ± 0.394
2.103MetVal: 2.103 ± 0.451
0.561MetTrp: 0.561 ± 0.213
0.701MetTyr: 0.701 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
2.875AsnAla: 2.875 ± 0.471
0.561AsnCys: 0.561 ± 0.212
2.103AsnAsp: 2.103 ± 0.276
2.945AsnGlu: 2.945 ± 0.451
1.683AsnPhe: 1.683 ± 0.383
4.137AsnGly: 4.137 ± 0.58
0.982AsnHis: 0.982 ± 0.254
3.085AsnIle: 3.085 ± 0.52
1.472AsnLys: 1.472 ± 0.367
3.506AsnLeu: 3.506 ± 0.518
0.841AsnMet: 0.841 ± 0.248
1.753AsnAsn: 1.753 ± 0.429
2.244AsnPro: 2.244 ± 0.439
1.753AsnGln: 1.753 ± 0.376
2.664AsnArg: 2.664 ± 0.394
1.963AsnSer: 1.963 ± 0.409
2.664AsnThr: 2.664 ± 0.455
2.804AsnVal: 2.804 ± 0.451
0.911AsnTrp: 0.911 ± 0.231
1.613AsnTyr: 1.613 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
2.875ProAla: 2.875 ± 0.456
0.351ProCys: 0.351 ± 0.153
2.804ProAsp: 2.804 ± 0.51
4.697ProGlu: 4.697 ± 0.732
2.244ProPhe: 2.244 ± 0.297
2.945ProGly: 2.945 ± 0.462
0.841ProHis: 0.841 ± 0.241
1.753ProIle: 1.753 ± 0.38
3.015ProLys: 3.015 ± 0.449
3.155ProLeu: 3.155 ± 0.439
1.262ProMet: 1.262 ± 0.288
1.893ProAsn: 1.893 ± 0.381
1.753ProPro: 1.753 ± 0.505
2.173ProGln: 2.173 ± 0.354
1.823ProArg: 1.823 ± 0.42
2.664ProSer: 2.664 ± 0.502
2.314ProThr: 2.314 ± 0.427
2.945ProVal: 2.945 ± 0.495
0.982ProTrp: 0.982 ± 0.252
1.472ProTyr: 1.472 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
5.048GlnAla: 5.048 ± 0.617
0.28GlnCys: 0.28 ± 0.135
2.314GlnAsp: 2.314 ± 0.409
3.786GlnGlu: 3.786 ± 0.521
0.841GlnPhe: 0.841 ± 0.215
3.435GlnGly: 3.435 ± 0.606
0.351GlnHis: 0.351 ± 0.177
1.472GlnIle: 1.472 ± 0.299
1.613GlnLys: 1.613 ± 0.308
3.225GlnLeu: 3.225 ± 0.607
1.192GlnMet: 1.192 ± 0.383
1.683GlnAsn: 1.683 ± 0.457
1.122GlnPro: 1.122 ± 0.293
1.542GlnGln: 1.542 ± 0.4
2.875GlnArg: 2.875 ± 0.459
2.173GlnSer: 2.173 ± 0.299
1.613GlnThr: 1.613 ± 0.384
2.314GlnVal: 2.314 ± 0.361
0.771GlnTrp: 0.771 ± 0.247
1.192GlnTyr: 1.192 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
4.277ArgAla: 4.277 ± 0.719
0.561ArgCys: 0.561 ± 0.209
3.155ArgAsp: 3.155 ± 0.599
3.015ArgGlu: 3.015 ± 0.558
2.594ArgPhe: 2.594 ± 0.358
4.838ArgGly: 4.838 ± 0.597
1.192ArgHis: 1.192 ± 0.316
3.576ArgIle: 3.576 ± 0.444
3.435ArgLys: 3.435 ± 0.51
5.679ArgLeu: 5.679 ± 0.744
2.103ArgMet: 2.103 ± 0.437
3.435ArgAsn: 3.435 ± 0.502
2.734ArgPro: 2.734 ± 0.413
2.454ArgGln: 2.454 ± 0.442
3.435ArgArg: 3.435 ± 0.537
3.015ArgSer: 3.015 ± 0.465
2.804ArgThr: 2.804 ± 0.519
4.487ArgVal: 4.487 ± 0.643
0.982ArgTrp: 0.982 ± 0.275
1.753ArgTyr: 1.753 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.487SerAla: 4.487 ± 0.662
0.911SerCys: 0.911 ± 0.275
3.365SerAsp: 3.365 ± 0.441
4.417SerGlu: 4.417 ± 0.54
2.314SerPhe: 2.314 ± 0.513
5.819SerGly: 5.819 ± 0.53
1.122SerHis: 1.122 ± 0.266
3.506SerIle: 3.506 ± 0.462
2.734SerLys: 2.734 ± 0.521
4.627SerLeu: 4.627 ± 0.522
1.963SerMet: 1.963 ± 0.396
2.594SerAsn: 2.594 ± 0.473
2.524SerPro: 2.524 ± 0.408
2.524SerGln: 2.524 ± 0.325
2.945SerArg: 2.945 ± 0.532
4.137SerSer: 4.137 ± 0.613
2.384SerThr: 2.384 ± 0.461
4.137SerVal: 4.137 ± 0.667
1.052SerTrp: 1.052 ± 0.277
1.823SerTyr: 1.823 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
4.066ThrAla: 4.066 ± 0.6
0.351ThrCys: 0.351 ± 0.148
2.244ThrAsp: 2.244 ± 0.437
3.295ThrGlu: 3.295 ± 0.494
2.384ThrPhe: 2.384 ± 0.402
3.996ThrGly: 3.996 ± 0.66
0.841ThrHis: 0.841 ± 0.22
2.384ThrIle: 2.384 ± 0.438
2.454ThrLys: 2.454 ± 0.378
5.258ThrLeu: 5.258 ± 0.551
1.472ThrMet: 1.472 ± 0.294
1.893ThrAsn: 1.893 ± 0.38
3.646ThrPro: 3.646 ± 0.508
2.244ThrGln: 2.244 ± 0.388
2.875ThrArg: 2.875 ± 0.346
3.576ThrSer: 3.576 ± 0.634
2.945ThrThr: 2.945 ± 0.446
3.435ThrVal: 3.435 ± 0.5
0.911ThrTrp: 0.911 ± 0.288
1.683ThrTyr: 1.683 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
5.539ValAla: 5.539 ± 0.67
1.192ValCys: 1.192 ± 0.313
4.347ValAsp: 4.347 ± 0.524
4.627ValGlu: 4.627 ± 0.655
3.015ValPhe: 3.015 ± 0.503
5.679ValGly: 5.679 ± 0.612
1.613ValHis: 1.613 ± 0.387
3.085ValIle: 3.085 ± 0.511
3.926ValLys: 3.926 ± 0.453
5.328ValLeu: 5.328 ± 0.895
1.963ValMet: 1.963 ± 0.421
3.506ValAsn: 3.506 ± 0.57
2.173ValPro: 2.173 ± 0.405
1.823ValGln: 1.823 ± 0.314
4.137ValArg: 4.137 ± 0.451
4.697ValSer: 4.697 ± 0.623
4.557ValThr: 4.557 ± 0.506
5.539ValVal: 5.539 ± 0.619
1.122ValTrp: 1.122 ± 0.273
3.085ValTyr: 3.085 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
1.192TrpAla: 1.192 ± 0.263
0.421TrpCys: 0.421 ± 0.205
1.402TrpAsp: 1.402 ± 0.305
1.262TrpGlu: 1.262 ± 0.272
0.561TrpPhe: 0.561 ± 0.228
1.542TrpGly: 1.542 ± 0.293
0.0TrpHis: 0.0 ± 0.0
0.771TrpIle: 0.771 ± 0.228
1.542TrpLys: 1.542 ± 0.361
1.122TrpLeu: 1.122 ± 0.251
0.561TrpMet: 0.561 ± 0.171
0.841TrpAsn: 0.841 ± 0.241
0.701TrpPro: 0.701 ± 0.234
0.351TrpGln: 0.351 ± 0.152
1.332TrpArg: 1.332 ± 0.279
0.631TrpSer: 0.631 ± 0.258
0.982TrpThr: 0.982 ± 0.252
0.701TrpVal: 0.701 ± 0.219
0.28TrpTrp: 0.28 ± 0.127
0.561TrpTyr: 0.561 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.472TyrAla: 1.472 ± 0.3
0.561TyrCys: 0.561 ± 0.194
2.804TyrAsp: 2.804 ± 0.45
2.103TyrGlu: 2.103 ± 0.421
1.122TyrPhe: 1.122 ± 0.249
2.664TyrGly: 2.664 ± 0.361
0.701TyrHis: 0.701 ± 0.207
1.753TyrIle: 1.753 ± 0.333
1.613TyrLys: 1.613 ± 0.336
3.085TyrLeu: 3.085 ± 0.649
0.631TyrMet: 0.631 ± 0.218
1.542TyrAsn: 1.542 ± 0.314
2.033TyrPro: 2.033 ± 0.404
1.192TyrGln: 1.192 ± 0.247
2.314TyrArg: 2.314 ± 0.352
1.613TyrSer: 1.613 ± 0.359
2.244TyrThr: 2.244 ± 0.385
2.664TyrVal: 2.664 ± 0.481
0.421TyrTrp: 0.421 ± 0.181
1.472TyrTyr: 1.472 ± 0.333
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (14264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski