Amino acid dipepetide frequency for Synechococcus phage S-CBS2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.221AlaAla: 11.221 ± 1.032
1.016AlaCys: 1.016 ± 0.345
5.541AlaAsp: 5.541 ± 0.468
5.865AlaGlu: 5.865 ± 0.757
3.094AlaPhe: 3.094 ± 0.414
8.266AlaGly: 8.266 ± 0.859
1.293AlaHis: 1.293 ± 0.235
4.618AlaIle: 4.618 ± 0.538
4.895AlaLys: 4.895 ± 0.689
8.543AlaLeu: 8.543 ± 0.588
2.355AlaMet: 2.355 ± 0.313
4.341AlaAsn: 4.341 ± 0.437
3.694AlaPro: 3.694 ± 0.485
3.971AlaGln: 3.971 ± 0.451
5.08AlaArg: 5.08 ± 0.598
5.495AlaSer: 5.495 ± 0.666
6.973AlaThr: 6.973 ± 0.659
5.218AlaVal: 5.218 ± 0.424
1.847AlaTrp: 1.847 ± 0.312
3.233AlaTyr: 3.233 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
1.108CysAla: 1.108 ± 0.348
0.231CysCys: 0.231 ± 0.169
0.831CysAsp: 0.831 ± 0.288
0.647CysGlu: 0.647 ± 0.187
0.508CysPhe: 0.508 ± 0.16
1.293CysGly: 1.293 ± 0.44
0.277CysHis: 0.277 ± 0.118
0.323CysIle: 0.323 ± 0.178
0.462CysLys: 0.462 ± 0.138
0.924CysLeu: 0.924 ± 0.239
0.185CysMet: 0.185 ± 0.109
0.6CysAsn: 0.6 ± 0.178
0.6CysPro: 0.6 ± 0.204
0.554CysGln: 0.554 ± 0.157
0.785CysArg: 0.785 ± 0.169
1.108CysSer: 1.108 ± 0.383
0.647CysThr: 0.647 ± 0.298
0.739CysVal: 0.739 ± 0.251
0.185CysTrp: 0.185 ± 0.101
0.6CysTyr: 0.6 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
5.403AspAla: 5.403 ± 0.511
0.693AspCys: 0.693 ± 0.333
3.002AspAsp: 3.002 ± 0.398
4.202AspGlu: 4.202 ± 0.518
2.124AspPhe: 2.124 ± 0.317
6.419AspGly: 6.419 ± 0.698
1.108AspHis: 1.108 ± 0.267
2.632AspIle: 2.632 ± 0.313
2.678AspLys: 2.678 ± 0.377
5.865AspLeu: 5.865 ± 0.604
1.478AspMet: 1.478 ± 0.279
2.309AspAsn: 2.309 ± 0.305
3.002AspPro: 3.002 ± 0.396
2.078AspGln: 2.078 ± 0.29
3.094AspArg: 3.094 ± 0.413
3.279AspSer: 3.279 ± 0.392
3.51AspThr: 3.51 ± 0.473
3.879AspVal: 3.879 ± 0.445
1.016AspTrp: 1.016 ± 0.22
2.309AspTyr: 2.309 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
7.527GluAla: 7.527 ± 0.955
0.785GluCys: 0.785 ± 0.277
3.186GluAsp: 3.186 ± 0.406
4.018GluGlu: 4.018 ± 0.514
2.494GluPhe: 2.494 ± 0.33
4.664GluGly: 4.664 ± 0.488
1.154GluHis: 1.154 ± 0.312
3.74GluIle: 3.74 ± 0.383
2.678GluLys: 2.678 ± 0.417
6.696GluLeu: 6.696 ± 0.547
1.247GluMet: 1.247 ± 0.247
1.662GluAsn: 1.662 ± 0.262
2.586GluPro: 2.586 ± 0.386
3.463GluGln: 3.463 ± 0.482
3.556GluArg: 3.556 ± 0.441
3.51GluSer: 3.51 ± 0.314
3.048GluThr: 3.048 ± 0.371
3.463GluVal: 3.463 ± 0.442
1.247GluTrp: 1.247 ± 0.263
1.755GluTyr: 1.755 ± 0.314
0.0GluXaa: 0.0 ± 0.0
Phe
2.494PheAla: 2.494 ± 0.365
0.554PheCys: 0.554 ± 0.165
2.17PheAsp: 2.17 ± 0.263
1.755PheGlu: 1.755 ± 0.273
1.108PhePhe: 1.108 ± 0.252
2.771PheGly: 2.771 ± 0.402
0.6PheHis: 0.6 ± 0.175
1.986PheIle: 1.986 ± 0.305
1.478PheLys: 1.478 ± 0.333
2.586PheLeu: 2.586 ± 0.357
1.201PheMet: 1.201 ± 0.254
1.755PheAsn: 1.755 ± 0.3
1.385PhePro: 1.385 ± 0.278
1.293PheGln: 1.293 ± 0.237
1.709PheArg: 1.709 ± 0.278
2.909PheSer: 2.909 ± 0.337
1.801PheThr: 1.801 ± 0.265
2.078PheVal: 2.078 ± 0.315
0.508PheTrp: 0.508 ± 0.118
1.108PheTyr: 1.108 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
7.712GlyAla: 7.712 ± 0.734
1.062GlyCys: 1.062 ± 0.295
4.572GlyAsp: 4.572 ± 0.488
4.987GlyGlu: 4.987 ± 0.569
3.094GlyPhe: 3.094 ± 0.457
12.099GlyGly: 12.099 ± 3.325
1.108GlyHis: 1.108 ± 0.262
4.341GlyIle: 4.341 ± 0.501
3.833GlyLys: 3.833 ± 0.441
6.604GlyLeu: 6.604 ± 0.388
1.478GlyMet: 1.478 ± 0.239
3.787GlyAsn: 3.787 ± 0.435
3.048GlyPro: 3.048 ± 0.458
3.048GlyGln: 3.048 ± 0.394
4.479GlyArg: 4.479 ± 0.533
7.897GlySer: 7.897 ± 1.104
7.481GlyThr: 7.481 ± 0.849
5.865GlyVal: 5.865 ± 0.59
1.108GlyTrp: 1.108 ± 0.237
3.787GlyTyr: 3.787 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
1.016HisAla: 1.016 ± 0.247
0.369HisCys: 0.369 ± 0.124
1.201HisAsp: 1.201 ± 0.238
0.877HisGlu: 0.877 ± 0.261
0.508HisPhe: 0.508 ± 0.165
1.062HisGly: 1.062 ± 0.216
0.369HisHis: 0.369 ± 0.221
0.739HisIle: 0.739 ± 0.226
0.97HisLys: 0.97 ± 0.206
1.478HisLeu: 1.478 ± 0.265
0.231HisMet: 0.231 ± 0.097
0.6HisAsn: 0.6 ± 0.155
1.108HisPro: 1.108 ± 0.212
0.785HisGln: 0.785 ± 0.189
0.739HisArg: 0.739 ± 0.241
0.785HisSer: 0.785 ± 0.199
0.831HisThr: 0.831 ± 0.174
1.247HisVal: 1.247 ± 0.252
0.369HisTrp: 0.369 ± 0.127
0.693HisTyr: 0.693 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
3.925IleAla: 3.925 ± 0.462
0.554IleCys: 0.554 ± 0.168
3.833IleAsp: 3.833 ± 0.355
3.925IleGlu: 3.925 ± 0.44
1.709IlePhe: 1.709 ± 0.261
3.879IleGly: 3.879 ± 0.38
0.739IleHis: 0.739 ± 0.162
2.124IleIle: 2.124 ± 0.379
2.586IleLys: 2.586 ± 0.342
4.11IleLeu: 4.11 ± 0.342
0.647IleMet: 0.647 ± 0.166
2.447IleAsn: 2.447 ± 0.374
2.771IlePro: 2.771 ± 0.357
2.632IleGln: 2.632 ± 0.517
2.771IleArg: 2.771 ± 0.326
4.064IleSer: 4.064 ± 0.483
3.186IleThr: 3.186 ± 0.402
2.678IleVal: 2.678 ± 0.366
0.6IleTrp: 0.6 ± 0.155
1.247IleTyr: 1.247 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
5.495LysAla: 5.495 ± 0.627
0.647LysCys: 0.647 ± 0.211
2.494LysAsp: 2.494 ± 0.383
3.463LysGlu: 3.463 ± 0.466
1.57LysPhe: 1.57 ± 0.276
3.186LysGly: 3.186 ± 0.435
0.831LysHis: 0.831 ± 0.197
2.217LysIle: 2.217 ± 0.394
3.417LysLys: 3.417 ± 0.53
4.341LysLeu: 4.341 ± 0.549
1.016LysMet: 1.016 ± 0.221
1.57LysAsn: 1.57 ± 0.315
2.632LysPro: 2.632 ± 0.355
1.94LysGln: 1.94 ± 0.402
1.94LysArg: 1.94 ± 0.349
2.678LysSer: 2.678 ± 0.345
3.186LysThr: 3.186 ± 0.361
2.678LysVal: 2.678 ± 0.29
0.508LysTrp: 0.508 ± 0.174
1.57LysTyr: 1.57 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
7.666LeuAla: 7.666 ± 0.545
1.247LeuCys: 1.247 ± 0.295
5.033LeuAsp: 5.033 ± 0.507
5.634LeuGlu: 5.634 ± 0.626
2.401LeuPhe: 2.401 ± 0.312
6.65LeuGly: 6.65 ± 0.501
1.016LeuHis: 1.016 ± 0.189
3.371LeuIle: 3.371 ± 0.4
3.463LeuLys: 3.463 ± 0.477
5.865LeuLeu: 5.865 ± 0.688
1.478LeuMet: 1.478 ± 0.291
3.602LeuAsn: 3.602 ± 0.577
3.371LeuPro: 3.371 ± 0.412
3.094LeuGln: 3.094 ± 0.356
4.572LeuArg: 4.572 ± 0.607
5.634LeuSer: 5.634 ± 0.537
6.142LeuThr: 6.142 ± 0.64
5.449LeuVal: 5.449 ± 0.578
1.293LeuTrp: 1.293 ± 0.247
3.14LeuTyr: 3.14 ± 0.355
0.0LeuXaa: 0.0 ± 0.0
Met
3.371MetAla: 3.371 ± 0.359
0.046MetCys: 0.046 ± 0.044
1.108MetAsp: 1.108 ± 0.228
1.108MetGlu: 1.108 ± 0.203
0.462MetPhe: 0.462 ± 0.119
0.924MetGly: 0.924 ± 0.195
0.369MetHis: 0.369 ± 0.132
0.97MetIle: 0.97 ± 0.212
1.108MetLys: 1.108 ± 0.208
1.339MetLeu: 1.339 ± 0.264
0.416MetMet: 0.416 ± 0.164
1.201MetAsn: 1.201 ± 0.213
0.924MetPro: 0.924 ± 0.222
0.647MetGln: 0.647 ± 0.165
0.924MetArg: 0.924 ± 0.246
1.154MetSer: 1.154 ± 0.251
1.385MetThr: 1.385 ± 0.242
0.97MetVal: 0.97 ± 0.245
0.185MetTrp: 0.185 ± 0.119
0.508MetTyr: 0.508 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
4.433AsnAla: 4.433 ± 0.598
0.554AsnCys: 0.554 ± 0.221
2.309AsnAsp: 2.309 ± 0.283
2.447AsnGlu: 2.447 ± 0.365
1.432AsnPhe: 1.432 ± 0.253
5.634AsnGly: 5.634 ± 0.667
0.693AsnHis: 0.693 ± 0.179
2.355AsnIle: 2.355 ± 0.316
1.847AsnLys: 1.847 ± 0.299
3.14AsnLeu: 3.14 ± 0.295
0.6AsnMet: 0.6 ± 0.165
1.339AsnAsn: 1.339 ± 0.247
1.662AsnPro: 1.662 ± 0.25
1.801AsnGln: 1.801 ± 0.34
2.032AsnArg: 2.032 ± 0.261
2.447AsnSer: 2.447 ± 0.428
2.678AsnThr: 2.678 ± 0.388
2.678AsnVal: 2.678 ± 0.446
0.785AsnTrp: 0.785 ± 0.19
1.616AsnTyr: 1.616 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
3.879ProAla: 3.879 ± 0.387
0.416ProCys: 0.416 ± 0.148
2.309ProAsp: 2.309 ± 0.323
3.14ProGlu: 3.14 ± 0.423
1.385ProPhe: 1.385 ± 0.259
4.018ProGly: 4.018 ± 0.43
0.693ProHis: 0.693 ± 0.185
2.124ProIle: 2.124 ± 0.351
2.263ProLys: 2.263 ± 0.396
3.048ProLeu: 3.048 ± 0.366
0.462ProMet: 0.462 ± 0.13
1.662ProAsn: 1.662 ± 0.264
2.078ProPro: 2.078 ± 0.385
2.078ProGln: 2.078 ± 0.33
1.94ProArg: 1.94 ± 0.332
3.602ProSer: 3.602 ± 0.461
3.233ProThr: 3.233 ± 0.463
2.678ProVal: 2.678 ± 0.312
0.693ProTrp: 0.693 ± 0.17
1.385ProTyr: 1.385 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
4.987GlnAla: 4.987 ± 0.577
0.185GlnCys: 0.185 ± 0.098
2.217GlnAsp: 2.217 ± 0.273
2.309GlnGlu: 2.309 ± 0.381
1.339GlnPhe: 1.339 ± 0.25
3.371GlnGly: 3.371 ± 0.811
0.554GlnHis: 0.554 ± 0.153
2.263GlnIle: 2.263 ± 0.291
2.078GlnLys: 2.078 ± 0.325
3.463GlnLeu: 3.463 ± 0.473
0.785GlnMet: 0.785 ± 0.233
1.57GlnAsn: 1.57 ± 0.281
1.432GlnPro: 1.432 ± 0.203
2.124GlnGln: 2.124 ± 0.454
2.401GlnArg: 2.401 ± 0.317
2.217GlnSer: 2.217 ± 0.398
2.078GlnThr: 2.078 ± 0.327
3.048GlnVal: 3.048 ± 0.387
0.831GlnTrp: 0.831 ± 0.193
1.339GlnTyr: 1.339 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
4.248ArgAla: 4.248 ± 0.57
0.785ArgCys: 0.785 ± 0.227
2.817ArgAsp: 2.817 ± 0.347
3.233ArgGlu: 3.233 ± 0.452
1.801ArgPhe: 1.801 ± 0.254
3.694ArgGly: 3.694 ± 0.466
1.108ArgHis: 1.108 ± 0.22
2.771ArgIle: 2.771 ± 0.284
3.463ArgLys: 3.463 ± 0.51
3.971ArgLeu: 3.971 ± 0.419
1.154ArgMet: 1.154 ± 0.192
1.709ArgAsn: 1.709 ± 0.365
1.801ArgPro: 1.801 ± 0.292
2.124ArgGln: 2.124 ± 0.315
2.863ArgArg: 2.863 ± 0.497
3.694ArgSer: 3.694 ± 0.388
2.217ArgThr: 2.217 ± 0.286
3.787ArgVal: 3.787 ± 0.376
1.016ArgTrp: 1.016 ± 0.271
2.263ArgTyr: 2.263 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
5.911SerAla: 5.911 ± 0.635
0.831SerCys: 0.831 ± 0.28
3.279SerAsp: 3.279 ± 0.423
3.417SerGlu: 3.417 ± 0.377
2.263SerPhe: 2.263 ± 0.317
8.358SerGly: 8.358 ± 0.897
1.062SerHis: 1.062 ± 0.236
4.387SerIle: 4.387 ± 0.587
2.817SerLys: 2.817 ± 0.411
5.08SerLeu: 5.08 ± 0.491
1.755SerMet: 1.755 ± 0.317
3.094SerAsn: 3.094 ± 0.46
2.586SerPro: 2.586 ± 0.373
2.447SerGln: 2.447 ± 0.358
2.817SerArg: 2.817 ± 0.39
6.142SerSer: 6.142 ± 0.766
4.433SerThr: 4.433 ± 0.613
5.08SerVal: 5.08 ± 0.519
1.154SerTrp: 1.154 ± 0.253
2.771SerTyr: 2.771 ± 0.448
0.0SerXaa: 0.0 ± 0.0
Thr
6.049ThrAla: 6.049 ± 0.705
0.924ThrCys: 0.924 ± 0.232
4.387ThrAsp: 4.387 ± 0.578
3.879ThrGlu: 3.879 ± 0.444
2.17ThrPhe: 2.17 ± 0.353
7.296ThrGly: 7.296 ± 0.859
0.785ThrHis: 0.785 ± 0.234
3.51ThrIle: 3.51 ± 0.397
2.678ThrLys: 2.678 ± 0.419
4.664ThrLeu: 4.664 ± 0.522
0.877ThrMet: 0.877 ± 0.19
2.909ThrAsn: 2.909 ± 0.544
3.602ThrPro: 3.602 ± 0.402
2.401ThrGln: 2.401 ± 0.352
2.263ThrArg: 2.263 ± 0.388
4.064ThrSer: 4.064 ± 0.626
5.634ThrThr: 5.634 ± 0.835
5.172ThrVal: 5.172 ± 0.576
1.432ThrTrp: 1.432 ± 0.322
2.17ThrTyr: 2.17 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
5.865ValAla: 5.865 ± 0.498
0.739ValCys: 0.739 ± 0.267
5.588ValAsp: 5.588 ± 0.537
4.756ValGlu: 4.756 ± 0.434
2.078ValPhe: 2.078 ± 0.285
4.156ValGly: 4.156 ± 0.577
1.293ValHis: 1.293 ± 0.297
3.694ValIle: 3.694 ± 0.459
2.725ValLys: 2.725 ± 0.336
4.433ValLeu: 4.433 ± 0.411
0.693ValMet: 0.693 ± 0.174
3.233ValAsn: 3.233 ± 0.42
3.417ValPro: 3.417 ± 0.487
2.032ValGln: 2.032 ± 0.326
3.463ValArg: 3.463 ± 0.395
5.357ValSer: 5.357 ± 0.475
4.618ValThr: 4.618 ± 0.508
4.202ValVal: 4.202 ± 0.428
0.739ValTrp: 0.739 ± 0.163
2.078ValTyr: 2.078 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
0.785TrpAla: 0.785 ± 0.187
0.277TrpCys: 0.277 ± 0.102
1.201TrpAsp: 1.201 ± 0.232
0.785TrpGlu: 0.785 ± 0.194
0.693TrpPhe: 0.693 ± 0.211
1.201TrpGly: 1.201 ± 0.206
0.369TrpHis: 0.369 ± 0.142
0.739TrpIle: 0.739 ± 0.18
0.462TrpLys: 0.462 ± 0.134
1.154TrpLeu: 1.154 ± 0.263
0.277TrpMet: 0.277 ± 0.128
1.154TrpAsn: 1.154 ± 0.291
0.416TrpPro: 0.416 ± 0.112
0.647TrpGln: 0.647 ± 0.152
1.247TrpArg: 1.247 ± 0.265
1.201TrpSer: 1.201 ± 0.267
1.293TrpThr: 1.293 ± 0.22
1.524TrpVal: 1.524 ± 0.301
0.231TrpTrp: 0.231 ± 0.095
0.693TrpTyr: 0.693 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.463TyrAla: 3.463 ± 0.35
0.693TyrCys: 0.693 ± 0.212
3.002TyrAsp: 3.002 ± 0.322
2.032TyrGlu: 2.032 ± 0.267
1.016TyrPhe: 1.016 ± 0.213
2.447TyrGly: 2.447 ± 0.33
0.554TyrHis: 0.554 ± 0.142
1.524TyrIle: 1.524 ± 0.22
1.478TyrLys: 1.478 ± 0.272
2.863TyrLeu: 2.863 ± 0.381
0.739TyrMet: 0.739 ± 0.157
1.94TyrAsn: 1.94 ± 0.284
0.97TyrPro: 0.97 ± 0.235
1.432TyrGln: 1.432 ± 0.274
1.847TyrArg: 1.847 ± 0.318
2.447TyrSer: 2.447 ± 0.271
2.494TyrThr: 2.494 ± 0.358
2.725TyrVal: 2.725 ± 0.325
0.6TyrTrp: 0.6 ± 0.14
1.616TyrTyr: 1.616 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 102 proteins (21656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski