Amino acid dipepetide frequency for Prochlorococcus phage P-SSP7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.17AlaAla: 8.17 ± 0.993
0.563AlaCys: 0.563 ± 0.266
4.648AlaAsp: 4.648 ± 0.779
5.564AlaGlu: 5.564 ± 0.929
3.521AlaPhe: 3.521 ± 0.428
5.0AlaGly: 5.0 ± 0.634
1.127AlaHis: 1.127 ± 0.298
5.493AlaIle: 5.493 ± 0.503
5.352AlaLys: 5.352 ± 1.011
5.916AlaLeu: 5.916 ± 0.663
2.183AlaMet: 2.183 ± 0.364
4.296AlaAsn: 4.296 ± 0.673
3.31AlaPro: 3.31 ± 0.547
3.028AlaGln: 3.028 ± 0.581
3.662AlaArg: 3.662 ± 0.538
5.705AlaSer: 5.705 ± 0.743
6.198AlaThr: 6.198 ± 0.753
3.874AlaVal: 3.874 ± 0.574
1.056AlaTrp: 1.056 ± 0.231
2.465AlaTyr: 2.465 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
1.127CysAla: 1.127 ± 0.295
0.141CysCys: 0.141 ± 0.098
0.563CysAsp: 0.563 ± 0.189
0.916CysGlu: 0.916 ± 0.257
0.352CysPhe: 0.352 ± 0.143
0.352CysGly: 0.352 ± 0.151
0.211CysHis: 0.211 ± 0.106
0.634CysIle: 0.634 ± 0.252
0.704CysLys: 0.704 ± 0.214
0.775CysLeu: 0.775 ± 0.248
0.282CysMet: 0.282 ± 0.121
0.352CysAsn: 0.352 ± 0.172
0.775CysPro: 0.775 ± 0.278
0.493CysGln: 0.493 ± 0.255
0.493CysArg: 0.493 ± 0.223
1.127CysSer: 1.127 ± 0.346
0.563CysThr: 0.563 ± 0.213
0.916CysVal: 0.916 ± 0.293
0.0CysTrp: 0.0 ± 0.0
0.352CysTyr: 0.352 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
4.789AspAla: 4.789 ± 0.623
0.845AspCys: 0.845 ± 0.278
3.944AspAsp: 3.944 ± 0.781
3.662AspGlu: 3.662 ± 0.398
2.395AspPhe: 2.395 ± 0.299
5.141AspGly: 5.141 ± 0.599
1.268AspHis: 1.268 ± 0.302
4.719AspIle: 4.719 ± 0.42
4.578AspLys: 4.578 ± 0.746
6.057AspLeu: 6.057 ± 0.496
0.845AspMet: 0.845 ± 0.298
1.902AspAsn: 1.902 ± 0.336
2.606AspPro: 2.606 ± 0.396
1.972AspGln: 1.972 ± 0.251
2.042AspArg: 2.042 ± 0.335
4.155AspSer: 4.155 ± 0.516
2.958AspThr: 2.958 ± 0.382
4.014AspVal: 4.014 ± 0.54
1.409AspTrp: 1.409 ± 0.38
2.183AspTyr: 2.183 ± 0.333
0.0AspXaa: 0.0 ± 0.0
Glu
5.141GluAla: 5.141 ± 0.842
0.423GluCys: 0.423 ± 0.154
4.155GluAsp: 4.155 ± 0.599
4.085GluGlu: 4.085 ± 1.008
2.042GluPhe: 2.042 ± 0.313
3.451GluGly: 3.451 ± 0.559
1.479GluHis: 1.479 ± 0.271
4.367GluIle: 4.367 ± 0.676
3.874GluLys: 3.874 ± 0.555
6.057GluLeu: 6.057 ± 0.762
1.127GluMet: 1.127 ± 0.276
2.888GluAsn: 2.888 ± 0.596
2.042GluPro: 2.042 ± 0.36
2.606GluGln: 2.606 ± 0.461
3.028GluArg: 3.028 ± 0.582
3.803GluSer: 3.803 ± 0.468
4.578GluThr: 4.578 ± 0.529
3.028GluVal: 3.028 ± 0.405
0.916GluTrp: 0.916 ± 0.259
2.042GluTyr: 2.042 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
1.62PheAla: 1.62 ± 0.393
0.563PheCys: 0.563 ± 0.2
2.113PheAsp: 2.113 ± 0.281
2.183PheGlu: 2.183 ± 0.435
1.409PhePhe: 1.409 ± 0.335
2.817PheGly: 2.817 ± 0.465
0.704PheHis: 0.704 ± 0.268
1.549PheIle: 1.549 ± 0.343
2.324PheLys: 2.324 ± 0.379
3.31PheLeu: 3.31 ± 0.405
1.127PheMet: 1.127 ± 0.254
3.381PheAsn: 3.381 ± 0.604
1.338PhePro: 1.338 ± 0.374
1.197PheGln: 1.197 ± 0.324
1.902PheArg: 1.902 ± 0.29
2.535PheSer: 2.535 ± 0.416
2.395PheThr: 2.395 ± 0.613
2.817PheVal: 2.817 ± 0.415
0.423PheTrp: 0.423 ± 0.125
1.409PheTyr: 1.409 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
4.789GlyAla: 4.789 ± 0.543
0.563GlyCys: 0.563 ± 0.176
4.155GlyAsp: 4.155 ± 0.524
3.451GlyGlu: 3.451 ± 0.436
2.183GlyPhe: 2.183 ± 0.425
4.014GlyGly: 4.014 ± 0.701
1.409GlyHis: 1.409 ± 0.338
4.719GlyIle: 4.719 ± 0.719
5.282GlyLys: 5.282 ± 0.522
6.057GlyLeu: 6.057 ± 0.731
1.761GlyMet: 1.761 ± 0.397
3.451GlyAsn: 3.451 ± 0.57
1.268GlyPro: 1.268 ± 0.241
2.958GlyGln: 2.958 ± 0.479
3.451GlyArg: 3.451 ± 0.542
4.859GlySer: 4.859 ± 1.085
4.648GlyThr: 4.648 ± 0.71
4.578GlyVal: 4.578 ± 0.556
0.916GlyTrp: 0.916 ± 0.264
3.521GlyTyr: 3.521 ± 0.552
0.0GlyXaa: 0.0 ± 0.0
His
0.916HisAla: 0.916 ± 0.315
0.352HisCys: 0.352 ± 0.148
1.056HisAsp: 1.056 ± 0.299
0.986HisGlu: 0.986 ± 0.262
1.409HisPhe: 1.409 ± 0.369
1.479HisGly: 1.479 ± 0.35
0.423HisHis: 0.423 ± 0.181
0.986HisIle: 0.986 ± 0.286
0.845HisLys: 0.845 ± 0.29
2.324HisLeu: 2.324 ± 0.503
0.352HisMet: 0.352 ± 0.149
0.916HisAsn: 0.916 ± 0.276
0.986HisPro: 0.986 ± 0.196
0.775HisGln: 0.775 ± 0.234
0.563HisArg: 0.563 ± 0.212
0.986HisSer: 0.986 ± 0.331
1.056HisThr: 1.056 ± 0.236
0.704HisVal: 0.704 ± 0.249
0.282HisTrp: 0.282 ± 0.151
0.352HisTyr: 0.352 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
5.071IleAla: 5.071 ± 0.775
0.423IleCys: 0.423 ± 0.203
4.367IleAsp: 4.367 ± 0.797
3.874IleGlu: 3.874 ± 0.634
1.831IlePhe: 1.831 ± 0.368
4.085IleGly: 4.085 ± 0.615
1.479IleHis: 1.479 ± 0.424
2.606IleIle: 2.606 ± 0.363
4.437IleLys: 4.437 ± 0.605
4.437IleLeu: 4.437 ± 0.694
1.268IleMet: 1.268 ± 0.322
3.874IleAsn: 3.874 ± 0.566
3.592IlePro: 3.592 ± 0.585
2.958IleGln: 2.958 ± 0.387
2.395IleArg: 2.395 ± 0.361
3.874IleSer: 3.874 ± 0.545
4.367IleThr: 4.367 ± 0.603
3.169IleVal: 3.169 ± 0.511
0.493IleTrp: 0.493 ± 0.198
1.831IleTyr: 1.831 ± 0.358
0.0IleXaa: 0.0 ± 0.0
Lys
5.634LysAla: 5.634 ± 0.802
0.775LysCys: 0.775 ± 0.238
4.226LysAsp: 4.226 ± 0.655
4.789LysGlu: 4.789 ± 0.694
1.69LysPhe: 1.69 ± 0.336
4.014LysGly: 4.014 ± 0.819
1.197LysHis: 1.197 ± 0.312
3.874LysIle: 3.874 ± 0.417
4.578LysLys: 4.578 ± 0.769
6.972LysLeu: 6.972 ± 0.671
0.986LysMet: 0.986 ± 0.236
4.085LysAsn: 4.085 ± 0.907
3.028LysPro: 3.028 ± 0.53
2.606LysGln: 2.606 ± 0.474
2.747LysArg: 2.747 ± 0.34
3.803LysSer: 3.803 ± 0.606
4.719LysThr: 4.719 ± 0.512
4.859LysVal: 4.859 ± 0.699
0.704LysTrp: 0.704 ± 0.219
3.31LysTyr: 3.31 ± 0.508
0.0LysXaa: 0.0 ± 0.0
Leu
6.198LeuAla: 6.198 ± 0.837
1.056LeuCys: 1.056 ± 0.316
4.93LeuAsp: 4.93 ± 0.63
4.648LeuGlu: 4.648 ± 0.771
3.099LeuPhe: 3.099 ± 0.464
5.564LeuGly: 5.564 ± 0.584
1.761LeuHis: 1.761 ± 0.434
4.437LeuIle: 4.437 ± 0.669
6.057LeuLys: 6.057 ± 0.724
5.986LeuLeu: 5.986 ± 0.568
2.395LeuMet: 2.395 ± 0.473
4.437LeuAsn: 4.437 ± 0.514
4.014LeuPro: 4.014 ± 0.504
4.226LeuGln: 4.226 ± 0.815
4.296LeuArg: 4.296 ± 0.673
5.212LeuSer: 5.212 ± 0.491
5.0LeuThr: 5.0 ± 0.661
4.93LeuVal: 4.93 ± 0.639
0.634LeuTrp: 0.634 ± 0.211
3.099LeuTyr: 3.099 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.392
0.282MetCys: 0.282 ± 0.136
0.986MetAsp: 0.986 ± 0.303
1.902MetGlu: 1.902 ± 0.4
0.845MetPhe: 0.845 ± 0.278
1.127MetGly: 1.127 ± 0.228
0.493MetHis: 0.493 ± 0.282
1.127MetIle: 1.127 ± 0.221
1.761MetLys: 1.761 ± 0.342
1.268MetLeu: 1.268 ± 0.243
0.211MetMet: 0.211 ± 0.125
1.197MetAsn: 1.197 ± 0.255
0.986MetPro: 0.986 ± 0.239
0.986MetGln: 0.986 ± 0.199
1.409MetArg: 1.409 ± 0.318
2.183MetSer: 2.183 ± 0.442
1.69MetThr: 1.69 ± 0.405
1.268MetVal: 1.268 ± 0.258
0.07MetTrp: 0.07 ± 0.068
0.634MetTyr: 0.634 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
4.719AsnAla: 4.719 ± 0.689
0.634AsnCys: 0.634 ± 0.251
2.958AsnAsp: 2.958 ± 0.423
2.254AsnGlu: 2.254 ± 0.416
1.761AsnPhe: 1.761 ± 0.358
4.789AsnGly: 4.789 ± 0.884
0.563AsnHis: 0.563 ± 0.184
3.028AsnIle: 3.028 ± 0.484
3.521AsnLys: 3.521 ± 0.678
3.381AsnLeu: 3.381 ± 0.422
0.916AsnMet: 0.916 ± 0.251
3.028AsnAsn: 3.028 ± 0.707
2.465AsnPro: 2.465 ± 0.349
2.113AsnGln: 2.113 ± 0.371
3.31AsnArg: 3.31 ± 0.503
3.662AsnSer: 3.662 ± 0.689
4.155AsnThr: 4.155 ± 0.84
3.803AsnVal: 3.803 ± 0.698
0.916AsnTrp: 0.916 ± 0.279
1.831AsnTyr: 1.831 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
2.747ProAla: 2.747 ± 0.538
0.563ProCys: 0.563 ± 0.209
2.676ProAsp: 2.676 ± 0.41
3.169ProGlu: 3.169 ± 0.44
2.042ProPhe: 2.042 ± 0.351
1.69ProGly: 1.69 ± 0.349
0.704ProHis: 0.704 ± 0.196
2.888ProIle: 2.888 ± 0.428
2.324ProLys: 2.324 ± 0.45
2.888ProLeu: 2.888 ± 0.515
0.704ProMet: 0.704 ± 0.267
2.465ProAsn: 2.465 ± 0.419
2.324ProPro: 2.324 ± 0.57
1.409ProGln: 1.409 ± 0.375
0.916ProArg: 0.916 ± 0.294
3.733ProSer: 3.733 ± 0.444
3.099ProThr: 3.099 ± 0.432
2.676ProVal: 2.676 ± 0.52
0.916ProTrp: 0.916 ± 0.215
1.549ProTyr: 1.549 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
4.719GlnAla: 4.719 ± 0.739
0.352GlnCys: 0.352 ± 0.167
1.972GlnAsp: 1.972 ± 0.402
2.606GlnGlu: 2.606 ± 0.365
1.338GlnPhe: 1.338 ± 0.244
2.958GlnGly: 2.958 ± 0.334
0.563GlnHis: 0.563 ± 0.214
3.24GlnIle: 3.24 ± 0.318
2.747GlnLys: 2.747 ± 0.406
3.733GlnLeu: 3.733 ± 0.563
0.775GlnMet: 0.775 ± 0.258
1.197GlnAsn: 1.197 ± 0.322
0.986GlnPro: 0.986 ± 0.26
2.535GlnGln: 2.535 ± 0.533
1.831GlnArg: 1.831 ± 0.315
2.747GlnSer: 2.747 ± 0.357
2.535GlnThr: 2.535 ± 0.4
3.381GlnVal: 3.381 ± 0.426
0.704GlnTrp: 0.704 ± 0.19
1.127GlnTyr: 1.127 ± 0.236
0.0GlnXaa: 0.0 ± 0.0
Arg
3.874ArgAla: 3.874 ± 0.675
0.704ArgCys: 0.704 ± 0.227
2.606ArgAsp: 2.606 ± 0.352
3.099ArgGlu: 3.099 ± 0.557
1.902ArgPhe: 1.902 ± 0.263
2.606ArgGly: 2.606 ± 0.375
0.563ArgHis: 0.563 ± 0.233
2.465ArgIle: 2.465 ± 0.369
3.169ArgLys: 3.169 ± 0.576
4.085ArgLeu: 4.085 ± 0.476
1.62ArgMet: 1.62 ± 0.317
2.606ArgAsn: 2.606 ± 0.451
1.62ArgPro: 1.62 ± 0.42
2.042ArgGln: 2.042 ± 0.359
2.254ArgArg: 2.254 ± 0.629
2.324ArgSer: 2.324 ± 0.315
2.535ArgThr: 2.535 ± 0.465
1.761ArgVal: 1.761 ± 0.389
0.634ArgTrp: 0.634 ± 0.223
2.042ArgTyr: 2.042 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
5.352SerAla: 5.352 ± 0.863
1.056SerCys: 1.056 ± 0.271
4.578SerAsp: 4.578 ± 0.561
2.817SerGlu: 2.817 ± 0.413
2.676SerPhe: 2.676 ± 0.357
6.55SerGly: 6.55 ± 0.916
0.845SerHis: 0.845 ± 0.231
3.381SerIle: 3.381 ± 0.597
4.014SerLys: 4.014 ± 0.555
4.789SerLeu: 4.789 ± 0.541
1.62SerMet: 1.62 ± 0.326
4.226SerAsn: 4.226 ± 0.684
3.099SerPro: 3.099 ± 0.393
2.817SerGln: 2.817 ± 0.448
3.028SerArg: 3.028 ± 0.571
5.916SerSer: 5.916 ± 1.099
5.634SerThr: 5.634 ± 0.936
3.028SerVal: 3.028 ± 0.468
1.056SerTrp: 1.056 ± 0.3
1.549SerTyr: 1.549 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
5.845ThrAla: 5.845 ± 0.945
0.634ThrCys: 0.634 ± 0.271
4.014ThrAsp: 4.014 ± 0.639
4.155ThrGlu: 4.155 ± 0.501
2.395ThrPhe: 2.395 ± 0.424
5.916ThrGly: 5.916 ± 0.621
0.986ThrHis: 0.986 ± 0.323
4.789ThrIle: 4.789 ± 0.743
4.578ThrLys: 4.578 ± 0.409
6.057ThrLeu: 6.057 ± 0.544
1.479ThrMet: 1.479 ± 0.31
4.367ThrAsn: 4.367 ± 0.637
3.31ThrPro: 3.31 ± 0.475
2.676ThrGln: 2.676 ± 0.448
2.465ThrArg: 2.465 ± 0.4
3.592ThrSer: 3.592 ± 0.59
7.677ThrThr: 7.677 ± 1.523
3.592ThrVal: 3.592 ± 0.755
0.916ThrTrp: 0.916 ± 0.241
2.676ThrTyr: 2.676 ± 0.387
0.0ThrXaa: 0.0 ± 0.0
Val
4.719ValAla: 4.719 ± 0.582
0.493ValCys: 0.493 ± 0.217
3.874ValAsp: 3.874 ± 0.527
3.381ValGlu: 3.381 ± 0.57
2.395ValPhe: 2.395 ± 0.511
3.521ValGly: 3.521 ± 0.48
0.986ValHis: 0.986 ± 0.228
3.944ValIle: 3.944 ± 0.503
4.296ValLys: 4.296 ± 0.477
3.803ValLeu: 3.803 ± 0.551
1.409ValMet: 1.409 ± 0.261
3.24ValAsn: 3.24 ± 0.402
2.183ValPro: 2.183 ± 0.463
2.535ValGln: 2.535 ± 0.392
2.254ValArg: 2.254 ± 0.414
4.648ValSer: 4.648 ± 0.801
5.141ValThr: 5.141 ± 0.765
4.085ValVal: 4.085 ± 0.605
0.563ValTrp: 0.563 ± 0.172
1.549ValTyr: 1.549 ± 0.27
0.0ValXaa: 0.0 ± 0.0
Trp
0.916TrpAla: 0.916 ± 0.254
0.352TrpCys: 0.352 ± 0.197
1.056TrpAsp: 1.056 ± 0.234
0.775TrpGlu: 0.775 ± 0.23
0.563TrpPhe: 0.563 ± 0.17
0.493TrpGly: 0.493 ± 0.196
0.423TrpHis: 0.423 ± 0.151
0.775TrpIle: 0.775 ± 0.208
1.197TrpLys: 1.197 ± 0.331
1.127TrpLeu: 1.127 ± 0.284
0.423TrpMet: 0.423 ± 0.17
0.563TrpAsn: 0.563 ± 0.191
0.423TrpPro: 0.423 ± 0.153
0.634TrpGln: 0.634 ± 0.222
0.493TrpArg: 0.493 ± 0.199
0.916TrpSer: 0.916 ± 0.255
1.127TrpThr: 1.127 ± 0.224
0.634TrpVal: 0.634 ± 0.23
0.141TrpTrp: 0.141 ± 0.084
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.817TyrAla: 2.817 ± 0.432
0.352TyrCys: 0.352 ± 0.192
2.606TyrAsp: 2.606 ± 0.384
2.817TyrGlu: 2.817 ± 0.42
1.268TyrPhe: 1.268 ± 0.283
2.465TyrGly: 2.465 ± 0.447
0.563TyrHis: 0.563 ± 0.191
1.409TyrIle: 1.409 ± 0.337
2.958TyrLys: 2.958 ± 0.599
2.958TyrLeu: 2.958 ± 0.491
0.845TyrMet: 0.845 ± 0.208
1.409TyrAsn: 1.409 ± 0.317
1.127TyrPro: 1.127 ± 0.348
1.479TyrGln: 1.479 ± 0.321
1.972TyrArg: 1.972 ± 0.279
2.254TyrSer: 2.254 ± 0.396
2.042TyrThr: 2.042 ± 0.385
1.902TyrVal: 1.902 ± 0.424
0.282TyrTrp: 0.282 ± 0.141
1.268TyrTyr: 1.268 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (14200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski