Amino acid dipepetide frequency for Prochlorococcus phage P-SSP10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.506AlaAla: 9.506 ± 1.375
0.804AlaCys: 0.804 ± 0.295
5.704AlaAsp: 5.704 ± 0.879
5.631AlaGlu: 5.631 ± 1.012
2.633AlaPhe: 2.633 ± 0.368
6.289AlaGly: 6.289 ± 0.846
1.097AlaHis: 1.097 ± 0.347
5.119AlaIle: 5.119 ± 0.489
4.68AlaLys: 4.68 ± 0.731
6.216AlaLeu: 6.216 ± 0.578
2.048AlaMet: 2.048 ± 0.381
5.265AlaAsn: 5.265 ± 0.853
2.998AlaPro: 2.998 ± 0.634
3.364AlaGln: 3.364 ± 0.627
4.022AlaArg: 4.022 ± 0.612
5.85AlaSer: 5.85 ± 1.109
6.143AlaThr: 6.143 ± 0.593
6.069AlaVal: 6.069 ± 0.8
1.024AlaTrp: 1.024 ± 0.177
2.706AlaTyr: 2.706 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.512CysAla: 0.512 ± 0.209
0.073CysCys: 0.073 ± 0.076
0.731CysAsp: 0.731 ± 0.305
0.512CysGlu: 0.512 ± 0.203
0.146CysPhe: 0.146 ± 0.105
0.293CysGly: 0.293 ± 0.163
0.146CysHis: 0.146 ± 0.111
0.658CysIle: 0.658 ± 0.284
0.804CysLys: 0.804 ± 0.264
0.658CysLeu: 0.658 ± 0.235
0.219CysMet: 0.219 ± 0.138
0.366CysAsn: 0.366 ± 0.162
0.439CysPro: 0.439 ± 0.195
0.512CysGln: 0.512 ± 0.234
0.439CysArg: 0.439 ± 0.182
0.658CysSer: 0.658 ± 0.272
0.512CysThr: 0.512 ± 0.217
0.658CysVal: 0.658 ± 0.248
0.146CysTrp: 0.146 ± 0.145
0.512CysTyr: 0.512 ± 0.229
0.0CysXaa: 0.0 ± 0.0
Asp
5.923AspAla: 5.923 ± 0.566
0.439AspCys: 0.439 ± 0.183
5.119AspAsp: 5.119 ± 0.744
4.095AspGlu: 4.095 ± 0.703
2.633AspPhe: 2.633 ± 0.429
6.143AspGly: 6.143 ± 1.014
1.243AspHis: 1.243 ± 0.332
3.583AspIle: 3.583 ± 0.462
4.241AspLys: 4.241 ± 0.533
5.777AspLeu: 5.777 ± 0.823
1.097AspMet: 1.097 ± 0.311
4.095AspAsn: 4.095 ± 0.717
2.267AspPro: 2.267 ± 0.703
2.121AspGln: 2.121 ± 0.488
2.34AspArg: 2.34 ± 0.586
4.241AspSer: 4.241 ± 0.532
3.071AspThr: 3.071 ± 0.593
3.876AspVal: 3.876 ± 0.435
1.17AspTrp: 1.17 ± 0.292
2.706AspTyr: 2.706 ± 0.492
0.0AspXaa: 0.0 ± 0.0
Glu
5.85GluAla: 5.85 ± 0.852
0.804GluCys: 0.804 ± 0.233
4.461GluAsp: 4.461 ± 0.668
3.071GluGlu: 3.071 ± 0.593
2.267GluPhe: 2.267 ± 0.489
3.51GluGly: 3.51 ± 0.561
1.097GluHis: 1.097 ± 0.277
3.51GluIle: 3.51 ± 0.466
3.218GluLys: 3.218 ± 0.516
7.751GluLeu: 7.751 ± 0.691
0.878GluMet: 0.878 ± 0.229
2.779GluAsn: 2.779 ± 0.516
2.413GluPro: 2.413 ± 0.564
1.828GluGln: 1.828 ± 0.31
2.267GluArg: 2.267 ± 0.449
3.803GluSer: 3.803 ± 0.619
4.388GluThr: 4.388 ± 0.675
4.168GluVal: 4.168 ± 0.572
1.536GluTrp: 1.536 ± 0.372
2.048GluTyr: 2.048 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
1.974PheAla: 1.974 ± 0.386
0.219PheCys: 0.219 ± 0.134
2.121PheAsp: 2.121 ± 0.379
2.34PheGlu: 2.34 ± 0.416
1.609PhePhe: 1.609 ± 0.277
2.413PheGly: 2.413 ± 0.467
1.243PheHis: 1.243 ± 0.389
2.121PheIle: 2.121 ± 0.329
2.486PheLys: 2.486 ± 0.496
2.633PheLeu: 2.633 ± 0.409
1.17PheMet: 1.17 ± 0.298
3.291PheAsn: 3.291 ± 0.504
1.024PhePro: 1.024 ± 0.309
0.951PheGln: 0.951 ± 0.339
1.755PheArg: 1.755 ± 0.443
2.633PheSer: 2.633 ± 0.492
2.194PheThr: 2.194 ± 0.333
2.267PheVal: 2.267 ± 0.325
0.512PheTrp: 0.512 ± 0.221
1.316PheTyr: 1.316 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
5.484GlyAla: 5.484 ± 1.119
0.439GlyCys: 0.439 ± 0.157
5.265GlyAsp: 5.265 ± 0.867
2.998GlyGlu: 2.998 ± 0.473
2.121GlyPhe: 2.121 ± 0.437
5.631GlyGly: 5.631 ± 1.129
0.951GlyHis: 0.951 ± 0.292
4.168GlyIle: 4.168 ± 0.544
5.631GlyLys: 5.631 ± 0.704
6.143GlyLeu: 6.143 ± 0.654
1.755GlyMet: 1.755 ± 0.309
5.192GlyAsn: 5.192 ± 1.462
1.389GlyPro: 1.389 ± 0.279
3.071GlyGln: 3.071 ± 0.45
3.071GlyArg: 3.071 ± 0.562
5.85GlySer: 5.85 ± 1.088
5.923GlyThr: 5.923 ± 0.79
5.192GlyVal: 5.192 ± 0.578
0.804GlyTrp: 0.804 ± 0.212
2.413GlyTyr: 2.413 ± 0.504
0.0GlyXaa: 0.0 ± 0.0
His
1.243HisAla: 1.243 ± 0.274
0.512HisCys: 0.512 ± 0.274
1.463HisAsp: 1.463 ± 0.361
1.316HisGlu: 1.316 ± 0.251
0.512HisPhe: 0.512 ± 0.248
0.951HisGly: 0.951 ± 0.254
0.219HisHis: 0.219 ± 0.155
0.804HisIle: 0.804 ± 0.202
0.658HisLys: 0.658 ± 0.233
1.389HisLeu: 1.389 ± 0.346
0.439HisMet: 0.439 ± 0.171
1.097HisAsn: 1.097 ± 0.297
0.731HisPro: 0.731 ± 0.236
0.731HisGln: 0.731 ± 0.198
1.024HisArg: 1.024 ± 0.248
1.17HisSer: 1.17 ± 0.27
1.316HisThr: 1.316 ± 0.307
1.243HisVal: 1.243 ± 0.339
0.146HisTrp: 0.146 ± 0.12
0.878HisTyr: 0.878 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.046IleAla: 5.046 ± 0.902
0.439IleCys: 0.439 ± 0.163
2.559IleAsp: 2.559 ± 0.307
3.51IleGlu: 3.51 ± 0.416
2.048IlePhe: 2.048 ± 0.372
3.144IleGly: 3.144 ± 0.667
1.097IleHis: 1.097 ± 0.28
2.633IleIle: 2.633 ± 0.436
5.192IleLys: 5.192 ± 0.685
3.729IleLeu: 3.729 ± 0.758
0.512IleMet: 0.512 ± 0.258
3.437IleAsn: 3.437 ± 0.747
3.071IlePro: 3.071 ± 0.541
3.071IleGln: 3.071 ± 0.43
2.194IleArg: 2.194 ± 0.363
4.022IleSer: 4.022 ± 0.606
3.583IleThr: 3.583 ± 0.588
2.559IleVal: 2.559 ± 0.581
0.951IleTrp: 0.951 ± 0.255
1.828IleTyr: 1.828 ± 0.424
0.0IleXaa: 0.0 ± 0.0
Lys
5.265LysAla: 5.265 ± 0.68
0.585LysCys: 0.585 ± 0.227
4.461LysAsp: 4.461 ± 0.781
4.753LysGlu: 4.753 ± 0.907
2.34LysPhe: 2.34 ± 0.398
4.022LysGly: 4.022 ± 0.764
1.17LysHis: 1.17 ± 0.225
4.168LysIle: 4.168 ± 0.713
5.265LysLys: 5.265 ± 0.699
5.923LysLeu: 5.923 ± 0.628
1.463LysMet: 1.463 ± 0.349
4.095LysAsn: 4.095 ± 0.731
2.779LysPro: 2.779 ± 0.56
2.559LysGln: 2.559 ± 0.496
3.291LysArg: 3.291 ± 0.53
2.852LysSer: 2.852 ± 0.487
3.291LysThr: 3.291 ± 0.521
3.656LysVal: 3.656 ± 0.668
0.951LysTrp: 0.951 ± 0.239
2.413LysTyr: 2.413 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
6.801LeuAla: 6.801 ± 0.678
0.731LeuCys: 0.731 ± 0.236
5.484LeuAsp: 5.484 ± 0.504
5.119LeuGlu: 5.119 ± 0.625
2.267LeuPhe: 2.267 ± 0.381
5.923LeuGly: 5.923 ± 0.851
1.755LeuHis: 1.755 ± 0.328
3.729LeuIle: 3.729 ± 0.545
4.973LeuLys: 4.973 ± 0.598
5.558LeuLeu: 5.558 ± 0.808
2.779LeuMet: 2.779 ± 0.478
5.119LeuAsn: 5.119 ± 0.668
3.803LeuPro: 3.803 ± 0.62
3.729LeuGln: 3.729 ± 0.478
3.51LeuArg: 3.51 ± 0.686
5.119LeuSer: 5.119 ± 0.657
4.388LeuThr: 4.388 ± 0.743
3.949LeuVal: 3.949 ± 0.485
0.731LeuTrp: 0.731 ± 0.218
2.706LeuTyr: 2.706 ± 0.765
0.0LeuXaa: 0.0 ± 0.0
Met
2.121MetAla: 2.121 ± 0.38
0.439MetCys: 0.439 ± 0.174
1.463MetAsp: 1.463 ± 0.388
1.536MetGlu: 1.536 ± 0.291
1.024MetPhe: 1.024 ± 0.192
2.048MetGly: 2.048 ± 0.515
0.658MetHis: 0.658 ± 0.289
1.024MetIle: 1.024 ± 0.237
1.609MetLys: 1.609 ± 0.33
1.463MetLeu: 1.463 ± 0.372
0.439MetMet: 0.439 ± 0.143
1.097MetAsn: 1.097 ± 0.255
0.439MetPro: 0.439 ± 0.137
1.17MetGln: 1.17 ± 0.248
0.878MetArg: 0.878 ± 0.231
1.316MetSer: 1.316 ± 0.257
2.194MetThr: 2.194 ± 0.303
1.389MetVal: 1.389 ± 0.357
0.293MetTrp: 0.293 ± 0.176
0.658MetTyr: 0.658 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
5.704AsnAla: 5.704 ± 0.996
0.293AsnCys: 0.293 ± 0.143
3.144AsnAsp: 3.144 ± 0.605
3.218AsnGlu: 3.218 ± 0.605
2.852AsnPhe: 2.852 ± 0.496
4.973AsnGly: 4.973 ± 1.166
1.17AsnHis: 1.17 ± 0.291
3.364AsnIle: 3.364 ± 0.497
3.51AsnLys: 3.51 ± 0.679
4.314AsnLeu: 4.314 ± 0.731
1.097AsnMet: 1.097 ± 0.426
3.144AsnAsn: 3.144 ± 0.467
3.144AsnPro: 3.144 ± 0.422
1.974AsnGln: 1.974 ± 0.42
2.779AsnArg: 2.779 ± 0.385
4.899AsnSer: 4.899 ± 0.918
3.729AsnThr: 3.729 ± 0.812
3.803AsnVal: 3.803 ± 0.579
0.878AsnTrp: 0.878 ± 0.254
2.048AsnTyr: 2.048 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
2.267ProAla: 2.267 ± 0.567
0.366ProCys: 0.366 ± 0.151
2.413ProAsp: 2.413 ± 0.386
2.413ProGlu: 2.413 ± 0.445
1.536ProPhe: 1.536 ± 0.301
1.974ProGly: 1.974 ± 0.489
0.366ProHis: 0.366 ± 0.196
2.121ProIle: 2.121 ± 0.462
2.559ProLys: 2.559 ± 0.514
2.048ProLeu: 2.048 ± 0.402
1.17ProMet: 1.17 ± 0.332
2.34ProAsn: 2.34 ± 0.397
2.267ProPro: 2.267 ± 0.726
1.974ProGln: 1.974 ± 0.336
0.658ProArg: 0.658 ± 0.172
2.779ProSer: 2.779 ± 0.484
3.583ProThr: 3.583 ± 0.666
2.706ProVal: 2.706 ± 0.473
0.951ProTrp: 0.951 ± 0.255
1.463ProTyr: 1.463 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
4.095GlnAla: 4.095 ± 0.714
0.219GlnCys: 0.219 ± 0.102
1.682GlnAsp: 1.682 ± 0.304
2.34GlnGlu: 2.34 ± 0.557
1.609GlnPhe: 1.609 ± 0.342
3.51GlnGly: 3.51 ± 0.583
0.878GlnHis: 0.878 ± 0.217
2.998GlnIle: 2.998 ± 0.47
2.048GlnLys: 2.048 ± 0.433
2.925GlnLeu: 2.925 ± 0.501
0.804GlnMet: 0.804 ± 0.251
2.413GlnAsn: 2.413 ± 0.44
0.658GlnPro: 0.658 ± 0.222
2.559GlnGln: 2.559 ± 0.62
1.609GlnArg: 1.609 ± 0.402
2.779GlnSer: 2.779 ± 0.425
2.048GlnThr: 2.048 ± 0.407
2.34GlnVal: 2.34 ± 0.447
0.512GlnTrp: 0.512 ± 0.218
1.828GlnTyr: 1.828 ± 0.356
0.0GlnXaa: 0.0 ± 0.0
Arg
4.168ArgAla: 4.168 ± 0.584
0.366ArgCys: 0.366 ± 0.193
2.925ArgAsp: 2.925 ± 0.466
2.852ArgGlu: 2.852 ± 0.501
1.974ArgPhe: 1.974 ± 0.325
2.34ArgGly: 2.34 ± 0.844
0.804ArgHis: 0.804 ± 0.203
2.413ArgIle: 2.413 ± 0.383
3.437ArgLys: 3.437 ± 0.703
3.291ArgLeu: 3.291 ± 0.396
0.951ArgMet: 0.951 ± 0.258
2.34ArgAsn: 2.34 ± 0.408
0.951ArgPro: 0.951 ± 0.255
1.682ArgGln: 1.682 ± 0.366
1.755ArgArg: 1.755 ± 0.365
2.706ArgSer: 2.706 ± 0.534
2.706ArgThr: 2.706 ± 0.529
2.194ArgVal: 2.194 ± 0.424
1.097ArgTrp: 1.097 ± 0.314
1.755ArgTyr: 1.755 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
4.973SerAla: 4.973 ± 1.154
0.512SerCys: 0.512 ± 0.249
4.826SerAsp: 4.826 ± 0.527
4.388SerGlu: 4.388 ± 0.617
2.779SerPhe: 2.779 ± 0.359
6.289SerGly: 6.289 ± 0.886
1.17SerHis: 1.17 ± 0.281
3.949SerIle: 3.949 ± 0.692
3.51SerLys: 3.51 ± 0.653
4.973SerLeu: 4.973 ± 0.573
1.609SerMet: 1.609 ± 0.31
4.607SerAsn: 4.607 ± 1.423
2.413SerPro: 2.413 ± 0.43
2.194SerGln: 2.194 ± 0.363
2.852SerArg: 2.852 ± 0.517
3.51SerSer: 3.51 ± 0.522
4.534SerThr: 4.534 ± 0.519
3.437SerVal: 3.437 ± 0.408
1.097SerTrp: 1.097 ± 0.249
2.194SerTyr: 2.194 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
8.044ThrAla: 8.044 ± 1.037
0.439ThrCys: 0.439 ± 0.207
4.461ThrAsp: 4.461 ± 0.503
4.241ThrGlu: 4.241 ± 0.784
2.486ThrPhe: 2.486 ± 0.53
5.923ThrGly: 5.923 ± 1.033
0.804ThrHis: 0.804 ± 0.239
3.656ThrIle: 3.656 ± 0.679
3.364ThrLys: 3.364 ± 0.608
5.996ThrLeu: 5.996 ± 0.651
1.463ThrMet: 1.463 ± 0.315
2.633ThrAsn: 2.633 ± 0.485
3.218ThrPro: 3.218 ± 0.694
1.682ThrGln: 1.682 ± 0.541
2.925ThrArg: 2.925 ± 0.654
4.168ThrSer: 4.168 ± 0.694
5.265ThrThr: 5.265 ± 0.947
3.876ThrVal: 3.876 ± 0.61
0.658ThrTrp: 0.658 ± 0.222
2.486ThrTyr: 2.486 ± 0.485
0.0ThrXaa: 0.0 ± 0.0
Val
4.899ValAla: 4.899 ± 0.799
0.585ValCys: 0.585 ± 0.271
4.095ValAsp: 4.095 ± 0.485
3.949ValGlu: 3.949 ± 0.463
1.828ValPhe: 1.828 ± 0.422
4.168ValGly: 4.168 ± 0.463
0.585ValHis: 0.585 ± 0.184
1.974ValIle: 1.974 ± 0.338
3.949ValLys: 3.949 ± 0.616
4.022ValLeu: 4.022 ± 0.511
1.755ValMet: 1.755 ± 0.365
4.461ValAsn: 4.461 ± 0.786
2.633ValPro: 2.633 ± 0.497
2.413ValGln: 2.413 ± 0.346
2.706ValArg: 2.706 ± 0.373
4.534ValSer: 4.534 ± 0.747
5.265ValThr: 5.265 ± 1.076
4.534ValVal: 4.534 ± 0.825
0.585ValTrp: 0.585 ± 0.196
2.267ValTyr: 2.267 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
1.243TrpAla: 1.243 ± 0.364
0.073TrpCys: 0.073 ± 0.075
1.243TrpAsp: 1.243 ± 0.268
1.389TrpGlu: 1.389 ± 0.3
0.512TrpPhe: 0.512 ± 0.195
0.585TrpGly: 0.585 ± 0.192
0.439TrpHis: 0.439 ± 0.216
0.731TrpIle: 0.731 ± 0.235
1.316TrpLys: 1.316 ± 0.298
0.804TrpLeu: 0.804 ± 0.288
0.512TrpMet: 0.512 ± 0.203
0.293TrpAsn: 0.293 ± 0.116
0.366TrpPro: 0.366 ± 0.182
0.951TrpGln: 0.951 ± 0.229
0.439TrpArg: 0.439 ± 0.203
0.951TrpSer: 0.951 ± 0.318
0.804TrpThr: 0.804 ± 0.288
1.097TrpVal: 1.097 ± 0.332
0.0TrpTrp: 0.0 ± 0.0
0.658TrpTyr: 0.658 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.048TyrAla: 2.048 ± 0.31
0.658TyrCys: 0.658 ± 0.282
2.559TyrAsp: 2.559 ± 0.466
2.048TyrGlu: 2.048 ± 0.459
1.097TyrPhe: 1.097 ± 0.26
3.364TyrGly: 3.364 ± 0.479
0.951TyrHis: 0.951 ± 0.312
1.901TyrIle: 1.901 ± 0.425
2.852TyrLys: 2.852 ± 0.589
2.559TyrLeu: 2.559 ± 0.429
1.024TyrMet: 1.024 ± 0.227
1.974TyrAsn: 1.974 ± 0.485
0.951TyrPro: 0.951 ± 0.292
1.316TyrGln: 1.316 ± 0.239
2.194TyrArg: 2.194 ± 0.409
1.974TyrSer: 1.974 ± 0.342
2.852TyrThr: 2.852 ± 0.795
2.194TyrVal: 2.194 ± 0.425
0.366TyrTrp: 0.366 ± 0.174
2.413TyrTyr: 2.413 ± 0.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (13676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski