Amino acid dipepetide frequency for Cyanophage 9515-10a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.177AlaAla: 9.177 ± 1.547
0.973AlaCys: 0.973 ± 0.319
5.77AlaAsp: 5.77 ± 0.686
5.006AlaGlu: 5.006 ± 0.774
2.781AlaPhe: 2.781 ± 0.447
5.84AlaGly: 5.84 ± 0.711
1.321AlaHis: 1.321 ± 0.336
5.075AlaIle: 5.075 ± 0.458
4.797AlaLys: 4.797 ± 0.72
6.813AlaLeu: 6.813 ± 0.768
2.364AlaMet: 2.364 ± 0.506
4.241AlaAsn: 4.241 ± 0.943
2.989AlaPro: 2.989 ± 0.62
3.824AlaGln: 3.824 ± 0.701
4.38AlaArg: 4.38 ± 0.631
5.353AlaSer: 5.353 ± 0.952
6.118AlaThr: 6.118 ± 1.07
6.187AlaVal: 6.187 ± 0.939
0.904AlaTrp: 0.904 ± 0.21
2.572AlaTyr: 2.572 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.626CysAla: 0.626 ± 0.273
0.139CysCys: 0.139 ± 0.097
0.765CysAsp: 0.765 ± 0.263
0.487CysGlu: 0.487 ± 0.16
0.209CysPhe: 0.209 ± 0.117
0.278CysGly: 0.278 ± 0.151
0.209CysHis: 0.209 ± 0.149
0.626CysIle: 0.626 ± 0.226
0.834CysLys: 0.834 ± 0.308
0.765CysLeu: 0.765 ± 0.243
0.209CysMet: 0.209 ± 0.143
0.348CysAsn: 0.348 ± 0.181
0.417CysPro: 0.417 ± 0.206
0.417CysGln: 0.417 ± 0.16
0.626CysArg: 0.626 ± 0.332
0.904CysSer: 0.904 ± 0.264
0.487CysThr: 0.487 ± 0.212
0.556CysVal: 0.556 ± 0.279
0.139CysTrp: 0.139 ± 0.149
0.348CysTyr: 0.348 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
6.396AspAla: 6.396 ± 0.792
0.278AspCys: 0.278 ± 0.181
3.754AspAsp: 3.754 ± 0.637
3.893AspGlu: 3.893 ± 0.656
2.086AspPhe: 2.086 ± 0.295
5.631AspGly: 5.631 ± 1.036
0.765AspHis: 0.765 ± 0.229
3.963AspIle: 3.963 ± 0.436
4.102AspLys: 4.102 ± 0.757
6.118AspLeu: 6.118 ± 0.868
1.738AspMet: 1.738 ± 0.4
4.31AspAsn: 4.31 ± 1.014
2.503AspPro: 2.503 ± 0.645
2.503AspGln: 2.503 ± 0.531
2.225AspArg: 2.225 ± 0.464
4.588AspSer: 4.588 ± 0.616
3.198AspThr: 3.198 ± 0.713
4.171AspVal: 4.171 ± 0.445
1.112AspTrp: 1.112 ± 0.305
2.642AspTyr: 2.642 ± 0.528
0.0AspXaa: 0.0 ± 0.0
Glu
6.048GluAla: 6.048 ± 0.828
0.973GluCys: 0.973 ± 0.285
4.241GluAsp: 4.241 ± 0.777
3.268GluGlu: 3.268 ± 0.667
2.294GluPhe: 2.294 ± 0.562
3.128GluGly: 3.128 ± 0.602
1.321GluHis: 1.321 ± 0.244
3.685GluIle: 3.685 ± 0.608
3.128GluLys: 3.128 ± 0.585
6.744GluLeu: 6.744 ± 0.699
0.904GluMet: 0.904 ± 0.23
2.711GluAsn: 2.711 ± 0.499
2.294GluPro: 2.294 ± 0.479
2.225GluGln: 2.225 ± 0.443
2.503GluArg: 2.503 ± 0.449
3.407GluSer: 3.407 ± 0.758
4.38GluThr: 4.38 ± 0.448
4.032GluVal: 4.032 ± 0.51
1.182GluTrp: 1.182 ± 0.316
2.225GluTyr: 2.225 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
1.669PheAla: 1.669 ± 0.382
0.209PheCys: 0.209 ± 0.112
2.225PheAsp: 2.225 ± 0.38
2.503PheGlu: 2.503 ± 0.458
1.529PhePhe: 1.529 ± 0.271
2.433PheGly: 2.433 ± 0.536
0.973PheHis: 0.973 ± 0.274
2.016PheIle: 2.016 ± 0.495
2.642PheLys: 2.642 ± 0.446
2.503PheLeu: 2.503 ± 0.607
1.112PheMet: 1.112 ± 0.241
2.92PheAsn: 2.92 ± 0.611
1.182PhePro: 1.182 ± 0.258
1.182PheGln: 1.182 ± 0.34
1.529PheArg: 1.529 ± 0.412
2.85PheSer: 2.85 ± 0.477
2.294PheThr: 2.294 ± 0.389
2.294PheVal: 2.294 ± 0.387
0.487PheTrp: 0.487 ± 0.207
0.834PheTyr: 0.834 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
5.701GlyAla: 5.701 ± 1.591
0.348GlyCys: 0.348 ± 0.138
5.075GlyAsp: 5.075 ± 1.073
3.198GlyGlu: 3.198 ± 0.399
2.294GlyPhe: 2.294 ± 0.388
4.658GlyGly: 4.658 ± 1.151
0.556GlyHis: 0.556 ± 0.175
4.241GlyIle: 4.241 ± 0.504
5.701GlyLys: 5.701 ± 0.712
6.257GlyLeu: 6.257 ± 0.765
1.599GlyMet: 1.599 ± 0.384
3.963GlyAsn: 3.963 ± 1.224
1.529GlyPro: 1.529 ± 0.275
2.989GlyGln: 2.989 ± 0.418
3.546GlyArg: 3.546 ± 0.526
5.909GlySer: 5.909 ± 1.264
6.396GlyThr: 6.396 ± 1.614
5.006GlyVal: 5.006 ± 0.568
0.765GlyTrp: 0.765 ± 0.224
2.781GlyTyr: 2.781 ± 0.479
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.307
0.487HisCys: 0.487 ± 0.237
1.251HisAsp: 1.251 ± 0.35
1.39HisGlu: 1.39 ± 0.348
0.487HisPhe: 0.487 ± 0.25
1.182HisGly: 1.182 ± 0.234
0.348HisHis: 0.348 ± 0.183
0.904HisIle: 0.904 ± 0.262
0.626HisLys: 0.626 ± 0.2
1.321HisLeu: 1.321 ± 0.352
0.348HisMet: 0.348 ± 0.162
1.46HisAsn: 1.46 ± 0.434
0.695HisPro: 0.695 ± 0.222
0.556HisGln: 0.556 ± 0.207
0.904HisArg: 0.904 ± 0.318
0.765HisSer: 0.765 ± 0.214
1.112HisThr: 1.112 ± 0.311
1.182HisVal: 1.182 ± 0.206
0.139HisTrp: 0.139 ± 0.125
0.834HisTyr: 0.834 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
5.284IleAla: 5.284 ± 0.614
0.348IleCys: 0.348 ± 0.193
2.85IleAsp: 2.85 ± 0.368
2.92IleGlu: 2.92 ± 0.397
1.947IlePhe: 1.947 ± 0.401
2.989IleGly: 2.989 ± 0.511
1.112IleHis: 1.112 ± 0.291
2.433IleIle: 2.433 ± 0.334
4.797IleLys: 4.797 ± 0.666
3.893IleLeu: 3.893 ± 0.599
0.556IleMet: 0.556 ± 0.223
2.364IleAsn: 2.364 ± 0.589
3.476IlePro: 3.476 ± 0.74
2.572IleGln: 2.572 ± 0.466
2.503IleArg: 2.503 ± 0.425
3.963IleSer: 3.963 ± 0.49
3.476IleThr: 3.476 ± 0.572
2.642IleVal: 2.642 ± 0.4
0.626IleTrp: 0.626 ± 0.239
2.155IleTyr: 2.155 ± 0.455
0.0IleXaa: 0.0 ± 0.0
Lys
5.562LysAla: 5.562 ± 0.776
0.417LysCys: 0.417 ± 0.273
5.006LysAsp: 5.006 ± 0.904
4.449LysGlu: 4.449 ± 0.948
2.642LysPhe: 2.642 ± 0.382
3.963LysGly: 3.963 ± 0.64
1.112LysHis: 1.112 ± 0.251
3.268LysIle: 3.268 ± 0.338
5.145LysLys: 5.145 ± 0.791
6.257LysLeu: 6.257 ± 0.688
1.251LysMet: 1.251 ± 0.266
3.963LysAsn: 3.963 ± 0.744
2.642LysPro: 2.642 ± 0.497
2.711LysGln: 2.711 ± 0.599
3.059LysArg: 3.059 ± 0.647
3.337LysSer: 3.337 ± 0.418
3.615LysThr: 3.615 ± 0.547
3.615LysVal: 3.615 ± 0.579
0.904LysTrp: 0.904 ± 0.262
2.433LysTyr: 2.433 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
6.813LeuAla: 6.813 ± 0.719
0.626LeuCys: 0.626 ± 0.235
5.909LeuAsp: 5.909 ± 0.347
5.631LeuGlu: 5.631 ± 0.536
2.364LeuPhe: 2.364 ± 0.453
6.396LeuGly: 6.396 ± 0.77
1.529LeuHis: 1.529 ± 0.372
3.685LeuIle: 3.685 ± 0.766
5.562LeuLys: 5.562 ± 0.533
5.423LeuLeu: 5.423 ± 0.783
2.642LeuMet: 2.642 ± 0.532
5.075LeuAsn: 5.075 ± 0.706
3.824LeuPro: 3.824 ± 0.575
3.407LeuGln: 3.407 ± 0.432
4.032LeuArg: 4.032 ± 0.673
5.006LeuSer: 5.006 ± 0.627
4.519LeuThr: 4.519 ± 0.629
3.476LeuVal: 3.476 ± 0.445
0.556LeuTrp: 0.556 ± 0.224
2.503LeuTyr: 2.503 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
2.294MetAla: 2.294 ± 0.372
0.417MetCys: 0.417 ± 0.167
1.599MetAsp: 1.599 ± 0.427
1.46MetGlu: 1.46 ± 0.351
0.765MetPhe: 0.765 ± 0.232
2.086MetGly: 2.086 ± 0.459
0.556MetHis: 0.556 ± 0.294
0.765MetIle: 0.765 ± 0.2
1.877MetLys: 1.877 ± 0.384
1.529MetLeu: 1.529 ± 0.408
0.487MetMet: 0.487 ± 0.179
0.904MetAsn: 0.904 ± 0.289
0.626MetPro: 0.626 ± 0.242
0.973MetGln: 0.973 ± 0.279
1.043MetArg: 1.043 ± 0.299
1.529MetSer: 1.529 ± 0.288
2.294MetThr: 2.294 ± 0.454
1.529MetVal: 1.529 ± 0.369
0.348MetTrp: 0.348 ± 0.153
0.695MetTyr: 0.695 ± 0.268
0.0MetXaa: 0.0 ± 0.0
Asn
3.824AsnAla: 3.824 ± 0.563
0.278AsnCys: 0.278 ± 0.176
2.503AsnAsp: 2.503 ± 0.411
3.685AsnGlu: 3.685 ± 0.585
2.781AsnPhe: 2.781 ± 0.461
4.797AsnGly: 4.797 ± 1.305
1.112AsnHis: 1.112 ± 0.274
2.433AsnIle: 2.433 ± 0.505
3.128AsnLys: 3.128 ± 0.584
4.449AsnLeu: 4.449 ± 0.749
0.973AsnMet: 0.973 ± 0.31
3.128AsnAsn: 3.128 ± 0.645
3.337AsnPro: 3.337 ± 0.572
2.016AsnGln: 2.016 ± 0.36
2.572AsnArg: 2.572 ± 0.593
4.032AsnSer: 4.032 ± 0.876
3.754AsnThr: 3.754 ± 0.804
3.337AsnVal: 3.337 ± 0.686
0.834AsnTrp: 0.834 ± 0.262
2.016AsnTyr: 2.016 ± 0.334
0.0AsnXaa: 0.0 ± 0.0
Pro
2.781ProAla: 2.781 ± 0.571
0.417ProCys: 0.417 ± 0.165
2.711ProAsp: 2.711 ± 0.42
2.225ProGlu: 2.225 ± 0.518
1.599ProPhe: 1.599 ± 0.312
2.433ProGly: 2.433 ± 0.51
0.765ProHis: 0.765 ± 0.25
2.225ProIle: 2.225 ± 0.312
2.294ProLys: 2.294 ± 0.384
2.016ProLeu: 2.016 ± 0.439
1.251ProMet: 1.251 ± 0.301
2.364ProAsn: 2.364 ± 0.527
2.503ProPro: 2.503 ± 0.781
1.877ProGln: 1.877 ± 0.324
0.904ProArg: 0.904 ± 0.236
2.572ProSer: 2.572 ± 0.465
3.685ProThr: 3.685 ± 0.601
2.92ProVal: 2.92 ± 0.559
1.112ProTrp: 1.112 ± 0.274
1.39ProTyr: 1.39 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
4.102GlnAla: 4.102 ± 1.003
0.209GlnCys: 0.209 ± 0.113
1.669GlnAsp: 1.669 ± 0.359
2.225GlnGlu: 2.225 ± 0.533
1.599GlnPhe: 1.599 ± 0.418
3.546GlnGly: 3.546 ± 0.651
0.834GlnHis: 0.834 ± 0.26
2.433GlnIle: 2.433 ± 0.45
2.572GlnLys: 2.572 ± 0.454
2.989GlnLeu: 2.989 ± 0.532
0.904GlnMet: 0.904 ± 0.292
1.877GlnAsn: 1.877 ± 0.498
0.765GlnPro: 0.765 ± 0.261
2.92GlnGln: 2.92 ± 0.664
1.877GlnArg: 1.877 ± 0.424
2.989GlnSer: 2.989 ± 0.473
1.947GlnThr: 1.947 ± 0.31
2.503GlnVal: 2.503 ± 0.513
0.626GlnTrp: 0.626 ± 0.239
1.877GlnTyr: 1.877 ± 0.41
0.0GlnXaa: 0.0 ± 0.0
Arg
3.615ArgAla: 3.615 ± 0.709
0.417ArgCys: 0.417 ± 0.21
3.128ArgAsp: 3.128 ± 0.512
2.642ArgGlu: 2.642 ± 0.576
1.877ArgPhe: 1.877 ± 0.366
2.503ArgGly: 2.503 ± 0.71
0.765ArgHis: 0.765 ± 0.199
2.503ArgIle: 2.503 ± 0.45
3.337ArgLys: 3.337 ± 0.721
3.407ArgLeu: 3.407 ± 0.514
1.112ArgMet: 1.112 ± 0.254
2.989ArgAsn: 2.989 ± 0.565
1.321ArgPro: 1.321 ± 0.358
1.599ArgGln: 1.599 ± 0.331
1.877ArgArg: 1.877 ± 0.425
2.781ArgSer: 2.781 ± 0.522
3.059ArgThr: 3.059 ± 0.468
2.642ArgVal: 2.642 ± 0.374
1.112ArgTrp: 1.112 ± 0.399
1.947ArgTyr: 1.947 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
5.006SerAla: 5.006 ± 0.984
0.765SerCys: 0.765 ± 0.284
5.006SerAsp: 5.006 ± 0.623
4.032SerGlu: 4.032 ± 0.634
2.503SerPhe: 2.503 ± 0.352
7.161SerGly: 7.161 ± 1.642
1.321SerHis: 1.321 ± 0.384
3.407SerIle: 3.407 ± 0.433
3.198SerLys: 3.198 ± 0.471
5.631SerLeu: 5.631 ± 0.519
1.669SerMet: 1.669 ± 0.275
3.685SerAsn: 3.685 ± 0.83
2.503SerPro: 2.503 ± 0.451
2.086SerGln: 2.086 ± 0.36
2.85SerArg: 2.85 ± 0.596
4.727SerSer: 4.727 ± 1.156
5.075SerThr: 5.075 ± 1.271
3.893SerVal: 3.893 ± 0.453
1.112SerTrp: 1.112 ± 0.246
2.503SerTyr: 2.503 ± 0.405
0.0SerXaa: 0.0 ± 0.0
Thr
8.412ThrAla: 8.412 ± 1.353
0.556ThrCys: 0.556 ± 0.198
4.241ThrAsp: 4.241 ± 0.73
4.241ThrGlu: 4.241 ± 0.493
1.877ThrPhe: 1.877 ± 0.4
5.631ThrGly: 5.631 ± 1.344
0.695ThrHis: 0.695 ± 0.207
4.032ThrIle: 4.032 ± 0.574
3.407ThrLys: 3.407 ± 0.581
5.84ThrLeu: 5.84 ± 0.686
1.182ThrMet: 1.182 ± 0.282
2.85ThrAsn: 2.85 ± 0.55
3.128ThrPro: 3.128 ± 0.739
2.086ThrGln: 2.086 ± 0.495
2.711ThrArg: 2.711 ± 0.536
5.423ThrSer: 5.423 ± 1.273
5.909ThrThr: 5.909 ± 1.305
4.449ThrVal: 4.449 ± 0.979
0.695ThrTrp: 0.695 ± 0.242
2.503ThrTyr: 2.503 ± 0.439
0.0ThrXaa: 0.0 ± 0.0
Val
4.727ValAla: 4.727 ± 0.677
0.695ValCys: 0.695 ± 0.226
4.449ValAsp: 4.449 ± 0.606
4.032ValGlu: 4.032 ± 0.588
2.294ValPhe: 2.294 ± 0.441
4.241ValGly: 4.241 ± 0.529
0.695ValHis: 0.695 ± 0.213
2.572ValIle: 2.572 ± 0.441
4.102ValLys: 4.102 ± 0.565
3.754ValLeu: 3.754 ± 0.417
1.808ValMet: 1.808 ± 0.338
3.615ValAsn: 3.615 ± 0.681
2.781ValPro: 2.781 ± 0.467
2.503ValGln: 2.503 ± 0.491
2.781ValArg: 2.781 ± 0.517
4.936ValSer: 4.936 ± 0.825
5.075ValThr: 5.075 ± 0.971
4.588ValVal: 4.588 ± 0.594
0.556ValTrp: 0.556 ± 0.164
2.016ValTyr: 2.016 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
1.321TrpAla: 1.321 ± 0.385
0.139TrpCys: 0.139 ± 0.09
1.182TrpAsp: 1.182 ± 0.302
0.973TrpGlu: 0.973 ± 0.321
0.556TrpPhe: 0.556 ± 0.202
0.417TrpGly: 0.417 ± 0.143
0.209TrpHis: 0.209 ± 0.129
0.834TrpIle: 0.834 ± 0.276
1.043TrpLys: 1.043 ± 0.254
0.834TrpLeu: 0.834 ± 0.307
0.556TrpMet: 0.556 ± 0.197
0.487TrpAsn: 0.487 ± 0.157
0.487TrpPro: 0.487 ± 0.179
0.765TrpGln: 0.765 ± 0.217
0.556TrpArg: 0.556 ± 0.285
0.834TrpSer: 0.834 ± 0.301
0.904TrpThr: 0.904 ± 0.224
0.973TrpVal: 0.973 ± 0.282
0.07TrpTrp: 0.07 ± 0.071
0.556TrpTyr: 0.556 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.155TyrAla: 2.155 ± 0.383
0.695TyrCys: 0.695 ± 0.269
2.642TyrAsp: 2.642 ± 0.499
2.433TyrGlu: 2.433 ± 0.417
0.973TyrPhe: 0.973 ± 0.222
3.198TyrGly: 3.198 ± 0.422
0.765TyrHis: 0.765 ± 0.339
1.808TyrIle: 1.808 ± 0.358
3.059TyrLys: 3.059 ± 0.618
2.433TyrLeu: 2.433 ± 0.428
1.043TyrMet: 1.043 ± 0.281
1.738TyrAsn: 1.738 ± 0.432
1.043TyrPro: 1.043 ± 0.373
1.251TyrGln: 1.251 ± 0.23
2.155TyrArg: 2.155 ± 0.433
2.225TyrSer: 2.225 ± 0.357
2.572TyrThr: 2.572 ± 0.564
2.225TyrVal: 2.225 ± 0.513
0.417TyrTrp: 0.417 ± 0.185
2.711TyrTyr: 2.711 ± 0.503
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (14385 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski