Amino acid dipepetide frequency for Acinetobacter phage Aristophanes

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.493AlaAla: 7.493 ± 0.729
0.668AlaCys: 0.668 ± 0.23
3.932AlaAsp: 3.932 ± 0.75
5.193AlaGlu: 5.193 ± 0.835
3.412AlaPhe: 3.412 ± 0.473
6.602AlaGly: 6.602 ± 0.493
1.632AlaHis: 1.632 ± 0.411
4.08AlaIle: 4.08 ± 0.51
5.045AlaLys: 5.045 ± 0.773
7.938AlaLeu: 7.938 ± 0.949
2.3AlaMet: 2.3 ± 0.493
4.228AlaAsn: 4.228 ± 0.722
2.745AlaPro: 2.745 ± 0.541
4.674AlaGln: 4.674 ± 0.776
3.116AlaArg: 3.116 ± 0.524
3.042AlaSer: 3.042 ± 0.485
5.712AlaThr: 5.712 ± 0.629
5.49AlaVal: 5.49 ± 0.648
1.113AlaTrp: 1.113 ± 0.232
2.967AlaTyr: 2.967 ± 0.348
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.193
0.0CysCys: 0.0 ± 0.0
0.148CysAsp: 0.148 ± 0.11
0.445CysGlu: 0.445 ± 0.188
0.297CysPhe: 0.297 ± 0.218
0.964CysGly: 0.964 ± 0.261
0.371CysHis: 0.371 ± 0.177
0.668CysIle: 0.668 ± 0.269
0.519CysLys: 0.519 ± 0.187
0.593CysLeu: 0.593 ± 0.231
0.593CysMet: 0.593 ± 0.224
0.593CysAsn: 0.593 ± 0.23
0.223CysPro: 0.223 ± 0.118
0.074CysGln: 0.074 ± 0.069
0.371CysArg: 0.371 ± 0.186
0.668CysSer: 0.668 ± 0.266
1.187CysThr: 1.187 ± 0.335
0.964CysVal: 0.964 ± 0.307
0.0CysTrp: 0.0 ± 0.0
0.593CysTyr: 0.593 ± 0.253
0.0CysXaa: 0.0 ± 0.0
Asp
5.49AspAla: 5.49 ± 0.646
0.742AspCys: 0.742 ± 0.241
3.116AspAsp: 3.116 ± 0.632
3.709AspGlu: 3.709 ± 0.485
2.226AspPhe: 2.226 ± 0.374
4.599AspGly: 4.599 ± 0.678
1.261AspHis: 1.261 ± 0.287
3.709AspIle: 3.709 ± 0.492
3.042AspLys: 3.042 ± 0.382
5.415AspLeu: 5.415 ± 0.615
1.929AspMet: 1.929 ± 0.367
3.116AspAsn: 3.116 ± 0.536
2.077AspPro: 2.077 ± 0.321
1.409AspGln: 1.409 ± 0.293
2.596AspArg: 2.596 ± 0.421
2.745AspSer: 2.745 ± 0.565
4.303AspThr: 4.303 ± 0.542
4.896AspVal: 4.896 ± 0.64
1.113AspTrp: 1.113 ± 0.267
2.077AspTyr: 2.077 ± 0.468
0.0AspXaa: 0.0 ± 0.0
Glu
5.341GluAla: 5.341 ± 0.561
1.039GluCys: 1.039 ± 0.447
3.932GluAsp: 3.932 ± 0.552
3.338GluGlu: 3.338 ± 0.559
2.522GluPhe: 2.522 ± 0.47
3.932GluGly: 3.932 ± 0.587
1.706GluHis: 1.706 ± 0.4
2.522GluIle: 2.522 ± 0.409
2.745GluLys: 2.745 ± 0.5
6.677GluLeu: 6.677 ± 0.879
1.335GluMet: 1.335 ± 0.361
2.967GluAsn: 2.967 ± 0.457
2.3GluPro: 2.3 ± 0.361
3.932GluGln: 3.932 ± 0.701
2.893GluArg: 2.893 ± 0.434
2.448GluSer: 2.448 ± 0.414
2.596GluThr: 2.596 ± 0.416
5.267GluVal: 5.267 ± 0.719
0.964GluTrp: 0.964 ± 0.274
3.338GluTyr: 3.338 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
2.671PheAla: 2.671 ± 0.55
0.668PheCys: 0.668 ± 0.214
1.484PheAsp: 1.484 ± 0.359
2.374PheGlu: 2.374 ± 0.384
1.335PhePhe: 1.335 ± 0.299
2.077PheGly: 2.077 ± 0.382
0.964PheHis: 0.964 ± 0.287
1.261PheIle: 1.261 ± 0.338
2.893PheLys: 2.893 ± 0.417
2.226PheLeu: 2.226 ± 0.464
0.593PheMet: 0.593 ± 0.15
2.819PheAsn: 2.819 ± 0.535
1.558PhePro: 1.558 ± 0.354
0.89PheGln: 0.89 ± 0.247
2.077PheArg: 2.077 ± 0.45
3.042PheSer: 3.042 ± 0.506
2.596PheThr: 2.596 ± 0.517
1.855PheVal: 1.855 ± 0.517
0.519PheTrp: 0.519 ± 0.22
1.484PheTyr: 1.484 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
5.712GlyAla: 5.712 ± 0.848
0.519GlyCys: 0.519 ± 0.191
5.045GlyAsp: 5.045 ± 0.459
3.116GlyGlu: 3.116 ± 0.343
2.374GlyPhe: 2.374 ± 0.423
5.193GlyGly: 5.193 ± 1.046
0.593GlyHis: 0.593 ± 0.189
4.525GlyIle: 4.525 ± 0.617
4.599GlyLys: 4.599 ± 0.542
4.896GlyLeu: 4.896 ± 0.587
1.558GlyMet: 1.558 ± 0.288
3.116GlyAsn: 3.116 ± 0.699
0.148GlyPro: 0.148 ± 0.117
3.264GlyGln: 3.264 ± 0.619
3.561GlyArg: 3.561 ± 0.423
4.303GlySer: 4.303 ± 0.802
5.638GlyThr: 5.638 ± 0.792
6.009GlyVal: 6.009 ± 0.721
1.113GlyTrp: 1.113 ± 0.267
3.858GlyTyr: 3.858 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.039HisAla: 1.039 ± 0.253
0.223HisCys: 0.223 ± 0.145
1.409HisAsp: 1.409 ± 0.396
1.113HisGlu: 1.113 ± 0.372
0.816HisPhe: 0.816 ± 0.259
1.632HisGly: 1.632 ± 0.352
0.519HisHis: 0.519 ± 0.214
1.113HisIle: 1.113 ± 0.284
1.261HisLys: 1.261 ± 0.274
1.706HisLeu: 1.706 ± 0.435
0.519HisMet: 0.519 ± 0.245
1.113HisAsn: 1.113 ± 0.21
0.964HisPro: 0.964 ± 0.24
0.593HisGln: 0.593 ± 0.224
1.039HisArg: 1.039 ± 0.386
1.335HisSer: 1.335 ± 0.303
1.632HisThr: 1.632 ± 0.342
0.964HisVal: 0.964 ± 0.225
0.297HisTrp: 0.297 ± 0.14
0.668HisTyr: 0.668 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
3.412IleAla: 3.412 ± 0.506
0.519IleCys: 0.519 ± 0.215
4.303IleAsp: 4.303 ± 0.657
3.635IleGlu: 3.635 ± 0.49
1.409IlePhe: 1.409 ± 0.434
2.3IleGly: 2.3 ± 0.558
0.816IleHis: 0.816 ± 0.198
3.042IleIle: 3.042 ± 0.57
3.783IleLys: 3.783 ± 0.673
3.783IleLeu: 3.783 ± 0.558
1.484IleMet: 1.484 ± 0.317
2.671IleAsn: 2.671 ± 0.459
2.226IlePro: 2.226 ± 0.397
2.671IleGln: 2.671 ± 0.5
2.745IleArg: 2.745 ± 0.404
3.19IleSer: 3.19 ± 0.412
4.006IleThr: 4.006 ± 0.528
3.042IleVal: 3.042 ± 0.414
0.816IleTrp: 0.816 ± 0.17
1.484IleTyr: 1.484 ± 0.406
0.0IleXaa: 0.0 ± 0.0
Lys
5.638LysAla: 5.638 ± 0.87
0.297LysCys: 0.297 ± 0.132
4.377LysAsp: 4.377 ± 0.599
3.783LysGlu: 3.783 ± 0.631
2.596LysPhe: 2.596 ± 0.406
2.893LysGly: 2.893 ± 0.365
1.558LysHis: 1.558 ± 0.348
2.3LysIle: 2.3 ± 0.493
2.3LysLys: 2.3 ± 0.576
6.602LysLeu: 6.602 ± 0.652
1.409LysMet: 1.409 ± 0.337
2.3LysAsn: 2.3 ± 0.421
2.226LysPro: 2.226 ± 0.488
2.596LysGln: 2.596 ± 0.5
3.932LysArg: 3.932 ± 0.547
4.154LysSer: 4.154 ± 0.55
2.003LysThr: 2.003 ± 0.395
3.932LysVal: 3.932 ± 0.665
0.742LysTrp: 0.742 ± 0.237
3.264LysTyr: 3.264 ± 0.619
0.0LysXaa: 0.0 ± 0.0
Leu
7.122LeuAla: 7.122 ± 0.812
0.742LeuCys: 0.742 ± 0.223
5.267LeuAsp: 5.267 ± 0.63
5.267LeuGlu: 5.267 ± 0.75
2.3LeuPhe: 2.3 ± 0.422
7.196LeuGly: 7.196 ± 0.828
2.151LeuHis: 2.151 ± 0.381
3.709LeuIle: 3.709 ± 0.474
4.377LeuLys: 4.377 ± 0.475
7.344LeuLeu: 7.344 ± 0.894
1.558LeuMet: 1.558 ± 0.386
4.896LeuAsn: 4.896 ± 0.55
3.19LeuPro: 3.19 ± 0.581
4.599LeuGln: 4.599 ± 0.629
4.896LeuArg: 4.896 ± 0.669
5.935LeuSer: 5.935 ± 0.77
4.525LeuThr: 4.525 ± 0.696
5.193LeuVal: 5.193 ± 0.652
0.89LeuTrp: 0.89 ± 0.247
4.303LeuTyr: 4.303 ± 0.763
0.0LeuXaa: 0.0 ± 0.0
Met
1.855MetAla: 1.855 ± 0.375
0.371MetCys: 0.371 ± 0.174
0.742MetAsp: 0.742 ± 0.236
0.742MetGlu: 0.742 ± 0.259
0.668MetPhe: 0.668 ± 0.198
1.409MetGly: 1.409 ± 0.348
0.742MetHis: 0.742 ± 0.197
1.113MetIle: 1.113 ± 0.287
1.632MetLys: 1.632 ± 0.357
2.151MetLeu: 2.151 ± 0.347
0.445MetMet: 0.445 ± 0.185
1.409MetAsn: 1.409 ± 0.22
0.445MetPro: 0.445 ± 0.205
1.78MetGln: 1.78 ± 0.354
1.187MetArg: 1.187 ± 0.278
2.151MetSer: 2.151 ± 0.388
1.484MetThr: 1.484 ± 0.208
1.409MetVal: 1.409 ± 0.332
0.297MetTrp: 0.297 ± 0.162
1.558MetTyr: 1.558 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
4.525AsnAla: 4.525 ± 0.618
0.519AsnCys: 0.519 ± 0.213
2.077AsnAsp: 2.077 ± 0.48
1.78AsnGlu: 1.78 ± 0.346
1.78AsnPhe: 1.78 ± 0.375
3.932AsnGly: 3.932 ± 0.769
1.039AsnHis: 1.039 ± 0.389
2.967AsnIle: 2.967 ± 0.381
2.745AsnLys: 2.745 ± 0.439
4.822AsnLeu: 4.822 ± 0.565
1.187AsnMet: 1.187 ± 0.234
2.151AsnAsn: 2.151 ± 0.469
2.893AsnPro: 2.893 ± 0.403
1.78AsnGln: 1.78 ± 0.49
1.335AsnArg: 1.335 ± 0.302
3.338AsnSer: 3.338 ± 0.644
4.377AsnThr: 4.377 ± 0.565
3.116AsnVal: 3.116 ± 0.525
1.113AsnTrp: 1.113 ± 0.257
2.374AsnTyr: 2.374 ± 0.382
0.0AsnXaa: 0.0 ± 0.0
Pro
2.226ProAla: 2.226 ± 0.365
0.074ProCys: 0.074 ± 0.07
3.19ProAsp: 3.19 ± 0.623
4.97ProGlu: 4.97 ± 0.678
1.261ProPhe: 1.261 ± 0.257
0.0ProGly: 0.0 ± 0.0
0.297ProHis: 0.297 ± 0.166
1.855ProIle: 1.855 ± 0.371
3.042ProLys: 3.042 ± 0.524
2.226ProLeu: 2.226 ± 0.344
1.039ProMet: 1.039 ± 0.295
1.706ProAsn: 1.706 ± 0.4
1.261ProPro: 1.261 ± 0.277
1.558ProGln: 1.558 ± 0.73
1.409ProArg: 1.409 ± 0.266
2.151ProSer: 2.151 ± 0.414
3.116ProThr: 3.116 ± 0.583
4.525ProVal: 4.525 ± 0.707
0.223ProTrp: 0.223 ± 0.143
1.484ProTyr: 1.484 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
5.712GlnAla: 5.712 ± 1.29
0.297GlnCys: 0.297 ± 0.161
2.522GlnAsp: 2.522 ± 0.491
2.819GlnGlu: 2.819 ± 0.405
1.558GlnPhe: 1.558 ± 0.372
3.561GlnGly: 3.561 ± 0.567
0.816GlnHis: 0.816 ± 0.219
2.522GlnIle: 2.522 ± 0.588
2.374GlnLys: 2.374 ± 0.415
3.561GlnLeu: 3.561 ± 0.623
0.593GlnMet: 0.593 ± 0.213
2.3GlnAsn: 2.3 ± 0.437
2.077GlnPro: 2.077 ± 1.1
4.006GlnGln: 4.006 ± 1.463
2.967GlnArg: 2.967 ± 0.519
3.487GlnSer: 3.487 ± 0.412
1.484GlnThr: 1.484 ± 0.384
3.412GlnVal: 3.412 ± 0.519
1.113GlnTrp: 1.113 ± 0.248
2.967GlnTyr: 2.967 ± 0.606
0.0GlnXaa: 0.0 ± 0.0
Arg
3.709ArgAla: 3.709 ± 0.521
0.445ArgCys: 0.445 ± 0.164
2.226ArgAsp: 2.226 ± 0.388
3.561ArgGlu: 3.561 ± 0.581
2.151ArgPhe: 2.151 ± 0.446
3.264ArgGly: 3.264 ± 0.541
0.89ArgHis: 0.89 ± 0.249
3.116ArgIle: 3.116 ± 0.436
2.893ArgLys: 2.893 ± 0.477
4.674ArgLeu: 4.674 ± 0.524
1.187ArgMet: 1.187 ± 0.336
2.003ArgAsn: 2.003 ± 0.42
1.632ArgPro: 1.632 ± 0.286
2.003ArgGln: 2.003 ± 0.604
3.19ArgArg: 3.19 ± 0.567
2.448ArgSer: 2.448 ± 0.412
3.487ArgThr: 3.487 ± 0.608
4.154ArgVal: 4.154 ± 0.694
0.371ArgTrp: 0.371 ± 0.179
2.3ArgTyr: 2.3 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
5.49SerAla: 5.49 ± 0.614
0.593SerCys: 0.593 ± 0.225
3.116SerAsp: 3.116 ± 0.465
3.487SerGlu: 3.487 ± 0.551
1.632SerPhe: 1.632 ± 0.433
4.599SerGly: 4.599 ± 0.608
0.89SerHis: 0.89 ± 0.342
2.596SerIle: 2.596 ± 0.443
4.228SerLys: 4.228 ± 0.572
4.451SerLeu: 4.451 ± 0.755
1.558SerMet: 1.558 ± 0.35
2.819SerAsn: 2.819 ± 0.394
2.3SerPro: 2.3 ± 0.411
3.19SerGln: 3.19 ± 0.583
2.596SerArg: 2.596 ± 0.39
3.338SerSer: 3.338 ± 0.35
5.119SerThr: 5.119 ± 1.124
4.228SerVal: 4.228 ± 0.835
0.593SerTrp: 0.593 ± 0.206
1.929SerTyr: 1.929 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
5.193ThrAla: 5.193 ± 0.861
0.519ThrCys: 0.519 ± 0.242
4.303ThrAsp: 4.303 ± 0.531
4.451ThrGlu: 4.451 ± 0.504
1.335ThrPhe: 1.335 ± 0.278
4.822ThrGly: 4.822 ± 0.832
1.039ThrHis: 1.039 ± 0.238
4.154ThrIle: 4.154 ± 0.678
3.561ThrLys: 3.561 ± 0.483
4.599ThrLeu: 4.599 ± 0.769
1.113ThrMet: 1.113 ± 0.378
2.819ThrAsn: 2.819 ± 0.471
4.228ThrPro: 4.228 ± 0.555
2.967ThrGln: 2.967 ± 0.41
3.264ThrArg: 3.264 ± 0.51
3.932ThrSer: 3.932 ± 0.585
4.896ThrThr: 4.896 ± 0.887
5.564ThrVal: 5.564 ± 0.932
0.668ThrTrp: 0.668 ± 0.227
2.151ThrTyr: 2.151 ± 0.361
0.0ThrXaa: 0.0 ± 0.0
Val
4.896ValAla: 4.896 ± 0.693
0.816ValCys: 0.816 ± 0.215
3.709ValAsp: 3.709 ± 0.581
4.08ValGlu: 4.08 ± 0.591
3.19ValPhe: 3.19 ± 0.319
5.935ValGly: 5.935 ± 0.647
1.78ValHis: 1.78 ± 0.326
3.264ValIle: 3.264 ± 0.584
4.599ValLys: 4.599 ± 0.642
6.454ValLeu: 6.454 ± 0.92
2.003ValMet: 2.003 ± 0.402
2.819ValAsn: 2.819 ± 0.505
3.783ValPro: 3.783 ± 0.518
5.341ValGln: 5.341 ± 0.912
3.412ValArg: 3.412 ± 0.522
3.932ValSer: 3.932 ± 0.824
3.19ValThr: 3.19 ± 0.586
5.341ValVal: 5.341 ± 0.754
0.89ValTrp: 0.89 ± 0.227
3.561ValTyr: 3.561 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
0.593TrpAla: 0.593 ± 0.221
0.297TrpCys: 0.297 ± 0.148
0.964TrpAsp: 0.964 ± 0.323
1.261TrpGlu: 1.261 ± 0.313
0.445TrpPhe: 0.445 ± 0.174
0.964TrpGly: 0.964 ± 0.306
0.074TrpHis: 0.074 ± 0.071
0.593TrpIle: 0.593 ± 0.207
0.964TrpLys: 0.964 ± 0.206
1.409TrpLeu: 1.409 ± 0.425
0.0TrpMet: 0.0 ± 0.0
1.187TrpAsn: 1.187 ± 0.234
0.0TrpPro: 0.0 ± 0.0
0.445TrpGln: 0.445 ± 0.162
0.371TrpArg: 0.371 ± 0.172
0.89TrpSer: 0.89 ± 0.235
0.964TrpThr: 0.964 ± 0.275
0.816TrpVal: 0.816 ± 0.352
0.297TrpTrp: 0.297 ± 0.152
0.816TrpTyr: 0.816 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.596TyrAla: 2.596 ± 0.495
0.593TyrCys: 0.593 ± 0.296
3.709TyrAsp: 3.709 ± 0.385
2.671TyrGlu: 2.671 ± 0.386
2.077TyrPhe: 2.077 ± 0.312
3.264TyrGly: 3.264 ± 0.559
0.816TyrHis: 0.816 ± 0.235
2.226TyrIle: 2.226 ± 0.391
2.374TyrLys: 2.374 ± 0.71
3.932TyrLeu: 3.932 ± 0.503
1.113TyrMet: 1.113 ± 0.308
2.819TyrAsn: 2.819 ± 0.489
1.335TyrPro: 1.335 ± 0.333
2.374TyrGln: 2.374 ± 0.472
2.745TyrArg: 2.745 ± 0.481
2.226TyrSer: 2.226 ± 0.275
3.264TyrThr: 3.264 ± 0.594
2.819TyrVal: 2.819 ± 0.562
0.223TyrTrp: 0.223 ± 0.152
2.374TyrTyr: 2.374 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (13481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski