Amino acid dipepetide frequency for Acinetobacter phage IME-AB2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.11AlaAla: 5.11 ± 0.936
0.592AlaCys: 0.592 ± 0.227
3.703AlaAsp: 3.703 ± 0.494
3.851AlaGlu: 3.851 ± 0.48
2.888AlaPhe: 2.888 ± 0.421
4.073AlaGly: 4.073 ± 0.526
1.111AlaHis: 1.111 ± 0.321
6.295AlaIle: 6.295 ± 0.582
5.925AlaLys: 5.925 ± 0.808
6.813AlaLeu: 6.813 ± 0.625
2.296AlaMet: 2.296 ± 0.551
4.295AlaAsn: 4.295 ± 0.581
2.222AlaPro: 2.222 ± 0.409
3.333AlaGln: 3.333 ± 0.665
2.592AlaArg: 2.592 ± 0.383
4.74AlaSer: 4.74 ± 0.691
4.369AlaThr: 4.369 ± 0.721
3.777AlaVal: 3.777 ± 0.625
0.889AlaTrp: 0.889 ± 0.223
3.407AlaTyr: 3.407 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.244
0.222CysCys: 0.222 ± 0.14
0.889CysAsp: 0.889 ± 0.258
1.037CysGlu: 1.037 ± 0.293
0.518CysPhe: 0.518 ± 0.175
0.741CysGly: 0.741 ± 0.229
0.222CysHis: 0.222 ± 0.121
0.444CysIle: 0.444 ± 0.17
1.185CysLys: 1.185 ± 0.284
0.667CysLeu: 0.667 ± 0.255
0.37CysMet: 0.37 ± 0.165
0.222CysAsn: 0.222 ± 0.115
0.444CysPro: 0.444 ± 0.186
0.148CysGln: 0.148 ± 0.098
0.741CysArg: 0.741 ± 0.279
0.592CysSer: 0.592 ± 0.243
0.444CysThr: 0.444 ± 0.177
1.407CysVal: 1.407 ± 0.326
0.444CysTrp: 0.444 ± 0.169
0.741CysTyr: 0.741 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
4.518AspAla: 4.518 ± 0.525
0.667AspCys: 0.667 ± 0.215
4.221AspAsp: 4.221 ± 0.617
4.443AspGlu: 4.443 ± 0.654
2.814AspPhe: 2.814 ± 0.421
4.962AspGly: 4.962 ± 0.647
0.518AspHis: 0.518 ± 0.202
4.221AspIle: 4.221 ± 0.638
5.036AspLys: 5.036 ± 0.704
4.888AspLeu: 4.888 ± 0.632
1.481AspMet: 1.481 ± 0.316
2.0AspAsn: 2.0 ± 0.351
1.703AspPro: 1.703 ± 0.36
2.592AspGln: 2.592 ± 0.377
2.592AspArg: 2.592 ± 0.34
3.259AspSer: 3.259 ± 0.506
2.888AspThr: 2.888 ± 0.399
3.703AspVal: 3.703 ± 0.514
1.185AspTrp: 1.185 ± 0.336
2.148AspTyr: 2.148 ± 0.387
0.0AspXaa: 0.0 ± 0.0
Glu
5.258GluAla: 5.258 ± 0.747
0.592GluCys: 0.592 ± 0.234
3.481GluAsp: 3.481 ± 0.582
4.443GluGlu: 4.443 ± 0.601
4.073GluPhe: 4.073 ± 0.688
4.369GluGly: 4.369 ± 0.536
0.963GluHis: 0.963 ± 0.229
5.184GluIle: 5.184 ± 0.654
4.147GluLys: 4.147 ± 0.677
5.999GluLeu: 5.999 ± 0.96
2.074GluMet: 2.074 ± 0.386
3.703GluAsn: 3.703 ± 0.44
1.259GluPro: 1.259 ± 0.327
3.036GluGln: 3.036 ± 0.439
1.407GluArg: 1.407 ± 0.329
5.628GluSer: 5.628 ± 0.517
2.074GluThr: 2.074 ± 0.417
4.369GluVal: 4.369 ± 0.605
1.037GluTrp: 1.037 ± 0.272
3.703GluTyr: 3.703 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
3.851PheAla: 3.851 ± 0.605
1.185PheCys: 1.185 ± 0.268
3.407PheAsp: 3.407 ± 0.603
2.666PheGlu: 2.666 ± 0.463
1.407PhePhe: 1.407 ± 0.336
3.407PheGly: 3.407 ± 0.409
0.815PheHis: 0.815 ± 0.2
3.481PheIle: 3.481 ± 0.432
3.333PheLys: 3.333 ± 0.571
2.814PheLeu: 2.814 ± 0.409
1.629PheMet: 1.629 ± 0.381
2.444PheAsn: 2.444 ± 0.433
0.963PhePro: 0.963 ± 0.28
1.111PheGln: 1.111 ± 0.36
1.851PheArg: 1.851 ± 0.38
2.222PheSer: 2.222 ± 0.413
2.296PheThr: 2.296 ± 0.401
2.814PheVal: 2.814 ± 0.455
0.889PheTrp: 0.889 ± 0.292
2.962PheTyr: 2.962 ± 0.485
0.0PheXaa: 0.0 ± 0.0
Gly
5.11GlyAla: 5.11 ± 0.897
0.815GlyCys: 0.815 ± 0.238
2.888GlyAsp: 2.888 ± 0.502
3.999GlyGlu: 3.999 ± 0.445
4.592GlyPhe: 4.592 ± 0.532
4.814GlyGly: 4.814 ± 0.754
1.037GlyHis: 1.037 ± 0.256
4.443GlyIle: 4.443 ± 0.538
4.147GlyLys: 4.147 ± 0.524
6.295GlyLeu: 6.295 ± 0.513
1.851GlyMet: 1.851 ± 0.376
3.851GlyAsn: 3.851 ± 0.535
0.667GlyPro: 0.667 ± 0.29
2.518GlyGln: 2.518 ± 0.501
2.296GlyArg: 2.296 ± 0.4
4.369GlySer: 4.369 ± 0.531
3.333GlyThr: 3.333 ± 0.518
6.221GlyVal: 6.221 ± 0.638
1.185GlyTrp: 1.185 ± 0.259
2.814GlyTyr: 2.814 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.333HisAla: 1.333 ± 0.314
0.074HisCys: 0.074 ± 0.085
1.037HisAsp: 1.037 ± 0.284
1.555HisGlu: 1.555 ± 0.319
0.296HisPhe: 0.296 ± 0.168
0.815HisGly: 0.815 ± 0.262
0.222HisHis: 0.222 ± 0.126
1.777HisIle: 1.777 ± 0.359
1.185HisLys: 1.185 ± 0.303
1.037HisLeu: 1.037 ± 0.255
0.444HisMet: 0.444 ± 0.163
0.815HisAsn: 0.815 ± 0.223
0.889HisPro: 0.889 ± 0.236
0.889HisGln: 0.889 ± 0.295
0.667HisArg: 0.667 ± 0.299
0.592HisSer: 0.592 ± 0.2
0.518HisThr: 0.518 ± 0.193
0.963HisVal: 0.963 ± 0.308
0.148HisTrp: 0.148 ± 0.098
1.037HisTyr: 1.037 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
4.592IleAla: 4.592 ± 0.611
1.037IleCys: 1.037 ± 0.289
4.666IleAsp: 4.666 ± 0.618
6.517IleGlu: 6.517 ± 0.819
2.222IlePhe: 2.222 ± 0.409
4.443IleGly: 4.443 ± 0.576
1.555IleHis: 1.555 ± 0.354
4.295IleIle: 4.295 ± 0.524
7.406IleLys: 7.406 ± 0.84
4.443IleLeu: 4.443 ± 0.531
1.703IleMet: 1.703 ± 0.376
3.999IleAsn: 3.999 ± 0.599
3.481IlePro: 3.481 ± 0.484
2.0IleGln: 2.0 ± 0.436
2.74IleArg: 2.74 ± 0.382
5.11IleSer: 5.11 ± 0.699
4.443IleThr: 4.443 ± 0.738
4.666IleVal: 4.666 ± 0.581
0.815IleTrp: 0.815 ± 0.244
2.814IleTyr: 2.814 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
6.295LysAla: 6.295 ± 0.806
0.963LysCys: 0.963 ± 0.283
3.999LysAsp: 3.999 ± 0.57
6.073LysGlu: 6.073 ± 0.92
3.11LysPhe: 3.11 ± 0.512
5.184LysGly: 5.184 ± 0.507
1.111LysHis: 1.111 ± 0.281
6.073LysIle: 6.073 ± 0.95
5.11LysLys: 5.11 ± 0.873
5.702LysLeu: 5.702 ± 0.634
2.518LysMet: 2.518 ± 0.577
4.443LysAsn: 4.443 ± 0.491
2.37LysPro: 2.37 ± 0.423
2.444LysGln: 2.444 ± 0.447
3.333LysArg: 3.333 ± 0.555
5.036LysSer: 5.036 ± 0.567
4.221LysThr: 4.221 ± 0.62
4.74LysVal: 4.74 ± 0.673
1.185LysTrp: 1.185 ± 0.292
2.296LysTyr: 2.296 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
6.591LeuAla: 6.591 ± 0.765
0.741LeuCys: 0.741 ± 0.234
5.258LeuAsp: 5.258 ± 0.693
5.776LeuGlu: 5.776 ± 0.639
3.11LeuPhe: 3.11 ± 0.473
4.814LeuGly: 4.814 ± 0.629
1.185LeuHis: 1.185 ± 0.39
5.702LeuIle: 5.702 ± 0.639
6.665LeuLys: 6.665 ± 0.758
5.925LeuLeu: 5.925 ± 0.626
2.296LeuMet: 2.296 ± 0.449
6.665LeuAsn: 6.665 ± 0.812
1.703LeuPro: 1.703 ± 0.411
2.222LeuGln: 2.222 ± 0.397
3.555LeuArg: 3.555 ± 0.475
5.406LeuSer: 5.406 ± 0.488
4.666LeuThr: 4.666 ± 0.538
4.518LeuVal: 4.518 ± 0.514
0.889LeuTrp: 0.889 ± 0.263
2.296LeuTyr: 2.296 ± 0.445
0.0LeuXaa: 0.0 ± 0.0
Met
1.777MetAla: 1.777 ± 0.376
0.518MetCys: 0.518 ± 0.189
1.185MetAsp: 1.185 ± 0.386
1.629MetGlu: 1.629 ± 0.408
1.407MetPhe: 1.407 ± 0.359
2.074MetGly: 2.074 ± 0.509
0.222MetHis: 0.222 ± 0.119
2.0MetIle: 2.0 ± 0.346
1.629MetLys: 1.629 ± 0.315
2.222MetLeu: 2.222 ± 0.405
0.444MetMet: 0.444 ± 0.186
2.592MetAsn: 2.592 ± 0.467
1.185MetPro: 1.185 ± 0.281
1.777MetGln: 1.777 ± 0.338
1.333MetArg: 1.333 ± 0.341
2.592MetSer: 2.592 ± 0.469
1.925MetThr: 1.925 ± 0.355
1.333MetVal: 1.333 ± 0.348
0.222MetTrp: 0.222 ± 0.128
0.667MetTyr: 0.667 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
4.073AsnAla: 4.073 ± 0.591
0.296AsnCys: 0.296 ± 0.123
3.851AsnAsp: 3.851 ± 0.501
3.925AsnGlu: 3.925 ± 0.491
2.148AsnPhe: 2.148 ± 0.466
4.962AsnGly: 4.962 ± 0.874
1.111AsnHis: 1.111 ± 0.27
3.629AsnIle: 3.629 ± 0.562
3.184AsnLys: 3.184 ± 0.491
5.036AsnLeu: 5.036 ± 0.639
1.629AsnMet: 1.629 ± 0.321
3.407AsnAsn: 3.407 ± 0.628
2.814AsnPro: 2.814 ± 0.477
2.444AsnGln: 2.444 ± 0.479
1.629AsnArg: 1.629 ± 0.302
3.629AsnSer: 3.629 ± 0.635
3.777AsnThr: 3.777 ± 0.479
3.333AsnVal: 3.333 ± 0.608
0.444AsnTrp: 0.444 ± 0.16
2.74AsnTyr: 2.74 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.777ProAla: 1.777 ± 0.405
0.148ProCys: 0.148 ± 0.104
2.0ProAsp: 2.0 ± 0.344
2.666ProGlu: 2.666 ± 0.422
1.481ProPhe: 1.481 ± 0.345
0.0ProGly: 0.0 ± 0.0
0.667ProHis: 0.667 ± 0.232
2.296ProIle: 2.296 ± 0.427
2.518ProLys: 2.518 ± 0.403
2.592ProLeu: 2.592 ± 0.46
1.037ProMet: 1.037 ± 0.365
2.444ProAsn: 2.444 ± 0.509
0.741ProPro: 0.741 ± 0.272
1.333ProGln: 1.333 ± 0.31
1.037ProArg: 1.037 ± 0.238
2.518ProSer: 2.518 ± 0.398
1.555ProThr: 1.555 ± 0.282
2.148ProVal: 2.148 ± 0.351
0.148ProTrp: 0.148 ± 0.105
1.629ProTyr: 1.629 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
3.259GlnAla: 3.259 ± 0.429
0.148GlnCys: 0.148 ± 0.109
2.222GlnAsp: 2.222 ± 0.487
2.592GlnGlu: 2.592 ± 0.47
1.851GlnPhe: 1.851 ± 0.502
2.592GlnGly: 2.592 ± 0.573
0.815GlnHis: 0.815 ± 0.267
2.0GlnIle: 2.0 ± 0.39
3.11GlnLys: 3.11 ± 0.529
3.407GlnLeu: 3.407 ± 0.648
0.889GlnMet: 0.889 ± 0.281
1.851GlnAsn: 1.851 ± 0.455
0.889GlnPro: 0.889 ± 0.274
1.703GlnGln: 1.703 ± 0.387
1.555GlnArg: 1.555 ± 0.409
2.222GlnSer: 2.222 ± 0.393
2.0GlnThr: 2.0 ± 0.33
1.925GlnVal: 1.925 ± 0.431
0.815GlnTrp: 0.815 ± 0.244
1.851GlnTyr: 1.851 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
2.37ArgAla: 2.37 ± 0.426
0.963ArgCys: 0.963 ± 0.292
2.592ArgAsp: 2.592 ± 0.446
2.444ArgGlu: 2.444 ± 0.387
2.0ArgPhe: 2.0 ± 0.398
2.074ArgGly: 2.074 ± 0.404
0.963ArgHis: 0.963 ± 0.234
2.962ArgIle: 2.962 ± 0.488
3.851ArgLys: 3.851 ± 0.549
3.259ArgLeu: 3.259 ± 0.487
0.518ArgMet: 0.518 ± 0.207
1.407ArgAsn: 1.407 ± 0.262
1.407ArgPro: 1.407 ± 0.322
1.259ArgGln: 1.259 ± 0.307
1.185ArgArg: 1.185 ± 0.296
2.518ArgSer: 2.518 ± 0.419
1.629ArgThr: 1.629 ± 0.413
2.592ArgVal: 2.592 ± 0.449
0.444ArgTrp: 0.444 ± 0.189
1.407ArgTyr: 1.407 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
3.555SerAla: 3.555 ± 0.606
0.815SerCys: 0.815 ± 0.311
3.629SerAsp: 3.629 ± 0.539
3.629SerGlu: 3.629 ± 0.637
3.703SerPhe: 3.703 ± 0.533
5.036SerGly: 5.036 ± 0.629
1.111SerHis: 1.111 ± 0.325
6.295SerIle: 6.295 ± 0.611
5.406SerLys: 5.406 ± 0.67
5.628SerLeu: 5.628 ± 0.759
2.444SerMet: 2.444 ± 0.396
3.333SerAsn: 3.333 ± 0.667
2.0SerPro: 2.0 ± 0.338
2.296SerGln: 2.296 ± 0.399
2.296SerArg: 2.296 ± 0.469
3.999SerSer: 3.999 ± 0.75
3.777SerThr: 3.777 ± 0.66
3.925SerVal: 3.925 ± 0.558
0.963SerTrp: 0.963 ± 0.266
1.851SerTyr: 1.851 ± 0.348
0.0SerXaa: 0.0 ± 0.0
Thr
3.703ThrAla: 3.703 ± 0.624
0.815ThrCys: 0.815 ± 0.232
3.259ThrAsp: 3.259 ± 0.489
2.518ThrGlu: 2.518 ± 0.417
1.925ThrPhe: 1.925 ± 0.404
4.369ThrGly: 4.369 ± 0.495
0.592ThrHis: 0.592 ± 0.224
3.555ThrIle: 3.555 ± 0.608
3.629ThrLys: 3.629 ± 0.598
4.962ThrLeu: 4.962 ± 0.637
1.333ThrMet: 1.333 ± 0.291
2.444ThrAsn: 2.444 ± 0.488
2.148ThrPro: 2.148 ± 0.416
2.0ThrGln: 2.0 ± 0.429
2.0ThrArg: 2.0 ± 0.41
2.74ThrSer: 2.74 ± 0.554
3.407ThrThr: 3.407 ± 0.843
4.221ThrVal: 4.221 ± 0.59
1.185ThrTrp: 1.185 ± 0.343
1.777ThrTyr: 1.777 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.518ValAla: 4.518 ± 0.564
0.667ValCys: 0.667 ± 0.226
3.999ValAsp: 3.999 ± 0.416
3.999ValGlu: 3.999 ± 0.711
3.11ValPhe: 3.11 ± 0.464
4.518ValGly: 4.518 ± 0.605
1.037ValHis: 1.037 ± 0.261
4.962ValIle: 4.962 ± 0.657
5.11ValLys: 5.11 ± 0.476
4.147ValLeu: 4.147 ± 0.65
2.148ValMet: 2.148 ± 0.422
4.666ValAsn: 4.666 ± 0.643
1.851ValPro: 1.851 ± 0.337
2.222ValGln: 2.222 ± 0.591
2.074ValArg: 2.074 ± 0.382
3.851ValSer: 3.851 ± 0.49
3.036ValThr: 3.036 ± 0.492
4.592ValVal: 4.592 ± 0.58
0.889ValTrp: 0.889 ± 0.273
2.518ValTyr: 2.518 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
1.111TrpAla: 1.111 ± 0.293
0.222TrpCys: 0.222 ± 0.122
0.815TrpAsp: 0.815 ± 0.211
0.667TrpGlu: 0.667 ± 0.201
0.889TrpPhe: 0.889 ± 0.258
0.815TrpGly: 0.815 ± 0.217
0.296TrpHis: 0.296 ± 0.123
0.889TrpIle: 0.889 ± 0.213
0.741TrpLys: 0.741 ± 0.237
1.111TrpLeu: 1.111 ± 0.263
0.518TrpMet: 0.518 ± 0.186
0.889TrpAsn: 0.889 ± 0.22
0.074TrpPro: 0.074 ± 0.077
0.667TrpGln: 0.667 ± 0.203
1.111TrpArg: 1.111 ± 0.282
1.407TrpSer: 1.407 ± 0.316
0.592TrpThr: 0.592 ± 0.246
1.037TrpVal: 1.037 ± 0.3
0.296TrpTrp: 0.296 ± 0.133
0.518TrpTyr: 0.518 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.666TyrAla: 2.666 ± 0.357
0.815TyrCys: 0.815 ± 0.258
2.74TyrAsp: 2.74 ± 0.468
2.37TyrGlu: 2.37 ± 0.447
2.0TyrPhe: 2.0 ± 0.471
3.184TyrGly: 3.184 ± 0.632
0.741TyrHis: 0.741 ± 0.189
2.444TyrIle: 2.444 ± 0.481
2.888TyrLys: 2.888 ± 0.469
3.036TyrLeu: 3.036 ± 0.482
1.037TyrMet: 1.037 ± 0.281
2.592TyrAsn: 2.592 ± 0.475
1.925TyrPro: 1.925 ± 0.426
1.703TyrGln: 1.703 ± 0.421
1.925TyrArg: 1.925 ± 0.34
3.259TyrSer: 3.259 ± 0.461
1.629TyrThr: 1.629 ± 0.378
1.629TyrVal: 1.629 ± 0.313
0.592TyrTrp: 0.592 ± 0.248
1.481TyrTyr: 1.481 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (13504 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski