Amino acid dipepetide frequency for Acinetobacter phage YMC-13-01-C62

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.009AlaAla: 5.009 ± 0.851
0.653AlaCys: 0.653 ± 0.218
3.92AlaAsp: 3.92 ± 0.525
4.065AlaGlu: 4.065 ± 0.459
2.541AlaPhe: 2.541 ± 0.411
3.993AlaGly: 3.993 ± 0.508
0.871AlaHis: 0.871 ± 0.216
5.445AlaIle: 5.445 ± 0.645
5.154AlaLys: 5.154 ± 0.794
6.606AlaLeu: 6.606 ± 0.687
1.96AlaMet: 1.96 ± 0.508
3.993AlaAsn: 3.993 ± 0.627
2.468AlaPro: 2.468 ± 0.4
3.267AlaGln: 3.267 ± 0.522
2.904AlaArg: 2.904 ± 0.466
3.702AlaSer: 3.702 ± 0.557
4.719AlaThr: 4.719 ± 0.715
4.138AlaVal: 4.138 ± 0.55
0.799AlaTrp: 0.799 ± 0.261
3.267AlaTyr: 3.267 ± 0.496
0.0AlaXaa: 0.0 ± 0.0
Cys
0.799CysAla: 0.799 ± 0.242
0.363CysCys: 0.363 ± 0.136
0.944CysAsp: 0.944 ± 0.27
1.089CysGlu: 1.089 ± 0.309
0.581CysPhe: 0.581 ± 0.224
1.089CysGly: 1.089 ± 0.278
0.073CysHis: 0.073 ± 0.066
0.508CysIle: 0.508 ± 0.206
1.307CysLys: 1.307 ± 0.32
0.871CysLeu: 0.871 ± 0.279
0.436CysMet: 0.436 ± 0.169
0.508CysAsn: 0.508 ± 0.208
0.508CysPro: 0.508 ± 0.19
0.218CysGln: 0.218 ± 0.112
0.726CysArg: 0.726 ± 0.279
0.799CysSer: 0.799 ± 0.27
0.653CysThr: 0.653 ± 0.258
1.307CysVal: 1.307 ± 0.37
0.363CysTrp: 0.363 ± 0.157
0.581CysTyr: 0.581 ± 0.219
0.0CysXaa: 0.0 ± 0.0
Asp
4.574AspAla: 4.574 ± 0.537
0.726AspCys: 0.726 ± 0.223
4.428AspAsp: 4.428 ± 0.799
4.646AspGlu: 4.646 ± 0.634
2.831AspPhe: 2.831 ± 0.42
4.936AspGly: 4.936 ± 0.58
1.089AspHis: 1.089 ± 0.293
4.574AspIle: 4.574 ± 0.687
4.719AspLys: 4.719 ± 0.626
5.009AspLeu: 5.009 ± 0.556
1.815AspMet: 1.815 ± 0.363
2.396AspAsn: 2.396 ± 0.424
1.597AspPro: 1.597 ± 0.319
2.904AspGln: 2.904 ± 0.429
2.904AspArg: 2.904 ± 0.479
3.412AspSer: 3.412 ± 0.555
2.759AspThr: 2.759 ± 0.35
4.283AspVal: 4.283 ± 0.603
1.597AspTrp: 1.597 ± 0.344
2.468AspTyr: 2.468 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
5.445GluAla: 5.445 ± 0.736
0.653GluCys: 0.653 ± 0.259
3.485GluAsp: 3.485 ± 0.628
4.428GluGlu: 4.428 ± 0.531
3.775GluPhe: 3.775 ± 0.716
4.211GluGly: 4.211 ± 0.473
1.089GluHis: 1.089 ± 0.308
5.009GluIle: 5.009 ± 0.613
4.719GluLys: 4.719 ± 0.701
5.88GluLeu: 5.88 ± 0.82
2.396GluMet: 2.396 ± 0.396
3.775GluAsn: 3.775 ± 0.412
1.162GluPro: 1.162 ± 0.309
3.049GluGln: 3.049 ± 0.492
1.815GluArg: 1.815 ± 0.324
5.372GluSer: 5.372 ± 0.496
1.96GluThr: 1.96 ± 0.382
4.428GluVal: 4.428 ± 0.557
1.234GluTrp: 1.234 ± 0.296
3.702GluTyr: 3.702 ± 0.557
0.0GluXaa: 0.0 ± 0.0
Phe
3.412PheAla: 3.412 ± 0.569
1.307PheCys: 1.307 ± 0.32
3.194PheAsp: 3.194 ± 0.446
2.686PheGlu: 2.686 ± 0.464
1.162PhePhe: 1.162 ± 0.287
3.63PheGly: 3.63 ± 0.489
0.726PheHis: 0.726 ± 0.209
3.122PheIle: 3.122 ± 0.414
3.63PheLys: 3.63 ± 0.539
2.759PheLeu: 2.759 ± 0.416
1.67PheMet: 1.67 ± 0.36
2.686PheAsn: 2.686 ± 0.42
1.307PhePro: 1.307 ± 0.277
1.089PheGln: 1.089 ± 0.35
1.597PheArg: 1.597 ± 0.315
2.323PheSer: 2.323 ± 0.464
2.323PheThr: 2.323 ± 0.396
2.759PheVal: 2.759 ± 0.43
0.581PheTrp: 0.581 ± 0.181
2.468PheTyr: 2.468 ± 0.445
0.0PheXaa: 0.0 ± 0.0
Gly
4.791GlyAla: 4.791 ± 0.569
1.452GlyCys: 1.452 ± 0.352
3.122GlyAsp: 3.122 ± 0.617
4.211GlyGlu: 4.211 ± 0.425
4.065GlyPhe: 4.065 ± 0.566
4.283GlyGly: 4.283 ± 0.568
0.799GlyHis: 0.799 ± 0.208
4.864GlyIle: 4.864 ± 0.584
4.138GlyLys: 4.138 ± 0.524
6.025GlyLeu: 6.025 ± 0.526
2.25GlyMet: 2.25 ± 0.478
3.848GlyAsn: 3.848 ± 0.466
0.508GlyPro: 0.508 ± 0.279
2.468GlyGln: 2.468 ± 0.419
2.541GlyArg: 2.541 ± 0.394
4.356GlySer: 4.356 ± 0.593
3.049GlyThr: 3.049 ± 0.532
5.808GlyVal: 5.808 ± 0.654
1.089GlyTrp: 1.089 ± 0.27
3.267GlyTyr: 3.267 ± 0.452
0.0GlyXaa: 0.0 ± 0.0
His
1.234HisAla: 1.234 ± 0.291
0.145HisCys: 0.145 ± 0.098
1.234HisAsp: 1.234 ± 0.289
1.452HisGlu: 1.452 ± 0.328
0.363HisPhe: 0.363 ± 0.152
1.089HisGly: 1.089 ± 0.313
0.363HisHis: 0.363 ± 0.165
1.742HisIle: 1.742 ± 0.37
1.452HisLys: 1.452 ± 0.292
1.162HisLeu: 1.162 ± 0.288
0.508HisMet: 0.508 ± 0.195
0.944HisAsn: 0.944 ± 0.297
0.726HisPro: 0.726 ± 0.239
0.653HisGln: 0.653 ± 0.221
0.218HisArg: 0.218 ± 0.116
0.799HisSer: 0.799 ± 0.229
0.581HisThr: 0.581 ± 0.187
0.871HisVal: 0.871 ± 0.313
0.073HisTrp: 0.073 ± 0.061
1.016HisTyr: 1.016 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
4.791IleAla: 4.791 ± 0.658
0.944IleCys: 0.944 ± 0.29
5.372IleAsp: 5.372 ± 0.56
6.098IleGlu: 6.098 ± 0.774
1.96IlePhe: 1.96 ± 0.352
4.574IleGly: 4.574 ± 0.557
1.67IleHis: 1.67 ± 0.383
4.138IleIle: 4.138 ± 0.503
7.623IleLys: 7.623 ± 0.738
4.936IleLeu: 4.936 ± 0.551
1.234IleMet: 1.234 ± 0.372
4.138IleAsn: 4.138 ± 0.534
3.92IlePro: 3.92 ± 0.523
1.96IleGln: 1.96 ± 0.4
2.613IleArg: 2.613 ± 0.412
4.574IleSer: 4.574 ± 0.689
3.92IleThr: 3.92 ± 0.658
4.646IleVal: 4.646 ± 0.569
0.508IleTrp: 0.508 ± 0.193
2.686IleTyr: 2.686 ± 0.542
0.0IleXaa: 0.0 ± 0.0
Lys
6.098LysAla: 6.098 ± 0.829
0.726LysCys: 0.726 ± 0.283
4.791LysAsp: 4.791 ± 0.816
5.88LysGlu: 5.88 ± 0.746
2.904LysPhe: 2.904 ± 0.478
5.154LysGly: 5.154 ± 0.547
1.089LysHis: 1.089 ± 0.243
6.171LysIle: 6.171 ± 0.774
5.953LysLys: 5.953 ± 0.927
5.808LysLeu: 5.808 ± 0.637
2.686LysMet: 2.686 ± 0.549
4.864LysAsn: 4.864 ± 0.589
2.468LysPro: 2.468 ± 0.451
2.613LysGln: 2.613 ± 0.366
3.485LysArg: 3.485 ± 0.48
4.428LysSer: 4.428 ± 0.564
3.92LysThr: 3.92 ± 0.576
5.154LysVal: 5.154 ± 0.68
1.162LysTrp: 1.162 ± 0.252
2.831LysTyr: 2.831 ± 0.479
0.0LysXaa: 0.0 ± 0.0
Leu
6.171LeuAla: 6.171 ± 0.674
0.726LeuCys: 0.726 ± 0.234
5.227LeuAsp: 5.227 ± 0.678
6.461LeuGlu: 6.461 ± 0.598
3.339LeuPhe: 3.339 ± 0.502
4.719LeuGly: 4.719 ± 0.602
1.089LeuHis: 1.089 ± 0.328
5.517LeuIle: 5.517 ± 0.573
6.461LeuLys: 6.461 ± 0.701
5.808LeuLeu: 5.808 ± 0.67
2.178LeuMet: 2.178 ± 0.403
6.025LeuAsn: 6.025 ± 0.729
1.597LeuPro: 1.597 ± 0.322
2.033LeuGln: 2.033 ± 0.392
3.63LeuArg: 3.63 ± 0.507
5.445LeuSer: 5.445 ± 0.555
4.791LeuThr: 4.791 ± 0.448
4.501LeuVal: 4.501 ± 0.529
0.799LeuTrp: 0.799 ± 0.221
1.887LeuTyr: 1.887 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
1.525MetAla: 1.525 ± 0.394
0.508MetCys: 0.508 ± 0.19
1.234MetAsp: 1.234 ± 0.365
1.742MetGlu: 1.742 ± 0.37
1.452MetPhe: 1.452 ± 0.342
2.033MetGly: 2.033 ± 0.41
0.508MetHis: 0.508 ± 0.205
1.67MetIle: 1.67 ± 0.338
2.105MetLys: 2.105 ± 0.416
2.323MetLeu: 2.323 ± 0.39
0.508MetMet: 0.508 ± 0.186
2.323MetAsn: 2.323 ± 0.412
1.016MetPro: 1.016 ± 0.255
1.67MetGln: 1.67 ± 0.298
1.162MetArg: 1.162 ± 0.301
3.049MetSer: 3.049 ± 0.517
2.033MetThr: 2.033 ± 0.342
1.379MetVal: 1.379 ± 0.286
0.363MetTrp: 0.363 ± 0.186
0.653MetTyr: 0.653 ± 0.222
0.0MetXaa: 0.0 ± 0.0
Asn
3.702AsnAla: 3.702 ± 0.53
0.508AsnCys: 0.508 ± 0.198
4.211AsnAsp: 4.211 ± 0.481
3.92AsnGlu: 3.92 ± 0.613
1.887AsnPhe: 1.887 ± 0.415
5.88AsnGly: 5.88 ± 0.866
1.525AsnHis: 1.525 ± 0.359
4.065AsnIle: 4.065 ± 0.616
3.485AsnLys: 3.485 ± 0.529
4.646AsnLeu: 4.646 ± 0.574
1.597AsnMet: 1.597 ± 0.314
3.267AsnAsn: 3.267 ± 0.56
2.759AsnPro: 2.759 ± 0.465
2.25AsnGln: 2.25 ± 0.396
1.96AsnArg: 1.96 ± 0.442
3.557AsnSer: 3.557 ± 0.529
3.267AsnThr: 3.267 ± 0.482
3.194AsnVal: 3.194 ± 0.52
0.436AsnTrp: 0.436 ± 0.217
2.613AsnTyr: 2.613 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
1.379ProAla: 1.379 ± 0.303
0.218ProCys: 0.218 ± 0.132
2.178ProAsp: 2.178 ± 0.348
2.759ProGlu: 2.759 ± 0.472
1.67ProPhe: 1.67 ± 0.401
0.073ProGly: 0.073 ± 0.087
0.508ProHis: 0.508 ± 0.194
2.468ProIle: 2.468 ± 0.381
2.904ProLys: 2.904 ± 0.511
2.323ProLeu: 2.323 ± 0.451
1.016ProMet: 1.016 ± 0.314
2.396ProAsn: 2.396 ± 0.475
0.653ProPro: 0.653 ± 0.271
1.379ProGln: 1.379 ± 0.293
0.871ProArg: 0.871 ± 0.274
2.468ProSer: 2.468 ± 0.435
1.96ProThr: 1.96 ± 0.338
1.815ProVal: 1.815 ± 0.329
0.145ProTrp: 0.145 ± 0.104
1.67ProTyr: 1.67 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
3.122GlnAla: 3.122 ± 0.46
0.218GlnCys: 0.218 ± 0.122
2.25GlnAsp: 2.25 ± 0.421
2.613GlnGlu: 2.613 ± 0.427
2.178GlnPhe: 2.178 ± 0.43
2.105GlnGly: 2.105 ± 0.455
0.799GlnHis: 0.799 ± 0.255
2.105GlnIle: 2.105 ± 0.382
3.122GlnLys: 3.122 ± 0.526
2.976GlnLeu: 2.976 ± 0.552
1.016GlnMet: 1.016 ± 0.296
1.815GlnAsn: 1.815 ± 0.395
0.799GlnPro: 0.799 ± 0.273
1.742GlnGln: 1.742 ± 0.354
1.742GlnArg: 1.742 ± 0.383
2.178GlnSer: 2.178 ± 0.393
1.96GlnThr: 1.96 ± 0.36
2.25GlnVal: 2.25 ± 0.417
0.726GlnTrp: 0.726 ± 0.211
1.597GlnTyr: 1.597 ± 0.413
0.0GlnXaa: 0.0 ± 0.0
Arg
2.178ArgAla: 2.178 ± 0.46
0.871ArgCys: 0.871 ± 0.27
2.686ArgAsp: 2.686 ± 0.455
2.686ArgGlu: 2.686 ± 0.52
1.815ArgPhe: 1.815 ± 0.378
2.033ArgGly: 2.033 ± 0.363
0.944ArgHis: 0.944 ± 0.288
2.831ArgIle: 2.831 ± 0.47
4.065ArgLys: 4.065 ± 0.59
2.831ArgLeu: 2.831 ± 0.489
0.799ArgMet: 0.799 ± 0.233
1.597ArgAsn: 1.597 ± 0.316
1.525ArgPro: 1.525 ± 0.282
1.379ArgGln: 1.379 ± 0.27
1.597ArgArg: 1.597 ± 0.343
2.904ArgSer: 2.904 ± 0.577
1.887ArgThr: 1.887 ± 0.372
2.468ArgVal: 2.468 ± 0.394
0.508ArgTrp: 0.508 ± 0.198
1.525ArgTyr: 1.525 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
3.702SerAla: 3.702 ± 0.519
0.871SerCys: 0.871 ± 0.261
3.775SerAsp: 3.775 ± 0.518
3.485SerGlu: 3.485 ± 0.537
3.122SerPhe: 3.122 ± 0.442
5.517SerGly: 5.517 ± 0.614
1.089SerHis: 1.089 ± 0.288
6.025SerIle: 6.025 ± 0.669
4.936SerLys: 4.936 ± 0.741
5.59SerLeu: 5.59 ± 0.581
2.396SerMet: 2.396 ± 0.48
3.702SerAsn: 3.702 ± 0.611
1.597SerPro: 1.597 ± 0.33
2.25SerGln: 2.25 ± 0.365
2.25SerArg: 2.25 ± 0.433
3.122SerSer: 3.122 ± 0.569
2.976SerThr: 2.976 ± 0.466
3.702SerVal: 3.702 ± 0.595
0.944SerTrp: 0.944 ± 0.259
1.887SerTyr: 1.887 ± 0.409
0.0SerXaa: 0.0 ± 0.0
Thr
3.702ThrAla: 3.702 ± 0.622
1.089ThrCys: 1.089 ± 0.244
3.412ThrAsp: 3.412 ± 0.505
2.105ThrGlu: 2.105 ± 0.331
2.105ThrPhe: 2.105 ± 0.406
3.92ThrGly: 3.92 ± 0.594
0.508ThrHis: 0.508 ± 0.187
3.557ThrIle: 3.557 ± 0.546
3.339ThrLys: 3.339 ± 0.503
4.719ThrLeu: 4.719 ± 0.606
1.089ThrMet: 1.089 ± 0.19
2.323ThrAsn: 2.323 ± 0.453
2.468ThrPro: 2.468 ± 0.473
1.742ThrGln: 1.742 ± 0.401
1.742ThrArg: 1.742 ± 0.307
2.396ThrSer: 2.396 ± 0.386
3.557ThrThr: 3.557 ± 0.632
4.646ThrVal: 4.646 ± 0.586
1.307ThrTrp: 1.307 ± 0.377
1.742ThrTyr: 1.742 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.065ValAla: 4.065 ± 0.575
0.944ValCys: 0.944 ± 0.322
4.283ValAsp: 4.283 ± 0.517
4.283ValGlu: 4.283 ± 0.66
3.267ValPhe: 3.267 ± 0.492
4.428ValGly: 4.428 ± 0.539
0.799ValHis: 0.799 ± 0.244
4.864ValIle: 4.864 ± 0.645
5.445ValLys: 5.445 ± 0.504
4.501ValLeu: 4.501 ± 0.699
1.815ValMet: 1.815 ± 0.319
4.719ValAsn: 4.719 ± 0.615
1.815ValPro: 1.815 ± 0.267
2.25ValGln: 2.25 ± 0.501
2.323ValArg: 2.323 ± 0.447
3.92ValSer: 3.92 ± 0.479
2.613ValThr: 2.613 ± 0.444
4.283ValVal: 4.283 ± 0.587
0.871ValTrp: 0.871 ± 0.248
2.759ValTyr: 2.759 ± 0.465
0.0ValXaa: 0.0 ± 0.0
Trp
0.944TrpAla: 0.944 ± 0.355
0.145TrpCys: 0.145 ± 0.079
0.871TrpAsp: 0.871 ± 0.258
0.799TrpGlu: 0.799 ± 0.211
1.234TrpPhe: 1.234 ± 0.321
0.581TrpGly: 0.581 ± 0.174
0.29TrpHis: 0.29 ± 0.162
0.944TrpIle: 0.944 ± 0.27
1.016TrpLys: 1.016 ± 0.266
1.016TrpLeu: 1.016 ± 0.273
0.508TrpMet: 0.508 ± 0.194
1.016TrpAsn: 1.016 ± 0.239
0.073TrpPro: 0.073 ± 0.065
0.653TrpGln: 0.653 ± 0.19
0.944TrpArg: 0.944 ± 0.299
0.871TrpSer: 0.871 ± 0.249
0.871TrpThr: 0.871 ± 0.244
1.016TrpVal: 1.016 ± 0.264
0.218TrpTrp: 0.218 ± 0.113
0.29TrpTyr: 0.29 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.831TyrAla: 2.831 ± 0.432
0.653TyrCys: 0.653 ± 0.2
3.049TyrAsp: 3.049 ± 0.537
2.25TyrGlu: 2.25 ± 0.474
2.178TyrPhe: 2.178 ± 0.44
2.686TyrGly: 2.686 ± 0.59
0.799TyrHis: 0.799 ± 0.229
2.686TyrIle: 2.686 ± 0.454
2.613TyrLys: 2.613 ± 0.419
2.613TyrLeu: 2.613 ± 0.426
1.307TyrMet: 1.307 ± 0.302
2.396TyrAsn: 2.396 ± 0.458
1.887TyrPro: 1.887 ± 0.397
1.742TyrGln: 1.742 ± 0.357
2.105TyrArg: 2.105 ± 0.413
3.267TyrSer: 3.267 ± 0.433
1.67TyrThr: 1.67 ± 0.344
1.597TyrVal: 1.597 ± 0.321
0.581TyrTrp: 0.581 ± 0.278
1.525TyrTyr: 1.525 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (13776 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski