Amino acid dipepetide frequency for Pelagibacter phage HTVC120P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.816AlaAla: 3.816 ± 0.51
0.611AlaCys: 0.611 ± 0.228
4.503AlaAsp: 4.503 ± 0.445
4.045AlaGlu: 4.045 ± 0.57
2.29AlaPhe: 2.29 ± 0.511
3.816AlaGly: 3.816 ± 1.071
0.992AlaHis: 0.992 ± 0.327
5.648AlaIle: 5.648 ± 0.578
5.19AlaLys: 5.19 ± 0.526
4.503AlaLeu: 4.503 ± 0.562
2.061AlaMet: 2.061 ± 0.419
5.19AlaAsn: 5.19 ± 1.083
2.366AlaPro: 2.366 ± 0.448
2.595AlaGln: 2.595 ± 0.437
2.366AlaArg: 2.366 ± 0.532
5.724AlaSer: 5.724 ± 0.726
4.885AlaThr: 4.885 ± 0.751
5.114AlaVal: 5.114 ± 0.975
1.069AlaTrp: 1.069 ± 0.244
3.053AlaTyr: 3.053 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.305CysAla: 0.305 ± 0.155
0.153CysCys: 0.153 ± 0.09
0.382CysAsp: 0.382 ± 0.163
0.229CysGlu: 0.229 ± 0.135
0.305CysPhe: 0.305 ± 0.151
0.611CysGly: 0.611 ± 0.283
0.229CysHis: 0.229 ± 0.138
0.534CysIle: 0.534 ± 0.218
1.069CysLys: 1.069 ± 0.336
0.916CysLeu: 0.916 ± 0.388
0.229CysMet: 0.229 ± 0.172
0.305CysAsn: 0.305 ± 0.239
0.382CysPro: 0.382 ± 0.143
0.611CysGln: 0.611 ± 0.201
0.534CysArg: 0.534 ± 0.251
0.84CysSer: 0.84 ± 0.188
0.382CysThr: 0.382 ± 0.161
0.687CysVal: 0.687 ± 0.215
0.0CysTrp: 0.0 ± 0.0
0.153CysTyr: 0.153 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
4.045AspAla: 4.045 ± 0.659
0.534AspCys: 0.534 ± 0.232
3.816AspAsp: 3.816 ± 0.528
4.045AspGlu: 4.045 ± 0.69
3.74AspPhe: 3.74 ± 0.549
3.053AspGly: 3.053 ± 0.475
0.382AspHis: 0.382 ± 0.153
3.206AspIle: 3.206 ± 0.497
4.732AspLys: 4.732 ± 0.882
6.335AspLeu: 6.335 ± 0.772
1.374AspMet: 1.374 ± 0.375
3.282AspAsn: 3.282 ± 0.436
1.603AspPro: 1.603 ± 0.352
1.374AspGln: 1.374 ± 0.291
1.603AspArg: 1.603 ± 0.271
3.587AspSer: 3.587 ± 0.67
4.808AspThr: 4.808 ± 0.637
4.656AspVal: 4.656 ± 0.789
0.916AspTrp: 0.916 ± 0.317
1.832AspTyr: 1.832 ± 0.266
0.0AspXaa: 0.0 ± 0.0
Glu
3.893GluAla: 3.893 ± 0.722
0.84GluCys: 0.84 ± 0.353
3.893GluAsp: 3.893 ± 0.613
3.587GluGlu: 3.587 ± 0.556
2.366GluPhe: 2.366 ± 0.434
3.893GluGly: 3.893 ± 0.4
1.298GluHis: 1.298 ± 0.324
4.656GluIle: 4.656 ± 0.545
5.114GluLys: 5.114 ± 0.752
5.648GluLeu: 5.648 ± 0.598
1.603GluMet: 1.603 ± 0.382
3.969GluAsn: 3.969 ± 0.54
1.679GluPro: 1.679 ± 0.513
2.519GluGln: 2.519 ± 0.56
2.9GluArg: 2.9 ± 0.537
3.206GluSer: 3.206 ± 0.469
4.122GluThr: 4.122 ± 0.473
2.824GluVal: 2.824 ± 0.575
0.992GluTrp: 0.992 ± 0.341
1.755GluTyr: 1.755 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.519PheAla: 2.519 ± 0.476
0.534PheCys: 0.534 ± 0.281
2.824PheAsp: 2.824 ± 0.677
2.29PheGlu: 2.29 ± 0.451
1.755PhePhe: 1.755 ± 0.363
2.29PheGly: 2.29 ± 0.562
0.153PheHis: 0.153 ± 0.111
2.671PheIle: 2.671 ± 0.605
4.274PheLys: 4.274 ± 0.702
3.282PheLeu: 3.282 ± 0.542
1.069PheMet: 1.069 ± 0.338
3.129PheAsn: 3.129 ± 0.481
0.992PhePro: 0.992 ± 0.302
2.442PheGln: 2.442 ± 0.34
1.45PheArg: 1.45 ± 0.289
2.748PheSer: 2.748 ± 0.523
2.9PheThr: 2.9 ± 0.683
2.061PheVal: 2.061 ± 0.332
0.382PheTrp: 0.382 ± 0.14
1.221PheTyr: 1.221 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
3.969GlyAla: 3.969 ± 0.597
0.382GlyCys: 0.382 ± 0.159
4.122GlyAsp: 4.122 ± 0.666
3.816GlyGlu: 3.816 ± 0.587
2.9GlyPhe: 2.9 ± 0.469
4.274GlyGly: 4.274 ± 0.67
1.069GlyHis: 1.069 ± 0.291
5.114GlyIle: 5.114 ± 0.811
4.961GlyLys: 4.961 ± 0.642
5.877GlyLeu: 5.877 ± 0.521
1.45GlyMet: 1.45 ± 0.288
4.503GlyAsn: 4.503 ± 0.784
0.534GlyPro: 0.534 ± 0.208
2.061GlyGln: 2.061 ± 0.42
2.519GlyArg: 2.519 ± 0.622
5.724GlySer: 5.724 ± 1.342
5.801GlyThr: 5.801 ± 1.176
3.511GlyVal: 3.511 ± 0.387
0.534GlyTrp: 0.534 ± 0.203
1.908GlyTyr: 1.908 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
1.221HisAla: 1.221 ± 0.273
0.382HisCys: 0.382 ± 0.155
0.611HisAsp: 0.611 ± 0.202
0.382HisGlu: 0.382 ± 0.162
0.611HisPhe: 0.611 ± 0.19
0.992HisGly: 0.992 ± 0.263
0.458HisHis: 0.458 ± 0.203
0.916HisIle: 0.916 ± 0.223
0.611HisLys: 0.611 ± 0.228
1.755HisLeu: 1.755 ± 0.369
0.763HisMet: 0.763 ± 0.214
0.763HisAsn: 0.763 ± 0.233
0.534HisPro: 0.534 ± 0.193
0.229HisGln: 0.229 ± 0.164
0.611HisArg: 0.611 ± 0.205
1.526HisSer: 1.526 ± 0.341
2.061HisThr: 2.061 ± 0.332
1.145HisVal: 1.145 ± 0.353
0.458HisTrp: 0.458 ± 0.216
0.992HisTyr: 0.992 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
6.182IleAla: 6.182 ± 0.559
0.84IleCys: 0.84 ± 0.242
4.961IleAsp: 4.961 ± 0.6
4.732IleGlu: 4.732 ± 0.707
2.137IlePhe: 2.137 ± 0.359
3.893IleGly: 3.893 ± 0.596
1.069IleHis: 1.069 ± 0.303
4.122IleIle: 4.122 ± 0.54
6.03IleLys: 6.03 ± 0.648
4.961IleLeu: 4.961 ± 0.593
1.374IleMet: 1.374 ± 0.299
4.885IleAsn: 4.885 ± 0.899
2.366IlePro: 2.366 ± 0.461
2.061IleGln: 2.061 ± 0.367
3.282IleArg: 3.282 ± 0.458
3.969IleSer: 3.969 ± 0.57
4.274IleThr: 4.274 ± 0.645
3.358IleVal: 3.358 ± 0.561
0.382IleTrp: 0.382 ± 0.163
1.298IleTyr: 1.298 ± 0.342
0.0IleXaa: 0.0 ± 0.0
Lys
5.037LysAla: 5.037 ± 0.611
0.458LysCys: 0.458 ± 0.196
4.732LysAsp: 4.732 ± 0.756
4.885LysGlu: 4.885 ± 0.664
3.664LysPhe: 3.664 ± 0.514
5.114LysGly: 5.114 ± 0.873
1.603LysHis: 1.603 ± 0.329
5.724LysIle: 5.724 ± 0.57
6.946LysLys: 6.946 ± 1.161
6.335LysLeu: 6.335 ± 0.751
2.824LysMet: 2.824 ± 0.544
5.037LysAsn: 5.037 ± 0.738
3.511LysPro: 3.511 ± 0.551
3.206LysGln: 3.206 ± 0.538
3.587LysArg: 3.587 ± 0.693
4.885LysSer: 4.885 ± 0.674
4.732LysThr: 4.732 ± 0.602
5.037LysVal: 5.037 ± 0.683
1.069LysTrp: 1.069 ± 0.361
2.748LysTyr: 2.748 ± 0.53
0.0LysXaa: 0.0 ± 0.0
Leu
6.106LeuAla: 6.106 ± 0.774
0.611LeuCys: 0.611 ± 0.184
4.885LeuAsp: 4.885 ± 0.418
5.419LeuGlu: 5.419 ± 0.603
2.29LeuPhe: 2.29 ± 0.483
4.35LeuGly: 4.35 ± 0.58
1.45LeuHis: 1.45 ± 0.358
5.037LeuIle: 5.037 ± 0.578
7.785LeuLys: 7.785 ± 0.79
6.106LeuLeu: 6.106 ± 0.823
1.679LeuMet: 1.679 ± 0.299
5.953LeuAsn: 5.953 ± 0.597
3.129LeuPro: 3.129 ± 0.422
3.816LeuGln: 3.816 ± 0.413
3.664LeuArg: 3.664 ± 0.562
5.801LeuSer: 5.801 ± 0.713
5.419LeuThr: 5.419 ± 0.697
4.122LeuVal: 4.122 ± 0.633
0.763LeuTrp: 0.763 ± 0.231
2.977LeuTyr: 2.977 ± 0.51
0.0LeuXaa: 0.0 ± 0.0
Met
2.29MetAla: 2.29 ± 0.474
0.534MetCys: 0.534 ± 0.208
1.374MetAsp: 1.374 ± 0.329
0.992MetGlu: 0.992 ± 0.3
1.069MetPhe: 1.069 ± 0.343
1.755MetGly: 1.755 ± 0.404
0.305MetHis: 0.305 ± 0.144
1.374MetIle: 1.374 ± 0.36
1.832MetLys: 1.832 ± 0.411
1.832MetLeu: 1.832 ± 0.414
0.611MetMet: 0.611 ± 0.202
1.603MetAsn: 1.603 ± 0.39
0.84MetPro: 0.84 ± 0.229
0.916MetGln: 0.916 ± 0.36
1.603MetArg: 1.603 ± 0.397
2.213MetSer: 2.213 ± 0.377
1.145MetThr: 1.145 ± 0.327
0.992MetVal: 0.992 ± 0.217
0.305MetTrp: 0.305 ± 0.139
0.687MetTyr: 0.687 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
5.037AsnAla: 5.037 ± 0.99
0.534AsnCys: 0.534 ± 0.228
3.435AsnAsp: 3.435 ± 0.407
3.816AsnGlu: 3.816 ± 0.601
2.748AsnPhe: 2.748 ± 0.513
4.427AsnGly: 4.427 ± 0.616
0.916AsnHis: 0.916 ± 0.317
5.495AsnIle: 5.495 ± 1.042
5.572AsnLys: 5.572 ± 0.861
5.419AsnLeu: 5.419 ± 0.534
1.374AsnMet: 1.374 ± 0.304
4.656AsnAsn: 4.656 ± 0.808
2.366AsnPro: 2.366 ± 0.466
2.137AsnGln: 2.137 ± 0.377
2.595AsnArg: 2.595 ± 0.367
5.572AsnSer: 5.572 ± 1.199
4.122AsnThr: 4.122 ± 0.848
4.274AsnVal: 4.274 ± 0.85
0.382AsnTrp: 0.382 ± 0.165
2.366AsnTyr: 2.366 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
0.992ProAla: 0.992 ± 0.275
0.305ProCys: 0.305 ± 0.175
1.832ProAsp: 1.832 ± 0.545
3.358ProGlu: 3.358 ± 0.581
1.832ProPhe: 1.832 ± 0.456
1.221ProGly: 1.221 ± 0.396
0.458ProHis: 0.458 ± 0.18
1.374ProIle: 1.374 ± 0.317
2.671ProLys: 2.671 ± 0.472
2.213ProLeu: 2.213 ± 0.42
0.611ProMet: 0.611 ± 0.245
2.9ProAsn: 2.9 ± 0.502
0.611ProPro: 0.611 ± 0.203
1.298ProGln: 1.298 ± 0.301
0.84ProArg: 0.84 ± 0.256
2.061ProSer: 2.061 ± 0.413
3.206ProThr: 3.206 ± 0.458
2.29ProVal: 2.29 ± 0.497
0.382ProTrp: 0.382 ± 0.181
1.221ProTyr: 1.221 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
2.442GlnAla: 2.442 ± 0.389
0.153GlnCys: 0.153 ± 0.101
2.137GlnAsp: 2.137 ± 0.497
3.664GlnGlu: 3.664 ± 0.545
1.603GlnPhe: 1.603 ± 0.425
2.595GlnGly: 2.595 ± 0.419
0.611GlnHis: 0.611 ± 0.238
3.129GlnIle: 3.129 ± 0.419
3.282GlnLys: 3.282 ± 0.579
3.816GlnLeu: 3.816 ± 0.568
1.069GlnMet: 1.069 ± 0.372
1.526GlnAsn: 1.526 ± 0.395
0.992GlnPro: 0.992 ± 0.244
2.29GlnGln: 2.29 ± 0.531
1.603GlnArg: 1.603 ± 0.385
2.519GlnSer: 2.519 ± 0.488
2.519GlnThr: 2.519 ± 0.364
1.984GlnVal: 1.984 ± 0.384
0.611GlnTrp: 0.611 ± 0.184
1.145GlnTyr: 1.145 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
2.977ArgAla: 2.977 ± 0.523
0.153ArgCys: 0.153 ± 0.101
2.061ArgAsp: 2.061 ± 0.358
2.366ArgGlu: 2.366 ± 0.529
1.679ArgPhe: 1.679 ± 0.42
1.832ArgGly: 1.832 ± 0.42
1.374ArgHis: 1.374 ± 0.302
2.671ArgIle: 2.671 ± 0.558
3.053ArgLys: 3.053 ± 0.676
3.435ArgLeu: 3.435 ± 0.456
1.221ArgMet: 1.221 ± 0.409
1.908ArgAsn: 1.908 ± 0.341
1.679ArgPro: 1.679 ± 0.339
1.832ArgGln: 1.832 ± 0.313
2.137ArgArg: 2.137 ± 0.577
2.061ArgSer: 2.061 ± 0.389
2.671ArgThr: 2.671 ± 0.417
2.29ArgVal: 2.29 ± 0.518
0.687ArgTrp: 0.687 ± 0.274
1.679ArgTyr: 1.679 ± 0.266
0.0ArgXaa: 0.0 ± 0.0
Ser
5.572SerAla: 5.572 ± 0.814
0.305SerCys: 0.305 ± 0.179
4.198SerAsp: 4.198 ± 0.55
4.503SerGlu: 4.503 ± 0.593
2.824SerPhe: 2.824 ± 0.476
6.64SerGly: 6.64 ± 1.233
1.526SerHis: 1.526 ± 0.226
4.35SerIle: 4.35 ± 0.578
5.19SerLys: 5.19 ± 0.676
4.961SerLeu: 4.961 ± 0.59
1.221SerMet: 1.221 ± 0.314
5.419SerAsn: 5.419 ± 1.174
1.908SerPro: 1.908 ± 0.393
3.206SerGln: 3.206 ± 0.395
2.213SerArg: 2.213 ± 0.452
6.106SerSer: 6.106 ± 1.145
5.419SerThr: 5.419 ± 0.879
4.045SerVal: 4.045 ± 0.547
0.763SerTrp: 0.763 ± 0.282
2.061SerTyr: 2.061 ± 0.468
0.0SerXaa: 0.0 ± 0.0
Thr
5.343ThrAla: 5.343 ± 0.652
0.382ThrCys: 0.382 ± 0.216
3.358ThrAsp: 3.358 ± 0.722
3.129ThrGlu: 3.129 ± 0.456
3.282ThrPhe: 3.282 ± 0.566
6.259ThrGly: 6.259 ± 0.843
0.992ThrHis: 0.992 ± 0.304
4.656ThrIle: 4.656 ± 0.636
5.037ThrLys: 5.037 ± 0.605
5.114ThrLeu: 5.114 ± 0.605
1.908ThrMet: 1.908 ± 0.525
5.114ThrAsn: 5.114 ± 0.775
2.442ThrPro: 2.442 ± 0.538
2.748ThrGln: 2.748 ± 0.48
2.213ThrArg: 2.213 ± 0.406
6.03ThrSer: 6.03 ± 0.986
5.266ThrThr: 5.266 ± 1.078
5.419ThrVal: 5.419 ± 1.175
0.763ThrTrp: 0.763 ± 0.204
2.137ThrTyr: 2.137 ± 0.416
0.0ThrXaa: 0.0 ± 0.0
Val
5.724ValAla: 5.724 ± 1.532
0.916ValCys: 0.916 ± 0.251
3.511ValAsp: 3.511 ± 0.578
3.282ValGlu: 3.282 ± 0.47
2.213ValPhe: 2.213 ± 0.444
4.808ValGly: 4.808 ± 0.546
0.992ValHis: 0.992 ± 0.267
3.129ValIle: 3.129 ± 0.446
3.74ValLys: 3.74 ± 0.702
4.732ValLeu: 4.732 ± 0.52
0.992ValMet: 0.992 ± 0.22
4.656ValAsn: 4.656 ± 0.926
2.442ValPro: 2.442 ± 0.36
2.519ValGln: 2.519 ± 0.451
2.671ValArg: 2.671 ± 0.448
4.274ValSer: 4.274 ± 0.687
4.274ValThr: 4.274 ± 0.702
2.977ValVal: 2.977 ± 0.58
0.611ValTrp: 0.611 ± 0.222
1.374ValTyr: 1.374 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
0.84TrpAla: 0.84 ± 0.277
0.076TrpCys: 0.076 ± 0.061
0.458TrpAsp: 0.458 ± 0.142
0.763TrpGlu: 0.763 ± 0.301
0.458TrpPhe: 0.458 ± 0.215
0.687TrpGly: 0.687 ± 0.205
0.153TrpHis: 0.153 ± 0.117
0.687TrpIle: 0.687 ± 0.231
1.145TrpLys: 1.145 ± 0.303
1.45TrpLeu: 1.45 ± 0.312
0.229TrpMet: 0.229 ± 0.125
0.534TrpAsn: 0.534 ± 0.222
0.153TrpPro: 0.153 ± 0.088
0.382TrpGln: 0.382 ± 0.177
0.458TrpArg: 0.458 ± 0.257
1.069TrpSer: 1.069 ± 0.388
0.763TrpThr: 0.763 ± 0.235
1.069TrpVal: 1.069 ± 0.339
0.229TrpTrp: 0.229 ± 0.136
0.229TrpTyr: 0.229 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.526TyrAla: 1.526 ± 0.381
0.153TyrCys: 0.153 ± 0.089
1.908TyrAsp: 1.908 ± 0.475
1.45TyrGlu: 1.45 ± 0.352
1.221TyrPhe: 1.221 ± 0.208
2.519TyrGly: 2.519 ± 0.477
0.992TyrHis: 0.992 ± 0.266
1.679TyrIle: 1.679 ± 0.49
2.9TyrLys: 2.9 ± 0.44
2.671TyrLeu: 2.671 ± 0.327
0.534TyrMet: 0.534 ± 0.182
1.908TyrAsn: 1.908 ± 0.407
1.145TyrPro: 1.145 ± 0.26
1.45TyrGln: 1.45 ± 0.265
0.916TyrArg: 0.916 ± 0.241
2.519TyrSer: 2.519 ± 0.474
2.824TyrThr: 2.824 ± 0.498
1.984TyrVal: 1.984 ± 0.515
0.534TyrTrp: 0.534 ± 0.232
0.763TyrTyr: 0.763 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (13103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski