Amino acid dipepetide frequency for Acinetobacter phage VB_ApiP_XC38

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.734AlaAla: 8.734 ± 0.934
0.497AlaCys: 0.497 ± 0.164
5.298AlaAsp: 5.298 ± 0.465
6.044AlaGlu: 6.044 ± 0.529
2.401AlaPhe: 2.401 ± 0.47
4.595AlaGly: 4.595 ± 0.55
1.532AlaHis: 1.532 ± 0.24
5.216AlaIle: 5.216 ± 0.432
6.333AlaLys: 6.333 ± 0.683
8.817AlaLeu: 8.817 ± 1.038
2.318AlaMet: 2.318 ± 0.495
5.091AlaAsn: 5.091 ± 0.506
1.904AlaPro: 1.904 ± 0.304
4.305AlaGln: 4.305 ± 0.995
4.098AlaArg: 4.098 ± 0.34
5.216AlaSer: 5.216 ± 0.853
5.547AlaThr: 5.547 ± 0.819
5.34AlaVal: 5.34 ± 0.728
0.993AlaTrp: 0.993 ± 0.192
3.229AlaTyr: 3.229 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.786CysAla: 0.786 ± 0.248
0.207CysCys: 0.207 ± 0.1
0.497CysAsp: 0.497 ± 0.165
0.414CysGlu: 0.414 ± 0.154
0.331CysPhe: 0.331 ± 0.146
0.497CysGly: 0.497 ± 0.204
0.207CysHis: 0.207 ± 0.092
0.455CysIle: 0.455 ± 0.153
0.621CysLys: 0.621 ± 0.19
0.704CysLeu: 0.704 ± 0.244
0.331CysMet: 0.331 ± 0.111
0.497CysAsn: 0.497 ± 0.168
0.083CysPro: 0.083 ± 0.056
0.373CysGln: 0.373 ± 0.136
0.29CysArg: 0.29 ± 0.153
0.497CysSer: 0.497 ± 0.149
0.455CysThr: 0.455 ± 0.17
0.497CysVal: 0.497 ± 0.167
0.041CysTrp: 0.041 ± 0.033
0.331CysTyr: 0.331 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
5.961AspAla: 5.961 ± 0.694
0.497AspCys: 0.497 ± 0.178
2.277AspAsp: 2.277 ± 0.34
4.057AspGlu: 4.057 ± 0.478
2.442AspPhe: 2.442 ± 0.345
4.264AspGly: 4.264 ± 0.525
1.2AspHis: 1.2 ± 0.27
3.808AspIle: 3.808 ± 0.394
3.601AspLys: 3.601 ± 0.426
5.133AspLeu: 5.133 ± 0.463
1.366AspMet: 1.366 ± 0.26
3.27AspAsn: 3.27 ± 0.396
2.732AspPro: 2.732 ± 0.315
2.111AspGln: 2.111 ± 0.328
1.904AspArg: 1.904 ± 0.304
3.767AspSer: 3.767 ± 0.384
4.512AspThr: 4.512 ± 0.424
3.85AspVal: 3.85 ± 0.363
0.621AspTrp: 0.621 ± 0.14
2.401AspTyr: 2.401 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
5.878GluAla: 5.878 ± 0.583
0.621GluCys: 0.621 ± 0.281
3.519GluAsp: 3.519 ± 0.43
4.346GluGlu: 4.346 ± 0.435
2.235GluPhe: 2.235 ± 0.272
3.146GluGly: 3.146 ± 0.258
1.449GluHis: 1.449 ± 0.273
3.643GluIle: 3.643 ± 0.367
3.477GluLys: 3.477 ± 0.531
8.072GluLeu: 8.072 ± 0.774
1.614GluMet: 1.614 ± 0.303
2.525GluAsn: 2.525 ± 0.28
1.656GluPro: 1.656 ± 0.47
3.767GluGln: 3.767 ± 0.615
3.063GluArg: 3.063 ± 0.368
3.85GluSer: 3.85 ± 0.401
3.146GluThr: 3.146 ± 0.49
4.967GluVal: 4.967 ± 0.51
0.497GluTrp: 0.497 ± 0.183
2.442GluTyr: 2.442 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
2.318PheAla: 2.318 ± 0.366
0.373PheCys: 0.373 ± 0.155
1.987PheAsp: 1.987 ± 0.262
2.028PheGlu: 2.028 ± 0.28
1.118PhePhe: 1.118 ± 0.235
2.028PheGly: 2.028 ± 0.302
0.497PheHis: 0.497 ± 0.165
2.566PheIle: 2.566 ± 0.426
2.732PheLys: 2.732 ± 0.384
2.401PheLeu: 2.401 ± 0.301
1.49PheMet: 1.49 ± 0.387
2.608PheAsn: 2.608 ± 0.372
0.745PhePro: 0.745 ± 0.191
1.449PheGln: 1.449 ± 0.298
1.49PheArg: 1.49 ± 0.26
1.821PheSer: 1.821 ± 0.274
2.649PheThr: 2.649 ± 0.403
2.194PheVal: 2.194 ± 0.354
0.248PheTrp: 0.248 ± 0.098
1.532PheTyr: 1.532 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
4.057GlyAla: 4.057 ± 0.543
0.373GlyCys: 0.373 ± 0.162
2.732GlyAsp: 2.732 ± 0.351
3.022GlyGlu: 3.022 ± 0.335
2.566GlyPhe: 2.566 ± 0.402
3.312GlyGly: 3.312 ± 0.459
0.869GlyHis: 0.869 ± 0.197
4.76GlyIle: 4.76 ± 0.466
4.388GlyLys: 4.388 ± 0.478
5.298GlyLeu: 5.298 ± 0.521
1.697GlyMet: 1.697 ± 0.279
3.643GlyAsn: 3.643 ± 0.38
0.911GlyPro: 0.911 ± 0.204
2.525GlyGln: 2.525 ± 0.294
1.946GlyArg: 1.946 ± 0.222
3.56GlySer: 3.56 ± 0.411
4.926GlyThr: 4.926 ± 0.439
5.091GlyVal: 5.091 ± 0.452
0.621GlyTrp: 0.621 ± 0.173
3.022GlyTyr: 3.022 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
1.283HisAla: 1.283 ± 0.341
0.248HisCys: 0.248 ± 0.104
1.159HisAsp: 1.159 ± 0.222
1.118HisGlu: 1.118 ± 0.237
0.704HisPhe: 0.704 ± 0.191
1.449HisGly: 1.449 ± 0.335
0.621HisHis: 0.621 ± 0.175
1.035HisIle: 1.035 ± 0.251
1.242HisLys: 1.242 ± 0.242
1.863HisLeu: 1.863 ± 0.389
0.414HisMet: 0.414 ± 0.176
1.118HisAsn: 1.118 ± 0.236
0.828HisPro: 0.828 ± 0.231
0.662HisGln: 0.662 ± 0.167
0.745HisArg: 0.745 ± 0.182
0.952HisSer: 0.952 ± 0.191
0.662HisThr: 0.662 ± 0.165
1.283HisVal: 1.283 ± 0.292
0.29HisTrp: 0.29 ± 0.104
0.704HisTyr: 0.704 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.381IleAla: 5.381 ± 0.38
0.538IleCys: 0.538 ± 0.191
4.967IleAsp: 4.967 ± 0.405
4.015IleGlu: 4.015 ± 0.401
1.366IlePhe: 1.366 ± 0.264
4.057IleGly: 4.057 ± 0.551
1.076IleHis: 1.076 ± 0.231
3.022IleIle: 3.022 ± 0.398
4.843IleLys: 4.843 ± 0.356
3.684IleLeu: 3.684 ± 0.417
1.449IleMet: 1.449 ± 0.266
3.477IleAsn: 3.477 ± 0.489
2.111IlePro: 2.111 ± 0.445
2.442IleGln: 2.442 ± 0.329
2.732IleArg: 2.732 ± 0.275
3.519IleSer: 3.519 ± 0.454
4.636IleThr: 4.636 ± 0.748
4.222IleVal: 4.222 ± 0.532
0.58IleTrp: 0.58 ± 0.197
2.152IleTyr: 2.152 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
6.209LysAla: 6.209 ± 0.538
0.414LysCys: 0.414 ± 0.144
4.843LysAsp: 4.843 ± 0.428
4.76LysGlu: 4.76 ± 0.647
2.649LysPhe: 2.649 ± 0.351
3.932LysGly: 3.932 ± 0.526
1.325LysHis: 1.325 ± 0.324
3.022LysIle: 3.022 ± 0.531
3.643LysLys: 3.643 ± 0.361
6.126LysLeu: 6.126 ± 0.608
2.194LysMet: 2.194 ± 0.364
2.649LysAsn: 2.649 ± 0.39
3.022LysPro: 3.022 ± 0.39
3.394LysGln: 3.394 ± 0.644
3.146LysArg: 3.146 ± 0.478
4.76LysSer: 4.76 ± 0.42
3.684LysThr: 3.684 ± 0.411
4.471LysVal: 4.471 ± 0.464
0.58LysTrp: 0.58 ± 0.17
2.442LysTyr: 2.442 ± 0.337
0.0LysXaa: 0.0 ± 0.0
Leu
6.913LeuAla: 6.913 ± 0.663
1.035LeuCys: 1.035 ± 0.293
5.919LeuAsp: 5.919 ± 0.529
6.168LeuGlu: 6.168 ± 0.536
2.566LeuPhe: 2.566 ± 0.343
5.216LeuGly: 5.216 ± 0.644
1.449LeuHis: 1.449 ± 0.333
5.547LeuIle: 5.547 ± 0.627
5.464LeuLys: 5.464 ± 0.573
6.623LeuLeu: 6.623 ± 0.492
2.442LeuMet: 2.442 ± 0.348
5.547LeuAsn: 5.547 ± 0.452
4.181LeuPro: 4.181 ± 0.475
4.057LeuGln: 4.057 ± 0.458
4.388LeuArg: 4.388 ± 0.411
6.251LeuSer: 6.251 ± 0.873
5.174LeuThr: 5.174 ± 0.589
5.34LeuVal: 5.34 ± 0.52
0.704LeuTrp: 0.704 ± 0.15
2.608LeuTyr: 2.608 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
2.566MetAla: 2.566 ± 0.332
0.29MetCys: 0.29 ± 0.133
1.076MetAsp: 1.076 ± 0.201
1.283MetGlu: 1.283 ± 0.273
1.2MetPhe: 1.2 ± 0.22
1.614MetGly: 1.614 ± 0.235
0.704MetHis: 0.704 ± 0.175
1.407MetIle: 1.407 ± 0.222
1.863MetLys: 1.863 ± 0.323
2.442MetLeu: 2.442 ± 0.324
0.745MetMet: 0.745 ± 0.189
1.449MetAsn: 1.449 ± 0.312
1.2MetPro: 1.2 ± 0.329
1.283MetGln: 1.283 ± 0.297
0.662MetArg: 0.662 ± 0.157
2.98MetSer: 2.98 ± 0.484
1.739MetThr: 1.739 ± 0.241
1.573MetVal: 1.573 ± 0.298
0.207MetTrp: 0.207 ± 0.112
1.076MetTyr: 1.076 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
5.712AsnAla: 5.712 ± 0.745
0.248AsnCys: 0.248 ± 0.114
2.856AsnAsp: 2.856 ± 0.349
2.484AsnGlu: 2.484 ± 0.35
2.07AsnPhe: 2.07 ± 0.334
2.98AsnGly: 2.98 ± 0.386
0.993AsnHis: 0.993 ± 0.22
2.608AsnIle: 2.608 ± 0.378
3.477AsnLys: 3.477 ± 0.445
5.588AsnLeu: 5.588 ± 0.555
1.573AsnMet: 1.573 ± 0.288
3.312AsnAsn: 3.312 ± 0.553
2.856AsnPro: 2.856 ± 0.456
3.312AsnGln: 3.312 ± 0.436
2.525AsnArg: 2.525 ± 0.393
3.519AsnSer: 3.519 ± 0.844
3.85AsnThr: 3.85 ± 0.536
3.56AsnVal: 3.56 ± 0.436
0.911AsnTrp: 0.911 ± 0.255
2.484AsnTyr: 2.484 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
1.946ProAla: 1.946 ± 0.288
0.248ProCys: 0.248 ± 0.105
2.732ProAsp: 2.732 ± 0.455
3.891ProGlu: 3.891 ± 0.559
0.869ProPhe: 0.869 ± 0.255
1.407ProGly: 1.407 ± 0.235
0.497ProHis: 0.497 ± 0.168
1.987ProIle: 1.987 ± 0.332
2.152ProLys: 2.152 ± 0.329
2.359ProLeu: 2.359 ± 0.285
1.035ProMet: 1.035 ± 0.242
1.863ProAsn: 1.863 ± 0.319
1.118ProPro: 1.118 ± 0.365
1.946ProGln: 1.946 ± 0.29
1.283ProArg: 1.283 ± 0.295
3.063ProSer: 3.063 ± 0.425
3.146ProThr: 3.146 ± 0.282
2.608ProVal: 2.608 ± 0.445
0.538ProTrp: 0.538 ± 0.179
0.952ProTyr: 0.952 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
4.305GlnAla: 4.305 ± 0.708
0.29GlnCys: 0.29 ± 0.145
2.028GlnAsp: 2.028 ± 0.345
2.566GlnGlu: 2.566 ± 0.361
1.573GlnPhe: 1.573 ± 0.27
3.105GlnGly: 3.105 ± 0.332
1.076GlnHis: 1.076 ± 0.257
2.732GlnIle: 2.732 ± 0.385
2.152GlnLys: 2.152 ± 0.337
4.802GlnLeu: 4.802 ± 0.461
1.407GlnMet: 1.407 ± 0.269
2.732GlnAsn: 2.732 ± 0.39
2.028GlnPro: 2.028 ± 0.393
2.401GlnGln: 2.401 ± 0.456
2.111GlnArg: 2.111 ± 0.383
2.649GlnSer: 2.649 ± 0.38
3.229GlnThr: 3.229 ± 0.516
3.353GlnVal: 3.353 ± 0.433
0.497GlnTrp: 0.497 ± 0.138
1.035GlnTyr: 1.035 ± 0.224
0.0GlnXaa: 0.0 ± 0.0
Arg
3.891ArgAla: 3.891 ± 0.441
0.331ArgCys: 0.331 ± 0.133
2.732ArgAsp: 2.732 ± 0.459
3.022ArgGlu: 3.022 ± 0.321
1.904ArgPhe: 1.904 ± 0.336
2.318ArgGly: 2.318 ± 0.324
0.828ArgHis: 0.828 ± 0.211
3.601ArgIle: 3.601 ± 0.411
3.85ArgLys: 3.85 ± 0.442
2.608ArgLeu: 2.608 ± 0.268
1.159ArgMet: 1.159 ± 0.188
2.318ArgAsn: 2.318 ± 0.377
0.869ArgPro: 0.869 ± 0.236
1.532ArgGln: 1.532 ± 0.219
1.159ArgArg: 1.159 ± 0.203
2.235ArgSer: 2.235 ± 0.32
2.484ArgThr: 2.484 ± 0.391
2.815ArgVal: 2.815 ± 0.361
0.331ArgTrp: 0.331 ± 0.121
1.449ArgTyr: 1.449 ± 0.239
0.0ArgXaa: 0.0 ± 0.0
Ser
5.63SerAla: 5.63 ± 1.108
0.455SerCys: 0.455 ± 0.187
4.015SerAsp: 4.015 ± 0.54
3.353SerGlu: 3.353 ± 0.463
2.152SerPhe: 2.152 ± 0.333
4.802SerGly: 4.802 ± 0.458
0.786SerHis: 0.786 ± 0.231
4.76SerIle: 4.76 ± 0.543
3.932SerLys: 3.932 ± 0.382
6.044SerLeu: 6.044 ± 0.731
1.407SerMet: 1.407 ± 0.252
3.808SerAsn: 3.808 ± 0.659
1.656SerPro: 1.656 ± 0.245
3.187SerGln: 3.187 ± 0.492
2.442SerArg: 2.442 ± 0.335
4.222SerSer: 4.222 ± 0.993
4.719SerThr: 4.719 ± 0.825
4.636SerVal: 4.636 ± 0.509
0.538SerTrp: 0.538 ± 0.164
2.111SerTyr: 2.111 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
6.582ThrAla: 6.582 ± 1.01
0.662ThrCys: 0.662 ± 0.195
4.222ThrAsp: 4.222 ± 0.401
3.891ThrGlu: 3.891 ± 0.45
2.318ThrPhe: 2.318 ± 0.306
4.76ThrGly: 4.76 ± 0.521
0.911ThrHis: 0.911 ± 0.187
3.932ThrIle: 3.932 ± 0.54
5.423ThrLys: 5.423 ± 0.531
4.843ThrLeu: 4.843 ± 0.613
1.449ThrMet: 1.449 ± 0.277
4.388ThrAsn: 4.388 ± 0.74
2.856ThrPro: 2.856 ± 0.425
2.525ThrGln: 2.525 ± 0.413
2.566ThrArg: 2.566 ± 0.324
4.388ThrSer: 4.388 ± 1.024
5.381ThrThr: 5.381 ± 0.851
4.264ThrVal: 4.264 ± 0.731
0.911ThrTrp: 0.911 ± 0.156
1.904ThrTyr: 1.904 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
5.63ValAla: 5.63 ± 0.49
0.414ValCys: 0.414 ± 0.152
4.388ValAsp: 4.388 ± 0.327
4.595ValGlu: 4.595 ± 0.535
1.946ValPhe: 1.946 ± 0.318
3.56ValGly: 3.56 ± 0.392
1.325ValHis: 1.325 ± 0.251
3.436ValIle: 3.436 ± 0.351
5.091ValLys: 5.091 ± 0.515
5.257ValLeu: 5.257 ± 0.522
2.028ValMet: 2.028 ± 0.302
4.139ValAsn: 4.139 ± 0.395
3.229ValPro: 3.229 ± 0.41
3.063ValGln: 3.063 ± 0.409
2.856ValArg: 2.856 ± 0.448
4.429ValSer: 4.429 ± 0.693
5.588ValThr: 5.588 ± 0.761
4.057ValVal: 4.057 ± 0.484
0.621ValTrp: 0.621 ± 0.137
1.946ValTyr: 1.946 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.035TrpAla: 1.035 ± 0.185
0.166TrpCys: 0.166 ± 0.089
0.248TrpAsp: 0.248 ± 0.09
0.745TrpGlu: 0.745 ± 0.172
0.166TrpPhe: 0.166 ± 0.082
0.373TrpGly: 0.373 ± 0.189
0.29TrpHis: 0.29 ± 0.121
0.497TrpIle: 0.497 ± 0.159
0.662TrpLys: 0.662 ± 0.189
1.325TrpLeu: 1.325 ± 0.27
0.331TrpMet: 0.331 ± 0.135
0.455TrpAsn: 0.455 ± 0.118
0.207TrpPro: 0.207 ± 0.104
0.538TrpGln: 0.538 ± 0.167
0.414TrpArg: 0.414 ± 0.151
0.745TrpSer: 0.745 ± 0.169
0.58TrpThr: 0.58 ± 0.173
0.869TrpVal: 0.869 ± 0.153
0.041TrpTrp: 0.041 ± 0.044
0.497TrpTyr: 0.497 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.691TyrAla: 2.691 ± 0.378
0.207TyrCys: 0.207 ± 0.098
2.277TyrAsp: 2.277 ± 0.286
2.111TyrGlu: 2.111 ± 0.309
1.697TyrPhe: 1.697 ± 0.277
1.904TyrGly: 1.904 ± 0.376
0.745TyrHis: 0.745 ± 0.22
2.235TyrIle: 2.235 ± 0.452
2.318TyrLys: 2.318 ± 0.324
3.643TyrLeu: 3.643 ± 0.578
0.869TyrMet: 0.869 ± 0.2
2.028TyrAsn: 2.028 ± 0.308
1.449TyrPro: 1.449 ± 0.311
1.2TyrGln: 1.2 ± 0.266
1.614TyrArg: 1.614 ± 0.239
2.235TyrSer: 2.235 ± 0.257
2.07TyrThr: 2.07 ± 0.317
2.608TyrVal: 2.608 ± 0.343
0.414TyrTrp: 0.414 ± 0.124
1.242TyrTyr: 1.242 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (24159 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski