Amino acid dipepetide frequency for Sulfitobacter phage EE36phi1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.87AlaAla: 7.87 ± 0.799
0.609AlaCys: 0.609 ± 0.186
4.696AlaAsp: 4.696 ± 0.544
5.87AlaGlu: 5.87 ± 0.8
2.565AlaPhe: 2.565 ± 0.281
6.305AlaGly: 6.305 ± 0.803
1.174AlaHis: 1.174 ± 0.261
3.652AlaIle: 3.652 ± 0.363
4.87AlaLys: 4.87 ± 0.506
5.87AlaLeu: 5.87 ± 0.471
3.044AlaMet: 3.044 ± 0.386
4.305AlaAsn: 4.305 ± 0.538
2.826AlaPro: 2.826 ± 0.31
3.218AlaGln: 3.218 ± 0.531
3.783AlaArg: 3.783 ± 0.431
4.739AlaSer: 4.739 ± 0.317
4.696AlaThr: 4.696 ± 0.53
6.131AlaVal: 6.131 ± 0.781
1.174AlaTrp: 1.174 ± 0.254
2.609AlaTyr: 2.609 ± 0.256
0.0AlaXaa: 0.0 ± 0.0
Cys
0.478CysAla: 0.478 ± 0.181
0.043CysCys: 0.043 ± 0.041
0.435CysAsp: 0.435 ± 0.187
0.435CysGlu: 0.435 ± 0.134
0.391CysPhe: 0.391 ± 0.138
0.609CysGly: 0.609 ± 0.204
0.348CysHis: 0.348 ± 0.137
0.304CysIle: 0.304 ± 0.17
0.609CysLys: 0.609 ± 0.184
0.652CysLeu: 0.652 ± 0.219
0.391CysMet: 0.391 ± 0.152
0.087CysAsn: 0.087 ± 0.061
0.478CysPro: 0.478 ± 0.163
0.435CysGln: 0.435 ± 0.17
0.391CysArg: 0.391 ± 0.149
0.348CysSer: 0.348 ± 0.145
0.565CysThr: 0.565 ± 0.2
0.348CysVal: 0.348 ± 0.153
0.087CysTrp: 0.087 ± 0.065
0.261CysTyr: 0.261 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
4.913AspAla: 4.913 ± 0.638
0.391AspCys: 0.391 ± 0.126
3.826AspAsp: 3.826 ± 0.409
4.348AspGlu: 4.348 ± 0.473
2.913AspPhe: 2.913 ± 0.338
4.739AspGly: 4.739 ± 0.481
1.174AspHis: 1.174 ± 0.236
4.0AspIle: 4.0 ± 0.43
3.0AspLys: 3.0 ± 0.48
6.392AspLeu: 6.392 ± 0.426
1.826AspMet: 1.826 ± 0.328
2.783AspAsn: 2.783 ± 0.325
3.957AspPro: 3.957 ± 0.436
2.565AspGln: 2.565 ± 0.294
3.087AspArg: 3.087 ± 0.352
3.174AspSer: 3.174 ± 0.471
3.348AspThr: 3.348 ± 0.293
4.565AspVal: 4.565 ± 0.478
1.174AspTrp: 1.174 ± 0.148
2.261AspTyr: 2.261 ± 0.372
0.0AspXaa: 0.0 ± 0.0
Glu
5.305GluAla: 5.305 ± 0.561
0.565GluCys: 0.565 ± 0.259
4.087GluAsp: 4.087 ± 0.388
5.783GluGlu: 5.783 ± 0.545
3.348GluPhe: 3.348 ± 0.41
4.348GluGly: 4.348 ± 0.566
1.304GluHis: 1.304 ± 0.345
4.131GluIle: 4.131 ± 0.347
3.87GluLys: 3.87 ± 0.499
5.087GluLeu: 5.087 ± 0.514
2.435GluMet: 2.435 ± 0.343
3.0GluAsn: 3.0 ± 0.299
1.783GluPro: 1.783 ± 0.23
3.218GluGln: 3.218 ± 0.398
3.435GluArg: 3.435 ± 0.314
3.565GluSer: 3.565 ± 0.378
3.826GluThr: 3.826 ± 0.552
5.348GluVal: 5.348 ± 0.434
0.565GluTrp: 0.565 ± 0.167
2.304GluTyr: 2.304 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.291
0.174PheCys: 0.174 ± 0.092
2.696PheAsp: 2.696 ± 0.347
2.044PheGlu: 2.044 ± 0.368
1.391PhePhe: 1.391 ± 0.226
2.783PheGly: 2.783 ± 0.363
0.696PheHis: 0.696 ± 0.222
2.652PheIle: 2.652 ± 0.371
2.826PheLys: 2.826 ± 0.339
2.826PheLeu: 2.826 ± 0.305
1.565PheMet: 1.565 ± 0.326
2.435PheAsn: 2.435 ± 0.296
1.391PhePro: 1.391 ± 0.181
1.478PheGln: 1.478 ± 0.252
2.131PheArg: 2.131 ± 0.311
2.609PheSer: 2.609 ± 0.304
2.174PheThr: 2.174 ± 0.367
2.261PheVal: 2.261 ± 0.303
0.304PheTrp: 0.304 ± 0.114
1.348PheTyr: 1.348 ± 0.227
0.0PheXaa: 0.0 ± 0.0
Gly
5.739GlyAla: 5.739 ± 0.629
0.348GlyCys: 0.348 ± 0.157
3.957GlyAsp: 3.957 ± 0.421
4.696GlyGlu: 4.696 ± 0.415
3.087GlyPhe: 3.087 ± 0.319
5.826GlyGly: 5.826 ± 0.812
1.304GlyHis: 1.304 ± 0.217
3.478GlyIle: 3.478 ± 0.484
4.478GlyLys: 4.478 ± 0.482
5.739GlyLeu: 5.739 ± 0.655
2.435GlyMet: 2.435 ± 0.35
3.261GlyAsn: 3.261 ± 0.267
2.391GlyPro: 2.391 ± 0.389
3.044GlyGln: 3.044 ± 0.474
3.435GlyArg: 3.435 ± 0.482
5.435GlySer: 5.435 ± 0.54
5.261GlyThr: 5.261 ± 0.551
5.218GlyVal: 5.218 ± 0.615
0.913GlyTrp: 0.913 ± 0.222
3.044GlyTyr: 3.044 ± 0.288
0.0GlyXaa: 0.0 ± 0.0
His
1.0HisAla: 1.0 ± 0.25
0.174HisCys: 0.174 ± 0.087
1.13HisAsp: 1.13 ± 0.247
0.913HisGlu: 0.913 ± 0.239
0.652HisPhe: 0.652 ± 0.219
1.696HisGly: 1.696 ± 0.365
0.652HisHis: 0.652 ± 0.274
1.391HisIle: 1.391 ± 0.25
1.435HisLys: 1.435 ± 0.351
2.044HisLeu: 2.044 ± 0.398
0.522HisMet: 0.522 ± 0.121
1.087HisAsn: 1.087 ± 0.266
0.783HisPro: 0.783 ± 0.206
0.478HisGln: 0.478 ± 0.133
1.217HisArg: 1.217 ± 0.293
0.87HisSer: 0.87 ± 0.213
0.565HisThr: 0.565 ± 0.187
1.044HisVal: 1.044 ± 0.229
0.13HisTrp: 0.13 ± 0.078
0.87HisTyr: 0.87 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
5.174IleAla: 5.174 ± 0.556
0.435IleCys: 0.435 ± 0.155
3.478IleAsp: 3.478 ± 0.345
3.783IleGlu: 3.783 ± 0.382
2.609IlePhe: 2.609 ± 0.303
3.304IleGly: 3.304 ± 0.402
1.0IleHis: 1.0 ± 0.22
3.087IleIle: 3.087 ± 0.364
2.739IleLys: 2.739 ± 0.448
4.565IleLeu: 4.565 ± 0.456
1.13IleMet: 1.13 ± 0.209
2.739IleAsn: 2.739 ± 0.248
3.218IlePro: 3.218 ± 0.371
2.304IleGln: 2.304 ± 0.364
3.261IleArg: 3.261 ± 0.407
3.783IleSer: 3.783 ± 0.359
3.565IleThr: 3.565 ± 0.439
3.087IleVal: 3.087 ± 0.453
1.044IleTrp: 1.044 ± 0.271
2.304IleTyr: 2.304 ± 0.325
0.0IleXaa: 0.0 ± 0.0
Lys
4.739LysAla: 4.739 ± 0.598
0.435LysCys: 0.435 ± 0.165
3.131LysAsp: 3.131 ± 0.432
4.783LysGlu: 4.783 ± 0.514
1.652LysPhe: 1.652 ± 0.224
3.565LysGly: 3.565 ± 0.406
1.087LysHis: 1.087 ± 0.305
2.652LysIle: 2.652 ± 0.392
3.739LysLys: 3.739 ± 0.569
5.0LysLeu: 5.0 ± 0.609
2.131LysMet: 2.131 ± 0.255
2.826LysAsn: 2.826 ± 0.37
2.435LysPro: 2.435 ± 0.408
2.435LysGln: 2.435 ± 0.288
2.652LysArg: 2.652 ± 0.306
3.304LysSer: 3.304 ± 0.516
3.304LysThr: 3.304 ± 0.638
3.478LysVal: 3.478 ± 0.357
0.826LysTrp: 0.826 ± 0.221
1.87LysTyr: 1.87 ± 0.331
0.0LysXaa: 0.0 ± 0.0
Leu
6.174LeuAla: 6.174 ± 0.625
1.0LeuCys: 1.0 ± 0.273
5.392LeuAsp: 5.392 ± 0.436
5.218LeuGlu: 5.218 ± 0.476
3.044LeuPhe: 3.044 ± 0.381
5.174LeuGly: 5.174 ± 0.464
1.522LeuHis: 1.522 ± 0.354
4.261LeuIle: 4.261 ± 0.451
4.391LeuLys: 4.391 ± 0.563
5.218LeuLeu: 5.218 ± 0.726
2.478LeuMet: 2.478 ± 0.31
4.696LeuAsn: 4.696 ± 0.369
3.696LeuPro: 3.696 ± 0.347
3.391LeuGln: 3.391 ± 0.371
4.0LeuArg: 4.0 ± 0.458
5.174LeuSer: 5.174 ± 0.421
5.435LeuThr: 5.435 ± 0.361
4.696LeuVal: 4.696 ± 0.384
0.913LeuTrp: 0.913 ± 0.2
2.435LeuTyr: 2.435 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
2.957MetAla: 2.957 ± 0.346
0.261MetCys: 0.261 ± 0.175
1.87MetAsp: 1.87 ± 0.29
1.435MetGlu: 1.435 ± 0.258
1.174MetPhe: 1.174 ± 0.214
2.348MetGly: 2.348 ± 0.316
0.217MetHis: 0.217 ± 0.087
1.565MetIle: 1.565 ± 0.235
1.739MetLys: 1.739 ± 0.227
2.783MetLeu: 2.783 ± 0.24
0.957MetMet: 0.957 ± 0.215
2.087MetAsn: 2.087 ± 0.401
1.696MetPro: 1.696 ± 0.267
1.522MetGln: 1.522 ± 0.232
1.652MetArg: 1.652 ± 0.385
2.522MetSer: 2.522 ± 0.338
2.565MetThr: 2.565 ± 0.354
2.174MetVal: 2.174 ± 0.419
0.261MetTrp: 0.261 ± 0.111
0.783MetTyr: 0.783 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
4.435AsnAla: 4.435 ± 0.528
0.522AsnCys: 0.522 ± 0.177
3.131AsnAsp: 3.131 ± 0.389
3.304AsnGlu: 3.304 ± 0.519
2.261AsnPhe: 2.261 ± 0.357
3.739AsnGly: 3.739 ± 0.523
1.087AsnHis: 1.087 ± 0.157
2.652AsnIle: 2.652 ± 0.432
2.652AsnLys: 2.652 ± 0.435
3.87AsnLeu: 3.87 ± 0.405
1.391AsnMet: 1.391 ± 0.248
3.131AsnAsn: 3.131 ± 0.432
3.174AsnPro: 3.174 ± 0.364
3.087AsnGln: 3.087 ± 0.311
2.652AsnArg: 2.652 ± 0.288
2.609AsnSer: 2.609 ± 0.287
3.0AsnThr: 3.0 ± 0.366
3.478AsnVal: 3.478 ± 0.378
0.913AsnTrp: 0.913 ± 0.248
1.565AsnTyr: 1.565 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
2.957ProAla: 2.957 ± 0.37
0.391ProCys: 0.391 ± 0.147
4.044ProAsp: 4.044 ± 0.341
3.609ProGlu: 3.609 ± 0.302
1.739ProPhe: 1.739 ± 0.301
2.304ProGly: 2.304 ± 0.339
0.87ProHis: 0.87 ± 0.201
2.174ProIle: 2.174 ± 0.322
2.131ProLys: 2.131 ± 0.33
3.348ProLeu: 3.348 ± 0.466
1.0ProMet: 1.0 ± 0.225
2.261ProAsn: 2.261 ± 0.33
1.609ProPro: 1.609 ± 0.256
2.087ProGln: 2.087 ± 0.303
1.696ProArg: 1.696 ± 0.239
3.304ProSer: 3.304 ± 0.423
2.913ProThr: 2.913 ± 0.235
3.261ProVal: 3.261 ± 0.328
0.696ProTrp: 0.696 ± 0.15
0.826ProTyr: 0.826 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
4.0GlnAla: 4.0 ± 0.423
0.174GlnCys: 0.174 ± 0.083
3.044GlnAsp: 3.044 ± 0.304
2.435GlnGlu: 2.435 ± 0.249
1.435GlnPhe: 1.435 ± 0.285
2.783GlnGly: 2.783 ± 0.375
1.0GlnHis: 1.0 ± 0.237
2.87GlnIle: 2.87 ± 0.369
2.739GlnLys: 2.739 ± 0.383
3.174GlnLeu: 3.174 ± 0.466
1.826GlnMet: 1.826 ± 0.359
2.261GlnAsn: 2.261 ± 0.323
1.565GlnPro: 1.565 ± 0.31
1.913GlnGln: 1.913 ± 0.294
2.652GlnArg: 2.652 ± 0.383
2.435GlnSer: 2.435 ± 0.311
3.261GlnThr: 3.261 ± 0.388
2.565GlnVal: 2.565 ± 0.307
0.565GlnTrp: 0.565 ± 0.173
1.522GlnTyr: 1.522 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
3.0ArgAla: 3.0 ± 0.388
0.478ArgCys: 0.478 ± 0.189
2.826ArgAsp: 2.826 ± 0.365
3.348ArgGlu: 3.348 ± 0.681
2.0ArgPhe: 2.0 ± 0.363
3.391ArgGly: 3.391 ± 0.467
0.565ArgHis: 0.565 ± 0.162
3.087ArgIle: 3.087 ± 0.315
2.739ArgLys: 2.739 ± 0.309
4.261ArgLeu: 4.261 ± 0.337
2.261ArgMet: 2.261 ± 0.33
2.957ArgAsn: 2.957 ± 0.405
1.609ArgPro: 1.609 ± 0.256
2.783ArgGln: 2.783 ± 0.408
2.391ArgArg: 2.391 ± 0.279
3.218ArgSer: 3.218 ± 0.403
3.174ArgThr: 3.174 ± 0.422
3.826ArgVal: 3.826 ± 0.384
0.783ArgTrp: 0.783 ± 0.217
1.913ArgTyr: 1.913 ± 0.347
0.0ArgXaa: 0.0 ± 0.0
Ser
3.826SerAla: 3.826 ± 0.396
0.304SerCys: 0.304 ± 0.12
4.391SerAsp: 4.391 ± 0.492
4.478SerGlu: 4.478 ± 0.434
2.261SerPhe: 2.261 ± 0.32
6.609SerGly: 6.609 ± 0.503
1.217SerHis: 1.217 ± 0.324
3.131SerIle: 3.131 ± 0.294
3.478SerLys: 3.478 ± 0.421
3.957SerLeu: 3.957 ± 0.33
2.044SerMet: 2.044 ± 0.302
3.435SerAsn: 3.435 ± 0.348
2.87SerPro: 2.87 ± 0.382
1.913SerGln: 1.913 ± 0.289
3.261SerArg: 3.261 ± 0.392
3.696SerSer: 3.696 ± 0.376
4.044SerThr: 4.044 ± 0.388
4.261SerVal: 4.261 ± 0.557
0.609SerTrp: 0.609 ± 0.231
1.565SerTyr: 1.565 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
5.131ThrAla: 5.131 ± 0.547
0.391ThrCys: 0.391 ± 0.163
4.565ThrAsp: 4.565 ± 0.522
3.522ThrGlu: 3.522 ± 0.349
2.087ThrPhe: 2.087 ± 0.369
5.913ThrGly: 5.913 ± 0.491
1.13ThrHis: 1.13 ± 0.228
3.957ThrIle: 3.957 ± 0.408
3.913ThrLys: 3.913 ± 0.375
4.522ThrLeu: 4.522 ± 0.528
1.652ThrMet: 1.652 ± 0.313
2.87ThrAsn: 2.87 ± 0.389
3.131ThrPro: 3.131 ± 0.311
2.739ThrGln: 2.739 ± 0.375
2.913ThrArg: 2.913 ± 0.362
3.391ThrSer: 3.391 ± 0.337
4.131ThrThr: 4.131 ± 0.466
3.609ThrVal: 3.609 ± 0.376
0.739ThrTrp: 0.739 ± 0.217
1.826ThrTyr: 1.826 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
5.87ValAla: 5.87 ± 0.507
0.391ValCys: 0.391 ± 0.126
3.826ValAsp: 3.826 ± 0.341
3.913ValGlu: 3.913 ± 0.401
2.261ValPhe: 2.261 ± 0.299
4.696ValGly: 4.696 ± 0.447
1.261ValHis: 1.261 ± 0.263
4.261ValIle: 4.261 ± 0.422
2.652ValLys: 2.652 ± 0.381
5.478ValLeu: 5.478 ± 0.546
2.217ValMet: 2.217 ± 0.286
3.913ValAsn: 3.913 ± 0.33
3.087ValPro: 3.087 ± 0.33
2.652ValGln: 2.652 ± 0.319
3.739ValArg: 3.739 ± 0.457
4.957ValSer: 4.957 ± 0.459
4.087ValThr: 4.087 ± 0.381
3.957ValVal: 3.957 ± 0.393
0.913ValTrp: 0.913 ± 0.203
1.87ValTyr: 1.87 ± 0.246
0.0ValXaa: 0.0 ± 0.0
Trp
1.044TrpAla: 1.044 ± 0.196
0.13TrpCys: 0.13 ± 0.072
1.087TrpAsp: 1.087 ± 0.218
1.044TrpGlu: 1.044 ± 0.222
0.478TrpPhe: 0.478 ± 0.161
1.13TrpGly: 1.13 ± 0.249
0.348TrpHis: 0.348 ± 0.135
1.217TrpIle: 1.217 ± 0.228
0.913TrpLys: 0.913 ± 0.192
1.261TrpLeu: 1.261 ± 0.215
0.435TrpMet: 0.435 ± 0.133
0.696TrpAsn: 0.696 ± 0.167
0.174TrpPro: 0.174 ± 0.084
0.913TrpGln: 0.913 ± 0.179
0.435TrpArg: 0.435 ± 0.13
0.348TrpSer: 0.348 ± 0.119
0.87TrpThr: 0.87 ± 0.195
0.826TrpVal: 0.826 ± 0.214
0.565TrpTrp: 0.565 ± 0.186
0.217TrpTyr: 0.217 ± 0.096
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.783TyrAla: 2.783 ± 0.287
0.478TyrCys: 0.478 ± 0.161
2.826TyrAsp: 2.826 ± 0.435
2.391TyrGlu: 2.391 ± 0.369
1.044TyrPhe: 1.044 ± 0.19
1.826TyrGly: 1.826 ± 0.279
0.696TyrHis: 0.696 ± 0.216
2.087TyrIle: 2.087 ± 0.32
1.13TyrLys: 1.13 ± 0.218
2.261TyrLeu: 2.261 ± 0.485
0.783TyrMet: 0.783 ± 0.168
1.826TyrAsn: 1.826 ± 0.213
1.348TyrPro: 1.348 ± 0.255
2.044TyrGln: 2.044 ± 0.302
1.826TyrArg: 1.826 ± 0.222
1.87TyrSer: 1.87 ± 0.308
1.348TyrThr: 1.348 ± 0.223
1.87TyrVal: 1.87 ± 0.27
0.957TyrTrp: 0.957 ± 0.224
1.174TyrTyr: 1.174 ± 0.195
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (23000 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski