Amino acid dipepetide frequency for Pseudomonas virus F116

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.397AlaAla: 15.397 ± 1.331
0.99AlaCys: 0.99 ± 0.259
8.317AlaAsp: 8.317 ± 0.724
8.862AlaGlu: 8.862 ± 0.822
3.416AlaPhe: 3.416 ± 0.419
9.951AlaGly: 9.951 ± 0.795
1.931AlaHis: 1.931 ± 0.258
5.347AlaIle: 5.347 ± 0.423
5.545AlaLys: 5.545 ± 0.613
9.802AlaLeu: 9.802 ± 0.803
3.168AlaMet: 3.168 ± 0.357
3.02AlaAsn: 3.02 ± 0.464
7.278AlaPro: 7.278 ± 0.831
6.683AlaGln: 6.683 ± 0.755
9.505AlaArg: 9.505 ± 0.895
5.941AlaSer: 5.941 ± 0.648
5.446AlaThr: 5.446 ± 0.515
7.822AlaVal: 7.822 ± 0.65
1.782AlaTrp: 1.782 ± 0.291
2.772AlaTyr: 2.772 ± 0.445
0.0AlaXaa: 0.0 ± 0.0
Cys
0.891CysAla: 0.891 ± 0.275
0.396CysCys: 0.396 ± 0.153
0.594CysAsp: 0.594 ± 0.18
0.594CysGlu: 0.594 ± 0.202
0.05CysPhe: 0.05 ± 0.057
1.287CysGly: 1.287 ± 0.346
0.495CysHis: 0.495 ± 0.135
0.347CysIle: 0.347 ± 0.121
0.248CysLys: 0.248 ± 0.112
0.792CysLeu: 0.792 ± 0.188
0.149CysMet: 0.149 ± 0.092
0.099CysAsn: 0.099 ± 0.065
0.545CysPro: 0.545 ± 0.185
0.446CysGln: 0.446 ± 0.161
0.941CysArg: 0.941 ± 0.265
0.743CysSer: 0.743 ± 0.213
0.693CysThr: 0.693 ± 0.233
0.396CysVal: 0.396 ± 0.146
0.594CysTrp: 0.594 ± 0.253
0.297CysTyr: 0.297 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
8.713AspAla: 8.713 ± 0.793
0.594AspCys: 0.594 ± 0.196
3.515AspAsp: 3.515 ± 0.447
4.01AspGlu: 4.01 ± 0.495
2.129AspPhe: 2.129 ± 0.319
5.297AspGly: 5.297 ± 0.538
1.287AspHis: 1.287 ± 0.255
2.772AspIle: 2.772 ± 0.339
1.881AspLys: 1.881 ± 0.338
5.099AspLeu: 5.099 ± 0.507
1.832AspMet: 1.832 ± 0.368
1.535AspAsn: 1.535 ± 0.315
3.416AspPro: 3.416 ± 0.435
3.515AspGln: 3.515 ± 0.413
4.604AspArg: 4.604 ± 0.411
2.723AspSer: 2.723 ± 0.461
3.119AspThr: 3.119 ± 0.335
3.466AspVal: 3.466 ± 0.43
0.792AspTrp: 0.792 ± 0.171
1.535AspTyr: 1.535 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
7.624GluAla: 7.624 ± 1.073
0.545GluCys: 0.545 ± 0.17
2.822GluAsp: 2.822 ± 0.419
3.515GluGlu: 3.515 ± 0.442
2.079GluPhe: 2.079 ± 0.424
4.357GluGly: 4.357 ± 0.451
1.287GluHis: 1.287 ± 0.28
2.376GluIle: 2.376 ± 0.444
2.327GluLys: 2.327 ± 0.328
5.842GluLeu: 5.842 ± 0.544
1.188GluMet: 1.188 ± 0.25
1.535GluAsn: 1.535 ± 0.339
3.515GluPro: 3.515 ± 0.489
4.753GluGln: 4.753 ± 0.587
5.198GluArg: 5.198 ± 0.509
3.218GluSer: 3.218 ± 0.415
2.624GluThr: 2.624 ± 0.368
3.466GluVal: 3.466 ± 0.465
0.99GluTrp: 0.99 ± 0.256
1.832GluTyr: 1.832 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
2.921PheAla: 2.921 ± 0.5
0.198PheCys: 0.198 ± 0.104
2.079PheAsp: 2.079 ± 0.336
1.782PheGlu: 1.782 ± 0.331
0.644PhePhe: 0.644 ± 0.159
2.921PheGly: 2.921 ± 0.445
0.594PheHis: 0.594 ± 0.171
1.238PheIle: 1.238 ± 0.259
1.535PheLys: 1.535 ± 0.326
2.129PheLeu: 2.129 ± 0.317
0.446PheMet: 0.446 ± 0.15
1.139PheAsn: 1.139 ± 0.252
1.733PhePro: 1.733 ± 0.257
1.337PheGln: 1.337 ± 0.209
2.228PheArg: 2.228 ± 0.368
1.832PheSer: 1.832 ± 0.304
1.634PheThr: 1.634 ± 0.293
2.426PheVal: 2.426 ± 0.384
0.545PheTrp: 0.545 ± 0.203
0.842PheTyr: 0.842 ± 0.335
0.0PheXaa: 0.0 ± 0.0
Gly
8.218GlyAla: 8.218 ± 0.71
1.386GlyCys: 1.386 ± 0.353
4.852GlyAsp: 4.852 ± 0.567
5.149GlyGlu: 5.149 ± 0.506
3.416GlyPhe: 3.416 ± 0.525
7.179GlyGly: 7.179 ± 0.709
0.99GlyHis: 0.99 ± 0.257
3.862GlyIle: 3.862 ± 0.471
3.515GlyLys: 3.515 ± 0.492
5.743GlyLeu: 5.743 ± 0.48
2.376GlyMet: 2.376 ± 0.329
2.574GlyAsn: 2.574 ± 0.388
2.97GlyPro: 2.97 ± 0.503
4.753GlyGln: 4.753 ± 0.481
6.535GlyArg: 6.535 ± 0.839
4.06GlySer: 4.06 ± 0.386
4.703GlyThr: 4.703 ± 0.655
5.198GlyVal: 5.198 ± 0.645
1.287GlyTrp: 1.287 ± 0.227
2.574GlyTyr: 2.574 ± 0.303
0.0GlyXaa: 0.0 ± 0.0
His
2.228HisAla: 2.228 ± 0.338
0.297HisCys: 0.297 ± 0.134
0.842HisAsp: 0.842 ± 0.261
1.485HisGlu: 1.485 ± 0.242
0.941HisPhe: 0.941 ± 0.265
1.733HisGly: 1.733 ± 0.238
0.743HisHis: 0.743 ± 0.205
0.446HisIle: 0.446 ± 0.152
0.495HisLys: 0.495 ± 0.134
1.881HisLeu: 1.881 ± 0.406
0.495HisMet: 0.495 ± 0.142
0.743HisAsn: 0.743 ± 0.205
1.287HisPro: 1.287 ± 0.288
1.188HisGln: 1.188 ± 0.281
2.129HisArg: 2.129 ± 0.309
0.99HisSer: 0.99 ± 0.285
1.089HisThr: 1.089 ± 0.249
1.04HisVal: 1.04 ± 0.234
0.297HisTrp: 0.297 ± 0.147
0.545HisTyr: 0.545 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
5.693IleAla: 5.693 ± 0.631
0.644IleCys: 0.644 ± 0.192
2.921IleAsp: 2.921 ± 0.312
3.119IleGlu: 3.119 ± 0.423
0.743IlePhe: 0.743 ± 0.214
3.367IleGly: 3.367 ± 0.46
0.99IleHis: 0.99 ± 0.231
1.337IleIle: 1.337 ± 0.258
2.178IleLys: 2.178 ± 0.275
2.525IleLeu: 2.525 ± 0.312
0.743IleMet: 0.743 ± 0.21
1.584IleAsn: 1.584 ± 0.285
2.228IlePro: 2.228 ± 0.283
1.634IleGln: 1.634 ± 0.331
3.02IleArg: 3.02 ± 0.431
1.881IleSer: 1.881 ± 0.335
2.921IleThr: 2.921 ± 0.419
2.079IleVal: 2.079 ± 0.371
0.743IleTrp: 0.743 ± 0.22
0.941IleTyr: 0.941 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
4.852LysAla: 4.852 ± 0.68
0.297LysCys: 0.297 ± 0.119
2.277LysAsp: 2.277 ± 0.336
2.129LysGlu: 2.129 ± 0.372
1.188LysPhe: 1.188 ± 0.265
2.772LysGly: 2.772 ± 0.387
1.04LysHis: 1.04 ± 0.243
1.436LysIle: 1.436 ± 0.459
2.178LysLys: 2.178 ± 0.483
3.367LysLeu: 3.367 ± 0.338
0.644LysMet: 0.644 ± 0.222
0.743LysAsn: 0.743 ± 0.197
2.376LysPro: 2.376 ± 0.263
1.832LysGln: 1.832 ± 0.293
2.772LysArg: 2.772 ± 0.431
1.782LysSer: 1.782 ± 0.231
2.178LysThr: 2.178 ± 0.385
2.871LysVal: 2.871 ± 0.343
0.495LysTrp: 0.495 ± 0.171
0.941LysTyr: 0.941 ± 0.196
0.0LysXaa: 0.0 ± 0.0
Leu
10.446LeuAla: 10.446 ± 0.992
0.644LeuCys: 0.644 ± 0.215
5.347LeuAsp: 5.347 ± 0.524
4.901LeuGlu: 4.901 ± 0.463
2.723LeuPhe: 2.723 ± 0.338
4.951LeuGly: 4.951 ± 0.461
1.634LeuHis: 1.634 ± 0.3
2.426LeuIle: 2.426 ± 0.412
2.574LeuLys: 2.574 ± 0.272
5.842LeuLeu: 5.842 ± 0.694
2.277LeuMet: 2.277 ± 0.338
2.97LeuAsn: 2.97 ± 0.489
4.901LeuPro: 4.901 ± 0.468
3.267LeuGln: 3.267 ± 0.416
6.287LeuArg: 6.287 ± 0.617
5.0LeuSer: 5.0 ± 0.555
4.604LeuThr: 4.604 ± 0.587
5.594LeuVal: 5.594 ± 0.435
0.891LeuTrp: 0.891 ± 0.174
1.634LeuTyr: 1.634 ± 0.34
0.0LeuXaa: 0.0 ± 0.0
Met
2.921MetAla: 2.921 ± 0.435
0.248MetCys: 0.248 ± 0.134
1.188MetAsp: 1.188 ± 0.257
0.842MetGlu: 0.842 ± 0.218
0.347MetPhe: 0.347 ± 0.125
2.079MetGly: 2.079 ± 0.372
0.347MetHis: 0.347 ± 0.119
1.139MetIle: 1.139 ± 0.213
1.139MetLys: 1.139 ± 0.216
1.881MetLeu: 1.881 ± 0.275
0.446MetMet: 0.446 ± 0.15
0.644MetAsn: 0.644 ± 0.15
2.129MetPro: 2.129 ± 0.27
1.089MetGln: 1.089 ± 0.251
1.634MetArg: 1.634 ± 0.321
1.485MetSer: 1.485 ± 0.221
1.931MetThr: 1.931 ± 0.31
1.089MetVal: 1.089 ± 0.23
0.347MetTrp: 0.347 ± 0.159
0.446MetTyr: 0.446 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.961AsnAla: 3.961 ± 0.464
0.446AsnCys: 0.446 ± 0.219
1.584AsnAsp: 1.584 ± 0.298
1.287AsnGlu: 1.287 ± 0.277
0.594AsnPhe: 0.594 ± 0.13
2.277AsnGly: 2.277 ± 0.32
0.594AsnHis: 0.594 ± 0.183
1.188AsnIle: 1.188 ± 0.26
0.99AsnLys: 0.99 ± 0.187
2.574AsnLeu: 2.574 ± 0.355
0.891AsnMet: 0.891 ± 0.234
0.644AsnAsn: 0.644 ± 0.235
1.733AsnPro: 1.733 ± 0.22
1.287AsnGln: 1.287 ± 0.279
2.277AsnArg: 2.277 ± 0.33
2.079AsnSer: 2.079 ± 0.315
1.436AsnThr: 1.436 ± 0.189
1.485AsnVal: 1.485 ± 0.314
0.842AsnTrp: 0.842 ± 0.2
0.792AsnTyr: 0.792 ± 0.216
0.0AsnXaa: 0.0 ± 0.0
Pro
7.822ProAla: 7.822 ± 0.666
0.594ProCys: 0.594 ± 0.196
4.505ProAsp: 4.505 ± 0.485
4.258ProGlu: 4.258 ± 0.458
1.485ProPhe: 1.485 ± 0.286
4.357ProGly: 4.357 ± 0.635
1.238ProHis: 1.238 ± 0.251
2.475ProIle: 2.475 ± 0.423
2.03ProLys: 2.03 ± 0.32
3.763ProLeu: 3.763 ± 0.644
1.436ProMet: 1.436 ± 0.321
1.485ProAsn: 1.485 ± 0.325
4.01ProPro: 4.01 ± 0.851
2.178ProGln: 2.178 ± 0.372
3.416ProArg: 3.416 ± 0.499
3.317ProSer: 3.317 ± 0.474
3.862ProThr: 3.862 ± 0.462
3.961ProVal: 3.961 ± 0.409
0.842ProTrp: 0.842 ± 0.208
1.584ProTyr: 1.584 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
8.367GlnAla: 8.367 ± 0.843
0.248GlnCys: 0.248 ± 0.129
2.822GlnAsp: 2.822 ± 0.375
3.317GlnGlu: 3.317 ± 0.512
1.683GlnPhe: 1.683 ± 0.28
4.258GlnGly: 4.258 ± 0.605
1.782GlnHis: 1.782 ± 0.332
2.723GlnIle: 2.723 ± 0.443
1.139GlnLys: 1.139 ± 0.24
2.921GlnLeu: 2.921 ± 0.354
1.188GlnMet: 1.188 ± 0.272
1.139GlnAsn: 1.139 ± 0.349
3.119GlnPro: 3.119 ± 0.311
4.307GlnGln: 4.307 ± 0.791
3.367GlnArg: 3.367 ± 0.43
1.733GlnSer: 1.733 ± 0.369
1.832GlnThr: 1.832 ± 0.255
2.673GlnVal: 2.673 ± 0.304
0.941GlnTrp: 0.941 ± 0.243
1.832GlnTyr: 1.832 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
8.07ArgAla: 8.07 ± 0.819
1.139ArgCys: 1.139 ± 0.312
4.406ArgAsp: 4.406 ± 0.568
4.703ArgGlu: 4.703 ± 0.635
2.673ArgPhe: 2.673 ± 0.394
5.693ArgGly: 5.693 ± 0.637
1.535ArgHis: 1.535 ± 0.322
3.515ArgIle: 3.515 ± 0.373
3.119ArgLys: 3.119 ± 0.385
7.228ArgLeu: 7.228 ± 0.652
1.386ArgMet: 1.386 ± 0.262
2.574ArgAsn: 2.574 ± 0.288
4.109ArgPro: 4.109 ± 0.545
4.456ArgGln: 4.456 ± 0.533
7.624ArgArg: 7.624 ± 1.071
4.406ArgSer: 4.406 ± 0.558
3.565ArgThr: 3.565 ± 0.542
5.347ArgVal: 5.347 ± 0.421
1.188ArgTrp: 1.188 ± 0.266
1.881ArgTyr: 1.881 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
6.139SerAla: 6.139 ± 0.507
0.396SerCys: 0.396 ± 0.158
3.317SerAsp: 3.317 ± 0.469
2.772SerGlu: 2.772 ± 0.361
1.436SerPhe: 1.436 ± 0.245
5.693SerGly: 5.693 ± 0.582
0.941SerHis: 0.941 ± 0.236
2.327SerIle: 2.327 ± 0.307
1.931SerLys: 1.931 ± 0.278
4.159SerLeu: 4.159 ± 0.402
1.485SerMet: 1.485 ± 0.203
1.683SerAsn: 1.683 ± 0.305
3.218SerPro: 3.218 ± 0.498
2.228SerGln: 2.228 ± 0.339
3.02SerArg: 3.02 ± 0.316
3.119SerSer: 3.119 ± 0.547
3.565SerThr: 3.565 ± 0.513
3.317SerVal: 3.317 ± 0.367
0.99SerTrp: 0.99 ± 0.203
1.485SerTyr: 1.485 ± 0.312
0.0SerXaa: 0.0 ± 0.0
Thr
7.129ThrAla: 7.129 ± 0.628
0.545ThrCys: 0.545 ± 0.201
3.713ThrAsp: 3.713 ± 0.378
2.475ThrGlu: 2.475 ± 0.35
1.931ThrPhe: 1.931 ± 0.325
5.198ThrGly: 5.198 ± 0.515
0.99ThrHis: 0.99 ± 0.226
2.079ThrIle: 2.079 ± 0.378
1.782ThrLys: 1.782 ± 0.282
3.961ThrLeu: 3.961 ± 0.365
0.99ThrMet: 0.99 ± 0.207
1.683ThrAsn: 1.683 ± 0.233
3.713ThrPro: 3.713 ± 0.425
1.98ThrGln: 1.98 ± 0.303
3.911ThrArg: 3.911 ± 0.43
2.772ThrSer: 2.772 ± 0.382
3.218ThrThr: 3.218 ± 0.49
3.961ThrVal: 3.961 ± 0.479
0.941ThrTrp: 0.941 ± 0.189
1.238ThrTyr: 1.238 ± 0.211
0.0ThrXaa: 0.0 ± 0.0
Val
7.327ValAla: 7.327 ± 0.61
0.446ValCys: 0.446 ± 0.158
4.505ValAsp: 4.505 ± 0.475
4.208ValGlu: 4.208 ± 0.589
1.188ValPhe: 1.188 ± 0.299
4.802ValGly: 4.802 ± 0.476
1.584ValHis: 1.584 ± 0.351
2.97ValIle: 2.97 ± 0.512
2.277ValLys: 2.277 ± 0.389
5.693ValLeu: 5.693 ± 0.562
1.535ValMet: 1.535 ± 0.228
1.634ValAsn: 1.634 ± 0.246
3.911ValPro: 3.911 ± 0.537
2.525ValGln: 2.525 ± 0.326
5.644ValArg: 5.644 ± 0.555
3.713ValSer: 3.713 ± 0.479
3.317ValThr: 3.317 ± 0.438
3.911ValVal: 3.911 ± 0.397
0.594ValTrp: 0.594 ± 0.188
1.139ValTyr: 1.139 ± 0.254
0.0ValXaa: 0.0 ± 0.0
Trp
1.782TrpAla: 1.782 ± 0.255
0.248TrpCys: 0.248 ± 0.133
0.594TrpAsp: 0.594 ± 0.178
0.594TrpGlu: 0.594 ± 0.215
0.396TrpPhe: 0.396 ± 0.156
1.188TrpGly: 1.188 ± 0.322
0.495TrpHis: 0.495 ± 0.137
0.248TrpIle: 0.248 ± 0.129
0.545TrpLys: 0.545 ± 0.136
1.733TrpLeu: 1.733 ± 0.307
0.347TrpMet: 0.347 ± 0.115
0.495TrpAsn: 0.495 ± 0.139
1.04TrpPro: 1.04 ± 0.32
0.693TrpGln: 0.693 ± 0.175
1.782TrpArg: 1.782 ± 0.283
0.743TrpSer: 0.743 ± 0.239
1.04TrpThr: 1.04 ± 0.256
1.188TrpVal: 1.188 ± 0.246
0.446TrpTrp: 0.446 ± 0.134
0.545TrpTyr: 0.545 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.525TyrAla: 2.525 ± 0.411
0.297TyrCys: 0.297 ± 0.101
1.634TyrAsp: 1.634 ± 0.261
1.089TyrGlu: 1.089 ± 0.282
1.139TyrPhe: 1.139 ± 0.287
1.98TyrGly: 1.98 ± 0.344
0.396TyrHis: 0.396 ± 0.148
1.04TyrIle: 1.04 ± 0.204
0.743TyrLys: 0.743 ± 0.157
1.98TyrLeu: 1.98 ± 0.293
0.297TyrMet: 0.297 ± 0.166
1.04TyrAsn: 1.04 ± 0.249
1.386TyrPro: 1.386 ± 0.242
1.238TyrGln: 1.238 ± 0.219
2.723TyrArg: 2.723 ± 0.387
1.683TyrSer: 1.683 ± 0.387
1.436TyrThr: 1.436 ± 0.254
1.634TyrVal: 1.634 ± 0.396
0.594TyrTrp: 0.594 ± 0.22
0.396TyrTyr: 0.396 ± 0.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (20200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski