Amino acid dipepetide frequency for Qinghai Lake virophage

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.297AlaCys: 0.297 ± 0.197
4.012AlaAsp: 4.012 ± 0.533
1.634AlaGlu: 1.634 ± 0.373
3.12AlaPhe: 3.12 ± 0.647
6.241AlaGly: 6.241 ± 2.949
0.446AlaHis: 0.446 ± 0.233
5.052AlaIle: 5.052 ± 2.053
2.377AlaLys: 2.377 ± 0.799
3.715AlaLeu: 3.715 ± 0.856
0.743AlaMet: 0.743 ± 0.288
2.972AlaAsn: 2.972 ± 0.683
1.932AlaPro: 1.932 ± 0.737
1.634AlaGln: 1.634 ± 0.529
1.634AlaArg: 1.634 ± 0.578
4.012AlaSer: 4.012 ± 1.125
3.715AlaThr: 3.715 ± 1.11
2.675AlaVal: 2.675 ± 1.036
0.0AlaTrp: 0.0 ± 0.0
2.526AlaTyr: 2.526 ± 0.608
0.0AlaXaa: 0.0 ± 0.0
Cys
0.297CysAla: 0.297 ± 0.295
0.446CysCys: 0.446 ± 0.341
1.189CysAsp: 1.189 ± 0.747
0.446CysGlu: 0.446 ± 0.23
0.743CysPhe: 0.743 ± 0.459
0.892CysGly: 0.892 ± 0.387
0.149CysHis: 0.149 ± 0.152
0.743CysIle: 0.743 ± 0.28
0.892CysLys: 0.892 ± 0.333
1.486CysLeu: 1.486 ± 0.41
0.0CysMet: 0.0 ± 0.0
1.189CysAsn: 1.189 ± 0.507
0.743CysPro: 0.743 ± 0.498
0.297CysGln: 0.297 ± 0.206
1.04CysArg: 1.04 ± 0.582
0.594CysSer: 0.594 ± 0.303
0.149CysThr: 0.149 ± 0.149
0.743CysVal: 0.743 ± 0.326
0.149CysTrp: 0.149 ± 0.129
0.892CysTyr: 0.892 ± 0.331
0.0CysXaa: 0.0 ± 0.0
Asp
3.566AspAla: 3.566 ± 0.615
0.892AspCys: 0.892 ± 0.28
3.12AspAsp: 3.12 ± 0.933
4.458AspGlu: 4.458 ± 1.344
3.566AspPhe: 3.566 ± 0.673
3.715AspGly: 3.715 ± 1.024
0.594AspHis: 0.594 ± 0.271
5.349AspIle: 5.349 ± 0.685
5.646AspLys: 5.646 ± 1.408
7.132AspLeu: 7.132 ± 0.993
2.675AspMet: 2.675 ± 0.534
1.486AspAsn: 1.486 ± 0.383
2.675AspPro: 2.675 ± 0.487
0.892AspGln: 0.892 ± 0.357
1.04AspArg: 1.04 ± 0.375
2.526AspSer: 2.526 ± 0.662
3.566AspThr: 3.566 ± 0.629
2.972AspVal: 2.972 ± 0.627
1.04AspTrp: 1.04 ± 0.414
4.012AspTyr: 4.012 ± 0.773
0.0AspXaa: 0.0 ± 0.0
Glu
2.675GluAla: 2.675 ± 0.566
1.486GluCys: 1.486 ± 0.68
5.052GluAsp: 5.052 ± 1.102
5.646GluGlu: 5.646 ± 1.467
2.823GluPhe: 2.823 ± 0.938
4.012GluGly: 4.012 ± 0.887
1.04GluHis: 1.04 ± 0.377
5.498GluIle: 5.498 ± 0.873
5.498GluLys: 5.498 ± 1.193
5.201GluLeu: 5.201 ± 1.159
1.932GluMet: 1.932 ± 0.513
3.715GluAsn: 3.715 ± 1.066
1.932GluPro: 1.932 ± 0.422
2.08GluGln: 2.08 ± 0.568
3.566GluArg: 3.566 ± 0.899
1.634GluSer: 1.634 ± 0.357
2.675GluThr: 2.675 ± 0.516
3.715GluVal: 3.715 ± 0.713
1.04GluTrp: 1.04 ± 0.457
4.012GluTyr: 4.012 ± 0.75
0.0GluXaa: 0.0 ± 0.0
Phe
1.189PheAla: 1.189 ± 0.54
1.04PheCys: 1.04 ± 0.382
1.783PheAsp: 1.783 ± 0.518
2.377PheGlu: 2.377 ± 0.744
2.526PhePhe: 2.526 ± 0.678
2.08PheGly: 2.08 ± 0.468
0.446PheHis: 0.446 ± 0.268
2.377PheIle: 2.377 ± 0.571
4.755PheLys: 4.755 ± 1.272
4.606PheLeu: 4.606 ± 1.024
0.149PheMet: 0.149 ± 0.141
5.201PheAsn: 5.201 ± 0.703
2.08PhePro: 2.08 ± 0.469
1.634PheGln: 1.634 ± 0.77
1.337PheArg: 1.337 ± 0.419
3.12PheSer: 3.12 ± 0.928
1.783PheThr: 1.783 ± 0.577
2.526PheVal: 2.526 ± 0.47
0.446PheTrp: 0.446 ± 0.264
1.932PheTyr: 1.932 ± 0.454
0.0PheXaa: 0.0 ± 0.0
Gly
5.498GlyAla: 5.498 ± 1.789
0.594GlyCys: 0.594 ± 0.259
3.418GlyAsp: 3.418 ± 0.651
3.418GlyGlu: 3.418 ± 0.752
2.823GlyPhe: 2.823 ± 0.853
6.686GlyGly: 6.686 ± 1.71
1.189GlyHis: 1.189 ± 0.38
3.863GlyIle: 3.863 ± 0.725
7.132GlyLys: 7.132 ± 1.285
5.795GlyLeu: 5.795 ± 0.485
1.634GlyMet: 1.634 ± 0.344
4.012GlyAsn: 4.012 ± 0.773
0.149GlyPro: 0.149 ± 0.143
3.863GlyGln: 3.863 ± 2.1
4.012GlyArg: 4.012 ± 1.399
6.241GlySer: 6.241 ± 1.957
3.566GlyThr: 3.566 ± 1.103
2.526GlyVal: 2.526 ± 0.507
0.446GlyTrp: 0.446 ± 0.325
2.972GlyTyr: 2.972 ± 0.582
0.0GlyXaa: 0.0 ± 0.0
His
0.446HisAla: 0.446 ± 0.214
0.0HisCys: 0.0 ± 0.0
0.892HisAsp: 0.892 ± 0.292
0.594HisGlu: 0.594 ± 0.339
0.743HisPhe: 0.743 ± 0.384
1.04HisGly: 1.04 ± 0.472
0.446HisHis: 0.446 ± 0.207
0.743HisIle: 0.743 ± 0.36
1.189HisLys: 1.189 ± 0.559
1.486HisLeu: 1.486 ± 0.388
0.297HisMet: 0.297 ± 0.208
0.0HisAsn: 0.0 ± 0.0
0.892HisPro: 0.892 ± 0.369
0.743HisGln: 0.743 ± 0.317
0.743HisArg: 0.743 ± 0.3
0.892HisSer: 0.892 ± 0.375
1.189HisThr: 1.189 ± 0.427
0.446HisVal: 0.446 ± 0.207
0.149HisTrp: 0.149 ± 0.134
0.446HisTyr: 0.446 ± 0.334
0.0HisXaa: 0.0 ± 0.0
Ile
2.823IleAla: 2.823 ± 0.695
1.04IleCys: 1.04 ± 0.431
4.458IleAsp: 4.458 ± 1.043
3.863IleGlu: 3.863 ± 0.802
2.823IlePhe: 2.823 ± 0.738
7.429IleGly: 7.429 ± 2.012
0.743IleHis: 0.743 ± 0.379
4.606IleIle: 4.606 ± 1.084
8.47IleLys: 8.47 ± 1.222
6.984IleLeu: 6.984 ± 0.91
1.337IleMet: 1.337 ± 0.521
5.201IleAsn: 5.201 ± 0.644
3.863IlePro: 3.863 ± 1.054
1.486IleGln: 1.486 ± 0.388
3.566IleArg: 3.566 ± 0.913
4.16IleSer: 4.16 ± 0.887
4.755IleThr: 4.755 ± 0.738
3.269IleVal: 3.269 ± 0.88
0.594IleTrp: 0.594 ± 0.348
3.418IleTyr: 3.418 ± 0.814
0.0IleXaa: 0.0 ± 0.0
Lys
3.269LysAla: 3.269 ± 0.893
1.486LysCys: 1.486 ± 0.407
6.686LysAsp: 6.686 ± 1.392
8.618LysGlu: 8.618 ± 1.954
3.863LysPhe: 3.863 ± 1.03
5.201LysGly: 5.201 ± 1.083
0.892LysHis: 0.892 ± 0.358
4.755LysIle: 4.755 ± 0.778
11.293LysLys: 11.293 ± 2.1
5.201LysLeu: 5.201 ± 0.808
2.972LysMet: 2.972 ± 0.92
6.092LysAsn: 6.092 ± 1.622
2.823LysPro: 2.823 ± 0.625
2.972LysGln: 2.972 ± 0.675
3.715LysArg: 3.715 ± 0.65
3.566LysSer: 3.566 ± 1.095
4.755LysThr: 4.755 ± 0.609
4.309LysVal: 4.309 ± 1.054
1.04LysTrp: 1.04 ± 0.629
5.201LysTyr: 5.201 ± 1.096
0.0LysXaa: 0.0 ± 0.0
Leu
3.12LeuAla: 3.12 ± 0.522
0.446LeuCys: 0.446 ± 0.211
5.201LeuAsp: 5.201 ± 0.97
5.646LeuGlu: 5.646 ± 1.223
3.566LeuPhe: 3.566 ± 0.64
5.795LeuGly: 5.795 ± 0.992
0.892LeuHis: 0.892 ± 0.346
4.309LeuIle: 4.309 ± 0.863
6.241LeuLys: 6.241 ± 1.407
7.281LeuLeu: 7.281 ± 0.787
1.189LeuMet: 1.189 ± 0.436
8.618LeuAsn: 8.618 ± 1.524
3.418LeuPro: 3.418 ± 0.633
3.12LeuGln: 3.12 ± 0.619
4.458LeuArg: 4.458 ± 0.891
6.241LeuSer: 6.241 ± 1.151
6.389LeuThr: 6.389 ± 0.947
3.566LeuVal: 3.566 ± 0.612
0.743LeuTrp: 0.743 ± 0.372
4.16LeuTyr: 4.16 ± 0.862
0.0LeuXaa: 0.0 ± 0.0
Met
1.189MetAla: 1.189 ± 0.434
0.297MetCys: 0.297 ± 0.205
1.189MetAsp: 1.189 ± 0.429
2.377MetGlu: 2.377 ± 0.657
0.594MetPhe: 0.594 ± 0.285
1.337MetGly: 1.337 ± 0.451
0.0MetHis: 0.0 ± 0.0
1.932MetIle: 1.932 ± 0.466
2.229MetLys: 2.229 ± 0.62
1.486MetLeu: 1.486 ± 0.427
0.892MetMet: 0.892 ± 0.312
1.634MetAsn: 1.634 ± 0.359
0.743MetPro: 0.743 ± 0.281
0.297MetGln: 0.297 ± 0.28
1.189MetArg: 1.189 ± 0.528
1.932MetSer: 1.932 ± 0.565
1.337MetThr: 1.337 ± 0.387
1.783MetVal: 1.783 ± 0.434
0.0MetTrp: 0.0 ± 0.0
0.297MetTyr: 0.297 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.715AsnAla: 3.715 ± 1.39
0.743AsnCys: 0.743 ± 0.306
4.16AsnAsp: 4.16 ± 0.926
5.201AsnGlu: 5.201 ± 1.005
2.675AsnPhe: 2.675 ± 0.67
3.863AsnGly: 3.863 ± 0.795
0.594AsnHis: 0.594 ± 0.22
4.755AsnIle: 4.755 ± 0.858
5.646AsnLys: 5.646 ± 0.945
6.389AsnLeu: 6.389 ± 0.778
2.08AsnMet: 2.08 ± 0.485
4.903AsnAsn: 4.903 ± 0.761
3.269AsnPro: 3.269 ± 0.653
2.972AsnGln: 2.972 ± 0.79
3.418AsnArg: 3.418 ± 0.762
4.309AsnSer: 4.309 ± 1.078
3.566AsnThr: 3.566 ± 0.672
2.526AsnVal: 2.526 ± 0.639
0.743AsnTrp: 0.743 ± 0.261
3.12AsnTyr: 3.12 ± 0.776
0.0AsnXaa: 0.0 ± 0.0
Pro
1.932ProAla: 1.932 ± 0.506
0.594ProCys: 0.594 ± 0.323
3.269ProAsp: 3.269 ± 0.673
1.634ProGlu: 1.634 ± 0.336
1.932ProPhe: 1.932 ± 0.759
0.0ProGly: 0.0 ± 0.0
0.892ProHis: 0.892 ± 0.441
5.349ProIle: 5.349 ± 1.297
2.526ProLys: 2.526 ± 0.934
3.269ProLeu: 3.269 ± 0.817
0.297ProMet: 0.297 ± 0.197
2.526ProAsn: 2.526 ± 0.721
1.189ProPro: 1.189 ± 0.417
2.972ProGln: 2.972 ± 0.641
1.932ProArg: 1.932 ± 0.682
2.229ProSer: 2.229 ± 0.842
2.08ProThr: 2.08 ± 0.544
2.823ProVal: 2.823 ± 0.671
0.149ProTrp: 0.149 ± 0.129
1.04ProTyr: 1.04 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
2.675GlnAla: 2.675 ± 0.648
0.0GlnCys: 0.0 ± 0.0
0.743GlnAsp: 0.743 ± 0.361
2.377GlnGlu: 2.377 ± 0.583
1.189GlnPhe: 1.189 ± 0.503
3.566GlnGly: 3.566 ± 2.128
0.594GlnHis: 0.594 ± 0.249
2.823GlnIle: 2.823 ± 0.681
1.634GlnLys: 1.634 ± 0.514
3.269GlnLeu: 3.269 ± 0.538
1.04GlnMet: 1.04 ± 0.346
1.634GlnAsn: 1.634 ± 0.394
2.377GlnPro: 2.377 ± 0.917
2.377GlnGln: 2.377 ± 0.64
1.634GlnArg: 1.634 ± 0.473
2.675GlnSer: 2.675 ± 1.071
3.269GlnThr: 3.269 ± 0.84
1.932GlnVal: 1.932 ± 0.489
0.0GlnTrp: 0.0 ± 0.0
1.04GlnTyr: 1.04 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
1.634ArgAla: 1.634 ± 0.595
0.892ArgCys: 0.892 ± 0.374
2.526ArgAsp: 2.526 ± 0.749
4.16ArgGlu: 4.16 ± 0.926
2.08ArgPhe: 2.08 ± 0.509
2.675ArgGly: 2.675 ± 0.682
0.446ArgHis: 0.446 ± 0.228
4.606ArgIle: 4.606 ± 0.814
3.269ArgLys: 3.269 ± 0.556
4.755ArgLeu: 4.755 ± 1.027
0.594ArgMet: 0.594 ± 0.231
2.823ArgAsn: 2.823 ± 0.795
1.932ArgPro: 1.932 ± 0.541
0.892ArgGln: 0.892 ± 0.376
2.972ArgArg: 2.972 ± 0.782
1.486ArgSer: 1.486 ± 0.481
2.229ArgThr: 2.229 ± 0.546
2.675ArgVal: 2.675 ± 0.685
0.892ArgTrp: 0.892 ± 0.377
1.337ArgTyr: 1.337 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
4.309SerAla: 4.309 ± 1.56
0.446SerCys: 0.446 ± 0.31
2.823SerAsp: 2.823 ± 0.54
2.823SerGlu: 2.823 ± 0.497
1.932SerPhe: 1.932 ± 0.641
4.755SerGly: 4.755 ± 1.513
0.446SerHis: 0.446 ± 0.21
6.835SerIle: 6.835 ± 1.108
4.012SerLys: 4.012 ± 0.846
3.863SerLeu: 3.863 ± 1.111
1.04SerMet: 1.04 ± 0.345
4.903SerAsn: 4.903 ± 0.962
1.932SerPro: 1.932 ± 0.682
1.783SerGln: 1.783 ± 1.023
2.675SerArg: 2.675 ± 0.546
5.498SerSer: 5.498 ± 1.655
3.12SerThr: 3.12 ± 0.819
4.755SerVal: 4.755 ± 1.689
0.0SerTrp: 0.0 ± 0.0
2.972SerTyr: 2.972 ± 0.671
0.0SerXaa: 0.0 ± 0.0
Thr
3.269ThrAla: 3.269 ± 0.733
1.04ThrCys: 1.04 ± 0.528
4.16ThrAsp: 4.16 ± 0.693
3.863ThrGlu: 3.863 ± 0.722
2.675ThrPhe: 2.675 ± 0.565
3.269ThrGly: 3.269 ± 0.611
1.04ThrHis: 1.04 ± 0.405
3.269ThrIle: 3.269 ± 0.821
4.012ThrLys: 4.012 ± 0.689
4.606ThrLeu: 4.606 ± 0.692
0.446ThrMet: 0.446 ± 0.254
4.012ThrAsn: 4.012 ± 0.923
3.418ThrPro: 3.418 ± 0.6
2.972ThrGln: 2.972 ± 1.103
1.634ThrArg: 1.634 ± 0.488
3.715ThrSer: 3.715 ± 0.84
5.052ThrThr: 5.052 ± 1.637
1.932ThrVal: 1.932 ± 0.662
0.594ThrTrp: 0.594 ± 0.239
3.566ThrTyr: 3.566 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
4.606ValAla: 4.606 ± 2.721
0.743ValCys: 0.743 ± 0.322
2.972ValAsp: 2.972 ± 0.524
3.269ValGlu: 3.269 ± 0.538
1.634ValPhe: 1.634 ± 0.528
3.863ValGly: 3.863 ± 0.927
0.446ValHis: 0.446 ± 0.23
3.418ValIle: 3.418 ± 0.626
4.755ValLys: 4.755 ± 1.122
3.12ValLeu: 3.12 ± 0.668
1.634ValMet: 1.634 ± 0.536
3.12ValAsn: 3.12 ± 0.926
1.486ValPro: 1.486 ± 0.555
2.08ValGln: 2.08 ± 0.598
2.08ValArg: 2.08 ± 0.421
3.863ValSer: 3.863 ± 0.894
1.634ValThr: 1.634 ± 0.534
2.526ValVal: 2.526 ± 0.782
0.743ValTrp: 0.743 ± 0.256
1.634ValTyr: 1.634 ± 0.468
0.0ValXaa: 0.0 ± 0.0
Trp
0.149TrpAla: 0.149 ± 0.149
0.149TrpCys: 0.149 ± 0.129
0.297TrpAsp: 0.297 ± 0.204
0.594TrpGlu: 0.594 ± 0.303
0.446TrpPhe: 0.446 ± 0.261
0.594TrpGly: 0.594 ± 0.307
0.149TrpHis: 0.149 ± 0.147
1.04TrpIle: 1.04 ± 0.54
1.04TrpLys: 1.04 ± 0.461
0.743TrpLeu: 0.743 ± 0.259
0.446TrpMet: 0.446 ± 0.267
0.743TrpAsn: 0.743 ± 0.297
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.743TrpArg: 0.743 ± 0.224
0.297TrpSer: 0.297 ± 0.181
0.892TrpThr: 0.892 ± 0.256
0.594TrpVal: 0.594 ± 0.372
0.0TrpTrp: 0.0 ± 0.0
0.149TrpTyr: 0.149 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.377TyrAla: 2.377 ± 0.395
0.446TyrCys: 0.446 ± 0.238
2.972TyrAsp: 2.972 ± 0.549
2.229TyrGlu: 2.229 ± 0.753
1.634TyrPhe: 1.634 ± 0.397
2.823TyrGly: 2.823 ± 0.537
1.932TyrHis: 1.932 ± 0.707
3.566TyrIle: 3.566 ± 0.878
6.241TyrLys: 6.241 ± 1.295
4.012TyrLeu: 4.012 ± 1.0
1.189TyrMet: 1.189 ± 0.426
3.863TyrAsn: 3.863 ± 0.706
1.783TyrPro: 1.783 ± 0.402
1.783TyrGln: 1.783 ± 0.511
1.486TyrArg: 1.486 ± 0.499
1.932TyrSer: 1.932 ± 0.617
2.823TyrThr: 2.823 ± 0.577
1.337TyrVal: 1.337 ± 0.379
0.297TyrTrp: 0.297 ± 0.192
2.823TyrTyr: 2.823 ± 0.979
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (6731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski