Amino acid dipepetide frequency for Patois virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.964AlaAla: 2.964 ± 1.63
1.976AlaCys: 1.976 ± 1.15
1.976AlaAsp: 1.976 ± 0.621
2.223AlaGlu: 2.223 ± 0.47
2.223AlaPhe: 2.223 ± 0.388
1.729AlaGly: 1.729 ± 1.252
0.741AlaHis: 0.741 ± 0.354
3.211AlaIle: 3.211 ± 0.735
3.458AlaLys: 3.458 ± 0.83
4.2AlaLeu: 4.2 ± 2.104
0.988AlaMet: 0.988 ± 0.571
3.458AlaAsn: 3.458 ± 1.438
0.988AlaPro: 0.988 ± 0.513
1.976AlaGln: 1.976 ± 0.304
2.964AlaArg: 2.964 ± 0.786
1.235AlaSer: 1.235 ± 0.479
1.976AlaThr: 1.976 ± 1.822
1.976AlaVal: 1.976 ± 1.41
0.247AlaTrp: 0.247 ± 0.156
2.223AlaTyr: 2.223 ± 0.912
0.0AlaXaa: 0.0 ± 0.0
Cys
2.223CysAla: 2.223 ± 0.503
0.247CysCys: 0.247 ± 0.156
0.494CysAsp: 0.494 ± 0.312
0.988CysGlu: 0.988 ± 0.91
1.729CysPhe: 1.729 ± 0.927
2.223CysGly: 2.223 ± 1.061
1.482CysHis: 1.482 ± 0.49
2.964CysIle: 2.964 ± 1.043
2.47CysLys: 2.47 ± 1.28
1.235CysLeu: 1.235 ± 0.494
0.494CysMet: 0.494 ± 0.152
2.223CysAsn: 2.223 ± 0.788
0.741CysPro: 0.741 ± 0.354
0.741CysGln: 0.741 ± 0.208
1.482CysArg: 1.482 ± 0.707
1.482CysSer: 1.482 ± 1.026
2.223CysThr: 2.223 ± 1.061
1.729CysVal: 1.729 ± 1.592
0.247CysTrp: 0.247 ± 0.156
1.482CysTyr: 1.482 ± 0.455
0.0CysXaa: 0.0 ± 0.0
Asp
1.976AspAla: 1.976 ± 1.121
1.235AspCys: 1.235 ± 0.8
2.47AspAsp: 2.47 ± 0.657
2.964AspGlu: 2.964 ± 1.004
3.706AspPhe: 3.706 ± 1.424
2.717AspGly: 2.717 ± 0.441
0.741AspHis: 0.741 ± 0.354
5.929AspIle: 5.929 ± 0.933
3.211AspLys: 3.211 ± 0.944
7.411AspLeu: 7.411 ± 0.885
2.964AspMet: 2.964 ± 1.408
1.976AspAsn: 1.976 ± 0.669
2.47AspPro: 2.47 ± 1.427
3.211AspGln: 3.211 ± 0.859
1.976AspArg: 1.976 ± 1.02
1.729AspSer: 1.729 ± 1.018
2.717AspThr: 2.717 ± 0.365
1.729AspVal: 1.729 ± 0.782
0.988AspTrp: 0.988 ± 1.208
2.964AspTyr: 2.964 ± 0.793
0.0AspXaa: 0.0 ± 0.0
Glu
3.211GluAla: 3.211 ± 0.861
1.235GluCys: 1.235 ± 0.8
3.706GluAsp: 3.706 ± 0.622
3.953GluGlu: 3.953 ± 1.079
5.682GluPhe: 5.682 ± 2.517
1.482GluGly: 1.482 ± 0.415
1.482GluHis: 1.482 ± 0.707
6.423GluIle: 6.423 ± 1.819
3.706GluLys: 3.706 ± 1.201
7.658GluLeu: 7.658 ± 1.523
2.717GluMet: 2.717 ± 1.063
3.706GluAsn: 3.706 ± 0.478
1.235GluPro: 1.235 ± 0.781
3.458GluGln: 3.458 ± 0.935
4.447GluArg: 4.447 ± 1.2
3.458GluSer: 3.458 ± 1.069
2.717GluThr: 2.717 ± 0.841
3.953GluVal: 3.953 ± 1.083
0.988GluTrp: 0.988 ± 0.656
1.976GluTyr: 1.976 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
0.741PheAla: 0.741 ± 0.208
1.976PheCys: 1.976 ± 0.935
3.706PheAsp: 3.706 ± 0.633
3.953PheGlu: 3.953 ± 0.489
2.717PhePhe: 2.717 ± 0.201
2.964PheGly: 2.964 ± 1.346
0.988PheHis: 0.988 ± 0.303
2.717PheIle: 2.717 ± 0.713
5.435PheLys: 5.435 ± 0.492
4.694PheLeu: 4.694 ± 2.616
1.482PheMet: 1.482 ± 0.522
3.706PheAsn: 3.706 ± 0.985
1.482PhePro: 1.482 ± 1.157
0.494PheGln: 0.494 ± 0.152
2.223PheArg: 2.223 ± 0.247
4.2PheSer: 4.2 ± 0.48
2.964PheThr: 2.964 ± 0.83
2.47PheVal: 2.47 ± 0.654
0.988PheTrp: 0.988 ± 0.625
2.964PheTyr: 2.964 ± 1.679
0.0PheXaa: 0.0 ± 0.0
Gly
0.988GlyAla: 0.988 ± 1.27
2.223GlyCys: 2.223 ± 1.061
2.964GlyAsp: 2.964 ± 0.232
3.211GlyGlu: 3.211 ± 0.768
1.482GlyPhe: 1.482 ± 1.263
0.0GlyGly: 0.0 ± 0.0
0.247GlyHis: 0.247 ± 0.227
3.706GlyIle: 3.706 ± 1.599
2.964GlyLys: 2.964 ± 0.383
3.706GlyLeu: 3.706 ± 1.238
0.741GlyMet: 0.741 ± 0.682
3.211GlyAsn: 3.211 ± 0.457
0.741GlyPro: 0.741 ± 0.354
1.729GlyGln: 1.729 ± 0.358
1.729GlyArg: 1.729 ± 0.64
2.964GlySer: 2.964 ± 1.641
3.211GlyThr: 3.211 ± 2.099
2.47GlyVal: 2.47 ± 0.989
0.494GlyTrp: 0.494 ± 0.152
0.988GlyTyr: 0.988 ± 0.303
0.0GlyXaa: 0.0 ± 0.0
His
0.494HisAla: 0.494 ± 0.455
1.235HisCys: 1.235 ± 0.562
0.741HisAsp: 0.741 ± 0.208
1.235HisGlu: 1.235 ± 0.494
0.988HisPhe: 0.988 ± 0.335
1.482HisGly: 1.482 ± 0.516
0.494HisHis: 0.494 ± 0.152
1.976HisIle: 1.976 ± 0.682
0.988HisLys: 0.988 ± 0.335
0.741HisLeu: 0.741 ± 0.632
0.247HisMet: 0.247 ± 0.227
2.717HisAsn: 2.717 ± 0.713
0.494HisPro: 0.494 ± 0.152
0.741HisGln: 0.741 ± 0.783
0.494HisArg: 0.494 ± 0.674
1.976HisSer: 1.976 ± 0.528
2.47HisThr: 2.47 ± 0.411
1.482HisVal: 1.482 ± 0.455
0.247HisTrp: 0.247 ± 0.156
0.741HisTyr: 0.741 ± 0.354
0.0HisXaa: 0.0 ± 0.0
Ile
3.953IleAla: 3.953 ± 2.721
2.964IleCys: 2.964 ± 2.387
3.706IleAsp: 3.706 ± 0.692
6.67IleGlu: 6.67 ± 1.324
3.211IlePhe: 3.211 ± 1.087
3.953IleGly: 3.953 ± 1.833
1.976IleHis: 1.976 ± 0.935
5.435IleIle: 5.435 ± 1.669
8.152IleLys: 8.152 ± 1.462
8.646IleLeu: 8.646 ± 1.683
1.976IleMet: 1.976 ± 0.528
3.706IleAsn: 3.706 ± 1.038
1.976IlePro: 1.976 ± 0.669
2.47IleGln: 2.47 ± 1.28
2.964IleArg: 2.964 ± 0.793
8.646IleSer: 8.646 ± 2.324
4.694IleThr: 4.694 ± 1.246
4.694IleVal: 4.694 ± 0.647
0.741IleTrp: 0.741 ± 0.208
2.223IleTyr: 2.223 ± 0.991
0.0IleXaa: 0.0 ± 0.0
Lys
2.47LysAla: 2.47 ± 1.383
2.223LysCys: 2.223 ± 1.061
5.682LysAsp: 5.682 ± 0.812
5.682LysGlu: 5.682 ± 0.94
3.953LysPhe: 3.953 ± 0.675
3.706LysGly: 3.706 ± 0.623
2.223LysHis: 2.223 ± 0.553
5.435LysIle: 5.435 ± 0.854
4.941LysLys: 4.941 ± 0.425
9.14LysLeu: 9.14 ± 0.263
3.211LysMet: 3.211 ± 0.944
1.976LysAsn: 1.976 ± 0.528
3.458LysPro: 3.458 ± 0.274
3.211LysGln: 3.211 ± 0.457
2.223LysArg: 2.223 ± 0.887
6.176LysSer: 6.176 ± 1.669
6.423LysThr: 6.423 ± 0.578
4.2LysVal: 4.2 ± 0.442
0.988LysTrp: 0.988 ± 0.513
3.953LysTyr: 3.953 ± 1.15
0.0LysXaa: 0.0 ± 0.0
Leu
4.447LeuAla: 4.447 ± 1.097
1.976LeuCys: 1.976 ± 0.426
5.188LeuAsp: 5.188 ± 1.475
6.917LeuGlu: 6.917 ± 1.21
4.694LeuPhe: 4.694 ± 0.923
1.729LeuGly: 1.729 ± 1.264
2.223LeuHis: 2.223 ± 0.912
6.917LeuIle: 6.917 ± 2.124
10.128LeuLys: 10.128 ± 3.25
9.14LeuLeu: 9.14 ± 0.52
2.964LeuMet: 2.964 ± 0.91
6.176LeuAsn: 6.176 ± 1.314
3.953LeuPro: 3.953 ± 0.675
2.717LeuGln: 2.717 ± 1.35
5.682LeuArg: 5.682 ± 2.596
8.399LeuSer: 8.399 ± 2.386
5.929LeuThr: 5.929 ± 3.27
4.941LeuVal: 4.941 ± 1.517
0.247LeuTrp: 0.247 ± 0.156
2.964LeuTyr: 2.964 ± 1.004
0.0LeuXaa: 0.0 ± 0.0
Met
0.741MetAla: 0.741 ± 0.208
0.988MetCys: 0.988 ± 0.303
2.47MetAsp: 2.47 ± 0.816
2.717MetGlu: 2.717 ± 0.365
1.235MetPhe: 1.235 ± 0.49
1.235MetGly: 1.235 ± 0.754
0.494MetHis: 0.494 ± 0.152
4.2MetIle: 4.2 ± 1.209
1.482MetLys: 1.482 ± 0.455
3.706MetLeu: 3.706 ± 1.201
0.988MetMet: 0.988 ± 0.303
1.235MetAsn: 1.235 ± 0.479
2.223MetPro: 2.223 ± 0.784
0.988MetGln: 0.988 ± 0.595
0.988MetArg: 0.988 ± 0.335
2.47MetSer: 2.47 ± 0.583
1.729MetThr: 1.729 ± 0.534
1.235MetVal: 1.235 ± 0.479
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.964AsnAla: 2.964 ± 0.51
1.482AsnCys: 1.482 ± 0.455
3.953AsnAsp: 3.953 ± 1.586
3.706AsnGlu: 3.706 ± 0.364
1.729AsnPhe: 1.729 ± 0.462
2.223AsnGly: 2.223 ± 1.485
1.976AsnHis: 1.976 ± 0.682
4.2AsnIle: 4.2 ± 1.272
5.188AsnLys: 5.188 ± 1.402
3.706AsnLeu: 3.706 ± 0.692
2.223AsnMet: 2.223 ± 0.623
3.211AsnAsn: 3.211 ± 0.257
1.976AsnPro: 1.976 ± 1.026
1.729AsnGln: 1.729 ± 1.093
2.964AsnArg: 2.964 ± 0.793
4.694AsnSer: 4.694 ± 0.889
4.447AsnThr: 4.447 ± 0.76
0.494AsnVal: 0.494 ± 0.152
0.741AsnTrp: 0.741 ± 0.208
2.223AsnTyr: 2.223 ± 0.811
0.0AsnXaa: 0.0 ± 0.0
Pro
1.235ProAla: 1.235 ± 0.479
0.0ProCys: 0.0 ± 0.0
1.729ProAsp: 1.729 ± 0.534
2.223ProGlu: 2.223 ± 1.09
1.729ProPhe: 1.729 ± 0.64
2.223ProGly: 2.223 ± 0.967
0.741ProHis: 0.741 ± 0.682
3.953ProIle: 3.953 ± 1.214
2.964ProLys: 2.964 ± 0.801
2.223ProLeu: 2.223 ± 0.991
0.741ProMet: 0.741 ± 0.691
1.482ProAsn: 1.482 ± 0.516
0.247ProPro: 0.247 ± 0.156
0.494ProGln: 0.494 ± 0.312
1.729ProArg: 1.729 ± 1.278
2.47ProSer: 2.47 ± 0.958
2.47ProThr: 2.47 ± 2.278
1.235ProVal: 1.235 ± 0.692
0.494ProTrp: 0.494 ± 0.312
1.235ProTyr: 1.235 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
2.47GlnAla: 2.47 ± 0.981
0.988GlnCys: 0.988 ± 0.335
2.223GlnAsp: 2.223 ± 2.429
3.458GlnGlu: 3.458 ± 0.924
1.729GlnPhe: 1.729 ± 0.534
1.482GlnGly: 1.482 ± 0.707
0.741GlnHis: 0.741 ± 0.354
2.717GlnIle: 2.717 ± 1.635
2.223GlnLys: 2.223 ± 0.553
3.458GlnLeu: 3.458 ± 0.924
0.988GlnMet: 0.988 ± 0.303
1.482GlnAsn: 1.482 ± 0.625
0.741GlnPro: 0.741 ± 0.354
1.235GlnGln: 1.235 ± 1.205
1.235GlnArg: 1.235 ± 0.479
1.482GlnSer: 1.482 ± 0.49
2.223GlnThr: 2.223 ± 0.613
2.223GlnVal: 2.223 ± 0.553
0.0GlnTrp: 0.0 ± 0.0
1.235GlnTyr: 1.235 ± 0.49
0.0GlnXaa: 0.0 ± 0.0
Arg
2.47ArgAla: 2.47 ± 1.548
1.976ArgCys: 1.976 ± 0.846
2.47ArgAsp: 2.47 ± 0.583
2.964ArgGlu: 2.964 ± 0.232
2.964ArgPhe: 2.964 ± 0.786
1.482ArgGly: 1.482 ± 0.455
1.235ArgHis: 1.235 ± 1.268
4.2ArgIle: 4.2 ± 1.149
3.211ArgLys: 3.211 ± 0.909
4.694ArgLeu: 4.694 ± 1.032
0.988ArgMet: 0.988 ± 0.455
2.47ArgAsn: 2.47 ± 1.562
1.235ArgPro: 1.235 ± 0.568
1.482ArgGln: 1.482 ± 1.86
1.482ArgArg: 1.482 ± 0.415
2.47ArgSer: 2.47 ± 0.583
0.741ArgThr: 0.741 ± 0.208
2.717ArgVal: 2.717 ± 0.859
0.0ArgTrp: 0.0 ± 0.0
1.482ArgTyr: 1.482 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
2.964SerAla: 2.964 ± 0.793
1.976SerCys: 1.976 ± 1.15
3.953SerAsp: 3.953 ± 0.771
3.953SerGlu: 3.953 ± 0.377
4.941SerPhe: 4.941 ± 0.828
1.235SerGly: 1.235 ± 0.49
1.482SerHis: 1.482 ± 0.393
6.176SerIle: 6.176 ± 2.395
8.399SerLys: 8.399 ± 1.538
9.14SerLeu: 9.14 ± 2.912
1.482SerMet: 1.482 ± 0.629
4.2SerAsn: 4.2 ± 1.754
1.729SerPro: 1.729 ± 0.467
2.47SerGln: 2.47 ± 0.583
2.223SerArg: 2.223 ± 0.247
5.188SerSer: 5.188 ± 1.265
4.447SerThr: 4.447 ± 1.118
4.694SerVal: 4.694 ± 0.783
0.494SerTrp: 0.494 ± 0.312
3.211SerTyr: 3.211 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
3.211ThrAla: 3.211 ± 0.776
1.976ThrCys: 1.976 ± 0.921
4.447ThrAsp: 4.447 ± 0.505
3.953ThrGlu: 3.953 ± 1.89
3.458ThrPhe: 3.458 ± 1.023
2.964ThrGly: 2.964 ± 1.725
0.494ThrHis: 0.494 ± 0.152
5.435ThrIle: 5.435 ± 1.13
4.694ThrLys: 4.694 ± 0.973
4.941ThrLeu: 4.941 ± 3.28
1.729ThrMet: 1.729 ± 0.467
2.223ThrAsn: 2.223 ± 0.623
2.223ThrPro: 2.223 ± 1.18
1.976ThrGln: 1.976 ± 1.201
2.964ThrArg: 2.964 ± 0.83
6.176ThrSer: 6.176 ± 0.999
4.2ThrThr: 4.2 ± 1.448
1.235ThrVal: 1.235 ± 0.328
0.988ThrTrp: 0.988 ± 0.595
3.953ThrTyr: 3.953 ± 1.056
0.0ThrXaa: 0.0 ± 0.0
Val
1.482ValAla: 1.482 ± 0.455
0.988ValCys: 0.988 ± 0.303
1.729ValAsp: 1.729 ± 1.362
2.964ValGlu: 2.964 ± 0.232
2.717ValPhe: 2.717 ± 0.937
2.47ValGly: 2.47 ± 0.989
0.988ValHis: 0.988 ± 0.335
2.223ValIle: 2.223 ± 0.388
3.706ValLys: 3.706 ± 0.607
4.694ValLeu: 4.694 ± 0.506
2.47ValMet: 2.47 ± 0.913
2.964ValAsn: 2.964 ± 0.232
1.729ValPro: 1.729 ± 0.927
1.729ValGln: 1.729 ± 1.058
1.482ValArg: 1.482 ± 0.49
5.188ValSer: 5.188 ± 0.232
3.458ValThr: 3.458 ± 0.935
1.729ValVal: 1.729 ± 1.018
0.247ValTrp: 0.247 ± 0.227
2.47ValTyr: 2.47 ± 0.739
0.0ValXaa: 0.0 ± 0.0
Trp
0.741TrpAla: 0.741 ± 0.632
0.0TrpCys: 0.0 ± 0.0
0.494TrpAsp: 0.494 ± 0.312
0.494TrpGlu: 0.494 ± 0.312
0.494TrpPhe: 0.494 ± 0.152
0.988TrpGly: 0.988 ± 0.595
0.0TrpHis: 0.0 ± 0.0
0.247TrpIle: 0.247 ± 0.156
0.0TrpLys: 0.0 ± 0.0
0.988TrpLeu: 0.988 ± 0.303
0.0TrpMet: 0.0 ± 0.0
0.741TrpAsn: 0.741 ± 0.208
0.0TrpPro: 0.0 ± 0.0
0.494TrpGln: 0.494 ± 0.312
0.247TrpArg: 0.247 ± 0.696
1.482TrpSer: 1.482 ± 0.516
0.247TrpThr: 0.247 ± 0.156
0.741TrpVal: 0.741 ± 0.691
0.0TrpTrp: 0.0 ± 0.0
0.741TrpTyr: 0.741 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.988TyrAla: 0.988 ± 0.839
1.235TyrCys: 1.235 ± 1.137
0.988TyrAsp: 0.988 ± 0.335
2.964TyrGlu: 2.964 ± 0.83
1.482TyrPhe: 1.482 ± 0.629
1.482TyrGly: 1.482 ± 0.393
1.235TyrHis: 1.235 ± 0.328
4.694TyrIle: 4.694 ± 1.539
3.953TyrLys: 3.953 ± 0.551
3.211TyrLeu: 3.211 ± 0.859
1.482TyrMet: 1.482 ± 0.415
2.964TyrAsn: 2.964 ± 0.83
1.976TyrPro: 1.976 ± 1.052
0.988TyrGln: 0.988 ± 0.588
1.482TyrArg: 1.482 ± 1.2
2.47TyrSer: 2.47 ± 0.739
3.953TyrThr: 3.953 ± 1.871
1.482TyrVal: 1.482 ± 0.629
0.0TyrTrp: 0.0 ± 0.0
1.235TyrTyr: 1.235 ± 0.328
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4049 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski