Amino acid dipepetide frequency for bat polyomavirus 5b1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.267AlaAla: 6.267 ± 4.88
2.089AlaCys: 2.089 ± 0.825
2.089AlaAsp: 2.089 ± 0.629
8.357AlaGlu: 8.357 ± 3.478
0.696AlaPhe: 0.696 ± 0.864
2.089AlaGly: 2.089 ± 0.734
0.696AlaHis: 0.696 ± 0.771
6.267AlaIle: 6.267 ± 2.128
5.571AlaLys: 5.571 ± 2.864
6.964AlaLeu: 6.964 ± 5.825
2.089AlaMet: 2.089 ± 1.677
2.089AlaAsn: 2.089 ± 0.629
3.482AlaPro: 3.482 ± 1.564
2.089AlaGln: 2.089 ± 1.591
3.482AlaArg: 3.482 ± 1.144
2.089AlaSer: 2.089 ± 0.734
3.482AlaThr: 3.482 ± 2.006
5.571AlaVal: 5.571 ± 1.04
2.089AlaTrp: 2.089 ± 0.817
2.786AlaTyr: 2.786 ± 1.163
0.0AlaXaa: 0.0 ± 0.0
Cys
2.089CysAla: 2.089 ± 0.807
1.393CysCys: 1.393 ± 1.727
0.696CysAsp: 0.696 ± 0.679
1.393CysGlu: 1.393 ± 0.9
0.696CysPhe: 0.696 ± 0.864
0.696CysGly: 0.696 ± 0.679
0.0CysHis: 0.0 ± 0.0
2.089CysIle: 2.089 ± 1.362
2.786CysLys: 2.786 ± 0.908
2.089CysLeu: 2.089 ± 1.704
2.089CysMet: 2.089 ± 1.134
1.393CysAsn: 1.393 ± 0.9
1.393CysPro: 1.393 ± 0.604
0.696CysGln: 0.696 ± 0.454
0.0CysArg: 0.0 ± 0.0
2.089CysSer: 2.089 ± 1.202
0.696CysThr: 0.696 ± 0.454
1.393CysVal: 1.393 ± 0.944
0.0CysTrp: 0.0 ± 0.0
3.482CysTyr: 3.482 ± 2.585
0.0CysXaa: 0.0 ± 0.0
Asp
2.786AspAla: 2.786 ± 0.839
0.696AspCys: 0.696 ± 0.864
4.875AspAsp: 4.875 ± 3.177
3.482AspGlu: 3.482 ± 1.187
3.482AspPhe: 3.482 ± 1.863
6.267AspGly: 6.267 ± 1.77
0.696AspHis: 0.696 ± 0.454
2.786AspIle: 2.786 ± 1.271
4.178AspLys: 4.178 ± 2.267
2.786AspLeu: 2.786 ± 1.163
0.0AspMet: 0.0 ± 0.0
1.393AspAsn: 1.393 ± 0.604
3.482AspPro: 3.482 ± 0.747
1.393AspGln: 1.393 ± 0.908
1.393AspArg: 1.393 ± 0.9
3.482AspSer: 3.482 ± 0.961
2.786AspThr: 2.786 ± 1.207
2.786AspVal: 2.786 ± 1.186
1.393AspTrp: 1.393 ± 0.601
2.089AspTyr: 2.089 ± 1.306
0.0AspXaa: 0.0 ± 0.0
Glu
9.749GluAla: 9.749 ± 7.145
1.393GluCys: 1.393 ± 0.908
4.178GluAsp: 4.178 ± 0.948
6.267GluGlu: 6.267 ± 2.419
2.786GluPhe: 2.786 ± 1.799
2.786GluGly: 2.786 ± 1.19
1.393GluHis: 1.393 ± 0.604
0.696GluIle: 0.696 ± 0.771
6.267GluLys: 6.267 ± 1.643
11.142GluLeu: 11.142 ± 4.519
0.696GluMet: 0.696 ± 0.454
5.571GluAsn: 5.571 ± 1.04
1.393GluPro: 1.393 ± 0.604
0.696GluGln: 0.696 ± 0.771
2.089GluArg: 2.089 ± 1.306
2.786GluSer: 2.786 ± 0.839
0.696GluThr: 0.696 ± 0.864
2.089GluVal: 2.089 ± 1.48
0.0GluTrp: 0.0 ± 0.0
2.089GluTyr: 2.089 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
5.571PheAla: 5.571 ± 1.973
2.786PheCys: 2.786 ± 2.548
0.0PheAsp: 0.0 ± 0.0
2.089PheGlu: 2.089 ± 1.362
1.393PhePhe: 1.393 ± 0.604
2.089PheGly: 2.089 ± 0.807
1.393PheHis: 1.393 ± 1.022
1.393PheIle: 1.393 ± 0.601
2.786PheLys: 2.786 ± 1.816
5.571PheLeu: 5.571 ± 1.835
2.089PheMet: 2.089 ± 1.553
2.089PheAsn: 2.089 ± 1.032
2.089PhePro: 2.089 ± 0.734
2.089PheGln: 2.089 ± 0.817
1.393PheArg: 1.393 ± 0.604
1.393PheSer: 1.393 ± 0.601
2.786PheThr: 2.786 ± 1.559
2.089PheVal: 2.089 ± 0.734
0.696PheTrp: 0.696 ± 0.864
2.089PheTyr: 2.089 ± 0.825
0.0PheXaa: 0.0 ± 0.0
Gly
3.482GlyAla: 3.482 ± 1.802
0.696GlyCys: 0.696 ± 0.454
4.875GlyAsp: 4.875 ± 1.216
3.482GlyGlu: 3.482 ± 1.564
3.482GlyPhe: 3.482 ± 1.144
6.964GlyGly: 6.964 ± 1.674
0.696GlyHis: 0.696 ± 0.454
4.875GlyIle: 4.875 ± 1.851
2.786GlyLys: 2.786 ± 1.186
4.875GlyLeu: 4.875 ± 1.851
2.089GlyMet: 2.089 ± 2.037
1.393GlyAsn: 1.393 ± 0.9
6.267GlyPro: 6.267 ± 0.895
4.178GlyGln: 4.178 ± 0.953
2.089GlyArg: 2.089 ± 1.306
2.089GlySer: 2.089 ± 0.734
1.393GlyThr: 1.393 ± 1.358
6.267GlyVal: 6.267 ± 1.024
0.0GlyTrp: 0.0 ± 0.0
1.393GlyTyr: 1.393 ± 1.358
0.0GlyXaa: 0.0 ± 0.0
His
1.393HisAla: 1.393 ± 0.908
2.089HisCys: 2.089 ± 1.134
2.089HisAsp: 2.089 ± 0.734
0.696HisGlu: 0.696 ± 0.864
1.393HisPhe: 1.393 ± 1.358
0.0HisGly: 0.0 ± 0.0
1.393HisHis: 1.393 ± 0.601
0.0HisIle: 0.0 ± 0.0
1.393HisLys: 1.393 ± 0.9
2.786HisLeu: 2.786 ± 1.207
0.0HisMet: 0.0 ± 0.0
0.696HisAsn: 0.696 ± 0.454
1.393HisPro: 1.393 ± 0.9
0.696HisGln: 0.696 ± 0.771
0.696HisArg: 0.696 ± 0.454
1.393HisSer: 1.393 ± 1.543
0.696HisThr: 0.696 ± 0.864
1.393HisVal: 1.393 ± 0.908
0.696HisTrp: 0.696 ± 0.864
1.393HisTyr: 1.393 ± 0.908
0.0HisXaa: 0.0 ± 0.0
Ile
1.393IleAla: 1.393 ± 1.358
1.393IleCys: 1.393 ± 0.604
2.786IleAsp: 2.786 ± 1.816
5.571IleGlu: 5.571 ± 3.588
0.0IlePhe: 0.0 ± 0.0
0.696IleGly: 0.696 ± 0.771
0.0IleHis: 0.0 ± 0.0
2.089IleIle: 2.089 ± 0.734
2.089IleLys: 2.089 ± 1.362
6.267IleLeu: 6.267 ± 1.872
2.089IleMet: 2.089 ± 1.265
2.786IleAsn: 2.786 ± 0.839
4.178IlePro: 4.178 ± 1.252
0.696IleGln: 0.696 ± 0.454
1.393IleArg: 1.393 ± 0.601
2.089IleSer: 2.089 ± 2.314
6.267IleThr: 6.267 ± 3.01
2.786IleVal: 2.786 ± 1.207
1.393IleTrp: 1.393 ± 1.543
3.482IleTyr: 3.482 ± 1.996
0.0IleXaa: 0.0 ± 0.0
Lys
3.482LysAla: 3.482 ± 1.777
2.089LysCys: 2.089 ± 0.807
1.393LysAsp: 1.393 ± 0.604
2.089LysGlu: 2.089 ± 1.704
2.089LysPhe: 2.089 ± 1.134
6.267LysGly: 6.267 ± 2.753
2.089LysHis: 2.089 ± 1.362
5.571LysIle: 5.571 ± 3.189
9.053LysLys: 9.053 ± 2.324
6.267LysLeu: 6.267 ± 2.21
4.875LysMet: 4.875 ± 1.657
2.089LysAsn: 2.089 ± 0.825
3.482LysPro: 3.482 ± 2.269
2.089LysGln: 2.089 ± 1.134
6.267LysArg: 6.267 ± 1.104
3.482LysSer: 3.482 ± 1.187
6.964LysThr: 6.964 ± 3.192
2.089LysVal: 2.089 ± 1.134
0.0LysTrp: 0.0 ± 0.0
1.393LysTyr: 1.393 ± 0.604
0.0LysXaa: 0.0 ± 0.0
Leu
5.571LeuAla: 5.571 ± 3.35
2.089LeuCys: 2.089 ± 0.807
5.571LeuAsp: 5.571 ± 2.372
4.875LeuGlu: 4.875 ± 1.836
7.66LeuPhe: 7.66 ± 1.345
4.178LeuGly: 4.178 ± 1.764
2.089LeuHis: 2.089 ± 1.134
5.571LeuIle: 5.571 ± 1.973
3.482LeuLys: 3.482 ± 1.863
8.357LeuLeu: 8.357 ± 2.081
2.786LeuMet: 2.786 ± 0.949
6.267LeuAsn: 6.267 ± 1.536
6.964LeuPro: 6.964 ± 1.426
7.66LeuGln: 7.66 ± 1.699
3.482LeuArg: 3.482 ± 0.344
4.875LeuSer: 4.875 ± 1.061
6.267LeuThr: 6.267 ± 2.042
5.571LeuVal: 5.571 ± 1.241
2.089LeuTrp: 2.089 ± 1.704
2.089LeuTyr: 2.089 ± 0.734
0.0LeuXaa: 0.0 ± 0.0
Met
2.786MetAla: 2.786 ± 1.163
0.696MetCys: 0.696 ± 0.864
2.786MetAsp: 2.786 ± 1.799
0.696MetGlu: 0.696 ± 0.454
0.696MetPhe: 0.696 ± 0.454
2.089MetGly: 2.089 ± 0.629
1.393MetHis: 1.393 ± 0.604
1.393MetIle: 1.393 ± 0.9
3.482MetLys: 3.482 ± 1.996
2.786MetLeu: 2.786 ± 1.202
1.393MetMet: 1.393 ± 0.9
2.089MetAsn: 2.089 ± 1.134
0.696MetPro: 0.696 ± 0.679
0.696MetGln: 0.696 ± 0.454
0.0MetArg: 0.0 ± 0.0
2.786MetSer: 2.786 ± 0.621
2.089MetThr: 2.089 ± 1.399
1.393MetVal: 1.393 ± 0.604
0.696MetTrp: 0.696 ± 0.679
0.696MetTyr: 0.696 ± 0.679
0.0MetXaa: 0.0 ± 0.0
Asn
2.089AsnAla: 2.089 ± 0.817
1.393AsnCys: 1.393 ± 0.908
0.696AsnAsp: 0.696 ± 0.454
4.178AsnGlu: 4.178 ± 0.675
1.393AsnPhe: 1.393 ± 0.908
2.089AsnGly: 2.089 ± 1.48
0.696AsnHis: 0.696 ± 0.454
1.393AsnIle: 1.393 ± 0.908
3.482AsnLys: 3.482 ± 1.863
6.267AsnLeu: 6.267 ± 2.46
0.696AsnMet: 0.696 ± 0.771
1.393AsnAsn: 1.393 ± 0.601
3.482AsnPro: 3.482 ± 1.968
2.786AsnGln: 2.786 ± 0.52
0.696AsnArg: 0.696 ± 0.771
1.393AsnSer: 1.393 ± 0.601
2.786AsnThr: 2.786 ± 1.163
4.178AsnVal: 4.178 ± 0.953
0.696AsnTrp: 0.696 ± 0.771
2.089AsnTyr: 2.089 ± 0.817
0.0AsnXaa: 0.0 ± 0.0
Pro
4.178ProAla: 4.178 ± 1.738
1.393ProCys: 1.393 ± 0.9
6.267ProAsp: 6.267 ± 1.135
2.786ProGlu: 2.786 ± 0.52
1.393ProPhe: 1.393 ± 0.908
5.571ProGly: 5.571 ± 0.651
2.089ProHis: 2.089 ± 0.817
1.393ProIle: 1.393 ± 0.604
6.267ProLys: 6.267 ± 2.042
4.875ProLeu: 4.875 ± 1.061
1.393ProMet: 1.393 ± 0.9
0.0ProAsn: 0.0 ± 0.0
6.964ProPro: 6.964 ± 1.376
1.393ProGln: 1.393 ± 0.601
2.089ProArg: 2.089 ± 1.48
3.482ProSer: 3.482 ± 0.961
1.393ProThr: 1.393 ± 1.358
4.875ProVal: 4.875 ± 3.04
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.482GlnAla: 3.482 ± 1.025
0.696GlnCys: 0.696 ± 0.454
2.786GlnAsp: 2.786 ± 0.839
1.393GlnGlu: 1.393 ± 0.908
2.786GlnPhe: 2.786 ± 0.908
1.393GlnGly: 1.393 ± 0.604
1.393GlnHis: 1.393 ± 1.022
4.875GlnIle: 4.875 ± 0.906
2.786GlnLys: 2.786 ± 1.474
1.393GlnLeu: 1.393 ± 0.601
1.393GlnMet: 1.393 ± 1.358
2.786GlnAsn: 2.786 ± 1.063
1.393GlnPro: 1.393 ± 0.604
2.089GlnGln: 2.089 ± 1.134
1.393GlnArg: 1.393 ± 0.601
2.786GlnSer: 2.786 ± 1.816
2.089GlnThr: 2.089 ± 1.032
3.482GlnVal: 3.482 ± 1.564
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.393ArgAla: 1.393 ± 1.543
0.696ArgCys: 0.696 ± 0.864
2.786ArgAsp: 2.786 ± 0.839
4.178ArgGlu: 4.178 ± 1.632
3.482ArgPhe: 3.482 ± 1.863
1.393ArgGly: 1.393 ± 0.604
0.0ArgHis: 0.0 ± 0.0
2.786ArgIle: 2.786 ± 1.271
2.089ArgLys: 2.089 ± 1.202
3.482ArgLeu: 3.482 ± 2.822
2.089ArgMet: 2.089 ± 0.807
2.089ArgAsn: 2.089 ± 0.629
1.393ArgPro: 1.393 ± 0.601
0.696ArgGln: 0.696 ± 0.771
3.482ArgArg: 3.482 ± 2.822
0.696ArgSer: 0.696 ± 0.454
0.696ArgThr: 0.696 ± 0.679
1.393ArgVal: 1.393 ± 0.601
1.393ArgTrp: 1.393 ± 1.022
2.786ArgTyr: 2.786 ± 1.93
0.0ArgXaa: 0.0 ± 0.0
Ser
5.571SerAla: 5.571 ± 2.269
2.089SerCys: 2.089 ± 2.037
2.089SerAsp: 2.089 ± 0.825
1.393SerGlu: 1.393 ± 0.908
4.875SerPhe: 4.875 ± 0.554
3.482SerGly: 3.482 ± 1.263
0.696SerHis: 0.696 ± 0.454
2.786SerIle: 2.786 ± 1.93
2.786SerLys: 2.786 ± 1.19
5.571SerLeu: 5.571 ± 1.5
1.393SerMet: 1.393 ± 0.965
1.393SerAsn: 1.393 ± 0.908
2.786SerPro: 2.786 ± 1.207
2.786SerGln: 2.786 ± 1.186
1.393SerArg: 1.393 ± 0.9
7.66SerSer: 7.66 ± 2.045
3.482SerThr: 3.482 ± 0.747
3.482SerVal: 3.482 ± 1.564
1.393SerTrp: 1.393 ± 0.965
1.393SerTyr: 1.393 ± 0.9
0.0SerXaa: 0.0 ± 0.0
Thr
2.786ThrAla: 2.786 ± 1.31
0.696ThrCys: 0.696 ± 0.679
0.696ThrAsp: 0.696 ± 0.679
4.178ThrGlu: 4.178 ± 1.704
1.393ThrPhe: 1.393 ± 0.601
5.571ThrGly: 5.571 ± 3.117
2.786ThrHis: 2.786 ± 0.908
0.696ThrIle: 0.696 ± 0.454
2.089ThrLys: 2.089 ± 1.202
7.66ThrLeu: 7.66 ± 2.75
0.696ThrMet: 0.696 ± 0.454
2.089ThrAsn: 2.089 ± 1.61
2.786ThrPro: 2.786 ± 1.271
2.786ThrGln: 2.786 ± 1.186
3.482ThrArg: 3.482 ± 0.344
2.786ThrSer: 2.786 ± 1.857
4.875ThrThr: 4.875 ± 1.992
4.178ThrVal: 4.178 ± 1.581
0.0ThrTrp: 0.0 ± 0.0
2.089ThrTyr: 2.089 ± 1.202
0.0ThrXaa: 0.0 ± 0.0
Val
4.178ValAla: 4.178 ± 1.257
2.089ValCys: 2.089 ± 1.704
3.482ValAsp: 3.482 ± 1.372
4.875ValGlu: 4.875 ± 1.559
1.393ValPhe: 1.393 ± 0.908
4.875ValGly: 4.875 ± 2.236
0.696ValHis: 0.696 ± 0.679
2.089ValIle: 2.089 ± 0.734
4.875ValLys: 4.875 ± 1.628
4.875ValLeu: 4.875 ± 3.054
0.0ValMet: 0.0 ± 0.0
4.875ValAsn: 4.875 ± 1.076
2.089ValPro: 2.089 ± 0.629
2.089ValGln: 2.089 ± 0.629
2.786ValArg: 2.786 ± 2.333
6.267ValSer: 6.267 ± 0.177
2.786ValThr: 2.786 ± 1.163
4.875ValVal: 4.875 ± 1.559
1.393ValTrp: 1.393 ± 0.944
0.696ValTyr: 0.696 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
1.393TrpAla: 1.393 ± 1.727
0.0TrpCys: 0.0 ± 0.0
0.696TrpAsp: 0.696 ± 0.771
1.393TrpGlu: 1.393 ± 0.944
1.393TrpPhe: 1.393 ± 1.727
2.089TrpGly: 2.089 ± 1.591
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.696TrpLys: 0.696 ± 0.454
0.696TrpLeu: 0.696 ± 0.771
0.696TrpMet: 0.696 ± 0.771
0.696TrpAsn: 0.696 ± 0.454
0.0TrpPro: 0.0 ± 0.0
1.393TrpGln: 1.393 ± 0.9
0.696TrpArg: 0.696 ± 0.771
1.393TrpSer: 1.393 ± 1.358
0.0TrpThr: 0.0 ± 0.0
0.696TrpVal: 0.696 ± 0.771
0.696TrpTrp: 0.696 ± 0.864
0.696TrpTyr: 0.696 ± 0.454
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.696TyrAla: 0.696 ± 0.679
0.696TyrCys: 0.696 ± 0.454
1.393TyrAsp: 1.393 ± 0.965
1.393TyrGlu: 1.393 ± 1.022
1.393TyrPhe: 1.393 ± 1.358
2.786TyrGly: 2.786 ± 1.31
2.089TyrHis: 2.089 ± 1.134
0.696TyrIle: 0.696 ± 0.771
4.178TyrLys: 4.178 ± 2.267
3.482TyrLeu: 3.482 ± 0.344
2.089TyrMet: 2.089 ± 1.362
0.696TyrAsn: 0.696 ± 0.454
2.089TyrPro: 2.089 ± 1.399
1.393TyrGln: 1.393 ± 0.908
0.696TyrArg: 0.696 ± 0.864
3.482TyrSer: 3.482 ± 0.961
2.089TyrThr: 2.089 ± 1.202
0.696TyrVal: 0.696 ± 0.454
0.696TyrTrp: 0.696 ± 0.454
1.393TyrTyr: 1.393 ± 1.543
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1437 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski