Amino acid dipepetide frequency for Taggert virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.584AlaAla: 2.584 ± 1.504
1.378AlaCys: 1.378 ± 0.402
1.378AlaAsp: 1.378 ± 1.807
1.723AlaGlu: 1.723 ± 0.636
2.756AlaPhe: 2.756 ± 0.951
3.101AlaGly: 3.101 ± 0.789
0.517AlaHis: 0.517 ± 0.418
3.618AlaIle: 3.618 ± 1.252
2.584AlaLys: 2.584 ± 0.339
3.445AlaLeu: 3.445 ± 0.203
1.723AlaMet: 1.723 ± 0.842
2.929AlaAsn: 2.929 ± 0.869
1.55AlaPro: 1.55 ± 0.141
1.206AlaGln: 1.206 ± 0.452
3.618AlaArg: 3.618 ± 0.555
4.479AlaSer: 4.479 ± 1.009
2.412AlaThr: 2.412 ± 1.538
3.618AlaVal: 3.618 ± 0.299
1.378AlaTrp: 1.378 ± 0.747
0.861AlaTyr: 0.861 ± 0.866
0.0AlaXaa: 0.0 ± 0.0
Cys
2.239CysAla: 2.239 ± 1.138
1.723CysCys: 1.723 ± 0.536
1.206CysAsp: 1.206 ± 0.406
1.378CysGlu: 1.378 ± 0.488
0.345CysPhe: 0.345 ± 0.143
0.689CysGly: 0.689 ± 0.514
0.172CysHis: 0.172 ± 0.086
1.723CysIle: 1.723 ± 0.447
1.378CysLys: 1.378 ± 0.602
2.239CysLeu: 2.239 ± 0.608
0.345CysMet: 0.345 ± 0.173
1.378CysAsn: 1.378 ± 0.602
1.378CysPro: 1.378 ± 1.028
1.206CysGln: 1.206 ± 0.605
1.55CysArg: 1.55 ± 0.416
2.067CysSer: 2.067 ± 0.673
2.584CysThr: 2.584 ± 1.1
1.378CysVal: 1.378 ± 0.79
0.517CysTrp: 0.517 ± 0.325
0.689CysTyr: 0.689 ± 0.514
0.0CysXaa: 0.0 ± 0.0
Asp
2.584AspAla: 2.584 ± 0.975
1.206AspCys: 1.206 ± 0.606
2.239AspAsp: 2.239 ± 0.068
3.445AspGlu: 3.445 ± 0.731
1.378AspPhe: 1.378 ± 0.275
1.895AspGly: 1.895 ± 0.572
0.861AspHis: 0.861 ± 0.268
3.79AspIle: 3.79 ± 0.983
4.479AspLys: 4.479 ± 2.089
4.651AspLeu: 4.651 ± 0.462
1.206AspMet: 1.206 ± 0.263
2.584AspAsn: 2.584 ± 0.954
1.55AspPro: 1.55 ± 0.141
1.206AspGln: 1.206 ± 0.45
1.55AspArg: 1.55 ± 0.518
5.857AspSer: 5.857 ± 1.526
2.239AspThr: 2.239 ± 0.536
2.239AspVal: 2.239 ± 0.507
1.206AspTrp: 1.206 ± 0.406
2.412AspTyr: 2.412 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
3.618GluAla: 3.618 ± 1.254
1.55GluCys: 1.55 ± 0.141
4.479GluAsp: 4.479 ± 1.419
5.168GluGlu: 5.168 ± 1.625
2.412GluPhe: 2.412 ± 0.675
3.618GluGly: 3.618 ± 0.299
1.723GluHis: 1.723 ± 0.7
2.929GluIle: 2.929 ± 1.058
3.273GluLys: 3.273 ± 0.26
9.302GluLeu: 9.302 ± 2.837
1.895GluMet: 1.895 ± 0.805
3.273GluAsn: 3.273 ± 0.86
1.723GluPro: 1.723 ± 0.191
3.101GluGln: 3.101 ± 0.975
3.962GluArg: 3.962 ± 0.485
7.235GluSer: 7.235 ± 1.89
4.307GluThr: 4.307 ± 0.581
4.479GluVal: 4.479 ± 0.979
0.517GluTrp: 0.517 ± 0.139
1.378GluTyr: 1.378 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
1.723PheAla: 1.723 ± 0.636
0.861PheCys: 0.861 ± 0.465
1.895PheAsp: 1.895 ± 0.175
3.618PheGlu: 3.618 ± 0.79
2.756PhePhe: 2.756 ± 0.722
1.895PheGly: 1.895 ± 0.63
0.861PheHis: 0.861 ± 0.268
2.584PheIle: 2.584 ± 0.241
3.101PheLys: 3.101 ± 0.31
3.79PheLeu: 3.79 ± 0.969
1.378PheMet: 1.378 ± 0.691
1.206PheAsn: 1.206 ± 0.312
1.895PhePro: 1.895 ± 0.63
1.378PheGln: 1.378 ± 0.848
1.206PheArg: 1.206 ± 0.452
4.651PheSer: 4.651 ± 0.604
2.239PheThr: 2.239 ± 0.584
0.689PheVal: 0.689 ± 0.399
0.345PheTrp: 0.345 ± 0.173
1.378PheTyr: 1.378 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
2.239GlyAla: 2.239 ± 0.986
1.723GlyCys: 1.723 ± 0.93
2.929GlyAsp: 2.929 ± 1.479
2.756GlyGlu: 2.756 ± 0.327
0.861GlyPhe: 0.861 ± 0.318
1.895GlyGly: 1.895 ± 0.355
1.206GlyHis: 1.206 ± 0.263
3.101GlyIle: 3.101 ± 0.665
4.651GlyLys: 4.651 ± 0.961
6.718GlyLeu: 6.718 ± 1.057
1.723GlyMet: 1.723 ± 0.365
2.239GlyAsn: 2.239 ± 1.061
1.895GlyPro: 1.895 ± 0.76
1.723GlyGln: 1.723 ± 0.133
3.618GlyArg: 3.618 ± 0.145
3.962GlySer: 3.962 ± 0.373
4.651GlyThr: 4.651 ± 1.464
3.101GlyVal: 3.101 ± 0.909
0.517GlyTrp: 0.517 ± 0.259
1.378GlyTyr: 1.378 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
2.067HisAla: 2.067 ± 0.019
0.689HisCys: 0.689 ± 0.182
0.517HisAsp: 0.517 ± 0.139
1.034HisGlu: 1.034 ± 0.325
0.517HisPhe: 0.517 ± 0.139
1.723HisGly: 1.723 ± 0.435
0.689HisHis: 0.689 ± 0.182
1.378HisIle: 1.378 ± 0.402
1.723HisLys: 1.723 ± 0.191
2.756HisLeu: 2.756 ± 0.727
0.517HisMet: 0.517 ± 0.259
0.517HisAsn: 0.517 ± 0.259
1.034HisPro: 1.034 ± 0.817
0.345HisGln: 0.345 ± 0.511
0.517HisArg: 0.517 ± 0.259
2.584HisSer: 2.584 ± 0.804
1.206HisThr: 1.206 ± 0.45
1.206HisVal: 1.206 ± 0.263
0.345HisTrp: 0.345 ± 0.143
0.861HisTyr: 0.861 ± 0.465
0.0HisXaa: 0.0 ± 0.0
Ile
2.067IleAla: 2.067 ± 0.555
1.55IleCys: 1.55 ± 0.416
3.273IleAsp: 3.273 ± 1.323
4.479IleGlu: 4.479 ± 1.216
2.412IlePhe: 2.412 ± 0.654
2.067IleGly: 2.067 ± 0.68
1.55IleHis: 1.55 ± 0.416
3.79IleIle: 3.79 ± 0.724
5.34IleLys: 5.34 ± 1.382
6.374IleLeu: 6.374 ± 0.699
1.378IleMet: 1.378 ± 0.38
2.067IleAsn: 2.067 ± 1.633
2.756IlePro: 2.756 ± 0.564
2.756IleGln: 2.756 ± 1.382
3.273IleArg: 3.273 ± 0.812
5.685IleSer: 5.685 ± 1.055
3.618IleThr: 3.618 ± 0.416
3.618IleVal: 3.618 ± 0.54
0.517IleTrp: 0.517 ± 0.259
2.412IleTyr: 2.412 ± 0.154
0.0IleXaa: 0.0 ± 0.0
Lys
3.962LysAla: 3.962 ± 0.776
1.034LysCys: 1.034 ± 0.428
4.823LysAsp: 4.823 ± 1.048
7.752LysGlu: 7.752 ± 1.777
2.412LysPhe: 2.412 ± 0.675
3.79LysGly: 3.79 ± 0.674
1.206LysHis: 1.206 ± 0.263
3.962LysIle: 3.962 ± 1.102
5.34LysLys: 5.34 ± 1.158
8.441LysLeu: 8.441 ± 1.513
0.861LysMet: 0.861 ± 0.866
4.134LysAsn: 4.134 ± 0.037
3.445LysPro: 3.445 ± 0.895
1.55LysGln: 1.55 ± 0.571
1.895LysArg: 1.895 ± 0.74
4.651LysSer: 4.651 ± 1.206
5.168LysThr: 5.168 ± 0.832
4.134LysVal: 4.134 ± 0.037
1.206LysTrp: 1.206 ± 1.315
0.517LysTyr: 0.517 ± 0.948
0.0LysXaa: 0.0 ± 0.0
Leu
4.651LeuAla: 4.651 ± 0.462
2.584LeuCys: 2.584 ± 0.605
4.823LeuAsp: 4.823 ± 0.751
8.441LeuGlu: 8.441 ± 1.688
3.273LeuPhe: 3.273 ± 0.775
5.857LeuGly: 5.857 ± 0.586
3.618LeuHis: 3.618 ± 0.789
6.029LeuIle: 6.029 ± 1.56
8.441LeuLys: 8.441 ± 1.701
12.748LeuLeu: 12.748 ± 2.361
1.378LeuMet: 1.378 ± 0.364
6.718LeuAsn: 6.718 ± 2.016
3.618LeuPro: 3.618 ± 0.416
2.412LeuGln: 2.412 ± 0.81
5.34LeuArg: 5.34 ± 1.296
9.819LeuSer: 9.819 ± 0.941
9.13LeuThr: 9.13 ± 1.255
7.58LeuVal: 7.58 ± 1.082
0.345LeuTrp: 0.345 ± 0.173
2.584LeuTyr: 2.584 ± 0.532
0.0LeuXaa: 0.0 ± 0.0
Met
0.861MetAla: 0.861 ± 0.318
0.345MetCys: 0.345 ± 0.143
1.034MetAsp: 1.034 ± 0.418
1.034MetGlu: 1.034 ± 0.278
1.034MetPhe: 1.034 ± 0.325
1.55MetGly: 1.55 ± 0.141
0.517MetHis: 0.517 ± 0.439
2.412MetIle: 2.412 ± 0.693
1.55MetLys: 1.55 ± 0.571
4.823MetLeu: 4.823 ± 1.591
0.689MetMet: 0.689 ± 0.904
1.034MetAsn: 1.034 ± 0.278
0.0MetPro: 0.0 ± 0.0
0.689MetGln: 0.689 ± 0.399
1.723MetArg: 1.723 ± 0.93
2.756MetSer: 2.756 ± 0.384
1.723MetThr: 1.723 ± 0.133
1.034MetVal: 1.034 ± 0.817
0.0MetTrp: 0.0 ± 0.0
0.345MetTyr: 0.345 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
1.206AsnAla: 1.206 ± 0.769
1.206AsnCys: 1.206 ± 0.406
1.206AsnAsp: 1.206 ± 0.263
2.412AsnGlu: 2.412 ± 0.398
2.239AsnPhe: 2.239 ± 1.192
2.067AsnGly: 2.067 ± 0.68
1.206AsnHis: 1.206 ± 0.262
4.823AsnIle: 4.823 ± 0.542
2.412AsnLys: 2.412 ± 0.154
5.34AsnLeu: 5.34 ± 0.631
1.723AsnMet: 1.723 ± 0.536
2.412AsnAsn: 2.412 ± 0.524
1.723AsnPro: 1.723 ± 0.691
1.55AsnGln: 1.55 ± 0.141
2.584AsnArg: 2.584 ± 0.419
6.029AsnSer: 6.029 ± 0.706
3.618AsnThr: 3.618 ± 0.299
3.618AsnVal: 3.618 ± 1.771
0.861AsnTrp: 0.861 ± 0.268
1.206AsnTyr: 1.206 ± 0.452
0.0AsnXaa: 0.0 ± 0.0
Pro
2.067ProAla: 2.067 ± 0.019
0.861ProCys: 0.861 ± 0.421
2.412ProAsp: 2.412 ± 0.155
3.273ProGlu: 3.273 ± 0.39
1.895ProPhe: 1.895 ± 1.183
1.723ProGly: 1.723 ± 0.191
1.034ProHis: 1.034 ± 0.428
1.378ProIle: 1.378 ± 0.364
1.895ProLys: 1.895 ± 0.492
2.756ProLeu: 2.756 ± 1.203
0.689ProMet: 0.689 ± 0.345
1.55ProAsn: 1.55 ± 0.312
0.689ProPro: 0.689 ± 0.373
1.206ProGln: 1.206 ± 0.606
2.067ProArg: 2.067 ± 0.661
2.929ProSer: 2.929 ± 0.325
2.239ProThr: 2.239 ± 0.507
2.929ProVal: 2.929 ± 0.719
0.861ProTrp: 0.861 ± 0.623
0.689ProTyr: 0.689 ± 0.182
0.0ProXaa: 0.0 ± 0.0
Gln
2.239GlnAla: 2.239 ± 1.594
0.689GlnCys: 0.689 ± 0.182
1.034GlnAsp: 1.034 ± 0.279
2.412GlnGlu: 2.412 ± 0.693
1.723GlnPhe: 1.723 ± 0.619
1.723GlnGly: 1.723 ± 0.691
0.517GlnHis: 0.517 ± 0.325
2.584GlnIle: 2.584 ± 0.975
2.239GlnLys: 2.239 ± 1.123
3.962GlnLeu: 3.962 ± 0.373
1.378GlnMet: 1.378 ± 0.364
1.206GlnAsn: 1.206 ± 0.262
1.895GlnPro: 1.895 ± 0.175
2.584GlnGln: 2.584 ± 0.746
0.517GlnArg: 0.517 ± 0.259
2.239GlnSer: 2.239 ± 0.068
2.584GlnThr: 2.584 ± 0.532
1.895GlnVal: 1.895 ± 0.662
0.172GlnTrp: 0.172 ± 0.086
0.689GlnTyr: 0.689 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
1.895ArgAla: 1.895 ± 0.945
1.206ArgCys: 1.206 ± 0.312
2.067ArgAsp: 2.067 ± 0.501
2.412ArgGlu: 2.412 ± 0.155
2.756ArgPhe: 2.756 ± 0.952
3.101ArgGly: 3.101 ± 1.036
1.206ArgHis: 1.206 ± 0.605
2.239ArgIle: 2.239 ± 0.576
2.412ArgLys: 2.412 ± 0.812
5.168ArgLeu: 5.168 ± 0.96
1.55ArgMet: 1.55 ± 0.688
2.584ArgAsn: 2.584 ± 0.481
0.689ArgPro: 0.689 ± 0.285
2.412ArgGln: 2.412 ± 0.527
3.962ArgArg: 3.962 ± 0.16
3.273ArgSer: 3.273 ± 0.819
3.618ArgThr: 3.618 ± 0.295
2.929ArgVal: 2.929 ± 0.649
0.517ArgTrp: 0.517 ± 0.439
2.239ArgTyr: 2.239 ± 1.09
0.0ArgXaa: 0.0 ± 0.0
Ser
3.445SerAla: 3.445 ± 0.91
2.239SerCys: 2.239 ± 0.723
4.651SerAsp: 4.651 ± 1.282
7.924SerGlu: 7.924 ± 0.57
4.307SerPhe: 4.307 ± 0.051
4.996SerGly: 4.996 ± 0.734
2.067SerHis: 2.067 ± 0.856
5.34SerIle: 5.34 ± 0.8
7.063SerLys: 7.063 ± 1.532
9.647SerLeu: 9.647 ± 1.918
2.067SerMet: 2.067 ± 0.277
5.685SerAsn: 5.685 ± 1.225
3.618SerPro: 3.618 ± 1.071
3.101SerGln: 3.101 ± 1.071
2.584SerArg: 2.584 ± 0.694
12.059SerSer: 12.059 ± 2.736
4.479SerThr: 4.479 ± 1.485
5.168SerVal: 5.168 ± 1.345
1.378SerTrp: 1.378 ± 0.79
2.412SerTyr: 2.412 ± 0.154
0.0SerXaa: 0.0 ± 0.0
Thr
3.445ThrAla: 3.445 ± 1.312
2.239ThrCys: 2.239 ± 0.669
2.929ThrAsp: 2.929 ± 0.414
4.307ThrGlu: 4.307 ± 1.105
3.445ThrPhe: 3.445 ± 0.382
5.512ThrGly: 5.512 ± 1.683
1.55ThrHis: 1.55 ± 0.518
3.79ThrIle: 3.79 ± 0.35
4.307ThrLys: 4.307 ± 0.535
6.546ThrLeu: 6.546 ± 0.652
1.378ThrMet: 1.378 ± 0.691
2.412ThrAsn: 2.412 ± 0.524
1.895ThrPro: 1.895 ± 0.76
2.756ThrGln: 2.756 ± 0.909
2.756ThrArg: 2.756 ± 2.126
4.134ThrSer: 4.134 ± 1.172
4.307ThrThr: 4.307 ± 1.235
4.996ThrVal: 4.996 ± 1.548
1.206ThrTrp: 1.206 ± 0.606
2.067ThrTyr: 2.067 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
3.445ValAla: 3.445 ± 1.312
1.034ValCys: 1.034 ± 0.278
2.929ValAsp: 2.929 ± 0.478
4.823ValGlu: 4.823 ± 0.719
1.034ValPhe: 1.034 ± 0.518
2.239ValGly: 2.239 ± 0.068
0.517ValHis: 0.517 ± 0.139
3.101ValIle: 3.101 ± 0.31
6.029ValLys: 6.029 ± 0.657
5.857ValLeu: 5.857 ± 0.734
1.55ValMet: 1.55 ± 0.312
3.445ValAsn: 3.445 ± 0.406
2.412ValPro: 2.412 ± 0.693
1.895ValGln: 1.895 ± 1.738
3.618ValArg: 3.618 ± 0.971
6.029ValSer: 6.029 ± 0.544
3.618ValThr: 3.618 ± 0.941
3.445ValVal: 3.445 ± 0.995
0.861ValTrp: 0.861 ± 0.249
1.895ValTyr: 1.895 ± 0.889
0.0ValXaa: 0.0 ± 0.0
Trp
0.172TrpAla: 0.172 ± 0.191
0.517TrpCys: 0.517 ± 0.139
1.034TrpAsp: 1.034 ± 0.428
0.689TrpGlu: 0.689 ± 0.285
0.689TrpPhe: 0.689 ± 1.021
1.378TrpGly: 1.378 ± 0.848
0.345TrpHis: 0.345 ± 0.173
0.517TrpIle: 0.517 ± 0.439
1.034TrpLys: 1.034 ± 0.325
1.378TrpLeu: 1.378 ± 0.364
0.689TrpMet: 0.689 ± 0.398
0.689TrpAsn: 0.689 ± 0.399
0.345TrpPro: 0.345 ± 0.143
0.345TrpGln: 0.345 ± 0.143
0.689TrpArg: 0.689 ± 0.399
1.034TrpSer: 1.034 ± 0.651
0.517TrpThr: 0.517 ± 0.139
0.689TrpVal: 0.689 ± 0.182
0.345TrpTrp: 0.345 ± 0.143
0.345TrpTyr: 0.345 ± 0.143
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.345TyrAla: 0.345 ± 0.173
1.55TyrCys: 1.55 ± 0.771
1.723TyrAsp: 1.723 ± 0.191
1.034TyrGlu: 1.034 ± 0.418
1.206TyrPhe: 1.206 ± 0.406
2.067TyrGly: 2.067 ± 0.242
0.861TyrHis: 0.861 ± 0.432
1.895TyrIle: 1.895 ± 0.105
1.034TyrLys: 1.034 ± 0.325
3.445TyrLeu: 3.445 ± 0.266
0.861TyrMet: 0.861 ± 0.268
1.378TyrAsn: 1.378 ± 0.364
1.034TyrPro: 1.034 ± 0.817
0.861TyrGln: 0.861 ± 0.421
1.206TyrArg: 1.206 ± 0.312
2.756TyrSer: 2.756 ± 0.564
1.723TyrThr: 1.723 ± 0.7
1.034TyrVal: 1.034 ± 0.34
0.345TyrTrp: 0.345 ± 0.452
1.206TyrTyr: 1.206 ± 0.769
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5806 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski