Amino acid dipepetide frequency for Currant virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.351AlaAla: 3.351 ± 3.232
1.676AlaCys: 1.676 ± 0.214
2.346AlaAsp: 2.346 ± 0.075
6.032AlaGlu: 6.032 ± 1.662
2.681AlaPhe: 2.681 ± 0.716
3.686AlaGly: 3.686 ± 0.652
1.34AlaHis: 1.34 ± 0.577
4.021AlaIle: 4.021 ± 0.138
4.021AlaLys: 4.021 ± 0.796
3.686AlaLeu: 3.686 ± 0.652
1.005AlaMet: 1.005 ± 0.433
3.351AlaAsn: 3.351 ± 1.362
1.34AlaPro: 1.34 ± 0.577
1.34AlaGln: 1.34 ± 0.577
4.357AlaArg: 4.357 ± 1.864
3.016AlaSer: 3.016 ± 1.299
2.011AlaThr: 2.011 ± 2.874
4.021AlaVal: 4.021 ± 1.073
0.67AlaTrp: 0.67 ± 0.646
2.681AlaTyr: 2.681 ± 1.154
0.0AlaXaa: 0.0 ± 0.0
Cys
1.005CysAla: 1.005 ± 0.433
0.0CysCys: 0.0 ± 0.0
0.335CysAsp: 0.335 ± 0.144
0.67CysGlu: 0.67 ± 0.646
1.676CysPhe: 1.676 ± 0.721
1.34CysGly: 1.34 ± 0.358
0.335CysHis: 0.335 ± 0.144
1.005CysIle: 1.005 ± 0.433
1.34CysLys: 1.34 ± 0.577
1.676CysLeu: 1.676 ± 0.214
1.005CysMet: 1.005 ± 0.433
1.005CysAsn: 1.005 ± 0.433
0.67CysPro: 0.67 ± 0.646
1.005CysGln: 1.005 ± 0.502
0.67CysArg: 0.67 ± 0.289
3.016CysSer: 3.016 ± 0.571
2.346CysThr: 2.346 ± 0.075
0.67CysVal: 0.67 ± 0.289
0.335CysTrp: 0.335 ± 0.144
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.357AspAla: 4.357 ± 1.864
1.34AspCys: 1.34 ± 0.577
3.016AspAsp: 3.016 ± 0.364
4.357AspGlu: 4.357 ± 1.864
5.027AspPhe: 5.027 ± 0.294
0.335AspGly: 0.335 ± 0.144
0.335AspHis: 0.335 ± 0.144
2.681AspIle: 2.681 ± 0.716
2.681AspLys: 2.681 ± 0.716
5.362AspLeu: 5.362 ± 1.374
0.335AspMet: 0.335 ± 0.144
1.676AspAsn: 1.676 ± 1.149
2.346AspPro: 2.346 ± 1.795
3.016AspGln: 3.016 ± 0.364
3.016AspArg: 3.016 ± 2.441
8.043AspSer: 8.043 ± 2.147
0.67AspThr: 0.67 ± 0.289
2.681AspVal: 2.681 ± 0.219
0.335AspTrp: 0.335 ± 0.144
1.34AspTyr: 1.34 ± 0.358
0.0AspXaa: 0.0 ± 0.0
Glu
3.686GluAla: 3.686 ± 0.652
2.011GluCys: 2.011 ± 0.866
2.681GluAsp: 2.681 ± 1.651
2.011GluGlu: 2.011 ± 0.866
3.351GluPhe: 3.351 ± 3.232
4.357GluGly: 4.357 ± 0.929
2.681GluHis: 2.681 ± 0.219
5.362GluIle: 5.362 ± 2.309
2.346GluLys: 2.346 ± 1.01
6.702GluLeu: 6.702 ± 1.951
1.676GluMet: 1.676 ± 0.721
2.681GluAsn: 2.681 ± 0.219
2.346GluPro: 2.346 ± 1.01
0.67GluGln: 0.67 ± 0.289
3.016GluArg: 3.016 ± 0.571
6.702GluSer: 6.702 ± 0.854
2.346GluThr: 2.346 ± 1.01
2.681GluVal: 2.681 ± 1.154
0.335GluTrp: 0.335 ± 0.144
1.676GluTyr: 1.676 ± 0.214
0.0GluXaa: 0.0 ± 0.0
Phe
3.016PheAla: 3.016 ± 1.299
1.34PheCys: 1.34 ± 0.577
4.021PheAsp: 4.021 ± 3.878
4.692PheGlu: 4.692 ± 1.085
4.692PhePhe: 4.692 ± 0.785
3.686PheGly: 3.686 ± 0.283
0.67PheHis: 0.67 ± 0.289
4.021PheIle: 4.021 ± 2.008
4.357PheLys: 4.357 ± 0.941
6.367PheLeu: 6.367 ± 0.998
2.346PheMet: 2.346 ± 1.01
5.362PheAsn: 5.362 ± 0.496
1.676PhePro: 1.676 ± 0.721
2.681PheGln: 2.681 ± 2.586
3.016PheArg: 3.016 ± 0.571
7.038PheSer: 7.038 ± 0.225
2.011PheThr: 2.011 ± 0.866
1.005PheVal: 1.005 ± 0.502
0.335PheTrp: 0.335 ± 0.144
2.346PheTyr: 2.346 ± 1.01
0.0PheXaa: 0.0 ± 0.0
Gly
2.346GlyAla: 2.346 ± 0.86
0.335GlyCys: 0.335 ± 0.791
3.686GlyAsp: 3.686 ± 1.218
4.021GlyGlu: 4.021 ± 0.796
3.686GlyPhe: 3.686 ± 1.218
3.016GlyGly: 3.016 ± 1.506
1.34GlyHis: 1.34 ± 0.577
3.351GlyIle: 3.351 ± 1.443
4.021GlyLys: 4.021 ± 0.796
5.362GlyLeu: 5.362 ± 0.439
0.67GlyMet: 0.67 ± 0.646
2.346GlyAsn: 2.346 ± 0.075
0.67GlyPro: 0.67 ± 0.646
2.011GlyGln: 2.011 ± 0.866
2.346GlyArg: 2.346 ± 1.795
5.697GlySer: 5.697 ± 1.287
1.34GlyThr: 1.34 ± 0.358
4.021GlyVal: 4.021 ± 0.796
1.676GlyTrp: 1.676 ± 0.214
0.67GlyTyr: 0.67 ± 0.289
0.0GlyXaa: 0.0 ± 0.0
His
2.011HisAla: 2.011 ± 1.004
0.0HisCys: 0.0 ± 0.0
1.676HisAsp: 1.676 ± 0.721
0.335HisGlu: 0.335 ± 0.144
1.676HisPhe: 1.676 ± 0.721
1.005HisGly: 1.005 ± 0.433
1.34HisHis: 1.34 ± 0.358
1.005HisIle: 1.005 ± 0.502
1.34HisLys: 1.34 ± 0.577
4.692HisLeu: 4.692 ± 0.15
0.67HisMet: 0.67 ± 0.289
0.67HisAsn: 0.67 ± 0.289
1.005HisPro: 1.005 ± 0.433
2.011HisGln: 2.011 ± 0.866
1.005HisArg: 1.005 ± 0.502
1.34HisSer: 1.34 ± 0.577
0.335HisThr: 0.335 ± 0.144
1.005HisVal: 1.005 ± 0.502
0.0HisTrp: 0.0 ± 0.0
1.34HisTyr: 1.34 ± 0.577
0.0HisXaa: 0.0 ± 0.0
Ile
4.021IleAla: 4.021 ± 0.796
2.011IleCys: 2.011 ± 0.866
4.692IleAsp: 4.692 ± 2.655
3.351IleGlu: 3.351 ± 1.443
2.346IlePhe: 2.346 ± 1.795
4.692IleGly: 4.692 ± 0.15
1.005IleHis: 1.005 ± 0.502
2.346IleIle: 2.346 ± 0.86
7.373IleLys: 7.373 ± 2.239
4.021IleLeu: 4.021 ± 0.796
1.005IleMet: 1.005 ± 0.433
4.692IleAsn: 4.692 ± 1.085
1.676IlePro: 1.676 ± 0.721
5.027IleGln: 5.027 ± 0.641
2.011IleArg: 2.011 ± 1.004
6.032IleSer: 6.032 ± 0.208
3.686IleThr: 3.686 ± 1.218
2.681IleVal: 2.681 ± 1.154
0.335IleTrp: 0.335 ± 0.144
2.681IleTyr: 2.681 ± 0.716
0.0IleXaa: 0.0 ± 0.0
Lys
4.357LysAla: 4.357 ± 1.876
0.335LysCys: 0.335 ± 0.144
3.016LysAsp: 3.016 ± 0.364
2.681LysGlu: 2.681 ± 0.219
3.016LysPhe: 3.016 ± 1.506
5.362LysGly: 5.362 ± 1.431
1.676LysHis: 1.676 ± 0.214
5.027LysIle: 5.027 ± 1.229
6.367LysLys: 6.367 ± 0.998
6.702LysLeu: 6.702 ± 0.081
2.346LysMet: 2.346 ± 1.01
4.357LysAsn: 4.357 ± 0.941
2.011LysPro: 2.011 ± 0.069
2.681LysGln: 2.681 ± 0.716
4.357LysArg: 4.357 ± 0.941
6.702LysSer: 6.702 ± 1.016
3.351LysThr: 3.351 ± 0.508
4.021LysVal: 4.021 ± 1.731
0.0LysTrp: 0.0 ± 0.0
1.34LysTyr: 1.34 ± 0.577
0.0LysXaa: 0.0 ± 0.0
Leu
3.686LeuAla: 3.686 ± 0.652
2.011LeuCys: 2.011 ± 0.069
3.686LeuAsp: 3.686 ± 1.587
4.692LeuGlu: 4.692 ± 0.15
7.708LeuPhe: 7.708 ± 1.449
6.367LeuGly: 6.367 ± 0.872
2.346LeuHis: 2.346 ± 0.86
9.048LeuIle: 9.048 ± 1.091
8.713LeuLys: 8.713 ± 0.947
8.043LeuLeu: 8.043 ± 1.593
3.686LeuMet: 3.686 ± 1.587
5.362LeuAsn: 5.362 ± 2.309
4.021LeuPro: 4.021 ± 0.138
2.681LeuGln: 2.681 ± 2.586
3.351LeuArg: 3.351 ± 1.443
10.054LeuSer: 10.054 ± 1.524
5.362LeuThr: 5.362 ± 2.309
4.692LeuVal: 4.692 ± 1.085
1.005LeuTrp: 1.005 ± 0.433
1.34LeuTyr: 1.34 ± 0.358
0.0LeuXaa: 0.0 ± 0.0
Met
1.005MetAla: 1.005 ± 0.433
0.67MetCys: 0.67 ± 0.289
1.005MetAsp: 1.005 ± 0.433
0.0MetGlu: 0.0 ± 0.0
0.67MetPhe: 0.67 ± 0.646
0.0MetGly: 0.0 ± 0.0
0.67MetHis: 0.67 ± 0.289
1.676MetIle: 1.676 ± 0.721
1.676MetLys: 1.676 ± 0.721
2.346MetLeu: 2.346 ± 1.01
0.67MetMet: 0.67 ± 0.289
1.34MetAsn: 1.34 ± 0.577
1.676MetPro: 1.676 ± 0.721
1.676MetGln: 1.676 ± 0.214
2.011MetArg: 2.011 ± 0.866
3.686MetSer: 3.686 ± 0.283
0.67MetThr: 0.67 ± 0.289
0.67MetVal: 0.67 ± 0.289
0.67MetTrp: 0.67 ± 0.289
1.676MetTyr: 1.676 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
3.016AsnAla: 3.016 ± 0.364
2.681AsnCys: 2.681 ± 1.651
2.681AsnAsp: 2.681 ± 0.716
3.351AsnGlu: 3.351 ± 1.443
4.357AsnPhe: 4.357 ± 1.876
2.346AsnGly: 2.346 ± 0.075
2.011AsnHis: 2.011 ± 0.866
2.346AsnIle: 2.346 ± 0.075
2.346AsnLys: 2.346 ± 0.86
8.043AsnLeu: 8.043 ± 1.593
0.67AsnMet: 0.67 ± 0.646
2.011AsnAsn: 2.011 ± 0.866
3.686AsnPro: 3.686 ± 1.587
1.34AsnGln: 1.34 ± 0.358
3.016AsnArg: 3.016 ± 1.506
6.032AsnSer: 6.032 ± 0.727
0.67AsnThr: 0.67 ± 0.289
3.016AsnVal: 3.016 ± 0.571
0.0AsnTrp: 0.0 ± 0.0
1.34AsnTyr: 1.34 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
2.011ProAla: 2.011 ± 1.004
0.0ProCys: 0.0 ± 0.0
2.681ProAsp: 2.681 ± 1.154
4.357ProGlu: 4.357 ± 1.864
2.011ProPhe: 2.011 ± 0.866
1.676ProGly: 1.676 ± 0.721
0.335ProHis: 0.335 ± 0.144
3.016ProIle: 3.016 ± 0.571
2.346ProLys: 2.346 ± 0.86
3.016ProLeu: 3.016 ± 0.364
1.005ProMet: 1.005 ± 0.433
3.686ProAsn: 3.686 ± 1.218
2.011ProPro: 2.011 ± 0.069
1.676ProGln: 1.676 ± 0.721
1.34ProArg: 1.34 ± 1.293
2.346ProSer: 2.346 ± 0.075
1.34ProThr: 1.34 ± 0.358
1.34ProVal: 1.34 ± 0.577
0.0ProTrp: 0.0 ± 0.0
0.67ProTyr: 0.67 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
2.681GlnAla: 2.681 ± 1.651
0.67GlnCys: 0.67 ± 0.289
1.005GlnAsp: 1.005 ± 0.433
2.011GlnGlu: 2.011 ± 0.866
2.011GlnPhe: 2.011 ± 1.004
3.686GlnGly: 3.686 ± 2.153
0.335GlnHis: 0.335 ± 0.144
1.676GlnIle: 1.676 ± 0.721
2.011GlnLys: 2.011 ± 0.866
6.367GlnLeu: 6.367 ± 0.063
0.67GlnMet: 0.67 ± 0.289
1.005GlnAsn: 1.005 ± 0.433
1.005GlnPro: 1.005 ± 0.433
2.346GlnGln: 2.346 ± 0.075
0.335GlnArg: 0.335 ± 0.791
5.362GlnSer: 5.362 ± 2.366
1.676GlnThr: 1.676 ± 0.214
2.681GlnVal: 2.681 ± 0.716
0.0GlnTrp: 0.0 ± 0.0
1.005GlnTyr: 1.005 ± 0.433
0.0GlnXaa: 0.0 ± 0.0
Arg
3.686ArgAla: 3.686 ± 1.218
0.335ArgCys: 0.335 ± 0.791
3.351ArgAsp: 3.351 ± 3.232
3.351ArgGlu: 3.351 ± 0.427
4.021ArgPhe: 4.021 ± 0.138
3.016ArgGly: 3.016 ± 1.506
0.67ArgHis: 0.67 ± 0.289
2.011ArgIle: 2.011 ± 0.069
3.351ArgLys: 3.351 ± 1.362
3.686ArgLeu: 3.686 ± 1.587
0.0ArgMet: 0.0 ± 0.0
2.011ArgAsn: 2.011 ± 0.866
2.681ArgPro: 2.681 ± 3.521
1.676ArgGln: 1.676 ± 0.214
3.686ArgArg: 3.686 ± 1.218
5.362ArgSer: 5.362 ± 3.301
1.676ArgThr: 1.676 ± 0.214
2.681ArgVal: 2.681 ± 0.219
0.335ArgTrp: 0.335 ± 0.144
1.676ArgTyr: 1.676 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
6.032SerAla: 6.032 ± 1.143
2.681SerCys: 2.681 ± 1.154
5.027SerAsp: 5.027 ± 1.576
6.702SerGlu: 6.702 ± 0.854
7.373SerPhe: 7.373 ± 0.369
3.016SerGly: 3.016 ± 0.364
3.686SerHis: 3.686 ± 0.652
8.378SerIle: 8.378 ± 2.938
6.702SerLys: 6.702 ± 1.016
10.724SerLeu: 10.724 ± 2.747
3.351SerMet: 3.351 ± 0.589
6.367SerAsn: 6.367 ± 2.868
2.681SerPro: 2.681 ± 0.716
3.686SerGln: 3.686 ± 0.283
4.692SerArg: 4.692 ± 1.72
13.405SerSer: 13.405 ± 5.448
2.681SerThr: 2.681 ± 1.651
6.367SerVal: 6.367 ± 0.063
1.34SerTrp: 1.34 ± 0.577
3.016SerTyr: 3.016 ± 1.299
0.0SerXaa: 0.0 ± 0.0
Thr
1.676ThrAla: 1.676 ± 0.214
1.005ThrCys: 1.005 ± 0.502
2.681ThrAsp: 2.681 ± 0.219
2.011ThrGlu: 2.011 ± 0.069
3.351ThrPhe: 3.351 ± 0.427
2.011ThrGly: 2.011 ± 1.004
1.34ThrHis: 1.34 ± 0.577
2.346ThrIle: 2.346 ± 0.075
1.005ThrLys: 1.005 ± 0.433
4.692ThrLeu: 4.692 ± 0.15
0.67ThrMet: 0.67 ± 0.289
1.005ThrAsn: 1.005 ± 0.433
1.34ThrPro: 1.34 ± 0.577
0.67ThrGln: 0.67 ± 0.289
3.351ThrArg: 3.351 ± 0.427
3.686ThrSer: 3.686 ± 2.153
1.34ThrThr: 1.34 ± 0.358
1.676ThrVal: 1.676 ± 0.721
1.005ThrTrp: 1.005 ± 0.433
1.005ThrTyr: 1.005 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
2.681ValAla: 2.681 ± 1.154
1.005ValCys: 1.005 ± 0.433
2.346ValAsp: 2.346 ± 0.86
2.011ValGlu: 2.011 ± 0.866
3.016ValPhe: 3.016 ± 1.299
1.34ValGly: 1.34 ± 0.577
2.011ValHis: 2.011 ± 0.866
4.357ValIle: 4.357 ± 0.929
6.367ValLys: 6.367 ± 0.063
2.681ValLeu: 2.681 ± 0.219
1.34ValMet: 1.34 ± 0.358
3.686ValAsn: 3.686 ± 0.652
2.011ValPro: 2.011 ± 0.069
1.005ValGln: 1.005 ± 0.433
1.676ValArg: 1.676 ± 1.149
6.032ValSer: 6.032 ± 1.662
2.346ValThr: 2.346 ± 0.075
2.011ValVal: 2.011 ± 0.866
0.0ValTrp: 0.0 ± 0.0
1.34ValTyr: 1.34 ± 0.577
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.335TrpGlu: 0.335 ± 0.144
0.0TrpPhe: 0.0 ± 0.0
0.335TrpGly: 0.335 ± 0.144
0.0TrpHis: 0.0 ± 0.0
0.67TrpIle: 0.67 ± 0.289
0.335TrpLys: 0.335 ± 0.791
1.005TrpLeu: 1.005 ± 0.433
0.335TrpMet: 0.335 ± 0.411
0.67TrpAsn: 0.67 ± 0.289
0.67TrpPro: 0.67 ± 0.289
0.335TrpGln: 0.335 ± 0.144
0.67TrpArg: 0.67 ± 0.289
1.34TrpSer: 1.34 ± 0.577
0.67TrpThr: 0.67 ± 0.289
1.005TrpVal: 1.005 ± 0.433
0.0TrpTrp: 0.0 ± 0.0
0.335TrpTyr: 0.335 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.011TyrAla: 2.011 ± 0.866
0.0TyrCys: 0.0 ± 0.0
3.016TyrAsp: 3.016 ± 1.299
2.346TyrGlu: 2.346 ± 1.01
2.011TyrPhe: 2.011 ± 0.069
1.34TyrGly: 1.34 ± 0.577
0.335TyrHis: 0.335 ± 0.791
1.34TyrIle: 1.34 ± 0.577
1.005TyrLys: 1.005 ± 0.433
2.681TyrLeu: 2.681 ± 1.154
0.67TyrMet: 0.67 ± 0.289
1.676TyrAsn: 1.676 ± 0.721
1.34TyrPro: 1.34 ± 1.293
1.005TyrGln: 1.005 ± 0.502
1.34TyrArg: 1.34 ± 1.293
3.351TyrSer: 3.351 ± 0.508
1.34TyrThr: 1.34 ± 0.577
0.335TyrVal: 0.335 ± 0.144
0.335TyrTrp: 0.335 ± 0.144
0.67TyrTyr: 0.67 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2985 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski