Amino acid dipepetide frequency for Trichoderma atroviride mycovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.327AlaAla: 8.327 ± 3.26
0.379AlaCys: 0.379 ± 0.289
4.921AlaAsp: 4.921 ± 1.053
4.921AlaGlu: 4.921 ± 0.549
3.785AlaPhe: 3.785 ± 0.317
6.813AlaGly: 6.813 ± 0.677
1.514AlaHis: 1.514 ± 0.087
4.542AlaIle: 4.542 ± 1.329
1.893AlaLys: 1.893 ± 1.226
4.542AlaLeu: 4.542 ± 0.795
0.757AlaMet: 0.757 ± 0.043
3.407AlaAsn: 3.407 ± 0.463
6.056AlaPro: 6.056 ± 1.254
4.164AlaGln: 4.164 ± 1.63
7.57AlaArg: 7.57 ± 0.969
6.813AlaSer: 6.813 ± 0.677
7.192AlaThr: 7.192 ± 0.922
5.678AlaVal: 5.678 ± 0.593
1.514AlaTrp: 1.514 ± 0.621
3.785AlaTyr: 3.785 ± 0.217
0.0AlaXaa: 0.0 ± 0.0
Cys
1.893CysAla: 1.893 ± 0.376
0.0CysCys: 0.0 ± 0.0
1.136CysAsp: 1.136 ± 0.332
0.379CysGlu: 0.379 ± 0.245
0.379CysPhe: 0.379 ± 0.245
0.757CysGly: 0.757 ± 0.577
0.379CysHis: 0.379 ± 0.289
0.0CysIle: 0.0 ± 0.0
0.757CysLys: 0.757 ± 0.043
1.514CysLeu: 1.514 ± 0.447
0.757CysMet: 0.757 ± 0.491
0.0CysAsn: 0.0 ± 0.0
0.757CysPro: 0.757 ± 0.043
0.0CysGln: 0.0 ± 0.0
0.757CysArg: 0.757 ± 0.043
1.893CysSer: 1.893 ± 0.91
0.0CysThr: 0.0 ± 0.0
1.136CysVal: 1.136 ± 0.736
0.0CysTrp: 0.0 ± 0.0
0.757CysTyr: 0.757 ± 0.491
0.0CysXaa: 0.0 ± 0.0
Asp
6.056AspAla: 6.056 ± 0.882
0.757AspCys: 0.757 ± 0.491
1.893AspAsp: 1.893 ± 0.376
1.893AspGlu: 1.893 ± 0.692
2.271AspPhe: 2.271 ± 0.938
4.164AspGly: 4.164 ± 1.63
2.271AspHis: 2.271 ± 0.664
3.785AspIle: 3.785 ± 0.851
2.271AspLys: 2.271 ± 1.198
3.785AspLeu: 3.785 ± 0.751
1.514AspMet: 1.514 ± 0.267
1.893AspAsn: 1.893 ± 0.158
1.514AspPro: 1.514 ± 0.621
2.65AspGln: 2.65 ± 1.717
3.407AspArg: 3.407 ± 1.139
5.678AspSer: 5.678 ± 1.127
4.921AspThr: 4.921 ± 1.083
3.028AspVal: 3.028 ± 0.174
0.757AspTrp: 0.757 ± 0.043
1.514AspTyr: 1.514 ± 0.447
0.0AspXaa: 0.0 ± 0.0
Glu
4.542GluAla: 4.542 ± 0.807
0.379GluCys: 0.379 ± 0.245
2.65GluAsp: 2.65 ± 0.649
1.514GluGlu: 1.514 ± 0.981
4.164GluPhe: 4.164 ± 0.506
2.65GluGly: 2.65 ± 0.649
1.514GluHis: 1.514 ± 0.087
1.136GluIle: 1.136 ± 0.202
1.893GluLys: 1.893 ± 0.91
3.407GluLeu: 3.407 ± 2.207
0.379GluMet: 0.379 ± 0.289
2.65GluAsn: 2.65 ± 0.115
2.271GluPro: 2.271 ± 0.404
1.514GluGln: 1.514 ± 1.155
4.164GluArg: 4.164 ± 1.574
2.271GluSer: 2.271 ± 0.938
3.407GluThr: 3.407 ± 1.531
3.028GluVal: 3.028 ± 0.894
0.379GluTrp: 0.379 ± 0.289
2.271GluTyr: 2.271 ± 0.13
0.0GluXaa: 0.0 ± 0.0
Phe
3.407PheAla: 3.407 ± 0.463
0.757PheCys: 0.757 ± 0.491
2.65PheAsp: 2.65 ± 0.649
2.65PheGlu: 2.65 ± 0.115
2.271PhePhe: 2.271 ± 0.664
2.65PheGly: 2.65 ± 0.649
0.0PheHis: 0.0 ± 0.0
1.893PheIle: 1.893 ± 0.158
1.136PheLys: 1.136 ± 0.202
3.028PheLeu: 3.028 ± 0.708
0.0PheMet: 0.0 ± 0.0
4.542PheAsn: 4.542 ± 0.795
2.271PhePro: 2.271 ± 0.938
1.136PheGln: 1.136 ± 0.202
1.893PheArg: 1.893 ± 0.692
3.785PheSer: 3.785 ± 0.851
3.785PheThr: 3.785 ± 1.385
3.407PheVal: 3.407 ± 0.463
0.757PheTrp: 0.757 ± 0.043
0.757PheTyr: 0.757 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
6.056GlyAla: 6.056 ± 0.348
0.757GlyCys: 0.757 ± 0.043
4.542GlyAsp: 4.542 ± 0.807
4.164GlyGlu: 4.164 ± 0.506
1.893GlyPhe: 1.893 ± 0.692
4.164GlyGly: 4.164 ± 0.506
1.136GlyHis: 1.136 ± 0.202
1.136GlyIle: 1.136 ± 0.332
2.271GlyLys: 2.271 ± 1.198
7.57GlyLeu: 7.57 ± 0.969
1.893GlyMet: 1.893 ± 0.376
3.028GlyAsn: 3.028 ± 0.36
0.379GlyPro: 0.379 ± 0.245
2.271GlyGln: 2.271 ± 0.938
3.785GlyArg: 3.785 ± 1.385
4.542GlySer: 4.542 ± 2.409
3.028GlyThr: 3.028 ± 0.894
2.65GlyVal: 2.65 ± 0.419
0.757GlyTrp: 0.757 ± 0.043
2.65GlyTyr: 2.65 ± 0.115
0.0GlyXaa: 0.0 ± 0.0
His
2.65HisAla: 2.65 ± 0.419
0.0HisCys: 0.0 ± 0.0
1.893HisAsp: 1.893 ± 0.376
0.379HisGlu: 0.379 ± 0.289
2.271HisPhe: 2.271 ± 0.404
0.757HisGly: 0.757 ± 0.043
1.136HisHis: 1.136 ± 0.332
1.514HisIle: 1.514 ± 0.087
1.136HisLys: 1.136 ± 0.202
1.136HisLeu: 1.136 ± 0.332
0.757HisMet: 0.757 ± 0.043
1.136HisAsn: 1.136 ± 0.202
1.514HisPro: 1.514 ± 0.087
0.757HisGln: 0.757 ± 0.043
1.136HisArg: 1.136 ± 0.866
1.514HisSer: 1.514 ± 0.087
3.028HisThr: 3.028 ± 0.894
2.65HisVal: 2.65 ± 0.419
0.379HisTrp: 0.379 ± 0.289
1.893HisTyr: 1.893 ± 0.376
0.0HisXaa: 0.0 ± 0.0
Ile
5.299IleAla: 5.299 ± 1.372
0.757IleCys: 0.757 ± 0.577
3.028IleAsp: 3.028 ± 0.174
1.893IleGlu: 1.893 ± 0.376
2.65IlePhe: 2.65 ± 0.115
1.893IleGly: 1.893 ± 0.692
1.514IleHis: 1.514 ± 1.155
2.65IleIle: 2.65 ± 0.649
1.893IleLys: 1.893 ± 1.444
4.542IleLeu: 4.542 ± 2.397
1.514IleMet: 1.514 ± 1.155
2.65IleAsn: 2.65 ± 0.115
2.271IlePro: 2.271 ± 0.664
3.028IleGln: 3.028 ± 0.174
2.271IleArg: 2.271 ± 0.404
3.028IleSer: 3.028 ± 1.428
3.407IleThr: 3.407 ± 0.463
4.164IleVal: 4.164 ± 0.506
1.136IleTrp: 1.136 ± 0.736
0.757IleTyr: 0.757 ± 0.491
0.0IleXaa: 0.0 ± 0.0
Lys
1.893LysAla: 1.893 ± 0.158
1.514LysCys: 1.514 ± 0.087
2.271LysAsp: 2.271 ± 0.664
3.028LysGlu: 3.028 ± 0.174
1.514LysPhe: 1.514 ± 1.155
1.893LysGly: 1.893 ± 1.226
0.757LysHis: 0.757 ± 0.577
3.028LysIle: 3.028 ± 0.708
0.757LysLys: 0.757 ± 0.577
3.028LysLeu: 3.028 ± 1.776
0.379LysMet: 0.379 ± 0.245
1.136LysAsn: 1.136 ± 0.202
2.271LysPro: 2.271 ± 0.938
3.028LysGln: 3.028 ± 1.242
3.407LysArg: 3.407 ± 0.071
2.271LysSer: 2.271 ± 0.664
2.271LysThr: 2.271 ± 0.664
2.271LysVal: 2.271 ± 1.198
1.136LysTrp: 1.136 ± 0.736
2.65LysTyr: 2.65 ± 1.487
0.0LysXaa: 0.0 ± 0.0
Leu
7.57LeuAla: 7.57 ± 0.435
2.271LeuCys: 2.271 ± 0.13
3.407LeuAsp: 3.407 ± 0.071
4.164LeuGlu: 4.164 ± 0.506
1.893LeuPhe: 1.893 ± 0.692
6.056LeuGly: 6.056 ± 1.788
2.271LeuHis: 2.271 ± 1.198
3.785LeuIle: 3.785 ± 0.751
4.542LeuLys: 4.542 ± 1.863
9.084LeuLeu: 9.084 ± 3.725
2.271LeuMet: 2.271 ± 1.198
6.056LeuAsn: 6.056 ± 1.95
6.813LeuPro: 6.813 ± 1.211
3.785LeuGln: 3.785 ± 1.285
3.028LeuArg: 3.028 ± 1.776
9.084LeuSer: 9.084 ± 0.013
5.299LeuThr: 5.299 ± 0.838
1.136LeuVal: 1.136 ± 0.332
1.136LeuTrp: 1.136 ± 0.332
1.136LeuTyr: 1.136 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
1.514MetAla: 1.514 ± 0.087
0.0MetCys: 0.0 ± 0.0
0.757MetAsp: 0.757 ± 0.577
0.379MetGlu: 0.379 ± 0.245
0.757MetPhe: 0.757 ± 0.577
0.757MetGly: 0.757 ± 0.043
0.379MetHis: 0.379 ± 0.289
2.271MetIle: 2.271 ± 0.664
0.0MetLys: 0.0 ± 0.0
1.136MetLeu: 1.136 ± 0.202
0.379MetMet: 0.379 ± 0.289
2.271MetAsn: 2.271 ± 0.13
0.379MetPro: 0.379 ± 0.245
1.136MetGln: 1.136 ± 0.202
1.893MetArg: 1.893 ± 0.91
1.514MetSer: 1.514 ± 0.621
0.379MetThr: 0.379 ± 0.289
1.136MetVal: 1.136 ± 0.332
0.379MetTrp: 0.379 ± 0.289
0.379MetTyr: 0.379 ± 0.289
0.0MetXaa: 0.0 ± 0.0
Asn
4.542AsnAla: 4.542 ± 0.261
0.757AsnCys: 0.757 ± 0.043
4.164AsnAsp: 4.164 ± 1.096
0.757AsnGlu: 0.757 ± 0.043
1.514AsnPhe: 1.514 ± 0.087
1.893AsnGly: 1.893 ± 0.376
1.893AsnHis: 1.893 ± 0.376
1.893AsnIle: 1.893 ± 0.158
1.893AsnLys: 1.893 ± 0.91
4.164AsnLeu: 4.164 ± 1.04
0.757AsnMet: 0.757 ± 0.215
2.271AsnAsn: 2.271 ± 0.938
3.028AsnPro: 3.028 ± 0.894
2.271AsnGln: 2.271 ± 0.404
3.028AsnArg: 3.028 ± 0.174
3.785AsnSer: 3.785 ± 0.851
4.164AsnThr: 4.164 ± 0.028
3.785AsnVal: 3.785 ± 0.317
1.893AsnTrp: 1.893 ± 0.158
3.028AsnTyr: 3.028 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
5.299ProAla: 5.299 ± 1.298
0.0ProCys: 0.0 ± 0.0
3.785ProAsp: 3.785 ± 0.217
1.893ProGlu: 1.893 ± 0.692
2.65ProPhe: 2.65 ± 1.183
4.164ProGly: 4.164 ± 0.562
1.136ProHis: 1.136 ± 0.736
2.65ProIle: 2.65 ± 0.419
3.407ProLys: 3.407 ± 0.605
5.678ProLeu: 5.678 ± 1.009
0.757ProMet: 0.757 ± 0.577
3.028ProAsn: 3.028 ± 1.428
4.921ProPro: 4.921 ± 2.121
1.136ProGln: 1.136 ± 0.202
2.65ProArg: 2.65 ± 0.115
4.542ProSer: 4.542 ± 0.261
3.407ProThr: 3.407 ± 0.605
3.785ProVal: 3.785 ± 1.385
0.379ProTrp: 0.379 ± 0.289
1.893ProTyr: 1.893 ± 0.692
0.0ProXaa: 0.0 ± 0.0
Gln
3.407GlnAla: 3.407 ± 1.139
0.379GlnCys: 0.379 ± 0.289
1.514GlnAsp: 1.514 ± 0.447
1.514GlnGlu: 1.514 ± 0.087
2.271GlnPhe: 2.271 ± 0.938
1.136GlnGly: 1.136 ± 0.202
0.379GlnHis: 0.379 ± 0.245
1.893GlnIle: 1.893 ± 0.91
0.757GlnLys: 0.757 ± 0.043
3.407GlnLeu: 3.407 ± 0.463
0.379GlnMet: 0.379 ± 0.245
3.028GlnAsn: 3.028 ± 1.428
2.65GlnPro: 2.65 ± 0.649
1.514GlnGln: 1.514 ± 0.087
5.678GlnArg: 5.678 ± 0.059
4.164GlnSer: 4.164 ± 0.028
2.65GlnThr: 2.65 ± 0.649
1.136GlnVal: 1.136 ± 0.202
1.136GlnTrp: 1.136 ± 0.202
1.514GlnTyr: 1.514 ± 0.621
0.0GlnXaa: 0.0 ± 0.0
Arg
5.678ArgAla: 5.678 ± 0.059
0.757ArgCys: 0.757 ± 0.043
4.164ArgAsp: 4.164 ± 0.562
3.028ArgGlu: 3.028 ± 0.894
1.514ArgPhe: 1.514 ± 0.447
3.785ArgGly: 3.785 ± 0.751
2.271ArgHis: 2.271 ± 0.404
3.785ArgIle: 3.785 ± 0.317
2.271ArgLys: 2.271 ± 0.13
7.57ArgLeu: 7.57 ± 0.969
1.136ArgMet: 1.136 ± 0.866
2.65ArgAsn: 2.65 ± 0.649
2.65ArgPro: 2.65 ± 0.115
1.893ArgGln: 1.893 ± 0.692
3.785ArgArg: 3.785 ± 0.317
7.57ArgSer: 7.57 ± 2.571
2.65ArgThr: 2.65 ± 0.649
4.542ArgVal: 4.542 ± 0.807
1.893ArgTrp: 1.893 ± 1.226
2.65ArgTyr: 2.65 ± 0.115
0.0ArgXaa: 0.0 ± 0.0
Ser
6.056SerAla: 6.056 ± 2.322
1.136SerCys: 1.136 ± 0.202
4.921SerAsp: 4.921 ± 0.519
4.921SerGlu: 4.921 ± 1.053
3.407SerPhe: 3.407 ± 0.071
6.435SerGly: 6.435 ± 0.636
2.65SerHis: 2.65 ± 0.115
4.921SerIle: 4.921 ± 1.617
3.785SerLys: 3.785 ± 0.751
7.57SerLeu: 7.57 ± 0.099
1.136SerMet: 1.136 ± 0.332
3.785SerAsn: 3.785 ± 0.217
4.164SerPro: 4.164 ± 1.096
3.785SerGln: 3.785 ± 0.317
6.056SerArg: 6.056 ± 0.72
3.785SerSer: 3.785 ± 0.217
4.921SerThr: 4.921 ± 0.015
4.542SerVal: 4.542 ± 0.261
0.757SerTrp: 0.757 ± 0.577
1.893SerTyr: 1.893 ± 0.158
0.0SerXaa: 0.0 ± 0.0
Thr
4.542ThrAla: 4.542 ± 0.273
0.0ThrCys: 0.0 ± 0.0
3.407ThrAsp: 3.407 ± 0.071
1.893ThrGlu: 1.893 ± 0.376
3.785ThrPhe: 3.785 ± 1.285
3.407ThrGly: 3.407 ± 0.997
2.271ThrHis: 2.271 ± 0.404
3.028ThrIle: 3.028 ± 0.36
5.299ThrLys: 5.299 ± 0.764
3.785ThrLeu: 3.785 ± 1.285
0.379ThrMet: 0.379 ± 0.289
3.407ThrAsn: 3.407 ± 0.605
6.435ThrPro: 6.435 ± 2.034
2.271ThrGln: 2.271 ± 0.13
5.299ThrArg: 5.299 ± 1.298
4.164ThrSer: 4.164 ± 1.096
7.57ThrThr: 7.57 ± 2.235
4.164ThrVal: 4.164 ± 1.04
0.757ThrTrp: 0.757 ± 0.043
2.65ThrTyr: 2.65 ± 0.953
0.0ThrXaa: 0.0 ± 0.0
Val
3.028ValAla: 3.028 ± 0.894
1.514ValCys: 1.514 ± 0.621
3.028ValAsp: 3.028 ± 1.242
2.65ValGlu: 2.65 ± 0.419
2.271ValPhe: 2.271 ± 0.938
3.785ValGly: 3.785 ± 0.217
2.65ValHis: 2.65 ± 0.115
3.407ValIle: 3.407 ± 1.531
2.271ValLys: 2.271 ± 0.664
5.678ValLeu: 5.678 ± 0.593
0.379ValMet: 0.379 ± 0.245
2.271ValAsn: 2.271 ± 0.664
3.407ValPro: 3.407 ± 1.139
2.271ValGln: 2.271 ± 0.938
4.921ValArg: 4.921 ± 1.053
6.813ValSer: 6.813 ± 0.143
3.407ValThr: 3.407 ± 0.463
1.514ValVal: 1.514 ± 0.087
0.379ValTrp: 0.379 ± 0.289
1.514ValTyr: 1.514 ± 0.087
0.0ValXaa: 0.0 ± 0.0
Trp
3.028TrpAla: 3.028 ± 0.894
0.379TrpCys: 0.379 ± 0.245
1.514TrpAsp: 1.514 ± 0.621
1.514TrpGlu: 1.514 ± 0.087
0.379TrpPhe: 0.379 ± 0.289
0.379TrpGly: 0.379 ± 0.289
0.379TrpHis: 0.379 ± 0.289
0.757TrpIle: 0.757 ± 0.491
0.0TrpLys: 0.0 ± 0.0
0.757TrpLeu: 0.757 ± 0.491
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.136TrpPro: 1.136 ± 0.866
0.379TrpGln: 0.379 ± 0.245
0.379TrpArg: 0.379 ± 0.245
1.514TrpSer: 1.514 ± 0.087
1.514TrpThr: 1.514 ± 0.447
0.757TrpVal: 0.757 ± 0.043
1.136TrpTrp: 1.136 ± 0.736
0.757TrpTyr: 0.757 ± 0.577
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.785TyrAla: 3.785 ± 1.285
0.757TyrCys: 0.757 ± 0.491
0.757TyrAsp: 0.757 ± 0.577
2.65TyrGlu: 2.65 ± 0.115
1.136TyrPhe: 1.136 ± 0.736
1.136TyrGly: 1.136 ± 0.332
1.136TyrHis: 1.136 ± 0.736
2.271TyrIle: 2.271 ± 0.664
2.271TyrLys: 2.271 ± 0.664
3.407TyrLeu: 3.407 ± 1.531
1.893TyrMet: 1.893 ± 0.158
2.271TyrAsn: 2.271 ± 0.664
2.271TyrPro: 2.271 ± 0.13
1.514TyrGln: 1.514 ± 0.447
1.136TyrArg: 1.136 ± 0.736
1.893TyrSer: 1.893 ± 0.158
1.514TyrThr: 1.514 ± 0.621
2.65TyrVal: 2.65 ± 0.115
0.0TyrTrp: 0.0 ± 0.0
0.757TyrTyr: 0.757 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski