Amino acid dipepetide frequency for Cryphonectria parasitica bipartite mycovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.184AlaAla: 12.184 ± 3.9
0.937AlaCys: 0.937 ± 0.551
7.498AlaAsp: 7.498 ± 1.926
8.435AlaGlu: 8.435 ± 3.639
1.874AlaPhe: 1.874 ± 1.102
4.686AlaGly: 4.686 ± 0.935
3.749AlaHis: 3.749 ± 1.483
3.749AlaIle: 3.749 ± 1.017
2.812AlaLys: 2.812 ± 0.899
4.686AlaLeu: 4.686 ± 0.935
4.686AlaMet: 4.686 ± 0.941
0.937AlaAsn: 0.937 ± 0.908
8.435AlaPro: 8.435 ± 1.643
1.874AlaGln: 1.874 ± 1.102
12.184AlaArg: 12.184 ± 7.687
3.749AlaSer: 3.749 ± 0.594
9.372AlaThr: 9.372 ± 3.735
7.498AlaVal: 7.498 ± 2.1
2.812AlaTrp: 2.812 ± 1.653
3.749AlaTyr: 3.749 ± 2.204
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.874CysPhe: 1.874 ± 1.102
0.937CysGly: 0.937 ± 0.551
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.874CysPro: 1.874 ± 1.032
0.0CysGln: 0.0 ± 0.0
0.937CysArg: 0.937 ± 0.551
0.0CysSer: 0.0 ± 0.0
0.937CysThr: 0.937 ± 0.551
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.56AspAla: 6.56 ± 1.622
0.0AspCys: 0.0 ± 0.0
4.686AspAsp: 4.686 ± 0.711
1.874AspGlu: 1.874 ± 1.816
2.812AspPhe: 2.812 ± 0.548
2.812AspGly: 2.812 ± 1.653
1.874AspHis: 1.874 ± 1.102
1.874AspIle: 1.874 ± 0.508
2.812AspLys: 2.812 ± 1.217
8.435AspLeu: 8.435 ± 1.643
0.937AspMet: 0.937 ± 0.908
3.749AspAsn: 3.749 ± 0.974
5.623AspPro: 5.623 ± 0.397
0.0AspGln: 0.0 ± 0.0
5.623AspArg: 5.623 ± 1.606
2.812AspSer: 2.812 ± 2.084
2.812AspThr: 2.812 ± 0.899
0.937AspVal: 0.937 ± 0.551
0.937AspTrp: 0.937 ± 0.551
2.812AspTyr: 2.812 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
6.56GluAla: 6.56 ± 1.481
0.0GluCys: 0.0 ± 0.0
2.812GluAsp: 2.812 ± 2.033
0.937GluGlu: 0.937 ± 1.122
3.749GluPhe: 3.749 ± 0.974
2.812GluGly: 2.812 ± 1.365
0.937GluHis: 0.937 ± 0.908
0.937GluIle: 0.937 ± 0.908
2.812GluLys: 2.812 ± 0.548
1.874GluLeu: 1.874 ± 1.816
4.686GluMet: 4.686 ± 2.289
1.874GluAsn: 1.874 ± 0.508
2.812GluPro: 2.812 ± 0.899
2.812GluGln: 2.812 ± 0.548
7.498GluArg: 7.498 ± 1.188
2.812GluSer: 2.812 ± 1.653
3.749GluThr: 3.749 ± 1.483
4.686GluVal: 4.686 ± 2.289
2.812GluTrp: 2.812 ± 1.217
2.812GluTyr: 2.812 ± 0.548
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 0.548
0.0PheCys: 0.0 ± 0.0
3.749PheAsp: 3.749 ± 0.974
0.937PheGlu: 0.937 ± 1.122
0.937PhePhe: 0.937 ± 0.551
0.0PheGly: 0.0 ± 0.0
0.937PheHis: 0.937 ± 0.551
1.874PheIle: 1.874 ± 1.102
0.937PheLys: 0.937 ± 0.551
0.937PheLeu: 0.937 ± 0.551
0.0PheMet: 0.0 ± 0.0
1.874PheAsn: 1.874 ± 1.368
1.874PhePro: 1.874 ± 1.102
3.749PheGln: 3.749 ± 2.204
0.0PheArg: 0.0 ± 0.0
2.812PheSer: 2.812 ± 0.548
0.0PheThr: 0.0 ± 0.0
5.623PheVal: 5.623 ± 1.096
0.0PheTrp: 0.0 ± 0.0
1.874PheTyr: 1.874 ± 1.102
0.0PheXaa: 0.0 ± 0.0
Gly
8.435GlyAla: 8.435 ± 4.41
0.0GlyCys: 0.0 ± 0.0
5.623GlyAsp: 5.623 ± 2.017
2.812GlyGlu: 2.812 ± 0.548
0.0GlyPhe: 0.0 ± 0.0
10.309GlyGly: 10.309 ± 4.812
1.874GlyHis: 1.874 ± 1.032
3.749GlyIle: 3.749 ± 2.262
4.686GlyLys: 4.686 ± 1.485
6.56GlyLeu: 6.56 ± 3.035
0.0GlyMet: 0.0 ± 0.0
1.874GlyAsn: 1.874 ± 1.102
1.874GlyPro: 1.874 ± 1.816
2.812GlyGln: 2.812 ± 1.365
2.812GlyArg: 2.812 ± 1.653
4.686GlySer: 4.686 ± 1.709
1.874GlyThr: 1.874 ± 0.508
2.812GlyVal: 2.812 ± 1.653
0.0GlyTrp: 0.0 ± 0.0
3.749GlyTyr: 3.749 ± 1.483
0.0GlyXaa: 0.0 ± 0.0
His
3.749HisAla: 3.749 ± 2.735
0.937HisCys: 0.937 ± 0.551
1.874HisAsp: 1.874 ± 1.102
0.937HisGlu: 0.937 ± 0.551
0.0HisPhe: 0.0 ± 0.0
0.937HisGly: 0.937 ± 0.551
0.0HisHis: 0.0 ± 0.0
0.937HisIle: 0.937 ± 0.551
1.874HisLys: 1.874 ± 1.368
1.874HisLeu: 1.874 ± 0.508
0.937HisMet: 0.937 ± 0.551
0.937HisAsn: 0.937 ± 0.551
0.937HisPro: 0.937 ± 0.551
0.0HisGln: 0.0 ± 0.0
0.937HisArg: 0.937 ± 0.551
0.937HisSer: 0.937 ± 0.551
1.874HisThr: 1.874 ± 1.368
0.937HisVal: 0.937 ± 0.551
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.686IleAla: 4.686 ± 0.902
0.937IleCys: 0.937 ± 0.551
3.749IleAsp: 3.749 ± 1.582
3.749IleGlu: 3.749 ± 1.017
0.0IlePhe: 0.0 ± 0.0
1.874IleGly: 1.874 ± 0.508
0.937IleHis: 0.937 ± 0.908
3.749IleIle: 3.749 ± 0.974
0.937IleLys: 0.937 ± 0.908
3.749IleLeu: 3.749 ± 0.974
0.0IleMet: 0.0 ± 0.0
2.812IleAsn: 2.812 ± 1.365
1.874IlePro: 1.874 ± 1.102
4.686IleGln: 4.686 ± 2.289
3.749IleArg: 3.749 ± 1.483
1.874IleSer: 1.874 ± 1.032
2.812IleThr: 2.812 ± 0.899
0.937IleVal: 0.937 ± 0.908
0.937IleTrp: 0.937 ± 0.908
0.937IleTyr: 0.937 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
2.812LysAla: 2.812 ± 1.217
0.0LysCys: 0.0 ± 0.0
1.874LysAsp: 1.874 ± 0.508
4.686LysGlu: 4.686 ± 1.485
1.874LysPhe: 1.874 ± 1.102
1.874LysGly: 1.874 ± 2.243
0.0LysHis: 0.0 ± 0.0
3.749LysIle: 3.749 ± 3.632
1.874LysLys: 1.874 ± 0.508
3.749LysLeu: 3.749 ± 0.594
0.937LysMet: 0.937 ± 0.551
0.937LysAsn: 0.937 ± 0.551
1.874LysPro: 1.874 ± 0.508
0.937LysGln: 0.937 ± 0.908
5.623LysArg: 5.623 ± 1.525
2.812LysSer: 2.812 ± 0.548
3.749LysThr: 3.749 ± 1.017
1.874LysVal: 1.874 ± 1.032
0.0LysTrp: 0.0 ± 0.0
2.812LysTyr: 2.812 ± 1.653
0.0LysXaa: 0.0 ± 0.0
Leu
9.372LeuAla: 9.372 ± 3.198
0.937LeuCys: 0.937 ± 0.551
3.749LeuAsp: 3.749 ± 1.582
2.812LeuGlu: 2.812 ± 1.653
1.874LeuPhe: 1.874 ± 0.508
10.309LeuGly: 10.309 ± 1.93
0.937LeuHis: 0.937 ± 0.551
1.874LeuIle: 1.874 ± 1.102
4.686LeuLys: 4.686 ± 2.033
4.686LeuLeu: 4.686 ± 2.756
1.874LeuMet: 1.874 ± 1.102
1.874LeuAsn: 1.874 ± 1.102
2.812LeuPro: 2.812 ± 0.548
7.498LeuGln: 7.498 ± 2.146
5.623LeuArg: 5.623 ± 1.606
5.623LeuSer: 5.623 ± 3.307
5.623LeuThr: 5.623 ± 1.745
5.623LeuVal: 5.623 ± 0.397
0.937LeuTrp: 0.937 ± 0.551
0.937LeuTyr: 0.937 ± 0.551
0.0LeuXaa: 0.0 ± 0.0
Met
2.812MetAla: 2.812 ± 0.899
0.0MetCys: 0.0 ± 0.0
0.937MetAsp: 0.937 ± 1.122
5.623MetGlu: 5.623 ± 3.151
0.937MetPhe: 0.937 ± 0.908
0.937MetGly: 0.937 ± 0.551
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.874MetLeu: 1.874 ± 0.508
1.874MetMet: 1.874 ± 0.711
2.812MetAsn: 2.812 ± 1.365
0.937MetPro: 0.937 ± 0.908
1.874MetGln: 1.874 ± 1.102
1.874MetArg: 1.874 ± 1.102
0.937MetSer: 0.937 ± 0.551
0.937MetThr: 0.937 ± 0.908
5.623MetVal: 5.623 ± 1.606
0.937MetTrp: 0.937 ± 0.551
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.874AsnAla: 1.874 ± 0.508
0.0AsnCys: 0.0 ± 0.0
2.812AsnAsp: 2.812 ± 1.365
1.874AsnGlu: 1.874 ± 1.032
3.749AsnPhe: 3.749 ± 0.974
0.937AsnGly: 0.937 ± 0.551
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
0.937AsnLys: 0.937 ± 0.908
1.874AsnLeu: 1.874 ± 1.368
1.874AsnMet: 1.874 ± 0.508
0.0AsnAsn: 0.0 ± 0.0
2.812AsnPro: 2.812 ± 0.548
1.874AsnGln: 1.874 ± 0.508
1.874AsnArg: 1.874 ± 0.508
3.749AsnSer: 3.749 ± 1.017
0.937AsnThr: 0.937 ± 0.551
2.812AsnVal: 2.812 ± 0.548
0.0AsnTrp: 0.0 ± 0.0
0.937AsnTyr: 0.937 ± 0.551
0.0AsnXaa: 0.0 ± 0.0
Pro
5.623ProAla: 5.623 ± 1.125
0.937ProCys: 0.937 ± 0.551
5.623ProAsp: 5.623 ± 1.125
3.749ProGlu: 3.749 ± 1.017
0.0ProPhe: 0.0 ± 0.0
1.874ProGly: 1.874 ± 1.102
1.874ProHis: 1.874 ± 1.102
6.56ProIle: 6.56 ± 0.218
0.937ProLys: 0.937 ± 0.908
4.686ProLeu: 4.686 ± 0.902
0.937ProMet: 0.937 ± 1.122
2.812ProAsn: 2.812 ± 1.217
3.749ProPro: 3.749 ± 0.974
3.749ProGln: 3.749 ± 0.594
1.874ProArg: 1.874 ± 1.102
3.749ProSer: 3.749 ± 1.483
3.749ProThr: 3.749 ± 1.017
5.623ProVal: 5.623 ± 1.525
0.937ProTrp: 0.937 ± 0.551
0.937ProTyr: 0.937 ± 0.551
0.0ProXaa: 0.0 ± 0.0
Gln
3.749GlnAla: 3.749 ± 1.017
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.874GlnGlu: 1.874 ± 0.508
1.874GlnPhe: 1.874 ± 1.102
3.749GlnGly: 3.749 ± 1.017
1.874GlnHis: 1.874 ± 1.102
5.623GlnIle: 5.623 ± 1.096
0.937GlnLys: 0.937 ± 0.551
2.812GlnLeu: 2.812 ± 1.653
0.0GlnMet: 0.0 ± 0.0
0.937GlnAsn: 0.937 ± 0.551
3.749GlnPro: 3.749 ± 1.582
3.749GlnGln: 3.749 ± 1.017
3.749GlnArg: 3.749 ± 0.594
3.749GlnSer: 3.749 ± 1.582
3.749GlnThr: 3.749 ± 1.483
1.874GlnVal: 1.874 ± 0.508
1.874GlnTrp: 1.874 ± 1.032
1.874GlnTyr: 1.874 ± 1.816
0.0GlnXaa: 0.0 ± 0.0
Arg
7.498ArgAla: 7.498 ± 2.1
0.0ArgCys: 0.0 ± 0.0
4.686ArgAsp: 4.686 ± 1.485
6.56ArgGlu: 6.56 ± 2.62
2.812ArgPhe: 2.812 ± 1.653
8.435ArgGly: 8.435 ± 3.749
0.0ArgHis: 0.0 ± 0.0
0.937ArgIle: 0.937 ± 0.551
7.498ArgLys: 7.498 ± 0.692
8.435ArgLeu: 8.435 ± 2.681
2.812ArgMet: 2.812 ± 0.548
0.0ArgAsn: 0.0 ± 0.0
3.749ArgPro: 3.749 ± 0.974
6.56ArgGln: 6.56 ± 1.42
8.435ArgArg: 8.435 ± 1.277
0.937ArgSer: 0.937 ± 1.122
3.749ArgThr: 3.749 ± 1.017
5.623ArgVal: 5.623 ± 2.523
0.937ArgTrp: 0.937 ± 1.122
2.812ArgTyr: 2.812 ± 0.548
0.0ArgXaa: 0.0 ± 0.0
Ser
7.498SerAla: 7.498 ± 1.188
0.0SerCys: 0.0 ± 0.0
0.937SerAsp: 0.937 ± 0.551
0.937SerGlu: 0.937 ± 0.551
2.812SerPhe: 2.812 ± 0.548
2.812SerGly: 2.812 ± 1.653
1.874SerHis: 1.874 ± 1.102
6.56SerIle: 6.56 ± 2.278
3.749SerLys: 3.749 ± 1.017
6.56SerLeu: 6.56 ± 1.622
3.749SerMet: 3.749 ± 1.483
2.812SerAsn: 2.812 ± 1.365
0.937SerPro: 0.937 ± 1.122
0.937SerGln: 0.937 ± 0.551
0.937SerArg: 0.937 ± 0.551
4.686SerSer: 4.686 ± 0.902
3.749SerThr: 3.749 ± 1.582
0.937SerVal: 0.937 ± 1.122
1.874SerTrp: 1.874 ± 1.102
1.874SerTyr: 1.874 ± 1.102
0.0SerXaa: 0.0 ± 0.0
Thr
8.435ThrAla: 8.435 ± 2.696
0.0ThrCys: 0.0 ± 0.0
2.812ThrAsp: 2.812 ± 2.033
1.874ThrGlu: 1.874 ± 0.508
1.874ThrPhe: 1.874 ± 1.032
3.749ThrGly: 3.749 ± 2.262
0.937ThrHis: 0.937 ± 1.122
1.874ThrIle: 1.874 ± 1.032
0.937ThrLys: 0.937 ± 0.551
2.812ThrLeu: 2.812 ± 1.653
0.0ThrMet: 0.0 ± 0.0
2.812ThrAsn: 2.812 ± 2.724
4.686ThrPro: 4.686 ± 0.935
2.812ThrGln: 2.812 ± 1.653
8.435ThrArg: 8.435 ± 2.696
4.686ThrSer: 4.686 ± 1.849
6.56ThrThr: 6.56 ± 2.847
2.812ThrVal: 2.812 ± 2.084
1.874ThrTrp: 1.874 ± 1.368
2.812ThrTyr: 2.812 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
5.623ValAla: 5.623 ± 1.745
0.937ValCys: 0.937 ± 1.122
4.686ValAsp: 4.686 ± 1.849
7.498ValGlu: 7.498 ± 0.738
0.0ValPhe: 0.0 ± 0.0
6.56ValGly: 6.56 ± 1.622
0.937ValHis: 0.937 ± 0.908
0.937ValIle: 0.937 ± 1.122
2.812ValLys: 2.812 ± 0.548
3.749ValLeu: 3.749 ± 0.594
1.874ValMet: 1.874 ± 1.102
0.0ValAsn: 0.0 ± 0.0
5.623ValPro: 5.623 ± 0.397
1.874ValGln: 1.874 ± 0.508
4.686ValArg: 4.686 ± 2.756
2.812ValSer: 2.812 ± 1.365
5.623ValThr: 5.623 ± 2.83
3.749ValVal: 3.749 ± 2.064
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.937TrpAla: 0.937 ± 0.908
0.0TrpCys: 0.0 ± 0.0
1.874TrpAsp: 1.874 ± 1.102
0.0TrpGlu: 0.0 ± 0.0
0.937TrpPhe: 0.937 ± 1.122
0.0TrpGly: 0.0 ± 0.0
0.937TrpHis: 0.937 ± 1.122
0.0TrpIle: 0.0 ± 0.0
0.937TrpLys: 0.937 ± 0.908
4.686TrpLeu: 4.686 ± 2.188
0.937TrpMet: 0.937 ± 0.551
0.937TrpAsn: 0.937 ± 0.551
0.937TrpPro: 0.937 ± 0.551
0.0TrpGln: 0.0 ± 0.0
1.874TrpArg: 1.874 ± 1.102
1.874TrpSer: 1.874 ± 1.102
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.937TrpTyr: 0.937 ± 0.551
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.686TyrAla: 4.686 ± 1.485
0.937TyrCys: 0.937 ± 0.551
0.0TyrAsp: 0.0 ± 0.0
2.812TyrGlu: 2.812 ± 1.365
0.937TyrPhe: 0.937 ± 0.551
0.937TyrGly: 0.937 ± 0.551
0.937TyrHis: 0.937 ± 0.551
0.0TyrIle: 0.0 ± 0.0
1.874TyrLys: 1.874 ± 0.508
5.623TyrLeu: 5.623 ± 2.017
1.874TyrMet: 1.874 ± 1.78
0.937TyrAsn: 0.937 ± 0.551
2.812TyrPro: 2.812 ± 1.653
0.0TyrGln: 0.0 ± 0.0
3.749TyrArg: 3.749 ± 0.974
0.937TyrSer: 0.937 ± 0.551
0.937TyrThr: 0.937 ± 0.551
0.0TyrVal: 0.0 ± 0.0
0.937TyrTrp: 0.937 ± 0.551
0.937TyrTyr: 0.937 ± 0.551
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1068 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski