Amino acid dipepetide frequency for Lynx canadensis associated microvirus CLP 9413

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.655AlaAla: 13.655 ± 7.348
0.0AlaCys: 0.0 ± 0.0
11.245AlaAsp: 11.245 ± 3.04
4.016AlaGlu: 4.016 ± 2.396
2.41AlaPhe: 2.41 ± 0.493
6.426AlaGly: 6.426 ± 1.125
0.803AlaHis: 0.803 ± 0.787
5.622AlaIle: 5.622 ± 3.666
7.229AlaLys: 7.229 ± 2.555
5.622AlaLeu: 5.622 ± 1.791
4.819AlaMet: 4.819 ± 4.724
4.016AlaAsn: 4.016 ± 1.994
0.803AlaPro: 0.803 ± 0.529
4.819AlaGln: 4.819 ± 2.721
5.622AlaArg: 5.622 ± 1.686
5.622AlaSer: 5.622 ± 2.296
5.622AlaThr: 5.622 ± 1.667
10.442AlaVal: 10.442 ± 2.977
3.213AlaTrp: 3.213 ± 1.333
2.41AlaTyr: 2.41 ± 1.538
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.874
0.0CysCys: 0.0 ± 0.0
0.803CysAsp: 0.803 ± 0.874
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.803CysLys: 0.803 ± 0.874
0.803CysLeu: 0.803 ± 0.874
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.803CysVal: 0.803 ± 1.293
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.622AspAla: 5.622 ± 1.09
0.803AspCys: 0.803 ± 1.293
4.016AspAsp: 4.016 ± 1.633
2.41AspGlu: 2.41 ± 1.587
3.213AspPhe: 3.213 ± 1.179
2.41AspGly: 2.41 ± 1.452
3.213AspHis: 3.213 ± 1.495
4.016AspIle: 4.016 ± 0.778
4.819AspLys: 4.819 ± 1.92
6.426AspLeu: 6.426 ± 2.998
0.0AspMet: 0.0 ± 0.0
1.606AspAsn: 1.606 ± 0.668
2.41AspPro: 2.41 ± 1.452
0.803AspGln: 0.803 ± 0.874
3.213AspArg: 3.213 ± 1.336
4.016AspSer: 4.016 ± 1.634
4.819AspThr: 4.819 ± 1.021
7.229AspVal: 7.229 ± 2.173
1.606AspTrp: 1.606 ± 1.575
7.229AspTyr: 7.229 ± 1.103
0.0AspXaa: 0.0 ± 0.0
Glu
4.819GluAla: 4.819 ± 2.455
0.0GluCys: 0.0 ± 0.0
1.606GluAsp: 1.606 ± 0.748
3.213GluGlu: 3.213 ± 3.04
0.0GluPhe: 0.0 ± 0.0
0.803GluGly: 0.803 ± 0.874
1.606GluHis: 1.606 ± 1.264
1.606GluIle: 1.606 ± 1.058
0.803GluLys: 0.803 ± 0.529
0.803GluLeu: 0.803 ± 0.874
2.41GluMet: 2.41 ± 0.956
2.41GluAsn: 2.41 ± 1.587
1.606GluPro: 1.606 ± 1.748
1.606GluGln: 1.606 ± 0.668
2.41GluArg: 2.41 ± 0.493
4.016GluSer: 4.016 ± 2.777
4.016GluThr: 4.016 ± 0.992
3.213GluVal: 3.213 ± 1.333
0.803GluTrp: 0.803 ± 0.529
4.016GluTyr: 4.016 ± 0.992
0.0GluXaa: 0.0 ± 0.0
Phe
4.819PheAla: 4.819 ± 1.275
0.0PheCys: 0.0 ± 0.0
4.016PheAsp: 4.016 ± 0.968
1.606PheGlu: 1.606 ± 1.633
0.803PhePhe: 0.803 ± 0.529
4.819PheGly: 4.819 ± 1.002
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
1.606PheLys: 1.606 ± 1.058
0.803PheLeu: 0.803 ± 0.874
1.606PheMet: 1.606 ± 1.633
2.41PheAsn: 2.41 ± 0.493
0.803PhePro: 0.803 ± 0.874
2.41PheGln: 2.41 ± 0.493
4.016PheArg: 4.016 ± 1.633
1.606PheSer: 1.606 ± 1.375
1.606PheThr: 1.606 ± 0.748
2.41PheVal: 2.41 ± 1.587
0.0PheTrp: 0.0 ± 0.0
2.41PheTyr: 2.41 ± 1.452
0.0PheXaa: 0.0 ± 0.0
Gly
3.213GlyAla: 3.213 ± 2.121
0.0GlyCys: 0.0 ± 0.0
3.213GlyAsp: 3.213 ± 1.333
4.819GlyGlu: 4.819 ± 0.98
4.016GlyPhe: 4.016 ± 1.382
8.032GlyGly: 8.032 ± 2.061
0.0GlyHis: 0.0 ± 0.0
4.016GlyIle: 4.016 ± 1.382
4.016GlyLys: 4.016 ± 1.417
6.426GlyLeu: 6.426 ± 0.932
0.803GlyMet: 0.803 ± 0.787
4.819GlyAsn: 4.819 ± 1.824
2.41GlyPro: 2.41 ± 0.956
0.803GlyGln: 0.803 ± 0.529
0.803GlyArg: 0.803 ± 0.874
8.032GlySer: 8.032 ± 3.649
5.622GlyThr: 5.622 ± 3.102
7.229GlyVal: 7.229 ± 1.126
0.803GlyTrp: 0.803 ± 0.529
3.213GlyTyr: 3.213 ± 2.121
0.0GlyXaa: 0.0 ± 0.0
His
4.016HisAla: 4.016 ± 2.797
0.0HisCys: 0.0 ± 0.0
0.803HisAsp: 0.803 ± 0.529
0.0HisGlu: 0.0 ± 0.0
0.803HisPhe: 0.803 ± 0.529
1.606HisGly: 1.606 ± 1.058
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.803HisLys: 0.803 ± 0.874
0.803HisLeu: 0.803 ± 0.529
0.0HisMet: 0.0 ± 0.0
1.606HisAsn: 1.606 ± 0.748
0.803HisPro: 0.803 ± 0.874
0.803HisGln: 0.803 ± 0.787
0.0HisArg: 0.0 ± 0.0
0.803HisSer: 0.803 ± 0.529
0.803HisThr: 0.803 ± 0.529
0.803HisVal: 0.803 ± 1.293
0.803HisTrp: 0.803 ± 0.529
0.803HisTyr: 0.803 ± 0.874
0.0HisXaa: 0.0 ± 0.0
Ile
4.016IleAla: 4.016 ± 1.382
0.803IleCys: 0.803 ± 0.874
3.213IleAsp: 3.213 ± 1.538
2.41IleGlu: 2.41 ± 1.538
0.803IlePhe: 0.803 ± 0.529
2.41IleGly: 2.41 ± 0.956
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
0.803IleLys: 0.803 ± 0.529
3.213IleLeu: 3.213 ± 1.179
1.606IleMet: 1.606 ± 1.264
5.622IleAsn: 5.622 ± 1.37
5.622IlePro: 5.622 ± 3.703
1.606IleGln: 1.606 ± 1.058
3.213IleArg: 3.213 ± 2.646
4.819IleSer: 4.819 ± 2.594
3.213IleThr: 3.213 ± 1.333
1.606IleVal: 1.606 ± 0.668
0.803IleTrp: 0.803 ± 0.529
1.606IleTyr: 1.606 ± 1.058
0.0IleXaa: 0.0 ± 0.0
Lys
5.622LysAla: 5.622 ± 2.692
0.803LysCys: 0.803 ± 0.874
2.41LysAsp: 2.41 ± 1.587
0.803LysGlu: 0.803 ± 0.874
0.803LysPhe: 0.803 ± 0.874
7.229LysGly: 7.229 ± 2.847
0.0LysHis: 0.0 ± 0.0
4.016LysIle: 4.016 ± 2.958
2.41LysLys: 2.41 ± 2.622
6.426LysLeu: 6.426 ± 4.775
0.0LysMet: 0.0 ± 0.0
4.819LysAsn: 4.819 ± 1.275
1.606LysPro: 1.606 ± 0.748
2.41LysGln: 2.41 ± 1.509
3.213LysArg: 3.213 ± 1.252
2.41LysSer: 2.41 ± 0.912
2.41LysThr: 2.41 ± 1.443
0.803LysVal: 0.803 ± 0.529
0.0LysTrp: 0.0 ± 0.0
4.016LysTyr: 4.016 ± 1.824
0.0LysXaa: 0.0 ± 0.0
Leu
6.426LeuAla: 6.426 ± 1.376
0.0LeuCys: 0.0 ± 0.0
4.016LeuAsp: 4.016 ± 1.818
1.606LeuGlu: 1.606 ± 0.748
4.016LeuPhe: 4.016 ± 1.417
3.213LeuGly: 3.213 ± 1.18
1.606LeuHis: 1.606 ± 0.949
4.819LeuIle: 4.819 ± 2.618
4.016LeuLys: 4.016 ± 1.824
1.606LeuLeu: 1.606 ± 0.668
1.606LeuMet: 1.606 ± 1.583
4.016LeuAsn: 4.016 ± 1.824
6.426LeuPro: 6.426 ± 2.705
4.819LeuGln: 4.819 ± 2.243
4.016LeuArg: 4.016 ± 1.417
4.819LeuSer: 4.819 ± 0.788
5.622LeuThr: 5.622 ± 1.223
0.0LeuVal: 0.0 ± 0.0
0.803LeuTrp: 0.803 ± 0.529
4.016LeuTyr: 4.016 ± 2.255
0.0LeuXaa: 0.0 ± 0.0
Met
3.213MetAla: 3.213 ± 2.121
0.0MetCys: 0.0 ± 0.0
1.606MetAsp: 1.606 ± 0.668
0.803MetGlu: 0.803 ± 1.293
0.0MetPhe: 0.0 ± 0.0
3.213MetGly: 3.213 ± 1.336
0.0MetHis: 0.0 ± 0.0
0.803MetIle: 0.803 ± 0.787
1.606MetLys: 1.606 ± 0.748
0.803MetLeu: 0.803 ± 0.874
0.803MetMet: 0.803 ± 0.529
0.803MetAsn: 0.803 ± 0.529
1.606MetPro: 1.606 ± 1.264
0.0MetGln: 0.0 ± 0.0
0.803MetArg: 0.803 ± 0.529
6.426MetSer: 6.426 ± 3.156
2.41MetThr: 2.41 ± 1.167
0.803MetVal: 0.803 ± 0.529
0.803MetTrp: 0.803 ± 0.529
1.606MetTyr: 1.606 ± 0.748
0.0MetXaa: 0.0 ± 0.0
Asn
10.442AsnAla: 10.442 ± 4.959
0.0AsnCys: 0.0 ± 0.0
0.803AsnAsp: 0.803 ± 0.529
0.803AsnGlu: 0.803 ± 0.529
0.803AsnPhe: 0.803 ± 1.293
1.606AsnGly: 1.606 ± 1.575
0.0AsnHis: 0.0 ± 0.0
5.622AsnIle: 5.622 ± 1.223
1.606AsnLys: 1.606 ± 1.575
4.819AsnLeu: 4.819 ± 0.788
0.0AsnMet: 0.0 ± 0.496
2.41AsnAsn: 2.41 ± 2.362
2.41AsnPro: 2.41 ± 1.361
2.41AsnGln: 2.41 ± 1.587
4.016AsnArg: 4.016 ± 1.812
6.426AsnSer: 6.426 ± 3.348
4.016AsnThr: 4.016 ± 1.994
0.803AsnVal: 0.803 ± 0.529
0.0AsnTrp: 0.0 ± 0.0
1.606AsnTyr: 1.606 ± 0.748
0.0AsnXaa: 0.0 ± 0.0
Pro
2.41ProAla: 2.41 ± 1.646
0.0ProCys: 0.0 ± 0.0
6.426ProAsp: 6.426 ± 3.788
2.41ProGlu: 2.41 ± 1.538
4.016ProPhe: 4.016 ± 1.633
3.213ProGly: 3.213 ± 1.352
1.606ProHis: 1.606 ± 1.633
3.213ProIle: 3.213 ± 0.379
2.41ProLys: 2.41 ± 0.956
1.606ProLeu: 1.606 ± 0.748
2.41ProMet: 2.41 ± 1.206
1.606ProAsn: 1.606 ± 1.058
2.41ProPro: 2.41 ± 2.622
2.41ProGln: 2.41 ± 1.587
0.803ProArg: 0.803 ± 0.529
4.819ProSer: 4.819 ± 1.275
3.213ProThr: 3.213 ± 1.769
4.819ProVal: 4.819 ± 2.626
0.0ProTrp: 0.0 ± 0.0
0.803ProTyr: 0.803 ± 0.874
0.0ProXaa: 0.0 ± 0.0
Gln
4.016GlnAla: 4.016 ± 2.895
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.606GlnGlu: 1.606 ± 0.668
0.803GlnPhe: 0.803 ± 0.787
2.41GlnGly: 2.41 ± 1.587
0.0GlnHis: 0.0 ± 0.0
3.213GlnIle: 3.213 ± 2.116
1.606GlnLys: 1.606 ± 0.748
6.426GlnLeu: 6.426 ± 2.16
0.803GlnMet: 0.803 ± 0.787
1.606GlnAsn: 1.606 ± 1.575
0.0GlnPro: 0.0 ± 0.0
2.41GlnGln: 2.41 ± 0.912
3.213GlnArg: 3.213 ± 0.379
4.819GlnSer: 4.819 ± 1.021
1.606GlnThr: 1.606 ± 0.668
1.606GlnVal: 1.606 ± 0.748
0.803GlnTrp: 0.803 ± 0.874
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.803ArgAla: 0.803 ± 0.529
0.0ArgCys: 0.0 ± 0.0
7.229ArgAsp: 7.229 ± 3.246
3.213ArgGlu: 3.213 ± 1.333
2.41ArgPhe: 2.41 ± 1.587
1.606ArgGly: 1.606 ± 0.668
0.0ArgHis: 0.0 ± 0.0
0.803ArgIle: 0.803 ± 0.787
4.819ArgLys: 4.819 ± 1.831
5.622ArgLeu: 5.622 ± 2.339
2.41ArgMet: 2.41 ± 0.902
2.41ArgAsn: 2.41 ± 1.167
4.819ArgPro: 4.819 ± 2.243
0.803ArgGln: 0.803 ± 0.874
0.803ArgArg: 0.803 ± 0.874
1.606ArgSer: 1.606 ± 1.058
1.606ArgThr: 1.606 ± 0.748
2.41ArgVal: 2.41 ± 0.956
0.0ArgTrp: 0.0 ± 0.0
4.819ArgTyr: 4.819 ± 0.98
0.0ArgXaa: 0.0 ± 0.0
Ser
12.048SerAla: 12.048 ± 5.75
0.803SerCys: 0.803 ± 0.874
5.622SerAsp: 5.622 ± 2.177
4.016SerGlu: 4.016 ± 3.25
3.213SerPhe: 3.213 ± 1.452
11.245SerGly: 11.245 ± 6.736
3.213SerHis: 3.213 ± 1.18
1.606SerIle: 1.606 ± 0.668
5.622SerLys: 5.622 ± 1.37
4.819SerLeu: 4.819 ± 1.824
2.41SerMet: 2.41 ± 1.361
3.213SerAsn: 3.213 ± 2.121
4.016SerPro: 4.016 ± 1.634
1.606SerGln: 1.606 ± 1.575
4.016SerArg: 4.016 ± 0.778
6.426SerSer: 6.426 ± 2.009
4.819SerThr: 4.819 ± 2.092
5.622SerVal: 5.622 ± 2.222
0.803SerTrp: 0.803 ± 0.874
1.606SerTyr: 1.606 ± 0.748
0.0SerXaa: 0.0 ± 0.0
Thr
6.426ThrAla: 6.426 ± 3.348
0.803ThrCys: 0.803 ± 0.874
4.819ThrAsp: 4.819 ± 1.275
2.41ThrGlu: 2.41 ± 1.587
4.016ThrPhe: 4.016 ± 3.045
7.229ThrGly: 7.229 ± 3.046
0.0ThrHis: 0.0 ± 0.0
1.606ThrIle: 1.606 ± 1.058
2.41ThrLys: 2.41 ± 0.493
4.016ThrLeu: 4.016 ± 1.914
0.803ThrMet: 0.803 ± 1.293
2.41ThrAsn: 2.41 ± 1.361
4.016ThrPro: 4.016 ± 2.255
2.41ThrGln: 2.41 ± 1.361
3.213ThrArg: 3.213 ± 2.116
6.426ThrSer: 6.426 ± 2.091
3.213ThrThr: 3.213 ± 1.352
2.41ThrVal: 2.41 ± 0.956
1.606ThrTrp: 1.606 ± 0.668
1.606ThrTyr: 1.606 ± 1.058
0.0ThrXaa: 0.0 ± 0.0
Val
6.426ValAla: 6.426 ± 0.932
0.0ValCys: 0.0 ± 0.0
4.016ValAsp: 4.016 ± 2.645
3.213ValGlu: 3.213 ± 1.452
1.606ValPhe: 1.606 ± 1.058
4.016ValGly: 4.016 ± 1.508
0.803ValHis: 0.803 ± 0.529
2.41ValIle: 2.41 ± 1.587
3.213ValLys: 3.213 ± 1.452
3.213ValLeu: 3.213 ± 1.333
2.41ValMet: 2.41 ± 0.956
4.016ValAsn: 4.016 ± 1.508
6.426ValPro: 6.426 ± 2.776
0.803ValGln: 0.803 ± 1.293
1.606ValArg: 1.606 ± 0.668
6.426ValSer: 6.426 ± 3.076
4.016ValThr: 4.016 ± 0.783
2.41ValVal: 2.41 ± 0.912
1.606ValTrp: 1.606 ± 0.748
1.606ValTyr: 1.606 ± 1.264
0.0ValXaa: 0.0 ± 0.0
Trp
1.606TrpAla: 1.606 ± 1.058
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.803TrpGlu: 0.803 ± 0.874
0.803TrpPhe: 0.803 ± 0.529
0.0TrpGly: 0.0 ± 0.0
0.803TrpHis: 0.803 ± 0.529
1.606TrpIle: 1.606 ± 1.058
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.803TrpPro: 0.803 ± 0.529
2.41TrpGln: 2.41 ± 2.362
0.803TrpArg: 0.803 ± 0.529
4.016TrpSer: 4.016 ± 0.778
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.606TrpTyr: 1.606 ± 0.748
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.016TyrAla: 4.016 ± 0.968
0.0TyrCys: 0.0 ± 0.0
4.016TyrAsp: 4.016 ± 3.25
1.606TyrGlu: 1.606 ± 1.058
3.213TyrPhe: 3.213 ± 1.495
0.803TyrGly: 0.803 ± 0.874
2.41TyrHis: 2.41 ± 0.956
2.41TyrIle: 2.41 ± 0.956
1.606TyrLys: 1.606 ± 0.748
3.213TyrLeu: 3.213 ± 0.379
2.41TyrMet: 2.41 ± 0.912
1.606TyrAsn: 1.606 ± 1.575
1.606TyrPro: 1.606 ± 0.949
1.606TyrGln: 1.606 ± 1.058
2.41TyrArg: 2.41 ± 1.167
3.213TyrSer: 3.213 ± 0.379
3.213TyrThr: 3.213 ± 2.459
4.819TyrVal: 4.819 ± 0.98
0.803TyrTrp: 0.803 ± 0.529
3.213TyrTyr: 3.213 ± 1.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski