Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_135

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.604AlaAla: 0.604 ± 1.149
0.0AlaCys: 0.0 ± 0.0
3.621AlaAsp: 3.621 ± 1.613
4.225AlaGlu: 4.225 ± 4.371
4.225AlaPhe: 4.225 ± 1.532
4.225AlaGly: 4.225 ± 2.194
0.0AlaHis: 0.0 ± 0.0
4.828AlaIle: 4.828 ± 1.162
4.225AlaLys: 4.225 ± 2.886
5.432AlaLeu: 5.432 ± 1.761
0.604AlaMet: 0.604 ± 0.458
4.828AlaAsn: 4.828 ± 2.455
4.828AlaPro: 4.828 ± 1.301
3.018AlaGln: 3.018 ± 0.904
1.811AlaArg: 1.811 ± 0.801
4.225AlaSer: 4.225 ± 1.311
1.207AlaThr: 1.207 ± 0.612
1.811AlaVal: 1.811 ± 2.24
2.414AlaTrp: 2.414 ± 1.216
4.225AlaTyr: 4.225 ± 1.721
0.0AlaXaa: 0.0 ± 0.0
Cys
0.604CysAla: 0.604 ± 0.475
0.0CysCys: 0.0 ± 0.0
1.811CysAsp: 1.811 ± 2.24
1.811CysGlu: 1.811 ± 1.771
1.207CysPhe: 1.207 ± 0.916
0.604CysGly: 0.604 ± 0.458
1.207CysHis: 1.207 ± 0.473
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.811CysLeu: 1.811 ± 0.831
0.0CysMet: 0.0 ± 0.0
0.604CysAsn: 0.604 ± 0.475
0.604CysPro: 0.604 ± 0.475
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.604CysThr: 0.604 ± 0.458
1.207CysVal: 1.207 ± 0.951
0.604CysTrp: 0.604 ± 0.475
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.414AspAla: 2.414 ± 0.836
0.0AspCys: 0.0 ± 0.0
6.035AspAsp: 6.035 ± 2.366
5.432AspGlu: 5.432 ± 2.183
5.432AspPhe: 5.432 ± 2.218
2.414AspGly: 2.414 ± 1.58
3.018AspHis: 3.018 ± 1.233
6.035AspIle: 6.035 ± 0.787
1.811AspLys: 1.811 ± 0.801
3.621AspLeu: 3.621 ± 0.935
1.207AspMet: 1.207 ± 1.03
2.414AspAsn: 2.414 ± 1.268
1.207AspPro: 1.207 ± 0.951
1.811AspGln: 1.811 ± 1.109
2.414AspArg: 2.414 ± 0.579
4.828AspSer: 4.828 ± 1.245
4.828AspThr: 4.828 ± 1.674
3.621AspVal: 3.621 ± 1.093
0.604AspTrp: 0.604 ± 0.475
6.035AspTyr: 6.035 ± 2.019
0.0AspXaa: 0.0 ± 0.0
Glu
7.242GluAla: 7.242 ± 2.928
0.0GluCys: 0.0 ± 0.0
1.207GluAsp: 1.207 ± 0.951
3.621GluGlu: 3.621 ± 1.355
1.811GluPhe: 1.811 ± 0.801
4.828GluGly: 4.828 ± 0.857
0.604GluHis: 0.604 ± 1.074
1.811GluIle: 1.811 ± 1.138
7.242GluLys: 7.242 ± 3.521
5.432GluLeu: 5.432 ± 2.175
0.0GluMet: 0.0 ± 1.056
5.432GluAsn: 5.432 ± 3.137
0.604GluPro: 0.604 ± 0.624
0.604GluGln: 0.604 ± 0.624
3.018GluArg: 3.018 ± 1.457
1.811GluSer: 1.811 ± 0.801
1.811GluThr: 1.811 ± 1.279
3.018GluVal: 3.018 ± 0.903
1.811GluTrp: 1.811 ± 0.831
8.449GluTyr: 8.449 ± 2.037
0.0GluXaa: 0.0 ± 0.0
Phe
4.828PheAla: 4.828 ± 1.835
0.604PheCys: 0.604 ± 1.149
7.242PheAsp: 7.242 ± 3.204
2.414PheGlu: 2.414 ± 0.946
3.018PhePhe: 3.018 ± 1.073
6.639PheGly: 6.639 ± 3.222
0.604PheHis: 0.604 ± 0.475
0.604PheIle: 0.604 ± 0.458
0.604PheLys: 0.604 ± 0.624
3.018PheLeu: 3.018 ± 1.464
0.604PheMet: 0.604 ± 0.458
3.621PheAsn: 3.621 ± 1.248
1.811PhePro: 1.811 ± 2.054
1.207PheGln: 1.207 ± 0.473
4.828PheArg: 4.828 ± 1.343
2.414PheSer: 2.414 ± 0.946
2.414PheThr: 2.414 ± 1.833
3.018PheVal: 3.018 ± 1.233
0.0PheTrp: 0.0 ± 0.0
2.414PheTyr: 2.414 ± 0.946
0.0PheXaa: 0.0 ± 0.0
Gly
4.225GlyAla: 4.225 ± 0.864
0.0GlyCys: 0.0 ± 0.0
5.432GlyAsp: 5.432 ± 3.459
3.621GlyGlu: 3.621 ± 1.123
1.811GlyPhe: 1.811 ± 0.801
2.414GlyGly: 2.414 ± 0.836
1.207GlyHis: 1.207 ± 0.473
4.225GlyIle: 4.225 ± 2.137
4.225GlyLys: 4.225 ± 0.996
4.828GlyLeu: 4.828 ± 2.129
1.207GlyMet: 1.207 ± 1.137
3.621GlyAsn: 3.621 ± 1.765
0.0GlyPro: 0.0 ± 0.0
4.828GlyGln: 4.828 ± 1.756
1.811GlyArg: 1.811 ± 0.801
4.828GlySer: 4.828 ± 2.619
4.225GlyThr: 4.225 ± 1.415
2.414GlyVal: 2.414 ± 0.946
0.0GlyTrp: 0.0 ± 0.0
1.207GlyTyr: 1.207 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.207HisCys: 1.207 ± 0.473
1.207HisAsp: 1.207 ± 0.473
1.207HisGlu: 1.207 ± 0.609
1.207HisPhe: 1.207 ± 0.916
0.0HisGly: 0.0 ± 0.0
0.604HisHis: 0.604 ± 0.475
0.604HisIle: 0.604 ± 0.475
1.811HisLys: 1.811 ± 1.426
0.604HisLeu: 0.604 ± 0.475
0.604HisMet: 0.604 ± 0.624
0.0HisAsn: 0.0 ± 0.0
0.604HisPro: 0.604 ± 0.458
0.0HisGln: 0.0 ± 0.0
1.811HisArg: 1.811 ± 1.006
0.604HisSer: 0.604 ± 0.458
0.0HisThr: 0.0 ± 0.0
1.811HisVal: 1.811 ± 0.985
0.0HisTrp: 0.0 ± 0.0
2.414HisTyr: 2.414 ± 1.268
0.0HisXaa: 0.0 ± 0.0
Ile
1.207IleAla: 1.207 ± 0.612
0.604IleCys: 0.604 ± 0.475
5.432IleAsp: 5.432 ± 1.064
1.811IleGlu: 1.811 ± 1.138
2.414IlePhe: 2.414 ± 1.154
3.018IleGly: 3.018 ± 1.715
0.0IleHis: 0.0 ± 0.0
3.018IleIle: 3.018 ± 1.457
3.621IleLys: 3.621 ± 1.835
4.225IleLeu: 4.225 ± 1.25
1.811IleMet: 1.811 ± 1.148
3.621IleAsn: 3.621 ± 1.436
1.811IlePro: 1.811 ± 0.897
1.811IleGln: 1.811 ± 0.897
3.018IleArg: 3.018 ± 1.739
4.828IleSer: 4.828 ± 1.343
3.621IleThr: 3.621 ± 1.47
7.242IleVal: 7.242 ± 2.762
0.0IleTrp: 0.0 ± 0.0
3.018IleTyr: 3.018 ± 0.973
0.0IleXaa: 0.0 ± 0.0
Lys
3.621LysAla: 3.621 ± 1.725
0.604LysCys: 0.604 ± 1.074
3.621LysAsp: 3.621 ± 2.191
2.414LysGlu: 2.414 ± 0.603
3.621LysPhe: 3.621 ± 1.602
3.621LysGly: 3.621 ± 0.897
0.0LysHis: 0.0 ± 0.0
3.621LysIle: 3.621 ± 1.432
5.432LysLys: 5.432 ± 1.665
6.639LysLeu: 6.639 ± 1.948
3.018LysMet: 3.018 ± 1.285
4.225LysAsn: 4.225 ± 2.137
2.414LysPro: 2.414 ± 1.216
4.225LysGln: 4.225 ± 2.016
3.621LysArg: 3.621 ± 0.775
3.621LysSer: 3.621 ± 1.846
4.828LysThr: 4.828 ± 1.343
3.621LysVal: 3.621 ± 0.756
0.0LysTrp: 0.0 ± 0.0
6.639LysTyr: 6.639 ± 1.608
0.0LysXaa: 0.0 ± 0.0
Leu
6.035LeuAla: 6.035 ± 3.309
3.018LeuCys: 3.018 ± 1.339
3.621LeuAsp: 3.621 ± 1.793
4.828LeuGlu: 4.828 ± 2.519
3.018LeuPhe: 3.018 ± 2.812
8.449LeuGly: 8.449 ± 3.168
1.811LeuHis: 1.811 ± 1.426
4.225LeuIle: 4.225 ± 2.023
4.828LeuLys: 4.828 ± 1.673
6.639LeuLeu: 6.639 ± 0.711
0.0LeuMet: 0.0 ± 0.0
4.828LeuAsn: 4.828 ± 1.505
3.621LeuPro: 3.621 ± 2.414
4.225LeuGln: 4.225 ± 2.974
6.639LeuArg: 6.639 ± 1.439
5.432LeuSer: 5.432 ± 0.999
4.225LeuThr: 4.225 ± 1.319
1.811LeuVal: 1.811 ± 1.109
0.0LeuTrp: 0.0 ± 0.0
2.414LeuTyr: 2.414 ± 1.268
0.0LeuXaa: 0.0 ± 0.0
Met
3.018MetAla: 3.018 ± 2.355
0.0MetCys: 0.0 ± 0.0
0.604MetAsp: 0.604 ± 0.624
1.811MetGlu: 1.811 ± 1.138
0.604MetPhe: 0.604 ± 0.458
0.604MetGly: 0.604 ± 0.624
0.0MetHis: 0.0 ± 0.0
1.207MetIle: 1.207 ± 0.609
3.018MetLys: 3.018 ± 1.654
0.604MetLeu: 0.604 ± 1.074
1.207MetMet: 1.207 ± 0.612
1.207MetAsn: 1.207 ± 0.612
1.811MetPro: 1.811 ± 0.801
0.604MetGln: 0.604 ± 0.458
1.811MetArg: 1.811 ± 1.006
1.811MetSer: 1.811 ± 1.298
1.207MetThr: 1.207 ± 0.473
1.207MetVal: 1.207 ± 0.612
0.0MetTrp: 0.0 ± 0.0
2.414MetTyr: 2.414 ± 1.154
0.0MetXaa: 0.0 ± 0.0
Asn
4.828AsnAla: 4.828 ± 1.209
2.414AsnCys: 2.414 ± 1.572
4.225AsnAsp: 4.225 ± 0.769
3.018AsnGlu: 3.018 ± 0.903
2.414AsnPhe: 2.414 ± 0.946
3.018AsnGly: 3.018 ± 1.654
0.0AsnHis: 0.0 ± 0.0
5.432AsnIle: 5.432 ± 1.545
4.225AsnLys: 4.225 ± 3.502
6.035AsnLeu: 6.035 ± 2.924
4.225AsnMet: 4.225 ± 1.415
2.414AsnAsn: 2.414 ± 1.218
3.621AsnPro: 3.621 ± 1.039
3.018AsnGln: 3.018 ± 1.277
4.828AsnArg: 4.828 ± 0.866
4.225AsnSer: 4.225 ± 1.495
3.621AsnThr: 3.621 ± 0.935
1.811AsnVal: 1.811 ± 1.138
0.604AsnTrp: 0.604 ± 0.475
2.414AsnTyr: 2.414 ± 0.946
0.0AsnXaa: 0.0 ± 0.0
Pro
1.207ProAla: 1.207 ± 1.308
0.0ProCys: 0.0 ± 0.0
1.207ProAsp: 1.207 ± 1.03
5.432ProGlu: 5.432 ± 1.098
2.414ProPhe: 2.414 ± 0.946
3.018ProGly: 3.018 ± 0.586
1.811ProHis: 1.811 ± 0.897
2.414ProIle: 2.414 ± 0.579
2.414ProLys: 2.414 ± 1.572
4.828ProLeu: 4.828 ± 1.754
1.207ProMet: 1.207 ± 0.476
1.207ProAsn: 1.207 ± 0.473
0.604ProPro: 0.604 ± 1.074
2.414ProGln: 2.414 ± 2.096
0.604ProArg: 0.604 ± 0.475
1.207ProSer: 1.207 ± 2.148
1.811ProThr: 1.811 ± 0.985
1.811ProVal: 1.811 ± 1.006
0.0ProTrp: 0.0 ± 0.0
1.811ProTyr: 1.811 ± 1.148
0.0ProXaa: 0.0 ± 0.0
Gln
4.225GlnAla: 4.225 ± 1.994
0.0GlnCys: 0.0 ± 0.0
0.604GlnAsp: 0.604 ± 0.475
4.828GlnGlu: 4.828 ± 1.484
4.225GlnPhe: 4.225 ± 0.729
2.414GlnGly: 2.414 ± 0.579
1.207GlnHis: 1.207 ± 0.612
2.414GlnIle: 2.414 ± 0.946
4.828GlnLys: 4.828 ± 2.468
4.828GlnLeu: 4.828 ± 2.318
0.0GlnMet: 0.0 ± 0.0
6.035GlnAsn: 6.035 ± 1.47
1.811GlnPro: 1.811 ± 0.378
3.621GlnGln: 3.621 ± 1.827
1.207GlnArg: 1.207 ± 0.609
4.225GlnSer: 4.225 ± 2.245
2.414GlnThr: 2.414 ± 0.965
1.207GlnVal: 1.207 ± 0.609
1.811GlnTrp: 1.811 ± 1.873
1.811GlnTyr: 1.811 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
1.811ArgAla: 1.811 ± 0.378
0.604ArgCys: 0.604 ± 0.458
3.018ArgAsp: 3.018 ± 1.265
2.414ArgGlu: 2.414 ± 1.218
3.018ArgPhe: 3.018 ± 1.233
2.414ArgGly: 2.414 ± 1.042
1.207ArgHis: 1.207 ± 0.916
3.621ArgIle: 3.621 ± 1.47
3.621ArgLys: 3.621 ± 1.25
5.432ArgLeu: 5.432 ± 1.842
0.604ArgMet: 0.604 ± 0.624
4.828ArgAsn: 4.828 ± 1.859
1.811ArgPro: 1.811 ± 1.279
3.621ArgGln: 3.621 ± 1.644
0.604ArgArg: 0.604 ± 0.458
3.018ArgSer: 3.018 ± 1.654
2.414ArgThr: 2.414 ± 1.239
1.811ArgVal: 1.811 ± 1.182
0.0ArgTrp: 0.0 ± 0.0
3.621ArgTyr: 3.621 ± 1.661
0.0ArgXaa: 0.0 ± 0.0
Ser
5.432SerAla: 5.432 ± 1.103
1.811SerCys: 1.811 ± 0.831
4.828SerAsp: 4.828 ± 1.657
4.828SerGlu: 4.828 ± 2.663
1.207SerPhe: 1.207 ± 0.473
2.414SerGly: 2.414 ± 0.918
0.604SerHis: 0.604 ± 0.475
3.621SerIle: 3.621 ± 0.756
3.621SerLys: 3.621 ± 1.355
3.018SerLeu: 3.018 ± 1.018
3.018SerMet: 3.018 ± 0.913
3.018SerAsn: 3.018 ± 1.073
1.811SerPro: 1.811 ± 0.985
5.432SerGln: 5.432 ± 1.532
3.621SerArg: 3.621 ± 0.948
7.242SerSer: 7.242 ± 2.814
2.414SerThr: 2.414 ± 1.338
5.432SerVal: 5.432 ± 1.44
0.0SerTrp: 0.0 ± 0.0
1.207SerTyr: 1.207 ± 0.473
0.0SerXaa: 0.0 ± 0.0
Thr
3.621ThrAla: 3.621 ± 1.123
0.0ThrCys: 0.0 ± 0.0
3.018ThrAsp: 3.018 ± 1.731
3.621ThrGlu: 3.621 ± 0.897
3.621ThrPhe: 3.621 ± 1.677
0.0ThrGly: 0.0 ± 0.0
0.604ThrHis: 0.604 ± 0.475
3.621ThrIle: 3.621 ± 1.846
2.414ThrLys: 2.414 ± 0.918
3.621ThrLeu: 3.621 ± 0.862
0.604ThrMet: 0.604 ± 0.475
3.621ThrAsn: 3.621 ± 1.112
3.621ThrPro: 3.621 ± 0.775
3.018ThrGln: 3.018 ± 2.66
0.604ThrArg: 0.604 ± 0.458
3.621ThrSer: 3.621 ± 1.401
3.621ThrThr: 3.621 ± 2.32
6.035ThrVal: 6.035 ± 1.994
0.604ThrTrp: 0.604 ± 0.458
3.018ThrTyr: 3.018 ± 1.654
0.0ThrXaa: 0.0 ± 0.0
Val
2.414ValAla: 2.414 ± 0.603
0.0ValCys: 0.0 ± 0.0
3.018ValAsp: 3.018 ± 1.018
1.207ValGlu: 1.207 ± 1.03
1.207ValPhe: 1.207 ± 0.916
3.621ValGly: 3.621 ± 1.763
0.604ValHis: 0.604 ± 1.149
2.414ValIle: 2.414 ± 1.248
6.639ValLys: 6.639 ± 2.171
3.621ValLeu: 3.621 ± 1.145
0.604ValMet: 0.604 ± 0.458
3.018ValAsn: 3.018 ± 1.099
4.828ValPro: 4.828 ± 3.037
3.621ValGln: 3.621 ± 1.436
4.225ValArg: 4.225 ± 0.91
3.621ValSer: 3.621 ± 1.093
4.225ValThr: 4.225 ± 0.873
1.811ValVal: 1.811 ± 1.006
0.0ValTrp: 0.0 ± 0.0
3.018ValTyr: 3.018 ± 1.265
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.604TrpCys: 0.604 ± 0.475
0.0TrpAsp: 0.0 ± 0.0
1.207TrpGlu: 1.207 ± 0.473
1.207TrpPhe: 1.207 ± 0.951
0.0TrpGly: 0.0 ± 0.0
0.604TrpHis: 0.604 ± 0.458
1.207TrpIle: 1.207 ± 0.916
0.0TrpLys: 0.0 ± 0.0
0.604TrpLeu: 0.604 ± 0.475
0.0TrpMet: 0.0 ± 0.0
1.811TrpAsn: 1.811 ± 1.138
0.0TrpPro: 0.0 ± 0.0
1.207TrpGln: 1.207 ± 0.609
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.604TrpVal: 0.604 ± 0.458
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.018TyrAla: 3.018 ± 0.586
1.207TyrCys: 1.207 ± 0.916
5.432TyrAsp: 5.432 ± 0.55
1.207TyrGlu: 1.207 ± 0.612
4.225TyrPhe: 4.225 ± 1.688
2.414TyrGly: 2.414 ± 0.579
0.604TyrHis: 0.604 ± 0.475
1.207TyrIle: 1.207 ± 0.473
4.828TyrLys: 4.828 ± 1.756
4.828TyrLeu: 4.828 ± 1.546
3.621TyrMet: 3.621 ± 0.862
6.035TyrAsn: 6.035 ± 1.193
0.0TyrPro: 0.0 ± 0.0
5.432TyrGln: 5.432 ± 1.098
3.018TyrArg: 3.018 ± 1.726
3.018TyrSer: 3.018 ± 1.233
3.018TyrThr: 3.018 ± 1.265
2.414TyrVal: 2.414 ± 1.268
0.604TyrTrp: 0.604 ± 0.475
4.225TyrTyr: 4.225 ± 1.699
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1658 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski