Amino acid dipepetide frequency for Halhan virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.891AlaAla: 4.891 ± 1.481
0.752AlaCys: 0.752 ± 0.413
4.138AlaAsp: 4.138 ± 0.109
4.515AlaGlu: 4.515 ± 1.289
4.138AlaPhe: 4.138 ± 1.299
4.138AlaGly: 4.138 ± 0.704
1.505AlaHis: 1.505 ± 0.231
6.772AlaIle: 6.772 ± 0.448
3.386AlaLys: 3.386 ± 0.669
5.643AlaLeu: 5.643 ± 1.909
0.752AlaMet: 0.752 ± 0.259
3.386AlaAsn: 3.386 ± 1.117
3.762AlaPro: 3.762 ± 0.315
4.138AlaGln: 4.138 ± 0.487
2.634AlaArg: 2.634 ± 0.851
6.02AlaSer: 6.02 ± 2.648
6.396AlaThr: 6.396 ± 1.846
4.515AlaVal: 4.515 ± 0.098
1.505AlaTrp: 1.505 ± 0.231
1.129AlaTyr: 1.129 ± 0.571
0.0AlaXaa: 0.0 ± 0.0
Cys
1.505CysAla: 1.505 ± 0.827
0.0CysCys: 0.0 ± 0.0
0.376CysAsp: 0.376 ± 0.207
0.376CysGlu: 0.376 ± 0.389
0.752CysPhe: 0.752 ± 0.413
0.752CysGly: 0.752 ± 0.413
0.0CysHis: 0.0 ± 0.0
1.881CysIle: 1.881 ± 0.438
1.129CysLys: 1.129 ± 0.571
0.376CysLeu: 0.376 ± 0.207
0.376CysMet: 0.376 ± 0.207
0.376CysAsn: 0.376 ± 0.207
0.376CysPro: 0.376 ± 0.207
0.752CysGln: 0.752 ± 0.413
0.752CysArg: 0.752 ± 0.413
0.376CysSer: 0.376 ± 0.389
0.376CysThr: 0.376 ± 0.207
0.376CysVal: 0.376 ± 0.207
0.0CysTrp: 0.0 ± 0.0
0.752CysTyr: 0.752 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
2.257AspAla: 2.257 ± 0.546
0.376AspCys: 0.376 ± 0.207
4.515AspAsp: 4.515 ± 1.093
3.386AspGlu: 3.386 ± 1.86
2.257AspPhe: 2.257 ± 0.049
3.01AspGly: 3.01 ± 1.653
0.752AspHis: 0.752 ± 0.413
3.01AspIle: 3.01 ± 0.133
2.634AspLys: 2.634 ± 0.256
9.406AspLeu: 9.406 ± 0.193
1.881AspMet: 1.881 ± 0.158
3.386AspAsn: 3.386 ± 0.074
3.386AspPro: 3.386 ± 0.669
1.505AspGln: 1.505 ± 0.364
1.881AspArg: 1.881 ± 0.438
6.396AspSer: 6.396 ± 0.655
5.267AspThr: 5.267 ± 1.87
3.762AspVal: 3.762 ± 3.292
0.752AspTrp: 0.752 ± 0.413
1.505AspTyr: 1.505 ± 0.231
0.0AspXaa: 0.0 ± 0.0
Glu
5.643GluAla: 5.643 ± 1.313
0.752GluCys: 0.752 ± 0.413
1.881GluAsp: 1.881 ± 0.438
4.891GluGlu: 4.891 ± 1.496
2.634GluPhe: 2.634 ± 0.256
0.376GluGly: 0.376 ± 0.207
0.0GluHis: 0.0 ± 0.0
4.138GluIle: 4.138 ± 0.487
1.881GluLys: 1.881 ± 1.033
4.138GluLeu: 4.138 ± 1.678
0.376GluMet: 0.376 ± 0.207
1.881GluAsn: 1.881 ± 0.158
2.257GluPro: 2.257 ± 0.049
0.376GluGln: 0.376 ± 0.207
3.762GluArg: 3.762 ± 0.876
3.762GluSer: 3.762 ± 0.911
3.762GluThr: 3.762 ± 1.471
3.01GluVal: 3.01 ± 0.728
0.752GluTrp: 0.752 ± 0.182
3.762GluTyr: 3.762 ± 0.876
0.0GluXaa: 0.0 ± 0.0
Phe
1.505PheAla: 1.505 ± 0.96
0.0PheCys: 0.0 ± 0.0
2.257PheAsp: 2.257 ± 0.049
2.634PheGlu: 2.634 ± 0.851
1.129PhePhe: 1.129 ± 0.025
3.762PheGly: 3.762 ± 0.315
0.376PheHis: 0.376 ± 0.207
2.257PheIle: 2.257 ± 0.644
3.386PheLys: 3.386 ± 0.669
3.386PheLeu: 3.386 ± 0.669
3.01PheMet: 3.01 ± 0.462
3.01PheAsn: 3.01 ± 0.728
2.257PhePro: 2.257 ± 1.142
1.129PheGln: 1.129 ± 0.571
4.138PheArg: 4.138 ± 1.082
6.02PheSer: 6.02 ± 2.116
6.396PheThr: 6.396 ± 3.037
2.634PheVal: 2.634 ± 0.256
1.129PheTrp: 1.129 ± 0.62
1.129PheTyr: 1.129 ± 1.166
0.0PheXaa: 0.0 ± 0.0
Gly
1.881GlyAla: 1.881 ± 0.438
0.0GlyCys: 0.0 ± 0.0
5.643GlyAsp: 5.643 ± 1.909
3.386GlyGlu: 3.386 ± 0.669
3.762GlyPhe: 3.762 ± 0.28
2.257GlyGly: 2.257 ± 0.049
0.376GlyHis: 0.376 ± 0.207
3.386GlyIle: 3.386 ± 0.074
3.386GlyLys: 3.386 ± 0.074
4.138GlyLeu: 4.138 ± 1.678
0.752GlyMet: 0.752 ± 0.413
3.762GlyAsn: 3.762 ± 2.101
1.129GlyPro: 1.129 ± 0.571
0.752GlyGln: 0.752 ± 0.413
3.386GlyArg: 3.386 ± 0.669
6.02GlySer: 6.02 ± 0.329
4.515GlyThr: 4.515 ± 1.093
4.138GlyVal: 4.138 ± 0.704
0.752GlyTrp: 0.752 ± 0.778
2.634GlyTyr: 2.634 ± 0.34
0.0GlyXaa: 0.0 ± 0.0
His
1.881HisAla: 1.881 ± 1.033
0.376HisCys: 0.376 ± 0.207
1.129HisAsp: 1.129 ± 0.025
1.129HisGlu: 1.129 ± 0.62
1.129HisPhe: 1.129 ± 0.62
1.881HisGly: 1.881 ± 1.033
0.0HisHis: 0.0 ± 0.0
1.129HisIle: 1.129 ± 0.62
0.376HisLys: 0.376 ± 0.207
0.752HisLeu: 0.752 ± 0.413
0.376HisMet: 0.376 ± 0.207
0.752HisAsn: 0.752 ± 0.413
0.376HisPro: 0.376 ± 0.207
0.0HisGln: 0.0 ± 0.0
0.752HisArg: 0.752 ± 0.778
2.257HisSer: 2.257 ± 0.049
1.129HisThr: 1.129 ± 0.025
1.505HisVal: 1.505 ± 0.231
0.0HisTrp: 0.0 ± 0.0
0.376HisTyr: 0.376 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
4.515IleAla: 4.515 ± 0.497
0.376IleCys: 0.376 ± 0.207
5.643IleAsp: 5.643 ± 0.473
3.01IleGlu: 3.01 ± 0.728
2.257IlePhe: 2.257 ± 1.24
4.515IleGly: 4.515 ± 0.694
1.505IleHis: 1.505 ± 0.827
2.634IleIle: 2.634 ± 0.256
1.129IleLys: 1.129 ± 0.62
4.138IleLeu: 4.138 ± 0.487
0.376IleMet: 0.376 ± 0.207
2.634IleAsn: 2.634 ± 0.256
4.891IlePro: 4.891 ± 2.077
2.257IleGln: 2.257 ± 0.546
3.762IleArg: 3.762 ± 0.876
6.396IleSer: 6.396 ± 1.727
4.138IleThr: 4.138 ± 1.299
6.02IleVal: 6.02 ± 0.862
0.376IleTrp: 0.376 ± 0.207
0.376IleTyr: 0.376 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
2.634LysAla: 2.634 ± 1.447
1.129LysCys: 1.129 ± 0.62
1.505LysAsp: 1.505 ± 0.827
2.257LysGlu: 2.257 ± 0.049
1.505LysPhe: 1.505 ± 0.96
1.881LysGly: 1.881 ± 0.438
1.505LysHis: 1.505 ± 0.231
2.257LysIle: 2.257 ± 0.049
2.257LysLys: 2.257 ± 1.24
6.02LysLeu: 6.02 ± 1.52
1.129LysMet: 1.129 ± 0.62
2.257LysAsn: 2.257 ± 0.644
2.257LysPro: 2.257 ± 0.546
0.752LysGln: 0.752 ± 0.413
3.386LysArg: 3.386 ± 1.264
3.386LysSer: 3.386 ± 0.522
2.634LysThr: 2.634 ± 0.34
2.634LysVal: 2.634 ± 1.447
0.752LysTrp: 0.752 ± 0.182
0.752LysTyr: 0.752 ± 0.413
0.0LysXaa: 0.0 ± 0.0
Leu
6.772LeuAla: 6.772 ± 1.338
0.752LeuCys: 0.752 ± 0.413
7.524LeuAsp: 7.524 ± 0.56
3.762LeuGlu: 3.762 ± 0.876
4.138LeuPhe: 4.138 ± 0.487
3.762LeuGly: 3.762 ± 0.315
3.01LeuHis: 3.01 ± 1.653
3.386LeuIle: 3.386 ± 0.669
4.515LeuLys: 4.515 ± 1.289
8.277LeuLeu: 8.277 ± 0.974
1.505LeuMet: 1.505 ± 0.827
6.772LeuAsn: 6.772 ± 0.147
6.396LeuPro: 6.396 ± 0.536
1.505LeuGln: 1.505 ± 0.231
7.148LeuArg: 7.148 ± 2.028
7.901LeuSer: 7.901 ± 1.958
5.643LeuThr: 5.643 ± 0.718
6.02LeuVal: 6.02 ± 2.052
0.752LeuTrp: 0.752 ± 0.182
2.634LeuTyr: 2.634 ± 0.935
0.0LeuXaa: 0.0 ± 0.0
Met
2.634MetAla: 2.634 ± 0.34
0.376MetCys: 0.376 ± 0.207
1.129MetAsp: 1.129 ± 0.62
1.129MetGlu: 1.129 ± 0.025
1.505MetPhe: 1.505 ± 0.231
0.752MetGly: 0.752 ± 0.182
0.752MetHis: 0.752 ± 0.182
1.505MetIle: 1.505 ± 0.364
1.129MetLys: 1.129 ± 0.025
2.257MetLeu: 2.257 ± 0.644
0.376MetMet: 0.376 ± 0.207
1.129MetAsn: 1.129 ± 0.62
1.129MetPro: 1.129 ± 0.62
0.752MetGln: 0.752 ± 0.413
1.129MetArg: 1.129 ± 0.62
0.752MetSer: 0.752 ± 0.182
1.129MetThr: 1.129 ± 0.025
0.752MetVal: 0.752 ± 0.413
0.376MetTrp: 0.376 ± 0.207
0.752MetTyr: 0.752 ± 0.413
0.0MetXaa: 0.0 ± 0.0
Asn
3.762AsnAla: 3.762 ± 0.911
1.129AsnCys: 1.129 ± 0.025
1.881AsnAsp: 1.881 ± 0.158
1.505AsnGlu: 1.505 ± 0.364
1.881AsnPhe: 1.881 ± 1.033
4.515AsnGly: 4.515 ± 0.694
0.752AsnHis: 0.752 ± 0.182
2.257AsnIle: 2.257 ± 1.142
2.257AsnLys: 2.257 ± 0.644
3.01AsnLeu: 3.01 ± 0.462
1.505AsnMet: 1.505 ± 0.231
0.752AsnAsn: 0.752 ± 0.778
3.01AsnPro: 3.01 ± 1.058
1.129AsnGln: 1.129 ± 0.62
1.505AsnArg: 1.505 ± 0.231
4.891AsnSer: 4.891 ± 2.077
4.515AsnThr: 4.515 ± 1.093
3.762AsnVal: 3.762 ± 0.315
0.376AsnTrp: 0.376 ± 0.389
1.505AsnTyr: 1.505 ± 0.364
0.0AsnXaa: 0.0 ± 0.0
Pro
3.01ProAla: 3.01 ± 0.728
0.752ProCys: 0.752 ± 0.778
3.762ProAsp: 3.762 ± 1.506
2.257ProGlu: 2.257 ± 0.049
2.634ProPhe: 2.634 ± 0.256
2.257ProGly: 2.257 ± 0.546
0.752ProHis: 0.752 ± 0.182
2.257ProIle: 2.257 ± 0.049
1.505ProLys: 1.505 ± 0.827
5.643ProLeu: 5.643 ± 1.909
1.881ProMet: 1.881 ± 0.438
1.129ProAsn: 1.129 ± 0.62
1.505ProPro: 1.505 ± 0.231
1.505ProGln: 1.505 ± 0.231
4.138ProArg: 4.138 ± 1.082
6.02ProSer: 6.02 ± 1.457
2.257ProThr: 2.257 ± 0.049
3.01ProVal: 3.01 ± 1.919
1.129ProTrp: 1.129 ± 0.62
1.881ProTyr: 1.881 ± 1.348
0.0ProXaa: 0.0 ± 0.0
Gln
3.01GlnAla: 3.01 ± 0.133
0.0GlnCys: 0.0 ± 0.0
2.257GlnAsp: 2.257 ± 1.142
1.505GlnGlu: 1.505 ± 0.231
1.505GlnPhe: 1.505 ± 0.231
1.129GlnGly: 1.129 ± 0.571
0.752GlnHis: 0.752 ± 0.413
1.505GlnIle: 1.505 ± 0.364
0.752GlnLys: 0.752 ± 0.413
2.634GlnLeu: 2.634 ± 0.256
0.376GlnMet: 0.376 ± 0.207
0.752GlnAsn: 0.752 ± 0.413
1.505GlnPro: 1.505 ± 0.231
1.505GlnGln: 1.505 ± 0.96
2.257GlnArg: 2.257 ± 0.644
0.376GlnSer: 0.376 ± 0.207
0.752GlnThr: 0.752 ± 0.182
3.01GlnVal: 3.01 ± 0.133
0.376GlnTrp: 0.376 ± 0.207
0.376GlnTyr: 0.376 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
6.02ArgAla: 6.02 ± 0.329
1.881ArgCys: 1.881 ± 0.438
3.01ArgAsp: 3.01 ± 0.462
2.257ArgGlu: 2.257 ± 0.049
3.386ArgPhe: 3.386 ± 0.522
3.01ArgGly: 3.01 ± 0.133
0.752ArgHis: 0.752 ± 0.413
6.772ArgIle: 6.772 ± 0.743
1.129ArgLys: 1.129 ± 0.62
3.01ArgLeu: 3.01 ± 0.133
0.752ArgMet: 0.752 ± 0.413
3.01ArgAsn: 3.01 ± 0.462
3.762ArgPro: 3.762 ± 0.315
1.505ArgGln: 1.505 ± 0.827
3.386ArgArg: 3.386 ± 0.074
3.01ArgSer: 3.01 ± 0.133
4.138ArgThr: 4.138 ± 0.487
5.643ArgVal: 5.643 ± 0.123
0.0ArgTrp: 0.0 ± 0.0
1.881ArgTyr: 1.881 ± 0.753
0.0ArgXaa: 0.0 ± 0.0
Ser
6.02SerAla: 6.02 ± 0.266
0.0SerCys: 0.0 ± 0.0
4.891SerAsp: 4.891 ± 1.481
4.891SerGlu: 4.891 ± 0.9
4.515SerPhe: 4.515 ± 0.497
6.02SerGly: 6.02 ± 0.862
0.752SerHis: 0.752 ± 0.413
3.762SerIle: 3.762 ± 0.315
4.138SerLys: 4.138 ± 1.082
9.782SerLeu: 9.782 ± 0.581
2.257SerMet: 2.257 ± 1.142
4.891SerAsn: 4.891 ± 0.9
3.01SerPro: 3.01 ± 1.058
3.762SerGln: 3.762 ± 1.506
4.138SerArg: 4.138 ± 0.704
11.287SerSer: 11.287 ± 0.946
6.396SerThr: 6.396 ± 0.655
7.524SerVal: 7.524 ± 0.63
1.129SerTrp: 1.129 ± 0.025
1.881SerTyr: 1.881 ± 0.753
0.0SerXaa: 0.0 ± 0.0
Thr
6.396ThrAla: 6.396 ± 2.441
0.752ThrCys: 0.752 ± 0.413
2.257ThrAsp: 2.257 ± 0.049
1.881ThrGlu: 1.881 ± 0.438
3.762ThrPhe: 3.762 ± 0.911
5.267ThrGly: 5.267 ± 0.679
1.881ThrHis: 1.881 ± 0.438
4.891ThrIle: 4.891 ± 0.291
2.634ThrLys: 2.634 ± 0.935
8.277ThrLeu: 8.277 ± 2.003
1.129ThrMet: 1.129 ± 0.025
1.505ThrAsn: 1.505 ± 0.96
4.138ThrPro: 4.138 ± 1.895
1.881ThrGln: 1.881 ± 0.158
3.762ThrArg: 3.762 ± 0.911
5.643ThrSer: 5.643 ± 0.718
4.515ThrThr: 4.515 ± 1.688
5.267ThrVal: 5.267 ± 1.275
1.129ThrTrp: 1.129 ± 0.025
3.386ThrTyr: 3.386 ± 0.074
0.0ThrXaa: 0.0 ± 0.0
Val
6.396ValAla: 6.396 ± 3.632
1.505ValCys: 1.505 ± 0.231
4.891ValAsp: 4.891 ± 2.672
3.762ValGlu: 3.762 ± 0.876
4.891ValPhe: 4.891 ± 0.291
4.891ValGly: 4.891 ± 0.291
0.0ValHis: 0.0 ± 0.0
4.515ValIle: 4.515 ± 1.289
3.386ValLys: 3.386 ± 0.669
6.772ValLeu: 6.772 ± 2.234
1.129ValMet: 1.129 ± 0.025
3.762ValAsn: 3.762 ± 0.911
3.386ValPro: 3.386 ± 0.522
0.0ValGln: 0.0 ± 0.0
3.386ValArg: 3.386 ± 0.074
6.02ValSer: 6.02 ± 1.457
2.634ValThr: 2.634 ± 2.126
5.643ValVal: 5.643 ± 1.664
0.752ValTrp: 0.752 ± 0.182
5.267ValTyr: 5.267 ± 0.511
0.0ValXaa: 0.0 ± 0.0
Trp
0.376TrpAla: 0.376 ± 0.207
0.376TrpCys: 0.376 ± 0.389
1.129TrpAsp: 1.129 ± 0.62
0.0TrpGlu: 0.0 ± 0.0
0.376TrpPhe: 0.376 ± 0.389
0.376TrpGly: 0.376 ± 0.207
0.752TrpHis: 0.752 ± 0.413
0.752TrpIle: 0.752 ± 0.778
0.376TrpLys: 0.376 ± 0.207
0.752TrpLeu: 0.752 ± 0.413
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.376TrpPro: 0.376 ± 0.207
0.0TrpGln: 0.0 ± 0.0
1.129TrpArg: 1.129 ± 0.025
1.881TrpSer: 1.881 ± 0.158
1.505TrpThr: 1.505 ± 0.231
1.129TrpVal: 1.129 ± 0.571
0.0TrpTrp: 0.0 ± 0.0
1.129TrpTyr: 1.129 ± 0.62
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.386TyrAla: 3.386 ± 0.669
0.376TyrCys: 0.376 ± 0.207
1.505TyrAsp: 1.505 ± 0.827
1.505TyrGlu: 1.505 ± 0.231
3.386TyrPhe: 3.386 ± 0.522
1.505TyrGly: 1.505 ± 0.827
1.129TyrHis: 1.129 ± 0.025
1.881TyrIle: 1.881 ± 0.158
1.505TyrLys: 1.505 ± 0.364
3.762TyrLeu: 3.762 ± 0.911
1.129TyrMet: 1.129 ± 0.532
0.752TyrAsn: 0.752 ± 0.778
0.376TyrPro: 0.376 ± 0.207
1.129TyrGln: 1.129 ± 0.025
1.881TyrArg: 1.881 ± 1.348
2.634TyrSer: 2.634 ± 1.531
2.257TyrThr: 2.257 ± 0.049
2.634TyrVal: 2.634 ± 0.935
0.376TyrTrp: 0.376 ± 0.207
0.752TyrTyr: 0.752 ± 0.182
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski