Amino acid dipepetide frequency for Beihai toti-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.294AlaAla: 3.294 ± 0.486
0.941AlaCys: 0.941 ± 0.139
1.412AlaAsp: 1.412 ± 0.765
2.353AlaGlu: 2.353 ± 0.626
1.412AlaPhe: 1.412 ± 0.116
4.235AlaGly: 4.235 ± 0.347
0.471AlaHis: 0.471 ± 0.255
4.235AlaIle: 4.235 ± 1.645
3.294AlaLys: 3.294 ± 0.163
6.588AlaLeu: 6.588 ± 2.271
1.412AlaMet: 1.412 ± 0.116
2.824AlaAsn: 2.824 ± 1.53
4.235AlaPro: 4.235 ± 0.996
3.765AlaGln: 3.765 ± 1.39
2.824AlaArg: 2.824 ± 0.881
3.294AlaSer: 3.294 ± 0.486
3.294AlaThr: 3.294 ± 0.486
3.765AlaVal: 3.765 ± 0.741
0.941AlaTrp: 0.941 ± 0.139
1.412AlaTyr: 1.412 ± 0.116
0.0AlaXaa: 0.0 ± 0.0
Cys
0.941CysAla: 0.941 ± 0.51
0.0CysCys: 0.0 ± 0.0
0.471CysAsp: 0.471 ± 0.255
0.941CysGlu: 0.941 ± 0.789
1.882CysPhe: 1.882 ± 0.928
0.471CysGly: 0.471 ± 0.255
0.0CysHis: 0.0 ± 0.0
0.941CysIle: 0.941 ± 0.789
0.941CysLys: 0.941 ± 0.789
1.412CysLeu: 1.412 ± 1.183
0.0CysMet: 0.0 ± 0.0
0.941CysAsn: 0.941 ± 0.789
0.471CysPro: 0.471 ± 0.255
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.471CysSer: 0.471 ± 0.394
0.471CysThr: 0.471 ± 0.255
0.941CysVal: 0.941 ± 0.789
0.471CysTrp: 0.471 ± 0.255
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.294AspAla: 3.294 ± 0.486
0.471AspCys: 0.471 ± 0.394
4.235AspAsp: 4.235 ± 2.25
1.882AspGlu: 1.882 ± 1.02
1.882AspPhe: 1.882 ± 0.279
2.824AspGly: 2.824 ± 1.53
0.0AspHis: 0.0 ± 0.0
6.118AspIle: 6.118 ± 3.827
2.824AspLys: 2.824 ± 2.366
4.235AspLeu: 4.235 ± 0.952
0.471AspMet: 0.471 ± 0.255
0.941AspAsn: 0.941 ± 0.139
1.882AspPro: 1.882 ± 0.371
1.412AspGln: 1.412 ± 0.116
1.882AspArg: 1.882 ± 0.371
3.765AspSer: 3.765 ± 0.092
1.882AspThr: 1.882 ± 0.371
5.176AspVal: 5.176 ± 0.442
2.353AspTrp: 2.353 ± 1.322
2.353AspTyr: 2.353 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
3.765GluAla: 3.765 ± 0.741
0.471GluCys: 0.471 ± 0.394
1.882GluAsp: 1.882 ± 0.279
0.941GluGlu: 0.941 ± 0.51
2.353GluPhe: 2.353 ± 0.673
2.353GluGly: 2.353 ± 0.626
1.882GluHis: 1.882 ± 0.928
1.882GluIle: 1.882 ± 0.928
3.294GluLys: 3.294 ± 2.111
3.765GluLeu: 3.765 ± 0.092
1.882GluMet: 1.882 ± 0.371
2.353GluAsn: 2.353 ± 0.673
1.412GluPro: 1.412 ± 0.765
2.353GluGln: 2.353 ± 0.626
1.882GluArg: 1.882 ± 0.279
3.294GluSer: 3.294 ± 0.812
1.882GluThr: 1.882 ± 1.02
2.824GluVal: 2.824 ± 1.067
0.471GluTrp: 0.471 ± 0.394
3.294GluTyr: 3.294 ± 0.163
0.0GluXaa: 0.0 ± 0.0
Phe
3.294PheAla: 3.294 ± 1.785
0.941PheCys: 0.941 ± 0.139
1.882PheAsp: 1.882 ± 0.928
0.941PheGlu: 0.941 ± 0.789
0.471PhePhe: 0.471 ± 0.255
3.765PheGly: 3.765 ± 0.741
0.0PheHis: 0.0 ± 0.0
2.353PheIle: 2.353 ± 1.275
1.412PheLys: 1.412 ± 0.534
4.235PheLeu: 4.235 ± 0.347
0.471PheMet: 0.471 ± 0.266
3.294PheAsn: 3.294 ± 0.812
3.294PhePro: 3.294 ± 1.136
1.412PheGln: 1.412 ± 0.765
2.353PheArg: 2.353 ± 1.275
6.588PheSer: 6.588 ± 1.625
1.882PheThr: 1.882 ± 1.02
4.235PheVal: 4.235 ± 0.302
0.941PheTrp: 0.941 ± 0.789
1.882PheTyr: 1.882 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
3.765GlyAla: 3.765 ± 2.04
0.471GlyCys: 0.471 ± 0.394
4.706GlyAsp: 4.706 ± 0.602
2.824GlyGlu: 2.824 ± 0.231
1.412GlyPhe: 1.412 ± 0.765
4.706GlyGly: 4.706 ± 1.251
1.882GlyHis: 1.882 ± 1.02
4.706GlyIle: 4.706 ± 0.602
3.765GlyLys: 3.765 ± 0.741
5.176GlyLeu: 5.176 ± 0.857
2.353GlyMet: 2.353 ± 0.626
3.294GlyAsn: 3.294 ± 0.486
2.824GlyPro: 2.824 ± 0.231
2.353GlyGln: 2.353 ± 0.626
1.882GlyArg: 1.882 ± 0.371
4.706GlySer: 4.706 ± 0.047
5.176GlyThr: 5.176 ± 0.857
4.706GlyVal: 4.706 ± 1.251
0.941GlyTrp: 0.941 ± 0.789
4.235GlyTyr: 4.235 ± 0.347
0.0GlyXaa: 0.0 ± 0.0
His
0.941HisAla: 0.941 ± 0.51
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.353HisGlu: 2.353 ± 0.024
0.471HisPhe: 0.471 ± 0.255
0.471HisGly: 0.471 ± 0.394
0.0HisHis: 0.0 ± 0.0
1.882HisIle: 1.882 ± 0.279
1.412HisLys: 1.412 ± 0.116
1.882HisLeu: 1.882 ± 0.371
0.471HisMet: 0.471 ± 0.255
1.412HisAsn: 1.412 ± 0.765
2.353HisPro: 2.353 ± 0.024
0.0HisGln: 0.0 ± 0.0
0.941HisArg: 0.941 ± 0.139
0.471HisSer: 0.471 ± 0.255
0.941HisThr: 0.941 ± 0.139
0.941HisVal: 0.941 ± 0.139
0.0HisTrp: 0.0 ± 0.0
0.941HisTyr: 0.941 ± 0.139
0.0HisXaa: 0.0 ± 0.0
Ile
3.765IleAla: 3.765 ± 0.741
0.471IleCys: 0.471 ± 0.255
3.294IleAsp: 3.294 ± 0.163
3.765IleGlu: 3.765 ± 0.557
3.765IlePhe: 3.765 ± 0.092
3.294IleGly: 3.294 ± 1.136
0.0IleHis: 0.0 ± 0.0
5.176IleIle: 5.176 ± 2.389
3.294IleLys: 3.294 ± 2.111
5.176IleLeu: 5.176 ± 1.091
2.353IleMet: 2.353 ± 0.024
4.235IleAsn: 4.235 ± 0.952
3.294IlePro: 3.294 ± 0.163
2.353IleGln: 2.353 ± 0.024
3.765IleArg: 3.765 ± 1.207
4.706IleSer: 4.706 ± 0.602
5.647IleThr: 5.647 ± 0.463
3.765IleVal: 3.765 ± 0.557
0.471IleTrp: 0.471 ± 0.394
1.412IleTyr: 1.412 ± 0.116
0.0IleXaa: 0.0 ± 0.0
Lys
0.471LysAla: 0.471 ± 0.255
0.941LysCys: 0.941 ± 0.789
2.824LysAsp: 2.824 ± 0.231
2.353LysGlu: 2.353 ± 0.673
2.353LysPhe: 2.353 ± 0.673
2.353LysGly: 2.353 ± 1.322
0.941LysHis: 0.941 ± 0.789
2.824LysIle: 2.824 ± 1.716
1.882LysLys: 1.882 ± 0.928
7.529LysLeu: 7.529 ± 2.413
1.412LysMet: 1.412 ± 0.534
6.588LysAsn: 6.588 ± 3.572
2.824LysPro: 2.824 ± 0.881
0.471LysGln: 0.471 ± 0.255
1.882LysArg: 1.882 ± 0.928
2.353LysSer: 2.353 ± 1.971
2.824LysThr: 2.824 ± 0.881
3.765LysVal: 3.765 ± 0.092
1.882LysTrp: 1.882 ± 0.279
2.824LysTyr: 2.824 ± 2.366
0.0LysXaa: 0.0 ± 0.0
Leu
4.706LeuAla: 4.706 ± 0.047
0.471LeuCys: 0.471 ± 0.394
4.706LeuAsp: 4.706 ± 0.047
5.176LeuGlu: 5.176 ± 1.74
4.706LeuPhe: 4.706 ± 1.251
8.941LeuGly: 8.941 ± 0.949
1.412LeuHis: 1.412 ± 0.534
5.647LeuIle: 5.647 ± 2.134
4.235LeuLys: 4.235 ± 0.302
6.588LeuLeu: 6.588 ± 0.326
3.765LeuMet: 3.765 ± 0.092
3.765LeuAsn: 3.765 ± 0.092
4.706LeuPro: 4.706 ± 1.251
3.765LeuGln: 3.765 ± 0.092
3.765LeuArg: 3.765 ± 1.207
7.059LeuSer: 7.059 ± 0.72
6.588LeuThr: 6.588 ± 0.323
6.118LeuVal: 6.118 ± 1.367
1.412LeuTrp: 1.412 ± 0.534
2.353LeuTyr: 2.353 ± 0.673
0.0LeuXaa: 0.0 ± 0.0
Met
2.353MetAla: 2.353 ± 0.024
0.0MetCys: 0.0 ± 0.0
0.941MetAsp: 0.941 ± 0.139
0.471MetGlu: 0.471 ± 0.394
1.882MetPhe: 1.882 ± 0.279
1.412MetGly: 1.412 ± 0.116
0.0MetHis: 0.0 ± 0.0
1.412MetIle: 1.412 ± 0.116
0.471MetLys: 0.471 ± 0.255
3.294MetLeu: 3.294 ± 0.486
0.471MetMet: 0.471 ± 0.255
0.941MetAsn: 0.941 ± 0.51
1.882MetPro: 1.882 ± 0.279
0.941MetGln: 0.941 ± 0.139
2.353MetArg: 2.353 ± 0.673
2.353MetSer: 2.353 ± 1.275
2.353MetThr: 2.353 ± 0.673
2.353MetVal: 2.353 ± 0.626
0.471MetTrp: 0.471 ± 0.394
0.471MetTyr: 0.471 ± 0.394
0.0MetXaa: 0.0 ± 0.0
Asn
2.824AsnAla: 2.824 ± 0.418
1.412AsnCys: 1.412 ± 1.183
2.824AsnAsp: 2.824 ± 0.418
2.824AsnGlu: 2.824 ± 0.418
4.706AsnPhe: 4.706 ± 0.047
2.353AsnGly: 2.353 ± 0.626
1.882AsnHis: 1.882 ± 1.02
6.118AsnIle: 6.118 ± 0.068
2.824AsnLys: 2.824 ± 1.067
3.765AsnLeu: 3.765 ± 0.741
1.882AsnMet: 1.882 ± 0.279
4.235AsnAsn: 4.235 ± 0.347
3.765AsnPro: 3.765 ± 1.39
4.706AsnGln: 4.706 ± 0.602
2.353AsnArg: 2.353 ± 1.971
3.294AsnSer: 3.294 ± 0.812
6.118AsnThr: 6.118 ± 0.718
5.647AsnVal: 5.647 ± 2.134
0.471AsnTrp: 0.471 ± 0.255
3.294AsnTyr: 3.294 ± 0.163
0.0AsnXaa: 0.0 ± 0.0
Pro
2.824ProAla: 2.824 ± 0.881
0.0ProCys: 0.0 ± 0.0
1.412ProAsp: 1.412 ± 0.116
0.941ProGlu: 0.941 ± 0.789
2.353ProPhe: 2.353 ± 0.626
3.765ProGly: 3.765 ± 1.39
1.412ProHis: 1.412 ± 0.765
4.706ProIle: 4.706 ± 1.9
1.882ProLys: 1.882 ± 0.371
7.529ProLeu: 7.529 ± 2.132
0.941ProMet: 0.941 ± 0.139
3.294ProAsn: 3.294 ± 1.136
1.882ProPro: 1.882 ± 0.371
3.294ProGln: 3.294 ± 0.486
1.412ProArg: 1.412 ± 0.116
4.235ProSer: 4.235 ± 0.347
4.235ProThr: 4.235 ± 1.645
6.118ProVal: 6.118 ± 0.718
2.353ProTrp: 2.353 ± 0.626
0.471ProTyr: 0.471 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
3.765GlnAla: 3.765 ± 2.04
0.0GlnCys: 0.0 ± 0.0
1.412GlnAsp: 1.412 ± 0.534
2.353GlnGlu: 2.353 ± 0.024
0.941GlnPhe: 0.941 ± 0.139
2.353GlnGly: 2.353 ± 0.626
0.941GlnHis: 0.941 ± 0.51
2.824GlnIle: 2.824 ± 1.53
1.412GlnLys: 1.412 ± 0.534
6.118GlnLeu: 6.118 ± 1.367
0.471GlnMet: 0.471 ± 0.394
4.235GlnAsn: 4.235 ± 0.996
2.353GlnPro: 2.353 ± 1.275
1.882GlnGln: 1.882 ± 1.02
0.941GlnArg: 0.941 ± 0.789
2.824GlnSer: 2.824 ± 1.53
2.353GlnThr: 2.353 ± 0.626
2.824GlnVal: 2.824 ± 0.881
0.0GlnTrp: 0.0 ± 0.0
0.471GlnTyr: 0.471 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
0.471ArgAla: 0.471 ± 0.255
0.0ArgCys: 0.0 ± 0.0
4.706ArgAsp: 4.706 ± 2.644
1.882ArgGlu: 1.882 ± 0.279
2.824ArgPhe: 2.824 ± 0.231
2.353ArgGly: 2.353 ± 0.024
1.412ArgHis: 1.412 ± 0.534
2.824ArgIle: 2.824 ± 0.418
2.353ArgLys: 2.353 ± 1.322
1.412ArgLeu: 1.412 ± 0.116
2.353ArgMet: 2.353 ± 0.673
0.941ArgAsn: 0.941 ± 0.789
0.941ArgPro: 0.941 ± 0.51
2.353ArgGln: 2.353 ± 0.626
1.412ArgArg: 1.412 ± 0.116
2.824ArgSer: 2.824 ± 0.231
2.824ArgThr: 2.824 ± 0.231
3.294ArgVal: 3.294 ± 2.111
2.824ArgTrp: 2.824 ± 0.231
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.588SerAla: 6.588 ± 1.622
0.471SerCys: 0.471 ± 0.255
3.765SerAsp: 3.765 ± 0.092
3.765SerGlu: 3.765 ± 0.092
3.765SerPhe: 3.765 ± 2.04
7.529SerGly: 7.529 ± 0.184
1.882SerHis: 1.882 ± 0.279
2.824SerIle: 2.824 ± 1.067
4.235SerLys: 4.235 ± 2.25
5.647SerLeu: 5.647 ± 2.134
0.941SerMet: 0.941 ± 0.139
3.765SerAsn: 3.765 ± 0.741
2.824SerPro: 2.824 ± 1.53
2.824SerGln: 2.824 ± 1.53
1.412SerArg: 1.412 ± 0.534
6.588SerSer: 6.588 ± 0.975
5.176SerThr: 5.176 ± 0.208
5.647SerVal: 5.647 ± 0.187
0.941SerTrp: 0.941 ± 0.789
2.353SerTyr: 2.353 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
5.647ThrAla: 5.647 ± 0.463
0.471ThrCys: 0.471 ± 0.255
2.824ThrAsp: 2.824 ± 0.881
3.294ThrGlu: 3.294 ± 0.163
1.882ThrPhe: 1.882 ± 0.279
2.824ThrGly: 2.824 ± 0.881
1.882ThrHis: 1.882 ± 0.371
2.353ThrIle: 2.353 ± 0.024
3.765ThrLys: 3.765 ± 2.505
6.588ThrLeu: 6.588 ± 0.326
1.412ThrMet: 1.412 ± 0.765
7.059ThrAsn: 7.059 ± 1.877
5.647ThrPro: 5.647 ± 1.761
1.882ThrGln: 1.882 ± 1.02
2.824ThrArg: 2.824 ± 0.418
5.647ThrSer: 5.647 ± 1.761
5.647ThrThr: 5.647 ± 2.41
3.294ThrVal: 3.294 ± 0.486
1.882ThrTrp: 1.882 ± 0.279
3.294ThrTyr: 3.294 ± 1.136
0.0ThrXaa: 0.0 ± 0.0
Val
1.882ValAla: 1.882 ± 1.02
2.824ValCys: 2.824 ± 1.067
3.294ValAsp: 3.294 ± 2.111
1.882ValGlu: 1.882 ± 0.371
2.824ValPhe: 2.824 ± 0.881
6.588ValGly: 6.588 ± 2.92
0.0ValHis: 0.0 ± 0.0
3.765ValIle: 3.765 ± 1.39
4.706ValLys: 4.706 ± 1.251
5.176ValLeu: 5.176 ± 3.039
1.882ValMet: 1.882 ± 0.279
9.412ValAsn: 9.412 ± 2.692
4.235ValPro: 4.235 ± 0.302
2.353ValGln: 2.353 ± 0.626
3.294ValArg: 3.294 ± 0.163
5.647ValSer: 5.647 ± 0.836
4.706ValThr: 4.706 ± 0.602
4.235ValVal: 4.235 ± 0.996
1.882ValTrp: 1.882 ± 0.928
2.824ValTyr: 2.824 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.139
0.471TrpCys: 0.471 ± 0.255
0.941TrpAsp: 0.941 ± 0.789
0.941TrpGlu: 0.941 ± 0.139
0.471TrpPhe: 0.471 ± 0.394
1.882TrpGly: 1.882 ± 0.928
0.941TrpHis: 0.941 ± 0.51
0.0TrpIle: 0.0 ± 0.0
0.941TrpLys: 0.941 ± 0.139
3.294TrpLeu: 3.294 ± 0.812
0.941TrpMet: 0.941 ± 0.139
0.471TrpAsn: 0.471 ± 0.394
0.471TrpPro: 0.471 ± 0.255
0.471TrpGln: 0.471 ± 0.255
1.412TrpArg: 1.412 ± 0.534
0.941TrpSer: 0.941 ± 0.51
2.353TrpThr: 2.353 ± 1.971
2.353TrpVal: 2.353 ± 0.673
0.0TrpTrp: 0.0 ± 0.0
0.471TrpTyr: 0.471 ± 0.394
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.471TyrAla: 0.471 ± 0.394
0.941TyrCys: 0.941 ± 0.789
2.353TyrAsp: 2.353 ± 0.024
2.353TyrGlu: 2.353 ± 0.024
3.294TyrPhe: 3.294 ± 0.163
1.412TyrGly: 1.412 ± 0.765
0.941TyrHis: 0.941 ± 0.139
1.412TyrIle: 1.412 ± 1.183
2.353TyrLys: 2.353 ± 1.971
0.471TyrLeu: 0.471 ± 0.255
0.471TyrMet: 0.471 ± 0.423
3.294TyrAsn: 3.294 ± 0.812
3.765TyrPro: 3.765 ± 0.741
1.882TyrGln: 1.882 ± 0.371
1.412TyrArg: 1.412 ± 0.534
1.882TyrSer: 1.882 ± 0.371
3.765TyrThr: 3.765 ± 0.741
1.412TyrVal: 1.412 ± 0.534
0.0TyrTrp: 0.0 ± 0.0
2.824TyrTyr: 2.824 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2126 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski