
Why is mobile data collection always blocked?
Brothers who have engaged in mobile data collection know that the biggest headache is IP blocking. Especially when using Appium to do automated testing, the device is connected to the same WiFi running scripts, the target server can be recognized in minutes. Last week, an e-commerce price comparison brother told me that their company 10 test machines, just run two days on the whole army, even the verification code can not be received.
at this momentproxy IP就是救命稻草了。简单说就是让每台设备走不同网络出口,把真实IP藏起来。不过市面上的代理服务五花八门,有些静态IP用两天就废,还有些高得能泡碗面。
Appium + proxy IP configuration
Let's start with how to integrate agents in Appium. The key is to add the agent to theDesired CapabilitiesAdd in the ingredients, here's an example for Android (iOS setup is similar):
desired_caps = {
'platformName': 'Android',
'proxy': {
'httpProxy': 'ipipgoDynamicProxyAddress:Port',
'sslProxy': 'ipipgoDynamicProxyAddress:Port'
}
}
Note the use ofDynamic Residential Proxy for ipipgoThe company has specifically optimized for the mobile terminal. Don't try to be cheap and use a data center IP, now the APP's wind control system is so refined that the IP of the server room is a catch.
| Agent Type | Applicable Scenarios | Shelf life |
|---|---|---|
| Dynamic Residential | High-frequency acquisition | single session |
| Static homes | Long Term Login | 24 hours + |
Three Top Tips for Avoiding the Acquisition Minefield
1. IP rotation strategy should be flirty enough: Don't be stupid and change IPs in order, we recommend ipipgo'sIntelligent switching modeThe API can automatically select the fastest node according to the response speed of the target website. Their API can automatically select the fastest node according to the response speed of the target website, and the actual collection efficiency can be improved by 40%.
2. Don't let the device fingerprints slip through the cracks.: In addition to changing IPs, remember to randomize device information. Make the following parameters into a configuration file to be called randomly:
- User-Agent (priority care)
- screen resolution
- time zone setting
3. Breathe in the rhythm of the operation: Don't use fixed time intervals, it is recommended to use normally distributed random numbers. For example, the average interval of 3 seconds, floating ± 1.5 seconds, so that more like a real person operation.
QA Time: Common Pitfalls for Newbies
Q:Why was I blocked even though I used a proxy?
A: check three points: ① proxy is not high anonymity (ipipgo proxy default with obfuscation) ② there is no clear APP cache ③ device hardware information is de-weighting
Q: What should I do if the agent speed affects the collection efficiency?
A: Choose a proxy that supports socket5 protocol (supported by ipipgo's enterprise version), which is more than 3 times faster than http proxy. If you are doing image collection, it is recommended to open multi-threaded + sub-regional proxy
Q: How can I manage agents with too many testers?
A: Provided with ipipgoAPI Dynamic ExtractionRemember to set up a whitelist to bind the device MAC address to prevent the IP from being impersonated.
Say something from the heart.
This proxy IP thing is really not something that can be handled by just any free service. Last year, our team tested 18 proxy services on the market, and in the end, only ipipgo was able to carry it!500,000 requests in a single dayof the stress test. In particular, theirExclusive mobile access,能控制在200ms以内,这对自动化测试来说太关键了。
Lastly, I would like to remind you that data collection should be done in a martial arts manner, and don't catch a website to death. Reasonable set collection frequency, with high-quality agent, is the long way. If you need to test ipipgo service, remember to use the coupon code!MOBILE2023New users can whore out enterprise-level agents for three days.

